Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

Non-coresident family as a driver of migration change in a crisis:
The case of the COVID-19 pandemic

Unchitta Kan1,*, Jericho McLeod1, Eduardo López1


1 Department of Computational and Data Sciences, George Mason University, VA, USA

* ukanjana@gmu.edu


Original submitted draft: October 23, 2023
Current revision: April 1, 2024

Abstract

Changes in U.S. migration during the COVID-19 pandemic show that many moved to less populated cities from larger cities, deviating from previous trends. In this study, building on prior work in the literature showing that the abundance of family ties is inversely related to population size, we analyze these migration changes with a focus on the crucial, yet overlooked factor of extended family. Employing two large-scale data sets, census microdata and mobile phone GPS relocation data, we show a collection of empirical results that paints a picture of migration change affected by kin. Namely, we find that people migrated closer to family at higher rates after the COVID-19 pandemic started. Moreover, even controlling for factors such as population density and cost of living, we find that changes in net in-migration tended to be larger and positive in cities with larger proportions of people who can be parents to adult children (our proxy for parental family availability, which is also inversely related to population size). Our study advances the demography-disaster nexus and amplifies ongoing literature highlighting the role of broader kinship systems in large-scale socioeconomic phenomena.


Keywords: inter-city migration, kinship, family distribution, cities, demography, mobility data

1 Introduction

The COVID-19 pandemic brought about a profound change in how people live and work. Among other things, the closure of facilities and attractions as well as the shift to remote work weakened the connection between people’s place of residence and work. Many individuals were no longer constrained to live in a particular city for economic opportunities, and some were incentivized by financial or personal reasons to relocate. It is not surprising, then, that notable changes in U.S. domestic migration trends were observed after the pandemic started. Our analysis, as well as others (Coven et al., 2023; Haslag and Weagley, 2022), suggests that a substantial proportion of these deviations can be attributed to a modification in the destinations of people’s migration: more people moved to smaller (less populated) cities, especially from larger cities, and fewer people moved to large cities.

What characteristics of cities of different sizes attracted or repelled people during the pandemic? Less crowding, lower cost of living, measures introduced to control the pandemic, and even weather may have helped to explain the disruption of typical migration patterns and why people were moving to smaller cities. In this study, we investigate a factor that has been overlooked: extended family who reside elsewhere. We ask the question, can some of these migration changes be attributable to individuals relocating to be closer to kin?

Our question is motivated by a recent empirical finding (McLeod, Kan, and López, 2023) that an individual is more likely to have kin ties located in smaller cities, as well as lines of research that point to the critical role of kin ties (David-Barrett et al., 2023; Furstenberg, 2020; Mulder, 2018) and how they are particularly activated during crises (Reed et al., 2023; Völker, 2023). Importantly, COVID-19 was a major societal event that, at least temporarily, uncoupled geography and economic aspirations—the latter of which is in tension with extended family orientation (Miller, 1976). If family-driven migration was indeed happening at higher rates, flows to smaller cities would follow as a consequence of the distribution of non-coresident family ties in the U.S. (McLeod, Kan, and López, 2023).

In this paper, we investigate empirically the connection between changes in inter-city migration during the COVID-19 crisis, city population, and the role that non-coresident family played in both large-scale patterns and individual-level migration dynamics. Our analyses show these factors to be related, emphasizing that the distribution of people’s non-coresident family is relevant in order to understand human migration. Here, for reasons explained below, our test of the influence of family on migration is for parental relationships. Our results point to important implications in the context of cities, and amplify ongoing literature which highlights the need for more research on the role of broader kinship in large-scale socioeconomic phenomena (David-Barrett, 2019; David-Barrett et al., 2023), especially non-household kin that is outside of the nuclear family (Furstenberg, 2020; Reed et al., 2023).

1.1 Non-coresident family as a driver of migration in a crisis

There are at least three lines of research that help us understand how non-household kin may influence migration decisions in normal times, and why it is plausible that people would relocate to be closer to them in times of crisis. The first line is referred to as the ‘family ties perspective’ (Mulder, 2007, 2018). The second line is the literature which focuses on how social ties are activated in a crisis such as the COVID-19 pandemic (Reed et al., 2023; Völker, 2023). The third line is the well-known tension between personal economic and social achievements (Miller, 1976).

1.1.1 Importance of kin

The ‘family ties perspective’ (Mulder, 2007, 2018) argues that family ties need to be taken into account in migration analyses because of three crucial ingredients they offer: support, need for proximity, and uniqueness.

Family is central to social and support networks of people (Wellman, 1979; Dunbar and Spoors, 1995; Plickert, Côté, and Wellman, 2007; Rözer, Mollenhorst, and Poortman, 2016). While individuals can, and do, maintain ties with family via calls after moving away (David-Barrett et al., 2023; Lambiotte et al., 2008), family face-to-face interaction and transfer of practical or physical support cannot be fully replaced by virtual interaction; they require geographical proximity and thus incentivize migration.

Kin is fundamentally different from other types of relationships because it is given rather than chosen. Also, an important characteristic of kinship that is less prominent in other relationships is the feeling of responsibility that kin feels towards each other (Mulder, 2018). From transfers of resources to help with mundane tasks to emotional support to aversion of crises, people look to their kin. For example, parents and adult children are more than ten times as likely to give or receive major assistance compared to other types of relationships (Wellman, 1979). In 2012 alone, it was estimated that informal care, which includes kin-based intergenerational care, amounted to over one billion hours of unpaid work per week in the U.S. (Dukhovnov and Zagheni, 2015). People place greater importance on kin than non-kin when it comes to interpersonal contacts (David-Barrett et al., 2023), especially women in their child-rearing years; collectively, individuals may even prioritize family over themselves (Krys et al., 2023).

Research has also shown that merely having family in one’s social network can influence the composition of that network itself (Dunbar and Spoors, 1995; Rözer, Mollenhorst, and Poortman, 2016). At the same time, the composition of one’s support network can also become focused on kin ties in non-routine situations.

1.1.2 Crises and kin ties

In times of crises, kin ties may be particularly activated. For example, as formal care facilities shut down during the pandemic (Lee and Parolin, 2021), informal intergenerational care (e.g. day care of young children by grandparents or eldercare by adult children) was likely to be even more important. This may be evidenced in the study by Völker (2023), which indicated that while people’s social networks became smaller and focused on core ties after the COVID-19 pandemic started, the network of practical helpers of the elderly proportionally consisted more of their children; similarly, among individuals aged 18–35, parents made up a larger share of their practical helpers networks.

Simultaneously, during the pandemic, Reed et al. (2023) found an increase in communications with non-coresident kin. Tunçgenç, Mulukom, and Newson (2023) found family bonds to positively influence well-being, and that among close social ties, only family bonds were positively linked to engagement in health behaviors. These findings are in line with another study by Lee, Lee, and Hartmann (2023) showing the stability and strengthening of bonds with kin compared to other ties after the start of the pandemic.

The activation of kin ties in systemic crises is not a phenomenon specific to just COVID-19. Other examples dating decades earlier include Shavit, Fischer, and Koresh (1994) who have found similar results in the context of the Gulf War, and Hurlbert, Haines, and Beggs (2000) in the context of hurricane Andrew.

1.1.3 Extended kin in migration decisions

While previous literature findings have established that non-household kin can influence migration decisions (see, e.g.,  Kan (2007); Spring et al. (2017)), kinship factors may be in tension with economic aspirations in the propensity to migrate (Miller, 1976). In other words, people may have to choose between being closer to economic opportunities or to extended family. However, it can be argued that COVID-19 is a unique case in that it is a systemic crisis that enabled more mobility, not less. At a time where individuals and families might look to their family the most, the pandemic also decoupled geography and employment, allowing people to achieve proximity to their extended family to a greater extent than before.

1.1.4 Parental ties

Among kin ties, intergenerational ties are understood in the literature to be the “important arena of action in Western kinship systems” (Furstenberg, 2020). The majority of support from kin flows vertically (typically in the downward direction, i.e., from parents to children or grandchildren). That about 75% of people in the U.S. whose parents or children are still alive reside within 30 miles from one of them (Choi et al., 2020) reflects this situation. Parents-in-law often act similarly to parents in terms of the support they provide (Wellman and Wortley, 1989; Compton and Pollak, 2014), and in many cultures they can be considered consequential in terms of one’s social network and cooperation (David-Barrett, 2023). While there is some modest evidence of horizontal transfer of time and resources between family members such as siblings (Wellman and Wortley, 1989; White, 2001), relatively little is known about contacts and exchanges between extended kin such as aunts, uncles, cousins, etc. (Furstenberg, 2020).

Aside from the clear evidence of their prominence, parental ties are a very compelling variable to study as a “pull factor” in pandemic-migration for the reason that migration is highly age- and life course-specific as well as context-dependent (Millington, 2000).

Individuals aged between 18 and 44 tend to have the highest propensity to migrate (Rogerson and Kim, 2005; Molloy, Smith, and Wozniak, 2011). This propensity declines with age (Castro and Rogers, 1984), which may have to do with the accumulation of social capital (Kan, 2007); it also declines with the household family life cycle stage (Miller, 1976).

At the same time, individuals in the 18 – 44 age range may also be the most likely to need support from or give support to parental family. For example, dual-earner parents returning to live near their own parents for childcare assistance is an identified phenomenon in the literature (Bailey, Blake, and Cooke, 2004). Grandparents are known to be important providers of early childcare (Furstenberg, 2020; Dukhovnov and Zagheni, 2015; Mulder, 2018), and evidence suggests proximity to them increases labor force participation among mothers with young children (Compton and Pollak, 2014). In the other care direction, individuals in their late child-rearing period—who belong to the so-called “sandwich generation”—may also find themselves needing to care for their aging parents.

Therefore, if it is largely the individuals in their twenties, thirties, and forties that were migrating for familial reasons, it is likely that they would look to where their parents were. For this reason, we focus in this study primarily on parental family.

2 Materials and methods

2.1 Design of study

Our study design seeks to provide evidence for the existence of a three-way relation between pandemic-migration, city population size, and non-coresident family. First, we analyze large-scale relocation data and show the increased migration flow to cities with smaller population after the pandemic started. Then, through three empirical investigations, we link migration to parental family as well as parental family to population—the first is done at the individual level, while the other two are done at the city level.

In Investigation 1, we examine whether there are differences in the individual-level migration rates to move towards parental family (or “return to home”) after the pandemic began in 2020. In Investigation 2, we construct a proxy variable that estimates the abundance of people who can be parents to adult children in each U.S. city and relate it to both city population and migration variables. In Investigation 3, we estimate an empirical model which tests whether cities with higher parental family availability saw a higher net population influx after the pandemic started.

Overall, Investigation 1 serves to validate our line of inquiry (linking migrational change to kin) as well as our study assumptions, while Investigations 2 and 3 connect parental family availability to both city population and changes in net migration of cities. The empirical model controls for relevant migration factors and mitigates potential confounders. Put together, our study examines whether the trend to migrate to smaller cities is partially a consequence of people moving to be closer to family, coupled with the heterogenous spatial distribution of parental family ties in the U.S.

We now elaborate on these analyses and the methodology we employ, beginning with the data used in this study.

2.2 Data sets

The two primary, large-scale data sets we employ in this study are the county-to-county relocation index aggregated from anonymized, opted-in mobile phone GPS data provided by the location intelligence company Spectus (2021), and the yearly U.S. Census American Community Survey Public Use Microdata Samples (ACS PUMS).

The Spectus data set has an advantage over administrative place-level migration data (e.g., from the U.S. Internal Revenue Service) due to its higher temporal resolution (weekly) as well as its ability to capture real time migration by algorithmically detecting new home location from mobile phone GPS instead of relying on individuals to report their change of address to governmental agencies (which could occur with a significant delay or not occur at all). Our data encompass nearly one-fourth of all possible origin-destination city pairs in the U.S.

Our second large-scale data set, ACS PUMS, contains individual-level information on a subsample of the U.S. Census ACS respondents, including the demographics of each person in the household, their place of birth, and their current place of residence. The microdata sample also includes survey questions about migration in the past year, such as whether a respondent has moved and the location from which they moved. For our study, we obtain PUMS yearly samples for the years 2016 to 2021 (corresponding to six separate samples) from IPUMS USA (Ruggles et al., 2023) which integrates and harmonizes PUMS data across all survey years. Across the six years, our IPUMS USA sample contains in total N=18,694,272𝑁18694272N=18,694,272italic_N = 18 , 694 , 272 person-records from N=8,247,978𝑁8247978N=8,247,978italic_N = 8 , 247 , 978 households.

Supplementarily, we obtain city population sizes, as well as control variables used in our empirical model from the ACS aggregated estimates (U.S. Census Bureau, 2020) and the U.S. Bureau of Economic Analysis (2020).

2.3 Calculations of inter-city migration from Spectus data

In the raw form, the Spectus data measures the weekly relocation flows between pairs of counties in the U.S. as an aggregated index called the Relocation Index. To calculate the Relocation Index rhk(t)subscript𝑟𝑘𝑡r_{hk}(t)italic_r start_POSTSUBSCRIPT italic_h italic_k end_POSTSUBSCRIPT ( italic_t ) during week t𝑡titalic_t between county hhitalic_h (old county) and county k𝑘kitalic_k (new county), Spectus uses the following formula:

rhk(t)=devices in h with a new home in k in week ttotal number of devices in h.subscript𝑟𝑘𝑡devices in h with a new home in k in week ttotal number of devices in hr_{hk}(t)=\frac{\text{devices in $h$ with a new home in $k$ in week $t$}}{% \text{total number of devices in $h$}}.italic_r start_POSTSUBSCRIPT italic_h italic_k end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG devices in italic_h with a new home in italic_k in week italic_t end_ARG start_ARG total number of devices in italic_h end_ARG . (1)

Home location is detected using an algorithm that identifies persistent night-time GPS location. For movers, this algorithm detects a change in the home location. Note that the Spectus data preserves the privacy of the users as they are aggregated at the county level. Our data spans from January 2019 to December 2020.

Our study is focused on inter-city migration, and we take core-based statistical areas (CBSAs) to be the geographic unit of study. CBSAs are urban areas delineated by the U.S. Office of Management and Budget, and can be either micropolitan areas (with population between 10,000 and 49,999 people) or metropolitan areas (with population of at least 50,000 people). We estimate dyadic relocation flows at the CBSA level by performing additional aggregations on the county-level Spectus data. In particular, to estimate the number Rij(t)subscript𝑅𝑖𝑗𝑡R_{ij}(t)italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_t ) of moves (as opposed to an index) from city i𝑖iitalic_i to city j𝑗jitalic_j during week t𝑡titalic_t, we use the formula

Rij(t)=hikjπhphrhk(t),subscript𝑅𝑖𝑗𝑡subscript𝑖subscript𝑘𝑗subscript𝜋subscript𝑝subscript𝑟𝑘𝑡R_{ij}(t)=\sum_{h\in i}\sum_{k\in j}\pi_{h}p_{h}r_{hk}(t),italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_h ∈ italic_i end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k ∈ italic_j end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_h italic_k end_POSTSUBSCRIPT ( italic_t ) , (2)

where the sums are over the counties that have a geographic correspondence to the appropriate CBSAs (i.e., counties that are part of the CBSAs), phsubscript𝑝p_{h}italic_p start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is the population of county hhitalic_h, and πhsubscript𝜋\pi_{h}italic_π start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is the estimated device sampling rate for county hhitalic_h. We calibrated πhsubscript𝜋\pi_{h}italic_π start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT by calculating and comparing yearly Spectus flow at the county level to the average yearly county-to-county migration flow derived from the 5-year, 2016–2019 migration data published by the U.S. Census Bureau (2021).

We also calculate the number Rij(θ)subscript𝑅𝑖𝑗𝜃R_{ij}(\theta)italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_θ ) of relocations from city i𝑖iitalic_i to city j𝑗jitalic_j spanning over a certain time period as

Rij(θ)=tθRij(t)=tθhikjπhphrhk(t).subscript𝑅𝑖𝑗𝜃subscript𝑡𝜃subscript𝑅𝑖𝑗𝑡subscript𝑡𝜃subscript𝑖subscript𝑘𝑗subscript𝜋subscript𝑝subscript𝑟𝑘𝑡R_{ij}(\theta)=\sum_{t\in\theta}R_{ij}(t)=\sum_{t\in\theta}\sum_{h\in i}\sum_{% k\in j}\pi_{h}p_{h}r_{hk}(t).italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_θ ) = ∑ start_POSTSUBSCRIPT italic_t ∈ italic_θ end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_t ∈ italic_θ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_h ∈ italic_i end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k ∈ italic_j end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_h italic_k end_POSTSUBSCRIPT ( italic_t ) . (3)

We use θ=0𝜃0\theta=0italic_θ = 0 to indicate the period between April 2019 – December 2019, which we take to be the baseline period in this study, and θ=1𝜃1\theta=1italic_θ = 1 to indicate the period between April 2020 – December 2020, the pandemic comparison period. Because we are only concerned with inter-city migration and not intra-city flows, we set Rii(t)subscript𝑅𝑖𝑖𝑡R_{ii}(t)italic_R start_POSTSUBSCRIPT italic_i italic_i end_POSTSUBSCRIPT ( italic_t ) and Rii(θ)subscript𝑅𝑖𝑖𝜃R_{ii}(\theta)italic_R start_POSTSUBSCRIPT italic_i italic_i end_POSTSUBSCRIPT ( italic_θ ) to 0.

In total, there are 211,902 origin-destination city pairs for the baseline period and 192,946 origin-destination pairs for the comparison period; both periods comprise 926 unique origin CBSAs and 926 unique destination CBSAs. The smallest city in the data set is Lamesa, TX (with population around 13,000 people) and the largest city is the New York metropolitan area (with population around 19.3 million people).

2.4 Analysis of trend to move to smaller cities during COVID-19

To illustrate the trend to move to smaller cities during COVID-19—the phenomenon which motivated this study—we define z(P;P,θ)𝑧superscript𝑃𝑃𝜃z(P^{\prime};P,\theta)italic_z ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_P , italic_θ ), the probability that movers from origin CBSAs whose log-population sizes fall into the log-population bin P+ΔP𝑃Δ𝑃P+\Delta Pitalic_P + roman_Δ italic_P would relocate to a destination CBSA whose log-population falls into the P+ΔPsuperscript𝑃Δ𝑃P^{\prime}+\Delta Pitalic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + roman_Δ italic_P bin during period θ𝜃\thetaitalic_θ (where ΔPΔ𝑃\Delta Proman_Δ italic_P is the bin size). Binning is employed to ease interpretation of the results and to manage fluctuations from the sparsity of samples of city population sizes, and we use log-population instead of raw population due to the skewed nature of city sizes in the U.S. (Ioannides and Skouras, 2013). Using the migration quantity derived from the Spectus data (equation 3), we calculate z(P;P,θ)𝑧superscript𝑃𝑃𝜃z(P^{\prime};P,\theta)italic_z ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_P , italic_θ ) as follows:

z(P;P,θ)=iPjPRij(θ)iPjRij(θ),𝑧superscript𝑃𝑃𝜃subscript𝑖𝑃subscript𝑗superscript𝑃subscript𝑅𝑖𝑗𝜃subscript𝑖𝑃subscript𝑗subscript𝑅𝑖𝑗𝜃z(P^{\prime};P,\theta)=\frac{\sum_{i\in P}\sum_{j\in P^{\prime}}R_{ij}(\theta)% }{\sum_{i\in P}\sum_{j}R_{ij}(\theta)},italic_z ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_P , italic_θ ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_P end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_θ ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i ∈ italic_P end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_θ ) end_ARG , (4)

where the notation iPsubscript𝑖𝑃\sum_{i\in P}∑ start_POSTSUBSCRIPT italic_i ∈ italic_P end_POSTSUBSCRIPT indicates summation over cities i𝑖iitalic_i whose log-population sizes belong to a bin P+ΔP𝑃Δ𝑃P+\Delta Pitalic_P + roman_Δ italic_P.

To capture the changes in z𝑧zitalic_z after the pandemic started, we calculate the ratio z(P;P,θ=1)/z(P;P,θ=0)𝑧superscript𝑃𝑃𝜃1𝑧superscript𝑃𝑃𝜃0z(P^{\prime};P,\theta=1)/z(P^{\prime};P,\theta=0)italic_z ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_P , italic_θ = 1 ) / italic_z ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_P , italic_θ = 0 ) for all P,Psuperscript𝑃𝑃P^{\prime},Pitalic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_P.

2.5 Investigation 1: Micro-level analysis of return-to-home movers

Migration rate λm(t)subscript𝜆𝑚𝑡\lambda_{m}(t)italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_t ) Migration type Who moved Where moved
λ1(t)(m=1)subscript𝜆1𝑡𝑚1\lambda_{1}(t)\quad(m=1)italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) ( italic_m = 1 ) Type 1 Individual movers parents’ home
λ2(t)(m=2)subscript𝜆2𝑡𝑚2\lambda_{2}(t)\quad(m=2)italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ) ( italic_m = 2 ) Type 2 Individual movers native place
(not joining parents’)
λ3(t)(m=3)subscript𝜆3𝑡𝑚3\lambda_{3}(t)\quad(m=3)italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t ) ( italic_m = 3 ) Type 3 Family household movers native place
Table 1: Types of moves identified in the ACS PUMS microdata, analyzed across the years 2016–2021. These types of moves represent relocations towards parental and extended family ties (native place is used in the literature as a proxy for presence of family ties (Compton and Pollak, 2014)). Details and discussion are given in Sections 2.5 and 3.2.

As a first investigation to examine potential links between non-coresident family and migration change during the COVID-19 pandemic, we employ the IPUMS USA microdata samples to study and compare the micro-level behavior of movers across the years 2016 to 2021. We examine whether there are differences in migration rates to move towards parental family after the pandemic began in 2020. In this exercise, we do not yet relate migration to specific cities or city sizes due to limitations in the data (namely, the PUMS variables of interest are only available at the levels of U.S. state and Census Public Use Microdata Areas). However, this exercise validates our line of inquiry (linking migrational change to kin) as well as study assumptions that will be relevant to our next investigation.

We classify three types of family-driven, “return-to-home” movers in the IPUMS USA data to analyze their patterns: (1) individuals moving into their parents’ households; (2) individuals moving back to their native state from elsewhere but not joining their parents’ household; and (3) family household units moving back to their native state from elsewhere. We call these types of moves Type 1, 2, and 3, respectively (see Table 1).

Type 1 movers are relatively simple to interpret: these are individuals who moved back in with their parents. For some adults and households that are a family unit, however, moving in with their parents may not be an option. Instead, they may choose to relocate to be within the locality of their parents. Precise information about the residence of one’s parents is highly identifiable and not publicly available. Therefore, we follow the literature (Compton and Pollak, 2014) and use one’s native state as a proxy variable for proximity to or presence of family ties. The residence of one’s parents is often the home in which one grew up, which is in turn often located in one’s state of birth. We refer to such returns to place of nativity performed by individuals as Type 2 moves, and we refer to such moves performed by family household units as Type 3 moves.

As a technical point which applies to both Type 1 and 2 movers, we consider individual movers to be persons who had migrated in the past year at the time of the PUMS survey, who were either a household of one person or a person who resided in a household in which not everyone had moved. If a household were labeled in the survey as a family household, were larger than one person, and every householder has moved, then we consider the household to be a family household unit mover (which is relevant to identifying Type 3 movers).

To identify Type 1 movers, we look at individual movers who had migrated within the same state or between states in the past year at the time of the survey (i.e., whose ‘MIGRATE1’ variable values are either 2 or 3 in the encoding of the IPUMS USA data). For each person in the sample, IPUMS USA includes variables that identify the mother (‘MOMLOC’) and father (‘POPLOC’) of that person if they live in the same household. (These variables are calculated probabilistically by IPUMS USA as they are not present in the regular ACS PUMS.) For each individual mover, if at least one parent was present in their current household and the parent(s) had not also moved in the past year, we classify that individual as a Type 1 mover.

To identify Type 2 movers, we look at IPUMS USA individual movers who had migrated from a different state in the past year at the time of the survey (‘MIGRATE1’ = 2). If their current state of residence (‘STATEFIP’) was the same as their state of birth (‘BPL’), we label them as Type 2 movers (i.e., individual movers moving back to place of nativity from elsewhere). We note that, strictly speaking, Type 1 moves are not mutually exclusive from Type 2 because one’s parental household may be in one’s native state (so Type 2 moves may contain some Type 1 moves). However, for our study, we exclude Type 1 moves from Type 2 moves.

Finally, we classify family households (‘HHTYPE’ is 1, 2, or 3) larger than one person whose every member had migrated in the past year to be Type 3 movers if their migration destination (current state of residence) was the place of birth of at least one householder.

We introduce the quantity λm(t)subscript𝜆𝑚𝑡\lambda_{m}(t)italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_t ), where m{1,2,3}𝑚123m\in\{1,2,3\}italic_m ∈ { 1 , 2 , 3 } indexes the type of movers, to capture the rates that their respective types of migration occur in each year t𝑡titalic_t. Table 1 provides a summary (who and where moved) of these move types and their corresponding λm(t)subscript𝜆𝑚𝑡\lambda_{m}(t)italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_t ). For notational brevity, we assume the dependence on t𝑡titalic_t in λm(t)subscript𝜆𝑚𝑡\lambda_{m}(t)italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_t ) is implicit and write λmsubscript𝜆𝑚\lambda_{m}italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT interchangeably. We calculate λ1subscript𝜆1\lambda_{1}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and λ2subscript𝜆2\lambda_{2}italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT for each year by dividing the number of Type 1 and Type 2 movers, respectively, by the total number of individual movers. We calculate λ3subscript𝜆3\lambda_{3}italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT by dividing the number of Type 3 movers by the total number of household movers. Our calculations are with consideration to sampling weights (see Section 4 of the publication by U.S. Census Bureau (2021) for ACS PUMS sampling weights). Explicitly, we use the formula

λm(t)=us(t)wu𝟙ind(u)𝟙m(u)us(t)wu𝟙ind(u)(m=1,2)subscript𝜆𝑚𝑡subscript𝑢𝑠𝑡subscript𝑤𝑢subscript1ind𝑢subscript1𝑚𝑢subscript𝑢𝑠𝑡subscript𝑤𝑢subscript1ind𝑢𝑚12\lambda_{m}(t)=\frac{\sum_{u\in s(t)}w_{u}\mathbbm{1}_{\text{ind}}(u)\mathbbm{% 1}_{m}(u)}{\sum_{u\in s(t)}w_{u}\mathbbm{1}_{\text{ind}}(u)}\quad\quad\quad(m=% 1,2)italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_u ∈ italic_s ( italic_t ) end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT ind end_POSTSUBSCRIPT ( italic_u ) blackboard_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_u ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_u ∈ italic_s ( italic_t ) end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT ind end_POSTSUBSCRIPT ( italic_u ) end_ARG ( italic_m = 1 , 2 ) (5)

to calculate λ1subscript𝜆1\lambda_{1}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and λ2subscript𝜆2\lambda_{2}italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, where u𝑢uitalic_u denotes persons in the IPUMS USA sample s(t)𝑠𝑡s(t)italic_s ( italic_t ) in year t𝑡titalic_t, 𝟙indsubscript1ind\mathbbm{1}_{\text{ind}}blackboard_1 start_POSTSUBSCRIPT ind end_POSTSUBSCRIPT and 𝟙msubscript1𝑚\mathbbm{1}_{m}blackboard_1 start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT are binary variables (=1absent1=1= 1 if a condition is met and 0 if it is not) indicating whether u𝑢uitalic_u is an individual mover and Type m𝑚mitalic_m mover, respectively. The variable wusubscript𝑤𝑢w_{u}italic_w start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT corresponds to the person sampling weight (‘PERWT’) of u𝑢uitalic_u.

For λ3subscript𝜆3\lambda_{3}italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, we use

λ3(t)=qs(t)wq𝟙hh(q)𝟙3(q)qs(t)wq𝟙hh(q),subscript𝜆3𝑡subscript𝑞𝑠𝑡subscript𝑤𝑞subscript1hh𝑞subscript13𝑞subscript𝑞𝑠𝑡subscript𝑤𝑞subscript1hh𝑞\lambda_{3}(t)=\frac{\sum_{q\in s(t)}w_{q}\mathbbm{1}_{\text{hh}}(q)\mathbbm{1% }_{3}(q)}{\sum_{q\in s(t)}w_{q}\mathbbm{1}_{\text{hh}}(q)},italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_q ∈ italic_s ( italic_t ) end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT hh end_POSTSUBSCRIPT ( italic_q ) blackboard_1 start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_q ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_q ∈ italic_s ( italic_t ) end_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT blackboard_1 start_POSTSUBSCRIPT hh end_POSTSUBSCRIPT ( italic_q ) end_ARG , (6)

where q𝑞qitalic_q denotes households in the yearly IPUMS USA sample, 𝟙hhsubscript1hh\mathbbm{1}_{\text{hh}}blackboard_1 start_POSTSUBSCRIPT hh end_POSTSUBSCRIPT and 𝟙3subscript13\mathbbm{1}_{3}blackboard_1 start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are binary variables indicating whether q𝑞qitalic_q is a family household mover and Type 3 mover, respectively, and wqsubscript𝑤𝑞w_{q}italic_w start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT corresponds to the household sampling weight (‘HHWT’) of q𝑞qitalic_q.

We note that for the 2020 sample, IPUMS USA uses experimental weights published by U.S. Census Bureau to address data collection and quality issues associated with the COVID-19 pandemic. Although the results for 2020 should still be interpreted with caution, we include them because they nevertheless provide valuable information.

2.6 Investigation 2: City-level analysis of parental family availability, population size, and net migration

If people were to migrate back to parental family after the COVID-19 shock, then we should see also larger flows to cities which have a larger abundance of parental family households. McLeod, Kan, and López (2023) refer to this abundance as availability. For our study, we require measures of parental family availability and net migration at the city level, described in this section.

2.6.1 Constructing parental family availability proxy variable from IPUMS USA

As mentioned earlier, there is considerable scarcity of data linking people to the location of their parents. Therefore, in this exercise we introduce a proxy variable visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT which estimates the stock of households in each city i𝑖iitalic_i whose householder(s) can be parents to adult children, capturing the notion of parental family availability.

Motivated by the literature and by methodological reasons, we design our proxy variable visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as follows. Because PUMS data do not contain information on kin who do not reside in the same household as the sample individuals, we cannot directly infer the city in which an individual’s parents may be located. At the same time, based on the literature we discussed in Section 1.1.4 (Miller, 1976; Castro and Rogers, 1984; Millington, 2000; Rogerson and Kim, 2005; Kan, 2007; Molloy, Smith, and Wozniak, 2011; Mulder, 2018), we assume that relocating individuals tend to generally be in their twenties, thirties, or forties, because these demographic groups tend to have the highest propensities to migrate. (We validate this assumption using statistics of the age of “return-to-home” movers in our previous analysis.) With this assumption, we expect that their parents would be at least a generation older. Therefore, we base our parental household estimation on certain age and marriage criteria.

Explicitly, within the IPUMS USA 2019 sample, we identify households in the sample that satisfy the following criteria:

  1. 1.

    Family households in which the head of household or their spouse (if married and spouse is present) is at least 50 years old, or

  2. 2.

    Non-family households in which the head of household is at least 50 years old and is either married but no spouse present, separated, divorced, or widowed.

The specific age of 50 was chosen based on the literature finding that “the vast majority of American parents who are older than the age of 50 provide support to children and grandchildren” (Furstenberg, 2020). Summing the household sampling weights (‘HHWT’) of these households estimates the total number of such households in the sample.

The most detailed geographic identifier in the ACS PUMS is the Public Use Microdata Area (PUMA) which consists of one or more contiguous counties and census tracts. Because our goal is to analyze migration patterns at the city level, we perform a PUMA-to-CBSA geo-allocation mapping algorithm to obtain city estimates of the parental family proxy variable. The mapping algorithm relies on the geographic correspondence file between PUMAs and CBSAs obtained from the Geocorr application maintained by the Missouri Census Data Center (2018). In this correspondence file, each entry is a PUMA–CBSA intersection along with an allocation factor which represents the proportion of the population living in this intersection out of the entire PUMA. Using these factors, we allocate the weighted total number of households satisfying the above criteria in each PUMA to each CBSA that intersects with it. Finally, we divide this number in each CBSA by the weighted total number of households in the CBSA to obtain the share of parental family households visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

2.6.2 City-level net migration

From our Spectus inter-city flows Rij(t)subscript𝑅𝑖𝑗𝑡R_{ij}(t)italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_t ) and Rij(θ)subscript𝑅𝑖𝑗𝜃R_{ij}(\theta)italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_θ ), defined in equations (2) and (3) respectively, we derive two net migration quantities: yi(t)subscript𝑦𝑖𝑡y_{i}(t)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ), and yi(θ)subscript𝑦𝑖𝜃y_{i}(\theta)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ ).

The quantity yi(t)subscript𝑦𝑖𝑡y_{i}(t)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) is a measure of net in-migration of a city and captures the inflow per outflow of city i𝑖iitalic_i during week t𝑡titalic_t. It is given by

yi(t)=kRki(t)jRij(t),subscript𝑦𝑖𝑡subscript𝑘subscript𝑅𝑘𝑖𝑡subscript𝑗subscript𝑅𝑖𝑗𝑡y_{i}(t)=\frac{\sum_{k}R_{ki}(t)}{\sum_{j}R_{ij}(t)},italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k italic_i end_POSTSUBSCRIPT ( italic_t ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_t ) end_ARG , (7)

where the numerator gives the total flow into a city i𝑖iitalic_i from all other cities during week t𝑡titalic_t and the denominator gives the total flow out of a city i𝑖iitalic_i to all other cities during week t𝑡titalic_t. Notice that yi(t)>1subscript𝑦𝑖𝑡1y_{i}(t)>1italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) > 1 implies that the migration inflow exceeds the migration outflow of city i𝑖iitalic_i during week t𝑡titalic_t (i.e., positive net influx). Consequently, yi(t)subscript𝑦𝑖𝑡y_{i}(t)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) captures the direction and magnitude of the net migration flow of a city.

We also calculate the corresponding quantity spanning the comparison time period, yi(θ)subscript𝑦𝑖𝜃y_{i}(\theta)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ ), using

yi(θ)=kRki(θ)jRij(θ).subscript𝑦𝑖𝜃subscript𝑘subscript𝑅𝑘𝑖𝜃subscript𝑗subscript𝑅𝑖𝑗𝜃y_{i}(\theta)=\frac{\sum_{k}R_{ki}(\theta)}{\sum_{j}R_{ij}(\theta)}.italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_k italic_i end_POSTSUBSCRIPT ( italic_θ ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( italic_θ ) end_ARG . (8)

Similar to the approach in Section 2.4, we can calculate the ratio yi(θ=1)/yi(θ=0)subscript𝑦𝑖𝜃1subscript𝑦𝑖𝜃0y_{i}(\theta=1)/y_{i}(\theta=0)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 1 ) / italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 0 ) to capture the changes in the net in-migration between the two time periods. We relate both yi(t)subscript𝑦𝑖𝑡y_{i}(t)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) and yi(θ=1)/yi(θ=0)subscript𝑦𝑖𝜃1subscript𝑦𝑖𝜃0y_{i}(\theta=1)/y_{i}(\theta=0)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 1 ) / italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 0 ) to our parental family proxy visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

2.7 Investigation 3: Empirical model

Finally, we estimate an empirical model to help control for other factors that may have been at play in pandemic-migration. We use a difference-in-differences (DiD) strategy with a continuous treatment variable (namely, visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT). DiD is an econometric model used to estimate the effect of a treatment by comparing the outcomes in the treated and untreated groups between two time periods (Lee, 2016); a continuous treatment is used to model increasing intensity of treatment instead of splitting units into treated and untreated groups. Applied to our study, with our outcome being changes in net in-migration, we can model parental family availability as a continuous treatment and take the two time periods to be before and after the COVID-19 shock.

Our model, which pools data from both before and during the pandemic, can be written as

ln[yi(θ=1)yi(θ=0)]=βvi+γ+aρaCai,subscript𝑦𝑖𝜃1subscript𝑦𝑖𝜃0𝛽subscript𝑣𝑖𝛾subscript𝑎subscript𝜌𝑎subscript𝐶𝑎𝑖\ln\left[\frac{y_{i}(\theta=1)}{y_{i}(\theta=0)}\right]=\beta v_{i}+\gamma+% \sum_{a}\rho_{a}C_{ai},roman_ln [ divide start_ARG italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 1 ) end_ARG start_ARG italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 0 ) end_ARG ] = italic_β italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_γ + ∑ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT , (9)

where the dependent variable is the log-ratio of the net flux into a city (see equation (8)) after and before the pandemic started, measuring the changes in migration. C𝐶Citalic_C denotes scalar control variables, indexed by a𝑎aitalic_a. Notice that the city-specific differences that existed in the dependent variable across cities before the COVID-19 shock are accounted for by the denominator in the log-ratio, constituting the city fixed effect.

Our coefficient of interest, β𝛽\betaitalic_β, is city-dependent and measures whether higher parental family availability v𝑣vitalic_v would lead to higher net flux after the pandemic started. The coefficient γ𝛾\gammaitalic_γ is city-independent and accounts for the effects that COVID-19 alone had on the dependent variable, constituting the time fixed effect. Finally, the city-dependent control variables Casubscript𝐶𝑎C_{a}italic_C start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT are included because we cannot ignore the possibility that the changes in migration behavior may have also been influenced by other factors whose importance may vary after the pandemic started. Using population density as an example, individuals may experience a higher desire to move to less dense places during the pandemic to avoid being infected, in which case the ρ𝜌\rhoitalic_ρ coefficient for population density will be negative. We include control variables that are relevant to relocation decisions: population size and density, median home value; median income; employment level (number of jobs per person); and the share of single family homes (SFH) in the city (home ownership is a major aspiration in the American life, and with work-from-home policies, households may have had more flexibility to seek a location where SFH were more available). All control variables except the share of SFH are in natural log scale.

3 Results

3.1 Increased migration from large to small cities

Refer to caption
Figure 1: Changes in relocation patterns after the COVID-19 pandemic started in terms of city population size and parental family availability. Panel (a) shows the changes in the probability z𝑧zitalic_z to relocate between (binned) city sizes before and during COVID-19 and suggests that movers from large cities were much more likely to relocate to small cities after the pandemic started than during the 2019 baseline period (red region in the bottom right corner). Panel (b) shows a binned scatterplot of our parental family availability proxy v𝑣vitalic_v in relation to log-population (blue, left vertical axis) and net in-migration changes (grey, right vertical axis) after COVID-19 shock. The dots represent the mean vertical axis values given the horizontal axis bins (with 50 discrete, equidistant bins along the v𝑣vitalic_v-axis in total); error bars represent the 95% CI of the means. The grey line and blue curves are fitted regression lines for the means (linear and order-2 polynomial, respectively) with the shaded regions corresponding to the 95% CI of the regression estimates. The grey vertical axis is log[y(θ=1)/y(θ=0)]𝑦𝜃1𝑦𝜃0\log\left[y(\theta=1)/y(\theta=0)\right]roman_log [ italic_y ( italic_θ = 1 ) / italic_y ( italic_θ = 0 ) ], which measures how much more (or less) of an attractor the cities in each bin became after the pandemic started (larger positive values indicate that on average the cities saw larger inflow per outflow after the pandemic started). Panel (c) shows a time-series of the average inflow per outflow of cities (log scale) grouped by quintiles of parental family availability v𝑣vitalic_v (shaded regions correspond to the 95% CI of the means), suggesting proportionally high increases in net in-migration to cities in the high v𝑣vitalic_v-quintiles after the COVID-19 shock in April 2020 compared to the corresponding time in the prior year.

We first present the changes in the probability to relocate to cities with log-population bin Psuperscript𝑃P^{\prime}italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT from an origin city with log-population bin P𝑃Pitalic_P. In Fig. 1, we visualize the ratio z(P;P,θ=1)/z(P;P,θ=0)𝑧superscript𝑃𝑃𝜃1𝑧superscript𝑃𝑃𝜃0z(P^{\prime};P,\theta=1)/z(P^{\prime};P,\theta=0)italic_z ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_P , italic_θ = 1 ) / italic_z ( italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ; italic_P , italic_θ = 0 ) as a function of P𝑃Pitalic_P and Psuperscript𝑃P^{\prime}italic_P start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT (see Section 2.4 and equation (4)). We select b=10𝑏10b=10italic_b = 10 equisized population bins to achieve granularity without significant sparsity (however, we find qualitatively consistent results for b=5,,10𝑏510b=5,...,10italic_b = 5 , … , 10). Fig. 1a reveals that there is a considerable increase in the probability for movers from large U.S. cities to migrate to small cities after the pandemic started (red region in the bottom right corner of Fig. 1a). At the same time, people from large cities were also less likely to migrate to another large city.

In numbers, our estimates derived from the Spectus data (see Section 2.3) indicate cities that were smaller than 500,000500000500,000500 , 000 in population had an influx of almost 52,0005200052,00052 , 000 people from cities that had over 500,000500000500,000500 , 000 in population in excess of what was observed during the baseline period in 2019. In total, between April 2020 – December 2020, these cities saw an increase in their net in-migration by 80 percent as compared to the same period in 2019 (60,0006000060,00060 , 000 versus 100,000100000100,000100 , 000). In other words, the excess influx from cities larger than 500,000 in population accounts for 95% of the increase in the net in-migration to these cities whose population is smaller than 500,000500000500,000500 , 000. Meanwhile, the top 10 largest cities saw twice as much net out-migration between April 2020 and December 2020 as compared to the same period in 2019 (82,00082000-82,000- 82 , 000 versus 45,60045600-45,600- 45 , 600).

If it is the case that part of this migration change is due to by family-driven migration, the increased migration to smaller cities may have followed as a consequence of the uneven distribution of family availability across the U.S. (McLeod, Kan, and López, 2023), where people in larger cities are less likely to have non-coresident family living nearby (hence would migrate elsewhere towards family). And because of the population distribution of cities in the U.S., in which the total number of people living across all cities with population size in a bin around P𝑃Pitalic_P decays with P𝑃Pitalic_P (Ioannides and Skouras, 2013), there is a bias in the direction of more extended family being located in cities with progressively smaller P𝑃Pitalic_P. To see if this is a valid line of inquiry (i.e., to check the premise), we determine if individuals or households made the decision to “move back home” to be closer to family at a higher rate once the pandemic started in comparison to the pre-pandemic period.

3.2 Micro-level dynamics of “moving back home”

Refer to caption
Figure 2: Results for the micro-level analysis of movers in the IPUMS data, showing the rates for individual movers to (a) move into their parents’ households (λ1subscript𝜆1\lambda_{1}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, Type 1 movers in the microdata) and (b) move back to their place of birth (POB) from elsewhere but not joining their parents’ household (λ2subscript𝜆2\lambda_{2}italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, Type 2 movers). Panel (c) shows λ3subscript𝜆3\lambda_{3}italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, the rates for Type 3 moves, i.e., family household units moving back to their native place from elsewhere.

We analyze the rates λm(t)subscript𝜆𝑚𝑡\lambda_{m}(t)italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_t ), where m{1,2,3}𝑚123m\in\{1,2,3\}italic_m ∈ { 1 , 2 , 3 }, at which the three types of return-to-home migration we classified in the IPUMS USA microdata occurred in each year t𝑡titalic_t between 2016 and 2021 (for methodology, see Investigation 1, Section 2.5). Table 1 provides a summary of these move types and their corresponding λm(t)subscript𝜆𝑚𝑡\lambda_{m}(t)italic_λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_t ). If parental households became a more important “insurance” destination under the COVID-19 crisis, we should expect to see a spike in λ1subscript𝜆1\lambda_{1}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in the year 2020. Similarly, we should see increases in λ2subscript𝜆2\lambda_{2}italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and λ3subscript𝜆3\lambda_{3}italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT after 2019.

In Fig. 2a, we indeed observe a jump in the Type 1 migration rate λ1subscript𝜆1\lambda_{1}italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in 2020. However, this rate dropped back to pre-pandemic levels in 2021. A possible interpretation of this result is that individuals who were able to (and perhaps needed to) move back in with their parents did so promptly after the pandemic started.

Analyzing the demographics of Type 1 movers in our IPUMS USA samples, we find that they tend to be young (median age of 25) and have low income (mean income of $25,266). More than half of them did not have a college degree, and about 60 percent were employed. Interestingly, the mean and median income of movers, as well as the percentage of college degree holders, were higher in 2020 and 2021 than in pre-pandemic years.

On the other hand, λ2subscript𝜆2\lambda_{2}italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, capturing the rate for Type 2 moves in which individuals moved to place of birth from elsewhere, saw a decrease in 2020 but an increase in 2021 (Fig. 2b). While not attempting to provide an explanation, a possible interpretation is that those individuals who did not have the option to move in with parents waited until 2021 to move back to their native state. Compared to Type 1 movers, Type 2 movers tended to be older (median age 29) and with a higher mean income. Similar to Type 1 movers, the mean and median income of Type 2 movers, as well as the proportion of college degree holders, were higher in 2020 and 2021 than in pre-pandemic years.

In Fig. 2c, we observe an increase in λ3subscript𝜆3\lambda_{3}italic_λ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT in 2020, and even a larger one in 2021, indicating that the pattern for family households to move back to their native states from elsewhere (Type 3 moves) increased in prevalence in 2020 and continued to do so in 2021. Plausibly, relocating an entire family household requires more logistical planning and “wait-and-see”, which could help explain the continued increase in the rate to move back to native place.

An interesting temporal-demographic dynamic we observe is that households that performed Type 3 moves in 2021 tended to have slightly older householders compared to prior years. Moreover, compared to prior years, a smaller proportion of family movers in 2020 and 2021 had eldest children who were younger than 5 or between 5 to 10 years old, but a slightly larger proportion of them had eldest children who were in their teens. In other words, families with young children were less likely to perform a move to native state during the pandemic, whereas those with older children (which would also indicate parents who are at the end of their child-rearing years) perform such moves at a marginally higher rate. This provides a modest support for the possibility that eldercare was a more prominent direction of care for the “sandwich generation” during the pandemic and, perhaps, a more important driver of family-related migration at the time.

Finally, we note that the demographics of movers here conform to our expectations (that they tend be inbetween their twenties and forties), which also helps to validate study assumptions that we rely on when constructing our family proxy variable v𝑣vitalic_v in our next analysis.

3.3 Cities with higher parental family availability observed larger positive changes in net in-migration

Up to now, we have shown that people moved more to smaller places and that moves towards place of origin also increased. In this section, we show in more detail that the parental family availability variable visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is both negatively correlated with population size and positively correlated with an increase in net in-migration during the pandemic in comparison to the baseline pre-pandemic period (Investigation 2, see Section 2.6).

Before analyzing visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we explore how it relates to the only other known systematic quantity about distribution of family in the U.S. Namely, we check how visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT relates to ϕisubscriptitalic-ϕ𝑖\phi_{i}italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, an estimate constructed based on a direct survey by McLeod, Kan, and López (2023) of the probability that an individual living in city i𝑖iitalic_i reports having non-household family nearby. We note that ϕitalic-ϕ\phiitalic_ϕ is not employed more broadly in this article because it is only available for 258 CBSAs, whereas our proxy v𝑣vitalic_v is calculated for all CBSAs in the U.S.

Following the methodology in  McLeod, Kan, and López (2023), we perform a modal regression of visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as a function of ϕisubscriptitalic-ϕ𝑖\phi_{i}italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Modal regression identifies the typical behavior of a random variable as a function of some independent variable using a smoothing kernel, picking up functional relationships that traditional regressions may otherwise miss (Chen et al., 2016). The method constructs 2-d kernel density estimates (KDE) (Hastie et al., 2009) from our set of data points (ϕi,vi)subscriptitalic-ϕ𝑖subscript𝑣𝑖(\phi_{i},v_{i})( italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). Then, conditioned on each value on the horizontal axis (ϕ)italic-ϕ(\phi)( italic_ϕ ), it calculates the conditional density of the KDE along the vertical axis (v𝑣vitalic_v), and extracts the local mode of the conditional density. These local modes of v𝑣vitalic_v are displayed as the white curve in Fig. 3 (values of KDE are visualized using a color scale). Fig. 3 shows that our parental family availability proxy v𝑣vitalic_v is monotonically related to ϕitalic-ϕ\phiitalic_ϕ, i.e. cities with larger ϕitalic-ϕ\phiitalic_ϕ also tend to have larger v𝑣vitalic_v.

Refer to caption
Figure 3: A heatmap showing normalized density of our parental availability proxy variable v𝑣vitalic_v conditioned on the general family availability ϕitalic-ϕ\phiitalic_ϕ obtained from McLeod, Kan, and López (2023) (see Sections 3.3 for discussion and methodology).

Now, relating visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to city population size in Fig. 1b (blue curve, left vertical axis), we find that parental family availability exhibits a decaying trend with city population (i.e., the share of households whose householders can be parents to adult children are larger in small cities than in large cities), which is consistent with the finding in McLeod, Kan, and López (2023) for general family availability ϕitalic-ϕ\phiitalic_ϕ.

Grouping cities by quintiles of parental family availability v𝑣vitalic_v, we see a notable increase in yi(t)subscript𝑦𝑖𝑡y_{i}(t)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ), a measure of net flux into city i𝑖iitalic_i, for those cities in the top three quintiles right after COVID-19 broke out in the U.S. when compared to the corresponding time period the year prior (Fig. 1c). A similar pattern is not seen in the bottom v𝑣vitalic_v-quintile.

In Fig. 1b (grey line), we observe: a) an increasing relationship between visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and log[yi(θ=1)/yi(θ=0)]subscript𝑦𝑖𝜃1subscript𝑦𝑖𝜃0\log\left[y_{i}(\theta=1)/y_{i}(\theta=0)\right]roman_log [ italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 1 ) / italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 0 ) ], and b) log[yi(θ=1)/yi(θ=0)]>0subscript𝑦𝑖𝜃1subscript𝑦𝑖𝜃00\log\left[y_{i}(\theta=1)/y_{i}(\theta=0)\right]>0roman_log [ italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 1 ) / italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 0 ) ] > 0 for cities with higher levels of parental availability. This indicates that, for those cities with larger parental family availability v𝑣vitalic_v, their net influx tended to be larger during the pandemic compared to before. (By definition, the larger yi(θ)subscript𝑦𝑖𝜃y_{i}(\theta)italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ ) is, the larger the inflow per outflow; hence, yi(θ=1)/yi(θ=0)>1subscript𝑦𝑖𝜃1subscript𝑦𝑖𝜃01y_{i}(\theta=1)/y_{i}(\theta=0)>1italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 1 ) / italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_θ = 0 ) > 1 indicates that inflow per outflow was larger during the pandemic than before.)

These results are consistent with our proposition that kin partially drove the migration changes that we see at the population level. However, other population effects may be at play. For example, lower population density and cost of living may be driving people to move to smaller cities which tends to coincide with larger visubscript𝑣𝑖v_{i}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. To perform a more in-depth test of the situation, we next estimate the empirical model, described in Section 2.7, with these factors as control variables.

3.4 Empirical model

β𝛽\betaitalic_β (1) (2) (3) (4)
v𝑣vitalic_v (parental family availability) 0.783*** 0.666*** 0.681*** 0.631***
(0.066) (0.068) (0.067) (0.066)
ρasubscript𝜌𝑎\rho_{a}italic_ρ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT
Population -0.018*** -0.010* -0.008
(0.004) (0.005) (0.005)
Population density -0.004 -0.007 -0.007
(0.006) (0.006) (0.006)
Median home value -0.026* -0.014
(0.014) (0.015)
Median income -0.096*** -0.118***
(0.036) (0.037)
Employment (jobs per person) 0.003 0.003
(0.003) (0.003)
Share of single-family homes 0.139**
(0.062)
Observations 1852 1852 1846 1846
R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT Within 0.138 0.183 0.216 0.221
Adj. R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT Within 0.137 0.180 0.210 0.215
* p <<< 0.1, ** p <<< 0.05, *** p <<< 0.01. Standard errors in parentheses.
Table 2: Empirical model (equation (9)) results showing the positive and statistically significant effect of parental family availability on changes in the net influx into a city after the COVID-19 shock relative to before. Model (4), which includes all of the control variables, estimates that a city that is 10 percentage points higher in v𝑣vitalic_v than another city would see a 6.5 percent higher (positive) change in inflow per outflow after the pandemic started. All models include city and time (before and after COVID-19 shock) fixed effects. All control variables except the share of SFH are in natural log scale.

We show the results of our empirical model (equation (9)) in Table 2. In all of the models with different sets of control variables, our coefficient of interest, β𝛽\betaitalic_β, is positive and statistically significant (bold numbers in Table 2). This strongly indicates that cities with larger parental family availability, v𝑣vitalic_v, saw a higher increase in inflow per outflow. The result from Fig. 1b suggested that having higher parental family availability led a city to one of three scenarios: i) in-migration was higher during the pandemic period compared to before; or ii) out-migration was lower during the pandemic period; or iii) a combination of the two previous scenarios. The regression results suggest that these scenarios would happen to a greater extent for cities with larger v𝑣vitalic_v compared to cities with smaller v𝑣vitalic_v. The complete model, Model (4), estimates that a city that is 10-percentage-points higher in v𝑣vitalic_v than another city would see a 6.5 percent higher (positive) change in inflow per outflow after the pandemic started.

The coefficients of the controls in each model align with our intuition of the general relocation behavior during the pandemic. For example, we expected that both population (Models (2) and (3)) and median home value (Models (3) and (4)) would have a significant negative effect on the dependent variable as movers sought less populated and cheaper destinations. We also find that the share of SFH has a positive effect on the dependent variable.

4 Discussion

Overall, our empirical results support the proposition that kin ties played a role in the shift in migration to smaller cities during the COVID-19 pandemic. To the best of our knowledge, such an attempt to connect pandemic-migration to non-coresident family has not been done. Our study adds to both the migration literature and the family ties perspective by showing that while socioeconomic and physical factors such as population density and cost of living may have been at play in pandemic-migration, the picture would be incomplete if family ties are neglected (Table 2). The migrational mechanism that this study casts light upon may help in migration modeling—for example, family ties or place of nativity for subpopulations could be incorporated in models such as the generalized gravity model for human migration (Park et al., 2018).

Qualitatively, if the migration decision process is thought of as “a hierarchically ordered set of values” or priorities (Miller, 1976), our study suggests that family became more highly ranked against other factors in a systemic crisis such as the COVID-19 pandemic. The fact that we see increased out-migration from large cities and increased migration towards family or place of nativity after the pandemic started supports previous literature findings (Miller, 1976) that economic aspirations and extended family proximity are in tension. At the same time, this would suggest that the comparative success of large cities (see, e.g., the literature on the scaling of productivity and innovation with city size (Bettencourt et al., 2007)) may come at a social and personal cost to individuals who have moved to these cities: they may have needed to replace relatively distant kin with local non-kin in their social network due to the cost of maintaining distant relationships, losing much of the remarkable support family provides—see also David-Barrett (2019) who explains this phenomenon.

Beyond these contributions, our study advances the emerging study of the demography-disasters nexus. As is argued by Karácsonyi, Taylor, and Bird (2021), perhaps even more important than enumerating death tolls, “the key to understanding impacts [of disasters] and avoiding them in the future is to understand the relationships between disasters and population change, both prior to and after a disaster.” Our observations linking city population to certain trends in population realignment show how the heterogeneity in the location of extended family across the U.S. is a source of vulnerability for cities. This heterogeneity, which existed prior to the pandemic, may be due to differences in demographic, socioeconomic, or infrastructural factors. Better social or institutional support for those lacking local non-coresident family could potentially help to neutralize the effects. On the other hand, this pandemic-migration may also contribute to irreversible changes in talent availability, real estate usage, and the growth of certain industries. In this regard, future research may focus on understanding these consequences in the long run.

At the destination cities, the prioritization of face-to-face interactions with family during the lockdown stages of the pandemic suggested by survey data (see Feehan and Mahmud (2021)) might have led to elevated transmission risks in these smaller cities and, when looked at together with other factors, may help to explain why these cities experienced comparatively worse epidemiological outcomes than in large, dense cities in later waves (Cheng et al., 2020; Koh et al., 2020; Pew Research Center, 2022). These epidemiological consequences can be long-lasting if we consider, e.g., the increased prevalence of long-COVID.

Our study is not without limitations. Most notably, part of our results rely on proxy variables of family because other large-scale data are not available that allow us to directly construct networks of movers and their family ties that also contain detailed geographic information. To this end, we combine multiple analyses at different levels in this study to provide more robust evidence of the effects of family on pandemic-migration. The lack of better data about extended family location is compatible with well-justified needs for individual privacy. At the same time, it does suggest that better sources of data that explore the spatial distribution of family across the U.S. are needed. The last systematic study of extended family across the U.S. was the now-discontinued National Survey of Families and Households; other surveys such as the Panel Study on Income Dynamics, although helpful, is not specifically designed to study non-coresident family patterns. Our results are mostly with respect to parental family, but the effects could be larger if we include other extended kin—here, again, new data would enable us to gain more insights (Furstenberg, 2020).

To summarize, in this study we present coherent empirical evidence, using multiple sources of data, that the increased preference to migrate to smaller cities may be partly driven by the increased migration towards non-coresident family, coupled with the heterogeneous distribution of family ties in the U.S. in which people are more likely to have family ties located in smaller cities (McLeod, Kan, and López, 2023). On a larger scale, our study amplifies ongoing literature highlighting the role of broader kinship systems (not limited to just the nuclear family) in macro-level socioeconomic phenomena (David-Barrett, 2019; David-Barrett et al., 2023; Furstenberg, 2020; Reed et al., 2023).

References

  • Bailey, Blake, and Cooke (2004) Bailey AJ, Blake MK, Cooke TJ (2004) Migration, care, and the linked lives of dual-earner households. Environ Plan A 36(9):1617—1632
  • Bettencourt et al. (2007) Bettencourt LM, Lobo J, Helbing D, Kühnert C, West GB (2007) Growth, innovation, scaling, and the pace of life in cities. Proc Natl Acad Sci 104(17):7301–7306
  • Castro and Rogers (1984) Castro LJ, Rogers A (1984) What the age composition of migrants can tell us. Popul Bull UN 1983
  • Chen et al. (2016) Chen YC, Genovese CR, Tibshirani RJ, Wasserman L (2016) Nonparametric modal regression. Ann Stat 44(2):489—514. https://doi.org/10.1214/15-AOS1373
  • Cheng et al. (2020) Cheng HY, Jian SW, Liu DP, Ng TC, Huang WT, Lin HH (2020) Contact tracing assessment of COVID-19 transmission dynamics in Taiwan and risk at different exposure periods before and after symptom onset. JAMA Intern Med 180(9):1156–1163
  • Choi et al. (2020) Choi H, Schoeni RF, Wiemers EE, Hotz VJ, Seltzer JA (2020) Spatial distance between parents and adult children in the United States. J Marriage Fam 82(2):822–840
  • Compton and Pollak (2014) Compton J, Pollak RA (2014) Family proximity, childcare, and women’s labor force attachment. J Urban Econ 79:72–90
  • Coven et al. (2023) Coven J, Gupta A, Yao I (2023) JUE Insight: urban flight seeded the COVID-19 pandemic across the United States. J Urban Econ 133:103489
  • David-Barrett (2019) David-Barrett T (2019) Network effects of demographic transition. Sci Rep 9(1):2361
  • David-Barrett et al. (2023) David-Barrett T, Diaz S, Rodriguez-Sickert C, Behncke I, Rotkirch A, Kertész J, Bravo L (2023) In a society of strangers, kin is still key: identified family relations in large-scale mobile phone data. arXiv preprint arXiv:2307.03547
  • David-Barrett (2023) David-Barrett T (2023) Network ecology of marriage. arXiv preprint arXiv:2310.05928
  • Dukhovnov and Zagheni (2015) Dukhovnov D, Zagheni E (2015) Who takes care of whom in the United States? Time transfers by age and sex. Popul Dev Rev 41(2):183–206
  • Dunbar and Spoors (1995) Dunbar RI, Spoors M (1995) Social networks, support cliques, and kinship. Hum Nat 6:273–290
  • Feehan and Mahmud (2021) Feehan DM, Mahmud AS (2021) Quantifying population contact patterns in the United States during the COVID-19 pandemic. Nat Commun 12(1):893
  • Furstenberg (2020) Furstenberg FF (2020) Kinship reconsidered: research on a neglected topic. J Marriage Fam 82(1):364–382
  • Haslag and Weagley (2022) Haslag PH, Weagley D (2022) From LA to Boise: how migration has changed during the COVID-19 pandemic. Available at SSRN 3808326
  • Hastie et al. (2009) Hastie T, Tibshirani R, Friedman JH, Friedman JH (2009) The elements of statistical learning: data mining, inference, and prediction (2nd ed). Springer, New York
  • Hurlbert, Haines, and Beggs (2000) Hurlbert JS, Haines VA, Beggs JJ (2000) Core networks and tie activation: What kinds of routine networks allocate resources in nonroutine situations? Am Soc Rev 598-618
  • Ioannides and Skouras (2013) Ioannides Y, Skouras S (2013) US city size distribution: robustly Pareto, but only in the tail. J Urban Econ 73(1):18–29. https://doi.org/10.1016/j.jue.2012.06.005
  • Kan (2007) Kan K (2007) Residential mobility and social capital. J Urban Econ 61(3):436–457
  • Karácsonyi, Taylor, and Bird (2021) Karácsonyi D, Taylor A, Bird D (2021) The demography of disasters: impacts for population and place. Springer, Cham
  • Koh et al. (2020) Koh WC, Naing L, Chaw L, Rosledzana MA, Alikhan MF, Jamaludin SA, Amin F, Omar A, Shazli A, Griffith M, Pastore R (2020) What do we know about SARS-CoV-2 transmission? A systematic review and meta-analysis of the secondary attack rate and associated risk factors. Plos One 15(10):e0240205
  • Krys et al. (2023) Krys K, Chun Yeung J, Haas BW, Van Osch Y, Kosiarczyk A, Kocimska-Zych A, Torres C, Selim HA, Zelenski JM, Bond MH, Park J (2023) Family first: evidence of consistency and variation in the value of family versus personal happiness across 49 different cultures. J Cross-Cult Psychol 54(3):323–339
  • Lambiotte et al. (2008) Lambiotte R, Blondel VD, De Kerchove C, Huens E, Prieur C, Smoreda Z, Van Dooren P (2008) Geographical dispersal of mobile communication networks. Physica A 387(21):5317–5325
  • Lee (2016) Lee Mj (2016) Matching, regression discontinuity, difference in differences, and beyond. Oxford University Press
  • Lee and Parolin (2021) Lee EK, Parolin Z (2021) The care burden during COVID-19: A national database of child care closures in the United States. Socius 7:23780231211032028
  • Lee, Lee, and Hartmann (2023) Lee B, Lee K, Hartmann B (2023) Transformation of social relationships in COVID-19 America: Remote communication may amplify political echo chambers. Sci Adv 9(51):eadi1540
  • McLeod, Kan, and López (2023) McLeod J, Kan U, López E (2023) Origins of face-to-face interaction with kin in US cities. arXiv preprint arXiv:2305.07944
  • Miller (1976) Miller SJ (1976) Family life cycle, extended family orientations, and economic aspirations as factors in the propensity to migrate. Sociol Q 17(3):323–335
  • Millington (2000) Millington J (2000) Migration and age: the effect of age on sensitivity to migration stimuli. Reg Stud 34(6):521–533
  • Missouri Census Data Center (2018) Missouri Census Data Center (2018) Geocorr 2018: geographic correspondence engine. https://mcdc.missouri.edu/applications/geocorr2018.html. Accessed 30 Jun 2023
  • Molloy, Smith, and Wozniak (2011) Molloy R, Smith CL, Wozniak A (2011) Internal migration in the United States. J Econ Perspect 25(3):173–196
  • Mulder (2007) Mulder CH (2007) The family context and residential choice: a challenge for new research. Popul Space Place 13(4):265–278
  • Mulder (2018) Mulder CH (2018) Putting family centre stage: ties to nonresident family, internal migration, and immobility. Dem Res 39:1151–1180
  • Park et al. (2018) Park HJ, Jo WS, Lee SH, Kim BJ (2018) Generalized gravity model for human migration. New J Phys 20(9):093018
  • Pew Research Center (2022) Pew Research Center (2022) The changing political geography of COVID-19 over the last two years. https://www.pewresearch.org/politics/2022/03/03/the-changing-political-geography-of-covid-19-over-the-last-two-years/. Accessed 5 Sep 2023
  • Plickert, Côté, and Wellman (2007) Plickert G, Côté RR, Wellman B (2007) It’s not who you know, it’s how you know them: who exchanges what with whom? Soc Netw 29(3):405–429
  • Reed et al. (2023) Reed MN, Li L, Pesando LM, Harris LE, Furstenberg FF, Teitler JO (2023) Communication with kin in the wake of the COVID-19 pandemic. Socius 9:23780231231199388
  • Rogerson and Kim (2005) Rogerson PA, Kim D (2005) Population distribution and redistribution of the baby-boom cohort in the United States: recent trends and implications. Proc Natl Acad Sci 102(43):15319–15324
  • Rözer, Mollenhorst, and Poortman (2016) Rözer J, Mollenhorst G, Poortman AR (2016) Family and friends: which types of personal relationships go together in a network? Soc Indic Res 127:809–826
  • Ruggles et al. (2023) Ruggles S, Flood S, Sobek M, Brockman D, Cooper G, Richards S, Schouweiler M (2023) IPUMS USA: Version 13.0 [dataset]. IPUMS, Minneapolis. https://doi.org/10.18128/D010.V13.0. Accessed 11 Oct 2021 and 20 Jun 2023
  • Shavit, Fischer, and Koresh (1994) Shavit Y, Fischer CS, Koresh Y (1994) Kin and nonkin under collective threat: Israeli networks during the gulf war. Soc Forces 72(4):1197–1215
  • Silverman (1986) Silverman BW (1986) Density estimation for statistics and data analysis, vol 26. CRC press, Boca Raton
  • Spectus (2021) Spectus (2021) Social impact. https://spectus.ai/social-impact/
  • Spring et al. (2017) Spring A, Ackert E, Crowder K, South SJ (2017) Influence of proximity to kin on residential mobility and destination choice: examining local movers in metropolitan areas. Demography 54(4):1277–1304
  • Tunçgenç, Mulukom, and Newson (2023) Tunçgenç B, Mulukom V van, Newson M (2023) Social bonds are related to health behaviors and positive well-being globally. Sci Adv 9(2):eadd3715
  • U.S. Bureau of Economic Analysis (2020) U.S. Bureau of Economic Analysis (2020) CAEMP25N Total full-time and part-time employment by NAICS industry. https://apps.bea.gov/iTable/?reqid=70#eyJhcHBpZCI6NzAsInN0ZXBzIjpbMSwyOV0sImRhdGEiOltbIlRhYmxlSWQiLCIzMyJdXX0=. Accessed 25 Nov 2021
  • U.S. Census Bureau (2020) U.S. Census Bureau (2020) 2016—2019 American Community Survey 5-year estimates. https://data.census.gov. Accessed 30 Jun 2023
  • U.S. Census Bureau (2021) U.S. Census Bureau (2021) County-to-county migration flows: 2015-2019 ACS. https://www.census.gov/data/tables/2019/demo/geographic-mobility/county-to-county-migration-2015-2019.html. Accessed 23 Sep 2022
  • U.S. Census Bureau (2021) U.S. Census Bureau (2021) Understanding and using the American Community Survey Public Use Microdata Sample files: what data users need to know. https://www.census.gov/programs-surveys/acs/library/handbooks/pums.html. Accessed 19 Jan 2024
  • Völker (2023) Völker B (2023) Networks in lockdown: the consequences of COVID-19 for social relationships and feelings of loneliness. Soc Netw 72:1–12
  • Wellman (1979) Wellman B (1979) The community question: the intimate networks of East Yorkers. Am J Soc 84(5):1201–1231
  • Wellman and Wortley (1989) Wellman B, Wortley S (1989) Brothers’ keepers: situating kinship relations in broader networks of social support. Sociol Perspect 32(3):273–306
  • White (2001) White L (2001) Sibling relationships over the life course: a panel analysis. J Marriage Fam 63(2):555–568

Acknowledgements

We thank Spectus for providing the relocation data, as well as Brennan Lake and Éadaoin Ilten for interfacing on behalf of Spectus. We are also grateful to Professors Noel D. Johnson, David W. S. Wong, and Sam G. B. Roberts for their helpful comments.

Ethics declarations

Ethical approval

We accessed our GPS-based relocation data under a strict agreement with Spectus; the agreement precludes attempts to de-anonymize or disaggregate the data. Spectus reviewed our current study prior to journal submission.

While the IPUMS USA data are publicly available, we follow Census Bureau principles by using these data solely for statistical purposes and not attempting to disaggregate or identify any individual within the data samples.

Informed consent

The Spectus data are collected from de-identified mobile-phone users who have opted in to provide access to their mobility data anonymously through a California Consumer Privacy Act (CCPA) compliant framework. The data provider, Spectus, always asks users to share their location before collecting it. Spectus also requires its application partners to disclose their relationship with Spectus and present users with options to opt out. Prior to sharing data with researchers, Spectus aggregates data to the county level. In order to further preserve privacy, Spectus discards data from counties with low thresholds of user counts.

The sampled individuals in the U.S. Census ACS are required by law to respond to the survey, but the Census Bureau is also required by law to protect respondents’ privacy, including ensuring that any identifiable information is removed before publicizing the data.

Competing interests

The authors declare no competing interests.

Authors’ contributions

Conceptualization: UK, EL; Methodology: UK, JM, EL; Software: UK, JM; Investigation: UK; Supervision: EL; Writing- original draft: UK; Writing- review and editing: UK, JM, EL.

Data availability

The U.S. Census, IPUMS USA, Geocorr, and BEA data sets used in this study are publicly available and can be downloaded from the respective organizations’ websites. The Spectus data were used under licence for the current study and are not publicly available. The code used to perform analyses of publicly available data during the current study is available at the online repository https://anonymous.4open.science/r/covid_family_migration.