Significance
Long ties, which connect individuals who lack mutual contacts in a social network, are known to be important for accessing valuable information about economic opportunities. Using population-scale communication data, we provide evidence of a robust positive relationship between the share of long ties and economic outcomes both at the individual and local levels in the United States and Mexico. Furthermore, we uncover that the formation of long ties is associated with disruptive life events such as migrating to another state or transferring to a different high school, which points to potential mechanisms responsible for forming and maintaining these valuable connections.
Keywords: social networks, weak ties, structural diversity, economic outcomes
Abstract
Social networks shape and reflect economic life. Prior studies have identified long ties, which connect people who lack mutual contacts, as a correlate of individuals’ success within firms and places’ economic prosperity. However, we lack population-scale evidence of the individual-level link between long ties and economic prosperity, and why some people have more long ties remains obscure. Here, using a social network constructed from interactions on Facebook, we establish a robust association between long ties and economic outcomes and study disruptive life events hypothesized to cause formation of long ties. Consistent with prior aggregated results, administrative units with a higher fraction of long ties tend to have higher-income and economic mobility. Individuals with more long ties live in higher-income places and have higher values of proxies for economic prosperity (e.g., using more Internet-connected devices and making more donations). Furthermore, having stronger long ties (i.e., with higher intensity of interaction) is associated with better outcomes, consistent with an advantage from the structural diversity constituted by long ties, rather than them being weak ties per se. We then study the role of disruptive life events in the formation of long ties. Individuals who have migrated between US states, have transferred between high schools, or have attended college out-of-state have a higher fraction of long ties among their contacts many years after the event. Overall, these results suggest that long ties are robustly associated with economic prosperity and highlight roles for important life experiences in developing and maintaining long ties.
Social networks can shape access to information and opportunities (1–4) and, thus, economic prosperity for individuals and places. Likewise, economic activity can affect the formation and maintenance of social ties. Such reciprocal causal relationships can produce robust and even self-reinforcing associations between network structure and economic prosperity, both for individuals and geographic regions. Indeed, recent empirical work relying on digital interaction data at a population-wide scale has documented some notable associations between the characteristics of social connections in a region and economic outcomes in that region. In particular, places with a higher structural and geographic diversity of connections have better economic outcomes (5–8). For example, the relative frequency of Facebook friendship links between every county-pair in the United States is a strong predictor of socioeconomic characteristics such as income, education, teenage birth rate, or life expectancy at the county level (8).
These aggregate associations between network structure and economic propensity are consistent with a broader theoretical and empirical literature on network structure. Long ties, which connect individuals who do not share any mutual contact (i.e., local bridges, ties with embeddedness of 0), are thought to facilitate the circulation of novel and valuable information and behaviors between multiple communities (9–14). Structurally diverse networks, which comprise numerous long ties, can provide individuals with an information advantage which leads to better managerial performance (11, 15–18), a higher propensity to find jobs (19, 20), more innovative teams (21–23), and faster adoption of new technologies (13, 24–26). Despite the extensive literature arguing for benefits of structural diversity (i.e., having a network with more long ties) in specialized professional contexts, the relationship between such network structures and general measures of economic prosperity is underexplored. Population-scale evidence has largely relied on ecological regressions with network structures and economic outcomes aggregated to telecommunication (6) or administrative regions (8), and association with network structure has often been characterized with univariate correlations and/or by combining multiple measures of network structure into a composite index (6). Furthermore, the question of who has more long ties and why they form and maintain these ties is largely unexplored despite their importance.
We have two main contributions: establishing a robust association between long ties and important measures of economic prosperity and linking major, disruptive life events to formation of long ties. First, we present population-scale evidence on the relationship between structural diversity and socioeconomic outcomes. Individual Facebook users in the United States with more long ties have better outcomes along four proxies of socioeconomic well-being, including living in richer zip codes, having more internet-connected devices, more expensive phones, and donating more often to charitable fundraising campaigns. Furthermore, we examine the additional role of tie strength, which captures the intensity of interactions between individuals. Individuals with stronger long ties tend to have better economic outcomes, even after accounting for the prevalence of these ties and their total intensity of interactions. Both of these results are also reflected in geographically aggregated results in the United States and Mexico. Structural diversity appears to importantly characterize the networks of higher-income people and places.
Given the robust association between long ties and economic outcomes, we then probe some potential origins in specific life events. Few studies have focused on the determinants of long ties. Existing works have shown that brokers, who connect two otherwise disconnected communities by filling a structural hole, are less risk-averse, more entrepreneurial, and more prone to change (10, 27, 28). They have a higher ability to integrate with various social groups and form structurally diverse networks arguably because they are less likely to define themselves and others in terms of socially distinct categories (29, 30). While these studies suggest that personality traits are associated with the presence of long ties, personality traits are affected by life experience (31) and network measures related to long ties have large nonheritable components (32, 33). We therefore study life events that expose individuals to multiple communities and push them to establish connections with individuals who might be very different from their previous contacts: i) migrating to another state, ii) transferring between high schools, and iii) attending an out-of-state college. In all three cases, experiencing such disruptions is associated with having more structurally diverse networks years later, even excluding the ties most directly implicated (e.g., ties to alumni of the involved schools). Though we flexibly adjust for covariates, we cannot attribute these associations entirely to causal effects of these events; however, in the case of transferring between high schools, a quasi-experimental analysis focusing on school closures yields qualitatively similar results. Together these results provide a view of long ties as consistently linked to disruptive life events, even long after those events.
Long Ties and Economic Outcomes
How does structural diversity relate to economic outcomes? To answer this question, we first count all the public comments posted on Facebook between December 2020 and June 2021 between users living in the United States to form a network of interactions between users; this is the network used throughout the paper (Materials and Methods for details). As a first measure of structural diversity, we compute the fraction of each individual’s ties that are long ties (i.e., lack any mutual network neighbors). We begin with a brief presentation of geographically aggregated results in the United States and Mexico, before highlighting our main results at the individual level in the United States.
Pooling the networks of all Facebook users living in the same region provides an overview of variation in this measure of network structure (Fig. 1A and SI Appendix, Fig. S3A). The least structurally diverse regions with available data, such as Appalachia in the United States or Zacatecas in Mexico, are economically disadvantaged. Zip codes in the United States with more long ties have higher household incomes (Fig. 1B), even after flexibly controlling for their overall population and racial composition, the number of Facebook users residing in the zip code, and their average degree (Materials and Methods for details). This positive relationship holds for additional measures of economic prosperity in the United States, including social mobility and unemployment rates, and for a wealth index in Mexico (SI Appendix, Fig. S4).
Fig. 1.

(A) Fraction of long ties, measured across counties in the United States, after controlling for average degree. The adjusted (residualized) fraction of long ties across administrative units in each country is binned into 10 deciles, with lower values shown in purple and higher values shown in green. Administrative units with less than 200 Facebook users sufficiently active over the measurement period or with less than 500 individuals are shown in white. (B) The fraction of long ties and (C) the tie-strength weighted fraction of long ties predict median income in US zip codes. The binned regression plots are generated according to models (6) and (7) in Materials and Methods, which adjust for the number of nodes in the zip code, its racial composition, and average degree; the regression with weighted fraction of long ties also adjusts for the fraction of long ties and the sum of the weights. Solid lines are local smoothers of second degree; bars are 95% confidence intervals, which are cluster-robust at the county level.
The findings from ecological regressions such as the analysis above do not necessarily imply a specific individual-level association. Thus, our primary analyses examine associations at the level of individuals’, rather than places’, networks. As typical measures of economic outcomes (e.g., income, wealth) are not available at an individual level linked to Facebook users, we rely on proxy measures of economic prosperity (34). For individuals living in the six most populous US states, we divide the fraction and weighted fraction of long ties each into 12 equal-sized categories and match on degree, level of daily activity, gender, and age (Materials and Methods for details). Individuals with a higher fraction of long ties live in zip codes with higher median family income, use more distinct devices to access Facebook, and donate more often to fundraising campaigns on Facebook (Fig. 2), consistent with previous findings that richer people donate more often (35). They similarly use more expensive mobile phones and have visited more countries in short trips (SI Appendix, Fig. S6).
Fig. 2.

(A) Fraction of long ties and (B) tie strength weighted fraction of long ties in ego networks versus individual-level variables that are positively correlated with proxies for economic outcomes for users who reside in the Top 6 most populous states in the United States. Users are matched on degree, average degree of their contacts, daily login activity, average daily login activity of their contacts, age, and gender; see Materials and Methods. The matching for the bottom row additionally incorporates the fraction of long ties and average edge weight. Values in the y-axis correspond to the difference with the baseline represented with a value of zero. Bars are 95% confidence intervals for these differences assuming independent observations; robustness to within-state dependence is illustrated by the multiple state-specific results.
By using a network weighted by the number of interactions, we examine the interplay between structural diversity and tie strength. Weak ties, which connect people who infrequently interact with one another, are often described as providing access to more novel—and thus incrementally valuable—information than strong ones (9). However, while weak ties tend to be long ties in general, there is substantial variation, and even some very long ties spanning distant parts of a network can be strong (36). This motivates an explicit distinction between tie strength and tie length (10, 18, 37). So, as a second measure of structural diversity, in addition to the fraction of long ties, we compute a tie-strength weighted fraction of long ties; Materials and Methods for details. When comparing individuals with a similar fraction of long ties, those with stronger long ties (i.e., higher values of weighted fraction of long ties) live in higher-income zip codes, use more unique devices, and make more donations (Fig. 2B). Similar results hold for additional proxies (SI Appendix, Fig. S6B), and at a geographically aggregated level for median income in the United States (Fig. 1C) and other economic outcomes in the United States and Mexico (SI Appendix, Fig. S4).
To jointly examine the contribution of the fraction and weighted fraction of long ties, we fit random forests (38) that use these fractions and other covariates to predict each of the economic outcomes. The average marginal change in the predicted outcomes from increasing the fraction and weighted fraction of long ties are positive and are of similar magnitude to each other (Fig. 3) where averages are computed over US states, covariates, and the distribution of the long tie measures. These results are quite consistent in direction when considering US states individually (SI Appendix, Fig. S7) suggesting that having long and strong ties provides a robust signal for predicting economic prosperity.
Fig. 3.
Estimated average marginal change in measures of individual economic prosperity given a 1% point (0.01) increase in the fraction or weighted fraction of long ties. These are computed from random forest models fit to each state and DC and including degree, average degree of contacts, daily login activity of the user, average daily login activity of their contacts, average weight of user’s edges, age, and gender. Bars are 95% confidence intervals that are cluster-robust at the state level.
Taken together, we find evidence that people and places with more long ties and stronger long ties are more economically prosperous. Having stronger long ties, rather than weak ties per se, predicts economic prosperity.
Life Events and Long Ties
Having established the existence of a robust cross-sectional relationship between the prevalence of long ties and economic prosperity, we turn our focus to processes responsible for their formation. We study three life events, each involving mobility between institutions and/or places, that we hypothesized would enable individuals to build connections to disparate communities: migrating to a new state, attending an out-of-state college, and transferring to a different high school. In all three cases, we compare the structural diversity of the network—many years later—surrounding individuals who experienced such events with those who did not.
Interstate Migration.
To study interstate migration and structural diversity, we examine Facebook users born in the United States and who have lived in the same state since 2012. We compare the fraction of long ties for individuals who were born in a different state from the one they currently live to that for individuals living in the state where they were born, matching on degree, age, gender, and home county income (Materials and Methods for details). Interstate migrants have on average 13% more long ties relative to nonmigrants (33.7% vs. 29.7% in absolute percentage terms of all their contacts with difference-in-means P < 10−10 using cluster-robust standard errors at the county level). While this difference is in part driven by migrants having accumulated long ties in their previous state, we also find that migrants have 4.0% more long ties with residents of their current state relative to nonmigrants (28.7% vs. 27.6% in absolute percentage terms with difference-in-means P = 0.011 using county-level cluster-robust standard errors). To account for differences in ego network sizes, we compare migrants’ propensity to form long ties with that of nonmigrants who have a similar degree. Migrants consistently have more long ties than nonmigrants with a similar degree (Fig. 4A). This difference in structural diversity between migrants and nonmigrants increases with the number of contacts and persists when we consider only long ties to contacts living in their current state. Thus, interstate migrants form long ties at a higher rate than similar nonmigrants independently of their number of contacts, and even in the state where they currently live.
Fig. 4.
Conditional on degree, people who (A) are interstate migrants or (B) attended multiple high schools have more long ties than matched controls. Qualitatively similar relationships hold when restricting the analysis to less directly implicated ties—those within the current state (A inset) and those outside of high school (B inset). All estimates are poststratified by gender, age, and hometown county income bins.
Out-of-State College.
To analyze attending an out-of-state college or university, we study Facebook users born in the United States and who attended college outside of their home state, and compare them with users of the same cohort who attended college within their home state. Attending an out-of-state college is associated with having a higher fraction of long ties years after attending college, and the effect persists even when we focus on connections to friends made outside of college and when we match on whether they are currently interstate migrants (SI Appendix, Fig. S12). See SI Appendix, section D.4 for more detailed analysis of this life event.
Transferring High Schools.
Adolescence involves learning how to expand one’s social network (39), and therefore disruptive experiences during this period are likely to shape an individual’s network structure later in life (40). We focus on Facebook users who were born between 1961 and 1991 in the United States, went to high school, and are currently living in the United States. We then compare the ego networks of users who report attending multiple high schools within the same state with those who attended a single high school (thus ruling out the effect of interstate relocation), while matching on degree, age, gender, and income of their home county ( Materials and Methods for details). People who transferred to a different high school on average have 10% more long ties relative to those who attended a single high school (35.2% vs. 31.9% in absolute percentage terms of all their contacts with difference-in-means P < 10−10 using cluster-robust standard errors at the county level). Some of this difference could be due to an increase in the immediate opportunity of meeting new friends. We therefore conduct an analysis restricted to long ties formed with contacts outside of high school, finding a fraction of long ties for users who transferred to a different high school that remains 9% larger relative to those who did not transfer (34.8% vs. 31.8% in absolute terms with P < 10−10 using county-level cluster-robust standard errors). Users who attended multiple high schools consistently form more long ties across all values of the ego network degree (Fig. 4B), even when we consider only contacts outside of high school.
Even conditional on covariates, any causal effects of transferring high schools (like the other two life events) on network structure are likely confounded with other unobserved variables. Here, we consider some plausibly exogenous variation in transferring high schools by identifying high schools that closed prior to 2016. Among attendees of these high schools around the time of the closure, later cohorts report that they have attended multiple high schools 20% more relative to earlier cohorts (17.3% vs 14.3% in absolute percentage terms); both fractions are high, likely reflecting various factors affecting transfers prior to outright closure. Within this restricted sample, and matching on school and age, we find people in the closure-affected cohorts have 6.1% more long ties relative to before-closure cohorts (46.4% vs. 43.7% in absolute percentage terms, Fig. 5). In a linear model with fixed effects for school and age, we estimate that being in a closure-affected cohort increases the fraction of long ties by 1.2 percentage points (95% CI [0.4, 1.9], SI Appendix, Table S1). Similar results hold when considering only the ties made outside high school. Together, these findings suggest that transferring to a different high school shapes the proclivity to connect with different communities.
Fig. 5.
People who attended a high school during its closure have more long ties than those who attended the same high school prior to closure. The estimates are poststratified by school and age. Bars are bootstrap 95% confidence intervals that are cluster-robust at the school level.
Finally, in SI Appendix, section D.6 we provide some evidence that individuals having each of these life events above is also associated with better economic outcomes measured with the same proxy variables as in prior analyses (SI Appendix, Figs. S14, S15, and S16).
Discussion
There is an extensive literature on the role played by social network structures in constituting social capital (2, 34, 41–43). A long tie is potentially a source of information not as directly available through any other contact, therefore individuals with structurally diverse networks ought to have an informational advantage (10, 11, 15–18, 27). Our study documents a strong cross-sectional relationship between long ties and measures of economic prosperity at both geographically aggregated and individual levels. The literature has sometimes equated weak ties and long ties, for example, by treating the number of mutual contacts as a measure of tie strength (44–46), such that long ties are all counted as “weak”. Our measure of tie strength allows us to clarify that it is long ties and stronger long ties (rather than weak ties per se) that are particularly associated with prosperity, consistent with the mechanisms posited by both Granovetter (9) and subsequent work (10, 18). Our results suggest that tie strength and length should be treated as conceptually distinct dimensions, each with its own links to economic prosperity.
Prompted by this link between long ties and economic outcomes, we consider the origins of long ties and factors that enable people to create and maintain these ties. Seminal work by Simmel (47) suggests that individuals with long ties are mediators who can resolve potential conflicts between disconnected communities due to their unique involvement with both communities. Simmel discussed the ability of these mediators as a natural consequence of their structural position in the network; however, it is perceivable that certain individuals have a higher tendency to occupy these network positions (SI Appendix, Fig S8). The existing literature on the origins of long ties has characterized the personality of individuals with long ties to be entrepreneurial, risk-taking, and thriving on change (28, 29, 48).
The studies mentioned above treat the determinants of long ties as a matter of personality or even genetic heritability (32), rather than a skill or disposition to be acquired, even though many of the personality correlates of long ties can develop through experiences. We examine the link between experiencing major life disruptions—interstate migration, transferring high schools and attending an out-of-state college—and the formation of these ties. The experiences from these major disruptions are similar in the sense that they expose individuals to diverse communities, require them to form new ties with others who are different than their old network, and potentially cause development of the social skills to do so. We find that individuals who experienced such disruptions have more structurally diverse networks years later, in ways that are not mechanically due to geographic mobility or exposure to a new community during high school or college. These findings suggest that the tendency to form long ties is not only a matter of innate personality but also life experiences. Shaping one’s personality and acquiring the necessary social skills is one potential mechanism behind the effect of these disruptive life experiences.
These results are consistent with findings showing that the benefits of long ties come from oscillation between making deep investments and brokering in multiple communities (49). Brokers with network oscillation have developed the skill for successfully moving in and out of groups, much similar to the experience of migration or relocation during high school:
“Experience with change is preparation for change... [Oscillators] can be expected to be flexible in moving between identities ... and so develop the adaptive self-monitoring associated with network brokers. The image that comes to mind is people who grew up in multiple countries, or in families that frequently moved between cities.” (49, p. 387]
Our findings have important limitations. First, we use communication activity on Facebook as a measure of social networks more generally (i.e., interaction offline and via other media). Our choice for network edges, i.e., reciprocal interactions, is meant to limit the network to meaningful ties; nevertheless, it is possible that our network measurements do not accurately capture off-Facebook interactions that confer economic advantage. Second, while the analyses of network structure and economic outcomes flexibly adjust for or match on relevant covariates, some of the observed associations can be due to unobserved factors. Furthermore, the observed associations likely reflect causal processes in both directions; that is, social networks affect economic outcomes, and also economic prosperity affects network formation and maintenance. For example, wealthier people may be more able to form and maintain long ties (by, e.g., traveling, dining out). Similarly, selection into some professions can lead to more opportunities to form and maintain long ties. There may also be important feedback between processes in these two directions if wealth enables forming long ties, which are then advantageous economically, etc. Our observational data cannot unambiguously disentangle these mechanisms. Overall, our results fit with the growing literature on benefits of long ties in specific settings and outcomes, and some of these studies rely on randomized interventions that affect network structure (46). We regard both causal directions as of interest for further study.
While the three disruptive life events we studied were selected because we hypothesized they would increase structural diversity, there can also be confounding factors between groups here. However, having three different types of events—involving different processes by which people are selected into the disruption—pointing to the same conclusions makes for a more robust and notable pattern. In addition, as we examine the network interactions of an individual many years after experiencing a disruptive event, we rule out endogeneities occurring on a short time scale. Finally, we argue that transferring high schools, especially when due to high school closure, is the most plausibly exogenous event. Transferring high schools is often either a decision made by parents on behalf of students or results from external factors outside a student’s control (50–52). It may even result from their current school closing, which we also found is associated with forming more long ties. Taken together, we believe that our findings on the role of disruptive life events are unlikely to be fully attributed to confounding.
Understanding the origins and consequences of long ties can have important implications for public policy. Intergenerational economic mobility has been declining in the United States (53), but long ties (SI Appendix, Figs. S4 and S5) and ties spanning class boundaries are associated with higher economic mobility (34). Some of those interclass ties are likely attributable to the design of institutions, as, e.g., smaller high schools have more such ties (43). More generally, we posit that dispositions to form long ties—developed in part through the kind of disruptive life events studied here—are upstream of many ties that cut across groups, including interclass ties. Policies aimed at fostering structural diversity could have a positive impact on social mobility and economic prosperity more generally. For these reasons, we hope that this study will encourage further research into the mechanisms of formation of long ties, particularly as it pertains to skills that can be taught, and their causal links with economic prosperity.
Materials and Methods
First, we describe the data and methods for examining the link between long ties and economic outcomes at the individual level, with additional notes for the geographically aggregated level. We then explain the data collection and analysis of the determinants of long ties.
Individual-Level Long Ties and Economic Outcomes.
Ego networks.
For each user currently residing in the United States, we construct their ego network from public commenting communication data spanning a period of 6 mo from December 2020 to June 2021. An ego network includes all the edges between the user (the ego) and their contacts (alters) and the edges among contacts themselves. In the commenting network, there is a directed edge between two users if one has mentioned or replied to a post made by the other. In constructing the ego networks, we retain only reciprocal edges (with at least one comment exchanged in each direction) to exclude one-off interactions between individuals who would be otherwise unrelated. Furthermore, nonreciprocal edges have been shown to be less important in diffusion and peer influence processes in networks (54, 55). To ensure a minimal level of activity by each user, we discard those with degree less than 10 (have communicated with less than 10 people over the 6-mo period). Commenting activities are more appropriate for the construction of the network than the actual friendships on Facebook for two reasons: commenting contacts are more likely to be strong and relevant to each user, as opposed to many friends on Facebook who are only acquaintances without any meaningful interaction over a long period. But more importantly, Facebook users extensively use commenting as a form of communication as there are about 195 million daily active users on the Facebook website or its mobile application in the United States and Canada.*
Measuring long ties.
Structural diversity is a purely topological concept that designates a multiplicity of local communities, but which has been operationalized in multiple ways, even within a single study (13). The fraction of long ties, those without any mutual contacts, is a conceptually simple and computationally practical operationalization. Long ties are sometimes also called local bridges and are those ties that have an edge embeddedness of 0 (56, §3.3]. Long ties closely resemble the widely used concept of structural holes or social bridges and their overall prevalence may indicate the extent of novel information received.
The local clustering coefficient is a related measure to the fraction of long ties and has been widely used as a node-level measure of embeddedness (56). While the clustering coefficient aggregates edge embeddedness, the fraction of long ties would be a more appropriate measure for the frequency of completely unembedded ties. Thus, our focus is on the fraction of long ties; however, we reached similar conclusions using the clustering coefficient.
Given an ego network, we compute the fraction of long ties among all ties from the ego to their contacts:
| [1] |
where Ni is the set of i’s neighbors, I(.) is the indicator function, and mcij is the number of mutual contacts between i and j.
We also consider what fraction of total interaction by an individual occurs along long ties. The normalized count of exchanged comments is our measure of (normalized) tie strength:
| [2] |
where tij is the number of comments i has exchanged with j in the 6 mo measurement period and ti = ∑j ∈ Nitij is the total number of comments user i has exchanged with all their contacts. Similar definitions of tie strength are widely used [e.g., refs. 57 and 58], and are known to correlate with other survey-based measures of tie strength (59, 60).
Given this definition of tie strength, a simple and intuitive measure for the strength of long ties is their weighted sum. The analysis on the strength of long ties uses the weighted fraction of long ties in which each long tie is weighted by its normalized tie strength defined in Eq. 2:
| [3] |
Both li and liw can range from 0 to 1.
Economic measures.
Direct measures of economic well-being are not available at the individual level linked to Facebook users. However, various proxy measures that are related to economic outcomes are available at the individual level. Our analysis relies on the following three proxy outcome variables in the main text:
-
1.
The median and mean household income of the user’s residential zip code: The income value is obtained from 2018 ACS and the user zip code of residence is based on a prediction that incorporates various pieces of information including internet connection. Note that this is not strictly an individual-level outcome.
-
2.
Unique devices: A user connection to Facebook logs their device model. We count the number of unique devices a user has used more than once to connect to Facebook from 2019 to 2021.
-
3.
Number of donations to fundraisers: Facebook has a personal fundraising platform where each user can request for financial assistance either for personal reasons or to a charity organization. Since 2016 when fundraisers launched to 2021, about $5 billion dollars have been donated through Facebook fundraising platform, involving about 85 million people.† We count the number of times a user has made a donation to a fundraising campaign over a 2-y period from June 2019 to June 2021. We expect that users with a higher disposable income tend to make more donations.
In SI Appendix, we provide further evidence of this relationship by examining the following two proxy outcomes:
-
1.
The user’s phone price: A user connection to Facebook logs their mobile phone model. We match their phone model against a database of phone prices in dollars.
-
2.
The user’s international touristic trips: The number of countries, other than Canada and Mexico, visited by a user in trips shorter than two weeks since 2012 captures the user’s tourism activity. A visit to a country is inferred based on the connection IP of the user while on a trip.
Matching and estimation.
We compare individuals with different values of the focal structural diversity measures but similar values of other covariates. We discretize the fraction (or weighted) long ties into 12 equal-sized levels. Generalized full matching (61) is an appropriate matching framework here as it allows for more than two treatment arms (12 in our case), assigns all units to exactly one matching group, and is computationally tractable. Fig. 2 shows the difference between each level of fraction of long ties or weighted long ties and the fourth bin which is used as the base level. The matching is done once per each state, rather than the whole country, which uses only the data from users whose predicted current residence is in that state. This allows us to discover potential heterogeneities that might exist between the states.
The Euclidean distance metric in the matching procedure is studentized so that different variables with different ranges contribute equally to the distance metric. It is constructed based on the following variables when the independent variable is the fraction of long ties (Top row of Fig. 2):
-
1.
Log user degree and log average degree of their contacts. These two variables ensure comparisons are made between ego networks of similar sizes.
-
2.
Daily login activity of the user, measured in terms of the number of days over a month with a login, and the average daily login activity of their contacts. These two variables ensure that the ego and alters have similar levels of activity, which might affect the fraction of long ties in the ego network.
-
3.
Age and gender of the user.
When the independent variable is weighted long ties (bottom row of Fig. 2), the distance metric incorporates the following additional variables:
-
1.
Log average weight of user’s edges (). This ensures that matched users have similar levels of commenting activity since, as this affects the feasible ranges of weighted long ties liw
-
2.
Fraction of long ties li.
Predictive model.
SI Appendix, Section B.1 in reports on a different estimation approach based on the average marginal effect of fraction of or weighted long ties in a random forest model. Fig. 3 reports summaries of these models fit to data from each state. The results of this alternative procedure are consistent with the matching results presented above.
Geographically Aggregated Long Ties and Economic Outcomes.
Data on network interactions.
These analyses use the same communication network as in the individual-level analyses. For each Facebook user, we also observe the predicted city of residence, which relies on the user’s information and activity, for example, their self-reported city on their profile and their internet connection information. These residential predictions allow us to match users to zip codes, counties, and states and construct the corresponding zip code network, consisting of users residing in the zip code.
Construction of geographically aggregated networks.
The nodes in each zip code network consist of all users who reside in that zip code and all their contacts. (Throughout we focus on zip codes, though the maps in Fig. 1A and SI Appendix, Fig. S3 use aggregation to counties with otherwise identical logic.) As with the individual-level analyses, we consider only reciprocal ties to filter out one-off communication events that do not signify a meaningful relationship. Thus, the existence of an edge in the zip code network means there has been at least one communication event in each direction. As opposed to the individual-level results which used directed networks, edges in the aggregate zip code networks are undirected with weights that correspond to the total number of communication events in either direction from December 2020 to June 2021. SI Appendix, Fig. S1 shows a schematic view of the zip code networks. In our analysis, we consider only zip codes with at least 150 users who reside in that zip code and for which we have census data available. This leaves us with 26,429 zip codes in the United States (excluding PO Boxes or businesses) and 15,157 zip codes in Mexico for which we have both network and economic indicator data. SI Appendix, Fig. S2 shows the histogram of some basic characteristics of these networks in the United States, along with a few economic indicators.
Measuring long ties in geographically aggregated networks.
We use the fraction of long ties among all edges that involve at least one node inside the zip code as our primary measure of zip code structural diversity:
| [4] |
where Z is the set of nodes residing inside the zip code. These long ties are shown with dotted black lines in SI Appendix, Fig. S1.
As with the individual-level analysis, we also define a version of this measure weighted by tie strength:
| [5] |
Since the measure of tie strength is normalized, lzw ranges between 0 and 1 and can be treated as weighted fraction of long ties. SI Appendix, Figs. S4 and S5 compare both lz and lzw measures against zip code outcomes.
Data on economic indicators.
In the United States, the zip code outcome variables consist of a set of economic indicators obtained from the census 5-year estimates of the 2018 American Community Survey (ACS) at the level of zip code tabulation areas (ZCTA). The economic indicators we consider here are median household income, fraction of households with income below $25K, and the unemployment rate. We also consider measures of social mobility provided by the Atlas of Opportunity project at the ZCTA level (62):
-
1.
the probability of a child born in the zip code and from the bottom 25% of the income distribution in the United States reaching the top 20% of the income distribution and
-
2.
the percentile income rank of a child born in the zip code from the bottom 25% of the income distribution.
Finally, as a robustness check, we replicate these analyses in Mexico. Due to the lack of income data at the zip code level, we constructed a wealth index for each zip code using the first principal component of the following variables from the 2020 census: percentage of illiterate population aged 15 or more, population aged 6 to 14 not attending school, population aged 15 or more without complete elementary school, population with no access to health services, log of average occupants per room, and percentage of households with dirt floor, no toilet, no piped water, no sewer system, no electricity, no washing machine, or no fridge. The wealth index for each zip code is the value of the first component rescaled to between 0 and 1. Effectively, our outcome indicator in Mexico is a summary measure on the intensity of economic, health, and education prosperity.
Covariates.
There is variation in the size of zip code networks in ways that can influence the range of reasonable values for lz and affect its statistical relationship with the outcomes. Thus, we control for the following variables in our analysis: population of the zip code, number of network nodes residing in the zip code, and mean number of edges from a node inside the zip code (mean degree). All the economic indicators we consider are correlated with racial composition of the zip code. Thus, to ensure that our network measure explains any outcome variation above and beyond racial composition, we also control for the racial composition (percent white) in each zip code. The data from Mexico do not include zip code-level racial composition, so the model for Mexico simply controls for the network covariates mentioned above.
Our second goal is to establish the relationship between the strength of long ties and the economic outcomes given a fixed number of long ties. Since lzw is highly correlated with the unweighted measure of structural diversity, lz, our analysis must control for it. Thus, our second analysis controls for the fraction of long ties, lw, in addition to those mentioned above.
Model and estimation.
To enable discovering potential nonlinearity in relationships between the zip code outcome and network structure, we use a modern binscatter method, binsreg (63). In addition to visualizing the data with cluster-robust confidence intervals at each bin, binscatter provides a principled way to adjust for our control variables such that it does not suffer from problems of residual-based approaches (64).
Binscatter regression with covariate adjustment fits the data according to a linear model on the covariates. In order to avoid any assumption on the functional form of the covariates, they are binned and the model includes an indicator for each bin. For example, the model on the relationship between the zip code outcome and the fraction of long ties is:
| [6] |
where yz is an outcome measure (e.g., zip code mean household income), lz is the fraction of long ties, pz is the zip code population bin, nz is the binned number of nodes inside the zip code, dz is the binned average degree of nodes inside the zip code, and rz is the percent white in the zip codes. The model attempts to make comparisons between networks of similar sizes by controlling for the combination of nz and dz, such that dz bins are determined conditional or within the nz bin. Binscatter regression reveals the shape of function μ(.) and computes the standard errors of each bin. Overall the model above includes 87 indicators as covariates.
We employ a similar approach for the model on the relationship between zip code outcomes and the strength of long ties, with the additional controls for the fraction of long ties as formulated below.
| [7] |
where lzw is the weighted fraction of long ties, lz is the binned unweighted fraction of long ties. The model above includes 94 indicators as covariates.
Life Events and Long Ties.
Data on ego networks.
The same network described in the previous sections is used to measure the fraction of long ties.
Data on interstate migration.
The current state of each user can be inferred based on Facebook’s internal prediction of residential location (based on activity and self-reported location information on their profile) as also used above in construction of zip-code-level networks. Given these data, we construct a monthly panel of predicted state location for each user for 9 y starting from January 2012. To determine whether a user has experienced interstate migration, we also need information on their hometown state, for which we rely on the optionally listed hometown on the user’s Facebook profile and ensure it can be matched against a known location. Overall, 72% of the active users in the data have provided this self-reported information on their profile. Given the monthly residence and hometown states, we restrict our attention to users with a hometown city in the United States, aged between 30 and 60 who have not migrated since 2012 (i.e., their monthly residence state has remained the same since January 2012). We then divide these users to migrants or those with a different hometown state than their current state and nonmigrants with the same current and hometown states. Therefore in our analysis, migrants are users who moved to their current state prior to 2012 and have lived in their current state continuously since 2012. Similarly, nonmigrants are users who have continued residing in their home state since 2012. The final data consist of 20.6 million users, out of which 16.7 million users (81%) are local or nonmigrant and 3.9 million users (19%) are migrant.
Data on high school attendance.
We obtain high school attendance data on each user based on the self-reported high school names and pages on their profiles. Multiple pages referring to the same high school by different users are consolidated to a single high school and then matched against the database of high schools in the United States provided by the National Center for Education Statistics (NCES) to ensure the validity of these self-reported schools. About 65% of the users we examined have reported their high school information. Our analysis is based on users aged between 30 and 60 who currently live in the United States, have indicated their hometown city in the United States, and have attended their high schools in the United States and all within a single state. We divide this population of users into two groups of single or multiple high schools and join each user with their ego network constructed from communication data. The final population consists of 24.4 million users, out of which 23.8 million (97%) attended a single high school and 673 thousand (3%) attended multiple high schools all within a single state.
Analysis.
Our goal is to compare the average fraction of long ties over the population of two groups: migrants vs. locals or multiple vs. single high school attendees. A difference in means within each degree bin, however, does not account for differences between the two populations. In our analysis, we account for observable differences using poststratification (i.e., many-to-many matching). In our poststratification, the combination of binned age (3 bins), gender (2 bins), and hometown county income (15 bins) constitute a stratum (a total of 90 strata). Fig. 4A and B, and SI Appendix, Fig. S12 show these poststratified means within each degree bin (with a total of 30 equal-size degree bins). We perform the poststratification conditional on degree for two reasons: first, to show how the range of common values for the fraction of long ties varies by degree and, second, to show that the difference between two groups (migrants vs. locals or single vs. multiple high schools attendance) is consistent across the whole range of ego degrees.
While migrant and local populations have similar degree and gender distributions, they vary significantly by their age, mean household income of where they grew up, and current state (SI Appendix, Fig. S9), hence the need for stratification. Similarly, users who have attended multiple high schools tend to have a slightly higher degree and be of younger age than the single high school users (SI Appendix, Fig. S10). Our analysis does not include migration status in the strata definition for two reasons: First, as we see in SI Appendix, Fig. S10, there is no notable difference between the two high school groups in terms of migration. Second, our analysis requires that neither group experiences an interstate migration event during high school.
High School Closures and Long Ties.
Data.
The National Council for Education Statistics provides data on high schools that closed in each year from 1995 to 2016. The analysis on high school closures involves Facebook users who had reported attending any of these closed high schools and have provided the exact time period of their attendance. The users whose reported end date at the school is within one year of its closure are considered to be in the “During Closure” condition and those whose reported end date at the school is between 2 and 5 y prior to its closure constitute the “Before Closure” condition. The final data consist of 33,813 users who had attended 285 closed high schools, out of which 11,309 (22,504) users attended the high schools during (before) their closure.
Analysis.
The main result reported above is based on SI Appendix, Table S1 which uses a two-way fixed effect model with fixed effects for both age in whole years and schools and cluster-robust standard errors at the school level. The average fractions of long ties in Fig. 5 are estimated using poststratification with the combination of school and age in 4-y bins as the strata.
Supplementary Material
Appendix 01 (PDF)
Acknowledgments
We thank S. Aral, A. Pentland, M. O. Jackson, D. G. Rand, R. Reagans, C. Roth, four anonymous referees, and seminar participants at University of Michigan, University of Pennsylvania, Stanford University, Northwestern University, Max Planck Institute for Human Development, Centro de Estudios Monetarios y Financieros, Fondation des Treilles, and Massachusetts Institute of Technology for their helpful comments. We especially appreciate Linxi Wang and Matthew Jacob for their help in our analysis. E.J. was supported by a grant to D.E. from the World Bank. S.P.F. was supported by the SDG Partnership Fund.
Author contributions
E.J., S.P.F., M.B., and D.E. designed research; E.J. performed research; E.J. and M.B. analyzed data; and E.J., S.P.F., M.B., and D.E. wrote the paper.
Competing interests
E.J. was a contractor of Meta Platforms, which operates Facebook, while conducting this research. D.E. has received funding from Meta for other research. Meta has sponsored a conference that D.E. organizes. D.E. was until recently a consultant to Twitter.
Footnotes
This article is a PNAS Direct Submission.
Although PNAS asks authors to adhere to United Nations naming conventions for maps (https://www.un.org/geospatial/mapsgeo), our policy is to publish maps as provided by the authors.
*These numbers are from a recent earnings report: https://investor.fb.com/financials/default.aspx.
Contributor Information
Eaman Jahani, Email: eaman@berkeley.edu.
Dean Eckles, Email: eckles@mit.edu.
Data, Materials, and Software Availability
Anonymized replication materials (code and sufficient statistics for all analyses in the paper) have been deposited in Github (https://github.com/eamanj/long_ties_economic_outcomes) (65). In addition, the publication of this paper is accompanied by the public release of geographically-aggregated measures of network structure as used in Fig. 1 and other associated analyses (https://socialmediaarchive.org/record/27). In order to protect the privacy of individuals and comply with relevant regulations, this data release has used methods that ensure the aggregated data does not reveal much about any individual person.
Supporting Information
References
- 1.Granovetter M., Economic action and social structure: The problem of embeddedness. Am. J. Sociol. 91, 481–510 (1985). [Google Scholar]
- 2.Jackson M. O., An overview of social networks and economic applications. Handbook Soc. Econ. 1, 511–585 (2011). [Google Scholar]
- 3.García-Jimeno C., Iglesias A., Yildirim P., Information networks and collective action: Evidence from the women’s temperance crusade. Am. Econ. Rev. 112, 41–80 (2022). [Google Scholar]
- 4.Banerjee A., Chandrasekhar A. G., Duflo E., Jackson M. O., The diffusion of microfinance. Science 341, 1236498 (2013). [DOI] [PubMed] [Google Scholar]
- 5.Zhang Y., Wang L., Zhu J. J., Wang X., Pentland A. S., The strength of structural diversity in online social networks. Research 2021, 9831621 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Eagle N., Macy M., Claxton R., Network diversity and economic development. Science 328, 1029–1031 (2010). [DOI] [PubMed] [Google Scholar]
- 7.Luo S., Morone F., Sarraute C., Travizano M., Makse H. A., Inferring personal economic status from social network location. Nat. Commun. 8, 15227 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bailey M., Cao R., Kuchler T., Stroebel J., Wong A., Social connectedness: Measurement, determinants, and effects. J. Econ. Perspect. 32, 259–280 (2018). [PubMed] [Google Scholar]
- 9.Granovetter M., The strength of weak ties. J. Sociol. 78, 1360–1380 (1973). [Google Scholar]
- 10.Burt R. S., Structural Holes: The Social Structure of Competition (Harvard University Press, Cambridge, MA, 1992). [Google Scholar]
- 11.Rodan S., Structural holes and managerial performance: Identifying the underlying mechanisms. Soc. Networks 32, 168–179 (2010). [Google Scholar]
- 12.Borgatti S. P., Structural holes: Unpacking Burt’s redundancy measures. Connections 20, 35–38 (1997). [Google Scholar]
- 13.Ugander J., Backstrom L., Marlow C., Kleinberg J., Structural diversity in social contagion. Proc. Natl. Acad. Sci. U.S.A. 109, 5962–5966 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.D. Eckles, E. Mossel, M. A. Rahimian, S. Sen. Long ties accelerate noisy threshold-based contagions. arXiv [Preprint] (2018). arXiv:1810.03579 [DOI] [PubMed]
- 15.Reagans R., Zuckerman E. W., Networks, diversity, and productivity: The social capital of corporate R&D teams. Org. Sci. 12, 502–517 (2001). [Google Scholar]
- 16.Hansen M. T., The search–transfer problem: The role of weak ties in sharing knowledge across organization subunits. Adm. Sci. Q. 44, 82–111 (1999). [Google Scholar]
- 17.Reagans R., McEvily B., Network structure and knowledge transfer: The effects of cohesion and range. Adm. Sci. Q. 48, 240–267 (2003). [Google Scholar]
- 18.Aral S., Van Alstyne M., The diversity-bandwidth trade-off. Am. J. Sociol. 117, 90–171 (2011). [Google Scholar]
- 19.Yakubovich V., Weak ties, information, and influence: How workers find jobs in a local Russian labor market. Am. Sociol. Rev. 70, 408–421 (2005). [Google Scholar]
- 20.Castilla E. J., Lan G. J., Rissing B. A., Social networks and employment: Mechanisms (Part 1). Sociol. Compass 7, 999–1012 (2013). [Google Scholar]
- 21.Mehra A., Dixon A. L., Brass D. J., Robertson B., The social network ties of group leaders: Implications for group performance and leader reputation. Org. Sci. 17, 64–79 (2006). [Google Scholar]
- 22.Obstfeld D., Social networks, the tertius iungens orientation, and involvement in innovation. Adm. Sci. Q. 50, 100–130 (2005). [Google Scholar]
- 23.de Vaan M., Stark D., Vedres B., Game changer: The topology of creativity. Am. J. Sociol. 120, 1144–1194 (2015). [DOI] [PubMed] [Google Scholar]
- 24.I. Kloumann, L. Adamic, J. Kleinberg, S. Wu, The lifecycles of apps in a social ecosystem in Proceedings of the 24th International Conference on World Wide Web (IW3C2) (2015), pp. 581–591.
- 25.Aral S., Walker D., Tie strength, embeddedness, and social influence: A large-scale networked experiment. Manag. Sci. 60, 1352–1370 (2014). [Google Scholar]
- 26.M. Bailey, D. M. Johnston, T. Kuchler, J. Stroebel, A. Wong, “Peer effects in product adoption” (National Bureau of Economic Research Technical Report, 2019).
- 27.R. S. Burt et al., Structural holes and good ideas. Am. J. Sociol. 110, 349–399 (2004).
- 28.Burt R. S., Jannotta J. E., Mahoney J. T., Personality correlates of structural holes. Soc. Networks 20, 63–87 (1998). [Google Scholar]
- 29.Kalish Y., Robins G., Psychological predispositions and network structure: The relationship between individual predispositions, structural holes and network closure. Soc. Networks 28, 56–84 (2006). [Google Scholar]
- 30.Kalish Y., Bridging in social networks: Who are the people in structural holes and why are they there? Asian J. Soc. Psychol. 11, 53–66 (2008). [Google Scholar]
- 31.Specht J., Egloff B., Schmukle S. C., Stability and change of personality across the life course: The impact of age and major life events on mean-level and rank-order stability of the Big Five. J. Pers. Soc. Psychol. 101, 862–82 (2011). [DOI] [PubMed] [Google Scholar]
- 32.Fowler J. H., Dawes C. T., Christakis N. A., Model of genetic variation in human social networks. Proc. Natl. Acad. Sci. U.S.A. 106, 1720–1724 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Neugart M., Yildirim S., Heritability in friendship networks. J. Econ. Behav. Org. 194, 41–55 (2022). [Google Scholar]
- 34.R. Chetty et al., Social capital. I: Measurement and associations with economic mobility. Nature 608, 108–121 (2022). [DOI] [PMC free article] [PubMed]
- 35.P. Wiepking, R. Bekkers, Who gives? A literature review of predictors of charitable giving. Part two: Gender, family composition and income. Voluntary Sect. Rev. 3, 217–245 (2012).
- 36.Park P. S., Blumenstock J. E., Macy M. W., The strength of long-range ties in population-scale social networks. Science 362, 1410–1413 (2018). [DOI] [PubMed] [Google Scholar]
- 37.Lyu D., Yuan Y., Wang L., Wang X., Pentland A., Investigating and modeling the dynamics of long ties. Commun. Phys. 5, 1–9 (2022). [Google Scholar]
- 38.Breiman L., Random forests. Mach. Learn. 45, 5–32 (2001). [Google Scholar]
- 39.J. Saramäki et al., Persistence of social signatures in human communication. Proc. Natl. Acad. Sci. U.S.A. 111, 942–947 (2014). [DOI] [PMC free article] [PubMed]
- 40.Wrzus C., Hänel M., Wagner J., Neyer F. J., Social network changes and life events across the life span: A meta-analysis. Psychol. Bull. 139, 53 (2013). [DOI] [PubMed] [Google Scholar]
- 41.Jackson M. O., Rogers B. W., Zenou Y., The economic consequences of social-network structure. J. Econ. Lit. 55, 49–95 (2017). [Google Scholar]
- 42.Granovetter M., The impact of social structure on economic outcomes social networks and economic outcomes: Core principles. J. Econ. Pers. 19, 33–50 (2005). [Google Scholar]
- 43.R. Chetty et al., Social capital. II: Determinants of economic connectedness. Nature 608, 122–134 (2022). [DOI] [PMC free article] [PubMed]
- 44.Gee L. K., Jones J., Burke M., Social networks and labor markets: How strong ties relate to job finding on Facebook’s social network. J. Labor Econ. 35, 485–518 (2017). [Google Scholar]
- 45.Gee L. K., Jones J. J., Fariss C. J., Burke M., Fowler J. H., The paradox of weak ties in 55 countries. J. Econ. Behav. Org. 133, 362–372 (2017). [Google Scholar]
- 46.Rajkumar K., Saint-Jacques G., Bojinov I., Brynjolfsson E., Aral S., A causal test of the strength of weak ties. Science 377, 1304–1310 (2022). [DOI] [PubMed] [Google Scholar]
- 47.G. Simmel, The Sociology of Georg Simmel (Simon and Schuster, 1950), vol. 92892.
- 48.Totterdell P., Holman D., Hukin A., Social networkers: Measuring and examining individual differences in propensity to connect with others. Soc. Networks 30, 283–296 (2008). [Google Scholar]
- 49.Burt R. S., Merluzzi J., Network oscillation. Acad. Manag. Dis. 2, 368–391 (2016). [Google Scholar]
- 50.Welsh R. O., School hopscotch: A comprehensive review of K-12 student mobility in the United States. Rev. Ed. Res. 87, 475–511 (2017). [Google Scholar]
- 51.Rumberger R. W., Palardy G. J., Test scores, dropout rates, and transfer rates as alternative indicators of high school performance. Am. Ed. Res. J. 42, 3–42 (2005). [Google Scholar]
- 52.Brummet Q., The effect of school closings on student achievement. J. Public Econ. 119, 108–124 (2014). [Google Scholar]
- 53.R. Chetty, N. Hendren, P. Kline, E. Saez, Where is the land of opportunity? The geography of intergenerational mobility in the United States. Q. J. Econ. 129, 1553–1623 (2014).
- 54.Almaatouq A., Radaelli L., Pentland A., Shmueli E., Are you your friends’ friend? Poor perception of friendship ties limits the ability to promote behavioral change PLoS ONE 11, 1–13 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Y. X. Zhu et al., Influence of reciprocal links in social networks. PLoS ONE 9, 1–8 (2014). [DOI] [PMC free article] [PubMed]
- 56.D. Easley et al., Networks, Crowds, and Markets (Cambridge University Press, Cambridge, 2010), vol. 8.
- 57.E. Bakshy, D. Eckles, R. Yan, I. Rosenn, “Social influence in social advertising: Evidence from field experiments” in Proceedings of the 13th ACM Conference on Electronic Commerce (2012), pp. 146–161.
- 58.R. M. Bond et al., A 61-million-person experiment in social influence and political mobilization. Nature 489, 295–298 (2012). [DOI] [PMC free article] [PubMed]
- 59.J. J. Jones et al., Inferring tie strength from online directed behavior. PLoS ONE 8, e52168 (2013). [DOI] [PMC free article] [PubMed]
- 60.Burke M., Kraut R. E., The relationship between Facebook use and well-being depends on communication type and tie strength. J. Comput.-Med. Commun. 21, 265–281 (2016). [Google Scholar]
- 61.Sävje F., Higgins M. J., Sekhon J. S., Generalized full matching. Polit. Anal. 29, 423–447 (2021). [Google Scholar]
- 62.R. Chetty, J. N. Friedman, N. Hendren, M. R. Jones, S. R. Porter, “The opportunity atlas: Mapping the childhood roots of social mobility” (National Bureau of Economic Research Working Paper 25147, 2018).
- 63.Cattaneo M. D., Crump R. K., Farrell M. H., Feng Y., Binscatter regressions. Stat. J. vv, 1–49 (2021). [Google Scholar]
- 64.M. D. Cattaneo, R. K. Crump, M. H. Farrell, Y. Feng, On binscatter (2021).
- 65.E. Jahani, Long ties, disruptive life events and economic prosperity. Github and ICPSR. https://github.com/eamanj/long_ties_economic_outcomes. Deposited 16 June 2023. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Appendix 01 (PDF)
Data Availability Statement
Anonymized replication materials (code and sufficient statistics for all analyses in the paper) have been deposited in Github (https://github.com/eamanj/long_ties_economic_outcomes) (65). In addition, the publication of this paper is accompanied by the public release of geographically-aggregated measures of network structure as used in Fig. 1 and other associated analyses (https://socialmediaarchive.org/record/27). In order to protect the privacy of individuals and comply with relevant regulations, this data release has used methods that ensure the aggregated data does not reveal much about any individual person.



