Skip to main content
Data in Brief logoLink to Data in Brief
. 2021 Jan 7;34:106717. doi: 10.1016/j.dib.2021.106717

Data on race, inequality, and social capital in the U.S. counties

Dongkyu Kim 1,, Mi-son Kim 1, Natasha Altema McNeely 1
PMCID: PMC7807210  PMID: 33490335

Abstract

This article presents data on social capital at the United States’ county-level. Following Rupasingha et al. (2006), the social capital index captures the common factor among density measures of 10 different types of associations, voter turnout rates, U.S. decennial census participation rates, and the number of non-profit organizations. Based on Knack (2003), we create associational densities measures as a proxy for both bridging and bonding social capital. Including data on income inequality, racial diversity, minority group size, average household income, educational attainment, the ratio of a family household, the size of migration population, and female labor market participation rates, the data covers 3,104 U.S. counties for both 2009 and 2014. This paper includes descriptive statistics and figures. This data article is associated with the article “Race, Inequality, and Social Capital in the U.S. Counties.”

Keywords: Racial diversity, Income inequality, Social capital, U.S. counties

Specifications Table

Subject Sociology and Political Science
Specific subject area Racial diversity, income inequality, social capital in the U.S. counties
Type of data Comma-separated values, tables, figures
How data were acquired The original data are from the websites of the Northeast Regional Center for Rural Development at Penn State University, the American Community Survey of the U.S. Census Bureau, and the U.S. Department of Agriculture.
Data format Comma Separated values & Analysed
Parameters for data collection All U.S. counties for both 2009 and 2014
Description of data collection The Northeast Regional Center for Rural Development at Penn State University provides raw data on the social capital index. Based on the two most recent social capital data, in both 2009 and 2014, which share the same component measures, other county-level data were added from the American Community Survey of the U.S. Census Bureau and the U.S. Department of Agriculture.
Data source location There are three primary data sources: Northeast Regional Center for Rural Development at Penn State University, the U.S. Census Bureau's American Community Survey, and the Economic Research Service of the United States Department of Agriculture. All variables were separately downloaded and merged.
Data accessibility Repository name: Mendeley Data
Data identification number: 10.17632/ps8mtmtmvv.2
Direct URL to data: https://data.mendeley.com/datasets/ps8mtmtmvv/2
Related research article Mi-son Kim, Dongkyu Kim, and Natasha Altema McNeely, “Race, Inequality, and Social Capital in the U.S. Counties”
https://doi.org/10.1080/03623319.2020.1799178

Value of the Data

  • Social scientists who are interested in the dynamics created by income inequality, racial diversity, and social capital in the U.S. Counties can easily utilize the dataset.

  • This dataset also provides other county-level covariates that can be utilized by social science and humanities research.

  • This dataset provides the most comprehensive measure of social capital of U.S. Counties for two time periods.

1. Data Description

This Data in Brief article is associated with the article “Race, Inequality, and Social Capital in the U.S. Counties.” [2] The data provided in this article were constructed to understand the variations of social capital across U.S. counties by examining the interaction between income inequality and ethnic diversity. Although the concept of social capital has been much debated, it can be largely defined as intangible social assets that individuals can utilize or enjoy by engaging with others. In that regard, Putnam [6] defines the concept as “networks, norms, and trust that enable participants to act together more effectively to pursue shared objectives.” Following Rupasingha et al. [7], the social capital index measures the extent to which individuals engage with others at the county-level.

The social capital index measures the common factor among four different types of variables: (1) the associational density of 10 different types of organizations (civic organizations, bowling centers, golf clubs, fitness centers, sports organizations, religious organizations, political organizations, labor unions, business organizations, and professional organizations), (2) the turnout rates for the previous presidential elections, (3) the response rate to the Census Bureau's decennial census, and (4) the number of non-profit organizations. The data are provided by the Northeast Regional Center for Rural Development at Penn State University. The index data has been updated four times since 1990. As the index has adopted a new associational typology for the 2000s data points, we only included data with a consistent typology. Thus, we have a social capital index for both 2009 and 2014, the two most recent data points. Table 1 reports all component measures’ summary statistics for each year, while Fig. 1 displays each county's average scores on the map.

Table 1.

Social capital index components.

2009
2014
N mean SD min max N mean SD min max
Civic orgs 3106 9.0 21.7 0 538 3139 8.3 20.5 0 546
Bowling centers 3106 1.4 3.0 0 58 3139 1.2 2.6 0 48
Golf clubs 3106 3.8 7.3 0 142 3139 3.6 7.3 0 141
Fitness centers 3106 9.7 30.1 0 738 3139 10.1 33.6 0 845
Sport orgs 3106 0.3 1.1 0 29 3139 0.3 1.3 0 37
Religious orgs 3106 57.7 123.1 0 3258 3139 58.5 125.2 0 3275
Political orgs 3106 0.7 3.0 0 66 3139 0.8 3.8 0 76
Labor orgs 3106 4.8 15.0 0 292 3139 4.5 14.2 0 283
Business orgs 3106 5.3 14.4 0 323 3139 5.0 14.0 0 290
Professional orgs 3106 2.1 8.9 0 214 3139 2.1 9.0 0 210
Voter turnout 3106 0.6 0.1 0.17 2.079 3139 0.7 0.1 0.35 1.116
Census rate 3106 0.7 0.1 0 0.95 3139 0.7 0.1 0 0.95
NGOs 3104 489.1 1472.6 1 41,125 3139 458.4 1381.6 0 37,547

Fig. 1.

Fig 1

The Social Capital Index across U.S. counties. The average scores for both 2009 and 2014. The darker the region is, the more social capital there is in each county. The decile cut-off values differentiate colors.

One of the independent variables is racial diversity. From data provided by the U.S. Census Bureau's American Community Survey (ACS), the racial diversity index was calculated as one minus the Herfindahl index of 7 ethnic groups (Non-Hispanic white, Hispanic, Black, Indian, Asian, Hawaiian, and two-more). It measures the probability that two people randomly chosen from a county belong to different ethnic groups (see, e.g., Alesina et al. 1999). Fig. 2 displays each county's average scores of diversity index on the map. Another key independent variable for the associated article is income inequality. Based on data also provided by the ACS, the variable measures the Gini index, which takes 0 for a perfectly equal distribution of income and 1 for perfectly unequal income distribution. Fig. 3 shows the geographical distribution of income inequality across the U.S. Counties. Table 2 shows the list of counties at both the top and the bottom ten ranks for these three key variables in 2014.

Fig. 2.

Fig 2

Racial diversity across U.S. counties. The average scores for both 2009 and 2014. The darker the region is, the more diversity there is in each county. The decile cut-off values differentiate colors.

Fig. 3.

Fig 3

Income inequality across U.S. counties. The average scores for both 2009 and 2014. The darker the region is, the more inequality there is in each county. The decile cut-off values differentiate colors.

Table 2.

2014 Rankings of social capital, inequality, and diversity.

Social capital: top 10 Income Inequality: top 10 Racial Diversity: top 10
Hinsdale County, CO Randolph County, GA Aleutians West Census Area, AK
Lexington city, VA Calhoun County, GA Queens County, NY
Mineral County, CO McMullen County, TX Maui County, HI
Motley County, TX New York County, NY Alameda County, CA
Thomas County, NE Borden County, TX Aleutians East Borough, AK
Hooker County, NE Baylor County, TX Hawaii County, HI
Griggs County, ND Orleans Parish, LA Fort Bend County, TX
Grant County, NE Corson County, SD Kauai County, HI
Kiowa County, KS Campbell County, SD Solano County, CA
Smith County, KS Eastland County, TX Honolulu County, HI

Social Capital: bottom 10 Income Inequality: bottom 10 Racial Diversity: bottom 10

Sioux County, ND Yakutat City and Borough, AK Tyler County, WV
Jim Hogg County, TX Bristol Bay Borough, AK Jackson County, KY
Webb County, TX Spencer County, KY Holmes County, OH
Hancock County, TN Emery County, UT Magoffin County, KY
Zavala County, TX Lake of the Woods County, MN Dickenson County, VA
Loving County, TX Sublette County, WY Osage County, MO
Maverick County, TX Chattahoochee County, GA Lincoln County, WV
Starr County, TX Grant County, NE Leslie County, KY
Shannon County, SD Power County, ID Blaine County, NE
Chattahoochee County, GA Clark County, ID Keya Paha County, NE

We further measured two different types of social capital by utilizing ten associational density variables. Scholars in the literature suggest that social capital has two different types: bridging and bonding social capital [3], [4], [5]. According to Putnam [5], bridging social capital can be defined as an open network that crosscuts, thus bridges, the existing social cleavages while bonding social capital is an inward-looking network that fortifies existing social interests. We labeled the former as ‘Putnam-type’ and the latter ‘Olson-type’ following Knack [3]. Based on Knack [3] and Rupasingha et al. [7], we measured bridging social capital (Putnam-type) with the associational density of the first six organizations (religious organizations, civic organizations, bowling centers, fitness centers, golf clubs, and sports organization) and bonding social capital (Olson-type) with the same density of the remaining associations (business organization, labor union, political organizations, and professional organizations). Both Fig. 4 and Fig. 5 display each variable on the map respectively.

Fig. 4.

Fig 4

Putnam type (bridging) social capital index across U.S. counties. The average scores for both 2009 and 2014. The darker the region is, the more bridging social capital there is in each county. The decile cut-off values differentiate colors.

Fig. 5.

Fig 5

Olson type (bonding) social capital index across U.S. counties. The average scores for both 2009 and 2014. The darker the region is, the more boding social capital there is in each county. The decile cut-off values differentiate colors.

We included other correlates of social capital in the dataset. Following the typology provided by the U.S. Department of Agriculture, the Rural-Urban Continuum Codes (RUCC), the urban and rural variables were dummy coded by taking suburban counties as a reference category. The RUCC scheme provides nine categories that distinguish metropolitan counties by population, and nonmetropolitan counties by population and adjacency to the metro area. We utilized three categories of metropolitan counties to construct a dummy variable for urban counties while using two categories of nonmetropolitan counties that are not adjacent to the metro area to construct the rural indicative variable. It is often believed that rural areas provide a favorable environment for social capital. In the statistical estimation of the associated article, the remaining category was considered as suburban areas and omitted in the regression analysis.

All other county-level variables are compiled by utilizing the ACS database. For the income variable, we used the mean income in the past 12 months with the inflation-adjusted dollars. Then, we transformed the average household income with the natural logarithm. The dataset also has the educational attainment variable that measures the percentage of residents who have at least some college education per county. It is well known that socioeconomic status is positively associated with social capital. Because social capital would be difficult to form in a fluid county, we include the share of the non-migratory population in our dataset. From the ACS's county-to-county migration flow data, we calculated the percentage of non-movers out of the county population. In a similar vein, it is expected that the family-oriented community would provide a good environment for social capital. Thus, the dataset includes the percentage of family households out of the total number of households for each county. Lastly, we include the size of the female workforce. The theoretical explanations about how traditional gender roles affect social capital are unsettled. Following Rupasingha et al. [7], we considered this variable to test the effect of women's traditional role as housewives empirically. Table 3 presents the summary statistics for all covariates over 3139 counties for both 2009 and 2014.

Table 3.

Descriptive statistics.

Variable N mean SD Min Max
Social capital index 6245 −0.007 1.250 −3.925 9.149
Racial diversity index 6245 0.286 0.183 0 0.769
Income Inequality index 6245 0.436 0.036 0.207 0.652
Urban 6244 0.372 0.483 0 1
Rural 6244 0.271 0.444 0 1
Ln(Average Household Income) 6245 10.942 0.224 10.259 11.934
Education (some college) ratio 6245 0.483 0.109 0.181 0.886
Non-migration population ratio 6234 0.859 0.046 0.478 0.997
Female workforce ratio 6245 0.701 0.076 0.361 1
Family household ratio 6245 0.523 0.068 0.233 0.902
Putnam (bridging associations) 6245 1.254 0.653 0 6.887
Olson (bonding associations) 6245 0.142 0.151 0 2.253

2. Experimental Design, Materials and Methods

Data construction for the associated article was constrained by the availability of data on social capital. Given social capital data for both 2009 and 2014, all relevant variables were compiled utilizing various data sources. Table 4 provides detailed information about all variables included in the dataset, including primary sources. These raw data are publicly available. However, putting them together to create correlates of social capital at the county-level requires careful handling of the data to align both temporal and geographical units. The Federal Information Processing Standards (FIPS), a four-digit county code, were used to match data points across different data sources. Furthermore, all data provided by the U.S. Census Bureau's ACS utilize the 5-year average estimates so that the dataset contains the least amount of missing values. With the constructed dataset, the associated article examined the variations of social capital at the county-level by utilizing two-stage multilevel regression analysis with year fixed effect [1,8,9]. Researchers could easily re-use or expand our dataset to better understand the variation of social capital at the county-level.

Table 4.

Variable description and data sources.

Variable Description Data source
FIPS - Federal Information Processing Standard, four-digit county codes National Institute of Standards and Technology

sk - Social capital index – 13 components + population data Northeast Regional Center for Rural Development
1. assn – Associational density of 10 types of organizations (per 1000 people)
1. relig (# of religious organization), 2. civic (# of civic organization), 3. bus (# of business organization, 4. pol (# of political organization), 5. prof (# of professional organization), 6. labor (# of labor unions), 7. bowl (# of bowling centers), 8. fitns (# of fitness centers), 9. golf (# of golf clubs), 10. sport (# of sports organization), & 11. pop (County population)
2. pvote – previous presidential election turnout
3. respn – US Census response rate
4. nccs – # of non-profit organizations

gini – Gini coefficient American Community Survey, US Census Bureau

eth_div – Ethnic diversity: 7-category diversity measure American Community Survey, US Census Bureau
1. p_white (Non-Hispanic white%), 2. p_hispanic (Hispanic%), 3. p_black (Black%), 4. p_indian (American Indians%), 5. p_asian (Asian%), 6. p_hawaiian (Pacific Islander%), 7 p_tomore (Other%)

urban & rural – Dummy variables for urban and rural counties Economic Research Service, USDA

fam_household –% of family household American Community Survey – US Census Bureau

female_wforce –% of female labor market participation American Community Survey – US Census Bureau

educ –% of people with at least some college education American Community Survey – US Census Bureau

income – Average household income American Community Survey – US Census Bureau

Nonmover –% of non-migratory population American Community Survey – US Census Bureau

Putnam – Bridging social capital: associational density for 6 components of sk: relig, civic, bowl, fitns, sport, & golf Authors’ calculation

Olson – Bonding social capital: associational density for 4 components of sk: bus, pol, prof, & labor Authors’ calculation

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.

Acknowledgments

The authors would like to thank the anonymous reviewers for their feedback. We would also like to thank the Center for Survey Research and Policy Analysis at the University of Texas Rio Grande Valley for providing the financial support for the open access publication.

Footnotes

Supplementary material associated with this article can be found in the online version at doi:10.1016/j.dib.2021.106717.

Appendix. Supplementary materials

mmc1.docx (2.1MB, docx)

References

  • 1.Gelman A., Hill J. Cambridge University Press; 2007. Data Analysis Using Regression and Multilevel/Hierarchical Models. [Google Scholar]
  • 2.M. Kim, D. Kim, & N. Altema McNeely. (2020). Race, inequality, and social capital in the U.S. counties, Soc. Sci. J. [DOI] [PMC free article] [PubMed]
  • 3.Knack S. Groups, growth and trust: cross-country evidence on the Olson and Putnam hypotheses. Public Choice. 2003;117(3–4):341–355. [Google Scholar]
  • 4.Patulny R.V., Svendsen G.L.H. Exploring the social capital grid: bonding, bridging, qualitative, quantitative. Int. J. Sociol. Soc. Policy. 2007;27(1/2):32–51. [Google Scholar]
  • 5.Putnam R. Simon and Schuster; 2001. Bowling Alone: The Collapse and Revival of American Community. [Google Scholar]
  • 6.Putnam R. Tuning in, tuning out: the strange disappearance of social capital in America. Polit. Sci. Polit. 1995;28(4):664–683. [Google Scholar]
  • 7.Rupasingha A., Goetz S., Freshwater D. The production of social capital in US counties. J. Socio Econ. 2006;35:83–101. [Google Scholar]
  • 8.Snijders T.A.B., Bosker R.J. SAGE; 2011. Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling. [Google Scholar]
  • 9.Steenbergen M.R., Jones B.S. Modeling multilevel data structures. Am. J. Pol. Sci. 2002;46(1):218–237. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.docx (2.1MB, docx)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES