Crowding Effects Dominate Demographic Attributes in COVID-19 Cases

Awi Federgruen; Sherin Naha

doi:10.1016/j.ijid.2020.10.063

. 2020 Nov 17;102:509–516. doi: 10.1016/j.ijid.2020.10.063

Crowding Effects Dominate Demographic Attributes in COVID-19 Cases

Awi Federgruen ^a,^⁎, Sherin Naha ^b

PMCID: PMC7833246 PMID: 33217575

Graphical abstract

Keywords: COVID-19, Socio-economic factors, Demographic factors, Average household size, Population density, Racial/ethnic factors

Highlights

•
Cross-sectional studies of infection rates are critical to understand the drivers behind the COVID epidemic.
•
This study sought to identify demographic and socio-economic indicators that drive the incidence rate of COVID-19.
•
The paper assesses these factors at zip-code granularity.
•
The average household size is the single most important explanatory variable; the percentage of the population 65 or older, and that below the poverty line are also strongly positively associated.
•
Population density, per se, does not have a significantly positive impact on incidence rates.

Abstract

Objective

With an eye toward possible public policy implications, our objective is to identify the socio-economic and demographic factors that drive the large variation in COVID-19 incidence rates observed within relatively compact geographic regions, and to quantify the relative impact of each of these factors. We use international comparisons as a starting point.

Methods

New York City, consisting of some 175 zip codes, is an ideal arena to pursue the above study given the large variation in case incidence rates across zip codes. We conducted systematic regression studies employing data with zip code granularity. Our model specifications are based on a well-established epidemiologic model that explains the effects of household sizes on R0.

Results

Average household size emerges as the single most important driver behind the large variation in COVID-19 incidence rates. It independently explains 62% of the variation. The percentage of the population above the age of 65 and the percentage below the poverty line are also strongly positively associated with zip code incidence rates. As to ethnic/racial characteristics, the percentages of African Americans, Hispanics and Asians within the population are significantly associated, but the magnitude of the impact is smaller. (The proportion of Asians within a zip code has a negative association.) Contrary to common belief, population density, by itself, does not have a significantly positive impact (other than when a high population is driven by large household sizes).

Conclusion

Our findings support implemented and proposed policies to quarantine patients and separate infected individuals from families or dormitories; they also support newly revised nursing home admission policies.

Introduction

The world continues to search for a fundamental understanding of the dynamics of the current pandemic. For example, we try to understand why, in the United States (US), as of May 17, 2020, 192,000 COVID-19 cases have been reported in New York City (NYC) alone, while the country-wide total has been mercifully restricted to 1.5 million. Thus, a city representing 2.5% of the US population accounted for 12.8% of the reported cases. Identifying the main drivers of the disease spread has important implications for public policies to contain the current epidemic and mitigate the widely expected “second wave”.

Population density is widely believed to be a main driving force. This theory has intuitive appeal. After all, the number of infections in a given region depends on the basic reproductive number R0, the average number of cases directly generated by one individual, in a population where all individuals are susceptible. R0, in turn, depends, in part, on the number of individuals with whom a single case has physical contact during the time interval in which she is contagious, presumably positively correlated with the population density.

However, the theory is challenged, first of all, on an international basis. Many cities with population density as great as or greater than NYC’s 10,198 residents per square kilometre (sq. km) have reported much lower case rates. For example, Manila, Baghdad, Mumbai, Seoul, Mexico City and Singapore have, respectively, 46128, 32874, 32103, 16000, 9800 and 8358 residents/sq. km. Their case incidence rates per 100,000 residents vary, however, between 9.4 (Seoul) and 635.4 (Singapore), as compared to NYC’s rate of 2286 cases per 100,000 residents.

These cities’ lower rates may be explained by ex-ante differences in international traffic patterns in and out of the country, affecting the cluster of “imported” cases, or specific containment and testing policies adopted by the respective governments. However, among states within the US, California had one of the highest population densities (251.3 residents per square mile) but one of the lowest COVID-19 mortality rates (8 per 100,000 residents), while Louisiana, with a population density 2.5 times lower than that of California, has reported 49 COVID-19 deaths per 100,000 residents.

And stark differences are apparent within NYC itself. (Wadhera et al., 2020) reported recently that among the city’s five boroughs, Manhattan had by far the fewest hospitalizations per 100,000 residents, but the greatest population density, 2.5 times the citywide average (25,106 vs 10,198 residents/sq.km). (Because the percentage of confirmed cases that require hospitalization is remarkably robust throughout the country, hospitalization rates can be viewed as proxies for incidence rates.) In fact, at zip code granularity, the rates of reported cases and the population densities are negatively correlated, with a correlation coefficient of – 33% (Table 2).

Table 2.

Correlation coefficients among the variables.

	Population density= # residents per sq. mile	Average household size	Percent below poverty line	Percent above age 65	# confirmed cases/100,000 residents
Population density=# residents per sq mile	1	−0.324	0.285	−0.064	-0.325
Average household size	−0.324	1	0.189	−0.207	0.622
Percent below poverty line	0.285	0.189	1	−0.367	0.229
Percent above age 65	−0.064	−0.207	−0.367	1	0.092
# confirmed cases/100,000 residents	-0.325	0.622	0.229	0.092	1

	Minimum	Maximum	Average
# confirmed cases per 100,000 residents	436.91	4226.51	2077.79
Population density= # residents per sq. mile	1389.08	126,067.69	38 480.75
Average household size	1.57	3.97	2.65
Percentage below poverty line	2.20	45.90	16.83
Percentage above the age of 65	0.46	28.98	14.26
Percentage of total Hispanic population	1.12	75.77	26.35
Percentage total Black non-Hispanic population	0.37	90.51	20.06
Percentage total Black population	0.37	93.81	23.69
Percentage total Asian population	0.07	72.62	14.78

	Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%
Intercept	450.51	163.46	2.76	0.006	127.86	773.154542
avg_household_size_adj	986.90	94.79	10.41	5.41E-20	799.80	1174.01

	Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%
Intercept	−284.40	353.05	−0.805	0.42	−981.40	412.59
Population_density	−0.006	0.003	−1.81	0.07	−0.01	0.001
Average_household_size	896.83	180.56	4.97	1.66E-06	540.36	1253.29
%_below_poverty_level	24.37	5.36	4.55	1.03E-05	13.79	34.942533
% age>65	49.94	9.80	5.10	9.29E-07	30.59	69.2839432
Population_density* (average_household_size)²	−3.52E-05	0.001	−0.03	0.98	−0.002	0.00250162

	Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%
Intercept	−549.53	246.39	−2.23	0.027	−1035.93	−63.14
avg_household_size_adj	1051.69	89.00	11.82	7.17E-24	875.99	1227.38
pct_below_poverty_level	25.48	5.43	4.692	5.57E-06	14.76	36.20
pct age>65	48.62	9.73	4.997	1.44E-06	29.41	67.82
*pct < PLpct_age>65**	−0.004	0.001	−3.03	0.003	−0.006	−0.001

	Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%
Intercept	−607.02	239.93	−2.53	0.0123	−1080.66	−133.37
avg_household_size_adj	918.81	90.26	10.18	0.0000	740.64	1096.99
pct_below_poverty_level	3.82	5.95	0.64	0.5220	−7.93	15.56
pct age>65	54.71	9.57	5.72	0.0000	35.83	73.60
%total_hispanic	12.35	3.03	4.08	0.0001	6.37	18.32

	Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%
Intercept	−593.85	245.16	−2.42	0.0165	−1077.82	−109.87
avg_household_size_adj	961.76	90.63	10.61	0.0000	782.84	1140.67
pct_below_poverty_level	14.79	5.01	2.95	0.0036	4.89	24.69
pct age>65	50.42	9.71	5.19	0.0000	31.26	69.59
%total_black_nonhispanic	5.87	1.94	3.03	0.0028	2.05	9.70

	Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%
Intercept	−600.22	244.74	−2.45	0.0152	−1083.37	−117.08
avg_household_size_adj	961.57	90.44	10.63	0.0000	783.04	1140.11
pct_below_poverty_level	13.79	5.08	2.71	0.0073	3.76	23.83
pct age>65	50.75	9.70	5.23	0.0000	31.61	69.89
%total_black	5.77	1.86	3.10	0.0022	2.10	9.43

	Coefficients	Standard Error	t Stat	P-value	Lower 95%	Upper 95%
Intercept	−484.66	238.91	−2.03	0.0441	−956.29	−13.02
avg_household_size_adj	1081.07	86.49	12.50	0.0000	910.34	1251.80
pct_below_poverty_level	12.27	4.89	2.51	0.0131	2.61	21.93
pct age>65	55.48	9.45	5.87	0.0000	36.82	74.13
%total_asian_alone	−14.73	3.17	−4.64	0.0000	−20.99	−8.47

	Average household size	Percent below poverty line	Percent above age 65	Percentage of total Hispanic population	Percentage total Black non-Hispanic population	Percentage total Black population	Percentage total Asian population	# confirmed cases/100,000 residents
Average household size	1.00	0.19	−0.21	0.26	−0.56	0.26	0.34	0.62
Percent below poverty line	0.19	1.00	−0.37	0.65	0.28	0.34	−0.29	0.23
Percent above age 65	−0.21	−0.37	1.00	−0.34	−0.13	−0.16	0.19	0.09
Percentage of total Hispanic population	0.34	0.65	−0.34	1.00	−0.02	0.05	−0.26	0.42
Percentage total Black non-Hispanic population	0.26	0.28	−0.13	−0.02	1.00	0.99	−0.48	0.34
Percentage total Black population	0.26	0.34	−0.16	0.05	0.99	1.00	−0.51	0.35
Percentage total Asian population	0.07	−0.29	0.19	−0.26	−0.48	−0.51	1.00	−0.19
# confirmed cases/100,000 residents	0.62	0.23	0.09	0.42	0.34	0.35	−0.19	1.00

	avg_household_size_adj	pct_below_poverty_level	pct_age>65	%total_hispanic
Minimum	875.61725	2.6686427	51.994143	1165.5236
Average	911.73546	4.0941206	55.349999	1238.7307
Maximum	931.62228	5.226449	58.726034	1310.935

	avg_household_size_adj	pct_below_poverty_level	pct_age>65	%total_black_non_hispanic
Minimum	916.27133	13.036153	47.943724	553.63002
Average	955.9674	15.172045	51.047048	578.35984
Maximum	981.29695	17.115713	54.177515	600.34054

	avg_household_size_adj	pct_below_poverty_level	pct_age>65	%total_black
Minimum	916.17228	12.10048	48.251701	542.78349
Average	955.76292	14.187076	51.369147	567.9981
Maximum	980.97525	16.112607	54.503164	588.8834

	avg_household_size_adj	pct_below_poverty_level	pct_age>65	%total_asian_alone
Minimum	1030.0448	10.525976	7.94E-09	−1514.9971
Average	1074.2913	12.603313	56.092721	−1471.4693
Maximum	1101.9448	14.40379	59.369881	−1421.1716

PERMALINK

Crowding Effects Dominate Demographic Attributes in COVID-19 Cases

Awi Federgruen

Sherin Naha

Graphical abstract

Highlights

Abstract

Objective

Methods

Results

Conclusion

Introduction

Table 2.

Table 1.

A basic household-based epidemiology model

Methods

Average household size

Age

Economic

Racial/Ethnic

Results

Table 3.

Table 4.

Table 5.

Table 6.

Fig. 1.

The impact of racial and ethnic factors

Table 7.

Table 8.

Table 9.

Table 10.

Table 15.

Table 11.

Table 12.

Table 13.

Table 14.

Discussion

Financial Support

Conflicts of Interest

Declaration of interests

Acknowledgement

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases