Territorial differences in the spread of COVID-19 in European regions and US counties

Fabrizio Natale; Stefano Maria Iacus; Alessandra Conte; Spyridon Spyratos; Francesco Sermi

doi:10.1371/journal.pone.0280780

. 2023 Feb 8;18(2):e0280780. doi: 10.1371/journal.pone.0280780

Territorial differences in the spread of COVID-19 in European regions and US counties

Fabrizio Natale ^1,^*, Stefano Maria Iacus ¹, Alessandra Conte ¹, Spyridon Spyratos ¹, Francesco Sermi ¹

Editor: Tzai-Hung Wen²

PMCID: PMC9907802 PMID: 36753502

Abstract

This article explores the territorial differences in the onset and spread of COVID-19 and the excess mortality associated with the pandemic, with a focus on European regions and US counties. Both in Europe and in the US, the pandemic arrived earlier and recorded higher Rt values in urban regions than in intermediate and rural ones. A similar gap is also found in the data on excess mortality. In the weeks during the first phase of the pandemic, urban regions in EU countries experienced excess mortality of up to 68 pp more than rural ones. We show that, during the initial days of the pandemic, territorial differences in Rt by the degree of urbanisation can be largely explained by the level of internal, inbound and outbound mobility. The differences in the spread of COVID-19 by rural-urban typology and the role of mobility are less clear during the second wave. This could be linked to the fact that the infection is widespread across territories, to changes in mobility patterns during the summer period as well as to the different containment measures which reverse the link between mobility and Rt.

Introduction

The COVID-19 pandemic is creating severe social and economic consequences, with some places experiencing disproportionately high levels of mortality and economic losses. Urban regions, and particularly large cities, have been severely affected by the spread of the pandemic in its early stages. Public discussion on the territorial impact of the pandemic requires a greater understanding of the way the pandemic is affecting regions that are diversely vulnerable and will require different recovery plans. Analyses of the role of population density and city size on the virus spread have led to mixed results [1–3]. While these analyses primarily look at the population scale as a whole, other analyses have examined disparities within the urban environment, looking in particular at the intensity of social contacts related to the urban organisation and life that would make some places more prone to infection in the first phase. In particular, some of the factors considered relevant for virus transmission are the connectivity of cities as hubs of national and international transport systems [4–6], and the structure of industry and the concentration of essential jobs in certain areas. [7, 8]. In addition, it has been documented that COVID-19 transmission is highest in family environments where people tend to be in close contact and with multi-generational family members living together [9, 10]. Our paper is aimed to gain a deeper understanding of the links between COVID-19, urban-rural typologies, territorial conditions, and mobility, which is critical for designing effective public health policy responses. We first explore the heterogeneity of COVID-19 patterns in its onset, spread, and associated excess mortality by comparing the results by the level of urbanisation of European regions and counties in the US. For the EU we use Eurostat NUTS3 rural-urban typologies and for the US we use the Rural-Urban Continuum Codes reduced to 3 classes. The classification in the EU and the US according to rural-urban typologies follows harmonised criteria of population density and size of the urban centres. On the basis of the share of the rural population regions at Territorial Level 3 (i.e. NUTS3 in the EU and counties in the US) are classified as predominantly rural regions, intermediate regions and predominantly urban regions. These classifications are routinely used by National Statistical Offices, the OECD and by the European Commission to publish territorial statistics. The results of our comparison of the spread of COVID-19 across regions show that the pandemic started earlier in urban regions than in intermediate and rural ones. Urban regions had the highest Rt values in both Europe and the US during the first wave, whereas rural counties were more affected than urban counties in the second wave. Analysis of excess mortality, calculated using Eurostat statistics and obtained from the difference between reported fatalities and a baseline model based on historical data between 2011 and 2019, also shows a large gap by urbanisation level during the first wave, with a median excess mortality up to 73% for urban regions, 18% for intermediate regions, and 11% for rural regions. In a second phase, we empirically examine the impact of mobility on virus spread. We model population mobility in European regions through a harmonised mobility index derived from mobile phone data. For our purpose of comparison by rural-urban typologies these data is unique because it provides not only relative temporal variation of mobility within each region in respect to a reference date but also information about absolute differences across regions. Due to the lack of similar data for the US our analysis on the effect of mobility on Rt is limited to the EU. We examine the geographical distribution of mobility changes through regression models for the weeks in the first and second virus waves. Our results show that, on the one hand, higher mobility explains most of the variation in values in the weekly Rt during the first wave, with internal, inbound, and outbound mobility positively affecting Rt. The effect of the per capita internal mobility, in particular, is more pronounced than that of the degree of urbanisation, and remains significant even when population and population density are taken into account. On the other hand, the same regression models replicated for the second wave show a negative role of mobility on the local spread of the virus, as well as a higher prevalence of the infection in rural regions compared to large cities. The paper is organised as follows. The data section describes the data and methods used in the analyses. In the results and discussion section, we present how the COVID-19 pandemic spread in rural, intermediate, and urban regions during the first and the second wave, and the conclusions are outlined in the final section.

Data and methods

In this section we present the data sources and the methods we used to assess the spread of the COVID-19 pandemic in rural, intermediate and urban regions. EU regions are classified into three types of areas based on the share of the local population living in urban clusters and city centres: urban (densely populated areas), intermediate (intermediate density areas), and rural areas (sparsely populated areas) (For more details on the Eurostat classification, see the link https://ec.europa.eu/eurostat/web/degree-of-urbanisation/background. For a general overview of the different approaches to the delineation of a city, see Rozenblat C.,(2020). US counties follow a similar classification scheme that distinguishes metropolitan counties by the population size of their metropolitan area and non-metropolitan counties by the degree of urbanisation and their proximity to a metropolitan area see https://www.ers.usda.gov/data-products/rural-urban-continuum-codes.aspx.) In this analysis, the variable grouping the 2013 rural-urban codes has been reclassified into 3 categories: urban (codes 1–2), intermediate (codes 3–4), and rural counties (codes 5–9). We calculated the reproductive number (Rt) as an indicator to assess how fast the virus spread across different types of geographical areas. We estimated the excess mortality to monitor in quantitative terms the evolution and impacts of COVID-19 pandemic. We used fully anonymised and regionally aggregated mobility data to get insights about the different regional mobility patters. Finally, we fitted a linear regression model to assess the relationship between mobility and Rt during the first and the second wave.

Rt

Rt is the main real-time indicator used to assess the evolution of the pandemic, design containment measures and monitor their effectiveness. During the pandemic several governments and administrations have established systems for the automatic triggering of restriction measures based on a weekly monitoring of regional Rt values. Technically, Rt gives a measure of the number of new infections caused by infected individuals at time t in a partially susceptible population. Values above one indicate that that the number of cases will increase while with values below one the pandemic will extinguish. A time-dependent reproduction number, Rt, was calculated for each day and region with the R package ‘R0’ [11]. For the calculation we followed a likelihood-based estimation procedure that derives the probability of infection from the analysis of the epidemic curve of the observed cases using sliding temporal windows. [12] This estimation procedure relies on a parameter about the time between the infection and the manifestation of the symptoms which in our cases was obtained from data reported during the early phases of the pandemic in China [13]. The data on confirmed COVID-19 cases at regional level was obtained through the ‘COVID19’ R package [14] and updated until end of 2020. To analyse at descriptive level territorial differences, the daily Rt values were averaged by consecutive weeks and across regions classified according to their rural-urban typology.

Excess mortality

The baseline for mortality was calculated with Generalised Additive Models fitted independently for each region. In the models we included a seasonal component to account for the increase in mortality during the winter months linked to influenza outbreaks, and a linear time trend to account for long-term changes in mortality due to demographic dynamics. The excess mortality was measured as difference between the reported data in 2020 and the estimated baseline for all occurrences exceeding the lower or upper 95% confidence intervals of the estimated baseline. The weekly mortality were obtained from Eurostat (demormweek3) and covered 900 regions in 26 EU Member States and the UK with time series which, depending on the MS, were starting from 2001 or 2015 and spanning until the end of 2020.

Mobility

In this study we used fully anonymised and aggregated mobility data shared with the European Commission (EC) by European Mobile Network Operators (MNOs). These mobility data comply with the ‘Guidelines on the use of location data and contact tracing tools in the context of the COVID-19 outbreak’ by the European Data Protection Board [15]. The mobility data were in the form of Origin-Destination Matrix (ODM) [16, 17] and they provided valuable insights into mobility patterns across geographical areas. The data has been used to derive mobility insights and build tools to inform better targeted containment measures, in a Mobility Visualisation Platform, available to the Member States [18].

Given the high variation in the spatial and temporal aggregation across countries and operators, the original ODMs were harmonised at standardised spatial and temporal granularity to the derived Mobility Indicators [19]. We further aggregated the Mobility Indicators at weekly intervals, and we normalised the Mobility Indicators to enable a better cross-country comparison. The normalisation was performed by comparing the number of movements for each NUTS3 areas and each type of movements (internal, inbound, outbound) by the average mobility levels between February 10 and March 8, 2020. The reason for this normalisation was to capture the relative decrease/increase of mobility compared to pre-lockdown levels. In addition to normalised mobility, we also estimated the per-capita internal mobility by dividing the number of movements recorded using mobility data in a NUTS3 region by population size reported by Eurostat as of 1 January 2018. The number of movements recorded by each Mobile Network operator depends on their methodology and their penetration rate in each country. Thus to enable cross-country comparison, we normalised the per-capita internal mobility by setting for each country the value of one to the NUTS3 regions with the higher per capita mobility over the reference time period February to December 2020, and the value of zero to the NUTS3 regions with the lowest per-capita mobility over the same time period. The limitation of our proposed indicators are the following. First we assume that the penetration rate of each MNO in each country is the same across rural, intermediate and urban areas and it remains stable across the time period that we analyse. Second, we assume that the population of the NUTS3 areas remained stable during the period that we analyse.

Regression

To support our intuition about the territorial heterogeneity in the spread of COVID-19 during the first and second waves, we examine the effect of different mobility patterns through OLS regression models. The models have the Rt values recorded in each European region as dependent variable, the rural-urban typology of the region, the internal, outbound and internal per capita mobility as main independent variables and the logs of the population and population density of the region as control variables. We run two set of models at 28 days since the onset of the pandemic in each region to capture effects during the first wave and for the weeks after August 2020 for the second wave. All specifications include country fixed effects to account for differences in virus transmission resulting from invariant country characteristics. The fitting of the regression models was constrained by the necessity of having regional data on COVID-19 cases for the calculation of Rt and mobility indicators for the same periods. Data on population was obtained from Eurostat (demorpjangrp3 and demord3dens). Overall the regressions are based on around 3500 observations in 654 regions for the first wave, and 10500 observations in 551 regions for the second wave.

Results and discussion

The COVID-19 pandemic started earlier in urban regions

Fig 1 describes the pandemic onset in the NUTS3 regions of some European countries and counties in the US, clustered by the three levels of urbanisation. We measure the pandemic onset in each region by the number of days between the registration of the first 20 confirmed cases of Coronavirus disease and the beginning of the year 2020. In both Europe and the United States, urban regions are more vulnerable to the pandemic’s onset. The pandemic started earlier in most urban regions, while we observe a later onset in intermediate and rural regions in the first wave. The choice of the threshold of 20 cases was to avoid influences from sporadic events. We performed a sensitivity check using thresholds between 1 and 500 cases and the finding of an earlier onset in urban regions in respect of intermediate and rural ones holds for all values of the threshold.

The infection has spread faster in urban regions during the first wave

Fig 2 displays the Rt values for the first and second waves of the pandemic. Rt is calculated from daily confirmed cases in 807 NUTS3 regions in the UK, Netherlands, Germany, Italy, Spain, France, Czech Republic and Austria (left) and in 3100 counties in the US (right). The indicator is averaged across regions grouped by rural-urban type and aggregated by days since the first reported case in each region (upper Figure), and weeks since the start of the second wave of the pandemic (lower Figure). Looking first at the upper figure, we observe that urban regions in Europe and the US recorded higher Rt values than those found in intermediate and rural regions at the start of the pandemic. This indicates that the disease spread faster in urban regions and that containment was more difficult in more densely populated areas. Approximately 56 days after the start of the pandemic, we find a general decline in the Rt and a reduction in the differences in Rt between the three groups of regions. At the start of the pandemic, the rural-urban divide in Rt values is more pronounced in the US counties. However, even in this case, the disparity in the pandemic spread by level of urbanisation has narrowed among the three regional groups, with the Rt index close to 1 at the end of the first wave. The lower part of Fig 2 shows the median Rt values across regions and counties in the weeks following the summer period, when the pandemic began to spread in a second wave of infections. In the European regions, we observe an initially higher Rt in the urban regions and increasing and higher values in the intermediate and rural regions as the second wave progresses. In contrast, in the US, rural and intermediate counties are the most vulnerable to virus spread for most of the weeks during the second wave, with a slight change in trend in the last weeks of the period.

The excess mortality linked to COVID-19 is higher in the European urban regions in the first wave

Fig 3 shows the trend in the excess mortality for the European regions during the year 2020. The increase in weekly mortality compared to past trends is used as an indirect measure to monitor the evolution of COVID-19. This indicator has the downside of including fatalities not necessarily linked to COVID-19, such as those caused by the saturation of hospital capacity, but has the advantage of being less influenced by the underestimation of the real infection rate due to asymptomatic cases or differences in testing strategies over time and regions [20]. The bars in Fig 3 show the weekly total excess mortality calculated from Eurostat statistics for most EU countries and the UK. The excess mortality is obtained from the difference between the reported fatalities and a modelled baseline estimated from historical data until 2019. The number of weekly fatalities attributable to COVID-19 peaked at the beginning of April, with about 41 400 deaths in excess compared to the baseline.(This peak represents 21 600 more cases than the excess mortality recorded in the same countries during the second week of January 2017, corresponding to a particularly severe year for the seasonal flu.) The lines in the figure show the median excess mortality in the NUTS3 regions classified according to their degree of urbanisation. At the peak of the pandemic in third week of April, the median excess mortality in urban regions reached its peak with an excess mortality of 73%, which was 58 pp higher than in intermediate regions and 68 pp higher than in rural regions in the same week. In the second wave of the pandemic, the disparities among regions appear less pronounced. There is also a reverse in the trend of excess mortality, with rural and intermediate regions having higher rates, 38% and 32% respectively, than urban regions with an excess mortality rate of 26%.

Mobility is higher in urban regions

One possible explanation for the higher Rt and excess mortality in urban regions is that in cities the infection can spread more rapidly given the higher population density, larger use of public transportation and higher number of social interactions. The intensity of social interaction is reflected in mobility indicators which can be calculated from mobile phones data. In fact, the relation between intensity of social contacts, mobility and infection is at the basis of mobility restriction that most governments have put in place to contain the pandemic. We analyse the patterns of mobility within, from and toward European regions with anonymised and aggregated mobile indicators derived from mobile phone data as described in the Data and methods section. Fig 4 shows the median patterns of weekly mobility of 1033 NUTS3 regions in 22 EU countries, grouped by rural, intermediate and urban typology, in relative terms compared to the pre-lockdown levels (upper chart) and per capita terms (lower chart). The trends in the two charts in Fig 4 reflect the implementation of generalised lockdown until April, the reopening during the summer period and the new restrictions on mobility after summer. The upper chart in Fig 4 reflects that the mobility has been reduced more in relative terms in urban areas compared to the intermediate and rural ones. The lower chart in Fig 4 shows that during the first wave, and independently from the implementation of the restriction measures, the level of per capita mobility was higher in urban regions in respect of intermediate and rural ones. During the second wave, the per capita mobility is almost equal across all areas, indicating substantial reduction of mobility in urban and intermediate regions at the beginning of summer and an increase in rural regions. This shift in mobility patterns is exemplified in Fig 5 showing the weekly relative changes in mobility for each Italian region (rows) in respect of the levels recorded during the last week of February. In this case, regions are sorted on the basis of their proximity to the sea or mountains to better appreciate the mobility linked to domestic tourism. In May, after the lifting of lockdown, all Italian regions recorded an increase of mobility to the levels of February. However, during summer, in coastal and mountain regions mobility increased to higher values than at the beginning of the year. The highest increase was recorded in the second week of August in the renowned region of Olbia in Sardinia (+373%). The fact that there was high mobility from urban to coastal and mountainous areas could have contributed to spreading the disease from cities to intermediate and rural areas. With the re-opening of schools in September, the level of mobility started again to increase uniformly across all regions.

The higher mobility in urban regions may explain great part of the territorial gaps in Rt during the first wave

Table 1 shows the results of regressions on the first wave of infection considering Rt values in the 28 days after the start of the pandemic. Table 2 presents results for the second wave on the Rt values in the weeks after August. The results of the regressions show a significant relationship between the effective reproduction number, Rt, and the levels of urbanisation (Column 1 in Table 1). During the first wave of the pandemic, Rt values are lower in rural and intermediate regions than in the urban regions used as reference. Urban regions are therefore the most affected in the first weeks of the pandemic in terms of number of cases due to their high population density and large concentration of social interactions, as well as the high local and global connectivity (Balcan et al., 2009). In Columns 2–4 we include the mobility controls separately, i.e. internal, inbound, outbound mobility, given the correlation between these measures within countries. We use the three-week lagged value of each mobility variable in the regressions to account for the delay between the mobility-driven infection and the positive case confirmation and to mitigate a potential reverse causality problem between the two variables. A sensitivity checks for the choice of the alternative lag periods is shown in Fig 6. We selected a lag of 3 weeks which is maximising the positive coefficient of mobility during the second wave. Positive lags produce as expected negative coefficients since mobility is reacting to restrictions measures rather than driving the infection. In all specifications, each mobility indicator is positively correlated with Rt values, indicating that higher mobility is associated with higher transmission. The coefficient on delayed mobility ranges from 1.82 to 1.53, depending on the specification. Mobility is also analysed using a per capita mobility indicator (Column 5), which captures the daily movements per capita in a nuts region. The positive and significant coefficient of the per capita mobility confirms a pattern of Rt that increases as the internal mobility measured on the total population increases. The demographic controls of the (log) total population and density, presented in Columns 6 and 7, also exert a positive effect on Rt in the first wave. Finally, in Columns 8 and 9, we simultaneously estimate the effect of the per capita internal mobility, the level of urbanisation of the regions and the population density and size. The main result of the estimates is that the increase in the internal mobility is positively and significantly associated with the number of cases, with a stable coefficient across the different specifications. The coefficient of the internal mobility indeed remains significant and positive even when we include the other control variables. Internal mobility appears to be a critical determinant of the rate of COVID-19 cases during the first wave, positively influencing the spread of the virus possibly through increased social interactions. These results confirm that great part of the territorial characteristics influencing the higher epidemiological risk at the onset of the pandemic in urban regions can be explained by the role of mobility.

Table 1. Regression on Rt during the first wave (28 days since onset).

	Rt first wave
	(1)	(2)	(3)	(4)	(5)	(6)	(7)	(8)	(9)
Intermediate	−0.13^**							0.03	0.05
Intermediate	(0.05)							(0.03)	(0.04)
Rural	−0.17^***							0.06^*	0.08
Rural	(0.06)							(0.03)	(0.05)
Internal -3W		1.82^***
Internal -3W		(0.07)
Inbound -3W			1.52^***
Inbound -3W			(0.05)
Outbound—3W				1.53^***
Outbound—3W				(0.05)
Internal pca -3W					0.85^***			0.89^***	0.95^***
Internal pca -3W					(0.09)			(0.09)	(0.10)
log(Population)						0.11^***			0.06^***
log(Population)						(0.03)			(0.02)
log(Population density)							0.05^***		−0.02
log(Population density)							(0.02)		(0.01)
AIC	11126.3	5134.8	4933.1	4941.3	5715.6	11118.2	11124.6	5716.7	5711
Observations	3,563	3,025	3,025	3,025	3,025	3,563	3,563	3,025	3,025
R²	0.02	0.31	0.35	0.35	0.16	0.02	0.02	0.16	0.17
Adjusted R²	0.01	0.31	0.35	0.35	0.16	0.02	0.01	0.16	0.16
F Statistic	7.42^***	192.71^***	235.70^***	233.89^***	83.74^***	9.37^***	8.44^***	65.47^***	54.58^***
df	8; 3554	7; 3017	7; 3017	7; 3017	7; 3017	7; 3555	7; 3555	9; 3015	11; 3013

Open in a new tab

Note:

*p<0.1;

**p<0.05;

***p<0.01

Standard errors in parenthesis

Table 2. Regression on Rt during the second wave (after August).

	Rt second wave
	(1)	(2)	(3)	(4)	(5)	(6)	(7)	(8)	(9)
Intermediate	0.004^***							0.004^***	−0.004^**
Intermediate	(0.001)							(0.001)	(0.002)
Rural	0.01^***							0.01^***	−0.001
Rural	(0.002)							(0.002)	(0.002)
Internal -3W		0.02^***
Internal -3W		(0.003)
Inbound -3W			0.01^***
Inbound -3W			(0.001)
Outbound—3W				0.01^***
Outbound—3W				(0.001)
Internal pca -3W					−0.03^***			−0.03^***	−0.03^***
Internal pca -3W					(0.01)			(0.01)	(0.01)
log(Population)						−0.01^***			−0.01^***
log(Population)						(0.001)			(0.001)
log(Population density)							−0.004^***		−0.002^***
log(Population density)							(0.001)		(0.001)
AIC	-29920.5	-29875.7	-29856.1	-29871.3	-29884.7	-29957.2	-29933.6	-29934.3	-30013.5
Observations	10,720	10,716	10,711	10,713	10,717	10,720	10,720	10,717	10,717
R²	0.07	0.06	0.06	0.06	0.06	0.07	0.07	0.07	0.08
Adjusted R²	0.07	0.06	0.06	0.06	0.06	0.07	0.07	0.07	0.08
F Statistic	95.50^***	103.95^***	102.92^***	104.58^***	104.82^***	114.48^***	110.86^***	87.88^***	80.04^***
df	8; 10711	7; 10708	7; 10703	7; 10705	7; 10709	7; 10712	7; 10712	9; 10707	11; 10705

Open in a new tab

Note:

*p<0.1;

**p<0.05;

***p<0.01

Standard errors in parenthesis

Table 2 examines the relationship between Rt and different mobility patterns in the European regions in a similar way to Table 1 but with data for the second wave (from August). The estimates show an inversion of sign from the first wave with a positive association between the virus spread and the rural and intermediate regions compared to large cities. These results may reflect a behavioural response as well as more severe containment measures in the most severely affected areas. In the second wave, different mobility patterns are associated with lower Rt values, presenting a weaker relationship than in the first wave, as presented in Column 2–6. The results thus indicate that the relationship between mobility and the regional virus transmission has changed over time and that shifts in mobility were used to control the pandemic. However, these changes were not sufficient to prevent a second wave of infection in most of the regions analysed. The demographic variables are significant and negative on the virus spread. The models that simultaneously estimate the effect of internal mobility per capita and different regional characteristics show a negative relationship between this mobility pattern and the virus transmission, as well as a higher prevalence of the infection in rural regions compared to large cities during the second wave. Fig 6 show the regression coefficients with internal mobility shifts of 3–0 weeks before and after the Rt reference week. This study is not aimed at a causality analysis between the two variables, however we quantify the different time-lag effects to detect their potential influence on transmission, which is useful for the deasese monitoring policies. During the first wave the relation of mobility on Rt is positive and peaks during the same week (week 0). During the second wave, the relation is constantly decreasing towards negative values. The fact that the relation during the first wave is becoming clearer towards the reference week indicates that mobility is having an effect on Rt. In contrast, during the second wave there is an inversion in the relationship and mobility rather than influencing seem to react to changes in Rt by moving in opposite directions. Intuitively, this is in line with the consideration that during the advanced stages of the pandemic, mobility is highly conditioned by restriction measures and closures that are put in place in correspondence with increases in Rt. (A specification linking the disease values to mobility may suffer from reverse causality. To mitigate this potential problem, we use a three weeks lagged value of each mobility variable in the regressions.)

Conclusion

In this article we analysed the territorial differences in the onset and spread of COVID-19 and the associated excess mortality, across the European NUTS3 regions and US counties during the first and second COVID-19 wave. During the first wave, the COVID-19 pandemic arrived earlier, recorded higher Rt values and had a higher impact in terms of excess mortality in urban regions compared to the intermediate and the rural ones. In the first wave, mobility influenced the spread of COVID-19, since the higher mobility of urban regions is explaining entirely the differences between the three groups of regions. The fact that these effects are more difficult to recognise in later stages of the pandemic can be tentatively explained by the widespread of the infection, the implementation of restriction measures which invert the link between mobility and Rt, often applied on a territorial basis, and the more complex mobility patterns experienced during the summer period.

Our findings are in line with previous studies identifying the role of mobility on virus spread in the early stages of the pandemic. To our knowledge, our research is unique in providing a broad geographic coverage and a high level of geographical detail, and in examining the role of regional mobility for the spread of COVID-19 through a unique data set derived from mobile phone data. In terms of policy implication, our research contributes to a better understanding of territorial characteristics of the spread of COVID-19, which is critical for designing effective public health policy responses, often decided at regional level.

Supporting information

S1 File. Data and R code used for the regressions presented in Tables 1 and 2.

The ZIP file contains the two data files “first_wave.csv” and “second_wave.csv”, the r code “Script.R” and a read me file with the data description “data_ReadMe.txt”. Please note that due to the commercial sensitivity of the mobility data, we have added a skewed non-negative random noise to the mobility columns, therefore the statistical results of the models that include mobility data are not replicable.

(ZIP)

Click here for additional data file.^{(607KB, zip)}

Acknowledgments

The authors acknowledge the support of European MNOs (among which 3 Group—part of CK Hutchison, A1 Telekom Austria Group, Altice Portugal, Deutsche Telekom, Orange, Proximus, TIM Telecom Italia, Tele2, Telefonica, Telenor, Telia Company and Vodafone) in providing access to aggregate and anonymised data. The authors would also like to acknowledge the GSMA (GSM Association of Mobile Network Operators.), colleagues from Eurostat and ECDC (European Centre for Disease Prevention and Control. An agency of the European Union.) for their input in drafting the data request.

Finally, the authors would also like to acknowledge the support from JRC colleagues, and in particular the E3 Unit, for setting up a secure environment to host and process of the data provided by MNOs, as well as the E6 Unit (the “Dynamic Data Hub team”) for their valuable support in setting up the database.

Data Availability

All data used in this study, except the mobility data, are openly available, and the data sources are specified in the data and methods section of the manuscript. Mobility data cannot be shared publicly because of legal reasons. We made publicly available all data used for the Table1 and Table2 as well as the R code to produce the nine models presented in the respective tables in the SI. We added a random noise (non negative and skewed) to the mobility data since due to their commercial sensitivity we cannot published them. Interested researchers who wish to reproduce the regressions with mobility variables can use openly available mobility data from other sources such as the Google mobility data. Future researchers can request access to the mobility data from the Mobile Network Operators who participate in the European Commission’s initiative to fight the COVID-19 pandemic. More info about this cooperation agreement can be found on the “Letter of Intent for Cooperation” which is available at https://www.gsma.com/gsmaeurope/wp-content/uploads/2021/02/Letter-of-Intent_final_16-April-2021.pdf.

Funding Statement

The authors received no specific funding for this work.

References

1.Stier AJ, Berman MG, Bettencourt LMA. COVID-19 attack rate increases with city size; 2020.
2. Ribeiro HV, Sunahara AS, Sutton J, Perc M, Hanley QS. City size and the spreading of COVID-19 in Brazil. PLOS ONE. 2020;15(9):1–12. doi: 10.1371/journal.pone.0239699 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Heroy S. Metropolitan-scale COVID-19 outbreaks: how similar are they?;. Available from: http://arxiv.org/abs/2004.01248.
4.Mazzoli M, Mateo D, Hernando A, Meloni S. Effects of mobility and multi-seeding on the propagation of the COVID-19 in Spain; 2020.
5.Gerritse M. Cities and COVID-19 infections: Population density, transmission speeds and sheltering responses; 2020.
6.Chinazzi M, Davis JT, Ajelli M, Gioannini C, Litvinova M, Merler S, et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak;. Available from: https://www.sciencemag.org/lookup/doi/10.1126/science.aba9757. [DOI] [PMC free article] [PubMed]
7.Almagro M, Orane-Hutchinson A. The determinants of the differential exposure to COVID-19 in New York City and their evolution over time. [DOI] [PMC free article] [PubMed]
8. Agnoletti M, Manganelli S, Piras F. Covid-19 and rural landscape: The case of Italy. Landscape and Urban Planning. 2020;204:103955. doi: 10.1016/j.landurbplan.2020.103955 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Bayer C, Kuhn M. Intergenerational ties and case fatality rates; 2020. Available from: https://voxeu.org/article/intergenerational-ties-and-case-fatality-rates.
10.Belloc M, Buoananno P, Drago F. Cross-country correlation analysis for research on COVID-19 | VOX, CEPR Policy Portal; 2020. Available from: https://voxeu.org/article/cross-country-correlation-analysis-research-covid-19.
11.Obadia T, Haneef R, Boëlle PY. The R0 package: a toolbox to estimate reproduction numbers for epidemic outbreaks; 2012. Available from: 10.1186/1472-6947-12-147. [DOI] [PMC free article] [PubMed]
12.Wallinga J. Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures;. Available from: https://academic.oup.com/aje/article-lookup/doi/10.1093/aje/kwh255. [DOI] [PMC free article] [PubMed]
13.Zhanwei Du, Xiaoke Xu, Ye Wu, Lin Wang, Benjamin J Cowling, Lauren Ancel Meyers. Serial Interval of COVID-19 among Publicly Reported Confirmed Cases; 2020. Available from: https://wwwnc.cdc.gov/eid/article/26/6/20-0357_article. [DOI] [PMC free article] [PubMed]
14.Guidotti E, Ardia D. COVID-19 Data Hub; 2020.
15.EDPB. Guidelines 04/2020 on the use of location data and contact tracing tools in the context of the COVID-19 outbreak; 2020.
16. Mamei M, Bicocchi N, Lippi M, Mariani S, Zambonelli F. Evaluating Origin–Destination Matrices Obtained from CDR Data. Sensors. 2019;19:1440. doi: 10.3390/s19204470 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Fekih M, Bellemans T, Smoreda Z, Bonnel P, Furno A, Galland S. A data-driven approach for origin–destination matrix construction from cellular network signalling data: a case study of Lyon region (France); 2020.
18.Commission E. COMMUNICATION FROM THE COMMISSION TO THE EUROPEAN PARLIAMENT AND THE COUNCIL Staying safe from COVID-19 during winter, COM/2020/786; 2020.
19. Santamaria C, Sermi F, Spyratos S, Iacus SM, Annunziato A, Tarchi D, et al. Measuring the impact of COVID-19 confinement measures on human mobility using mobile positioning data. A European regional analysis. Safety Science. 2020;132:104925. doi: 10.1016/j.ssci.2020.104925 [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Bartoszek K, Guidotti E, Iacus SM, Okrój M. Are official confirmed cases and fatalities counts good enough to study the COVID-19 pandemic dynamics? A critical assessment through the case of Italy. Nonlinear Dynamics. 2020;101(3):1951–1979. doi: 10.1007/s11071-020-05761-w [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0280780.r001

Decision Letter 0

Celine Rozenblat

27 Apr 2021

PONE-D-21-07940

Territorial differences in the spread of COVID-19 in European regions and US counties

PLOS ONE

Dear Dr. Natale,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

The paper is of high interest and the two Reviewers suggest very specific improvements that are very relevant in my point of view and often easy to solve (most of them are minor revisions asking some precisions on some points). I hope that you will agree with these proposals and that we will be able to publish your paper soon.

I would add 3 points.

1) In the review 2, you will find the request to put more attention (small discussion) about urban/rural definition (point 10 of his review), it is essential to refer to the EU+UN-Habitat+OECD work on urban/rural delineations on which I guess you based your typology?

- Applying the Degree of Urbanisation — A methodological manual to define cities, towns and rural areas for international comparisons — 2021 edition, DOI: 10.2785/706535

A paragraph on this discussion would be appropriate as it is fundamental in your model.

For a comparison of different approaches of "what is a city?", I would suggest a synthesis that I made recently (But I would not want to oblige you to quote my own work):

- Rozenblat C. (2020). Extending the concept of city for the delineation of large urban regions (LUR) for the cities of the world, Cybergeo, https://doi.org/10.4000/cybergeo.35411

2) for the approach on mobility: I think that you should distinguish the "local mobility" that you consider, to the "global" connectedness that could be evaluated by the air passengers flows of airports. I think that urban areas are also previously affected because they are highly globally connected. For this discussion I woudl suggest:

- Balcan, D., Colizza, V., Gonçalves, B., Hu, H., Ramasco, J. J., & Vespignani, A. (2009). Multiscale mobility networks and the spatial spreading of infectious diseases. Proceedings of the National Academy of Sciences, 106(51), 21484-21489.

3) As a spreading infection, the discussion could interprete also (with some limits) the COVID diffusion processes from urban to semi-urban and rural areas.

Please submit your revised manuscript by Jun 11 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Celine Rozenblat

Academic Editor

PLOS ONE

Additional Editor Comments:

I would add 3 points.

- Applying the Degree of Urbanisation — A methodological manual to define cities, towns and rural areas for international comparisons — 2021 edition, DOI: 10.2785/706535

A paragraph on this discussion would be appropriate as it is fundamental in your model.

For a comparison of different approaches of "what is a city?", I would suggest a synthesis that I made recently (But I would not want to oblige you to quote my own work):

- Rozenblat C. (2020). Extending the concept of city for the delineation of large urban regions (LUR) for the cities of the world, Cybergeo, https://doi.org/10.4000/cybergeo.35411

3) As a spreading infection, the discussion could interprete also (with some limits) the COVID diffusion processes from urban to semi-urban and rural areas.

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: No

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This paper studies determinants of COVID-19 spread at a regional level in Europe, and in a lesser extent gives descriptive statistics for US counties. Results are interesting regarding the respective role of density and mobility, and are directly useful for policy making. The paper is well written and structured, and the analysis are technically sound. I suggest the paper to be published after minor revisions.

A few remarks:

- L46: are there other possible indicators to quantify epidemic spreading? More background on using the reproduction number could be given for readers not familiar with standards in epidemiology.

- L53: citations for both R packages are not provided; more details on the estimation procedure could also be given for more clarity - for example is there a time-window on which the reproduction number is computed? In this case is there some optimal window size, and which size is used here?

- A discussion on data quality and possible biases (for cases, mortality, and mobility) would be useful to add.

- Linear statistical models are used here; could non-linear relationships be also considered?

- The results comparing Europe and US contrast with the rest of the results focusing on Europe which are more detailed - furthermore urban contexts and definitions of cities are quite different in Europe and the US, so the comparison may not be that straightforward. I would suggest to put US counties as supplementary material, and focus the paper on Europe with robust and comparable results.

- L241: using instrumental variable to disentangle causalities between mobility and reproduction number is suggested; more details could be provided, in particular which kind of variable could be used?

- L244: as lagged regression are used, Granger causality tests could be provided here, or a study of lagged correlations.

- Although mobility data can not be made publicly available for privacy reasons, source code should be provided (with synthetic mobility data for example) for replication on other case studies.

Reviewer #2: Summary:

The submitted paper aims to examine urban-rural gradients of COVID-19 spread and excess mortality. The authors first present data suggesting that the pandemic started earlier, spread faster, and was more deadly in more urban regions (at least for the first wave). They then show that mobility is generally higher in more urban areas and attempt to explain the increased spread and mortality as a result of this higher mobility.

Paper Strengths:

The broad coverage of geographical areas covered by the paper’s data are impressive and of general interest. In addition, the paper aims to highlight the nuance of infectious disease spread in different types of human settlements. These details are likely important to keep in mind as we continue to manage the spread of COVID-19, role out vaccination programs, and plan for future infectious disease outbreaks.

Paper Weaknesses:

Despite the strengths of this paper and the interest of the data it presents, there are major methodological concerns, clarity issues, and discrepancies between the data and conclusion that need to be addressed.

1) The authors should include a brief summary of the parameters used while calculating Rt. Because the authors aggregate to weekly averages, it is important for readers to understand that these are sliding windows.

2) The authors should justify their choice of 9 regression models and provide statistics to compare between models. If drawing conclusions from multiple models, multiple comparison correction should be employed.

3) The authors include a model called “Internal pca” which is not described in the caption or mentioned in the main text or methods.

4) The authors measure pandemic onset by the number o days between the beningng of 2020 and when the 20th confirmed case occurred. How was this number chosen? Are the results sensitive to different choices?

5) When comparing differences in Rt, mobility, and excess mortality, across urban-rural gradients, the authors should employ appropriate statistical tests and multiple comparison corrections (they certainly have enough data).

6) The methods section claims that the excess mortality time series spans from 2001 to 2015 but the results section claims that the baseline was calculated from data between 2011 and 2019. The authors should clarify and clearly state how the baseline was calculated (and with which data).

8) I would suggest the authors use log-population instead of population in their regression models as that is often a better indicator of ecological measures (e.g. Rt and mobility).

9) In the introduction the authors cite three papers which look at the relationship between city population and the spread of covid-19, and claim that these studies refer to population density. In fact, these studies are all based on urban scaling theory (Bettencourt, 2013) and specifically aim to understand the role of social network density in covid-19 spread. The distinction between social network density and population density is an important theoretical one and should be acknowledged.

10) Also in the introduction and the results it is crucial that the authors discuss the definitions of urban, intermediate, and rural regions being used. There are a number of different ways to decide what is a city (Taubenböck, 2012), and these choices might impact the interpretation of results.

11) There is inconsistency in how the terminology for the three mobility indicators. I would suggest sticking to a single choice for each indicator to improve clarity.

12) The authors should justify the comparison of models with different temporal shifts and put forth a hypothesis for how the results of these comparisons might indicate causality. As it stands, their data does not suggest any causal relationship between reproductive numbers and mobility. Part of this is that the methodology needs to be more clear.

13) The authors say that “mobility changes may respond endogenously to Rt values”. I question whether people actually respond to Rt values rather than perceived risk. It might be the case the people are checking Rt values on the internet and changing their behavior, but if so citations or additional evidence are needed.

14) Without proper statistics for model comparison, the discussion of the meaning of regression coefficients is difficult to justify. The authors should also address the fact that their regression models explain such a small proportion of the total variance in reproductive numbers.

15)

Figure 2 should have some sort of indication of the error in the calculated Rt values.

Figure 3 is confusing and the excess mortality axis should be color to match the bars.

Figure 5 is missing a description of what the color bar means.

Figure 6 is missing an x axis label.

16) The authors state that “mobility explains entirely the differences between the three groups of regions”. While it is true that differences in Rt by regional characterization are non-significant when conditioning on mobility, this statistical model treats regions with different levels of urbanization similarly. I suggest that the authors run an additional model where level of urbanization is treated as random effect to allow for different relationships between mobility and Rt in the different types of regions: a 50% reduction of mobility in a sparsely populated rural area may have only a very small effect of Rt, while a similar reduction in mobility might drastically impact spread in a large city.

Bibliography:

Bettencourt, L. M. (2013). The origins of scaling in cities. Science, 340(6139), 1438-1441.

Taubenböck, H., Esch, T., Felbier, A., Wiesner, M., Roth, A., & Dech, S. (2012). Monitoring urbanization in mega cities from space. Remote sensing of Environment, 117, 162-176.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Feb 8;18(2):e0280780. doi: 10.1371/journal.pone.0280780.r002

Author response to Decision Letter 0

7 Jun 2021

We thank the editor and reviewers for the very useful suggestions provided. We provided detailed replies to each comment and results of sensitivity checks and additional analyses in the replies to reviewers document.

Attachment

Submitted filename: Replies to reviewers.docx

Click here for additional data file.^{(165.6KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0280780.r003

Decision Letter 1

Tzai-Hung Wen

6 Nov 2022

PONE-D-21-07940R1Territorial differences in the spread of COVID-19 in European regions and US countiesPLOS ONE

Dear Dr. Natale,

Please submit your revised manuscript by Dec 21 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Tzai-Hung Wen, Ph.D.

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #3: All comments have been addressed

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #3: Yes

Reviewer #4: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #3: Yes

Reviewer #4: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #3: Yes

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #3: Yes

Reviewer #4: Yes

**********

6. Review Comments to the Author

Reviewer #3: In the current work the authors study the territorial differences during the spread of COVID - 19 using indicators like Rt and the excess mortality for European and US counties. Then, they study the linear relation between Rt and mobility to confirm their hypothesis.

The paper is innovative, technically sound, the results strongly support the authors claims and the scientific problems were researched thoroughly. More specifically their hypothesis that mobility is a major factor which creates the territorial differences, is verified in the first wave.

In my opinion, this work should be accepted as is, as the authors revised the paper according to the previous reviewers’ comments and answered all the questions convincingly. Given that the produced results differ for different waves it would be interesting to observe the application of the same analysis to the 3rd or 4th wave. Moreover, another extension of this work would be to focus the study on the reasons of the differences between the waves.

Reviewer #4: The revised manuscript entitled “Territorial differences in the spread of COVID-19 in

European regions and US counties” evaluated transmission dynamics in Europe and the US. In general, the authors have responded all the comments from the previous reviewers. Below are some additional comments:

2. What kind of regression you applied in the analysis?? The approach should be mentioned in the method but you described OLS and other detail in the result. Why do you include 3-week lags in the model?? Any scientific evidence to support the 3-week duration?

3. Although the title indicates the analysis will compare Europe and the US, it seems that the analysis about US only appeared in Figure 1 and 2. The excess mortality, mobility analysis only focused on Europe. The authors need to describe it clearly.

4. Table 1 and 2. What is the number in the parentheses of each coefficient? P-value?

5. The adjusted R-squared values in the regression models are very low. Are you able to draw reliable conclusions from the models??

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #3: No

Reviewer #4: No

**********

PLoS One. 2023 Feb 8;18(2):e0280780. doi: 10.1371/journal.pone.0280780.r004

Author response to Decision Letter 1

15 Nov 2022

Dear Editor,

the paper was subject to two full round of peer review and is awaiting publication since long. We believe that we have fully addressed the comments of the first two reviewers and, in this last round, also the points raised by reviewer 4. We thank you and all the anonymous reviewers for their time and interest in revising the manuscript.

Reviewer 3

Response: The paper was originally submitted in March 2021 and has taken a long review process. During this long period the pandemic has progressed into 3rd or 4th waves. We have checked the mortality data for the EU and our findings about the lack of rural-urban differences are confirmed in the subsequent periods. As we indicate in the discussion major difference between first and second wave is linked to the widespread territorial diffusion of the infections following its initial onset in cities. We expect that similar effect continue to explain the rural-urban inversion in the later periods. We will consider the suggestions for further work.

Reviewer 4

1. What is the motivation to compare the COVID-19 transmission in Europe and the US? What’s the definition of NUTS3 region? What is Rural-Urban Continuum Codes applied to the US? Although you indicated the detail in the footnote, I suggest giving brief description about this information.RWe considered two areas of the world for which there was sufficiently detailed data on COVID19 cases at weekly and regional levels and with a harmonised classification allowing a comparison according to rural-urban typologies.

Response: We add further details on the classification of regions and rural-urban typologies in the EU and US.

2. What kind of regression you applied in the analysis?? The approach should be mentioned in the method but you described OLS and other detail in the result.

Response: We provided more details on the regression and moved its descrition to the methods section.

Why do you include 3-week lags in the model?? Any scientific evidence to support the 3-week duration?

Response: In the regression analyses, we use the mobility variables lagged by 3 weeks relative to the Rt value to account for the delay between the infection and the confirmation of the positive case. A sensitivity analysis of the lag is given in Figure 6. We selected a lag of 3 weeks in the past which is maximising the positive coefficient of mobility during the second wave. Positive lags produce as expected negative coefficients since mobility is reacting through restrictions rather than driving infection. We add more details to describe the choice of the lag in the text. Also, for Italy, Carteni et al. (2020) report that trips made three weeks earlier are the main determinants of new daily cases.

With data on counties in the US between January and April 2020, Badr et al. (2020) show that declining mobility is strongly correlated with lower COVID-19 case growth rates, and that the impact may not be evident for up to three weeks. This is consistent with the incubation period of the first virus variants plus the additional time for reporting.

Response: We added a sentence clearly stating that the analysis of mobility covers only the EU due to the lack of sufficiently detailed mobility data for the US or other parts of the world.

4. Table 1 and 2. What is the number in the parentheses of each coefficient? P-value?

Response: We add in the caption of the tables a sentence specifying that numbers in parentheses represent the standard errors.

5. The adjusted R-squared values in the regression models are very low. Are you able to draw reliable conclusions from the models??

Response: Our purpose is not to predict the spread of the pandemic. For this we agree that there would be a need to achieve higher R-squared. The prediction of the spread of infections rather than simple OLS would require proper epidemiological models able to capture the typical temporal dynamics of epidemics. In our exercise we ignore these temporal dynamics and focus on the territorial differences and this explains the relatively low R-squared. The main aim of the paper is to see if the difference between rural and urban is significant and what is the role of mobility in explaining such difference. We tested alternative formulation of the model including as control variable a time dimension. This increasing dramatically the value of R-squared reflecting the time dependency in Rt values but has the downsize of cancelling and adsorbing most of the effect from our variables of interest.

Attachment

Submitted filename: Reply to reviewers2.docx

Click here for additional data file.^{(16.6KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0280780.r005

Decision Letter 2

Tzai-Hung Wen

10 Jan 2023

Territorial differences in the spread of COVID-19 in European regions and US counties

PONE-D-21-07940R2

Dear Dr. Natale,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Tzai-Hung Wen, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #4: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #4: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #4: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #4: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #4: Yes

**********

6. Review Comments to the Author

Reviewer #4: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #4: No

**********

PLoS One. doi: 10.1371/journal.pone.0280780.r006

Acceptance letter

Tzai-Hung Wen

13 Jan 2023

PONE-D-21-07940R2

Territorial differences in the spread of COVID-19 in European regions and US counties

Dear Dr. Natale:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Tzai-Hung Wen

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Data and R code used for the regressions presented in Tables 1 and 2.

(ZIP)

Click here for additional data file.^{(607KB, zip)}

Attachment

Submitted filename: Replies to reviewers.docx

Click here for additional data file.^{(165.6KB, docx)}

Attachment

Submitted filename: Reply to reviewers2.docx

Click here for additional data file.^{(16.6KB, docx)}

Data Availability Statement

[pone.0280780.ref001] 1.Stier AJ, Berman MG, Bettencourt LMA. COVID-19 attack rate increases with city size; 2020.

[pone.0280780.ref002] 2. Ribeiro HV, Sunahara AS, Sutton J, Perc M, Hanley QS. City size and the spreading of COVID-19 in Brazil. PLOS ONE. 2020;15(9):1–12. doi: 10.1371/journal.pone.0239699 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0280780.ref003] 3.Heroy S. Metropolitan-scale COVID-19 outbreaks: how similar are they?;. Available from: http://arxiv.org/abs/2004.01248.

[pone.0280780.ref004] 4.Mazzoli M, Mateo D, Hernando A, Meloni S. Effects of mobility and multi-seeding on the propagation of the COVID-19 in Spain; 2020.

[pone.0280780.ref005] 5.Gerritse M. Cities and COVID-19 infections: Population density, transmission speeds and sheltering responses; 2020.

[pone.0280780.ref006] 6.Chinazzi M, Davis JT, Ajelli M, Gioannini C, Litvinova M, Merler S, et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak;. Available from: https://www.sciencemag.org/lookup/doi/10.1126/science.aba9757. [DOI] [PMC free article] [PubMed]

[pone.0280780.ref007] 7.Almagro M, Orane-Hutchinson A. The determinants of the differential exposure to COVID-19 in New York City and their evolution over time. [DOI] [PMC free article] [PubMed]

[pone.0280780.ref008] 8. Agnoletti M, Manganelli S, Piras F. Covid-19 and rural landscape: The case of Italy. Landscape and Urban Planning. 2020;204:103955. doi: 10.1016/j.landurbplan.2020.103955 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0280780.ref009] 9.Bayer C, Kuhn M. Intergenerational ties and case fatality rates; 2020. Available from: https://voxeu.org/article/intergenerational-ties-and-case-fatality-rates.

[pone.0280780.ref010] 10.Belloc M, Buoananno P, Drago F. Cross-country correlation analysis for research on COVID-19 | VOX, CEPR Policy Portal; 2020. Available from: https://voxeu.org/article/cross-country-correlation-analysis-research-covid-19.

[pone.0280780.ref011] 11.Obadia T, Haneef R, Boëlle PY. The R0 package: a toolbox to estimate reproduction numbers for epidemic outbreaks; 2012. Available from: 10.1186/1472-6947-12-147. [DOI] [PMC free article] [PubMed]

[pone.0280780.ref012] 12.Wallinga J. Different Epidemic Curves for Severe Acute Respiratory Syndrome Reveal Similar Impacts of Control Measures;. Available from: https://academic.oup.com/aje/article-lookup/doi/10.1093/aje/kwh255. [DOI] [PMC free article] [PubMed]

[pone.0280780.ref013] 13.Zhanwei Du, Xiaoke Xu, Ye Wu, Lin Wang, Benjamin J Cowling, Lauren Ancel Meyers. Serial Interval of COVID-19 among Publicly Reported Confirmed Cases; 2020. Available from: https://wwwnc.cdc.gov/eid/article/26/6/20-0357_article. [DOI] [PMC free article] [PubMed]

[pone.0280780.ref014] 14.Guidotti E, Ardia D. COVID-19 Data Hub; 2020.

[pone.0280780.ref015] 15.EDPB. Guidelines 04/2020 on the use of location data and contact tracing tools in the context of the COVID-19 outbreak; 2020.

[pone.0280780.ref016] 16. Mamei M, Bicocchi N, Lippi M, Mariani S, Zambonelli F. Evaluating Origin–Destination Matrices Obtained from CDR Data. Sensors. 2019;19:1440. doi: 10.3390/s19204470 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0280780.ref017] 17.Fekih M, Bellemans T, Smoreda Z, Bonnel P, Furno A, Galland S. A data-driven approach for origin–destination matrix construction from cellular network signalling data: a case study of Lyon region (France); 2020.

[pone.0280780.ref018] 18.Commission E. COMMUNICATION FROM THE COMMISSION TO THE EUROPEAN PARLIAMENT AND THE COUNCIL Staying safe from COVID-19 during winter, COM/2020/786; 2020.

[pone.0280780.ref019] 19. Santamaria C, Sermi F, Spyratos S, Iacus SM, Annunziato A, Tarchi D, et al. Measuring the impact of COVID-19 confinement measures on human mobility using mobile positioning data. A European regional analysis. Safety Science. 2020;132:104925. doi: 10.1016/j.ssci.2020.104925 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0280780.ref020] 20. Bartoszek K, Guidotti E, Iacus SM, Okrój M. Are official confirmed cases and fatalities counts good enough to study the COVID-19 pandemic dynamics? A critical assessment through the case of Italy. Nonlinear Dynamics. 2020;101(3):1951–1979. doi: 10.1007/s11071-020-05761-w [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Territorial differences in the spread of COVID-19 in European regions and US counties

Fabrizio Natale

Stefano Maria Iacus

Alessandra Conte

Spyridon Spyratos

Francesco Sermi

Roles

Abstract

Introduction

Data and methods

Rt

Excess mortality

Mobility

Regression

Results and discussion

The COVID-19 pandemic started earlier in urban regions

Fig 1. Onset of the pandemic across regions by rural-urban typology.

The infection has spread faster in urban regions during the first wave

Fig 2. Median Rt values across European regions and US counties by rural-urban typology and sliding temporal windows.

The excess mortality linked to COVID-19 is higher in the European urban regions in the first wave

Fig 3. Total excess mortality (upper panel) and median relative excess mortality with 95% CI by rural-urban typology (lower panel) on a weekly basis.

Mobility is higher in urban regions

Fig 4. Median normalised internal mobility and internal mobility per capita with 95% CI, across regions grouped by rural-urban typology.

Fig 5. Percentage weekly relative change in the internal mobility of NUTS3 areas in Italy in respect to the pre-COVID-19 mobility levels (week 24 FEB–01 MAR 2020).

The higher mobility in urban regions may explain great part of the territorial gaps in Rt during the first wave

Table 1. Regression on Rt during the first wave (28 days since onset).

Table 2. Regression on Rt during the second wave (after August).

Fig 6. Regression coefficients with shifts of internal mobility of 3 weeks before and after the reference week of Rt.

Conclusion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Celine Rozenblat

Roles

Author response to Decision Letter 0

Decision Letter 1

Tzai-Hung Wen

Roles

Author response to Decision Letter 1

Decision Letter 2

Tzai-Hung Wen

Roles

Acceptance letter

Tzai-Hung Wen

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases