Abstract
We study the spatio-temporal spread of SARS-CoV-2 in Santiago de Chile using anonymized mobile phone data from 1.4 million users, 22% of the whole population in the area, characterizing the effects of non-pharmaceutical interventions (NPIs) on the epidemic dynamics. We integrate these data into a mechanistic epidemic model calibrated on surveillance data. As of August 1, 2020, we estimate a detection rate of 102 cases per 1000 infections (90% CI: [95–112 per 1000]). We show that the introduction of a full lockdown on May 15, 2020, while causing a modest additional decrease in mobility and contacts with respect to previous NPIs, was decisive in bringing the epidemic under control, highlighting the importance of a timely governmental response to COVID-19 outbreaks. We find that the impact of NPIs on individuals’ mobility correlates with the Human Development Index of comunas in the city. Indeed, more developed and wealthier areas became more isolated after government interventions and experienced a significantly lower burden of the pandemic. The heterogeneity of COVID-19 impact raises important issues in the implementation of NPIs and highlights the challenges that communities affected by systemic health and social inequalities face adapting their behaviors during an epidemic.
Subject terms: Viral infection, Epidemiology, Complex networks
Fine-grained studies of epidemic spread and of the effect of nonpharmaceutical interventions are still needed to underpin demographic and socio-economic effects. Here, the authors study the spatial and temporal spread of SARS-CoV-2 in Santiago de Chile using anonymized mobile phone data.
Introduction
As of September 1, 2020, Chile has reported more than 400,000 cases and 590 SARS-CoV-2 deaths per million, becoming one of the worst COVID-19 epidemic globally1. Officially, the first SARS-CoV-2 case in Chile was detected on March 3, 20202. Although other cases were rapidly confirmed all over the country, the urban area of the capital city, Santiago Metropolitan Region, quickly became the epicenter of the national epidemic. Indeed, as of September 1, 2020, about 70% of the total cases in the nation have been reported in the comunas (i.e., municipalities) of Santiago, making it one of the largest urban COVID-19 outbreak in the world. The first set of nonpharmaceutical interventions (NPIs) were put in place in mid-March, when schools were closed, public gatherings were banned, and passengers traveling from high-risk countries were mandated to self-isolate for 14 days. However, the adopted measures were not able to contain the contagion: after a sharp increase in cases, a full lockdown was instituted to the whole Metropolitan Region on May 153.
In this work, we model the spatial and temporal spread of SARS-CoV-2 in 37 comunas of the urban area of Santiago which comprises 6.4 million individuals. We aim to provide a data-driven characterization of the unfolding of the COVID-19 epidemic and an estimation of the impact of NPIs on its spreading. Research on similar geographical scales has been conducted, for example, for Boston4, Wanzhou5, New York City6, and London7. While COVID-19 is clearly a global issue, the measures adopted to mitigate and/or suppress the spread of SARS-CoV-2 have been quite heterogeneous across countries and often across sub-regions within the same country8–10. Hence, modeling specific contexts and spatial scales is key to identify separate effects of NPIs, their practical impact when revoked, and possibly reintroduced over an extended period of time. We study the reduction of mobility and contacts inferred from mobile devices as input for a spatially and age-structured epidemic model. In fact, mobile device data can be used to evaluate, in near real-time, the effects of interventions and self-initiated behavioral changes on the mobility of people and to inform large scale epidemic models11–15. Here, we use anonymised data provided by a major mobile phone operator in South America (Telefónica Movistar), with a market share of 24.61% as of March 2020.
To characterize the changes in mobility and physical contacts during the outbreak, we used anonymised data from 1.4 million mobile devices (about 22% of the total population in the comunas under consideration). We find consistent downward trends coinciding with the NPIs issued by local and national authorities. We estimate that the first set of NPIs issued on March 16 led to a reduction of about 48% in the number of travels between comunas. An additional 17% reduction is observed with the introduction of the full lockdown on May 15.
We develop a stochastic mechanistic epidemic model integrating mobility, physical contacts, and census data. The model suggests that the full lockdown, while causing a modest additional decrease in mobility and physical contacts with respect to the NPIs already in place, was decisive in bringing the epidemic under control. This relatively small additional decrease in mobility and contacts was enough to push the effective reproductive number below the critical value of 1, a clear example of the threshold effects characterizing epidemic dynamics on structured mobility networks16. We estimate that the full lockdown prevented an additional 34.7% (95% CI: [27.2%, 44.1%]) increase in the total number of deaths. In addition, we estimate the critical impact of the timing of the full lockdown through counterfactual scenarios: an additional week of delay would have corresponded to an 18.1% (95% CI: [6.0%, 34.0%]) more intense incidence peak according to our estimates.
Despite being regarded as a high-income country, Chile and its capital city show concerning social and economic inequalities17. Unfortunately, this makes Santiago a natural experiment to investigate the link between socioeconomic disparities and the burden of the pandemic. We explore this important dimension finding that changes in mobility patterns strongly correlate with the economic and development indicators of comunas. Furthermore, the model links these observations with heterogeneous burden of COVID-19 across comunas which is also observed in the epidemiological data reported by the national surveillance. More precisely, our results suggest higher attack and death rates in disadvantaged areas. Due to challenges faced in reducing their mobility and contacts, communities exhibiting systemic social disparities are affected in a differential way by government-mandated NPIs and disease’s burden. This observation raises the key issue of health disparities in the management of emerging infectious diseases such as COVID-19.
Results
We evaluate the effects of NPIs policies, government-mandated mobility limitations by integrating mobile phone data and an epidemic model. We identify three phases of the epidemic management in Santiago: (i) before March 16 (business as usual, baseline), (ii) between March 16 and May 15 (first set of NPIs), and (iii) after May 15 (full lockdown). For convenience, we will refer to the period March 16 to May 15 as the partial lockdown and to the period after May 15 as the full lockdown.
It is important to notice how the timeline of interventions is fairly complex. It includes night curfews, dynamic quarantine, and lockdowns restricted to a few comunas across the region studied here and in other parts of Chile2. However, as we see below, the data suggest that those measures did not have a significant impact on people’s behaviors, thus for simplicity we consider the two main sets of NPIs only. We characterize the three phases outlined above in terms of (a) mobility among comunas, and (b) contacts reduction between individuals. Mobility describes the (varying) rates at which people travel among different comunas, while contacts reduction parameters estimate to what extent physical contacts varied in time in each comuna (more details in the “Methods” section).
Effects of NPIs and social inequalities
In Fig. 1a, we provide an overview of mobility in Santiago during the period of study. As a proxy for general mobility, we consider the number of devices visiting a comuna that is different from their home one (see “Methods” section). We observe a sharp drop following the first set of interventions on March 16. Afterward, mobility remains fairly constant until the introduction of the full lockdown on May 15, when we observe an additional 17% decrease. As we will show below, this intervention represented an important tipping point of the epidemic in Santiago.
More in detail, we represent changes in mobility flows across comunas in Fig. 1b. The partial lockdown causes an average drop of about 48%, while, with the introduction of the full lockdown, mobility across comunas drops by 65% with respect to the baseline. For each comuna we also consider the mean percentage decrease in mobility after March 16 and compare it with the Human Development Index (HDI), a coefficient that measures key aspects of human development, such as life expectancy, education, and per capita income18 (see the “Methods” section for more details on HDI calculation). In Fig. 1c, we observe that a greater decrease in mobility is generally associated with a higher HDI (Pearson correlation coefficient ρ = − 0.80, p < 0.001). The same trend is observed in the absolute change of mobility (see Supplementary Fig. 5) suggesting that wealthier and more developed comunas became more isolated after the interventions. This result is in line with previous studies that showed how changes in mobility patterns following government-issued interventions, and the extent to which people can afford social distancing, vary across different socio-demographic groups15,19–22.
In Fig. 1d, we represent contacts reduction parameters. Across the board, contacts drop by 36% with the first set of NPIs policies and by an additional 11% with the lockdown. Since all points are below the diagonal, we conclude that, with the introduction of the full lockdown, contacts decrease further in all comunas. Also, the decrease is consistent with the existing reductions after the partial lockdown. Indeed and show a high significant correlation (Kendall rank correlation coefficient τ = 0.79, p < 0.001).
The spread of COVID-19 in Santiago
We use the mobility data to develop and inform a stochastic mechanistic metapopulation epidemic model (see “Methods” for details) and simulate the spread of COVID-19 in the comunas of Santiago. The model is calibrated on official surveillance data and takes as initial seeding the realistic projections of active cases on March 1, 2020, in the Metropolitan area of Santiago from ref. 14.
We use an Approximate Bayesian Computation (ABC) approach23,24 (see details in the “Methods” section) to find the posterior distribution of the reproductive number in Santiago (median R0 = 2.66, 95% CI: [2.58, 2.72]), which is in line with previous findings that identify the value of R0 of SARS-CoV-2 to be in the range between 2 and 3 in different countries25–27. In Fig. 2a, we report the number of weekly deaths projected by the model together with official figures (used for calibration). The two time series show a good agreement with a high correlation (ρ = 0.99, p < 0.001) and a median absolute percentage error of 12%. Interestingly, the agreement between data and model starts to deviate in the last two data points (end of July early August). We can speculate that, at least in part, this might be due to a lockdown fatigue. In fact, while the official restrictions were relaxed later (mid of August), a decrease in compliance, linked to the reduction of cases/deaths, could have taken place earlier. The period is outside our current data coverage. Hence, we leave testing such hypothesis to future work.
As of August 1, 2020, the median projected fraction of infected individuals in the area under study is 38.7% (95% CI: [35.1%, 41.6%]). This estimate is about tenfold the official reported figures. We are not aware of publicly available seroprevalence studies that we can use as a comparison and validation. For this reason, we can only consider qualitative evidence hinting that the Chilean outbreak has affected a significant fraction of the population. For example, the share of positive COVID-19 tests peaked at 59.10% on June 18, and in the Santiago Metropolitan Region the occupation of ICU beds reached almost saturation level (95%) in May. Also, a recent epidemiological study aimed at characterizing the first wave in Chile showed significant under-reporting of symptomatic cases (around 50%) based on estimates of the Case Fatality Rate2. Another recent modeling study focused on the COVID-19 outbreak in Santiago estimates a number of infected individuals 5–10 times larger than the official figures28. Finally, previous seroprevalence studies conducted, for example, in the United States29, Spain30, Italy31, Brazil32, and Iran33, showed that the actual number of COVID-19 infections is several times (factors vary from 4 to 20) those reported by the official surveillance. As a sensitivity check, we repeated the calibration considering the upper limit of the 95% credible interval for the Infection Fatality Rate from ref. 34. This leads to a projected median prevalence of 28.3% (95% CI: [23.5%, 32.3%]) but to a sensibly worse fit of the data (ρ = 0.82, median percentage error of 49%).
Projected cases present a significant correlation with official numbers as can be observed in Fig. 2b (ρ = 0.84, p < 0.001). Besides, we compare the dates when 200 infections have been reached in different comunas according to our model and official surveillance, finding a significant correlation (Kendall rank correlation coefficient τ = 0.61, p < 0.001). Similar results are found considering instead the dates when 50, 100, and 500 cases have been reached (see Supplementary Fig. 6). Interestingly, the same dates estimated through modeling are much earlier, hinting that many of the infections in the initial phase of the spreading went unreported. In Fig. 2c, we show the attack rates versus the HDI of different comunas. We find a strong correlation between attack rates computed on officially reported cases and HDI (ρ = −0.74, p < 0.001), providing evidence that wealthier comunas experienced significantly smaller outbreaks. In addition, the very same picture emerges from our modeling results. Indeed, simulated attack rates present a high significant correlation with HDI (ρ = −0.69, p < 0.001). Finally, in Fig. 2d we show the number of deaths per 1000 versus the HDI of different comunas. We find a significant correlation between HDI and both simulated (ρ = −0.50, p < 0.002) and officially reported (ρ = −0.40, p < 0.02) deaths, hinting that wealthier comunas experienced also a smaller burden in terms of casualties. We note that the correlations obtained in this case are lower than those found previously for the attack rates. This may be due to the interplay between diverse age distributions and age-dependent mortality rates. Indeed comunas with higher HDI have a higher mean age and the infection fatality rate for COVID-19 is significantly higher in older age brackets.
In Section 2 of the Supplementary Information, we compare our modeling approach and results with three simpler models. The first neglects the mobility between comunas. In other words, we attempt to fit the evolution of deaths in the region by considering each comuna as a separate population. Interestingly, this approach leads to a median absolute percentage error of 38% which implies a worse performance than the main model discussed above (12%). This result highlights the importance of capturing the coupling between areas and accounting for spatio-temporal heterogeneities in spreading patterns which are shaped, among other factors35, by mobility. The second and third consider the entire metropolitan area as a single, age-structured, population. Similar approaches have been proposed to model the spreading of SARS-CoV-2 and the impact of NPIs in cities7, regions15, and countries36. The difference between these two models lays on the data we used to capture the effects of NPIs on contact matrices. In one, we use estimates measured from the mobile phone data. In the other instead, we use the Google Mobility Reports37 and the Oxford COVID-19 Government Response Tracker38,39 (see Section 2 of the Supplementary Information for details). Both models lead to inferior performance, with a median absolute percentage error of 18% and 43%, respectively.
Overall, out of these three simpler models, the second is the closest to the performance of our main modeling approach described above. However, by construction, it does not provide any information about the heterogeneous spread and impact of the virus across comunas.
Counterfactual scenarios
To assess the impact of heterogeneous responses to the spreading we run a hypothetical scenario in which mobility and contacts decrease uniformly across comunas, starting on March 16. More in detail, we apply to all comunas the average reduction in mobility and contacts observed for the 4th quartile of HDI (i.e., 25% comunas with higher HDI). See Section 3 of the Supplementary Information for more details. According to our simulations, this leads to a significant decrease of cases and deaths: −83.8% (95% CI: [−77.6%, −88.6%]) fewer cases and −70.5% (95% CI: [−55.0%, −80.9%]) fewer deaths as of May 15, the date when the full lockdown was enforced. Interestingly, the uniform reduction we are imposing implies a relatively modest additional decrease in mobility and contacts with respect to the ones estimated through mobile phone data. In this hypothetical scenario, with the partial lockdown mobility rates drop by 55% and contacts by 49% (versus, respectively, the 48% and the 36% estimated in our main analysis). Although such homogeneous reduction across comunas is a theoretical exercise that does not consider complex socioeconomic constraints, that go from the collective need to keep key supply chains active to the individual imperative to feed their own family, it crystallizes the dramatic effects of inequality on disease spreading on the one side, and it shows the positive benefits of equal, early, and strong responses on the other.
We also use the model to investigate counterfactual scenarios aimed at estimating the impact of NPIs on the spread of COVID-19 in Santiago. As a first counterfactual scenario, we simulated the epidemic in the absence of a full lockdown. From Fig. 3a, we observe that this leads on average to a 21.6% (95% CI: [7.5%, 41.3%]) more intense incidence peak and 34.7% (95% CI: [27.2%, 44.1%]) more deaths. To estimate the impact of the timing of the full lockdown, we run simulations where we anticipate or delay it up to 4 weeks. According to results in Fig. 3b, an earlier lockdown implies a less intense incidence peak (from around −20% to −35%). It is interesting to note, however, that a delay of 1–2 weeks has a very similar effect of no intervention at all in terms of incidence peak intensity. More specifically, 1 week delay causes a 18.1% (95% CI: [6.0%, 34.0%]) while two weeks delay cause a 21.6% (95% CI: [7.4%, 41.1%]) more intense incidence peak. The timing of the full lockdown also has a significant effect on the number of deaths. According to our estimates in Fig. 3b, just 1 week of delay implies a 7.7% (95% CI: [1.3%, 13.7%]) increase in mortality.
Effective reproduction number
In Fig. 3c we show the evolution of the effective reproduction number Rt estimated using the method from ref. 40 on the simulated and the official reported incidence. In the simulated Rt time series, we observe the two discontinuities after the implementation of government-issued NPIs. However, we note that the partial lockdown had the sole effect of slowing down the epidemic. Indeed, after March 16, the estimated Rt is still >1. After the full lockdown, instead, Rt was pushed below 1 making the containment possible. This is visible both in the simulated and the reported time series. This result underlines the importance of the full lockdown that, despite causing a relatively small effect on mobility, had a decisive role in bringing the outbreak under control. It is worth stressing this result. The full lockdown constituted a key tipping point for the evolution of the epidemic pushing the reproductive number below its critical threshold. A similar finding has been recently reported for the evolution of the pandemic in Germany41. Indeed, also in that context, only the subsequent compounding of interventions was able to bring the reproductive number below one thus curbing spreading of the virus.
Discussion
The analysis presented here shows that the effects of NPIs issued by the government strongly correlate with a measure of human development, such as the HDI. In particular, comunas with higher HDI were able to reduce more significantly their mobility. This, in turn, is reflected in both data and modeling estimates by a lower burden of COVID-19 (i.e., cases, deaths) in the comunas characterized by a higher HDI. The combination of these results raises policy-making concerns. Indeed, while lockdowns are unquestionably effective in mitigating the epidemic activity, they may as well augment social and health inequalities, penalizing more vulnerable communities. Other studies have found that mobility restrictions unequally affected different regions of France15, Italy20, United States19, Colombia, Mexico, and Indonesia21 with a higher income being associated with a larger capacity to afford social distancing. Furthermore, observations in the United States19,22,42,43, Singapore44, and the UK45,46 show that socioeconomic inequalities are linked to worst health outcomes during the current pandemic.
Our data-driven analysis also shows that the timeliness of NPIs is just one variable influencing the outcome of the mitigation effort. The case of Santiago is emblematic. NPIs were introduced early with respect to other countries. Only 2 days after the first 50 confirmed cases. For comparison, Denmark introduced measures after five, Austria after nine, Italy and Germany after 15 days of reaching that threshold47. These measures were followed by a considerable reduction in mobility and contacts, but cases soared anyway in the metropolitan area. According to our analysis, the first set of NPIs significantly slowed down transmissions but not enough to stop the epidemic. It was only after the introduction of the second additional lockdown that the Santiago outbreak was brought under control. This indicates that earlier implementation of more stringent NPIs may be beneficial in quickly mitigating the outbreak without extending for a long time policies that potentially might unequally affect communities. Similar results have been reported in the context of China48, New Zealand49, and the USA8,50.
The present work comes with limitations. First, we focused only on the epidemic evolution within the Santiago Metropolitan Area and we overlooked both national and international importations after March 1, 2020. While it is reasonable to assume that after this date the epidemic was largely sustained by the internal spreading (especially considering the various restrictions on international and national mobility), we acknowledge this as a possible limitation. Second, compared to other approaches4,51,52, we considered a relatively simple disease dynamics. However, as shown in Section 2 of the Supplementary Information, our results are robust respect to the compartmental structure adopted. Third, our mobility measures also have limitations. Indeed, we considered the overall reduction in mobility and contacts. However, not all mobility translates equally in terms of transmission risk. Some types of locations such as for example restaurants and hotels have been linked to higher chances of infection than others22. We acknowledge that our mobile phone users’ sample was not selected to be representative of the whole population. However, Telefónica Movistar data well represent the different socio-demographic groups of Santiago53. Last, mobile phone data streams are dependent on the distribution of antennas. Nonetheless, this issue is, at least partially, solved by the geographical level of aggregation we use here which is that of the comunas. In fact, for all 342 comunas of continental Chile, the Pearson correlation coefficient of census data and “home location” inferred from mobile phone data is 0.97 (more details in Section 5 of the Supplementary Information).
Overall, our study characterizes the unfolding of SARS-CoV-2 in one of the largest metropolitan areas in South America; a region that so far has received far less attention than others. It quantifies the unequal effects across communities of behavioral changes introduced by governmental measures as well as individual (re)actions and provides evidence that even small delays in the implementation of NPIs can have a significant impact on the unfolding of the epidemic.
Methods
Measuring mobility and contacts
In this work, we use phone data in the form of eXtended Detail Records (XDR). This stream records every interaction (e.g., packet request) between devices and antennas. An entry in our dataset can be formalized as a tuple 〈d, t, a〉 indicating a packet request to antenna a made by device d at time t. We approximate the position of d with that of the antenna, which has a fixed latitude and longitude. We also assign a home antenna to each device by finding the most active antenna during night hours. Finally, we assign antennas to correspondent comunas according to their position. The dataset includes data for the period February 27, 2020 to June 1, 2020, for 1.4 million devices which correspond to the 22% of the total population in the area considered. To preserve the privacy of device owners, we analyze and display only anonymous and aggregated results. Furthermore, no other information about the users (i.e., gender, age etc.) was used or available.
As specified in the previous sections, by considering the governmental response and observing variations in the overall mobility, we identify three phases of nonpharmaceutical interventions and characterize them in terms of mobility and contacts reduction. Mobility is measured considering the fraction of devices traveling between comunas. Formally, for each day t we build a mobility rate matrix whose element σij(t) is the fraction of devices living in comuna i that visited j on day t. It is important to mention how we do not have information about points of interest (POI). Hence, the data capture all types of mobility such as commuting for work, grocery runs, and/or recreational activities. We average these daily rates during the three phases to obtain three distinct matrices describing mobility (i) before any restrictions, (ii) during the partial lockdown, and (iii) during the full lockdown.
The data we use do not provide direct information about physical contacts between users. Having the privacy of the users in mind, and considering the various non-trivial assumptions one would need to make, we do not attempt to estimate/infer such contacts. Instead, we focus on a metric that allows us to capture the variation before and after the various interventions. As we describe below, the epidemic model considers a homogeneous mixing approximation within each subpopulation (i.e., comuna) hence the only important variable is an estimate of contacts reduction rather than the actual contacts. To this end, contacts reduction is estimated by looking at the variation in the number of users co-located in the same antenna. Each antenna a in comuna j has a resident population . On day t, the total number of visitors from the same comuna is . Assuming homogeneous mixing, the maximum number of contacts in antenna a is . Then, we assume the reduction of contact during the partial and full lockdown to be equal to:
1 |
In other words, the reduction of contacts during the partial and full lockdown is considered as the variation of the maximum number of contacts before and after each intervention. Finally, we aggregate at the level of comunas taking the median of these quantities over all antennas located in the same comuna.
Modeling the spread of SARS-CoV-2 in Santiago
The model used in this work to simulate the spread of SARS-CoV-2 is largely inspired by the global epidemic and mobility model (GLEAM)54–56. In this section, we present the conceptual framework but a full mathematical description is provided in Section 4 of the Supplementary Information.
The comunas of Santiago are represented as distinct subpopulations forming a metapopulation network. Inside each one, we divide individuals into K = 16 5-year age brackets respecting the demographic of different comunas57 and we use the country-specific contact matrix from ref. 58 to define the rates at which different age groups mix with each other. Individuals are also divided into compartments according to their health status. We consider a SLIR (Susceptible, Latent, Infectious, Removed) compartmentalization setup. A similar approach has been used in several modeling studies in the context of COVID-1914,59,60. Interacting with Infectious, Susceptibles move to the Latent stage in which they are not infectious yet. Only after the latent period, Latent become Infectious. Last, Infectious transit to the Removed compartment at a rate inversely proportional to the infectious period. In Section 2 of the Supplementary Information we repeat the analyses for another compartmentalization that includes pre-symptomatic and asymptomatic transmission. The findings are not impacted by a different compartmental setup. Furthermore, the simpler SLIR scheme leads to a closer reproduction of the observed deaths.
Individuals can get the infection interacting with infected in their home and in other connected metapopulations. To model this aspect, we consider the mobility network previously introduced to describe an effective coupling (i.e., the strength of connection) between comunas. Technically, we use a time-scale separation technique and approximation55,61 to define a “force of infection” that expresses the infection rate for individuals in age group k residing in comuna j:
2 |
τ defines the timescale of mobility and σj = ∑i≠jσji is the total mobility rate of population j. The first term in Eq. (2) represents the contribution from active infections in comuna j, and the sum instead describes the effective contribution from cases in other connected comunas i. As mentioned above, the data we use capture all types of movements across comunas. Hence, the coupling between subpopulations accounts for mobility linked to work (i.e., commuting), grocery, and other activities. In our formulation, we assume that such movements take place over a timescale that is much smaller than the disease timescale and the temporal resolution of the epidemic data. This assumption is supported by the data. In fact, as shown in Supplementary Fig. 12, the average duration of trips outside the home comuna is 4.5 h and 85% of them take place within 8 h. Although users may travel outside of their comuna for more than 8 h, the probability of a trip to last more than 1 day is <3%. The time-scale separation approach allows us to integrate the faster dynamics—that is, the mobility—and consider their contribution to the spreading processes without simulating individual mobility patterns and thus considerably simplifying the computational costs of the model. It is important to notice how such time-separation approximation is exact only in the case τ−1 → 0. However, it holds as long as the faster timescale is shorter than the transition rates of the disease61. As mentioned in details below, for COVID-19 these are of the order of several days. We set the value of τ−1 = 1/3 day to also account for commuting patterns. In Section 4 of the Supplementary Information we provide the full derivation and more details.
Government interventions are implemented by changing the mobility network on March 16 (partial lockdown) and again on May 15 (full lockdown) to reflect the corresponding variations in the coupling between comunas. Similarly, on these dates, we multiply the age contact matrix of each comuna by and , respectively. We do not explicitly account for school closure since its effect is already included in the changes inferred from mobile devices data.
The model is fully stochastic and transitions among compartments are simulated through chain binomial processes62–64. More in detail, on a given time step the number of individuals transiting from compartment to compartment is extracted from a binomial distribution: . In the main text, we present results for a latent period of 4 days and an infectious period of 2.5 days, which imply a generation time TG = 6.5 days, in line with current estimates65,66. In Section 1 of the Supplementary Information we show a sensitivity analysis to these choices which do not substantially impact the findings. We simulate deaths considering the estimates of the infection fatality rate from ref. 34 and a delay Δ after the transition to the Removed compartment. This delay is included to account for the time span between isolation of acute cases (i.e., hospitalization), death, and official reporting, which can be more than 2 weeks67.
Initial seeding is done using the projections of active cases on March 1, 2020, in the Metropolitan area of Santiago from ref. 14 and assigning infections to different comunas proportionally to the population distribution. The calibration is performed on weekly deaths using an Approximate Bayesian Computation (ABC) Rejection method23,24. At each step of the ABC algorithm, a set of parameters θ is sampled from a prior distribution and an instance of the model is generated using these parameters. Then, an output quantity of the model is compared to the corresponding real quantity E using a distance measure . If this distance is greater (smaller) than a predefined tolerance ϵ, then the sampled set of parameters is discarded (retained). After a sufficient number of iterations, the distribution of accepted sets will approximate the posterior distribution of parameters P(θ, E) given the evidence E from the data. In this work, we set a flat uniform prior on the parameters the basic reproduction number R0 ∈ [2, 4] (in steps of 0.02) and on the delay in deaths Δ ∈ [14, 21] days (in steps of 1 day). We perform calibration using the median absolute percentage error as a distance metric with a tolerance of 20% on weekly deaths (in the Supplementary Information we present a sensitivity analysis on this tolerance). We run 140,000 iterations which correspond to about 200 stochastic realizations for each possible parameter set. We use the official data issued by the Department of Statistics of the Chilean Minister of Health68. We consider both COVID-19 “confirmed” and “suspected” deaths to perform the calibration. In Supplementary Fig. 1 we support this decision showing that considering only confirmed COVID-19 deaths we still obtain a significant anomaly in mortality. Nonetheless, we also repeat the calibration only on “confirmed” deaths showing that the posterior distribution of R0 is smaller but not significantly different. Model projections are produced sampling parameter sets directly from the posterior distribution and generating an ensemble of trajectories. In this work, we generate model estimates sampling 5000 sets on which we compute median and confidence intervals.
Measuring socioeconomic differences
We measure socioeconomic differences using the Human Development Index (HDI). The HDI is a coefficient that measures the level of achievement of key aspects of human development (including life expectancy, education, and per capita income) in the population considered18. It is released regularly for different countries by the United Nations in the Human Development Report69. In Chile, the last official calculation of the HDI at the level of comunas was done in 200070. However, to rely on more updated estimates, we computed the HDI using census data for the period 2013–2015, following the guidelines provided in ref. 18. These estimates have been previously used to study the socioeconomic determinants of mobility in Santiago71. The code used for the calculation of HDI can be found in ref. 72. In Supplementary Fig. 4 we repeat the analyses for other socioeconomic indicators. In particular, we consider separately the components of the HDI, namely the Life Expectancy Index (LEI), the Education Index (EI), and the Income Index. The general pattern presented in the main text holds also for the other indicators considered.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
All authors thank the High Performance Computing facilities at Greenwich University. L.F. thanks Víctor Navarro and Loreto Bravo. N.G. acknowledges support from the Doctoral Training Alliance. M.T. acknowledges support from the Lagrange Project of the Institute for Scientific Interchange Foundation (ISI Foundation) funded by Fondazione Cassa di Risparmio di Torino (Fondazione CRT). LF acknowledges the funding and support of Telefónica R&D Chile and CISCO Chile. M.T. acknowledges support from EPIPOSE - “Epidemic intelligence to minimize COVID-19’s public health, societal and economical impact” H2020-SC1-PHE-CORONAVIRUS-2020 call. M.C. and A.V. acknowledge support from Google Cloud and Google Cloud Research Credits program, the McGovern Foundation, and the Bill & Melinda Gates Foundation (award number INV006010).
Author contributions
N.G., M.T., L.F., N.P., and A.V. designed the study. N.G. and N.P. analyzed the data. N.G. implemented and run the epidemic model. L.F. mined and provided the anonymised mobile phone data. M.C. provided the importation data to initialize the model. All authors interpreted the results, wrote and approved the manuscript.
Data availability
The raw data analyzed in the study are not publicly available due to privacy reasons. All the aggregated mobility data needed to run the model are available at ref. 73. Analysis of the anonymised mobile phone data was performed on mobile operator’s systems without transferring it outside. Only aggregated mobility patterns across municipalities were provided to researchers outside Chile and only these have been used for the results presented here. The study was deemed exempt (IRB #20-10-05) by the Northeastern University Internal Review Board.
Code availability
The main epidemic model described in the manuscript has been implemented with the programming language C++. The alternative models, detailed in Section 2 of the Supplementary Information, have been implemented in C++ or in Python. All codes are available at ref. 73, on GitHub https://github.com/ngozzi/covid19-santiago and on Zenodo https://zenodo.org/record/4607303.
Competing interests
A.V. and M.C. report grants from Metabiota Inc., outside the submitted work; M.T. reports personal fees from GSK, outside the submitted work. All other authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks Lin Wang, Jizhe Xia and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-22601-6.
References
- 1.Covid-19 in Chile. https://www.gob.cl/coronavirus/cifrasoficiales/ (2020).
- 2.Canals, M. et al. Epidemic trends, public health response and health system capacity: the Chilean experience in four months of the COVID-19 pandemic. Rev. Panam Salud Publica. 44, e99 (2020). [DOI] [PMC free article] [PubMed]
- 3.Chilean government interventions against COVID-19. https://www.gob.cl/coronavirus/plandeaccion/ (2020).
- 4.Aleta A, et al. Modelling the impact of testing, contact tracing and household quarantine on second waves of COVID-19. Nat. Hum. Behav. 2020;4:964–971. doi: 10.1038/s41562-020-0931-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Shi, Q. et al. Effective control of SARS-CoV-2 transmission in Wanzhou, China. Nat. Med.27, 86–93 (2021). [DOI] [PubMed]
- 6.Kissler, S. et al. Reductions in commuting mobility correlate with geographic differences in SARS-CoV-2 prevalence in New York City. Nat. Commun.11, 4674 (2020). [DOI] [PMC free article] [PubMed]
- 7.Goscé L, Phillips PA, Spinola P, Gupta DRK, Abubakar PI. Modelling SARS-CoV2 spread in London: approaches to lift the lockdown. J. Inf. 2020;81:260 – 265. doi: 10.1016/j.jinf.2020.05.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.White ER, Hébert-Dufresne L. State-level variation of initial COVID-19 dynamics in the united states. PLoS ONE. 2020;15:e0240648. doi: 10.1371/journal.pone.0240648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Haug N, et al. Ranking the effectiveness of worldwide COVID-19 government interventions. Nat. Hum. Behav. 2020;4:1303–1312. doi: 10.1038/s41562-020-01009-0. [DOI] [PubMed] [Google Scholar]
- 10.Cheng C, Barceló J, Hartnett AS, Kubinec R, Messerschmidt L. COVID-19 government response event dataset (CoronaNet v.1.0) Nat. Hum. Behav. 2020;4:756–768. doi: 10.1038/s41562-020-0909-7. [DOI] [PubMed] [Google Scholar]
- 11.Oliver, N. et al. Mobile phone data for informing public health actions across the COVID-19 pandemic life cycle. Sci. Adv.6, eabc0764 (2020). [DOI] [PMC free article] [PubMed]
- 12.Zhou Y, et al. Effects of human mobility restrictions on the spread of COVID-19 in Shenzhen, China: a modelling study using mobile phone data. Lancet Digital Health. 2020;2:e417–e424. doi: 10.1016/S2589-7500(20)30165-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Badr, H. S. et al. Association between mobility patterns and COVID-19 transmission in the USA: a mathematical modelling study. Lancet Infect. Dis.20, 1247–1254 (2020). [DOI] [PMC free article] [PubMed]
- 14.Chinazzi M, et al. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science. 2020;368:395–400. doi: 10.1126/science.aba9757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Pullano, G., Valdano, E., Scarpa, N., Rubrichi, S. & Colizza, V. Population mobility reductions during COVID-19 epidemic in France under lockdown. Lancet Digital Health10.1101/2020.05.29.20097097 (2020). [DOI] [PMC free article] [PubMed]
- 16.Bajardi P, et al. Human mobility networks, travel restrictions, and the global spread of 2009 H1N1 pandemic. PLoS ONE. 2011;6:e16591. doi: 10.1371/journal.pone.0016591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.OECD. Income inequality (indicator). https://data.oecd.org/inequality/income-inequality.htm (2021).
- 18.Klugman, J. Human Development Report 2010—20th anniversary edition. The Real Wealth of Nations: Pathways to Human Development (2010).
- 19.Weill JA, Stigler M, Deschenes O, Springborn MR. Social distancing responses to COVID-19 emergency declarations strongly differentiated by income. Proc. Natl Acad. Sci. USA. 2020;117:19658–19660. doi: 10.1073/pnas.2009412117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bonaccorsi G, et al. Economic and social consequences of human mobility restrictions under COVID-19. Proc. Natl Acad Sci. USA. 2020;117:15530–15535. doi: 10.1073/pnas.2007658117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fraiberger, S. P. et al. Uncovering socioeconomic gaps in mobility reduction during the COVID-19 pandemic using location data. Preprint at https://arxiv.org/abs/2006.15195 (2020).
- 22.Chang, S. et al. Mobility network models of COVID-19 explain inequities and inform reopening. Nature589, 82–87 (2021). [DOI] [PubMed]
- 23.Sunnåker M, et al. Approximate bayesian computation. PLoS Comput. Biol. 2013;9:1–10. doi: 10.1371/journal.pcbi.1002803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Minter A, Retkute R. Approximate Bayesian Computation for infectious disease modelling. Epidemics. 2019;29:100368. doi: 10.1016/j.epidem.2019.100368. [DOI] [PubMed] [Google Scholar]
- 25.Zhao S, et al. Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: a data-driven analysis in the early phase of the outbreak. Int. J. Infect. Dis. 2020;92:214 – 217. doi: 10.1016/j.ijid.2020.01.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Liu Y, Gayle AA, Wilder-Smith A, Rocklöv J. The reproductive number of COVID-19 is higher compared to SARS coronavirus. J. Travel Med. 2020;27:Taaa021. doi: 10.1093/jtm/taaa021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Giordano G, et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat. Med. 2020;26:1–6. doi: 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Mena, G. et al. Socioeconomic status determines COVID-19 incidence and related mortality in Santiago, Chile. medRxiv10.1101/2021.01.12.21249682 (2021). [DOI] [PMC free article] [PubMed]
- 29.Havers, F. P. et al. Seroprevalence of antibodies to SARS-CoV-2 in 10 sites in the United States, March 23–May 12, 2020. JAMA Int. Med.180, 1576–1586 (2020). [DOI] [PubMed]
- 30.Pollán, M. et al. Prevalence of SARS-CoV-2 in Spain (ENE-COVID): a nationwide, population-based seroepidemiological study. Lancet396, 535–544 (2020). [DOI] [PMC free article] [PubMed]
- 31.ISTAT, Primi risultati dell’indagine di sieroprevalenza sul SARS-CoV-2. https://www.istat.it/it/archivio/246156 (2020).
- 32.Buss LF, et al. Three-quarters attack rate of SARS-CoV-2 in the Brazilian amazon during a largely unmitigated epidemic. Science. 2021;371:288–292. doi: 10.1126/science.abe9728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Poustchi, H. et al. SARS-CoV-2 antibody seroprevalence in the general population and high-risk occupational groups across 18 cities in Iran: a population-based cross-sectional study. Lancet Infect. Dis.21, 473–481 (2021). [DOI] [PMC free article] [PubMed]
- 34.Verity, R. et al. Estimates of the severity of coronavirus disease 2019: a model-based analysis. Lancet Infect. Dis.20, 669–677 (2020). [DOI] [PMC free article] [PubMed]
- 35.Rader B, et al. Crowding and the shape of COVID-19 epidemics. Nat. Med. 2020;26:1829–1834. doi: 10.1038/s41591-020-1104-0. [DOI] [PubMed] [Google Scholar]
- 36.Walker PG, et al. The impact of COVID-19 and strategies for mitigation and suppression in low-and middle-income countries. Science. 2020;369:413–422. doi: 10.1126/science.abc0035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Google llc. Google COVID-19 community mobility reports. https://www.google.com/covid19/mobility/ (2020).
- 38.Hale, T. et al. A global panel database of pandemic policies (Oxford COVID-19 Government Response Tracker). Nat. Hum. Behav. 1–10, 10.1038/s41562-021-01079-8 (2021). [DOI] [PubMed]
- 39.Oxford COVID-19 Government Response Tracker. https://www.bsg.ox.ac.uk/research/research-projects/coronavirus-government-response-tracker (2020).
- 40.Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am. J. Epidemiol. 2013;178:1505–1512. doi: 10.1093/aje/kwt133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Dehning, J. et al. Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science369, eabb9789 (2020). [DOI] [PMC free article] [PubMed]
- 42.Blundell R, Costa Dias M, Joyce R, Xu X. Covid-19 and inequalities. Fis. Stud. 2020;41:291–319. doi: 10.1111/1475-5890.12232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Muñoz-Price LS, et al. Racial disparities in incidence and outcomes among patients with COVID-19. JAMA Netw. Open. 2020;3:e2021892–e2021892. doi: 10.1001/jamanetworkopen.2020.21892. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Yi, H. et al. Health equity considerations in COVID-19: geospatial network analysis of the COVID-19 outbreak in the migrant population in Singapore. J. Travel Med.28, taaa159 (2021). [DOI] [PMC free article] [PubMed]
- 45.Mathur, R. et al. Ethnic differences in COVID-19 infection, hospitalisation, and mortality: an OpenSAFELY analysis of 17 million adults in England. medRxiv10.1101/2020.09.22.20198754 (2020).
- 46.Topriceanu, C.-C. et al. Inequality in access to health and care services during lockdown—findings from the COVID-19 survey in five UK national longitudinal studies. medRxiv10.1101/2020.09.12.20191973 (2020). [DOI] [PMC free article] [PubMed]
- 47.Li C, Romagnani P, von Brunn A, Anders H-J. SARS-CoV-2 and Europe: timing of containment measures for outbreak control. Infection. 2020;48:483. doi: 10.1007/s15010-020-01420-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Tian H, et al. An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China. Science. 2020;368:638–642. doi: 10.1126/science.abb6105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Jefferies S, et al. Covid-19 in New Zealand and the impact of the national response: a descriptive epidemiological study. Lancet Public Health. 2020;5:e612–e623. doi: 10.1016/S2468-2667(20)30225-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Auger KA, et al. Association between statewide school closure and COVID-19 incidence and mortality in the US. JAMA. 2020;324:859. doi: 10.1001/jama.2020.14348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Di Domenico L, Pullano G, Sabbatini CE, Boëlle P-Y, Colizza V. Impact of lockdown on COVID-19 epidemic in Île-de-France and possible exit strategies. BMC Med. 2020;18:240. doi: 10.1186/s12916-020-01698-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Gatto M, et al. Spread and dynamics of the COVID-19 epidemic in Italy: effects of emergency containment measures. Proc. Natl Acad. Sci. USA. 2020;117:10484–10491. doi: 10.1073/pnas.2004978117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Gauvin L, et al. Gender gaps in urban mobility. Humanit. Soc. Sci. Commun. 2020;7:1–13. doi: 10.1057/s41599-020-0500-x. [DOI] [Google Scholar]
- 54.Balcan D, et al. Modeling the spatial spread of infectious diseases: The global epidemic and mobility computational model. J. Comput. Sci. 2010;1:132–145. doi: 10.1016/j.jocs.2010.07.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Balcan D, et al. Seasonal transmission potential and activity peaks of the new influenza A(H1N1): a Monte Carlo likelihood analysis based on human mobility. BMC Med. 2009;7:45. doi: 10.1186/1741-7015-7-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.y Piontti, A. P., Perra, N., Rossi, L., Samay, N. & Vespignani, A. Charting the next Pandemic: Modeling Infectious Disease Spreading in the Data Science Age (Springer, 2018).
- 57.Chile 2017 census. https://www.censo2017.cl (2020).
- 58.Prem K, Cook AR, Jit M. Projecting social contact matrices in 152 countries using contact surveys and demographic data. PLoS Comput. Biol. 2017;13:1–21. doi: 10.1371/journal.pcbi.1005697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lai, S. et al. Effect of non-pharmaceutical interventions to contain COVID-19 in China. Nature585, 410–413 (2020). [DOI] [PMC free article] [PubMed]
- 60.Wu, J., Leung, K. & Leung, G. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet395, 689–697 (2020). [DOI] [PMC free article] [PubMed]
- 61.Keeling MJ, Rohani P. Estimating spatial coupling in epidemiological systems: a mechanistic approach. Ecol. Lett. 2002;5:20–29. doi: 10.1046/j.1461-0248.2002.00268.x. [DOI] [Google Scholar]
- 62.M. Longini Jr, I. Chain Binomial Model (American Cancer Society, 2014).
- 63.Scalia Tomba G. Emilia Vynnycky and Richard G White. An introduction to infectious disease modelling. Eur. J. Public Health. 2010;22:295–295. doi: 10.1093/eurpub/ckq163. [DOI] [Google Scholar]
- 64.Becker, N. A general chain binomial model for infectious diseases. Biometrics37, 251–258 (1981). [PubMed]
- 65.Backer, J. A., Klinkenberg, D. & Wallinga, J. Incubation period of 2019 novel coronavirus (2019-nCoV) infections among travellers from Wuhan, China, 20–28 January 2020. Eurosurveillance25, 2000062 (2020). [DOI] [PMC free article] [PubMed]
- 66.Kissler SM, Tedijanto C, Goldstein E, Grad YH, Lipsitch M. Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period. Science. 2020;368:860–868. doi: 10.1126/science.abb5793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Centers for Disease Control and Prevention, COVID-19 Pandemic Planning Scenarios. https://www.cdc.gov/coronavirus/2019-ncov/hcp/planning-scenarios.html#table-1 (2021).
- 68.Departamento de Estadisticas e Información de Salud. https://deis.minsal.cl/#datosabiertos (2020).
- 69.Global Human Development Indicators, United Nations Development Programme. http://hdr.undp.org/en/countries (2020).
- 70.Human development index estimates for Chile. http://www.desarrollosocialyfamilia.gob.cl/btca/txtcompleto/mideplan/des_humano_comunas2000.pdf (2000).
- 71.Beiró, M. et al. Shopping mall attraction and social mixing at a city scale. EPJ Data Sci.7, 28 (2018).
- 72.Code for human development index calculation at the comunas level for Chile. https://zenodo.org/record/1305169#X7fExC2ZPq1 (2020).
- 73.Gozzi, N. et al. Code for the paper “Estimating the effect of social inequalities in the mitigation of COVID-19 across communities in Santiago de Chile”. 10.5281/zenodo.4607303 (2021). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw data analyzed in the study are not publicly available due to privacy reasons. All the aggregated mobility data needed to run the model are available at ref. 73. Analysis of the anonymised mobile phone data was performed on mobile operator’s systems without transferring it outside. Only aggregated mobility patterns across municipalities were provided to researchers outside Chile and only these have been used for the results presented here. The study was deemed exempt (IRB #20-10-05) by the Northeastern University Internal Review Board.
The main epidemic model described in the manuscript has been implemented with the programming language C++. The alternative models, detailed in Section 2 of the Supplementary Information, have been implemented in C++ or in Python. All codes are available at ref. 73, on GitHub https://github.com/ngozzi/covid19-santiago and on Zenodo https://zenodo.org/record/4607303.