Abstract
Human mobility is strongly associated with the spread of SARS-CoV-2 via air travel on an international scale and with population mixing and the number of people moving between locations on a local scale. However, these conclusions are drawn mostly from observations in the context of the global north where international and domestic connectivity is heavily influenced by the air travel network; scenarios where land-based mobility can also dominate viral spread remain understudied. Furthermore, research on the effects of nonpharmaceutical interventions (NPIs) has mostly focused on national- or regional-scale implementations, leaving gaps in our understanding of the potential benefits of implementing NPIs at higher granularity. Here, we use Chile as a model to explore the role of human mobility on disease spread within the global south; the country implemented a systematic genomic surveillance program and NPIs at a very high spatial granularity. We combine viral genomic data, anonymized human mobility data from mobile phones and official records of international travelers entering the country to characterize the routes of importation of different variants, the relative contributions of airport and land border importations, and the real-time impact of the country's mobility network on the diffusion of SARS-CoV-2. The introduction of variants which are dominant in neighboring countries (and not detected through airport genomic surveillance) is predicted by land border crossings and not by air travelers, and the strength of connectivity between comunas (Chile's lowest administrative divisions) predicts the time of arrival of imported lineages to new locations. A higher stringency of local NPIs was also associated with fewer domestic viral importations. Our analysis sheds light on the drivers of emerging respiratory infectious disease spread outside of air travel and on the consequences of disrupting regular movement patterns at lower spatial scales.
Keywords: genomic epidemiology, viral importations, spatial invasion
Significance Statement.
Global preparedness for pandemic threats requires an understanding of the global variations of spatiotemporal transmission dynamics. Regional differences are important because the local context sets the conditions for the unfolding of local epidemics, which in turn affect transmission dynamics at a broader scale. Knowledge gaps from the SARS-CoV-2 pandemic remain for regions like South America, where distinct sets of viral variants emerged and spread from late 2020 onwards, and where changes in human behavior resulted in epidemics which differed from those observed in other regions. Our interdisciplinary analysis of the SARS-CoV-2 epidemic in Chile provides insights into the spatiotemporal trends of viral diffusion in the region which shed light on the drivers that can influence future epidemic waves and pandemics.
Introduction
The epidemic dynamics of SARS-CoV-2 are context-dependent. Intensity of transmission has been shown to vary greatly due to a combination of factors related to population mixing, international travel, socioeconomic indicators, and SARS-CoV-2 lineage composition (1–3). The emergence of novel variants of interest (VOIs) and variants of concern (VOCs) was typically associated with epidemic waves in the country of their first report and later caused epidemics either regionally (Gamma and Beta in Brazil and South Africa, respectively) or globally (Alpha, Delta, and Omicron in England, India, and South Africa, respectively; (1, 4–7)).
During the emergence of Gamma, Beta, and Alpha in late 2020, the likelihood of a variant being detected in a country could be predicted by the number of passengers arriving from the countries reporting initial outbreaks (8). During that time, international and local travel was still restricted compared with prepandemic levels (9). Studies have shown how increased rates of local mixing impacted SARS-CoV-2 growth rates and total numbers of cases (10, 11) and suggest that different types of human mobility contributed to epidemic spread (12, 13).
Despite these general insights, the transmission dynamics of multiple variants in Latin America remain understudied and it is still unknown why some variants spread in this region and others did not. Multiple factors likely contributed to SARS-CoV-2 lineage composition (14), including high transmission rates during the times when some variants emerged: the Gamma variant in the Brazilian Amazon (5), the Lambda variant in Peru and Chile (15), and the Mu variant in Colombia (16, 17). Various other lineages not designated as VOIs or VOCs in regions like Mexico (18) and the United States (e.g. Iota and Epsilon (19–21) were also detected in the continent but did not reach high frequencies).
Here, we analyze 787 SARS-CoV-2 genomes collected via dedicated airport genomic surveillance at Santiago de Chile Airport (SCL) and 7,957 genomes generated through community genomic surveillance (22, 23). These data are combined with human mobility data from mobile phones (covering 24.3% (24) of Chile's mobile phone subscribers) and records of international arrivals into the country (both airport arrivals and total land border crossings during 2021) to explore how international arrivals, airport testing, and local interventions (implemented at the lowest administrative divisions in the country, comunas) impacted the transmission dynamics of SARS-CoV-2 lineages in Chile during 2020–2021. We also assess how Chile's mobility network predicts viral spread following new importations. While similar analyses have been performed for other locations and contexts, Chile provides a distinct scenario to study these phenomena. Its genomic surveillance program at the largest international airport in the country (that received the vast majority of international travelers which were all tested upon arrival during 2021), combined with its network of land international ports of entry across its large border with Argentina (and smaller borders with Bolivia and Peru), results in an ideal scenario to evaluate the routes by which individual variants were imported into the country. Also, the highly localized application of nonpharmaceutical interventions (NPIs) during the pandemic (25–28) was a unique phenomenon which has not been evaluated in its efficacy in limiting the geographic spread of SARS-CoV-2.
The SARS-CoV-2 landscape in Chile and South America in 2021
Different SARS-CoV-2 variants emerged in South America during late 2020 and early 2021 (29). The earliest VOC described in South America was the Gamma variant, which emerged in Manaus, Brazil (5), and quickly spread to the southern half of the continent, into Argentina, Chile, Uruguay, Bolivia, and Paraguay (Fig. 1A). Simultaneously, the Lambda variant emerged and rapidly increased in frequency in Peru and Chile, while the Mu variant appeared later, around April 2021, in Colombia, Ecuador, and Venezuela (Fig. 1A). The epidemiological trends across different countries during this time suggest that these emerging variants drove national epidemic peaks, as increases in the number of cases followed increases in the prevalence of each variant between February and May 2021 across the region. Further epidemic peaks followed in June and July, which might have been driven by viral lineage replacement caused by regional movements of various South American variants or by the importation of the Delta variant, which swept across the world from spring 2021 onwards (Fig. 1A). Variants that became dominant in some other regions of the world, like Alpha and Beta, did not dominate in South America (Fig. S1).
These variant dynamics across other countries and their connectivity to Chile likely played a role on the country's SARS-CoV-2 epidemic trends (Fig. 1B–D). Data on monthly land border crossings and international airport arrivals show that both Peru and Argentina were important contributors to the total incoming travelers to Chile during 2021, albeit with different trends; while Peru accounted for the majority of air travelers arriving every month via SCL (up to 22.7% of all air passengers per month, Fig. 1B), more total travelers arrived from Argentina through land travel (14 land border crossings with Chile were operationally active during 2021, which recorded between 67 and 100% of all incoming travelers from Argentina into the country each month; Fig. 1B). We note that travelers from Argentina increased considerably from May 2021 onwards possibly reducing the impact of air travel restrictions in the flow of people from that country during that time (specifically May, June, and July 2021; Fig. 1B).
During the study period between November 2020 and October 2021, Chile experienced three epidemic waves, peaking in February, April, and July, respectively. Each wave was followed by periods during which individual comunas enacted lockdowns of different stringency (i.e. the degree to which people could move freely; Supporting Text) and were implemented as a response to the changing epidemic trends. This resulted in a fluctuation of the number of comunas and total people under restrictions over time (Fig. 1C).
International importations of viral lineages depend on country-level prevalence and human mobility
Chile implemented separate genomic surveillance programs for incoming international travelers at SCL and within the community. The proportions of different variants in both datasets show that for Alpha, Lambda, Mu, and Delta, airport detections preceded the detection of these variants in the community by 6, 4, 5, and 2 weeks, respectively. Gamma, however, was rarely observed using data from airport surveillance yet consistently exceeded 25% of sequences from cases sampled in the community from 2021 April 25 until the end of August, when Delta started to replace all other lineages (Fig. 1D).
From these observations, we hypothesize that variants like Gamma were imported to Chile through different pathways other than international air travel. We performed phylogenetic analysis of virus genomic data from all five variants to map the number and timing of importations in Chile between late 2020 and throughout 2021 and found frequent and sustained importations across the study period. Alpha and Lambda importations peaked simultaneously around April, while Gamma saw sustained importations between May and August 2021. Mu had a peak in importations around July, followed by large numbers of Delta importations between August and October (Fig. 2A). Phylodynamic analysis of each variant reveals an estimated ∼648 independent introductions (excluding introductions of all other non-VOI/VOC viral lineages). For Delta, we detect the highest number of importations (median 336 importations; 95% HPD: 327–344), considerably higher than for Lambda (median 111 importations; 95% HPD: 95–124), Mu (median 86 importations; 95% HPD: 79–92), Gamma (median 74 importations; 95% HPD: 64–83), and Alpha (median 41 importations; 95% HPD: 35–43; Fig. 2B). These estimates are likely affected by sampling intensity heterogeneities across countries and by the limited proportion of sampled cases in Chile, which also exhibit local sampling variation across space and time. We note a general trend whereby an increasing proportion of cases were sequenced coincided with declining numbers of cases (Fig. S2). We accounted for these potential biases by using a mobility-informed subsampling approach that considers human movements from international destinations into Chile (see Materials and methods and (18)).
An estimated 43% of introductions led to detectable onward transmission in Chile, hereafter called “transmission lineages” (TLs). Across variants, only for Gamma and Lambda that more than 50% of viral introductions resulted in onward local transmission (Fig. S3). As with previous studies, the size distribution of these TLs is heavily skewed, with a small portion of importations leading to large TLs, while the vast majority resulted in limited detectable onward transmission (Fig. S4); this size distribution might be affected by the study period cutoff date, as the persistence of TLs over time is right-censored (Fig. S5). The median number of sequences in a TL tends to be higher as the numbers of cases during the time of the importation was lower, although this pattern is not observable for Mu and for Gamma (Fig. S3).
Because phylogenetic inferences are influenced by sampling biases and model specification, we use independent datasets to correlate estimates made from genomic data. Previous studies have shown that the number of international travelers coming into a territory can be used to help infer the expected number of SARS-CoV-2 case introductions (8, 30). We calculated an estimated importation intensity index (EII) based on the estimated cases for each variant in a potential source country and the total movements from that country into Chile. We used a Granger causality (GC) test to ask whether EIIs can “forecast” the inferred importations of each variant over time (see Materials and methods). We found that the weekly number of viral importations inferred from genomic data followed the EII for each variant (Table 1). Interestingly, Gamma introductions are best predicted by an EII that only includes land-based border crossings (lEII), while the remaining four variants are predicted by an EII based on air-based human movements (aEII) rather than land border crossings (Table 1). These results may explain why Gamma was rarely detected during airport genomic surveillance.
Table 1.
VOC | Source location | aEII | lEII | ||||
---|---|---|---|---|---|---|---|
Lag (model support) | F statistic | P value | Lag (model support) | F statistic | P value | ||
Alpha | Florida (USA) | 12 (AIC = −6.74) | 10.65 | 0.001 | — | — | — |
France | 3 (SC = −2.65) | 7.47 | 0.0005 | — | — | — | |
Spain | 12 (AIC = −6.83) | 1.77 | 0.21 | — | — | — | |
Argentina | 12 (AIC = −9.89) | 7.42 | 0.001 | 1 (SC = −2.24) | 0.04 | 0.84 | |
Bolivia a | — | — | — | — | — | — | |
Peru | 2 (SC = −5.84) | 8.95 | 0.000594 | 1 (SC = −8.53) | 14.83 | 0.0004 | |
Combined b | 12 (AIC = 3.42) | 20.98 | 0.0001 | 1 (SC = −2.24) | 0.07 | 0.79 | |
Gamma | Rio de Janeiro (BRA) | 6 (SC = −4.29) | 0.39 | 0.88 | — | — | — |
Santa Catarina (BRA) | 11 (SC = −14.99 | 1.07 | 0.46 | — | — | — | |
Sao Paulo (BRA) | 1 (SC = 0.47) | 0.03 | 0.86 | — | — | — | |
Colombia | 12 (SC = −3.31) | 1.00 | 0.52 | — | — | — | |
Argentina | 1 (SC = −3.62) | 3.50 | 0.07 | 1 (SC = 1.63) | 11.14 | 0.002 | |
Bolivia | 1 (SC = −6.05) | 0.44 | 0.51 | 1 (SC = 0.52) | 3.61 | 0.06 | |
Peru | 1 (SC = −1.19) | 0.17 | 0.68 | 1 (SC = −3.91) | 5.75 | 0.02 | |
Combined b | 1 (SC = 8.36) | 0.03 | 0.86 | 1 (SC = 2.29) | 9.80 | 0.003 | |
Lambda | Argentina | 1 (SC = −4.36) | 1.98 | 0.17 | 1 (SC = 1.39) | 0.009 | 0.92 |
Bolivia | 1 (SC = −11.22) | 0.53 | 0.47 | 1 (SC = −3.02) | 0.04 | 0.85 | |
Peru | 10 (SC = −1.95) | 3.84 | 0.01 | 1 (SC = −2.84) | 53.21 | 4.24E-09 | |
Combined b | 10 (SC = 5.21) | 3.68 | 0.01 | 1 (SC = 1.42) | 0.64 | 0.43 | |
Mu | Colombia | 1 (SC = 0.52) | 7.04 | 0.01 | — | — | — |
Argentina | 1 (SC = −1.11) | 0.17 | 0.68 | 1 (SC = −4.77) | 1.19 | 0.28 | |
Bolivia | 2 (SC = −7.25) | 1.00 | 0.38 | 2 (SC = −0.64) | 1.57 | 0.22 | |
Peru | 12 (SC = −7.80) | 5.08 | 0.006 | 10 (SC = −9.39) | 3.27 | 0.02 | |
Combined b | 1 (SC = 7.54) | 7.50 | 0.009 | 2 (SC = −0.61) | 1.82 | 0.18 | |
Delta | Sao Paulo (BRA) | 12 (SC = −16.32) | 1459.4 | 5.47E-12 | — | — | — |
Florida (USA) | 12 (SC = −8.71) | 321.79 | 2.28E-09 | — | — | — | |
Georgia (USA) | 12 (SC = −15.77) | 873.46 | 4.25E-11 | — | — | — | |
New York (USA) | 12 (SC = −54.78) | 9.93E + 15 | 2.20E-16 | — | — | — | |
Texas (USA) | 12 (SC = −16.01) | 3553.6 | 1.56E-13 | — | — | — | |
France | 12 (SC = −89.96) | 122.5 | 1.06E-07 | — | — | — | |
Netherlands | 12 (SC = −8.89) | 449.03 | 6.05E-10 | — | — | — | |
Spain | 12 (SC = −7.68) | 188.35 | 1.92E-08 | — | — | — | |
Argentina | 12 (SC = −6.04) | 2.32 | 1.20E-01 | 12 (SC = −3.71) | 102.46 | 1.71E-09 | |
Bolivia | 9 (SC = −6.71) | 3.37E+ 26 | 2.20E-16 | 9 (SC = −64.23) | 3.44E+ 27 | 2.20E-16 | |
Peru | 12 (SC = −71.38) | NAc | NAc | 12 (SC = −73.92) | NAc | NAc | |
Combined b | 12 (SC = 4.35) | 8.43 | 0.003 | 12 (SC = −4.05) | 121.25 | 6.87E-10 |
Countries with italized names share a land border with Chile. Nonneighboring countries included based on at least one epidemiological week where EII > 1. Cells in orange show significant results at α = 0.01.
aNo genomic surveillance data for this line.
b aEII estimated from all countries with direct flights into Chile (including those minor contributors not shown in the table) and lEII estimates from all neighboring countries.
cUnavailable test result due to multicolinearity.
Airport testing and genomic surveillance were implemented as a public health measure and were combined with requirements for returning travelers of self-isolation following a positive test result (Supporting Text). Therefore, this surveillance scheme aimed to minimize and ideally contain transmission from infected incoming travelers. Its effectiveness depends on the sensitivity and accuracy of the testing itself and the proportion of viral introductions through the airport rather than by other routes. We consequently hypothesize that cases identified during airport surveillance would result in more limited transmission in the community. To explore this hypothesis, we used a discrete phylogeographic approach to estimate the number of viral movements between countries other than Chile and the SCL International Airport, the movements from SCL airport into the community within Chile, and importations from other countries directly into community circulation in Chile (i.e. not detected or mediated via airport surveillance). The importation dynamics appear specific for individual variants: Gamma and Lambda importations were predominantly inferred directly from international destinations into the community with no evidence of airport mediation, while Alpha and Mu show an equivalent number of importations into the SCL International Airport as they did into the community. Delta shows a higher proportion of importations through the airport compared with directly into the community. Overall, negligible numbers of transitions from the airport into the community were inferred (Fig. 2D). We note, however, that importations with no evidence of airport mediation could still have been imported through the airport but went undetected due to the timing of testing (no detectable virus at arrival) or that positive cases were not sequenced (although ∼95% of positive cases from airport surveillance during 2021 were sequenced, those excluded were those where sample quality prohibited genome sequencing and likely occurred at random across the collection of samples).
An independent estimation of both lEII and aEII for individual source countries per variant shows the likely contributors of viral importations over time (Fig. 2C), with Gamma and Mu introductions likely derived from a single source country (Argentina and Colombia, respectively), while Alpha and Delta likely being imported from multiple countries (or states within the United States and Brazil) simultaneously. Lambda exhibits a distinct pattern, in that its predominant source changed from Peru (via air travel) during early 2021, to Argentina (via land border crossings) after April 2021 (Fig. 2C), although the expected importation wave from Argentina was not observed using genomic data (Fig. 2A, Table 1). In the case of Gamma, only Argentina and Peru lEIIs show a significant correlation with inferred viral introductions, further suggesting that land mobility played a bigger role in the seeding of this variant in Chile compared with air travel. This is corroborated by the increased number of Gamma viral imports directly into the community compared with the airport-mediated importations (Fig. 2D). Given that two consecutive epidemic peaks in Chile were likely driven by Lambda (which peaked around 2021 April 11) and Gamma (which peaked around 2021 June 6) (Fig. S6), it is possible that the earliest Lambda peak was partially driven by airport-mediated importations from Peru and the following Gamma peak (also representing a shift in the dominant variant circulating in Chile) was driven by land importations from Argentina. Interestingly, both the aEII and lEII for Peru significantly correlate with the observed importations for all variants, suggesting that both land and air routes of entry played an important role in how seeding events from this country into Chile took place (even though there is a single land border crossing point between Peru and Chile; Table 1). Again, the relatively high number of introductions that show no evidence of being airport-mediated (Fig. 2D) for Alpha, Lambda, and Mu suggests that land importations were also commonplace during this time. All global EII estimates (combining both lEII and aEII, Fig. S7) and the phylogenetically inferred viral imports are temporally associated (FAlpha = 15.00, PAlpha < 0.001; FGamma = 7.19, PGamma = 0.01; FLambda = 12.97, PLambda < 0.001; FMu = 5.46, PMu = 0.02; FDelta = 32.28, PDelta < 0.001).
Human mobility drives SARS-CoV-2 spatial invasion across comunas
Given the limited ports of entry for different variants into Chile, the arrival of individual lineages to new comunas after their introduction is expected to be driven by infected people moving within the country and seeding new local epidemics. We estimate the impact of local scale human mobility on the invasion dynamics of different TLs by comparing comuna-to-comuna arrival times extracted from a continuous phylogeographic approach (31) (see Materials and methods). The spread of these TLs took place during periods of changing mobility in the country as lockdowns were enacted as part of the Paso a paso plan, which established a tiered system for the implementation of mobility restrictions to mitigate viral transmission (the plan employed various stringency tiers that can be summarized as (i) a high stringency full lockdown with mobility restrictions every day of the week, (ii) a mid-stringency weekend lockdown with mobility restrictions only during weekends, and (iii) a low stringency lockdown with no mobility restrictions; Supporting Text). This plan gave individual comunas the authority to determine the appropriate stringency for these NPIs and the appropriate times to change stringency. Consequently, real-time mobility data (estimated from mobile phone data) are important in accurately estimating the contribution of human mobility to viral spread. Between January and October, there were at least three periods when increasing numbers of comunas were placed under lockdown (Fig. 1C); an accompanying reduction in mobility followed, reaching its lowest point around the April lockdown season, when up to 80% of comunas were under lockdown (Fig. 3A). The bounce-back following this downward trend also coincided with a policy implemented on May 26, when individuals with a complete immunization schedule were issued a “mobility pass” which allowed them free movement, even within comunas under lockdown (Fig. 3A).
We identified three distinct periods of human mobility within which the largest observed TLs were introduced into Chile: (i) epoch 1, between January and March when large Alpha, Lambda, and Gamma TLs were imported, (ii) epoch 2, between May and July, when major Mu TLs were imported, and (iii) epoch 3, between July and September, when major Delta TLs were imported (Fig. S8). The arrival times of TLs to new comunas correlated with the mobility flow between the origin and destination comunas in the epoch during which the TLs were imported; this pattern is consistent for all epochs and for all variants (Fig. 3B). The strength of the correlation is generally greater for variants introduced during epoch 1, which predates the issuing of the mobility passes.
TLs spread rapidly across the country as a result of these seeding events, predominantly in central Chile and in large urban centers either to the south (such as Temuco and Concepción) or to the north (such as Antofagasta; Fig. 4A). Following importation into the country, each TL spread via short- and long-range domestic seeding events into new comunas which occurred early in the invasion timeline of a TL. Some of the observed epidemic synchronicity between comunas (Fig. S9) is likely the result of these early seeding events taking place to locations which are both close and distant from the origin of a TL. While the timing of domestic seeding contributes to epidemic dynamics across the country, local transmission is likely affected by human mobility and mixing within specific locations which are in turn driven by different NPIs in effect before and during the occurrence of viral seeding events.
Targeted NPIs and their effects on the spatial dynamics and persistence of SARS-CoV-2 lineages
An additional important factor in the unfolding of these epidemic waves is the highly localized NPIs implemented during 2021 following the national Paso a Paso plan. Despite the decentralized authority on their implementation, the stringency of these NPIs tended to be synchronized across the country (more strongly among comunas in the same region) and followed national epidemiological COVID-19 trends (Fig. S10). This created a scenario in which viral movements could occur between comunas under different lockdown stringency tiers. We hypothesize that movements within comunas should decrease with higher stringency lockdown tiers, which might also confer a protective effect on other comunas due to fewer viral imports.
To test this, we implement a negative binomial model to estimate whether higher lockdown stringency levels (n = 3) are associated with fewer inferred viral movements (modeling details are given in Materials and methods). Our results show that lockdown stringency is significantly negatively correlated with the inferred number of viral movements after accounting for new reported cases, both within comunas (χ2 = 7.95, P = 0.02) and between comunas (χ2 = 23.24, P < 0.001). Compared with comunas under full lockdown, comunas under no lockdown receive a greater number of inferred viral movements from other comunas (IRRbetween_comunas = 1.16, 95% CI: 1.06–1.26); comunas under weekend lockdown receive approximately the same number of viral movements from other comunas (IRRbetween_comunas = 0.97, 95% CI: 0.88–1.08). These inferred viral movements are dependent on and limited by the probability of new imports generating cases that can be detected through genomic surveillance and the genomic surveillance intensity across the country (i.e. the likelihood of detecting a TL given the sampling process that produces the genomic data).
The effects of lockdown stringency on inferred viral movements within the same comuna exhibit a similar pattern: a limited and nonsignificant reduction is observed for comunas under no lockdown compared with comunas under full lockdown (IRRwithin_comunas = 1.05, 95% CI: 0.98–1.14) and comunas under weekend lockdowns show no significant differences compared with comunas under full lockdown (IRRwithin_comunas = 0.95, 95% CI: 0.88–1.04). The magnitudes of these differences are small due in part to the large variation in the proportion of viral movements under each lockdown tier: essentially large numbers of viral movements are inferred for some comunas, while for others only rare and sparse viral importations are inferred (Fig. 4B).
A separate factor that could affect viral lineage movements in Chile is the implementation of the mobility pass for fully vaccinated individuals, to whom official lockdown restrictions did not apply. We expanded the previous model to account for the times before and after the implementation of the mobility pass on May 26. No evidence was found that the implementation of the mobility pass was associated with significant changes in the number of viral movements within comunas (χ2 = 1.04, P = 0.31), but we find a near two-fold increase (IRRbetween_comunas_mobility_pass = 1.73, 95% CI = 1.42−2.09) in the numbers of viral movements between comunas (χ2 = 29.87, P < 0.001).
Lineage size distributions have been shown to vary due to importation times, such that earlier importations lead to larger TLs, and these larger lineages persist for longer after implementations of NPIs (30). We measure persistence using an ad hoc analysis of the Bayesian posterior tree distribution for the largest Gamma lineage ((32), see Materials and methods). We find that the proportion of persisting branches per comuna decays over time across all lockdown tiers (Fig. S11). The rates at which persisting branches decay over weeks vary when assessed by a fitted exponential curve (44.85, 22.72, and 19.74% weekly reduction for comunas under full, weekend, and no lockdown, respectively) but wide credible intervals for all cases limit the interpretability of these differences between lockdown tiers. Future work should assess the location- and context-specific factors driving TL decay rates, including sampling intensities, geography, climate, and social interactions.
Discussion
Our results provide an interdisciplinary evaluation of the utility of airport genomic surveillance and the role of targeted interventions in detecting and predicting viral spread of multiple viral variants. Airport and community genomic surveillance in Chile presented an opportunity to investigate the context-specific spatial dynamics of SARS-CoV-2 variants. We find that new viral importations (as inferred from genomic data) are correlated with estimates of the expected intensity of viral importations. These measurements use independent data sources, with the former relying on phylogenetically inferred viral importations and the latter combining the flows of passengers, prevalence of the variant in the source country, and the number of cases. While this has been demonstrated in other settings, previous studies have typically focused on one viral variant at a time (1, 4, 30, 33). Incorporating the changing landscape of variant prevalence across South America revealed not only the importance of air travel but also that of land-based transport from neighboring countries, especially Chile's large, shared border with Argentina, which facilitated direct viral importations into the community, as observed for the Gamma variant. While a previous study has shown the importance of land-based transport in Jordan and the Middle East (13), the role of these land-based viral importations in Chile is particularly important given that the largest wave of viral importations from Argentina via land border crossings preceded the second epidemic wave of 2021 dominated by the Gamma variant (Fig. S6). This shows that the effects of viral importations across land borders can be highly impactful to the trajectory of an epidemic. Airport surveillance, established by the precedent of the importance of travelers in driving the importation of viral lineages into a country (e.g., (34)), appeared to be effective in detecting lineages at the airport, and we found some evidence that it prevented these lineages from further circulating in the community (Fig. 2D). The earliest location of detection of a new importation does not necessarily cluster geographically with its possible source, likely due to limited spatiotemporal resolution of genomic sampling and surveillance. However, a low prevalence at the inferred time of a new importation generally resulted in higher numbers of infections being derived from such importation for most variants (no such pattern was observed for Mu and an inverse pattern was observed for Gamma, Fig. S3), as was observed early on during the summer of 2020 in Europe when low disease prevalence at a time when international travel was reopened resulted in larger viral TLs (32).
The frequencies of SARS-CoV-2 variants during late 2020 and 2021 provide important context to our findings; in some regions, the pandemic was characterized by a sequence of genetic sweeps during which new variants displaced the circulating viral genetic lineages. In North America and Europe, this process started with Alpha (4, 35–38) which was subsequently replaced by Delta (1, 39–42) and ultimately Omicron (33, 43–46). Similarly, in South Africa epidemic waves were driven by emerging and imported VOCs; starting with Beta, followed by Delta and finally Omicron (6, 7). It is also notable that other countries saw the dominance of viral lineages which did not significantly expand across their national borders during the early epidemic waves (e.g. lineage B.1.398 in Lebanon (47), lineages B.1.1.222 and B.1.1.519 in Mexico (18)) and that regionally important variants that did not reach global significance were also identified (e.g. variant Eta in Nigeria and West Africa (48)). Given the high connectivity of various South American countries with European countries and US states (8, 18, 49, 50), SARS-CoV-2 lineages that were dominant in the northern hemisphere heavily influenced the viral lineage composition of the region during the early epidemic waves (51–60). Nonetheless, locally emerging variants played an important role in the viral dynamics of the continent, as exemplified by the emergence and spread of Gamma (5, 61). A recent quantification of the differences in importation and transmission dynamics in the region (62) shows that air travel volume is a strong predictor for viral spread for variants like Gamma and Lambda and highlights the heterogeneity of variant dominance across countries. Particularly, it emphasizes how Chile and Argentina show remarkable synchronicity between variant-specific epidemic waves. Our results expand this picture and disentangle the effects of different types of cross-border mobility in the spread of viral lineages. The possibility of extending this framework to predict which variants would become dominant in a country given the regional and global context would also require the characterization of both the unique immunity landscape of regions and the antigenic characteristics of novel variants.
Human mobility patterns within the country and the implementation of highly targeted NPIs played a role in the domestic spread of viral lineages within Chile. The time delay between the introduction of a new lineage in Chile and the local introduction of the lineage in a different comuna within Chile is negatively correlated with the intensity of local human movements from the source; this observation sets the basis for a probabilistic measure of viral invasion during the early stages of an epidemic as more remote comunas, which are less central in the country's mobility network, would expect a greater delay between the epidemic taking off in central locations of the country and their own epidemic seeding.
Links between human mobility and viral seeding may also have consequences for the efficacy of NPIs, such as the highly localized comuna-level lockdowns. The reactive nature of the implementation of higher lockdown stringency levels likely resulted in a synchronized pattern of NPI stringency across the country (with strong regional synchronicity, Fig. S10). Viral genomic data suggest that the patterns of lineage movements across comunas show some limited differences dependent on lockdown stringency, but the limited spatiotemporal coverage from genomic surveillance poses a considerable limitation to the interpretation of these differences. Furthermore, the observed decay in the proportion of persisting lineages across all lockdown tiers follows patterns observed in other contexts (30), and it is therefore unclear the extent to which these are attributable to the NPIs themselves. A detailed analysis on the effects of these stringency levels should be paired with the quantification of the mobility changes produced by NPIs to fully understand the causal relationship of lockdowns, changes in human behavior, and the infection dynamics.
Our study has several limitations. Firstly, after the emergence of Alpha, Gamma, and Beta, Chile implemented and increased their airport and community surveillance during 2021. This meant that the proportion of cases sequenced increased which was visible especially during the declining Delta wave at the end of 2021 (Fig. S2). The significantly larger number of inferred Delta introductions therefore needs to be interpreted with caution. We did, however, also estimate an increase of Delta EII during that time. Secondly, our EII calculations are biased due to variable case reporting across space and time. Previous work has proposed that sequencing ∼5% of all cases allows the detection of a new viral lineage with a detection probability >80%, and that the sequencing intensities displayed by lower middle– and upper middle–income countries (as is the case for many cases in South America) allow for the estimation of a lineage prevalence with a small margin of error (63). It is unclear whether lower levels of sequencing observed for certain countries resulted in biases in genomic prevalence estimates (i.e. the true prevalence of a variant could be either higher or lower than expected), which adds further uncertainty to our EII estimates. Thirdly, the mobile phone data represent roughly one quarter of the population of Chile and it has been shown that they over represent urban areas and higher income groups. Accounting and adjusting for these biases will be an important area of future work. Fourthly, following an importation, viral lineages spread in the country widely and circulated more intensely in key regions in north and central Chile. While continuous phylogeographic analyses are prone to sampling biases (64, 65), the distribution and circulation of these lineages around important urban areas and locations attractive for tourists suggests that viral movements in the country follow from human movements, as has been reported previously (1, 4, 8, 9, 30, 66, 67). The invasion process appears to be explained by the connectivity between comuna pairs, making human movement estimates from mobile phone usage an important predictor of arrival times of a new viral lineage into different comunas. Nonetheless, the effects of the heterogeneous NPI landscape in the country during the study period are also meaningful and likely represent an atypical mobility regime for Chile. A full description of the link between lockdown tiers and the true changes in human movement patterns (i.e. compliance with NPIs) is required to clarify the link between viral movements, human mobility, and NPIs. It is also crucial to account for the effects of socioeconomic differences both at an individual and geographic level (26). The early COVID-19 epidemic waves in Santiago in 2020 were sparked in wealthier comunas which also became more isolated following the implementation of the first lockdowns in the city (25). Exploring these patterns during subsequent epidemic waves and under comuna-level lockdowns requires additional follow-up work.
Coordinated genomic surveillance and the use of human mobility data can aid in the monitoring and prediction of viral spread during a large-scale national epidemic of directly transmitted pathogens. The weight of each of these data streams in the inference of pathogen lineage dynamics is an important question to address, as scaling up genomic pathogen surveillance can be costly, and data on human movements can be unavailable, or sensitive when not aggregated appropriately. Nonetheless, we find that surrogate measures such as EIIs correlate with inferred introductions, although EIIs require the collection and accessibility of epidemiological (and sometimes genomic) data from other countries. Identifying which locations act as key sources of viral importations can help to prioritize surveillance (68). Furthermore, our findings on the effect of human movement on the arrival times of viral lineages to new locations and the mitigating effects of mobility restrictions at targeted spatial scales can inform consideration of NPI packages that have a minimal effect on human freedom of movement while maintaining a high efficacy in epidemic control.
Materials and methods
SARS-CoV-2 sampling from community and airport surveillance in Chile
Chile implemented a systematic genomic surveillance program in April 2021. During epidemiological week (epiweek) 47 of 2020 (2020 November 21 to 2020 November 27), randomization of community samples for the early identification of VOI was implemented through PCR testing of variant-associated mutations (69, 70). This strategy considers random, representative sampling without incorporating clinical or epidemiological criteria to estimate the prevalence of SARS-CoV-2 variants and lineages circulating in Chile, using a weekly sample size based on the number of new cases registered the previous week. During epiweek 5 of 2022 (2022 January 30 to 2022 February 5), this strategy was updated for sampling from Arturo Merino Benítez International Airport (in this work referred to as the Santiago International Airport or SCL for short) and the laboratory requirements for submitting samples to the Instituto de Salud Publica (ISP) for genomic sequencing of SARS-CoV-2 cases (71). The strategy provided specific details regarding the sample size, which was based on the number of cases in travelers during the previous epiweek (with a novel variant prevalence set at 0.50%, or 1/200, and considering a 95% confidence level; (72)). This approach aimed to identify novel variants entering the country and estimate the incidence of circulating variants in travelers. From this surveillance program, we included 787 complete genome sequences from the airport surveillance scheme (40 Alpha genomes, 27 Gamma genomes, 62 Lambda genomes, 68 Mu genomes, 403 Delta genomes, and 187 genomes from other viral lineages) and 7,957 genome sequences from the community surveillance scheme (129 Alpha genomes, 2,100 Gamma genomes, 1,418 Lambda genomes, 745 Mu genomes, 1,796 Delta genomes, and 1,769 genomes from other viral lineages).
Sequencing was performed following details reported in (73). Total RNA was extracted from nasopharyngeal swabs using the Zybio EXM 6000 automated system (Chongqing, China), transcribed to cDNA using the SuperScript III One-step RT-PCR System with Platinum Taq Kit and RNase OUT (Invitrogen), and amplified using the COVID-19 ARTIC Illumina v.3 primer set. Sequencing libraries from PCR products were produced using the Nextera Flex Library Prep Kit (Illumina, San Diego, CA, USA), purified with Agencourt AMPure XP beads (Beckman Coulter, Brea, CA, USA), and quantified using a Quant-it dsDNA HS Assay Kit (Invitrogen). Purified libraries were sequenced in a MiSeq sequencer using a 300-cycle kit (Illumina, San Diego, CA, USA). Sequencing quality control was performed using the FastQC v0.11.8 software, reads were filtered and trimmed using the BBDuk software, and genome assembly was performed using the IRMA v0.9.3 software.
Data from COVID-19 epidemics in Chile and connected countries
Epidemiological, genomic, and human mobility data were collected between November 2020 (which includes the earliest estimated ancestor of a TL in Chile with a TPMRCA on 2020 November 1; TPMRCA referring to the time of the parent node of the earliest tree node inferred to have occurred in Chile, followed by local viral transmission inside the country) and October 2021 (which includes the latest sample collection date for our study time period on 2021 October 12). This time span is henceforth referred to as the “study period”.
The monthly numbers of air travelers entering Chile during the study period were collected from the National Civil Aviation Agency and the National Ministry of Transportation and Telecommunications (Junta Aeronáutica Civil, Ministerio de Transportes y Telecomunicaciones) at http://www.jac.gob.cl/estadisticas/informes-estadisticos-mensuales-del-trafico-aereo/(retrieved on 2022 August 3). These data include the source city from which passengers arrived into Chile, as well as the airport to which they arrived. Given that 99.24% of returning travelers entered the country through the Santiago de Chile International Airport (SCL) and that all other international airports only served travelers from eight countries (Argentina, Colombia, Peru, Paraguay, Venezuela, Bolivia, Haiti, and Uruguay) out of 23 countries where travelers arrived from, data from airports outside of SCL were not considered for further analyses as their overall contribution was negligible. Furthermore, genomic data from airport surveillance were only collected at SCL, making the use of these data more comparable to our phylogenetic inferences. Given that counts for incoming air travelers were only available on a monthly basis, the total number of passengers per month was divided equally across the weeks that make up each month to obtain weekly estimates. This transformation assumes equal numbers of passengers entering every week from each origin city, which is justified by the periodic frequencies of airline scheduling practices (74). The monthly number of individuals entering the country through land border crossings was obtained through an Information Transparency request to the government of Chile data portal (https://datos.gob.cl). These data included the name of the specific border crossing station at which the numbers were recorded, from which we can identify the source country from where travelers entered and the comuna in Chile where they first arrived. We also divided the numbers of travelers equally across weeks in any given month for convenience, even if the periodic flight schedule assumption is unlikely to apply to land border crossing data.
Anonymized, individually reported COVID-19 cases from Chile were provided by the ISP of Chile. For each case, the sample collection date and comuna (lowest administrative level in Chile equivalent to adm3) of residence of the patient are recorded. Cases are aggregated daily and by comuna to produce epidemiological time series for the country. During the study period, Chile established a unique decentralized system for NPIs where individual comunas had autonomous authority to place their area under one of three tiers of stringency on limitations to human movements and gatherings (Supporting Text). Daily records for the NPI stringency tier for each comuna were used to estimate the total population under lockdown on any given day by multiplying the number of comunas under lockdown by their respective population sizes (provided within the same data files, obtained through the public COVID-19 GitHub repository of the Government of Chile, (no longer available online) at https://github.com/MinCiencia/Datos-COVID19/tree/master/output/producto24). Population sizes and aggregated case counts per country for locations from which incoming travelers were recorded into Chile during the study period were obtained from the Our World in Data COVID-19 dashboard at https://ourworldindata.org/covid-cases (retrieved on 2021 December 5; (75)).
Genomic data from Chile were generated through two distinct genomic surveillance programs and are described in the SARS-CoV-2 sampling from community and airport surveillance in Chile section. Metadata for these sequences separated by surveillance program were provided by the ISP, including an individual sequence ID which matches the ID assigned to viral genomes when uploaded to GISAID (76). Genome sequences from Chile were downloaded from GISAID (retrieved on 2022 June 30) and filtered by matching individual IDs with the metadata provided by the ISP. The proportion of sequences for each variant observed under the community surveillance scheme during each epidemic week was used to estimate the number of cases attributable to each of the variants over time.
Generation of a background genomic dataset utilizing international human mobility data
Global genome datasets assigned to each variant under investigation (Alpha, Gamma, Lambda, Mu, and Delta) were downloaded jointly with the Chile sequences from the aforementioned GISAID dataset. To gain an overview for the global introductory events into Chile, we curated a bespoke global dataset using openly available human mobility data. We identified the five South American countries with the highest intensity of human mobility into Chile (Argentina, Bolivia, Brazil, Colombia, and Peru) and the seven countries outside of South America with direct flights and the highest intensity of human mobility into Chile (Mexico, Canada, France, Panama, Spain, the United States, and the United Kingdom). For each variant, all sequences from these locations were identified then randomly sampled, uniformly across time per epidemiological week over the study period. This resulted in datasets with an approximate 1:1:1 ratio of sequences from Chile vs. sequences from countries with the highest relative mobility vs. countries with the most direct flights from outside of South America: 609 sequences for Alpha, 4,826 sequences for Gamma, 3,164 sequences for Lambda, 1,720 sequences for Mu, and 6,983 sequences for Delta.
Extraction of mobility estimates from mobile phone network data
We analyzed up to 3.5 million individual daily trajectories (which correspond approximately to 24% of the Chilean mobile phone subscription market share) from a total of 6.5 million Telefonica mobile phone users. We then aggregate individual trajectories into mobility indicators, selecting three main mobility aggregation methods, and we proceed by evaluating the interplay of the mobility process, with the spatial invasion. The first aggregating method counts the number of displacements between any two consecutive eXtended Detail Records (XDRs) made by a user, and it accounts for the full trajectory of each individual (D). The second coupling matrix connects the residence location of each user to all their visited locations, with a coupling force that is proportional to the number of XDRs made in each location (L). The third coupling matrix (L1) connects the residence location of each user to all their visited locations, with a coupling force that is proportional to the time spent in each location (77). We quantified the associated coupling matrices (D, L, and L1) for each month during the study period in Chile at the comuna level.
Phylogenetic inference of SARS-CoV-2 viral importations into Chile
Variant-specific genomic datasets were aligned to the Hu-1 reference genome (78, 79) using Minimap2 (80), and a maximum likelihood (ML) phylogenetic tree was estimated for each alignment using IQtree (81) under a GTR substitution model and by modeling the evolutionary rate variation across sites using a Gamma distribution with four site categories. The temporal signal for each phylogeny was evaluated by estimating the regression between root-to-tip lengths and sample collection dates using TempEst (82), after establishing the most likely root for each phylogeny by a minimization of residuals from each regression. Sequences for which the residuals were estimated to fall more than two standard deviations from the mean were removed, and the ML tree re-estimated under the same conditions described above. For these phylogenies, polytomies were not randomly resolved but rather maintained in the trees.
Branch lengths for the inferred ML trees were re-scaled to fit a temporal timescale using TreeTime (83). For all phylogenies, a fixed evolutionary rate of 7.4 × 10−4 subst./site/year was used for these analyses, following the empirical findings by Ghafari et al. (84). Given that individual analyses were performed per variant, the use of a fixed evolutionary rate allows us to ignore the likely inflated evolutionary rate observed prior to the emergence of individual variants (85) while also minimizing potential comparability issues among datasets due to the increased rate variation among lineages due to the different dataset sizes and sampling periods which can obscure the temporal signal (86).
Time-scaled ML phylogenies were further analyzed using an alignment-free likelihood estimation method implemented in BEAST v1.10.5 (prerelease; commit:d1a45) commonly known as Thorney BEAST, and described in further detail in (30). The time-scaled ML phylogenies were used as starting trees, while the previously described polytomic ML phylogenies were used as data trees; Thorney BEAST samples from different node heights and resolutions for polytomies to generate a posterior sample where branch length (in genetic terms; i.e. number of mutations) likelihood estimates are calculated as a function of a Poisson distribution with mean equal to the evolutionary rate multiplied by the branch length (in time units; (1)). Individual MCMC chains for each dataset were run for 100 million steps with a burn-in of ∼10% of the initial steps. All runs had a fixed evolutionary rate of 7.4 × 10−4 subst./site/year and a nonparametric Skygrid tree prior (87) with breakpoints every two weeks of the sampling period for the specific variant being analyzed. Trees were sampled every 5,000 steps, and convergence of all parameters was evaluated using Tracer (88) and defined as parameter effective sample sizes higher than 200.
The resulting empirical tree distributions were reanalyzed using BEAST 1.10.4 to estimate the number of importations into Chile using a discrete trait analysis (DTA) phylogeographic approach. Sequences collected from outside of Chile were labeled as “non-Chile”; sequences collected within Chile were further identified as “airport” or “community” sequences according to whether they were collected through airport surveillance of returning travelers or community surveillance inside Chile, respectively. Transitions between discrete states were estimated using an asymmetric model, and individual counts of transitions between states were estimated by stochastic mapping in the form of Markov jumps and rewards (89, 90). These DTAs were run for 10 million MCMC steps and sampled every 10,000 steps. Summary maximum clade credibility (MCC) trees were generated for all datasets using TreeAnnotator.
Phylogeographic reconstruction of the domestic spread of viral TLs in the country
From the discrete phylogeographic analyses, individual importations were identified following the rationale used by du Plessis et al. (30): subtrees that descend from a node inferred to have occurred inside Chile, which in turn descends from a node inferred to have occurred outside Chile. These subtrees, referred to as TLs (TLs, (30)), were extracted using Fertree (https://github.com/jtmccr1/fertree). We analyze the 20 largest TLs (Alpha, n = 1; Gamma, n = 2; Lambda, n = 5; Mu, n = 5; Delta, n = 7) which correspond to those with sizes >93rd percentile of all identified lineages within the country (>61 tips). These TLs range in size from the largest being Gamma TL 35 (n = 1827) for Gamma (corresponding to the Chile-specific PANGO designated Gamma sub lineage N.4) to the smallest being Mu TL 61 (n = 61) for Mu. We extracted these larger TLs from the summary MCC trees and performed individual continuous phylogeographic analyses for each to map their spread across comunas within Chile.
Sequence metadata were available for sequences up to the comuna (adm3) level, but no individual georeferenced coordinates were collected to ensure patient privacy. For each individual sequence where the comuna sampling location was known, we randomly assigned a grid cell with sample weights proportional to the population density within the comuna obtained from WorldPop, at a resolution of 1 km2 grid (https://hub.worldpop.org/geodata/summary?id=44918). Sequences assigned to the same grid cell were further given random coordinates within the grid with uniform sampling to ensure that all sequences have unique coordinates as required by the continuous phylogeographic model. A final check was placed to ensure that the final coordinates fell within the comuna polygon.
For the 20 extracted subtrees, we pruned all sequences not collected in Chile and assigned the aforementioned coordinates to the remaining tips. We then used BEAST to estimate the continuous diffusion of the TLs using a relaxed random walk model with a Cauchy distribution to account for among-branch dispersal velocity heterogeneity (31). MCMC chains were run by duplicate for 50,000,000 steps each, with multiple runs performed when necessary to achieve convergence. Multiple chains for each TL were combined using LogCombiner, and the summary MCC trees for each TL were generated with TreeAnnotator. Mapping the dispersal of each TL in Chile was done with custom Python scripts (https://github.com/leoferres/spatial_dynamics_covid_chile).
EII indices from variant prevalence, case, and human travel data
To estimate the expected number of importations of each VOI/VOC into Chile over time independently from our phylogenetic inference, we compute a weekly EII index for every country from which incoming flights into Chile were recorded during late 2020 and 2021. Exceptions to the country-level EIIs were considered for the United States and Brazil where state-level estimates were produced instead to account for the broad geography of both countries and their heterogeneous epidemiological landscapes. Five states from the United States (Florida, New York, Texas, Georgia, and California) and seven states from Brazil (São Paulo, Rio de Janeiro, Goias, Parana, Rio Grande do Sul, Bahia, and Santa Catarina) were included as these are the only states in each country from which direct flights arrive to SCL.
For any variant under investigation X (Alpha, Gamma, Lambda, Mu, and Delta) and source location l, the EII over n epidemiological weeks is defined as
where Il,i is the aggregate number of new cases for epidemiological week i in location l, Pl is the population from location l, gX,l,i is the proportion of viral genome sequences (available in GISAID) that were sampled at location l in week i and that are assigned to variant X, and Al,i is the total air passenger volume coming into Chile from location l on week i. For the three neighboring countries (Argentina, Bolivia, and Peru) that share a land border (and therefore border crossings) with Chile, variation of the EII was estimated (lEII) as
where Ll,i is the number of land travelers entering Chile from location l on week i. For these countries, the air travel–based EII described previously (referred to as aEII when relevant) was combined with the lEII to estimate a global EII as
Since this formulation of the EII estimates the numbers of individuals expected to enter the country infected with variant X, we also derive a combined cEII from all locations from which international air travelers come to Chile:
Vector autoregressive models to test forecasting power of EIIs on viral imports per variant
Given the weekly estimates, every EII variation for each individual viral variant X is concatenated into a time series covering t = 48 epidemiological weeks, describing the expected number of introductions per variant over time. These EII time series are compared with time series of inferred observations of viral importations from the genomic data P, taken as the weekly aggregate counts of phylogenetic nodes which are the most recent common ancestors of individual Chile TLs. We formulate a bivariate vector autoregression model from both time series:
where yx,t is a vector of both P and EII values at week t and k is the maximum lag, equal to 12 to represent a maximum allowed delay between expected and inferred importations of approximately three months. From this, the system of equations is:
The optimum value for k is evaluated by comparing information criteria from each lag value using the vars package in R; we specifically evaluate the Akaike information criteria (AIC), Hannan–Quinn information criteria, Schwartz criteria (SC), and Akaike's final prediction error. We take the lowest value among the four indicators as the indicator of the optimum lag value indicator, which in most cases corresponds to SC or AIC (Table 1). To test whether phylogenetically inferred importations P are forecasted by the EII estimates, we perform Granger causality (GC) tests with α = 0.05. Contrary to what its name suggests, GC tests do not establish causality between the time series but rather temporally dependent correlation; we therefore refer to our findings as one time series Granger-causing the other, which is not true causality.
Estimation of the effects of human mobility at the comuna level on viral movements
Following a viral importation, some TLs spread domestically within Chile. The comunas where these local TLs were first detected are not a reliable measure of where the introduction likely took place due to delays between importation and detection as has been shown previously (30) (Fig. S12 shows the numbers of “first detection” events in the country and the location of land border crossings in the country; the lag between the TMRCAs and the first detection of TLs in Chile was estimated to be 9.59 ± 8.95 days). Instead, we estimate the impact of local scale human mobility on the invasion dynamics of different TLs by comparing comuna-to-comuna-specific arrival times extracted from our continuous phylogeographic analysis to human movements.
For each epoch v, we compute the metric of mobility exchange between the source comuna s and location j, namely as:
where j represents comunas, t runs from t0, day of first detection of the variant defined in epoch v in the source comuna s, to T, days to consider for the mobility aggregation, L1sj,t is the time spent by residents of s in j on day t. accounts for the total time spent, respectively, by users of the source location in comuna j and vice versa. Epochs v are defined as 60-day intervals starting on the first date of detection of a variant (which defines the epoch) at comuna level and are named after the variant itself. The time window of mobility aggregation T must be shorter than the epoch; in Fig. 4, we chose T = 2 weeks. For each epoch, only transmission events with both transmission start date and respective source time t0 falling within the epoch time window are chosen. Epochs may include multiple TLs spreading at the same time. Arrival times at comunas j are then compared with the human mobility between comuna j and the mobility between j and source s.
Estimation of the effects of NPI stringency at the comuna level on viral movements and viral lineage persistence
We formulated simple negative binomial models to test whether the stringency of NPIs implemented at the comuna level had a measurable effect on the estimated numbers of viral movements within a single comuna or between comunas. For every inferred viral movement (represented by a branch in the continuous phylogeographic analysis, with the ancestral node location accounting for the source comuna for that movement and the descendant node or tip location accounting for the destination comuna for that movement), we determine the lockdown stringency tier that corresponds to the estimated median age of the source and destination comunas, and aggregate all movements based on the lockdown tier of the destination comunas. The models estimate the aggregated counts of viral movements (either into each comuna or within a comuna) explained by both the lockdown stringency tier (three levels: no lockdown, weekend lockdown, or full lockdown) and the numbers of new cases reported during the duration of that lockdown. We confirmed the appropriateness of a negative binomial model compared with a Poisson regression model (i.e. the assumption of conditional means not being equal to conditional variances) by performing a likelihood ratio test between both model fit methods (DF = 1, P < 0.001). From the fitted model coefficients, we estimate the incidence rate ratios (IRR) to quantify the effect of lockdown tier stringency on viral movement counts. We further expand these models to also account for the effect of the implementation of a mobility pass for fully vaccinated individuals on 2021 May 26; results for both model formulations are shown even if a model which does not account for the mobility pass better fits the data for viral movements within comunas (LRT = 1.04, P = 0.31) while accounting for the mobility pass is a better fit for the data of viral movements between comunas (LRT = 29.87, P < 0.001).
Following the arrival of viral lineages to a new location (comuna), they could either persist and be detected within that location or become extinct. The number of newly arrived lineages and previously circulating lineages in a location can be used to estimate the proportions of persisting lineages (32, 49). We analyze the posterior tree distributions from the continuous phylogeographic analyses using PersistenceSummarizer (32) to estimate the number of persisting lineages within the 14 comunas with the highest number of observed viral movements (five comunas in the Santiago Metropolitan Area: Pirque, Puente Alto, Santiago, San Jose de Maipo, and San Bernardo; nine comunas from the rest of the country: Arica, Los Angeles, Requinoa, Copiapo, Iquique, Valparaiso, Antofagasta, Concepcion, and Puerto Montt) under different lockdown stringency tiers. We aggregate comunas under the same lockdown tier and set the date when the tier was enacted as the ancestral time and follow up with sequential evaluation times on every week following the implementation of the lockdown stringency tier. This way, we can estimate persistence trends in different comunas under the same lockdown stringency tier despite the fact that these were implemented at different points in time, and that stringency tiers were enacted and relaxed on multiple occasions across our study period. Full and weekend lockdowns were implemented for a maximum continuous time of 16 and 20 weeks, respectively (Fig. S13); we therefore perform lineage persistence analyses for a maximum of 25 weeks following the implementation of a specific stringency tier. The remaining comunas beyond the 14 mentioned above are excluded as they display few viral movements and therefore yield insufficient information regarding lineage persistence.
To estimate the rate of decay of persisting lineages under different stringency tiers, we fit an exponential model to the median number of persisting lineages over every week following the implementation of the specific tier. The model is formulated as:
where y is the median proportion of persisting lineages from Gamma TL 35 and x is the number of weeks since the implementation of that lockdown tier. Decay rates are estimated as model parameter b.
Supplementary Material
Acknowledgments
The contents of this publication are the sole responsibility of the authors and do not necessarily reflect the views of the European Commission or any other funder. The authors would like to thank and acknowledge all data contributors, i.e. the Authors and their Originating laboratories responsible for obtaining the specimens, and their Submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative. A complete list of the Authors and Originating laboratories can be found at the public GitHub repository.
Contributor Information
Bernardo Gutierrez, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom; Pandemic Sciences Institute, University of Oxford, Oxford OX3 7DQ, United Kingdom; Colegio de Ciencias Biológicas y Ambientales, Universidad San Francisco de Quito USFQ, Quito 170901, Ecuador.
Joseph L -H Tsui, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom.
Giulia Pullano, Department of Biology, Georgetown University, Washington, DC 20057, USA; INSERM, Sorbonne Université, Institut Pierre Louis d’Epidémiologie et de Santé Publique, IPLESP, 75012 Paris, France.
Mattia Mazzoli, INSERM, Sorbonne Université, Institut Pierre Louis d’Epidémiologie et de Santé Publique, IPLESP, 75012 Paris, France; ISI Foundation, 10126 Turin, Italy.
Karthik Gangavarapu, Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA.
Rhys P D Inward, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom.
Sumali Bajaj, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom.
Rosario Evans Pena, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom.
Simon Busch-Moreno, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom.
Marc A Suchard, Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Biostatistics, University of California Los Angeles, Los Angeles, CA 90095, USA; Department of Biomathematics, University of California Los Angeles, Los Angeles, CA 90095, USA.
Oliver G Pybus, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom; Pandemic Sciences Institute, University of Oxford, Oxford OX3 7DQ, United Kingdom; Department of Pathobiology and Population Science, Royal Veterinary College, London AL9 7TA, United Kingdom.
Alejandra Dunner, Instituto de Salud Pública de Chile, 7780050 Santiago, Chile.
Rodrigo Puentes, Instituto de Salud Pública de Chile, 7780050 Santiago, Chile.
Salvador Ayala, Instituto de Salud Pública de Chile, 7780050 Santiago, Chile.
Jorge Fernandez, Instituto de Salud Pública de Chile, 7780050 Santiago, Chile.
Rafael Araos, Facultad de Medicina Clínica Alemana, Instituto de Ciencias e Innovación en Medicina (ICIM), Universidad del Desarrollo, 7610671 Santiago, Chile.
Leo Ferres, ISI Foundation, 10126 Turin, Italy; Data Science Institute, Universidad del Desarrollo, 7610671 Santiago, Chile; Telefónica, 7500775 Santiago, Chile.
Vittoria Colizza, INSERM, Sorbonne Université, Institut Pierre Louis d’Epidémiologie et de Santé Publique, IPLESP, 75012 Paris, France; Tokyo Tech World Research Hub Initiative, Institute of Innovative Research, Tokyo Institute of Technology, Tokyo 152-8550, Japan.
Moritz U G Kraemer, Department of Biology, University of Oxford, Oxford OX1 3SZ, United Kingdom; Pandemic Sciences Institute, University of Oxford, Oxford OX3 7DQ, United Kingdom.
Supplementary Material
Supplementary material is available at PNAS Nexus online.
Funding
M.U.G.K. acknowledges funding from The Rockefeller Foundation, Google.org, the Oxford Martin School Programmes in Pandemic Genomics (also O.G.P. and B.G.) and Digital Pandemic Preparedness, European Union’s Horizon Europe programme projects MOOD (#874850) (also V.C., R.P.D.I., B.G., and R.E.P.) and E4Warning (101086640) (also S.B.-M.), the John Fell Fund, a Branco Weiss Fellowship, and Wellcome Trust grant nos. 225288/Z/22/Z, 226052/Z/22/Z, and 228186/Z/23/Z (also B.G.), United Kingdom Research and Innovation (#APP8583), and the Medical Research Foundation (MRF-RG-ICCH-2022-100069). L.F. and the authors thank the funding and support of Telefónica/Movistar Chile and CISCO Chile. This research was supported by the Fondo Nacional de Desarrollo Científico y Tecnológico (FONDECYT) of Chile grant no. 1221315 (to L.F.). L.F. also acknowledges financial support from the Lagrange Project of the Institute for Scientific Interchange Foundation (ISI Foundation), funded by Fondazione Cassa di Risparmio di Torino (Fondazione CRT).
Author Contributions
B.G., J.L.-H.T., M.U.G.K., V.C., L.F., G.P., and M.M. conceived and planned the research. A.d., R.P., S.A., and J.F. contributed an extension of the genomic and economic data, and preprocessed/cleaned the datasets. R.A. contributed socioeconomic data. B.G., J.L.-H.T., M.M., R.E.P., R.D.I., K.G., G.P., and L.F. analyzed the data. B.G. and M.U.G.K. wrote the first draft of the manuscript. All authors subsequently edited, read, and approved the manuscript.
Preprints
This manuscript was originally posted on a preprint server here: https://doi.org/10.1101/2024.01.18.24301504.
Data Availability
Publicly available input data (epidemiological, flight data, and land border crossings data) and scripts used to generate the analyses and plots of this manuscript are available in GitHub at https://github.com/BernardoGG/SARS2_Chile.git. Raw mobile phone data from Telefonica cannot be shared publicly due to data policy requirements. For replicability purposes, the mobility dataset is available upon reasonable request through direct contact with L.F. at lferres@udd.cl.
References
- 1. McCrone JT, et al. 2022. Context-specific emergence and growth of the SARS-CoV-2 Delta variant. Nature. 610:154–160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Sun K, et al. 2021. Transmission heterogeneities, kinetics, and controllability of SARS-CoV-2. Science. 371:eabe2424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. de Souza WM, et al. 2020. Epidemiological and clinical characteristics of the COVID-19 epidemic in Brazil. Nat Hum Behav. 4:856–865. [DOI] [PubMed] [Google Scholar]
- 4. Kraemer MUG, et al. 2021. Spatiotemporal invasion dynamics of SARS-CoV-2 lineage B.1.1.7 emergence. Science. 373:889–895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Faria NR, et al. 2021. Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus, Brazil. Science. 372:815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Tegally H, et al. 2021. Detection of a SARS-CoV-2 variant of concern in South Africa. Nature. 592:438–443. [DOI] [PubMed] [Google Scholar]
- 7. Viana R, et al. 2022. Rapid epidemic expansion of the SARS-CoV-2 omicron variant in Southern Africa. Nature. 603:679–686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Tegally H, et al. 2023. Dispersal patterns and influence of air travel during the global expansion of SARS-CoV-2 variants of concern. Cell. 186:3277–3290. 10.1016/j.cell.2023.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Tegally H, Khan K, Huber C, de Oliveira T, Kraemer MUG. 2022. Shifts in global mobility dictate the synchrony of SARS-CoV-2 epidemic waves. J Travel Med. 29:taac134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Rader B, et al. 2020. Crowding and the shape of COVID-19 epidemics. Nat Med. 26:1829–1834. [DOI] [PubMed] [Google Scholar]
- 11. Mazzoli M, et al. 2021. Interplay between mobility, multi-seeding and lockdowns shapes COVID-19 local impact. PLoS Comput Biol. 17:e1009326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Schlosser F, et al. 2020. COVID-19 lockdown induces disease-mitigating structural changes in mobility networks. Proc Natl Acad Sci U S A. 117:32883–32890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Parker E, et al. 2022. Regional connectivity drove bidirectional transmission of SARS-CoV-2 in the Middle East during travel restrictions. Nat Commun. 13:4784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Volz E. 2023. Fitness, growth and transmissibility of SARS-CoV-2 genetic variants. Nat Rev Genet. 24:724–734. 10.1038/s41576-023-00610-z [DOI] [PubMed] [Google Scholar]
- 15. Padilla-Rojas C, et al. 2021. Genomic analysis reveals a rapid spread and predominance of lambda (C.37) SARS-COV-2 lineage in Peru despite circulation of variants of concern. J Med Virol. 93:6845–6849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Laiton-Donato K, et al. 2021. Characterization of the emerging B.1.621 variant of interest of SARS-CoV-2. Infect Genet Evol. 95:105038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Jimenez-Silva C, et al. 2023. Genomic epidemiology of circulating SARS-CoV-2 variants during first two years of the pandemic in Colombia. Commun Med (Lond). 3:97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Castelán-Sánchez HG, et al. 2023. Comparing the evolutionary dynamics of predominant SARS-CoV-2 virus lineages co-circulating in Mexico. eLife. 12:e82069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. West AP, et al. 2021. Detection and characterization of the SARS-CoV-2 lineage B.1.526 in New York. Nat Commun. 12:4886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Annavajhala MK, et al. 2021. Emergence and expansion of SARS-CoV-2 B.1.526 after identification in New York. Nature. 597:703–708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Deng X, et al. 2021. Transmission, infectivity, and neutralization of a spike L452R SARS-CoV-2 variant. Cell. 184:3426–3437.e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Castillo AE, et al. 2020. Phylogenetic analysis of the first four SARS-CoV-2 cases in Chile. J Med Virol. 92:1562–1566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Castillo AE, et al. 2020. Geographical distribution of genetic variants and lineages of SARS-CoV-2 in Chile. Front Public Health. 8:562615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Subtel . “Especial Análisis Nueva Tecnología 5G en Internet Móvil y crecimiento Tecnología Fibra en Internet Fija” (Ministerio de Transportes y Telecomunicaciones, 2022) (October 12, 2023). https://www.subtel.gob.cl/wp-content/uploads/2022/06/PPT_Series_MARZO_2022_V0.pdf
- 25. Gozzi N, et al. 2021. Estimating the effect of social inequalities on the mitigation of COVID-19 across communities in Santiago de Chile. Nat Commun. 12:2429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Mena GE, et al. 2021. Socioeconomic status determines COVID-19 incidence and related mortality in Santiago, Chile. Science. 372:eabg5298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Li Y, Undurraga EA, Zubizarreta JR. 2022. Effectiveness of localized lockdowns in the COVID-19 pandemic. Am J Epidemiol. 191:812–824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Cuadrado C, et al. Impact of small-area lockdowns for the control of the COVID-19 pandemic. medRxiv 2020. 10.1101/2020.05.05.20092106, preprint: not peer reviewed. [DOI]
- 29. Molina-Mora JA, et al. 2023. Overview of the SARS-CoV-2 genotypes circulating in Latin America during 2021. Front Public Health. 11:1095202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. du Plessis L, et al. 2021. Establishment and lineage dynamics of the SARS-CoV-2 epidemic in the UK. Science. 371:708–712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Lemey P, Rambaut A, Welch JJ, Suchard MA. 2010. Phylogeography takes a relaxed random walk in continuous space and time. Mol Biol Evol. 27:1877–1885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Lemey P, et al. 2021. Untangling introductions and persistence in COVID-19 resurgence in Europe. Nature. 595:713–717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Tsui JL-H, et al. 2023. Genomic assessment of invasion dynamics of SARS-CoV-2 omicron BA.1. Science. 381:336–343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Filipe AdS, et al. 2021. Genomic epidemiology reveals multiple introductions of SARS-CoV-2 from mainland Europe into Scotland. Nat Microbiol. 6:112–122. [DOI] [PubMed] [Google Scholar]
- 35. Volz E, et al. 2021. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature. 593:266–269. [DOI] [PubMed] [Google Scholar]
- 36. Buenestado-Serrano S, et al. 2022. First confirmation of importation and transmission in Spain of the newly identified SARS-CoV-2 B.1.1.7 variant. Enferm Infecc Microbiol Clin. 40:546–549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Alpert T, et al. 2021. Early introductions and transmission of SARS-CoV-2 variant B.1.1.7 in the United States. Cell. 184:2595–2604.e13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Michaelsen TY, et al. 2022. Introduction and transmission of SARS-CoV-2 lineage B.1.1.7, alpha variant, in Denmark. Genome Med. 14:47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Vaidyanathan G. 2021. Coronavirus variants are spreading in India—what scientists know so far. Nature. 593:321–322. 10.1038/d41586-021-01274-7 [DOI] [PubMed] [Google Scholar]
- 40. Vöhringer HS, et al. 2021. Genomic reconstruction of the SARS-CoV-2 epidemic in England. Nature. 600:506–511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Earnest R, et al. 2022. Comparative transmissibility of SARS-CoV-2 variants Delta and alpha in New England, USA. Cell Rep Med. 3:100583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Campbell F, et al. 2021. Increased transmissibility and global spread of SARS-CoV-2 variants of concern as at June 2021. Euro Surveill. 26:2100509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Kozlov M. 2022. How does omicron spread so fast? A high viral load isn’t the answer. Nature. 10.1038/d41586-022-00129-z [DOI] [PubMed] [Google Scholar]
- 44. Sofonea MT, et al. 2022. Analyzing and modeling the spread of SARS-CoV-2 omicron lineages BA.1 and BA.2, France, September 2021-February 2022. Emerg Infect Dis. 28:1355–1365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Ito K, Piantham C, Nishiura H. 2022. Relative instantaneous reproduction number of omicron SARS-CoV-2 variant with respect to the Delta variant in Denmark. J Med Virol. 94:2265–2268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Eggink D, et al. 2022. Increased risk of infection with SARS-CoV-2 omicron BA.1 compared with Delta in vaccinated and previously infected individuals, the Netherlands, 22 November 2021 to 19 January 2022. Euro Surveill. 27:2101196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Nour D, et al. 2022. The role of Lebanon in the COVID-19 butterfly effect: the B.1.398 example. Viruses. 14:1640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Ozer EA, et al. 2022. Multiple expansions of globally uncommon SARS-CoV-2 lineages in Nigeria. Nat Commun. 13:688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Matteson NL, et al. 2023. Genomic surveillance reveals dynamic shifts in the connectivity of COVID-19 epidemics. Cell. 186:5690–5704.e20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Kraemer MUG, et al. 2020. Mapping global variation in human mobility. Nat Hum Behav. 4:800–810. [DOI] [PubMed] [Google Scholar]
- 51. Laiton-Donato K, et al. 2020. Genomic epidemiology of severe acute respiratory syndrome coronavirus 2, Colombia. Emerg Infect Dis. 26:2854–2862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Candido DS, et al. 2020. Evolution and epidemic spread of SARS-CoV-2 in Brazil. Science. 369:1255–1260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Franco D, et al. 2021. Early transmission dynamics, spread, and genomic characterization of SARS-CoV-2 in Panama. Emerg Infect Dis. 27:612–615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Juscamayta-López E, et al. 2021. Phylogenomics reveals multiple introductions and early spread of SARS-CoV-2 into Peru. J Med Virol. 93:5961–5968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Salazar C, et al. , Multiple introductions, regional spread and local differentiation during the first week of COVID-19 epidemic in Montevideo, Uruguay. bioRxiv (2020) 10.1101/2020.05.09.086223, preprint: not peer reviewed. [DOI] [Google Scholar]
- 56. Elizondo V, et al. 2021. SARS-CoV-2 genomic characterization and clinical manifestation of the COVID-19 outbreak in Uruguay. Emerg Microbes Infect. 10:51–65. 10.1080/22221751.2020.1863747 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Gutierrez B, et al. 2021. Genomic epidemiology of SARS-CoV-2 transmission lineages in Ecuador. Virus Evol. 7:veab051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Loureiro CL, et al. 2021. SARS-CoV-2 genetic diversity in Venezuela: predominance of D614G variants and analysis of one outbreak. PLoS One. 16:e0247196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Santiago GA, et al. 2022. Genomic surveillance of SARS-CoV-2 in Puerto Rico enabled early detection and tracking of variants. Commun Med. 2:100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Giovanetti M, et al. 2022. Genomic epidemiology of the SARS-CoV-2 epidemic in Brazil. Nat Microbiol. 7:1490–1500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Torres C, et al. 2021. Cost-effective method to perform SARS-CoV-2 variant surveillance: detection of alpha, gamma, lambda, delta, epsilon, and zeta in Argentina. Front Med. 8:755463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Gräf T, et al. 2024. Dispersion patterns of SARS-CoV-2 variants gamma, lambda and mu in Latin America and the Caribbean. Nat Commun. 15:1837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Brito AF, et al. 2022. Global disparities in SARS-CoV-2 genomic surveillance. Nat Commun. 13:7003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Kalkauskas A, et al. 2021. Sampling bias and model choice in continuous phylogeography: getting lost on a random walk. PLoS Comput Biol. 17:e1008561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Liu P, Song Y, Colijn C, MacPherson A. 2022. The impact of sampling bias on viral phylogeographic reconstruction. PLOS Glob Public Health. 2:e0000577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Bogoch II, et al. 2020. Pneumonia of unknown aetiology in Wuhan, China: potential for international spread via commercial air travel. J Travel Med. 27:taaa008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Candido DDS, et al. 2020. Routes for COVID-19 importation in Brazil. J Travel Med. 27:taaa042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Bastani H, et al. 2021. Efficient and targeted COVID-19 border testing via reinforcement learning. Nature. 599:108–113. [DOI] [PubMed] [Google Scholar]
- 69. Ministerio de Salud de Chile . Instructivo sobre estrategia de vigilancia genómica mediante mutaciones asociadas a variantes a través de RT–PCR. Ordinario B51 N°338 del 26 de enero 2022.
- 70. Ministerio de Salud de Chile . Proyecto de Vigilancia Genómica de SARS- CoV-2. Ordinario Res. Ex. N°403 del 27 de abril 2021.
- 71. The University of Texas COVID-19 modeling consortium . Variant detection calculator. https://covid-19.tacc.utexas.edu/dashboards/variants/
- 72. Meyer's Lab, University of Texas at Austin . User’s guide to variant detection calculators. March 2, 2021.
- 73. Departamento de Epidemiología, Ministerio de Salud de Chile . Informe epidemiológico No. 12: Vigilancia genómica de SARS-CoV-2 (COVID-19), 22 de agosto de 2021.
- 74. Zhou L, Liang Z, Chou C-A, Chaovalitwongse WA. 2020. Airline planning and scheduling: models and solution methodologies. Front Eng Manag. 7:1–26. [Google Scholar]
- 75. Mathieu E, et al. 2020. Coronavirus pandemic (COVID-19). OurWorldInData.org
- 76. Shu Y, McCauley J. 2017. GISAID: global initiative on sharing all influenza data—from vision to reality. Euro Surveill. 22:30494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Pullano G. 2021. “Human mobility and epidemics,” Sorbonne Université (September 26, 2023).
- 78. Wu F, et al. 2020. A new coronavirus associated with human respiratory disease in China. Nature. 579:265–269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Zhu N, et al. 2020. A novel coronavirus from patients with pneumonia in China, 2019. N Engl J Med. 382:727–733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80. Li H. 2018. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 34:3094–3100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81. Minh BQ, et al. 2020. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 37:1530–1534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82. Rambaut A, Lam TT, Carvalho LM, Pybus OG. 2016. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2:vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Sagulenko P, Puller V, Neher RA. 2018. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 4:vex042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. Ghafari M, et al. 2022. Purifying selection determines the short-term time dependency of evolutionary rates in SARS-CoV-2 and pH1N1 influenza. Mol Biol Evol. 39:msac009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85. Tay JH, Porter AF, Wirth W, Duchene S. 2022. The emergence of SARS-CoV-2 variants of concern is driven by acceleration of the substitution rate. Mol Biol Evol. 39:msac013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86. Duchene S, et al. 2020. Temporal signal and the phylodynamic threshold of SARS-CoV-2. Virus Evol. 6:veaa061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87. Gill MS, et al. 2013. Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci. Mol Biol Evol. 30:713–724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88. Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. 2018. Posterior summarization in Bayesian phylogenetics using tracer 1.7. Syst Biol. 67:901–904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89. Minin VN, Suchard MA. 2008. Fast, accurate and simulation-free stochastic mapping. Philos Trans R Soc Lond B Biol Sci. 363:3985–3995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Minin VN, Suchard MA. 2008. Counting labeled transitions in continuous-time Markov models of evolution. J Math Biol. 56:391–412. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Publicly available input data (epidemiological, flight data, and land border crossings data) and scripts used to generate the analyses and plots of this manuscript are available in GitHub at https://github.com/BernardoGG/SARS2_Chile.git. Raw mobile phone data from Telefonica cannot be shared publicly due to data policy requirements. For replicability purposes, the mobility dataset is available upon reasonable request through direct contact with L.F. at lferres@udd.cl.