Abstract
The basic reproductive number (R0) is a fundamental measure used to quantify the transmission potential of an epidemic in public health practice. However, R0 cannot reflect the time-varying nature of an epidemic. A time-varying effective reproductive number Rt can provide more information because it tracks the subsequent evolution of transmission. However, since it neglects individual-level geographical variations in exposure risk, Rt may smooth out interpersonal heterogeneous transmission potential, obscure high-risk spreaders, and hence hamper the effectiveness of control measures in spatial dimension. Therefore, this study proposes a new method for quantifying spatially adjusted (time-varying) reproductive numbers that reflects spatial heterogeneity in transmission potential among individuals. This new method estimates individual-level effective reproductive numbers (Rj) and a summarized indicator for population-level time-varying reproductive number (Rt). Data from the five most severe dengue outbreaks in southern Taiwan from 1998–2015 were used to demonstrate the ability of the method to highlight early spreaders contributing to the geographic expansion of dengue transmission. Our results show spatial heterogeneity in the transmission potential of dengue among individuals and identify the spreaders with the highest Rj during the epidemic period. The results also reveal that super-spreaders are usually early spreaders that locate at the edges of the epidemic foci, which means that these cases could be the drivers of the expansion of the outbreak. Therefore, our proposed method depicts a more detailed spatial-temporal dengue transmission process and identifies the significant role of the edges of the epidemic foci, which could be weak spots in disease control and prevention.
Subject terms: Infectious diseases, Epidemiology, Computational science
Introduction
The basic reproductive number (R0) is a fundamental measure used to quantify the transmission potential of an epidemic1. It is defined as the number of infections caused by an index case within a completely susceptible population, i.e., a population in which there is no pre-existing immunity. R0 is a summary index suggesting both the intrinsic transmissibility of a pathogen and the infrastructure that allows the disease to spread in a given setting. In particular, the value of R0 is affected by transmission probability, contact rate and duration of infectiousness2.
Based on deterministic homogenous-mixing epidemic models3, public health practitioners usually regard R0 = 1 as a useful threshold for ensuring the development of an outbreak, referred to as the epidemic threshold4. The health authorities also use R0 for predicting final epidemic size3,5 and assessing the resources required to contain the outbreak, e.g., determining what proportion of people should be vaccinated2. Therefore, it is of great practical importance to estimate R0 for a possible outbreak in public health practice. The value of R0 is estimated from the initial growth rate of an epidemic based on mechanistic models, such as the susceptible-infected-recovered model3,6. For example, R0 has been estimated at approximately 1.5 for 2009 H1N1 influenza7, at 3 for the 2003 severe acute respiratory syndrome (SARS) outbreak8, and at 12–18 for historical measles outbreaks9. As an epidemic unfolds, however, the depletion and recovery of the susceptible population cause fluctuating effective transmissibility, so the value of R0 cannot reflect the time-varying nature of an epidemic. Moreover, when an epidemic occurs in a realistic contact network, R0 can also fail to characterize the transmission potential at the initial stage10.
On the other hand, the effective reproductive number (R) is defined as the number of infections caused by any case11. The difference between R and R0 is that the value of R does not depend on the assumption that the population is completely susceptible, which is often violated in later stages of an outbreak or in a situation in which the population has been exposed to the pathogen previously. Therefore, R aims to characterize the progression of an epidemic in a realistic scenario. Intuitively, counting the branches of infection on transmission trees (i.e., who infects whom6) precisely quantifies the value of R; however, doing this is impractical in most circumstances except for confined outbreaks where contact tracing is feasible12. The epidemic curve (an illustration of the number of new infections) does not encode transmission dynamics and cannot be used to infer R directly. The renewal equation11, which describes the temporal transmission relationship between propagating generations of infected cases, is therefore proposed to estimate R from incidence data in different epidemic periods. More specifically, the renewal equation estimates the values of the time-varying effective reproductive number (Rt), defined as the population-level transmission potential at time t11,13. In a fully susceptible population, early values of Rt should approximate R0, but Rt is more informative in that it tracks the subsequent evolution of transmission potential during the course of an outbreak. The values of Rt can also reveal the time when an outbreak was contained by monitoring the epidemic threshold. During the 2014 Ebola outbreak in West Africa, Rt captured a distinct temporal pattern of transmission potential in different countries, signaling the different levels of control measures needed14. As the change in transmission potential is highly correlated to control measures, public health practitioners can evaluate the effectiveness of control measures by determining the change in Rt after implementation. For example, sudden drops in Rt (from 3 to approximately 0.7) after the issuance of a global alert during the 2003 SARS outbreak indicated the effectiveness of the alert15.
Time-varying effective reproductive numbers can be estimated by the renewal equation with a weighted ratio of infectors and infectees11. The generation interval, defined as the infection time interval between a potential infector and an infectee, is the major component of the process of determining the temporal weights of possible transmission. A major pitfall of the renewal equation method is that it automatically assumes homogeneous mixing of individuals. In effect, the temporal weight assumes that transmission probability is determined only by the temporal difference in onset of illness between the infector and infectees, regardless of possible variations due to geographical and social proximity. Human mobility and contact patterns are highly structured in the real world16–18 and inevitably violate the homogeneous mixing assumption19. Studies have demonstrated spatial heterogeneities in the transmission of dengue20, cholera21, influenza22, and foot-and-mouth disease23. Recent studies have also shown that spatial distance between cases strongly influences short-term movement24 and, hence, the spread of these diseases. Moreover, short-range transmission has served the essential mode of disease transmission25, i.e., patients typically have a higher chance of infecting others nearby. Although this spatial effect has been considered in the aforementioned studies and in the transmission tree reconstruction of some diseases26,27, the effect of spatial distance on the estimation of the effective reproductive number has not yet been explored. The spatial relationship between individuals can be utilized to calculate the discriminative transmission potential for each individual (Fig. 1; see the methods section for details). Neglecting the spatial variation, conversely, may smooth out heterogeneities in transmission potential, obscure high-risk spreaders, and hamper the effectiveness of control measures. In summary, the effect of spatial proximity or distance should be accounted for in the estimation of effective reproductive numbers.
Therefore, the objective of this study is to propose such method for quantifying spatially adjusted reproductive numbers that reflects spatial heterogeneity in exposure risk. It generates individual-level effective reproductive numbers (Rj) and a summarized indicator for the whole population (Rt) by the transmission probability estimated for all infector-infectee pairs, based on both temporal and spatial characteristics. Temporal weighting functions account for the fact that infected cases can only transmit the disease effectively within a certain time window (the generation interval). When only temporal weighting is considered, our method is equivalent to the renewal equation, assuming homogeneous mixing. The spatial weighting function, on the other hand, accounts for the decaying chance of transmission when the distance between individuals increases. The value of spatially adjusted Rj can provide more information regarding spatial heterogeneities in transmission potential and can better aid in the implementation of control measures. Data from dengue epidemics in southern Taiwan are used to demonstrate the ability of this method to identify early spreaders contributing to the geographic expansion of dengue transmission.
Methods
Study Area
Taiwan (23.778°N, 120.930°E) is an island country at the border of tropical and subtropical climate zones. Due to its geographic proximity to dengue endemic countries in Southeast Asia (Fig. 2A), dengue outbreaks in Taiwan are triggered by imported index cases from endemic regions. Large outbreaks have occurred in the tropical monsoon regions of southern Taiwan, particularly in the cities of Tainan (TN) and Kaohsiung (KH). These two large tropical cities feature high temperatures and high humidity, high population density, and highly urbanized landscapes, all of which provide appropriate breeding habitats for Aedes aegypti, the main vector mosquitoes of dengue virus. Thus, as dengue outbreaks occur frequently in late summer and wane in winter, the two cities form natural settings for the repetitive observation of dengue invasion and epidemic propagation.
Dengue data
Dengue fever is listed as a notifiable infectious disease in Taiwan. This means that physicians are mandated to report suspected cases in which the patient lives in or has a history of travel to a dengue-affected area and has corresponding symptoms, including fever, rash, eye pain, leukopenia, etc. The reported cases are then confirmed by standard laboratory tests, including real-time PCR, ELISA, and antigen detection28. These surveillance data are recorded and provided by Taiwan Centers for Disease Control, Ministry of Health and Welfare29. The dengue database starts in 1998, when the electronic reporting system was implemented, and we include all records since 1998-01-01. Individual-level information, including date of disease onset and X-Y coordinates of residence, is also provided. Each pair of residence coordinates is listed as the center of a basic statistical area, which is the smallest geographic unit for socioeconomic surveying in Taiwan. There was a total of 78,749 confirmed dengue cases (4.4% of them were imported cases) in Taiwan from 1998 to 2017, 92.5% of which occurred in Tainan and Kaohsiung cities.
Figure 2B shows the historical records of dengue epidemics in Taiwan since 1998; the five most severe outbreaks were included in this study. Among the five outbreaks, two occurred in Tainan (2007TN, 2015TN), and three occurred in Kaohsiung (2002KH, 2014KH, 2015KH). The sizes of the outbreaks ranged from 22,784 cases in 2015TN to 1,183 cases in 2011KH. Generally, dengue outbreaks start in June, reach their peak around September, and end near the end of the year or early the next January. In this study, we analyzed the largest outbreak, 2015TN, to demonstrating our method, while detailed results for the other four epidemics are presented in our supplementary results. Of the 22,784 total infections in 2015TN, only 19 cases (<0.1%) were imported from foreign countries, suggesting that indigenous transmission was well established. In contrast with Kaohsiung city, very few outbreaks occurred in Taiwan before 2015.
Quantifying the temporal transmission dynamics
The renewal equation specifies the relationship between generations of incident cases, as shown in Eq. (1). It can be applied to estimate time-varying effective reproductive numbers due to its simplicity and generality regarding the temporal transmission dynamics11,13,30.
1 |
Rt represents the population-level time-varying effective reproductive number at time t. denotes the number of incident cases at time x, and in the denominator denotes the number of potential infectors generated at a days (a ≥ 0) prior to time x. The hat notations stand for observed case numbers. g(a) stands for the distribution of the generation interval, which is used as a weighting function for representing transmission potential at time interval a. It can be specified by the nature of disease transmission process.
Importantly, the generation interval, also known as generation time, is the period between the infection of an infector and the infection of its infectees. It is a fundamental parameter reflecting the natural history of pathogens. Since the generation interval may vary between individuals, it can be described by a probability distribution, g(a), where a is a time interval. Therefore, the function can be regarded as the transmission weight between a pair of cases whose observed generation interval equals a (Fig. 1B). The representation of transmission weight (also referred to as transmission likelihood15) as g(a) is the basic component of the process of estimating effective reproductive numbers in the renewal equation and the following proposed method. The generation interval of dengue, in particular, is composed if four periods: host incubation, vector incubation, host infectious period and vector infectious period31. Nonetheless, showing compatibility with empirical data and mathematical convenience, gamma distributions are frequently used to model the generation interval of dengue collectively32. In this study, we used a gamma distribution with mean = 20 days and standard deviation = 9 days in our analysis, in accordance with a previous study25.
Individual reproductive number
In order to capture spatial variations among individuals, we adopted another method developed by Wallinga and Teunis15 to estimate individual reproductive numbers (Rj). We refer to this method as the non-adjusted method (i.e., non-adjusted for spatial effect). The transmission probability between pair of cases can be described mathematically as
2 |
where pij is the probability that case j infects case i. Note that we use superscripts to index time-invariant, individual (or pairwise) attributes (e.g., Rj, pij) and subscripts for time-varying attributes (e.g., Rt). g(·) is the temporal weighting function, representing generation interval. Given the onset time interval , is the transmission weight of the case pair (i, j). The pairwise transmission weight is then normalized by all received transmission weights of case i (from all potential infectors k ≠ i) to produce consistent estimation of transmission probability. The resulting pij is interpreted as the probability of individual i being infected by individual j. Therefore, Rj as the average number of secondary cases caused by individual j is the sum of all pij involving j as the infector, as shown in Fig. 1A and Eq. (3).
3 |
Spatially adjusted reproductive number
In this study, we extend a previous method with a spatial weighting function, , in order to account for the effect of spatial variation on dengue infections. We refer to this extended method as the (spatially) adjusted method. The transmission probability for case pair (i, j) becomes
4 |
where is the distance between the pair (i, j) and f is a function relating transmission weight to the distance between cases. The specifications of the temporal difference and g remain the same as in the previous method. Likewise, the spatially adjusted individual reproductive number is the sum of those involving j as the infector (Eq. 5).
5 |
The spatial weighting function is also called the transmission kernel and is a monotonically decaying function with respect to distance, reflecting neighborhood transmission32. Thus, unlike the generation interval, which is typically marked with a temporally lagged effect because of the latency period of infectiousness, the spatial weighting function decreases monotonically as distance increases, which means people would be easily get infections if they live near each other. It reflects people in the nearby neighborhood may share common environmental sources of dengue infection. There are several kinds of spatial kernels used in the literature, including exponential decay25 and power-law decay33. In the context of dengue transmission, we adopted an exponential decaying kernel with mean = 125 m27. This approach accounts for both temporal and spatial relationships when estimating individual-level reproductive numbers. Apart from the component of the generation interval inherited from the previous method, the distance-decayed spatial weighting function captures the spatial risk of dengue infection.
To calculate the population-level effective reproductive number from individual estimates, the Rj values can be further aggregated to form the Adjusted Rt given a specified time step τ13:
6 |
which is simply the average of all the values of Adjusted Rj for which tj falls in the time window [t − 0.5τ, t + 0.5τ]. As long as the weighting functions are specified, this method allows a quick estimation of population (Adjusted Rt) and individual (Adjusted Rj) effective reproductive numbers using observed data where onset dates are available.
Individual transmission distance
We defined individual transmission distance Dj as the average of geographic distances from an infector j to all potential infectees i, weighted by their pair-wise transmission probabilities (pij), as shown in Eq. (7). Dj could reflect the infection range transmitted by a particular infector j.
7 |
Quantifying the preconditions of spreaders
To explore the phenomenon of spatial expansion, two types of spreaders are of interest: early spreaders and succeeding spreaders. Early spreaders are defined as those cases that trigger a new cluster (by introducing the disease into an unaffected area). There are usually few or no cases of the disease, and the cluster tendency is low until several early spreaders emerge. Succeeding spreaders are defined as those cases that were infected by early cases nearby. Although there is a clear distinction between the two spreaders in theory, most individuals lie on a spectrum between these two extremes. We have therefore quantified this property according to the clustering tendency at the location and time immediately before the emergence of that spreader. In other words, clustering tendency is used as the measure for the precondition of a dengue case, and the median clustering tendency is used as the cutoff to classify all dengue cases. If the precondition clustering tendency is higher than the median, the case is categorized as a succeeding spreader; otherwise, it is characterized as an early spreader.
We used kernel density estimation (KDE) to quantify spatial clustering tendency in order to (1) determine the preconditions of a specific spreader and (2) compare the distributions of spatially adjusted effective reproductive numbers and case clustering patterns. The clustering tendency at the location of case j is defined as34
8 |
where h is called the kernel bandwidth or smoothing parameter (i.e., the parameter controlling the extent of smoothness), denotes the distance between case i and the locality of case j, and K(·) is a spatial smoothing function characterizing the contribution of each individual over the relative distance .
Results
The incidence of the 2015TN dengue outbreak are shown in Fig. 3A. The outbreak emerged in May with a handful of sporadic cases and finished the next January with a total of 22,784 cases. We divided the outbreak into six stages: emerging (I), growing (II and III), peak plateau (IV), and decaying (V and VI), to depict the spatiotemporal evolutions of the outbreak. Figure 3B shows the Adjusted Rt and Non-Adjusted Rt of the outbreak. These Rt curves are median or mean estimates of the adjusted and non-adjusted individual reproductive numbers, summarizing the temporal evolution of population-level transmission potential. Adjusted Rt provides further information: the shaded area represents the interquartile range (IQR) of adjusted individual reproductive numbers, reflecting the spatial individual heterogeneity of the transmission potential. In contrast, individual reproductive numbers of incident cases that shared identical dates of illness onset are constants in Non-Adjusted Rt. This indicates that the incident cases during this period were highly geographically heterogeneous in their transmission potential. The distributions of Adjusted Rj in these stages are also multimodal and right-skewed, which indicates that some incident cases could have higher transmission potential (i.e. super-spreaders).
Figure 3C shows the spatial distributions of incident cases in these stages. We can identify the linkages between time-varying reproductive numbers (Fig. 3B) and spatial-temporal distributions of incident cases (Fig. 3A,C) in different stages. In the emerging stage, the irregular growth of Adjusted Rt reflects the initial growth of the outbreak with a sporadic distribution of incident cases and a small emerging cluster (Fig. 3C). Subsequently, in the growing stages (from stage II to III), the epidemic curve started to show exponential growth (Fig. 3A); the larger IQR of the individual reproductive numbers (Fig. 3B) reflects some incident cases with higher transmission potential that occurred and resulted in vigorous expansion of disease clustering (Fig. 3C). In the stage of peak plateau (stage IV), Adjusted Rt dropped under the epidemic threshold (<1), indicating that the outbreak was contained. The spatial distributions of cases also show that the clustering areas remained and stopped expanding. In sum, the time-varying reproductive numbers, Adjusted Rt and Non-adjusted Rt, can reflect the timing of the outbreak containment according to the epidemic threshold. The Adjusted Rt can further reflect the timing of epidemic expansion when the IQR of individual reproductive numbers increases.
In Fig. 4, the distributions of individual transmission distance were compared between the two methods: Non-adjusted and Adjusted Rj. Based on the neighborhood transmission setting of Adjusted Rj, the transmission distance of an infector is around 200–300 meters and few long-range transmission links (longer than 2 kilometers). Furthermore, we used Adjusted Rj = 10 as a threshold to categorize super- (Adjusted Rj > 10) and normal- (Adjusted Rj < 10) spreaders. The figure also shows that long-range transmission links are from super-spreaders. The result indicates that dengue cases with high transmissibility have the ability to spread pathogens to geographically distant areas.
To further explore spatial relationships of super-spreaders and dengue epidemic expansion, we illustrated the locations of super-spreaders and clustering tendency of the dengue epidemic during the rapidly growing period (stages II and III), as shown in Fig. 5. Except the very beginning of the stage II (June-25–July-6), the figure shows that super-spreaders tend to distribute at or outside the edge of the main clusters from July-06 to August-23. In addition, the circle size of the super-spreaders in Fig. 5 represent their transmission range. Therefore, the locations of larger circles would reflect long-rage transmission occurred at the edge of dengue clusters. It implied that the role of super-spreaders could be the drivers of geographic expansion of the dengue epidemic.
In order to profile the roles of different spreaders in detailed spatial transmission/expansion process, we classified the cases into two types, early spreaders and succeeding spreaders. Early spreaders are regarded as the sources of new emerging clusters, and succeeding spreaders are those that come after early spreaders. Figure 6 presents a succeeding spreader a (panel A) and an early spreader b (panel B). In each panel, we also compared different methods (spatially adjusted and non-adjusted) that estimate the transmission probability from a spreader to its potential infectees. The time of the incident cases on these maps is the 30th day after the onset day of the given spreader. Infectees labeled with darker colors have a higher probability of becoming infected by spreader a or b. By the non-adjusted method, the two spreaders have identical transmission probabilities and individual reproductive number because of the homogenous mixing assumption (Non-adjusted Ra = Non-adjusted Rb = 4.15). However, they should play different roles in the spatial expansion of the outbreak. Spreader a (the succeeding spreader) occurred in an ongoing cluster; thus, this case is unlikely to be the primary source that triggered this local outbreak. Spreader b (the early spreader), on the contrary, initiated a new cluster where no case had occurred before, and the following cases that emerged were centered on spreader b. Thus, a subsequent local outbreak can be logically attributed to spreader b as the primary ancestor. The adjusted method (left-side maps in Fig. 6), which takes into account the spatial-temporal relationships of incident cases, differentiates the individual reproductive numbers of early and succeeding spreaders (Adjusted Ra = 1.84, Adjusted Rb = 5.74). It also yields more reasonable spatial transmission potential by upweighting the potential infectees proximity to spreader a in Fig. 6A.
To clarify the distinct roles of early and succeeding spreaders in the outbreak expansion process, we compared the distributions of Adjusted Rj between different types of spreaders in Fig. 7A. We found that all the super spreaders (Adjusted Rj > 10) are early spreaders and most of succeeding spreaders have low Adjusted Rj. To examine the generalization of this pattern, we estimated Adjusted Rj of dengue cases in top five largest dengue epidemics since 1990 in Taiwan, including KH2002, TN2007, KH2014, KH2015, TN2015, as shown in Fig. 7B. Non-adjusted Rj were used as the baseline to control for the fluctuations across different stages of the outbreak. The figure showed consistent patterns among these large-scale dengue outbreaks, which means early spreaders with high transmissibility can be generally highlighted by the spatially adjusted method.
Discussion
The effective time-varying reproductive number is a commonly used indicator for measuring disease transmissibility. However, the index conventionally does not capture spatial dynamics of disease transmission. We proposed a new method of calculating the spatially adjusted effective reproductive number by incorporating a spatial-weighting function that captures the nature of heterogeneous mixing. Unlike the averaged time-varying reproductive number, this method estimates different individual-level reproductive numbers (Rj) for given onset times and locations of dengue infections. Thus, it can reflect spatial heterogeneity in transmission potential among individuals and identify the possible super-spreaders35 with high Rj during the rapidly growing period. Our results also reveal that dengue cases with high transmission potential and long-transmission distance are usually located at the edges of the epidemic foci, which means they could be the drivers of further outbreak expansion. Therefore, our proposed method depicts a more detailed spatial-temporal dengue transmission process and identifies the significant role of the edges apart from the epidemic foci, which could be the weak spots in disease control and prevention. Our spatially adjusted method also could further apply to assess individual-level transmission potential of other acute contagious diseases, such as influenza and Zika virus infection, with observed generation interval and the context of neighborhood transmission.
The effective reproductive number of dengue as estimated in past studies reflected an averaged overall epidemic trend, which did not take into account the spatial heterogeneity of transmissibility. Hsieh36 estimated the effective reproductive number at the initial stage of the 2015TN outbreak to be 6.84, which is similar to our averaged estimates (Fig. 3B). Hsieh’s study also showed that Kaohsiung consistently possessed lower effective reproductive numbers (from 1.29 to 2.87), implying that Tainan city may have epidemiological characteristics, such as a lack of herd immunity, that make it more prone to dengue transmission than Kaohsiung. Internationally, Guzzetta25 reported that the time-varying reproductive number of non-endemic urban cities in Brazil is much smaller than the estimates from Tainan and Kaohsiung cities (maximum Rt is approximately 2.2). Codeco31 and Pinho37 also estimated time-varying reproductive numbers in Salvador and Brazil, with a maximum Rt of approximately 4.5. Comparing the values of Rt among different cities may be subject to possible confounding factors, such as weather conditions, host immunity and circulating viral strands.
Some studies considered location information to determine reproductive numbers for quantifying transmissibility of foot-and-mouth disease (FMD)38,39. These models considered spatial distribution of farms, however, it was difficult to capture FMD disease transmission process among animals. In other words, the time-varying reproductive numbers in these models are difficult to reflect nature course of FMD transmission in terms of implementing general interval and renewal equation. Furthermore, farm-level models for FMD outbreaks can be categorized the farms into infected and susceptible ones to determine relative transmissibility of each farm at a specific time. It may be difficult to estimate the amount of susceptible persons living around the infected cases for human infectious diseases, such as influenza and dengue fever. Other studies have also explored spatial heterogeneity by performing stratified analysis of Rt with respect to different administrative regions and with respect to region-to-region transmission40,41. These studies used a spatial weighting function to emphasize the interregional transmission process but still assumed a homogeneous-mixing model within each region. Thus, these studies did not address the spatial heterogeneous mixing issue when estimating reproductive numbers. Our study considered individual-level spatial heterogeneity and used the spatially adjusted reproductive number to measure the transmission potential of each individual.
Like the basic reproductive number, Rj can be regarded as a function of duration of infectiousness, incubation period, transmission probability, vector mosquito density, and host-vector contact rate. The adjusted Rj in this study could reflect vector mosquito density and host-vector contact rate, which are also highly heterogeneous in space42. Meanwhile, the dispersal of vector mosquitoes is largely confined to neighboring areas (average radius of 28–199 meters43), providing an effective infectious zone of an infector. An exponential spatial weighting function herein represents this infectious zone. Guzzetta25 estimated the mean transmission distance of dengue in a metropolis area to be approximately 127 m and further indicated that an exponential distribution described the data better than a radiation model which is a more dispersed distribution effectively describes human mobility44. Kissler, et al.45 also reported the aptness of the exponential distribution when an outbreak is typically dominated by short-range transmission. Therefore, the exponential spatial weighting function in this study is an appropriate substitute for the effective infectious zone for measuring dengue transmission. The exponential distance-decayed function also avoids overestimating the transmission probabilities of infector-infectee pairs with long geographic distances, especially for large-scale dengue epidemics. In sum, areas with high Rj are potential risk areas for high dengue transmission; knowledge of such areas is important for spatial targeting during dengue epidemics.
Spatial epidemiological studies focused for many years on developing methods for identifying significant disease clustering in time and space, such as space-time scan statistics46 or point pattern analysis47. Hotspot areas identified are usually regarded as significant risk areas and as high-priority sites for intervention strategies aimed at mitigating an epidemic48,49. The significance of the study is to introduce the perspective of individual-level transmission potential to disease risk mapping. We found that locations of individuals with high transmission potential are usually located at the edges of growing disease clusters, which can easily be neglected when intervention resources focus on epidemic clusters. A previous study found that urban villages that were originally at the edge of the city but are now enclosed by urbanized lands act as transfer stations for dengue outbreaks50. In this study, we further provide a better understanding of outbreak expansion by categorizing different types of spreaders. Early spreaders with high transmission potential may initiate new source of infection at the edges of the main cluster, resulting in geographic expansion at the exponential growing stages of the outbreak. Therefore, the edge of the outbreak should be a priority of spatial targeting to contain the outbreak regarding both range and magnitude. Succeeding spreaders are indeed still important in tallying morbidity and fatality. However, their high-density clustering patterns make them prone to the depletion of local susceptible populations and degenerating transmission potential. In summary, the center and edges of epidemic clusters play different roles in developing epidemic progression in terms of different types of spreaders (succeeding vs. early) and different patterns of epidemic growth (intensifying vs. expanding). These findings provide important insights for implementing different interventions in the center and on the edges of epidemic clusters.
There are several limitations to this study. First, the method of estimating time-varying effective reproductive numbers is a retrospective procedure that uses observed infectee generation to estimate the Rj of the infector generation. In other words, it cannot be used for predicting future epidemic progression in real time. Nonetheless, the method is helpful for understanding the course of an epidemic and studying the possible mechanisms of geographical expansion. Second, this study considers only geographical distance as a factor in transmission potential. Other factors influence the spatial spreading of dengue. Among these, host heterogeneity (including variations in density51, mobility52, and susceptibility53) strongly modulates the transmission dynamic42 and should be considered in further studies. Finally, the spatial weighting function reflects the assumption of distance-decayed properties (neighborhood transmission). However, the assumption may not reflect long-distance transmission54 and complex urban transport and mobility52. The question of how to develop more detailed spatial weighting schemes that capture realistic mobility patterns warrants further investigation.
Supplementary information
Acknowledgements
The research was supported by the grants of Ministry of Science and Technology (MOST 107-2410-H-002-167-MY3; MOST 108-2638-H-002-002-MY2) and National Health Research Institutes (NHRI-108A1-MRCO-0619191) in Taiwan. The authors also acknowledge the financial support provided by Infectious Diseases Research and Education Center, Ministry of Health and Welfare (MOHW) and National Taiwan University (NTU) and the NTU Research Center for Future Earth from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education (MOE) in Taiwan. The funders had no role in study design, data collection and analysis, or preparation of the manuscript.
Author contributions
T.H.W. conceived of the main conceptual ideas. T.C.N. and T.H.W. developed the theory, analyzed the results, and wrote the manuscript. T.C.N. performed the experiments in discussions with T.H.W.
Data availability
The dengue surveillance dataset used in the current study are publicly available in Taiwan CDC Open Data Portal, https://data.cdc.gov.tw. The data analysis tutorial is included in Supplementary Information files.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
is available for this paper at 10.1038/s41598-019-55574-0.
References
- 1.Heesterbeek J, Dietz K. The concept of Ro in epidemic theory. Statistica Neerlandica. 1996;50:89–110. doi: 10.1111/j.1467-9574.1996.tb01482.x. [DOI] [Google Scholar]
- 2.Dietz K. The estimation of the basic reproduction number for infectious diseases. Statistical methods in medical research. 1993;2:23–41. doi: 10.1177/096228029300200103. [DOI] [PubMed] [Google Scholar]
- 3.Vynnycky, E. & White, R. An introduction to infectious disease modelling. OUP oxford, (2010).
- 4.Heffernan JM, Smith RJ, Wahl LM. Perspectives on the basic reproductive ratio. Journal of the Royal Society Interface. 2005;2:281–293. doi: 10.1098/rsif.2005.0042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Yi, W. & JinDe, C. Final size of network epidemic models: Properties and connections. SCIENCE CHINA Information Sciences. Advance online publication. doi:10.1007/s11432-019-2656-2 (2019).
- 6.Cauchemez, S., Hoze, N., Cousien, A. & Nikolay, B. How Modelling Can Enhance the Analysis of Imperfect Epidemic Data. Trends in parasitology (2019). [DOI] [PMC free article] [PubMed]
- 7.Fraser C, et al. Pandemic potential of a strain of influenza A (H1N1): early findings. Science. 2009;324:1557–1561. doi: 10.1126/science.1176062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lipsitch M, et al. Transmission dynamics and control of severe acute respiratory syndrome. Science. 2003;300:1966–1970. doi: 10.1126/science.1086616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Guerra FM, et al. The basic reproduction number (R0) of measles: a systematic review. The Lancet Infectious Diseases. 2017;17:e420–e428. doi: 10.1016/S1473-3099(17)30307-9. [DOI] [PubMed] [Google Scholar]
- 10.Liu Q-H, et al. Measurability of the epidemic reproduction number in data-driven contact networks. Proceedings of the National Academy of Sciences. 2018;115:12680–12685. doi: 10.1073/pnas.1811115115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wallinga J, Lipsitch M. How generation intervals shape the relationship between growth rates and reproductive numbers. Proceedings of the Royal Society B: Biological Sciences. 2006;274:599–604. doi: 10.1098/rspb.2006.3754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Faye O, et al. Chains of transmission and control of Ebola virus disease in Conakry, Guinea, in 2014: an observational study. The Lancet Infectious Diseases. 2015;15:320–326. doi: 10.1016/S1473-3099(14)71075-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. American journal of epidemiology. 2013;178:1505–1512. doi: 10.1093/aje/kwt133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Team WER. Ebola virus disease in West Africa—the first 9 months of the epidemic and forward projections. New England Journal of Medicine. 2014;371:1481–1495. doi: 10.1056/NEJMoa1411100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wallinga J, Teunis P. Different epidemic curves for severe acute respiratory syndrome reveal similar impacts of control measures. American Journal of epidemiology. 2004;160:509–516. doi: 10.1093/aje/kwh255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Balcan D, et al. Multiscale mobility networks and the spatial spreading of infectious diseases. Proceedings of the National Academy of Sciences. 2009;106:21484–21489. doi: 10.1073/pnas.0906910106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mossong J, et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS medicine. 2008;5:e74. doi: 10.1371/journal.pmed.0050074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Roth C, Kang SM, Batty M, Barthélemy M. Structure of urban movements: polycentric activity and entangled hierarchical flows. PloS one. 2011;6:e15923. doi: 10.1371/journal.pone.0015923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Riley S. Large-scale spatial-transmission models of infectious disease. Science. 2007;316:1298–1301. doi: 10.1126/science.1134695. [DOI] [PubMed] [Google Scholar]
- 20.Favier C, et al. Influence of spatial heterogeneity on an emerging infectious disease: the case of dengue epidemics. Proceedings of the Royal Society B: Biological Sciences. 2005;272:1171–1177. doi: 10.1098/rspb.2004.3020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tuite AR, et al. Cholera epidemic in Haiti, 2010: using a transmission model to explain spatial spread of disease and identify optimal control interventions. Annals of internal medicine. 2011;154:593–601. doi: 10.7326/0003-4819-154-9-201105030-00334. [DOI] [PubMed] [Google Scholar]
- 22.Viboud C, et al. Synchrony, waves, and spatial hierarchies in the spread of influenza. Science. 2006;312:447–451. doi: 10.1126/science.1125237. [DOI] [PubMed] [Google Scholar]
- 23.Savill NJ, et al. Topographic determinants of foot and mouth disease transmission in the UK 2001 epidemic. BMC Veterinary Research. 2006;2:3. doi: 10.1186/1746-6148-2-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kraay AN, Trostle J, Brouwer AF, Cevallos W, Eisenberg JN. Determinants of Short-term Movement in a Developing Region and Implications for Disease Transmission. Epidemiology. 2018;29:117–125. doi: 10.1097/EDE.0000000000000751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Guzzetta G, Marques-Toledo CA, Rosà R, Teixeira M, Merler S. Quantifying the spatial spread of dengue in a non-endemic Brazilian metropolis via transmission chain reconstruction. Nature communications. 2018;9:2837. doi: 10.1038/s41467-018-05230-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ypma RJ, et al. Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data. Proceedings of the Royal Society B: Biological Sciences. 2011;279:444–450. doi: 10.1098/rspb.2011.0913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wen T-H, Sabel CE, Wang I-H. A geo-computational algorithm for exploring the structure of diffusion progression in time and space. Scientific reports. 2017;7:12565. doi: 10.1038/s41598-017-12852-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Center for Diseases Control, M. o. H. a. W., Taiwan. Guidelines for dengue control. (2009).
- 29.Center for Diseases Control, M. o. H. a. W., Taiwan. Dengue daily Confirmed cases since 1998, https://data.cdc.gov.tw/dataset/dengue-daily-determined-cases-1998 (2019).
- 30.Champredon D, Dushoff J, Earn DJ. Equivalence of the Erlang-distributed SEIR epidemic model and the renewal equation. SIAM Journal on Applied Mathematics. 2018;78:3258–3278. doi: 10.1137/18M1186411. [DOI] [Google Scholar]
- 31.Codeço CT, Villela DA, Coelho FC. Estimating the effective reproduction number of dengue considering temperature-dependent generation intervals. Epidemics. 2018;25:101–111. doi: 10.1016/j.epidem.2018.05.011. [DOI] [PubMed] [Google Scholar]
- 32.Cowling BJ, Fang VJ, Riley S, Peiris JM, Leung GM. Estimation of the serial interval of influenza. Epidemiology. 2009;20:344. doi: 10.1097/EDE.0b013e31819d1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Meyer S, Held L. Power-law models for infectious disease spread. The Annals of Applied Statistics. 2014;8:1612–1639. doi: 10.1214/14-AOAS743. [DOI] [Google Scholar]
- 34.Shi X. Selection of bandwidth type and adjustment side in kernel density estimation over inhomogeneous backgrounds. International Journal of Geographical Information Science. 2010;24:643–660. doi: 10.1080/13658810902950625. [DOI] [Google Scholar]
- 35.Stein RA. Super-spreaders in infectious diseases. International Journal of Infectious Diseases. 2011;15:e510–e513. doi: 10.1016/j.ijid.2010.06.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hsieh Y-H. Dengue outbreaks in Taiwan, 1998-2017: importation, serotype and temporal pattern. Asian Pacific Journal of Tropical Medicine. 2018;11:460. doi: 10.4103/1995-7645.240081. [DOI] [PubMed] [Google Scholar]
- 37.Pinho SD, et al. Modelling the dynamics of dengue real epidemics. Philosophical Transactions of the Royal Society A: Mathematical, Physical Engineering Sciences. 2010;368:5679–5693. doi: 10.1098/rsta.2010.0278. [DOI] [PubMed] [Google Scholar]
- 38.Ferguson NM, Donnelly CA, Anderson RM. Transmission intensity and impact of control policies on the foot and mouth epidemic in Great Britain. Nature. 2001;413:542. doi: 10.1038/35097116. [DOI] [PubMed] [Google Scholar]
- 39.Haydon DT, et al. The construction and analysis of epidemic trees with reference to the 2001 UK foot–and–mouth outbreak. Proceedings of the Royal Society of London. Series B: Biological Sciences. 2003;270:121–127. doi: 10.1098/rspb.2002.2191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Backer JA, Wallinga J. Spatiotemporal analysis of the 2014 Ebola epidemic in West Africa. PLoS computational biology. 2016;12:e1005210. doi: 10.1371/journal.pcbi.1005210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Yamauchi T, Takeuchi S, Yamano Y, Kuroda Y, Nakadate T. Estimation of the effective reproduction number of influenza based on weekly reports in Miyazaki Prefecture. Scientific reports. 2019;9:2539. doi: 10.1038/s41598-019-39057-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Cummins B, Cortez R, Foppa IM, Walbeck J, Hyman JM. A spatial model of mosquito host-seeking behavior. PLoS computational biology. 2012;8:e1002500. doi: 10.1371/journal.pcbi.1002500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Harrington LC, et al. Dispersal of the dengue vector Aedes aegypti within and between rural communities. The American journal of tropical medicine. 2005;72:209–220. doi: 10.4269/ajtmh.2005.72.209. [DOI] [PubMed] [Google Scholar]
- 44.Simini F, González MC, Maritan A, Barabási A-L. A universal model for mobility and migration patterns. Nature. 2012;484:96. doi: 10.1038/nature10856. [DOI] [PubMed] [Google Scholar]
- 45.Kissler SM, et al. Geographic transmission hubs of the 2009 influenza pandemic in the United States. Epidemics. 2019;26:86–94. doi: 10.1016/j.epidem.2018.10.002. [DOI] [PubMed] [Google Scholar]
- 46.Kulldorff M, Heffernan R, Hartman J, Assunçao R, Mostashari F. A space–time permutation scan statistic for disease outbreak detection. PLoS medicine. 2005;2:e59. doi: 10.1371/journal.pmed.0020059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Gatrell, A. C., Bailey, T. C., Diggle, P. J. & Rowlingson, B. S. Spatial point pattern analysis and its application in geographical epidemiology. Transactions of the Institute of British geographers, 256–274 (1996).
- 48.Cuadros DF, Awad SF, Abu-Raddad LJ. Mapping HIV clustering: a strategy for identifying populations at high risk ofHIV infection in sub-Saharan Africa. International journal of health geographics. 2013;12:28. doi: 10.1186/1476-072X-12-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Liu Y, et al. Detecting spatial-temporal clusters of HFMD from 2007 to 2011 in Shandong Province, China. PloS one. 2013;8:e63447. doi: 10.1371/journal.pone.0063447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Ren H, Wu W, Li T, Yang Z. Urban villages as transfer stations for dengue fever epidemic: A case study in the Guangzhou, China. PLoS neglected tropical diseases. 2019;13:e0007350. doi: 10.1371/journal.pntd.0007350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Caraco T, Duryea MC, Glavanakov S, Maniatty W, Szymanski BK. Host spatial heterogeneity and the spread of vector-borne infection. Theoretical Population Biology. 2001;59:185–206. doi: 10.1006/tpbi.2000.1517. [DOI] [PubMed] [Google Scholar]
- 52.Stoddard ST, et al. The role of human movement in the transmission of vector-borne pathogens. PLoS neglected tropical diseases. 2009;3:e481. doi: 10.1371/journal.pntd.0000481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Smith D, Dushoff J, Snow R, Hay S. The entomological inoculation rate and Plasmodium falciparum infection in African children. Nature. 2005;438:492. doi: 10.1038/nature04024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Wang Y, Cao J, Li X, Alsaedi A. Edge-based epidemic dynamics with multiple routes of transmission on random networks. Nonlinear Dynamics. 2018;91:403–420. doi: 10.1007/s11071-017-3877-3. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The dengue surveillance dataset used in the current study are publicly available in Taiwan CDC Open Data Portal, https://data.cdc.gov.tw. The data analysis tutorial is included in Supplementary Information files.