Abstract
Background
The required efforts, feasibility and predicted success of an intervention strategy against an infectious disease are partially determined by its basic reproduction number, R0. In its simplest form R0 can be understood as the product of the infectious period, the number of infectious contacts and the per-contact transmission probability, which in the case of vector-transmitted diseases necessarily extend to the vector stages. As vectors do not usually recover from infection, they remain infectious for life, which places high significance on the vector’s life expectancy. Current methods for estimating the R0 for a vector-borne disease are mostly derived from compartmental modelling frameworks assuming constant vector mortality rates. We hypothesised that some of the assumptions underlying these models can lead to unrealistic high vector life expectancies with important repercussions for R0 estimates.
Methodology and principal findings
Here we used a stochastic, individual-based model which allowed us to directly measure the number of secondary infections arising from one index case under different assumptions about vector mortality. Our results confirm that formulas based on age-independent mortality rates can overestimate R0 by nearly 100% compared to our own estimate derived from first principles. We further provide a correction factor that can be used with a standard R0 formula and adjusts for the discrepancies due to erroneous vector age distributions.
Conclusion
Vector mortality rates play a crucial role for the success and general epidemiology of vector-transmitted diseases. Many modelling efforts intrinsically assume these to be age-independent, which, as clearly demonstrated here, can lead to severe over-estimation of the disease’s reproduction number. Our results thus re-emphasise the importance of obtaining field-relevant and species-dependent vector mortality rates, which in turn would facilitate more realistic intervention impact predictions.
Author summary
Many infectious diseases of public health concern, such as dengue, Zika and malaria, are transmitted by insect vectors. Control effort required to curb their continued spread or even prevent their establishment in the first place are partially determined by the disease’s basic reproduction number, R0. Of particular importance for estimating R0 is the duration at which the vector can transmit the disease, which is limited by its life expectancy. Many R0 estimation methods are based on mathematical frameworks that assume constant vector mortality rates. Here we demonstrate how the resulting exponential distribution in life-expectancy can lead to significant over-estimations. By means of an individual-based model we elucidate the effect of vector mortality on R0 estimates and derive a correction factor that alleviates some of the discrepancies due to erroneous vector age distributions. Our results clearly demonstrate the need to obtain more realistic, i.e. field-relevant vector mortality rates in order to generate robust estimates of a disease’s reproduction number and provide guidance for setting-specific disease control.
Introduction
Over the last few decades there has been a global rise in the emergence and re-emergence of vector-borne infectious diseases [1]. The continuing threat of Plasmodium falciparum malaria [2, 3] and dengue [4], the rapid, near pandemic spread of Zika virus [5] or the recent epizootic outbreak of Yersinia pestis (plague) in Madagascar [6] are just some examples of pathogens transmitted by insect vectors that pose a major threat to global public health. Their dependence on insects for transmission between vertebrate hosts has a number of important implications. First, they are frequently subject to strong spatial and temporal fluctuations due to environmental and climatic variations, such as seasonality in rainfall or temperature. Second, these pathogens should be amenable to vector control. That is, disease transmission can, at least in theory, be interrupted simply by removing the insect vector (e.g. use of insecticides) or by preventing contact between the vector and the host (e.g. use of bednets). Furthermore, it has been suggested that only a fraction of insects need to be removed or vector-host contacts to be prevented for the disease to die out. This concept is largely based on mathematical theory that can be traced back to the first formal description and mathematical treatment of the malaria life-cycle by Ross [7]. Unfortunately, translating theoretical predictions to practical applications, especially with regards to disease elimination through vector control, has only resulted in partial success.
The epidemiological reasoning behind the theory relies on a particular threshold condition involving the so-called basic reproduction number, R0, which denotes the expected number of secondary cases arising from a single infection in a totally susceptible population [8]. To date, R0 is frequently used either to predict the extent of an epidemic outbreak or to derive the necessary conditions to prevent this outbreak from happening, e.g. by means of vaccination. The crux of the problem is how to robustly derive or estimate this number in the first place. Compartmentalised systems of ordinary differential equations (ODEs) have been in use for decades to understand infectious diseases at the population level and provide the backbone for most formulas for R0 [9]. These allow the reproduction number to be computed either exclusively using empirically informed parameter estimates or from the initial growth rate of an outbreak [10]. Although the latter is the more common approximation method for directly-transmitted disease [11–14], it has equally been applied to vector-borne pathogens [15–17].
An important consideration for R0 estimates of vector-borne pathogens is that these can vary substantially across space and time. For example, reported R0 estimates for the complete transmission cycle of Plasmodium falciparum in Africa range from 1 to more than 3,000 [18, 19]. Based on nine epidemics in Brazil between 1996 and 2003, the reproduction number for dengue has been estimated to be somewhere between 2 to 103 [20], and median estimates for Zika range between 2.6–4.8 in French Polynesia [21] and 4–9 in Rio de Janiero [22]. The reasons for such wide variations are manifold. As mentioned earlier, the dependence on insect vectors for transmission can naturally introduce large spatio-temporal heterogeneities. That is, a disease introduced during the dry season will behave very differently to the same disease being introduced during the rainy seasons. Equally, an outbreak in a densely populated urban area will likely take a different course than an outbreak in a sparsely populated rural area. Here we argue that in addition to these natural variations and potential differences in data collection and analyses, the actual methodologies used to derive R0 estimates can also introduce substantial discrepancies.
A crucial component of the reproduction number for a vector-borne disease is the mean time that an infected vector is able to transmit to a host, or the infectious vector-to-host transmission period (VHTP) [23]. As infectious vectors are assumed to continue to transmit the disease until death, the VHTP is determined both by the life expectancy of the vector and the extrinsic incubation period of the pathogen. For mathematical simplicity, most epidemiological models of vector-borne diseases assume that vectors have a constant (daily) mortality rate. However, this assumption is in stark contrast to findings from lab-based and field mark-and-recapture studies. For example, survival probabilities of the dengue mosquito vector Aedes aegypti and the principal malaria vectors Anopheles stephensi and An. gambia have been shown to be strongly age-dependent [24–27]. Although it should be clear that current lab and field-based studies of vector survivorship come with their own set of limitations and uncertainties, constant, i.e. age-independent mortality rates are biologically less likely than assuming a general decrease in the survival probability with age.
Previous work has looked into the effects of logistic mortality rates on the vectorial capacity, the mosquito-related components of R0 [28]. However, the effects of assuming constant vector mortality on R0 in a system where death rates are strongly age-dependent have not yet been explored. Here we compared a commonly used R0 formula based on continuous-time differential equation model using constant mortality rates to an R0 estimate derived from first principles under relaxed assumptions about vector mortality. Using a stochastic, individual-based simulation model (IBM), which permits the direct measurement of the average number of secondary cases, we demonstrate how the underlying assumptions of vector survivorship can significantly inflate R0 estimates. We further show how estimates based on endemic equilibria are generally more robust and derive a correction factor to ameliorate R0-inflations in estimation methods based on epidemic growth curves.
Methods
Model frameworks
We derived R0 estimates from two different epidemiological frameworks: (i) a simple, single-strain vector-borne disease model based on ordinary differential equations (ODE), where vector mortality is assumed to be constant, leading to an exponentially distributed vector survivorship, and (ii) a stochastic individual-based model (IBM), which permits more explicit control over the demographic processes regulating birth and death rates.
ODE model
The classical ODE approach to model infectious diseases is obtained by dividing the population into those that are susceptible (S), exposed but not yet infectious (E), infectious (I) and recovered (R). The same principle is then applied to extend these models to vector-transmitted diseases, except for the fact that vectors usually do not recover from infection but are instead assumed to remain infectious until death. This model can be realised by the following set of differential equations
(1) |
(2) |
(3) |
(4) |
(5) |
(6) |
(7) |
Here, 1/μH,V are the mean host and vector life expectancies; β is the daily biting rate; pH,V are the per-bite transmission probabilities from vector to human and human to vector, respectively; 1/ϵH,V are the incubation periods in the host and vector, respectively; and 1/γ is the mean infectious period in the host. This model is illustrated by means of a flow diagram in Fig 1A.
Stochastic IBM
We employed a spatially-explicit IBM (motivated by the one previously proposed by [29]), which, similarly to the ODE approach, assumes that individual hosts are either susceptible (SH), exposed (EH), infectious (IH), or recovered (RH). Individual vectors are equally set to be either susceptible (SV), exposed (EV) or infectious (IV), in which state they remain until death. Vectors are assumed to bite at a constant per day rate β. Infectious vectors transmit the infection to a host with probability pH, and susceptible vectors become infected upon biting an infectious host with probability pV. For simplicity we fixed the extrinsic incubation period (1/ϵV days), the intrinsic incubation period (1/ϵH days) as well as the length of infections in the host (1/γ days).
In contrast to the ODE model, we assumed age-dependent mortalities for both the hosts and the vectors, governed a Weibull distribution:
(8) |
where ci and di are the scale and shape parameters, with i ∈ {H, V} denoting the parameters for host and vectors, respectively. Setting cV = 1 results in an exponential vector age distribution (Fig 1B), i.e. where a vector’s risk of death is age-independent equivalent to the above ODE formulation. Defining cV > 1 results in a sigmoid age profile (Fig 1D) with the vector death rate being age-dependent. This allowed us to investigate the effect of different vector age distributions on the reproduction number R0 within this framework.
The host and vector populations were divided into a set of communities organized into a lattice. The distribution of humans and vectors was uniform across all communities, and it was assumed that individuals mix homogeneously within each community. Human movement was incorporated by allowing human individuals to temporarily visit any sub-population in the lattice with probability ω. The remaining proportion of transmission events from each community, 1 − ω, were dispersed to surrounding local sub-populations, thereby modelling the movement of vectors. Please refer to S1 Text for a more detailed description of the individual-based model.
Estimation of R0
ODE-based reproduction number
We derived the R0 estimates from the ODE model by applying the next generation approach [30], which relates the number of newly infected individuals in the compartments in consecutive generations to one another (see S1 Text for details), yielding
(9) |
where M is the vector:host ratio (NV: NH) and and are the probabilities of hosts and vectors surviving the intrinsic and extrinsic incubation period of the pathogen respectively.
For many vector-borne disease systems, such as malaria and dengue, both the human recovery rate, 1/γ, and the intrinsic incubation period 1/ϵH, are much shorter than the mean human life expectancy, 1/μH. We can therefore make the following approximation of the above formula, which, here, we define as .
(10) |
IBM-based reproduction number
The basic reproduction number for the individual-based model, , was derived from first principles using the transmission cycle of the pathogen, similar to [18] (Fig 1C). Starting with an infected host in an entirely susceptible population, this individual will infect on average MpV β vectors per day and will remain infected for 1/γ days. Therefore, a single infected host is expected to infect a total of MpV β/γ vectors.
A single infected vector will infect on average pHβ hosts per day (in a totally susceptible population). As vectors remain infectious for the rest of their lives, the infectious period is defined as the difference between the mean life expectancy, , and the mean age at which a vector becomes infectious, (see S1 Text for details), meaning that an infectious vector will have days to infect hosts.
Furthermore, the proportion of infected vectors that survive the extrinsic incubation period, denoted by , also depends on the vector mortality risk (see S1 Text for details). Combining all these terms, the basic reproduction number of the individual based model can be derived as
(11) |
Note, the first term is identical to the first term of . However, the second term, which denotes the infectious period of the vector, and the third term, which denotes the probability of vectors surviving the incubation period, differ between and . This is because the formula for the reproduction number derived from the transmission cycle in the individual based model takes into account alternative (Weibull distributed) vector mortality risks, whereas the ODE system assumes a constant mortality rate (an exponential distributed mortality risk).
Timeseries-based reproduction number
In addition to the direct R0 formulas derived above, we also considered two common methods to estimate the reproduction number from timeseries data: one approach based on the initial growth rate of an epidemic outbreak (S1A Fig), and one based on the dynamic equilibrium of an endemic scenario (S1B Fig). Both epidemic and endemic cases were simulated using the individual based model.
Epidemic growth rate. The epidemic outbreak method for estimating the reproduction number requires timeseries data for the introduction of the disease into a completely susceptible population [31]. The initial (exponential) growth rate, λ, was obtained by fitting a Poisson generalised linear model to the initial outbreak data (S1A Fig). Using the classical SEIR-SEI system of ordinary differential equations for a vector-borne disease, the formula for the basic reproduction number, , can be derived as
(12) |
where 1/μV is the mean life expectancy of the vector (see S1 Text for derivation).
Endemic equilibrium approach. The asymptotically stable steady state of susceptible individuals in an ODE-based SEIR system for a directly transmitted disease can be used to estimate the basic reproduction number [10] as
(13) |
where S* is the number of susceptible individuals at equilibrium. The directly transmitted disease R0 estimate was then used as an approximation for the basic reproduction number of an endemic vector-borne disease. For the stochastic IBM this required the system to reach a dynamic equilibrium, where the proportion of susceptibles oscillates around the deterministic equilibrium (as the inherent stochasticities prevent the system from reaching an equilibrium state). We calculated using susceptibility levels at a single time point, as well as the mean proportion of susceptible individuals over the final five years of the timeseries (S1B Fig).
Parameter values
Table 1 provides an overview of the parameters and parameter values used throughout this work (unless stated otherwise). The values were chosen to reflect the epidemiological dynamics of an arboviral disease, such as dengue or Zika. However, the results presented here are qualitatively independent of the particular choice of parameters; S2 Fig show the results of model sensitivity analyses with respect to the dependency of R0 estimates on particular parameter values.
Table 1. The default set of parameter values used in the simulation of the individual-based model describing the spread of a vector-borne disease.
Parameter | Description | Value [range†] |
---|---|---|
|C| | Number of communities in lattice | 400 [1, 16384] |
NH | Host population size | 100000 |
M | Vector to host ratio | 1.2 |
cH | Host mortality shape parameter | 6 |
dH | Host mortality scale parameter | 75 × 365 days |
cV | Vector mortality shape parameter | 4 [1–4] |
dV | Vector mortality scale parameter | 23 days [10, 40] |
1/μH | Mean human life expectancy in ODE | 70 years |
1/μV | Mean vector life expectancy in ODE | 21.3 days |
1/γ | Host recovery time | 4 days |
1/ϵH | Intrinsic incubation period | 6 days |
1/ϵV | Extrinsic incubation period | 5 days |
β | Per day contact rate | 0.6 days−1 |
pH | Pathogen transmission success to host | 0.5 |
pV | Pathogen transmission success to vector | 0.5 |
σ | Local disease dispersal kernel standard deviation | 2 |
ω | Long distance transmission probability | 10−4 |
ι | External infection rate per 100,000 hosts per day | 10−2 [10−5, 1] |
† range considered for sensitivity analyses.
Results
A multitude of the mathematical models put forward to study the dynamics of vector-borne diseases are based on compartmental models described by systems of ordinary differential equations (ODE). Crucial to these types of models is the assumption of constant death rates. As vectors are assumed to remain infectious for life, such assumptions influence not only the resulting dynamics but also the estimates of the disease’s basic reproduction number R0 (and relatedly the (time-varying) effective reproduction number Re(t)). Here we aimed to quantify the effects of relaxing the assumption of constant vector death rate on R0 estimates within the same theoretical setting. This was done by comparing the R0 values derived from an SEIR-SEI system of ODEs with a formula derived from first principles using the transmission cycle of a generic vector-borne disease using different assumptions about vector mortality rates (see Methods). We then verified these estimates by means of a stochastic individual-based model, which allowed us to directly measure R0 from running repeat simulations of introducing an infected individuals into a fully susceptible population. The same model was also used to derive R0 estimates from simulated timeseries data.
Constant vector mortality rates over-estimate R0
Assuming constant vector mortality rates leads to exponentially distributed age profiles (Fig 1B), which permit some vectors to live in excess of four times their mean life expectancy and potentially to transmit the pathogen for an unusually long period of time. Even more concerning is that the vector life expectancy in each compartment of the infection process is essentially the same (Fig 1A). That is, independent of when in its life a vector becomes infected and infectious, its remaining life expectancy remains exponentially distributed around the mean life expectancy. As a consequence, all infectious vectors have a vector-to-human transmission period (VHTP) equal to the mean life expectancy of all vectors, 1/μV, with obvious consequences for R0 estimates.
In contrast to ODE models, individual-based models (IBM) permit much greater control over vector mortality rates. Here we used (Weibull distributed) age-dependent vector death rates (see Methods), which yield a range of sigmoid age profiles (Fig 1D) but which all prevent vectors from living severely extended lives. More importantly, an individual vector’s remaining life expectancy remains unchanged when transitioning between susceptible and infected state or between infected and infectious state, resulting in shorter and more realistic infectious periods.
We demonstrate the effect of assuming different vector mortality rates by comparing the R0 estimates derived from the ODE model to the individual-based model (see Methods). As expected, using parameters as listed in Table 1 we find that the reproduction numbers from the ODE and IBM systems are similar under the assumption of constant vector mortality rates (Table 2). The small discrepancy between the two models is due to the ODE model’s assumption of an exponentially distributed extrinsic incubation period, whereas the IBM assumes this to be a fixed length of time. Using the IBM approach to track individual mosquitoes and infection events we also find that under this assumption the mean age at which vectors become infected is 20 days and infectious at an age of 25 days, i.e. days beyond their average life-expectancy. Furthermore, those vectors that have become infectious live for an average of 46 days, which means that their infectious period is 21 days (equal to the life expectancy of all vectors). This clearly highlights the discrepancy between model outputs based on constant mortality rates and biological reality. In contrast, assuming age-dependent mortality rates (Weibull shape parameter, cV = 4) results in biologically more reasonable infectious periods of 11 days (Table 2) and R0 estimates that are less than half of those based on a model with constant mortality.
Table 2. Comparison of the reproduction numbers derived from a system of ordinary differential equations and the individual-based model under the assumption of constant versus age-dependent mortality rates.
Model | Vector death rate | Life expectancy | Infectious period | R0 |
---|---|---|---|---|
ODE | Constant | 21 days | 21 days | 7.5 |
IBM | Constant | 21 days | 21 days | 7.2 |
IBM | Age-dependent | 21 days | 11 days | 3.2 |
To further demonstrate the dependency of R0 on different distributions of mosquito survivorship, we changed the Weibull distribution of vector mortality to transition smoothly between an exponential (cV = 1) and a sigmoid (cV > 1) age profiles and by keeping the average life expectancy constant. As illustrated in Fig 2, relaxing the assumption of constant mortality and resultant exponential age profile shortens the average infectious period and lowers the reproduction number as derived from the transmission cycle of the pathogen, i.e. . This clearly demonstrate that as well as the vector life expectancy, the actual shape of the survival curve strongly determines the estimated values of a pathogen’s reproduction number.
Comparing R0 estimates through direct measurement
The scenario defined by the reproduction number, whereby a single infectious case enters an entirely susceptible population, is arguably unrealistic for most diseases. Furthermore, disease transmission is an inherently stochastic process, such that each realisation of a disease introduction event is likely to take a different course. We should therefore expect that R0 estimates derived from such introductory events should come with a certain degree of variation. In order to better understand the variability of the expected number of secondary cases and then to directly compare the above formula-based R0 estimates, we simulated disease introduction events into a completely susceptible population using our IBM framework and kept track of all secondary host infections resulting from the index case.
As before we compared the two different assumptions regarding vector life expectancy: constant vs. age-dependent mortality rates. As shown in Fig 3, there is a wide distribution in the number of secondary infections, particularly when we assumed constant vector death rates (Fig 3A and 3B). In that case it was not unusual to observe 40-60 secondary infections, due to the aforementioned unrealistically high life-expectancies for some of the vectors, permitting the accumulation of secondary cases well after the primary human case has recovered (Fig 3A). The mean number of secondary infection (i.e. R0) across 500 model simulations was around 7, more than twice that of the model which assumed age-dependent mortalities. In the latter case we observed secondary infections in the range of 0 to 18 (due to the model’s stochastic nature where some vectors may be infected for their entire life) and with a mean of around 3.2 (Fig 3C and 3D), in line with theoretical expectations. Please refer to S3 Fig for sensitivity on model parameters on the direct measurement of mean secondary infections from the IBM.
An interesting observation is that under both assumptions of vector mortality, over 30% of our simulations resulted in zero secondary infections, as either the single primary case did not infect any vectors, the infected vectors failed to survive the extrinsic incubation period, or the infectious vectors failed to transmit the pathogen. Shorter infectious periods for both the host and the vector, a longer extrinsic incubation period, and lower transmissibility naturally decrease the overall likelihood of transmission from primary to secondary cases. Therefore, the proportion of failed outbreaks crucially depends on all these factors (S4 Fig).
Initial growth rate methods can lead to over-estimation of R0
In most cases, only successful disease introductions that lead to epidemic outbreaks are observed. These outbreaks can then be used to estimate the reproduction number based on the initial epidemic growth rate λ (see Methods). Formulas to calculate R0 from λ are usually based on ODE modelling frameworks assuming constant vector death rates. To investigate the effect of this assumption on estimating a disease’s basic reproduction number from epidemic growth rates, , we used our IBM framework to generate 100 epidemic outbreaks (discounting failed introductory events) under identical initial conditions for both constant and age-dependent mortality rates.
As illustrated in Fig 4A, estimating the reproduction number from initial outbreak data is fairly reliable as long as the empirical age profiles of the mosquitoes match the one assumed in the model. That is, if mosquito mortality was indeed independent of age, leading to exponentially distributed age profiles, then can provide good estimates of the real reproduction number. However, if the risk of dying does increase with age, then , as derived form the ODE framework, is once again significantly over-inflated. Likewise, assuming age-dependent death rates when mortality is in fact constant, this could lead to an underestimation of the true reproduction number; note, however, that the latter scenario is arguably less relevant in biological terms.
Noticeable in all situations is the considerable variance in . This is due to the stochastic nature of our spatial IBM framework, which to a certain extent should also reflect the natural stochasticities underlying real vector-borne disease systems. Changing the model’s spatial and demographic set-up will obviously affect the variance reported here; however, the results, related to the mean values, are to be understood as independent of the model’s underlying structure.
Correction for initial growth rate methods
As shown in Fig 4, using the initial epidemic growth rate is only appropriate when empirical vector mortality is indeed age-independent, whereas it can lead to significant over-estimations otherwise. In order to compensate for this and include age-dependent vector mortality rates into the ODE-derived formula for , we replaced this critical term by the vector to host transmission period (VHTP), denoted by , calculated directly from an assumed vector age profile (see Methods), which yields the corrected estimate
(14) |
where μV is the constant vector mortality rate in the classical system of ordinary differential equations. Crucially, a vector age profile has to be assumed explicitly to calculate the VHTP. And as before, if the assumed profile in matches the simulations’ profile, we find that the derived reproduction numbers are good estimates of the actual ones, with the same variance as before (Fig 4).
Endemic equilibrium can provide robust estimates of R0
Finally, we sought to estimate R0 from the dynamic equilibrium distribution of susceptibles in the human population (see Methods). Crucially, this approach does not require any a priori knowledge of mosquito survivorship and should therefore provide more robust estimates regardless of the underlying assumptions regarding vector mortality rates. Indeed, and as demonstrated in Fig 5, using the endemic state can provide reasonable estimates of a disease’s true (i.e. theoretical) R0 value, even though the formula itself was derived from a directly transmittable disease, which might explain why slightly underestimates R0.
As before we find a significant degree of variation around the mean estimates, due to the stochastic nature of disease transmission. This can somewhat be reduced by taken longer term averages (compare single time point estimates with 10 year average in Fig 5), which in reality will be limited due to data availability. Equally, the model and population structure itself, including as population size, importation rates, spatial structuring and mixing, all affect the stability of the dynamic equilibrium and with it the variance and hence robustness of (see S2 Fig). Although this method is only applicable for diseases that have reached at least a semi-endemic state, its parameter and assumption-free approach means that it should be considered as one of the most robust ways to estimate a disease’s reproduction number.
Discussion
Mathematical models describing the population dynamics of an infectious disease provide the necessary frameworks by which we can calculate an infectious disease’s reproduction number, R0, based on specific parameters related to infection and transmission probabilities. One of the most important factors influencing R0 is the length at which an individual remains infectious. For vector-transmitted diseases this places huge significance on vector mortality rates as vectors usually do not clear an infection and instead remain infectious for life. Many formulas to estimate R0 are based on systems of ordinary differential equations (ODEs), which commonly assume that vector mortality is constant, i.e. independent of age. As we have demonstrated here, the resulting exponential distribution and the effective resetting of life expectancies as individuals transition through the infection stages permit some vectors to live for an extraordinary length of time. As a result, vectors are potentially able to transmit the disease multiple times that of what should biologically be possible, leading to significantly inflated R0 estimates.
In comparison to ODE models, individual-based models (IBMs) provide much greater control over the dynamics that govern both demography and disease transmission. Here we used an individual-based modelling approach to elucidate the influence of vector mortality on R0 estimates and to highlight the discrepancy between model predictions based on constant vs. age-dependent mortality. Because individual infection events can easily be tracked within an IBM, the basic reproduction number can essentially be measured simply by counting the number of secondary infections arising from a single index case. This in turn not only allowed us to compare different formulas for estimating R0 but also provided us with a better understanding of the degree of uncertainty surrounding these estimates.
As demonstrated here, the assumption of constant vector death rates can lead to significant over-estimation of R0. Importantly, it is not so much that the formulas commonly used to estimate R0 are inherently wrong but rather that the underlying assumption of the models from which they are derived are not necessarily aligned with biological reality. We found that one of the most robust methods to estimate a pathogen’s R0 is based on the proportion of susceptible individuals at endemic equilibrium, as this is entirely parameter free and does not require any assumption about vector death rates. Unfortunately, this only works for diseases that are well established in a population, and its reliability is strongly dependent on the stochasticity of the underlying endemic equilibrium, i.e. the (multi-annual) variations around the mean. For emerging diseases this is obviously not practical and estimation methods in those cases usually make use of epidemic growth curves instead. However, these also implicitly assume exponential vector age profiles and are therefore subject to inflation. In order to account for this we have here derived a correction factor that can be applied to classical R0 estimation formulas and which adjusts for most of the discrepancy between the vector-to-human transmission period (VHTP) of the biological system and the assumed system with constant vector mortality.
In this work we made use of an individual-based modelling framework to test the effect of non-exponential vector age distributions on R0. Alternative methods that allow for the (partial) relaxation of the assumption regarding constant mortality or vector senescence have also been proposed, including lumped-age class models [32] or systems of partial differential equations [33]. However, these methods can still suffer from the same issues as simpler ODE models, where transition rates between life and infection stages are usually exponentially distributed and where information about individual ages is lost at every transition stage. The ease at which different distributions that govern host and vector mortality, infection recovery and other epidemiological factors can be incorporated, make IBM frameworks the natural choice to examine the influence of vector mortality or other such factors on R0 estimations. Here we only concentrated on the effect of vector mortality, whereas similar arguments are equally valid for the distribution underlying the extrinsic incubation period [34], for example. Nevertheless, our work strongly suggests that vector mortality rates, or rather our assumptions about the age-dependency of survivorship, are the predominant factors, as our correction term for R0 estimates based on epidemic growth essentially recovers the true value.
Another important observation from this study was that when simulating the spread of a disease from a single infected individual and then calculating R0 based on the number of secondary infections, the stochastic nature of such events resulted in very wide distributions in R0. Although the assumption of age-dependent mortality rates generally prevented extremely high values of secondary cases, and therefore R0, the variance was still in the region of twice the mean and included a significant proportion of zero cases. That is, in around a third of the simulations we observed no secondary case at all despite starting off with the same initial conditions. This then begs the question whether these events should be counted towards the estimated R0 or not, as in reality we never observe such failed introductions. Comparing the expected with the observed R0 value would suggest that zero cases should be counted, which on the other hand implies that even high values of the reproduction number are by no means a guarantee that an outbreak should ensue if a disease gets introduced in a fully susceptible population (sufficient conditions to prevent stochastic fade-out at the start of an epidemic have been previously discussed [35]). The high variation also suggests that control strategies based on R0 estimates generated from initial growth rates should be treated with caution and that estimations based on one particular setting might not be adequate to generalize and predict pathogen behaviour across all other spatial contexts [36].
We here concentrated solely on the basic reproduction number, which describes an arguably unusual and often artificial situation. However, it should be clear that the same arguments also hold for the effective reproduction number, Re, which is essentially R0 multiplied by the fraction of the population that is susceptible to a disease, as well as their time-dependent counterparts R(t) and Re(t). Furthermore, the serial and generation intervals, which can be understood as temporal analogues of the reproduction number, also rely on the vector to host transmission period and are usually assumed to be exponentially distributed [37, 38]. This implies that these intervals, and alternative R0 estimation methods that depend upon them [39–41], may equally be over-estimated.
Our work thus reiterates the importance of obtaining empirical vector mortality rates in the field. The original Ross-MacDonald model for the spread of Plasmodium falciparum and P. vivax malaria assumed constant vector mortality as laboratory and field studies seemed to suggest that death rates were age independent [42]. However, re-analysis of laboratory data showed that mosquito mortality is in fact age-dependent for several Anopholes species [43]. More recent studies also confirmed that mosquito mortality is dependent on age for Anopheles mosquitoes [27] and Aedes aegypti [24, 25]. What is clear is that more work needs to be done to fully elucidate realistic, i.e. field-relevant vector mortality rates, perhaps with more accurate spectroscopic methods [44], as well as their environmental drivers. That is, seasonal variations in temperature and rainfall have been shown to affect the birth and death rate of vectors [45–48], the vectorial competence [49] as well as the extrinsic incubation period [50–53]. It has also been emphasized that other spatio-temporal heterogeneities, such as community structures and host and vector movement, should be considered when assessing R0 [54, 55]. All this needs to be factored in if we are to develop better models to understand the epidemiological and ecological determinants of vector-borne diseases, guide outbreak prevention strategies or monitor ongoing intervention measures.
Supporting information
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was supported by an EPSRC DTP studentship to WT. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Gubler DJ. Vector-borne diseases. Revue scientifique et technique (International Office of Epizootics). 2009;28(2):583–8. [DOI] [PubMed] [Google Scholar]
- 2.Trampuz A, Jereb M, Muzlovic I, Prabhu RM. Clinical review: Severe malaria. Critical care (London, England). 2003;7(4):315–323. 10.1186/cc2183 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Noor AM, Kinyoki DK, Mundia CW, Kabaria CW, Mutua JW, Alegana VA, et al. The changing risk of Plasmodium falciparum malaria infection in Africa: 2000-10: A spatial and temporal analysis of transmission intensity. The Lancet. 2014;383(9930):1739–1747. 10.1016/S0140-6736(13)62566-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Guzman MG, Halstead SB, Artsob H, Buchy P, Farrar J, Gubler DJ, et al. Dengue: a continuing global threat. Nature Reviews Microbiology. 2010;8(12):S7–S16. 10.1038/nrmicro2460 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fauci AS, Morens DM. Zika Virus in the Americas—Yet Another Arbovirus Threat. New England Journal of Medicine. 2016;374(7):601–604. 10.1056/NEJMp1600297 [DOI] [PubMed] [Google Scholar]
- 6.World Health Organization. Plague—Madagascar; 2017. Available from: http://www.who.int/csr/don/02-november-2017-plague-madagascar/en/.
- 7.Ross R. The Preventation of malaria. London: John Murray; 1911. [Google Scholar]
- 8.Heesterbeek JAP, Dietz K. The concept of R0 in epidemic theory. Statistica Neerlandica. 1996;50(1):89–110. 10.1111/j.1467-9574.1996.tb01482.x [DOI] [Google Scholar]
- 9.Reiner RC, Perkins TA, Barker CM, Niu T, Chaves LF, Ellis AM, et al. A systematic review of mathematical models of mosquito-borne pathogen transmission: 1970-2010. Journal of the Royal Society, Interface. 2013;10(81):20120921 10.1098/rsif.2012.0921 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Dietz K. The estimation of the basic reproduction number for infectious diseases. Statistical Methods in Medical Research. 1993;2(1):23–41. 10.1177/096228029300200103 [DOI] [PubMed] [Google Scholar]
- 11.Cairns AJ. Model fitting and projection of the AIDS epidemic. Mathematical biosciences. 1991;107(2):451–89. 10.1016/0025-5564(91)90019-F [DOI] [PubMed] [Google Scholar]
- 12.Chowell G, Hengartner NW, Castillo-Chavez C, Fenimore PW, Hyman JM. The basic reproductive number of Ebola and the effects of public health measures: The cases of Congo and Uganda. Journal of Theoretical Biology. 2004;229(1):119–126. 10.1016/j.jtbi.2004.03.006 [DOI] [PubMed] [Google Scholar]
- 13.Lipsitch M, Cohen T, Cooper B, Robins JM, Ma S, James L, et al. Transmission dynamics and control of severe acute respiratory syndrome. Science (New York, NY). 2003;300(5627):1966–70. 10.1126/science.1086616 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Breban R, Vardavas R, Blower S. Theory versus data: How to calculate R0? PLoS ONE. 2007;2(3):0–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Massad E, Coutinho Fa, Burattini MN, Lopez LF. The risk of yellow fever in a dengue-infested area. Transactions of the Royal Society of Tropical Medicine and Hygiene. 1996;95(4):370–374. 10.1016/S0035-9203(01)90184-1 [DOI] [PubMed] [Google Scholar]
- 16.Chowell G, Diaz-Dueñas P, Miller JC, Alcazar-Velazco A, Hyman JM, Fenimore PW, et al. Estimation of the reproduction number of dengue fever from spatial epidemic data. Mathematical Biosciences. 2007;208(2):571–589. 10.1016/j.mbs.2006.11.011 [DOI] [PubMed] [Google Scholar]
- 17.Towers S, Brauer F, Castillo-Chavez C, Falconar AKI, Mubayi A, Romero-Vivas CME. Estimate of the reproduction number of the 2015 Zika virus outbreak in Barranquilla, Colombia, and estimation of the relative role of sexual transmission. Epidemics. 2016;17:50–55. 10.1016/j.epidem.2016.10.003 [DOI] [PubMed] [Google Scholar]
- 18.Smith DL, McKenzie FE, Snow RW, Hay SI. Revisiting the basic reproductive number for malaria and its implications for malaria control. PLoS Biology. 2007;5(3):0531–0542. 10.1371/journal.pbio.0050042 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Bousema T, Griffin JT, Sauerwein RW, Smith DL, Churcher TS, Takken W, et al. Hitting hotspots: Spatial targeting of malaria for control and elimination. PLoS Medicine. 2012;9(1):1–7. 10.1371/journal.pmed.1001165 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Favier C, Degallier N, Rosa-Freitas MG, Boulanger JP, Costa Lima JR, Luitgards-Moura JF, et al. Early determination of the reproductive number for vector-borne diseases: The case of dengue in Brazil. Tropical Medicine and International Health. 2006;11(3):332–340. 10.1111/j.1365-3156.2006.01560.x [DOI] [PubMed] [Google Scholar]
- 21.Kucharski AJ, Funk S, Eggo RM, Mallet HP, Edmunds WJ, Nilles EJ. Transmission Dynamics of Zika Virus in Island Populations: A Modelling Analysis of the 2013-14 French Polynesia Outbreak. PLoS neglected tropical diseases. 2016;10(5):e0004726 10.1371/journal.pntd.0004726 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Villela DAM, Bastos LS, DE Carvalho LM, Cruz OG, Gomes MFC, Durovni B, et al. Zika in Rio de Janeiro: Assessment of basic reproduction number and comparison with dengue outbreaks. Epidemiology and infection. 2017;145(8):1649–1657. 10.1017/S0950268817000358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mendes Luz P, Torres Codeço C, Massad E, Struchiner CJ. Uncertainties Regarding Dengue Modeling in Rio de Janeiro, Brazil. Memorias do Instituto Oswaldo Cruz. 2003;98(7):871–878. [PubMed] [Google Scholar]
- 24.Styer LM, Carey JR, Wang JL, Scott TW. Mosquitoes do senesce: Departure from the paradigm of constant mortality. American Journal of Tropical Medicine and Hygiene. 2007;76(1):111–117. 10.4269/ajtmh.2007.76.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Harrington LC, Jones JJ, Kitthawee S, Sithiprasasna R, Edman JD, Scott TW. Age-Dependent Survival of the Dengue Vector Aedes aegypti (Diptera: Culicidae) Demonstrated by Simultaneous Release-Recapture of Different Age Cohorts. Journal of Medical Entomology. 2008;45(2):307–313. [DOI] [PubMed] [Google Scholar]
- 26.Hugo LE, Jeffery JAL, Trewin BJ, Wockner LF, Thi Yen N, Le NH, et al. Adult Survivorship of the Dengue Mosquito Aedes aegypti Varies Seasonally in Central Vietnam. PLoS Neglected Tropical Diseases. 2014;8(2). 10.1371/journal.pntd.0002669 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dawes EJ, Churcher TS, Zhuang S, Sinden RE, Basáñez MG. Anopheles mortality is both age- and Plasmodium-density dependent: implications for malaria transmission. Malaria journal. 2009;8:228 10.1186/1475-2875-8-228 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Bellan SE. The importance of age dependent mortality and the extrinsic incubation period in models of mosquito-borne disease transmission and control. PLoS ONE. 2010;5(4). 10.1371/journal.pone.0010165 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lourenço J, Recker M. Natural, Persistent Oscillations in a Spatial Multi-Strain Disease System with Application to Dengue. PLoS Computational Biology. 2013;9(10). 10.1371/journal.pcbi.1003308 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Diekmann O, Heesterbeek JAP, Roberts MG. The construction of next-generation matrices for compartmental epidemic models. Journal of The Royal Society Interface. 2010;7(47):873–885. 10.1098/rsif.2009.0386 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ribeiro RM, Qin L, Chavez LL, Li D, Self SG, Perelson AS. Estimation of the initial viral growth rate and basic reproductive number during acute HIV-1 infection. Journal of virology. 2010;84(12):6096–6102. 10.1128/JVI.00127-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hancock PA, Godfray HCJ. Application of the lumped age-class technique to studying the dynamics of malaria-mosquito-human interactions. Malaria Journal. 2007;6 10.1186/1475-2875-6-98 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Rock KS, Wood DA, Keeling MJ. Age- and bite-structured models for vector-borne diseases. Epidemics. 2015;12:20–29. 10.1016/j.epidem.2015.02.006 [DOI] [PubMed] [Google Scholar]
- 34.Brand SPC, Rock KS, Keeling MJ. The Interaction between Vector Life History and Short Vector Life in Vector-Borne Disease Transmission and Control. PLoS Computational Biology. 2016;12(4):1–21. 10.1371/journal.pcbi.1004837 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hartfield M, Alizon S. Introducing the Outbreak Threshold in Epidemiology. PLoS Pathogens. 2013;9(6):9–12. 10.1371/journal.ppat.1003277 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Smith DL, Perkins TA, Reiner RC, Barker CM, Niu T, Chaves LF, et al. Recasting the theory of mosquito-borne pathogen transmission dynamics and control. Transactions of the Royal Society of Tropical Medicine and Hygiene. 2014;108(4):185–97. 10.1093/trstmh/tru026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Siraj AS, Oidtman RJ, Huber JH, Kraemer MUG, Brady J, Johansson MA, et al. Temperature modulates dengue virus epidemic growth rates through its effects on reproduction numbers and generation intervals. PLoS Neglected Tropical Diseases. 2017; p. 1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Huber JH, Johnston GL, Greenhouse B, Smith DL, Perkins TA. Quantitative, model-based estimates of variability in the generation and serial intervals of Plasmodium falciparum malaria. Malaria Journal. 2016;15(1):490 10.1186/s12936-016-1537-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. American Journal of Epidemiology. 2013;178(9):1505–1512. 10.1093/aje/kwt133 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Reiner RC, Le Menach A, Kunene S, Ntshalintshali N, Hsiang MS, Perkins TA, et al. Mapping residual transmission for malaria elimination. eLife. 2015;4(DECEMBER2015):1955–1969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Perkins TA, Rodriguez-Barraquer I, Manore C, Siraj AS, España G, Barker CM, et al. What lies beneath: a spatial mosaic of Zika virus transmission in the 2015-2016 epidemic in Colombia. Epidemics. 2018. [Google Scholar]
- 42.MacDonald G. The analysis of the sporozoite rate. Tropical diseases bulletin. 1952;49(6):569–86. [PubMed] [Google Scholar]
- 43.Clements AN, Paterson GD. The Analysis of Mortality and Survival Rates in Wild Populations of Mosquitoes. The Journal of Applied Ecology. 1981;18(2):373. [Google Scholar]
- 44.Lambert B, Sikulu-Lord MT, Mayagaya VS, Devine G, Dowell F, Churcher TS. Monitoring the Age of Mosquito Populations Using Near-Infrared Spectroscopy. Scientific Reports. 2018;8(1):1–9. 10.1038/s41598-018-22712-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Alto BW, Juliano SA. Precipitation and temperature effects on populations of Aedes albopictus (Diptera: Culicidae): Implications for range expansion. Journal of Medical Entomology. 2001;38(5):646–656. 10.1603/0022-2585-38.5.646 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Brady OJ, Johansson Ma, Guerra Ca, Bhatt S, Golding N, Pigott DM, et al. Modelling adult Aedes aegypti and Aedes albopictus survival at different temperatures in laboratory and field settings. Parasites & vectors. 2013;6:351 10.1186/1756-3305-6-351 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Valdez LD, Sibona GJ, Diaz LA, Contigiani MS, Condat CA. Effects of rainfall on Culex mosquito population dynamics. Journal of Theoretical Biology. 2017;421:28–38. 10.1016/j.jtbi.2017.03.024 [DOI] [PubMed] [Google Scholar]
- 48.Yang HM, Macoris MLG, Galvani KC, Andrighetti MTM, Wanderley DMV. Assessing the effects of temperature on the population of Aedes aegypti, the vector of dengue. Epidemiology and Infection. 2009;137(8):1188–1202. 10.1017/S0950268809002040 [DOI] [PubMed] [Google Scholar]
- 49.Lambrechts L, Paaijmans KP, Fansiri T, Carrington LB, Kramer LD, Thomas MB, et al. Impact of daily temperature fl uctuations on dengue virus transmission by Aedes aegypti. Proceedings of the National Academy of Sciences of the United States of America. 2011;108(18):1–6. 10.1073/pnas.1101377108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Beck-Johnson LM, Nelson WA, Paaijmans KP, Read AF, Thomas MB, Bjørnstad ON. The effect of temperature on Anopheles mosquito population dynamics and the potential for malaria transmission. PLoS ONE. 2013;8(11). 10.1371/journal.pone.0079276 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Brady OJ, Golding N, Pigott DM, Kraemer MUG, Messina JP, Reiner RC, et al. Global temperature constraints on Aedes aegypti and Ae. albopictus persistence and competence for dengue virus transmission. Parasites & vectors. 2014;7(1):338 10.1186/1756-3305-7-338 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Chan M, Johansson MA. The Incubation Periods of Dengue Viruses. PLoS ONE. 2012;7(11):1–7. 10.1371/journal.pone.0050972 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Focks DA, Daniels E, Haile DG, Keesling JE. A simulation model of the epidemiology of urban dengue fever: Literature analysis, model development, preliminary validation, and samples of simulation results. American Journal of Tropical Medicine and Hygiene. 1995;53(5):489–506. 10.4269/ajtmh.1995.53.489 [DOI] [PubMed] [Google Scholar]
- 54.Perkins TA, Scott TW, Le Menach A, Smith DL. Heterogeneity, Mixing, and the Spatial Scales of Mosquito-Borne Pathogen Transmission. PLoS Computational Biology. 2013;9(12). 10.1371/journal.pcbi.1003327 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Lloyd AL, Kitron U, Perkins TA, Gonzalo M. The Basic Reproductive Number for Disease Systems with Multiple Coupled Heterogeneities. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.