Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2011 Jul 1.
Published in final edited form as: Nature. 2005 Nov 24;438(7067):492–495. doi: 10.1038/nature04024

The entomological inoculation rate and Plasmodium falciparum infection in African children

D L Smith 1, J Dushoff 1,2, R W Snow 3,4, S I Hay 3,5
PMCID: PMC3128496  EMSID: UKMS35850  PMID: 16306991

Abstract

Malaria is an important cause of global morbidity and mortality. The fact that some people are bitten more often than others has a large effect on the relationship between risk factors and prevalence of vector-borne diseases13. Here we develop a mathematical framework that allows us to estimate the heterogeneity of infection rates from the relationship between rates of infectious bites and community prevalence. We apply this framework to a large, published data set that combines malaria measurements from more than 90 communities4. We find strong evidence that heterogeneous biting or heterogeneous susceptibility to infection are important and pervasive factors determining the prevalence of infection: 20% of people receive 80% of all infections. We also find that individual infections last about six months on average, per infectious bite, and children who clear infections are not immune to new infections. The results have important implications for public health interventions: the success of malaria control will depend heavily on whether efforts are targeted at those who are most at risk of infection.


Economic, public-health, and medical advances over the past 100 years halved the global geographical extent of malaria5. Nonetheless, population growth and failure to prevent infection or manage disease in this reduced extent means that Plasmodium falciparum remains a leading cause of global morbidity6 and mortality4,7,8. Central to the future of malaria control is more effective understanding of the relationships between risk factors, the frequency and longevity of infection, and disease outcome9. These relationships have been debated extensively in relation to disease control10,11.

An important aspect of this debate is the quantitative relationship that inevitably exists between the proportion of people who are infected with P. falciparum, (the parasite ratio, PR) and the rate at which people are bitten by infectious mosquitoes (the entomological inoculation rate, EIR). The EIR is rarely recorded in Africa4, limiting informed debate on the relationship between vector biology, transmission intensity, clinical disease and mortality risks. Conversely, the PR is a widely measured index of infection risk, allowing a more detailed investigation of the interaction between transmission intensity, age, and disease burden. Using mathematical models, we seek here to explore the factors that determine the complex, nonlinear association between EIR and PR, and use these models to highlight key features of the biology of malaria transmission that relate to future disease control.

Ross first developed a mathematical model for the relationship between EIR and PR12, but his model, as extended by Macdonald, performed poorly in the African Savannah13. Since then, several modifications to Ross’ original model have been proposed that are now considered important in malaria epidemiology. Informed by a century of theory, we have developed a large set of mathematical models, fitted them to a comprehensive data set, and selected the best of these models to describe the relationship between EIR and PR (Methods).

It was assumed by Ross that human populations were homogeneous, but in fact some people are bitten by mosquitoes more than others because of proximity to larval habitat14, differential attractiveness to mosquitoes15, or other reasons. Moreover, some people are more susceptible to infection, per bite. Heterogeneous infection rates have important implications for the dynamics and control of malaria13, and heterogeneity fundamentally changes the relationship between EIR and PR14. Those who are infected most play a role in malaria transmission that is analogous to the role of the most sexually active in transmission of sexually transmitted diseases16. We assume that relative infection rates follow a Γ distribution, with mean 1 and variance 1/k.

Ross assumed that infections clear at a constant rate, regardless of subsequent infections. The assumption was challenged by a growing consensus that superinfection with P. falciparum would increase the time to clear an infection1720. Let ε denote the annual EIR, b the transmission efficiency (the probability that a bite by an infectious mosquito results in an infection—either a new infection or superinfection) and 1/r the expected time to clear each infection. Assuming superinfection and assuming infections clear independently, clearance occurs at the rate /(ebε/r − 1) (refs 1719); thus, the time to clearance is longer when annual EIR is higher. We also considered immunity to reinfection by comparing SIS to SIRS dynamics (Methods). The full set of candidate models included SIS and SIRS models combined with heterogeneous infection and superinfection.

We identified 119 empirical estimates of EIR matched to coincidental measures of PR in African children under 15 years of age4; these data should be regarded with circumspection, as they were collected using different methods and for other purposes. One hundred and nine pairs measured positive EIR and PR. Ninety-one of these pairs also reported the sample sizes for the PR estimate. Two other modifications to these models were considered. First, microscopy errors bias estimates of PR and affect the analysis21. As the sensitivity and specificity of microscopy is not known, they were fitted along with the other model parameters. Second, the age ranges of the children sampled differed among studies. This introduces a potential bias if estimated prevalence varied substantially with age. We fitted each model with and without age corrections and microscopy errors (Methods).

Each candidate model was fitted to the data by maximum likelihood using R (ref. 22) and compared to a log-linear model5,23. Based on these fits, the best overall model was selected using Akaike’s information criterion (AIC)24 (Table 1, Fig 1a). The best overall model was a simple function that incorporated heterogeneous infection rates, with no immunity to reinfection:

PR=1(1+bεrk)k (1)

Equation (1) fitted better than the log-linear model (another two-parameter model) and had very strong evidential support over the log-linear model, with ΔAIC of 185 (refs 5, 23). Moreover, equation (1) was selected over the log-linear model in every case when the analysis was repeated on 1,000 bootstrapped data sets.

Table 1.

The models and fitted parameters

PR ΔAIC b/r 1/k bInline graphic α β a b Name
1(1+bεrk)k 0 0.45 4.2 96% 12% 0.008 −0.0016 SI°S
0(1esεr1+bϕεesεr)g(s)ds 2 0.45 4.2 0 96% 12% 0.008 −0.016 SI°RS
bεbε+r 213 0.16 78% 16% 0.01 −0.026 SIS
1esεr1+bϕεesεr 557 0.1 0.05 76% 21% 0.012 −0.028 SI°RS
1 − e−ε/r 612 0.077 75% 21% 0.01 −0.03 SI°S
c d
c logε +d 185 0.084 0.32 0.008 −0.012 Log-linear

Models are ranked by AIC. The analysis found strong evidential support for the model ∫SI°S described by equation (1), a model with heterogeneous infection rates (the symbol ∫ denotes heterogeneous infection) and superinfection (the default assumption is constant clearance; implies superinfection). The models that lacked heterogeneous infection, SIS, SI°S, SI°RS, and the log-linear model fit poorly, as judged by AIC. (We report the ΔAIC values.) Moreover, the parameters for the model ∫SI°S are biologically meaningful, unlike those for the log-linear model. Finally, the model ∫SI°RS fitted no better than ∫SI°S, and the fitted duration of immunity to infection was zero.

Figure 1. The data and fitted functions and a sensitivity analysis for the best overall model.

Figure 1

a, The best fit for SIS (tan), SI°S (yellow), SI°RS (orange), the log-linear model (grey), and the best-overall model ∫SI°S (blue). The best-fit ∫SI°RS model was not visually different from the ∫SI°S model with heterogeneity. b, In ∫SI°S, PR is more sensitive to the variance in infection rates (that is, doubling or halving k) at high EIR (solid lines, blue and green), and more sensitive to variability in recovery (doubling or halving r) for low EIR (dashed lines, yellow and green). Models are defined in Methods.

The model has six fitted parameters, including sensitivity, specificity, age-corrections, k and b/r: b and r are exactly co-linear—the mathematical relationship between EIR and PR in these models depends only on their ratio. For the best overall model, the fitted sensitivity and specificity were 95.8% and 88.4%, respectively. The estimates were different for each model. A sensitivity analysis demonstrates that PR is much more sensitive to changes in k, which determines variance in infection rates, when annual EIR is greater than about 1, but PR is more sensitive to b/r when EIR is lower (Fig. 1b).

A simple summary of heterogeneous infection is the fraction of all infections received by the subpopulation that is infected most frequently; for 1/k = 4.2, 20% of the population receives 80% of all infections, similar to a single study from Tanzania in which 20% of the population received 80% of all bites3. This represents an average across the 91 populations sampled—the distribution of infection rates in a particular population may be more or less heterogeneous, depending on the local ecology14.

The fitted parameter b/r is the product of transmission efficiency and persistence times; alternatively, it is the expected duration of an infection, per infectious bite. If transmission efficiency were perfect, the best-fit parameter would correspond to a duration of infection of approximately 166 days; if transmission efficiency were approximately 50%, then persistence would be 11 months. These estimates are consistent with estimates of persistence from simple infections induced for malaria therapy25 and with recent studies of persistence for natural infections26,27.

Consistent with other studies28,29 and the notion that immunity to clinical illness develops after repeated infection in early childhood, we found no evidence for immunity to infection among these populations of African children, as reflected in the relationship between EIR and PR. A direct comparison of SIS and SIRS models (Methods; equation (4) versus equations (6) or (7)) demonstrated that mathematical models for malaria infection in children should be SIS and not SIRS because children do not become immune to infection after clearing a single infection, but immunity requires repeated infection or possibly some change in immune function with age.

On the other hand, the analysis did reveal a strong decline in prevalence associated with the maximum age of the sample population. The lower bound for age was associated with a 0.8% increase in prevalence for each year of age, and the upper bound was associated with a 1.6% decrease in prevalence. On closer scrutiny, most of the effect was due to 16 studies for which the maximum age was larger than 12. One likely explanation is that these studies included many children who had become sufficiently immune to control the peripheral parasitaemia. Individuals who control peripheral parasitaemia may clear infections faster, or they may be more likely to return a false negative microscopy report, an issue that may be resolved with a more sensitive test such as PCR (polymerase chain reaction).

One objection to this analysis is that the heterogeneous composition of the population inevitably biases the study. The inferential perils of cross-level analysis can be avoided if the heterogeneous composition of the population is known in advance and incorporated into the analysis, but this is a practical impossibility. Our approach deals with this problem by assuming that the heterogeneity can be treated statistically. We specify a one-parameter family of distributions and allow a shape parameter to be fitted. This analysis corrects for the bias introduced by heterogeneity, assuming that the distribution is properly chosen, and that heterogeneous infection rates are the main source of bias.

We have found a simple approximating model for the relationship between EIR and PR. Our analysis suggests that heterogeneous biting or susceptibility to infection plays an important role in determining PR, that immunity to infection in early childhood does not, and that persistence times for malaria are at least six months, and possibly much longer.

Our findings have broad implications for malaria dynamics. Clearly, PR declines in adults28,29 owing to some kind of immunity that reduces infection, increases clearance, or that reduces apparent PR by lowering the parasite densities in peripheral parasitaemia and decreasing sensitivity by microscopy. The distinction between immunity to clinical disease, immunity to infection, transmission blocking immunity, and the ability of individuals to manage peripheral parasitaemia has been confusing in mathematical models; the epidemiological state often called ‘recovered and immune’ acts as a reservoir of P. falciparum in some models but not in others. A critical evaluation of the mathematical models of malaria is warranted.

The results here are consistent with earlier views of malaria epidemiology; very substantial reductions in EIR will be necessary to achieve modest reductions in PR throughout Africa. In quantitative terms, reducing EIR from 200 to 100 and then to 50 would reduce PR by 4% and then by an additional 5%. A corollary is that the effort required to reduce the disease burden in Africa would be enormous. Such findings have substantial implications for (1) the prospects of control with imperfect vaccines, (2) the development and persistence of anti-malarial drug resistance, and (3) public health interventions that aim to reduce disease without reducing PR. In particular, heterogeneous infection implies that malaria will be substantially more difficult to control if control measures are applied uniformly. Conversely, targeting malaria control at those who are bitten most, where practical, may provide a disproportionate impact and wider community benefits by reducing the frequency of asymptomatic infections, the sporozoite rate in the mosquito population and overall transmission13.

METHODS

Models

Ross’s population dynamic model describes the temporal relationship between EIR and PR, as well as the relationship at equilibrium12. Let x denote PR in a population as a function of time or age. Initially, we assume people become susceptible to infections after clearing an infection. Assuming EIR remains constant, PR changes according to the following equation:

x.=bε(1x)rx (2)

Ross’s relationship between EIR and PR is found by setting x.=0 and solving for x (Table 1). The model is called SIS, because those who are infected (I) become susceptible (S) again after clearing an infection.

For super-infection, PR is the equilibrium of the equation:

x.=bε(1x)f(bε,r)x (3)

where f(Λ, r) = Λ/(eΛ/r − 1). We call this an SI°S model, where I° implies the per capita time to clearance (that is, from I to S) is given by f.

In heterogeneous populations, let s index the population with expected infection rate bsε, and let x(s) denote the proportion of humans in that class that are infected. To describe the distribution of infection rates in the population, let g(s) denote the fraction of the population in class s, and without loss of generality, let g(s) denote a probability distribution function with mean 1. Thus, g(s) affects the distribution of infection rates without changing the mean; describes average infection rates, but individual expectations can vary substantially. The dynamics are described by the equation:

x.(s)=bsε(1x(s))f(bsε,r)x(s) (4)

The population prevalence is found by solving for the equilibrium in equation (4), denoted x(s), and integrating:

0x(s)g(s)ds (5)

Here, we let g(s, k) denote a Γ distribution, with mean 1 and variance 1/k. Thus, the average rate of infection in the population is and the variance of the infection rate is b2ε2/k;1/k is the coefficient of variation of the population infection rate. For this distribution, equation (5) has the closed form solution given by equation (1). This model is called ∫SI°S.

Ross’s model, the heterogeneous infection model, and the superinfection model are closely related. As expected, the functional relationship with super-infection is the limit of a heterogeneous infection model as the variance in expected infection rates approaches 0. Curiously, Ross’s original function is a special case of a heterogeneous infection model (equation (1)) with k = 1. A longer closed form expression can be derived for the model ∫SIS, the heterogeneous model with Ross’s assumption about clearance (not shown). The best fit model ∫SI°S is virtually identical to the Ross analogue of the best fit model ∫SIS but with a very different interpretation (results not shown). Thus, the super-infection clearance assumption does little, per se, to improve the model fit. On the other hand, it may provide a more accurate estimate of the time to clear an infection19.

For immunity to infection, let y denote the proportion of a population that has cleared P. falciparum infections and is immune to re-infection. Let Inline graphic denote the average duration of immunity to re-infection. The dynamics are described by the equations:

x.=bε(1xy)f(bε,r)xy.=f(bε,r)xyϕ (6)

Note that the fitted parameter is actually Inline graphic’ = bInline graphic (see Table 1). This model is called SI°RS, where R means recovered and immune.

For a heterogeneous population model with immunity to infection, let y(s) denote the proportion of recovered and immune hosts. The dynamics are described by the equations:

x.(s)=bsε(1x(s)y(s))f(bsε,r)x(s)y.(s)=f(bsε,r)x(s)y(s)ϕ (7)

We could not find a closed-form expression, so we fitted the function shown in Table 1; numerical integration was performed by R. This model is called ∫SI°RS. Age, microscopy errors and likelihood. Let α denote the sensitivity of microscopy and 1 − β the specificity. The estimated PR, Y, is related to the true PR by the formula Y = αX + β(1 − X); it is biased upwards at low prevalence by false positives (β) and downwards at high prevalence by false negatives (α).

Similarly, the differences in the age distribution of children sampled is a potential source of bias. As we have no information about the age distribution of children actually sampled, we use the bounds for bias correction.

Let Li and Ui be the lower and upper ages of the children from the ith study, and let Inline graphic and Û denote the mean lower and bounds from all the studies. The predicted PR in the ith sample, given the estimated EIR, is Xi = f(εi)+a(LiInline graphic)+b(UiÛ). The predicted estimate of PR in the ith sample, given the estimated EIR is Yi = αXi +β(1 − Xi). The log likelihood of the data, given the model, was:

i=191Pilog(Yi)+Nilog(1Yi) (8)

where Pi was the number of humans that tested positive and Ni the number that tested negative in the ith study. For each underlying function f, we verified that the full model was in fact the best model by backwards fitting (that is, we redid the analysis setting α, β, a and b to zero, one by one, then in combinations).

We have reported the fitted sensitivity and specificity, but we also repeated the analysis assuming a constant sensitivity and specificity for all models. The results and model ranking are similar to those reported here if the assumed values of sensitivity and specificity are high, close to their fitted values. In contrast, when sensitivity and specificity are much lower, the log-linear model is strongly favoured. As the lowest PR that could be measured is β, and as the PR in several studies was close to zero, specificity must be fairly high, at least when PR is low. Some of this information is captured by fitting α and β.

Uncertainty in EIR

The studies used different methods to estimate EIR, so we were uncertain how to estimate the study precision. Moreover, only 38 studies reported sample sizes for their estimates of the human biting rate and the sporozoite rate. To evaluate whether uncertainty in EIR would affect our analysis, we repeated the analysis 1,000 times using equation (1) and the log-linear model. In each instance, we multiplied the estimated EIR by 2ξ, where ξ was a uniform random variable drawn from the interval (−1, 1). The average best-fit parameter estimates changed, but equation (1) was selected as the best model in 98.8% of these trials. We repeated the analysis again multiplying EIR by 4ξ with similar results.

Acknowledgements

We thank D. Bradley, B. Grenfell, F. E. McKenzie, W. Prudhomme, S. Randolph, M. Recker, D. Rogers and L. Waller for comments and suggestions. We also thank the NCEAS working group for discussion. S.I.H. is funded by a Research Career Development Fellowship from the Wellcome Trust. R.W.S. is a Wellcome Trust Senior Research Fellow and acknowledges the support of the Kenyan Medical Research Institute (KEMRI). This work was partially conducted as part of the Environment and Disease Working Group supported by the National Center for Ecological Analysis and Synthesis, a Center funded by NSF, the University of California Santa Barbara, and the State of California. The views presented in this Letter represent the personal views of the authors and do not construe or imply any official position or policy of the Fogarty International Center, National Institutes of Health, Department of Health and Human Services, or the US government.

Footnotes

Author Information Reprints and permissions information is available at npg.nature.com/reprintsandpermissions.

The authors declare no competing financial interests.

References

  • 1.Dietz K. Models for vector-borne parasitic diseases. Lecture Notes Biomath. 1980;39:264–277. [Google Scholar]
  • 2.Dye C, Hasibeder G. Population dynamics of mosquito-borne disease: effects of flies which bite some people more frequently than others. Trans. Soc. Trop. Med. Hyg. 1986;80:69–77. doi: 10.1016/0035-9203(86)90199-9. [DOI] [PubMed] [Google Scholar]
  • 3.Woolhouse ME, et al. Heterogeneities in the transmission of infectious agents: implications for the design of control programs. Proc. Natl Acad. Sci. USA. 1997;94:338–342. doi: 10.1073/pnas.94.1.338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hay SI, Guerra CA, Tatem AJ, Atkinson PM, Snow RW. Urbanization, malaria transmission, and disease burden in Africa. Nature Rev. Microbiol. 2005;3:81–90. doi: 10.1038/nrmicro1069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hay SI, Guerra CA, Tatem AJ, Noor AM, Snow RW. The global distribution and population at risk of malaria: past, present and future. Lancet Infect. Dis. 2004;4:327–336. doi: 10.1016/S1473-3099(04)01043-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Snow RW, Guerra CA, Noor AM, Myint HY, Hay SI. The global distribution of clinical episodes of Plasmodium falciparum malaria. Nature. 2005;434:214–217. doi: 10.1038/nature03342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Murray CJL, Lopez AD. Alternative projections of mortality and disability by cause 1990-2020: Global burden of disease study. Lancet. 1997;24:1498–1504. doi: 10.1016/S0140-6736(96)07492-2. [DOI] [PubMed] [Google Scholar]
  • 8.Jamison DT, Creese A, Prentice T. World Health Report, 1999: Making a Difference. World Health Organization; Geneva: 1999. [Google Scholar]
  • 9.Snow RW, Gilles HM. Ch. 5. In: Warrell DA, Gilles HM, editors. Essential Malariology. 4th edn Arnold; London: 2002. pp. 85–106. [Google Scholar]
  • 10.Snow RW, Marsh K. The consequences of reducing transmission of Plasmodium falciparum in Africa. Adv. Parasitol. 2002;52:235–264. doi: 10.1016/s0065-308x(02)52013-3. [DOI] [PubMed] [Google Scholar]
  • 11.Macdonald G. The Epidemiology and Control of Malaria. Oxford Univ. Press; London: 1957. [Google Scholar]
  • 12.Ross R. The Prevention of Malaria. John Murray; London: 1911. [Google Scholar]
  • 13.Nájera JA. A critical review of the field application of a mathematical model of malaria eradication. Bull. World Health Organ. 1974;50:449–457. [PMC free article] [PubMed] [Google Scholar]
  • 14.Smith DL, Dushoff J, McKenzie FE. The risk of a mosquito-borne infection in a heterogeneous environment. PLoS Biol. 2004;2:e368. doi: 10.1371/journal.pbio.0020368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Takken W, Knols BGJ. Odor-mediated behaviour of Afrotropical malaria mosquitoes. Annu. Rev. Entomol. 1999;44:131–157. doi: 10.1146/annurev.ento.44.1.131. [DOI] [PubMed] [Google Scholar]
  • 16.Hethcote HW, Yorke JA. Lecture Notes in Biomathematics. Vol. 56. Springer; Berlin: 1984. pp. 1–105. [Google Scholar]
  • 17.Walton GA. On the control of Malaria in Freetown, Sierra Leone. I. Plasmodium falciparum and Anopheles gambiae in relation to malaria occurring in infants. Ann. Trop. Med. Parasitol. 1947;41:380–407. doi: 10.1080/00034983.1947.11685341. [DOI] [PubMed] [Google Scholar]
  • 18.Dietz K, Molineaux L, Thomas A. A malaria model tested in the African savannah. Bull. World Health Organ. 1974;50:347–357. [PMC free article] [PubMed] [Google Scholar]
  • 19.Aron JL, May RM. Ch. 5. In: Anderson RM, editor. Population Dynamics and Infectious Disease. London; Chapman and Hall: 1982. pp. 139–179. [Google Scholar]
  • 20.Bailey NTJ. The Biomathematics of Malaria. Oxford Univ. Press; Oxford: 1982. [Google Scholar]
  • 21.McKenzie FE, Sirichaisinthop J, Miller RS, Gasser RA, Wongsrichanalai C., Jr Dependence of malaria detection and species diagnosis by microscopy on parasite density. Am. J. Trop. Med. Hyg. 2003;69:372–376. [PMC free article] [PubMed] [Google Scholar]
  • 22.R Development Core Team . R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; Vienna, Austria: 2004. ⟨ http://www.R-project.org.3-900051-07-0 ⟩ . [Google Scholar]
  • 23.Beier JC, Killeen GF, Githure JI. Short report: entomologic inoculation rates and Plasmodium falciparum malaria prevalence in Africa. Am. J. Trop. Med. Hyg. 1999;61:109–113. doi: 10.4269/ajtmh.1999.61.109. [DOI] [PubMed] [Google Scholar]
  • 24.Burnham KP, Anderson DA. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach. 2nd edn Springer; New York: 2002. [Google Scholar]
  • 25.Eyles DE, Young MD. The duration of untreated or inadequately treated Plasmodium falciparum infections in the human host. J. Nat. Malaria Soc. 1951;10:327–336. [PubMed] [Google Scholar]
  • 26.Gu W, et al. Low recovery rates stabilize malaria endemicity in areas of low transmission in coastal Kenya. Acta Trop. 2003;86:71–81. doi: 10.1016/s0001-706x(03)00020-2. [DOI] [PubMed] [Google Scholar]
  • 27.Sama W, Killeen G, Smith T. Estimating the duration of Plasmodium falciparum infection from trials of indoor residual spraying. Am. J. Trop. Med. Hyg. 2004;70:625–634. [PubMed] [Google Scholar]
  • 28.Wilson D. Bagster. Rural hyperendemic malaria in Tanganyika Territory. Trans. R. Soc. Trop. Med. Hyg. 1936;29:583–618. doi: 10.1016/0035-9203(62)90050-0. [DOI] [PubMed] [Google Scholar]
  • 29.Molineaux L, Gramiccia G. The Garki Project. World Health Organization; Geneva: 1980. [Google Scholar]

RESOURCES