Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2005;438(7066):355–359. doi: 10.1038/nature04153

Superspreading and the effect of individual variation on disease emergence

J O Lloyd-Smith 1,2,, S J Schreiber 3, P E Kopp 4, W M Getz 1
PMCID: PMC7094981  PMID: 16292310

Coughs and sneezes...

From Typhoid Mary to SARS, it has long been known that some people spread disease more than others. But for diseases transmitted via casual contact, contagiousness arises from a plethora of social and physiological factors, so epidemiologists have tended to rely on population averages to assess a disease's potential to spread. A new analysis of outbreak data shows that individual differences in infectiousness exert powerful influences on the epidemiology of ten deadly diseases. SARS and measles (and perhaps avian influenza) show strong tendencies towards ‘superspreading events’ that can ignite explosive epidemics — but this same volatility makes outbreaks more likely to fizzle out. Smallpox and pneumonic plague, two potential bioterrorism agents, show steadier growth but still differ markedly from the traditional average-based view. These findings are relevant to how emerging diseases are detected and controlled.

Supplementary information

The online version of this article (doi:10.1038/nature04153) contains supplementary material, which is available to authorized users.

Abstract

Population-level analyses often use average quantities to describe heterogeneous systems, particularly when variation does not arise from identifiable groups1,2. A prominent example, central to our current understanding of epidemic spread, is the basic reproductive number, R0, which is defined as the mean number of infections caused by an infected individual in a susceptible population3,4. Population estimates of R0 can obscure considerable individual variation in infectiousness, as highlighted during the global emergence of severe acute respiratory syndrome (SARS) by numerous ‘superspreading events’ in which certain individuals infected unusually large numbers of secondary cases5,6,7,8,9,10. For diseases transmitted by non-sexual direct contacts, such as SARS or smallpox, individual variation is difficult to measure empirically, and thus its importance for outbreak dynamics has been unclear2,10,11. Here we present an integrated theoretical and statistical analysis of the influence of individual variation in infectiousness on disease emergence. Using contact tracing data from eight directly transmitted diseases, we show that the distribution of individual infectiousness around R0 is often highly skewed. Model predictions accounting for this variation differ sharply from average-based approaches, with disease extinction more likely and outbreaks rarer but more explosive. Using these models, we explore implications for outbreak control, showing that individual-specific control measures outperform population-wide measures. Moreover, the dramatic improvements achieved through targeted control policies emphasize the need to identify predictive correlates of higher infectiousness. Our findings indicate that superspreading is a normal feature of disease spread, and to frame ongoing discussion we propose a rigorous definition for superspreading events and a method to predict their frequency.

Supplementary information

The online version of this article (doi:10.1038/nature04153) contains supplementary material, which is available to authorized users.

Main

For sexually transmitted and vector-borne diseases, host contact rates have long served as surrogates for individual infectiousness3,12,13,14, leading to the assertion of a general ‘20/80 rule’ (whereby 20% of cases cause 80% of transmission13) and to the influential concept of high-risk ‘core groups’3,12,13. For directly transmitted infections, however, the overall infectiousness of each case—that is, the number of other individuals infected during the infectious lifetime of a single individual—arises from a complex mixture of host, pathogen and environmental factors (see Supplementary Notes). Consequently, the degree of infectiousness is distributed continuously in any population4,7,11,15,16 and, crucially, distinct risk groups often cannot be defined a priori2,11. This impedes the conventional approach to adding heterogeneity to epidemic models, in which populations are divided into homogeneous subgroups2,3,4,17. Research on continuous individual variation in infectiousness for directly transmitted infections has been largely restricted to within-household transmission18,19, or to variation in infectious period20,21 or social network22. Some recent studies have used contact tracing data to investigate specific questions in light of observed variation8,16, but a broad understanding of the role of individual variation in outbreak dynamics is lacking.

As a theoretical basis, we introduce the ‘individual reproductive number’, ν, as a random variable representing the expected number of secondary cases caused by a particular infected individual. Values for ν are drawn from a continuous probability distribution with population mean R0 that encodes all variation in infectious histories of individuals, including properties of the host and pathogen and environmental circumstances. In this framework, superspreading events (SSEs) are not exceptional events9, but important realizations from the right-hand tail of a distribution of ν (refs 7, 15). Stochastic effects in transmission are modelled using a Poisson process4, so that the number of secondary infections caused by each case, Z, is described by an ‘offspring distribution’ Pr(Z = k) where Z∼Poisson(ν).

By considering three possible distributions of ν, we generate three candidate models for the offspring distribution: (1) in generation-based models neglecting individual variation, ν = R0 for all cases, yielding Z∼Poisson(R0); (2) in differential-equation models with homogeneous transmission and constant recovery rates, ν is exponentially distributed, yielding Z∼geometric(R0); (3) in a more general formulation, we let ν be gamma-distributed with mean R0 and dispersion parameter k, yielding Z∼negative binomial(R0,k) (ref. 23). The negative binomial model includes the conventional Poisson (k → ∞) and geometric (k = 1) models as special cases. It has variance R0(1 + R0/k), so smaller values of k indicate greater heterogeneity.

We gathered empirical offspring distributions from detailed contact tracing or surveillance data sets, and challenged the candidate models using model selection techniques24 (see Supplementary Notes). For SARS outbreaks in Singapore and Beijing, the negative binomial model is unequivocally favoured (Fig. 1a and Supplementary Table 1). Conventional models assuming homogeneity cannot reproduce the observed transmission patterns. For the Singapore outbreak, the maximum-likelihood estimate is 0.16 (90% confidence interval 0.11–0.64), indicating an underlying distribution of ν that is highly overdispersed (Fig. 1a, inset). According to this analysis, the great majority of SARS cases in Singapore were barely infectious (73% had ν < 1) but a small proportion were highly infectious (6% had ν > 8). This is consistent with field reports from SARS-afflicted regions5,6 but contrasts with published SARS models9,10,25,26.

Figure 1. Evidence for variation in individual reproductive number ν.

Figure 1

a, Transmission data from the SARS outbreak in Singapore in 2003 (ref. 5). Bars show observed frequency of Z, the number of individuals infected by each case. Lines show maximum-likelihood fits for Z∼Poisson (squares), Z∼geometric (triangles), and Z∼negative binomial (circles). Inset, probability density function (solid) and cumulative distribution function (dashed) for gamma-distributed ν (corresponding to Z∼negative binomial) estimated from Singapore SARS data. b, Expected proportion of all transmission due to a given proportion of infectious cases, where cases are ranked by infectiousness. For a homogeneous population (all ν = R0), this relation is linear. For five directly transmitted infections (based on values in Supplementary Table 1), the line is concave owing to variation in ν. c, Proportion of transmission expected from the most infectious 20% of cases, for 10 outbreak or surveillance data sets (triangles). Dashed lines show proportions expected under the 20/80 rule (top) and homogeneity (bottom). Superscript ‘v’ indicates a partially vaccinated population. d, Reported superspreading events (SSEs; diamonds) relative to estimated reproductive number R (squares) for twelve directly transmitted infections. Lines show 5–95 percentile range of Z∼Poisson(R), and crosses show the 99th-percentile proposed as threshold for SSEs. Stars represent SSEs caused by more than one source case. ‘Other’ diseases are: 1, Streptococcus group A; 2, Lassa fever; 3, Mycoplasma pneumonia; 4, pneumonic plague; 5, tuberculosis. R is not shown for ‘other’ diseases, and is off-scale for monkeypox. See Supplementary Notes for details.

Comparing results for eight directly transmitted infections reveals the differing degree of individual variation among diseases and outbreak settings (Fig. 1b, c and Supplementary Tables 1, 2). The Poisson offspring distribution is almost always strongly rejected. The geometric model has considerable support for several data sets, which indicates significant individual variation in transmission rates because real infectious periods are less dispersed than the exponential distribution20,21. The negative binomial model is selected decisively for several data sets, and enables comparative study of diseases through the dispersion parameter. Like SARS, measles in highly vaccinated populations shows high variation in two surveillance data sets, with narrow confidence intervals excluding the conventional models (note that heterogeneous vaccination coverage is an important environmental factor contributing to this pattern). Monkeypox and smallpox viruses show intermediate variation, consistent across multiple data sets, and pneumonic plague transmission is slightly less variable. Data limitations prevent definitive conclusions for other diseases. Comparing our findings to the 20/80 rule proposed for sexually transmitted and vector-borne diseases13, no general rule emerges but the core principle of heterogeneous transmission is certainly supported (Fig. 1c).

Numerous reports of superspreading events provide further evidence for variation in ν. We reviewed 37 published accounts of SSEs for 11 directly transmitted infections (Fig. 1d; see Supplementary Notes). Unrecognized or misdiagnosed illness is the most common cause of these SSEs, followed by alternative modes of spread (especially airborne), high contact rates, and co-infections that aid transmission. High pathogen load or shedding rates are occasionally implicated, but are rarely measured. A consistent and general definition of SSEs is currently lacking—for SARS, an SSE has been arbitrarily defined as Z ≥ 8 (ref. 6), Z ≥ 10 (ref. 5), Z > 10 (ref. 26) or ‘many more than the average number’9, and different thresholds are surely needed for measles (R0∼11–18; ref. 3) or monkeypox (R0 < 1).

We propose this general protocol for defining a superspreading event: (1) estimate the effective reproductive number, R, for the disease and population in question; (2) construct a Poisson distribution with mean R, representing the expected range of Z due to stochasticity without individual variation; (3) define an SSE as any infected individual who infects more than Z(n) others, where Z(n) is the nth percentile of the Poisson(R) distribution. A 99th-percentile SSE is then any case causing more infections than would occur in 99% of infectious histories in a homogeneous population (Fig. 1d). This approach complements a priori identification of potential superspreaders when that is feasible, as for sexually transmitted diseases (where promiscuity drives risk)3,12. In addition, the definition enables prediction of the frequency of SSEs once R0 and k have been estimated (Supplementary Fig. 1)—an outstanding challenge in emerging disease epidemiology8,9.

To assess the effect of individual variation on disease outbreaks, we analyse a branching process model with negative binomial offspring distribution, corresponding to gamma-distributed ν (Fig. 2a; see Supplementary Notes). Of primary interest is the probability of stochastic extinction, q, after the introduction of a single infected individual (Fig. 2b). For R0 < 1, all invasions die out, as in standard models. For R0 > 1, increased variation strongly favours extinction8. For example, if R0 = 3 then q = 0.06 under the assumption of homogeneous ν (k → ∞), or q = 0.33 if k = 1, but if k = 0.16 (as estimated for SARS) then q = 0.76. Extinction risk rises owing to a higher proportion of non-transmitting cases when ν is overdispersed (Figs 1a, 2a and Supplementary Fig. 2a). This effect thwarts invasion by diseases that are very potent spreaders on average: for arbitrarily high R0, q → 1 as k → 0 (Supplementary Fig. 2b). The expected number of cases before extinction is hardly affected by k (Supplementary Fig. 2c), because low-k outbreaks that fail probably lacked SSEs and thus resemble homogeneous outbreaks with lower R0. Accordingly, when individual variation is large, extinction occurs rapidly or not at all (Supplementary Fig. 2d).

Figure 2. Outbreak dynamics with different degrees of individual variation in infectiousness.

Figure 2

a, The individual reproductive number ν is drawn from a gamma distribution with mean R0 and dispersion parameter k. Probability density functions are shown for six gamma distributions with R0 = 1.5 (‘k = Inf’ indicates k → ∞). b, Probability of stochastic extinction of an outbreak, q, versus population-average reproductive number, R0, following introduction of a single infected individual. The value of k increases from top to bottom (values and colours as in a). c, Growth of simulated outbreaks with R0 = 1.5 and one initial case, conditional on non-extinction. Boxes show median and interquartile range (IQR) of the first disease generation with 100 cases; whiskers show most extreme values within 1.5 × IQR of the boxes, and crosses show outliers. Percentages show the proportion of 10,000 simulated outbreaks that reached the 100-case threshold (roughly 1 - q).

For outbreaks avoiding stochastic extinction, epidemic growth rates strongly depend on variation in ν (Fig. 2c and Supplementary Fig. 2e, f). Diseases with high individual variation show infrequent but explosive epidemics after introduction of a single case. This pattern recalls SARS in 2003, for which many settings experienced no epidemic despite unprotected exposure to SARS cases27,28, whereas a few cities suffered explosive outbreaks8,9,10,15,26. Our results, using = 0.16 for SARS, explain this simply by the presence or absence of high-ν individuals in the early generations of each outbreak6. In contrast, conventional models (with k = 1 or k → ∞) cannot simultaneously generate frequent failed invasions and rapid growth rates without additional, subjective model structure.

Disease control interventions could increase or decrease individual variation in infectiousness. Infected individuals might reduce their number of non-essential contacts, or governments might impose quarantine or isolation on particular individuals. Here we explore several idealized cases theoretically, for an outbreak with offspring distribution Z∼negative binomial(R0,k) before control (see Supplementary Notes). Consider the effect of control effort c, where c = 0 reflects no control and c = 1 reflects complete blockage of transmission. Under population-wide control, the infectiousness of every individual in the population is reduced by a factor c (that is, Inline graphic for all individuals). Under random, individual-specific control, a proportion c of infected individuals (chosen at random) is traced and isolated completely such that they cause zero infections (that is Inline graphic for a proportion c of infected individuals, and Inline graphic for the rest). Individual-specific control raises the degree of heterogeneity in the outbreak as measured by the variance-to-mean ratio of Z, whereas population-wide control reduces heterogeneity. Both approaches yield effective reproductive number R = (1 - c)R0, so the threshold control effort for guaranteed disease extinction is c ≥ 1 - 1/R0 as in conventional models. For intermediate values of c, however, the individual-specific approach always works better (Fig. 3a and Supplementary Fig. 3a, b), consistent with our finding that higher variation favours disease extinction (Fig. 2b). Branching process theory confirms that qind > qpop whenever c∈(0,1 - 1/R0) (see Supplementary Notes).

Figure 3. Implications for control measures.

Figure 3

a, Increase in extinction probability (qind - qpop) under individual-specific control compared to population-wide control, for diseases with R0 = 3 and different degrees of individual variation, k, subject to control effort c. With population-wide control, the infectiousness of all individuals is reduced by a factor c. With individual-specific control, a proportion c of infectious individuals (selected at random) have their infectiousness reduced to zero. The outbreak is assumed to begin with one case, with control present from the outset. b, Estimates of and from outbreak data sets before and after control measures were initiated (joined by solid lines; Supplementary Table 2), and post-control values of kc estimated from theoretical models of control as described in the Supplementary Notes. c, Effect of random versus targeted control measures. The probability of outbreak containment (defined as never reaching the 100-case threshold) for four diseases with R0 = 3 and k = 0.1 (blue), k = 0.5 (green), k = 1 (black) or k → ∞ (purple). Control policies are population-wide (solid lines), random individual-specific (dotted lines), or targeted individual-specific (dashed lines, where half of all control effort is focused on the most infectious 20% of cases). For k → ∞, all individuals are identical, so targeting has no effect and dotted and dashed lines overlay one another. d, The factor by which targeting increases the effect of control on preventing a major outbreak, relative to random individual-specific control (see Supplementary Notes), when 20%, 40% or 60% of the total population is controlled. Results in c and d are the mean of 10,000 simulations, with control beginning in the second generation of cases.

To assess the realism of these idealized control scenarios, we analysed contact tracing data from four outbreaks before and after imposition of control measures. Control always lowered the estimated dispersion parameter (that is c < ) as predicted by the individual-specific model (Fig. 3b), although small sample sizes often led to overlapping confidence intervals (Supplementary Table 2). This increased skew in transmission arose chiefly from undiagnosed or misdiagnosed individuals, who continued to infect others (and even cause SSEs), whereas controlled individuals infected very few. To further examine our control theories, we calculated Inline graphic and Inline graphic for each data set; c was always closer to Inline graphic, although twice it fell between the two predictions, indicating a possible combination of control mechanisms (Fig. 3b). Real-world control thus seems to increase individual variation, favouring extinction but risking ongoing SSEs. Larger data sets are needed to establish this pattern definitively.

If highly infectious individuals can be identified predictively (see Supplementary Notes) then the efficiency of control could be greatly increased (Fig. 3c, d). Focusing half of all control effort on the most infectious 20% of cases is up to threefold more effective than random control (Fig. 3d). When k = 0.1 or 0.5, outbreak containment is assured for targeted control levels at roughly half the threshold level of c = 1 - 1/R0 for random control. Gains in efficiency increase with more intense targeting of high-ν cases, but saturate as overall coverage c increases (Supplementary Fig. 3c, d). Again, branching process theory generalizes these findings: for a given proportion c of individuals controlled, greater targeting of higher-ν individuals leads to lower effective reproductive number R and higher extinction probability q (see Supplementary Notes).

The data sets analysed here were collected from published literature, and may be subject to selection bias for successful invasions and SSEs rather than typical disease behaviour. Surveillance data sets are less vulnerable to this bias, but may under-report isolated cases. We urge that detailed transmission tracing data be collected and made public whenever possible, even if unexceptional. At a minimum, we propose a new measure for inclusion in outbreak reports: the proportion of cases not transmitting (p0), which, together with R0 is sufficient to estimate the degree of variation in ν (Supplementary Fig. 4). As more data become available, trends may emerge in the degree of variation present, for example, for different modes of spread or levels of virulence. Richer data sets may also enable testing of the branching process assumption that case outcomes are independent and identically distributed, by detecting possible correlations in ν values within transmission lineages or systematic changes as outbreaks progress.

Our results have broad implications for emerging disease epidemiology, and open challenges for further work. Explosive epidemics demand rapid action by authorities and can strain health infrastructures. High extinction probabilities indicate that disease introductions or host species jumps may be more frequent than currently suspected. Cluster-size surveillance for pathogen adaptation29 or dwindling population immunity30 should be tuned to observed levels of variation. Realization of targeted control measures requires a better understanding of factors determining individual infectiousness. This work must be integrated with established theory of sexually transmitted diseases and social networks, where high-risk groups exert nonlinear influence on R0 because contact rates affect infectiousness and susceptibility equally3,4,12,13,22. All diseases probably show intermediate degrees of covariation between infectiousness and susceptibility, a topic demanding empirical and theoretical study17. The central role of R0 in epidemic analysis is unassailable, but our findings show that emerging disease outbreaks cannot be fully understood if individual variation in infectiousness is neglected. Examination of other population processes dependent on small numbers of individuals may yield similar insights.

Methods

Analysis of disease data

For data sets including the full distribution of Z, we estimated 0 and using maximum-likelihood methods. The candidate models were compared using Akaike's information criterion (AICc) modified for small sample size. Confidence intervals for were estimated by bias-corrected non-parametric bootstrapping and corroborated by four other methods. For data sets including only estimates of 0 and the proportion of cases not transmitting (0), we estimated by solving 0 = (1 + 0/k)-k numerically, and evaluated the candidate models using confidence intervals calculated by two methods. Expected proportions of transmission due to particular groups of infectious individuals (Fig. 1b, c) were calculated using the gamma distribution of ν with estimated values of 0 and . See Supplementary Notes for details, and for descriptions of data sets.

Branching process analysis

Analysis of branching process models centres on the probability generating function (pgf) of the offspring distribution, Inline graphic, defined for |s| ≤ 1. When R0 > 1, the long-term probability of disease extinction after introduction of a single infected individual is the unique solution of q = g(q) on the interval (0,1). For a negative binomial offspring distribution Z ≈ NegB(R0,k), the pgf is Inline graphic. Under population-wide control, Inline graphic and therefore Inline graphic, and the variance-to-mean ratio is 1 + (1 - c)R0/k. Under random individual-specific control, the exact pgf is Inline graphic with variance-to-mean ratio 1 + R0/k + cR0. This scenario can be approximated by Inline graphic , where Inline graphic is the solution to Inline graphic and decreases monotonically as c increases. Further details, descriptions of outbreak simulations and formal analysis of control measures are found in the Supplementary Notes.

Supplementary information

Supplementary Notes (435.6KB, pdf)

This file contains additional discussion of factors contributing to individual variation in infectiousness, methodological details on statistical and modelling analyses, and details of outbreak datasets and superspreading events. (PDF 435 kb)

Supplementary Figures (322KB, pdf)

The file includes Supplementary Figures 1 to 4, with captions. The figures address the prediction of superspreading event frequency, further results on outbreak dynamics and control, and estimation of the dispersion parameter k with limited data. (PDF 322 kb)

Supplementary Table 1 (71.1KB, pdf)

A summary of results from our statistical analysis of uncontrolled outbreaks (corresponds to Figure 1a–c). (PDF 71 kb)

Supplementary Table 2 (94.5KB, pdf)

Detailed results from our statistical analysis of uncontrolled outbreaks (elaborating on Supplementary Table 1), and the analysis of data before and after control measures were applied in four outbreaks. (PDF 94 kb)

Acknowledgements

We are grateful for comments and data suggestions from B. Bolker, J. Edmunds, N. Ferguson, A. Galvani, R. Gani, N. Gay, J. Gog, B. Grenfell, H. Hethcote, D. Heymann, A. Hubbard, N. Jewell, J. Lauer, R. May, T. Porco, C. Roth, D. Smith and B. Williams. We thank R. Gani for providing unpublished data from a previous publication, and L. Matthews for sharing work ahead of print. Our research was supported by the NSF, NIH-NIDA, the James S. McDonnell Foundation, the NSF/NIH Ecology of Infectious Disease Program, and the South African Centre for Epidemiological Modelling and Analysis (SACEMA). Author Contributions J.O.L.-S. and W.M.G. conceived the study. J.O.L.-S. collected and analysed outbreak data, conducted dynamic modelling, and drafted and revised the text. S.J.S. conducted formal analysis of branching processes and control measures. W.M.G. provided technical input on superspreading and control analyses. All authors contributed conceptually, and edited or commented on the text.

Competing interests

Reprints and permissions information is available at npg.nature.com/reprintsandpermissions. The authors declare no competing financial interests.

References

  • 1.Levin SA, Grenfell B, Hastings A, Perelson AS. Mathematical and computational challenges in population biology and ecosystems science. Science. 1997;275:334–343. doi: 10.1126/science.275.5298.334. [DOI] [PubMed] [Google Scholar]
  • 2.Becker NG, Britton T. Statistical studies of infectious disease incidence. J. R. Stat. Soc. B. 1999;61:287–307. doi: 10.1111/1467-9868.00177. [DOI] [Google Scholar]
  • 3.Anderson RM, May RM. Infectious Diseases of Humans: Dynamics and Control. 1991. [Google Scholar]
  • 4.Diekmann O, Heesterbeek JAP. Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis, and Interpretation. 2000. [Google Scholar]
  • 5.Leo YS, et al. Severe acute respiratory syndrome—Singapore, 2003. Morbid. Mortal. Wkly Rep. 2003;52:405–411. [PubMed] [Google Scholar]
  • 6.Shen Z, et al. Superspreading SARS events, Beijing, 2003. Emerg. Infect. Dis. 2004;10:256–260. doi: 10.3201/eid1002.030732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Dye C, Gay N. Modeling the SARS epidemic. Science. 2003;300:1884–1885. doi: 10.1126/science.1086925. [DOI] [PubMed] [Google Scholar]
  • 8.Lipsitch M, et al. Transmission dynamics and control of severe acute respiratory syndrome. Science. 2003;300:1966–1970. doi: 10.1126/science.1086616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Riley S, et al. Transmission dynamics of the etiological agent of SARS in Hong Kong: Impact of public health interventions. Science. 2003;300:1961–1966. doi: 10.1126/science.1086478. [DOI] [PubMed] [Google Scholar]
  • 10.Bauch CT, Lloyd-Smith JO, Coffee M, Galvani AP. Dynamically modeling SARS and other newly-emerging respiratory illnesses: past, present, future. Epidemiology. 2005;16:791–801. doi: 10.1097/01.ede.0000181633.80269.4c. [DOI] [PubMed] [Google Scholar]
  • 11.Koopman J. Modeling infection transmission. Annu. Rev. Public Health. 2004;25:303–326. doi: 10.1146/annurev.publhealth.25.102802.124353. [DOI] [PubMed] [Google Scholar]
  • 12.May RM, Anderson RM. Transmission dynamics of HIV infection. Nature. 1987;326:137–142. doi: 10.1038/326137a0. [DOI] [PubMed] [Google Scholar]
  • 13.Woolhouse MEJ, et al. Heterogeneities in the transmission of infectious agents: Implications for the design of control programs. Proc. Natl Acad. Sci. USA. 1997;94:338–342. doi: 10.1073/pnas.94.1.338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lloyd-Smith JO, Getz WM, Westerhoff HV. Frequency-dependent incidence in models of sexually transmitted diseases: portrayal of pair-based transmission and effects of illness on contact behaviour. Proc. R. Soc. Lond. B. 2004;271:625–634. doi: 10.1098/rspb.2003.2632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Anderson RM, et al. Epidemiology, transmission dynamics and control of SARS: the 2002–2003 epidemic. Phil. Trans. R. Soc. Lond. B. 2004;359:1091–1105. doi: 10.1098/rstb.2004.1490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Gani R, Leach S. Epidemiologic determinants for modeling pneumonic plague outbreaks. Emerg. Infect. Dis. 2004;10:608–614. doi: 10.3201/eid1004.030509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Becker N, Marschner I. Stochastic Processes in Epidemic Theory. 1990. pp. 90–103. [Google Scholar]
  • 18.Becker N. Analysis of Infectious Disease Data. 1989. pp. 48–59. [Google Scholar]
  • 19.Bailey NTJ. The Mathematical Theory of Infectious Diseases. 1975. pp. 263–265. [Google Scholar]
  • 20.Keeling MJ, Grenfell BT. Disease extinction and community size: Modeling the persistence of measles. Science. 1997;275:65–67. doi: 10.1126/science.275.5296.65. [DOI] [PubMed] [Google Scholar]
  • 21.Lloyd AL. Destabilization of epidemic models with the inclusion of realistic distributions of infectious periods. Proc. R. Soc. Lond. B. 2001;268:985–993. doi: 10.1098/rspb.2001.1599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Meyers LA, Pourbohloul B, Newman MEJ, Skowronski DM, Brunham RC. Network theory and SARS: predicting outbreak diversity. J. Theor. Biol. 2005;232:71–81. doi: 10.1016/j.jtbi.2004.07.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Boswell MT, Patil GP. Random Counts in Models and Structures. 1970. pp. 7–8. [Google Scholar]
  • 24.Burnham KP, Anderson DR. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach. 2002. [Google Scholar]
  • 25.Lloyd-Smith JO, Galvani AP, Getz WM. Curtailing transmission of severe acute respiratory syndrome within a community and its hospital. Proc. R. Soc. Lond. B. 2003;270:1979–1989. doi: 10.1098/rspb.2003.2481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Wallinga J, Teunis P. Different epidemic curves for severe acute respiratory syndrome reveal similar impacts of control measures. Am. J. Epidemiol. 2004;160:509–516. doi: 10.1093/aje/kwh255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ha LD, et al. Lack of SARS transmission among public hospital workers, Vietnam. Emerg. Infect. Dis. 2004;10:265–268. doi: 10.3201/eid1002.030707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Park BJ, et al. Lack of SARS transmission among healthcare workers, United States. Emerg. Infect. Dis. 2004;10:244–248. doi: 10.3201/eid1002.030793. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ferguson NM, Fraser C, Donnelly CA, Ghani AC, Anderson RM. Public health risk from the avian H5N1 influenza epidemic. Science. 2004;304:968–969. doi: 10.1126/science.1096898. [DOI] [PubMed] [Google Scholar]
  • 30.Farrington CP, Kanaan MN, Gay NJ. Branching process models for surveillance of infectious diseases controlled by mass vaccination. Biostatistics. 2003;4:279–295. doi: 10.1093/biostatistics/4.2.279. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Notes (435.6KB, pdf)

This file contains additional discussion of factors contributing to individual variation in infectiousness, methodological details on statistical and modelling analyses, and details of outbreak datasets and superspreading events. (PDF 435 kb)

Supplementary Figures (322KB, pdf)

The file includes Supplementary Figures 1 to 4, with captions. The figures address the prediction of superspreading event frequency, further results on outbreak dynamics and control, and estimation of the dispersion parameter k with limited data. (PDF 322 kb)

Supplementary Table 1 (71.1KB, pdf)

A summary of results from our statistical analysis of uncontrolled outbreaks (corresponds to Figure 1a–c). (PDF 71 kb)

Supplementary Table 2 (94.5KB, pdf)

Detailed results from our statistical analysis of uncontrolled outbreaks (elaborating on Supplementary Table 1), and the analysis of data before and after control measures were applied in four outbreaks. (PDF 94 kb)


Articles from Nature are provided here courtesy of Nature Publishing Group

RESOURCES