Phylodynamic Inference and Model Assessment with Approximate Bayesian Computation: Influenza as a Case Study

Oliver Ratmann; Gé Donker; Adam Meijer; Christophe Fraser; Katia Koelle

doi:10.1371/journal.pcbi.1002835

. 2012 Dec 27;8(12):e1002835. doi: 10.1371/journal.pcbi.1002835

Phylodynamic Inference and Model Assessment with Approximate Bayesian Computation: Influenza as a Case Study

Oliver Ratmann ^1,^2,^*, Gé Donker ³, Adam Meijer ⁴, Christophe Fraser ², Katia Koelle ^1,⁵

Editor: Sergei L Kosakovsky Pond⁶

PMCID: PMC3531293 PMID: 23300420

Abstract

A key priority in infectious disease research is to understand the ecological and evolutionary drivers of viral diseases from data on disease incidence as well as viral genetic and antigenic variation. We propose using a simulation-based, Bayesian method known as Approximate Bayesian Computation (ABC) to fit and assess phylodynamic models that simulate pathogen evolution and ecology against summaries of these data. We illustrate the versatility of the method by analyzing two spatial models describing the phylodynamics of interpandemic human influenza virus subtype A(H3N2). The first model captures antigenic drift phenomenologically with continuously waning immunity, and the second epochal evolution model describes the replacement of major, relatively long-lived antigenic clusters. Combining features of long-term surveillance data from the Netherlands with features of influenza A (H3N2) hemagglutinin gene sequences sampled in northern Europe, key phylodynamic parameters can be estimated with ABC. Goodness-of-fit analyses reveal that the irregularity in interannual incidence and H3N2's ladder-like hemagglutinin phylogeny are quantitatively only reproduced under the epochal evolution model within a spatial context. However, the concomitant incidence dynamics result in a very large reproductive number and are not consistent with empirical estimates of H3N2's population level attack rate. These results demonstrate that the interactions between the evolutionary and ecological processes impose multiple quantitative constraints on the phylodynamic trajectories of influenza A(H3N2), so that sequence and surveillance data can be used synergistically. ABC, one of several data synthesis approaches, can easily interface a broad class of phylodynamic models with various types of data but requires careful calibration of the summaries and tolerance parameters.

Author Summary

The infectious disease dynamics of many viral pathogens like influenza, norovirus and coronavirus are inextricably tied to their evolution. This interaction between evolutionary and ecological processes complicates our ability to understand the infectious disease behavior of rapidly evolving pathogens. Most statistical methods for the analysis of these “phylodynamics” require that the likelihood of the data can be explicitly calculated. Currently, this is not possible for many phylodynamic models, so that questions on the interaction between viral variants cannot be well-addressed within this framework. Simulation-based statistical methods circumvent likelihood calculations. Considering interpandemic human influenza A virus subtype H3N2, we here illustrate the effectiveness of these methods to fit and assess complex phylodynamic models against both sequence and surveillance data. We find that combining molecular genetic and epidemiological data is key to estimate phylodynamic parameters reliably. Moreover, the information in the available data taken together is enough to expose quantitative model inconsistencies. Methods such as ABC which can combine sequence and surveillance data appear to be well-suited to fit and assess mechanistic hypotheses on the phylodynamics of RNA viruses.

Introduction

Many infectious pathogens, most notably RNA viruses, evolve on the same time scale as their ecological dynamics [1]. One of the perhaps best documented examples are human influenza A viruses, which cause substantial morbidity and mortality as they escape host immunity predominantly through the evolution of their surface antigens [2]. The resulting, dynamical interaction between the ecological and evolutionary processess can be better understood through the formulation and simulation of so-called “phylodynamic” mathematical models, e.g. [3]–[8]. However, while data on disease incidence as well as viral genetic and antigenic variation are increasing for many viruses, e.g. [9]–[13], fitting and assessing phylodynamic models to these data is still not commonly done.

Historically, epidemiological time series data have been pervasively used to analyze hypotheses of host-pathogen interactions at the population level [14]–[17]. However, time series data capture the underlying evolutionary processes of pathogens only very indirectly. For flu, this has limited the type of infectious disease models that can be statistically interfaced with time series data, and the number of epidemiological parameters that can be simultaneously estimated [18], [19]. Consequently, the disease behavior of rapidly evolving pathogens is increasingly studied under additional, complementary data sets [1], most typically in ways that attempt to qualitatively reproduce prominent disease attributes [3]–[8].

More recently, coalescent-based statistical methods have been used to elucidate the disease dynamics of RNA viruses from molecular genetic data alone [20]. These methods have been particularly useful to reconstruct epidemiological transmission histories, identifying when and where transmission occurred and how viral populations change over time. For example, coalescent-based analyses have highlighted the importance of the tropics in the complex circulation dynamics of human influenza A (H3N2) virus (in short: H3N2) [9], [21], [22]. However, most coalescent methods estimate past population dynamics within a class of flexible demographic functions including exponential and logistic growth as well as the nonparametric Bayesian skyride [23], [24]; but see also [25]. These demographic functions do not explicitly describe the non-linear population dynamics of RNA viruses. Thus, assessing which ecological interactions underlie observed patterns of sequence diversity, and estimating the respective strength of these interactions, is difficult within this framework.

Because of these limitations, we adopt a different statistical approach known as Approximate Bayesian Computation (ABC) to infer the phylodynamics of RNA viruses. ABC allows mechanistic phylodynamic models to be simultaneously fitted against both sequence and surveillance data. This method circumvents explicit likelihood calculations by simulating instead from the stochastic model that defines the likelihood [26]. Recent extensions of ABC allow for model assessment to be carried out at no further computational cost [27]. We further suggest incorporating variable selection procedures to quantify if and to what extent the data provide support for the inclusion of specific model components [28].

To demonstrate the utility of our approach, we consider the phylodynamics of interpandemic H3N2. We obtained weekly reports of H3N2 incidence in the Netherlands from 1994–2009 by combining influenza-like-illness (ILI) surveillance data with detailed records of associated, laboratory-confirmed cases of flu by type and subtype [29], [30], and similarly for France and the USA; see Figure 1 and the supplementary online material (Text S1). In addition, we reconstructed the ladder-like phylogeny of H3N2's haemagglutinin gene (HA) from dated European sequences collected in 1968–2009 (see Figure 1 and Text S1). To represent H3N2's global phylodynamics, we focus on a class of spatially structured phylodynamic compartmental models that formalize probabilistically how evolving, antigenic variants interact epidemiologically. These antigenic variants might correspond to the major antigenic clusters that are distinguishable in H3N2 antigenic maps [31], but can in principle also represent a different phenotypic resolution. The evolutionary dynamics of viral genotypes are separately formulated for each antigenic phenotype because genetic distances do not necessarily easily translate into phenotypic relationships [5]. Spatial substructure has been incorporated in several models of H3N2 phylodynamics to reflect the global circulation of the virus [4], [8], [32]. We adopt here a simple source-sink framework, where the sink is thought of as the Netherlands into which viral genetic diversity and antigenic strains are imported on a seasonal scale from a source population where the virus persists [9], [33]. We fit and assess two distinct models to the combined features of sequence and incidence data described in Figure 1 and Table 1. The first model captures H3N2's antigenic drift phenomenologically through gradual loss of immunity, and the second model describes the antigenic evolution of the virus explicitly with particular assumptions on the tempo of antigenic change.

(A) Weekly ILI time series from the Netherlands, and estimated time series of influenza A(H3N2) from weekly virological data. Type and subtype specific time series were estimated under an additive Negative Binomial regression model; see Text S1. (B) Reconstructed HA phylogeny from 776 European sequences with known times of isolation. The phylogeny was inferred with the BEAST program under a relaxed Exponential clock; see Text S1. (C) H3N2 seasonal attack rates (rATT), calculated from estimated H3N2 case report times series in the Netherlands in 1994–2009 (blue), and the USA (cyan) as well as France in 1997–2008 (black). (D) Ratio of consecutive case report attack rates on the log scale. (E) Autocorrelation of case report peaks. (F) Histogram of the duration of seasonal epidemics at half their peak size. (G) Number of estimated nucleotide substitutions of dated HA sequences from the root A/Bilthoven/16190/68 as in Smith et al. [31]. Nucleotide substitutions were estimated with BEAST under an Exponential clock (red) and Lognormal clock model (violet). (H) Histograms of pairwise nucleotide diversity among sequences collected in the same season. (I) Time series of the number of phylogenetic lineages circulating within the same month. (J) Time series of the time to the most recent common ancestor of phylogenetic lineages circulating within the same month. Colors from H to J are as in G.

Table 1. Basic phylodynamic summaries of H3N2 surveillance data and phylogenies, and calibrated weighting schemes.

shorthand	summary	data	distance	summary values and distances^*			weighting scheme^†
				Netherlands	France	USA	under the SEIRS model	under the epochal evolution model
-attack	average , where is the total case report incidence in season		log ratio	0.56%	1.9% (−1.26)	1.4%(−0.97)	Indicator (3)	Indicator (3)


-attack	standard deviation in		log ratio	1.68	2.78 (−0.5)	2.24 (−0.28)	Indicator (3)	Indicator (3)


explosiveness	average duration of reported seasonal epidemics at half their peak size	time series 1994–2009	log ratio	3.2	4.54 (−0.32)	5.81 (−0.57)	Indicator (3)	Indicator (3)


correlation	Pearson autocorrelation of case report peaks at a lag of 2 & 4 years		largest difference	0.07 & 0	0.06 & −0.27 (−0.27)	−0.06 & 0.23 (0.23)	Exponential (4)	Indicator (3)


pop-attack	largest seasonal population-level attack rate	Ref. [2]	difference		20%		Indicator (3)	Exponential (4)

Open in a new tab

Distances between summaries derived from the first listed and subsequent data sets are given in brackets.

^†

Weighting schemes differ across models to accommodate weak or strong inconsistencies; see also Table 3.

^‡

The number of dated HA sequences available before Inline graphic is very small, so that these years effectively do not contribute to the diversity. To make this sampling effect more apparent, all phylogenetic summaries except the divergence are only computed on the period 1991–2009.

Methods

Approximate Bayesian Computation

To perform phylodynamic inference and goodness-of-fit analyses for complex phylodynamic models, we adopt a simulation-based approach that has become known as Approximate Bayesian Computation (ABC) [26]. Our first goal is to estimate the posterior density

(1)

of epidemiological and evolutionary model parameters Inline graphic under approximations to the likelihood of observed population incidence and phylogenetic data . The prior density can be used to incorporate existing information or limit the range of plausible values of model parameters. Our second goal is to assess fitted phylodynamic models based on a recent extension of ABC [27].

ABC methods circumvent computations of the likelihood Inline graphic by comparing the observed data to simulated data in terms of many, lower-dimensional summary statistics , , such as those in Figure 1. Using a distance function that compares summaries, each simulation is weighted according to the magnitude of the summary error under a weighting scheme Inline graphic , and this value is used in place of the likelihood term in Monte Carlo algorithms. In essence, ABC is a particular auxiliary variable Monte Carlo method, where the summary errors take on the role of auxiliary variables. Integrating these errors out, the ABC likelihood approximation adopted here is

(2)

where the weighting scheme is typically the Indicator

(3)

with tolerance parameter Inline graphic or the Exponential

(4)

with Inline graphic . Intuitively, the summary errors indicate how well a parameterized model reproduces the observed data. Once Monte Carlo algorithms such as the Markov Chain Monte Carlo (MCMC) sampler proposed by Marjoram et al. [34] have converged, the magnitude of the summary errors can be used to diagnose goodness-of-fit with respect to each of the summaries Inline graphic . To use this detailed information on each summary, we prefer using (2) to the Mahanalobis approximation (see [26]). Although uncommon, we typically use the log ratio so that the errors can be uniformly interpreted as fold-deviations. Parameter inference using ABC is approximate in that the ABC target density Inline graphic approaches the posterior density (1) as tends to zero if the summaries are sufficient for [26]. We use a Monte Carlo algorithm that is very similar to the MCMC sampler in Figure 2. A full specification of the algorithm is given in Text S1.

Phylodynamic hypotheses are formulated into evolving, dynamical systems models. We used a two-tier model formulation whose genetic component is tied to its ecological component through the flows through the prevalence class. Existing knowledge on model parameters is incorporated through the prior , and Monte Carlo algorithms such as MCMC are used to fit the model to different types of data, e.g. incidence time series and reconstructed phylogenies (see Figure 1) with an ABC approach. ABC is based on likelihood approximations such as (2), which requires a specification of phylodynamic summaries (e.g. Table 1). The summary errors are used to diagnose if the fitted phylodynamic model is consistent with available data in terms of the specified summaries.

It is typically difficult to establish the sufficiency of phylodynamic summaries analytically, and instead a small set of summaries is chosen such that model parameters of interest can be estimated [26]. Table 1 lists basic features of H3N2 epidemiological and phylogenetic data that were primarily considered in this study. Phylodynamic models were fitted and assessed against the features of the Dutch incidence data and the viral phylogeny derived under the Exponential clock model. The differences between these summaries and those derived from the remaining data in Figure 1 were used to set the ABC tolerances large enough so that inference is robust to the choice of phylogenetic reconstruction method and reporting country. Although smaller tolerances can be computationally feasible, these were not supported by the additional data considered. We typically use the Indicator weighting scheme (3) with tolerances Inline graphic that encompass differences in summary values across reporting countries and/or reconstruction methods, see Table 1. When a model never fits a particular summary well, we use (4) to give a mild prior preference to small errors [27]. See Text S1 for further details.

Spatial two-tier models to represent H3N2 phylodynamics

Deterministic skeleton

ABC methods require that each phylodynamic simulation must run on the order of tens of seconds. To meet this computational requirement while still allowing for flexible modeling [6], [35], we adopt a two-tier approach that separates the genotypes of rapidly evolving viruses from their antigenic phenotypes [7]. The underlying rationale is that differences in genotype are only relevant from a population dynamic perspective if they translate into perceivable phenotypic differences. The first tier describes the dynamic interactions of antigenic variants in the host population, here in terms of coupled susceptible-exposed-infected-recovered-susceptible (SEIRS) equations that are further spatially structured into a strongly seasonally forced sink population and a re-seeding, weakly seasonally forced source population (denoted by Inline graphic and respectively). The second tier simulates a phylogeny that is consistent with the prevalence and incidence dynamics of each antigenic unit in the first tier. Assuming polarized immunity [3], the deterministic skeleton for the th antigenic unit is

graphic file with name pcbi.1002835.e085.jpg

(5a)

graphic file with name pcbi.1002835.e086.jpg

(5b)

(5c)

(5d)

where all model parameters are described in Table 2 or below. Two infectious subcompartments Inline graphic , are employed to obtain more realistic infectiousness profiles [36]. is the number of individuals infected with the th genotype of the th antigenic unit, , for convenience, and and for all .

Table 2. Phylodynamic model parameters, prior and estimated densities.

symbol	description	prior density	meanstd. dev., 95% conf. interval of
			posterior density under the SEIRS model	posterior density under the epochal evolution model
	Basic reproductive number	uninformative	3.030.55, [1.77, 4.14]	18.75.3, [9.2, 26.8]
	effective reproductive number	-	1.260.05, [1.17, 1.35]	1.420.12, [1.27, 1.51]
	Average incubation period in days	0.9
	Average infectiousness period in days	1.8
	Average duration of immunity in years	uninformative	9.81.8, [6.5, 12.2]	206103, [46, 380]
	Reporting rate	uninformative	0.150.06, [0.06, 0.26]	0.560.23, [0.25, 0.95]
	Residual selection	Exponential slab with mean 0.007 & Gaussian pseudo-prior centered at 0.09 [28]	0.10.16, [0.01, 0.44]	0.040.07, [0.001, 0.12]
	Inclusion probability of	uninformative	10, [1,1]	10, [1,1]
	Mutation rate,	uninformative	1.320.3, [1.0, 1.9]	3.381.2, [1.8, 5.4]
	Size of sink population	fixed to Dutch demographic data, http://statline.cbs.nl
	Size of source population	uninformative	1.280.95, [0.43, 3.6]10	2.91.6, [0.7, 5.7]10
	Birth/death rate in the sink population	fixed to Dutch demographic data
	Birth/death rate in the source population,	1/50; average lifespan of 60 years adjusted by net fertility rate in South East Asia
	Seasonal forcing in the sink population	see Text S1	0.420.14, [0.3, 0.6]	0.350.15, [0.12, 0.58]
	Seasonal forcing in the source population	; key assumption, see Text S1	0.010.007, [0.002, 0.02]	0.0130.006, [0.008,0.02]
	Number of travelers visiting the sink population	; encompassing lowest & highest annual records; http://statline.cbs.nl	8.52.8, [3.6, 14.1]10	9.93.4, [3.8, 14.6]10
	Fraction of re-seeding the source population		0.060.03, [0.01, 0.09]	0.060.03, [0.02, 0.09]
	Partial cross-immunity of mother-daughter variants	uninformative	-	0.760.05, [0.67, 0.85]
	Scale parameter of the antigenic emergence rate	uninformative	-	38697, [247, 533]
	Shape parameter of the antigenic emergence rate	2; Ref. [7]

Open in a new tab

In the first tier (5a–5b), competition between two antigenic variants Inline graphic , arises through resource depletion via partial cross-immunity that decays multiplicatively with kinship level , , where is the degree of cross-immunity between mother-daughter variants [3]. The emergence of antigenic variants is described phenomenologically with a per capita hazard function Inline graphic after the emergence time of the resident phenotype [7]. The hazard function is parameterized with a scale parameter and a shape parameter . The strength of sinusoidal seasonal forcing in the source population in the transmission parameter, , is assumed to be much smaller than in the asynchroneously forced sink population, Inline graphic , and is set so that transmission peaks at the winter solstice in the Northern hemisphere. is the number of infected visiting travelers from the source, while is the number of individuals that re-seed the source population. Thus, the source population can be interpreted as an interconnected, re-seeding tropical region whose population size Inline graphic is to be estimated. We further calibrate the sink population to represent the Netherlands, using demographic data to specify and over the study period 1968–2009. To fit model (5), we transform into at disease equilibrium of a single variant [14], and define , by and where is the number of infected individuals at disease equilibrium of a single variant.

In the second tier (5c–5d), the instantaneous loss in Inline graphic is proportional to genotype frequency, while the gain in is weighted by the fitness advantage of each genotype. As before [7], fitness is assumed to increase linearly with the number of nucleotide mutations between the th genotype and the founder genotype of the th antigenic variant. The total number of infections and losses Inline graphic and are the simulated transitions in and out of at time , so that (5c–5d) are tied to (5a–5b). New genotypes evolve at a rate , and a genealogy of the th antigenic unit is generated by recording the emergence times of each genotype along with their kinships. The branch length between offspring and parental genotype is always one. After extinct genotypes are pruned, the branch length between two genotypes gives the number of nucleotide substitutions between them. These genealogies are concatenated by connecting the root genotype with a genotype of the parental antigenic unit that is randomly drawn according to genotype frequencies at time Inline graphic . The residual selection parameter accounts phenomenologically for selection pressures between genetic variants that are evident from the shape of the virus phylogeny, but remain unexplained by a particular ecological model of antigenic variants (5a–5b). Once the distribution of has been inferred from population incidence and genetic data, we can then quantify how well a phylodynamic model describes patterns of continual immune selection mechanistically, and also compare alternative phylodynamic models in this respect.

Stochastic process model

To account for demographic stochasticity, Markov transition probabilites are derived from (5), assuming that the per capita rates are constant over a small time interval Inline graphic , and that transitions out of any state are independent and multinomially distributed. Generally, consider a state and all per capita rates out of . The state transitions out of into states ,, are

where Inline graphic is the total number of individuals leaving at time .

For the application to H3N2 phylodynamics, simulations were started in Inline graphic at the disease equilibrium of a single antigenic variant and generated under a multinomial Euler scheme with days. After the simulations of the first tier completed, the corresponding phylogeny was simulated based on the flows in and out of the prevalence compartments [7]. Simulated data were recorded after Inline graphic to match the time range of the observed summaries in Figure 1. We do not estimate the initial conditions of the state variables and assume that by 1990, the phylodynamic processes do not depend any longer on the initial values in 1968.

Stochastic observation model

To interface the two-tier model with observed case report data and phylogenies, we simulated reported incidence under a Poisson model with mean Inline graphic and drew a requested number of genotypes at specified sampling times without replacement according to the genotype frequencies at those times. Replacing the genotype emergence times with the corresponding sampling times and pruning non-sampled genotypes, we obtained a dated phylogeny with branch lengths encoding nucleotide substitution distances.

Inferring inclusion probabilities of model parameters

A frequent problem in phylodynamic modeling is to determine if a specific model parameter should be included. For example, it can be unclear which types of ecological interactions between antigenic variants underlie pathogen phylodynamics, or if the residual selection parameter Inline graphic in (5c–5d) is required in addition to a given ecological mechanism that induces immune selection. Following existing variable selection procedures [28], we use an additional indicator variable to denote whether a single model parameter is present () or absent () and estimate its posterior probability under equation (2). Here, we use a standard spike-and-slab variable selection procedure [28] to estimate inclusion probabilities of the residual selection parameter Inline graphic .

Results

Basic geographic framework for modeling H3N2 phylodynamics

To illustrate ABC methodology with the summaries in Table 1, we begin with a classical phenomenological model that implicitly accounts for antigenic drift through gradual loss of immunity [37]. H3N2 phylodynamics are represented with a spatial two-tier system of equations that is a special case of (5) when the antigenic emergence rate is set to

(6)

For simplicity, we will refer to (5) without antigenic variants as the SEIRS model.

Simulated data

We first tested ABC on simulated data generated under the SEIRS model and found that the subset Inline graphic , , , , , of model parameters can be reliably estimated with ABC tolerances that are smaller than those in Table 1 (see Text S1). Tigher tolerances on the population level attack rate contributed most to more reliable estimates of .

Parameter inference

The behavior of the spatial SEIRS model, when fitted to the case report and phylogenetic summaries in Table 1, is illustrated in Figure 3 with parameter estimates given in Table 2. On real data, the summary errors were considerably larger than on simulated data, so that the Inline graphic could not be used. Instead, we chose ABC tolerances with a data-driven approach that compares summary errors across different empirical data sets (see the Methods section and Table 1). Overall, we can simultaneously infer the epidemiological and evolutionary parameters , , , , , , . As shown in Figure 3A, the MCMC algorithm may get occasionally stuck in the tails of the target density (see Text S1 for further discussion). The posterior mean and standard deviation of Inline graphic , , are relatively large in comparison to estimates from previous studies [36], [38]–[40], and is positively correlated with the average duration of infection to yield realistic incidence time series. We back-calculated the effective reproductive number from the prevalence growth rate at the beginning of each season (see Text S1), and find that many combinations of Inline graphic and give a tight mean posterior in agreement with these studies. In the absence of any ecological mechanisms inducing strain competition, the mean residual selection parameter is large and always included in the SEIRS model. Generally, the sequence divergence imposes negative correlations between Inline graphic , (Figure 3E) and , (Figure 3F), and the posterior mean mutation rate /genome/year is much smaller than H3N2's substitution rate, 5.3–6.1/genome/year, when selection is incorporated into the model. Figure 4 illustrates that the fraction of susceptible individuals ranges within 15–40% and changes smoothly under seasonal forcing, thus leading to sustained oscillations in disease incidence. We failed to estimate Inline graphic , , , and recovered distributions close to the prior. Our prior assumptions are summarised in Table 2 and more fully discussed in Text S1.

(A–C) MCMC trajectories of the estimated , the calculated , and the TMRCA summary error of four chains that were started at overdispersed starting values (see Methods). Samples before iteration 1000 were discarded. (D–F) Two-dimensional histograms of parts of the ABC fit, illustrating the correlations between the estimated parameter pairs (, ), (, ) and (,). Throughout, histograms were computed from all samples across the four chains after burn-in. Color codings are separate for each subplot, with respective density values indicated in the contours. (G–I) Two-dimensional histograms of parts of the joint density of summary errors, illustrating goodness-of-fit with respect to the *correlation* and interannual variability of the case report data, as well as the *divergence*, *diversity* and the *TMRCA*'s of the HA phylogeny.

(A–B) Population-level weakly incidence in the sink and source population, respectively. (C) Corresponding weekly time series of the percentage of susceptible individuals in the sink population. (D) Simulated H3N2 weekly surveillance time series in the sink population (blue) and reconstructed H3N2 time series in the Netherlands (black). (E) Simulated and observed case report seasonal attack rates, and (F) autocorrelation function of case report peaks. Typically, simulations under the fitted model show sustained oscillations that follow a clear biennial pattern. (G) Simulated HA phylogeny under a large, estimated residual selection parameter. (H) Simulated and observed lineage profile, and (I) simulated and observed time series of the time to the most recent common ancestor of extant phylogenetic lineages. Despite a relatively high selection parameter, the number of lineages and the time to the most recent common ancestor are overall too high when compared to data. Model parameters are , , , , , , , , , , , , .

Sensitivity of parameter estimates to phylodynamic summaries

The extent to which phylodynamic parameters can be estimated depends mainly on the type of information that underlies the ABC summaries. As described more fully in Text S1, a broad range of epidemiological parameters are quantitatively consistent with summaries of the H3N2 case report data in Figure 1A because variable reporting rates can mask the true extent of population incidence when immunity is not permanent [19]. Detailed studies of closely monitored populations and serological data suggest interpandemic seasonal H3N2 attack rates between 10–20% in temperate regions [2], and we found that conditioning on a broad window of maximum seasonal population incidence attack rates (pop-attack) between 15–30% ensures that key epidemiological parameters can be well estimated (Figure S4 in Text S1).

Moreover, while the sequence divergence and diversity are standard descriptors of viral phylogenies [1], we found that they are not sufficient to infer the size of the source population Inline graphic when the mutation rate and the residual selection parameter are simultaneously estimated. Considering the narrowness of the phylogeny in terms of the number of circulating lineages, we could estimate the source population size (Figures S5–6 in Text S1). We can use the number of lineages despite their dependence on sampling effort because with ABC, we are free to sample simulated sequences exactly as in the observed data set, see the Methods section. Finally, the time to the most recent common ancestor (TMRCA) links the evolutionary dynamics with the ecological interactions between antigenically distinct viral variants because weak selective advantages invariably lead to coexistence and deep phylogenies in the face of high Inline graphic and weak . In the absence of sufficiently strong ecological interactions, the TMRCA's favor a larger residual selection parameter (Figure S7 in Text S1).

Goodness of fit

The summary errors reveal that the SEIRS model fails to reproduce the irregular interannual variability in winter season epidemics, and the narrowness and limited diversity of the HA phylogeny despite large Inline graphic (Figure 3G–I). However, the model can reproduce H3N2's high divergence rate. This is not the case for the SEIRS model without a separate, weakly seasonally forced source population (see Text S1).

Epochal evolution model of H3N2 phylodynamics

While several models have been able to simulate phylodynamics that are consistent with some aspects of the observed data, most notably the ladder-like phylogeny of H3N2's haemagglutinin gene [4], [5], [41], none have been quantitatively fitted and tested against a set of epidemiological and molecular genetic features such as those in Figure 1. Here, we focus on the epochal evolution model as formulated in [7] within the above spatial framework, which is identical to (5) when antigenic variants are interpreted as major antigenic clusters. To fit (5) to the serial replacement of 11 major antigenic clusters within 1968–2002 [31], we define an antigenic cluster as any antigenic unit that survives for at least Inline graphic years and use the summaries in Table 1 as well as the number of antigenic clusters generated in 1968–2002 (nclust). Following [7], the emergence rate is set to increase with age,

(7)

Inline graphic , and the scaling parameter is estimated. For simplicity, we refer to (5) with this antigenic emergence rate and an antigenic resolution that is determined by nclust as the epochal evolution model.