Abstract
While influenza virus diversity and antigenic drift have been well characterized on a global scale, the factors that influence the virus’ rapid evolution within and between human hosts are less clear. Given the modest effectiveness of seasonal vaccination, vaccine-induced antibody responses could serve as a potent selective pressure for novel influenza variants at the individual or community level. We used next generation sequencing of patient-derived viruses from a randomized, placebo-controlled trial of vaccine efficacy to characterize the diversity of influenza A virus and to define the impact of vaccine-induced immunity on within-host populations. Importantly, this study design allowed us to isolate the impact of vaccination while still studying natural infection. We used pre-season hemagglutination inhibition and neuraminidase inhibition titers to quantify vaccine-induced immunity directly and to assess its impact on intrahost populations. We identified 166 cases of H3N2 influenza over 3 seasons and 5119 person-years. We obtained whole genome sequence data for 119 samples and used a stringent and empirically validated analysis pipeline to identify intrahost single nucleotide variants at ≥1% frequency. Phylogenetic analysis of consensus hemagglutinin and neuraminidase sequences showed no stratification by pre-season HAI and NAI titer, respectively. In our study population, we found that the vast majority of intrahost single nucleotide variants were rare and that very few were found in more than one individual. Most samples had fewer than 15 single nucleotide variants across the entire genome, and the level of diversity did not significantly vary with day of sampling, vaccination status, or pre-season antibody titer. Contrary to what has been suggested in experimental systems, our data indicate that seasonal influenza vaccination has little impact on intrahost diversity in natural infection and that vaccine-induced immunity may be only a minor contributor to antigenic drift at local scales.
Author summary
Influenza is a significant global health problem. Vaccination is the best way to prevent influenza virus infection, and seasonal influenza vaccines are considered for reformulation each year in order to keep up with the virus’ evolution. Despite these efforts, vaccine recipients often develop an immune response that does not protect from infection. Given the current recommendation that all people over 6 months of age get vaccinated, it is important to understand how vaccination itself may impact viral evolution during natural human infection. We studied how vaccination may alter viral evolution within individuals, as each person harbors many highly-related influenza variants that differ in their ability to escape the immune response. We compared groups of people in a vaccine trial to determine the impact that vaccination has on viral diversity and variant selection within individuals. We did not detect significant differences in the number of variants detected or in the prevalence of mutations that could impact antibody binding based on vaccination group or antibody response. Our work suggests that vaccination is not a major factor in driving the emergence of new influenza strains at the level of the individual host.
Introduction
Despite recommendations for universal influenza vaccination and the ample availability of vaccines in the United States, influenza continues to cause significant morbidity and mortality [1]. This is, in part, a result of the modest effectiveness of current vaccines, so that considerable numbers of vaccine failures occur each year. Within individuals, influenza populations exist as a collection of closely related, and at times antigenically distinct, variants that may exhibit diverse phenotypes [2–6]. Intrahost single nucleotide variants (iSNV) can be transmitted as part of the infecting population [5,7–10] or generated over the course of an infection due to the virus’ low replication fidelity [11,12]. The evolutionary forces that shape the genetic structure of viral populations within hosts and ultimately give rise to novel antigenic variants at the host population level are poorly understood. A clear understanding of the intrahost diversity of influenza virus populations and its impact on influenza virus evolution is central to many questions of direct clinical and public health relevance [13].
Influenza vaccines are considered for reformulation each year to counter the viral antigenic drift that enables escape from the previous year’s vaccine [14]. Annual influenza vaccine effectiveness is 60% on average, and can be much lower during antigenically unmatched years [15,16]. While antigenic drift is monitored annually on a global scale, the source of antigenic variation is ultimately at the level of the individual host. Phylogenetic studies of whole genome sequences from cities and smaller communities have demonstrated that multiple lineages circulate over the course of a single influenza season [2,17], and individual hosts may harbor mixed infections that include antigenically novel variants [3,4]. While human hosts could be preferentially infected with one lineage over another based on pre-infection immune status, the degree to which circulating escape variants contribute to vaccine failure is currently unknown.
Host immune selection is a major driver of influenza virus evolution on the global scale. Both phylogenetic analysis and antigenic cartography have demonstrated that antibodies exert positive selective pressure on the viral hemagglutinin (HA) and neuraminidase (NA) proteins [18,19]. Vaccination and natural influenza infection often lead to partial, or non-sterilizing immunity, and post-vaccination antibody titers are only a moderate predictor of subsequent protection [20,21]. Previous work has demonstrated that sub-neutralizing concentrations of immune sera can promote the generation of antigenic variants, and some have argued that vaccination can accelerate the process of antigenic drift [22]. A recent study in vaccinated people suggested that novel antigenic variants could be present at low frequencies [23]. Importantly, humans often differ in their prior exposure to influenza viruses and vaccines, and pre-existing immunity may confound such studies [24]. Therefore, the extent to which partial immunity selects for antigenically relevant variants during natural infection in humans is unclear.
By necessity, most of the available data on vaccination and intrahost evolution have come from analyses of HA sequences in large animal models of infection, including horses, pigs, and dogs. These studies have suggested that intrahost populations include a number of somewhat rare single nucleotide variants that increase and decrease in frequency over the course an infection with sporadic fixation events occurring in some animals. The overall impact of vaccination on antigenic diversification was not clear [7,25–27], and there were differences between experimentally and naturally infected animals [7,8,26]. These differences are also likely to be an issue in extrapolating results from human experimental challenge models [28].
Some have suggested that high intrahost diversity reflects increased viral fitness, and mechanisms that alter intrahost diversity may impact evolutionary trajectories [29–31]. If the transmission bottleneck is sufficiently wide, low frequency variants that arise within a host can plausibly be transmitted and spread through host populations [5,10]. Understanding how intrahost diversity is generated and maintained and the extent to which host immune status impacts this diversity may be important for defining influenza virus’ larger evolutionary patterns [23,26,27].
Here we used next generation sequencing to define the impact of vaccine-induced immunity on the intrahost diversity of influenza virus during natural infection. We specifically asked: (i) whether influenza viruses in vaccinated individuals represent escape variants, (ii) whether novel antigenic variants are found in hosts with non-sterilizing immunity, and (iii) the degree to which vaccine-induced immunity impacts the overall diversity of intrahost populations. Because we analyzed influenza populations from individuals enrolled in a randomized, double-blind, placebo-controlled trial of influenza vaccine efficacy [32–34], we were uniquely positioned to define this aspect of immunity to natural infection.
Results
Study subjects and specimens
We utilized influenza A RT-PCR positive throat swab samples from a randomized, double-blind, placebo-controlled study of vaccine efficacy that took place during the 2004–2008 influenza seasons at six study sites in Michigan [32–34]. This trial measured vaccine efficacy of both the trivalent inactivated (IIV) and live attenuated influenza vaccine (LAIV) compared to placebo and each other. We sequenced patient-derived influenza populations without culturing from three seasons: 2004–2005, 2005–2006, and 2007–2008. The 2006–2007 influenza season did not have enough influenza-positive samples for our study (total n = 16). Influenza A (H3N2) strains dominated the other three seasons, and the circulating 2004–2005 virus was considered at the time to be only a modest mismatch with the vaccine strain. The other seasons were antigenically matched. The numbers of subjects each year were as follows: 2004–2005 season, 522 IIV, 519 LAIV, and 206 placebo [32]; 2005–2006 season, 867 IIV, 853 LAIV, and 338 placebo [33]; 2007–2008 season, 813 IIV, 814 LAIV, and 325 placebo [34]. Over these 5119 person-years of observation, 165 individuals had culture or RT-PCR confirmed influenza A (H3N2) infection and specimens available for analysis. Of these, 80 individuals had received LAIV, 42 had received IIV, and 43 had received placebo. For 2004–2005, flu-positive samples were available for 28 subjects: 7 IIV, 12 LAIV, and 9 placebo, and for 2005–2006, 32 samples were available to study: 13 IIV, 14 LAIV, and 5 placebo. During the 2007–2008 season, 105 flu-positive samples were available: 22 IIV, 54 LAIV, and 29 placebo. We were able to amplify and quantify genomes for 119 of the 165 influenza-positive samples (Table 1). The average age of this sequenced cohort from all years was 24.5, indicating that participants were generally young and likely shared similar influenza pre-exposure histories particularly after randomization. The age, sex and race of the cohort were similar to that of the overall study cohort for each of the 3 seasons.
Table 1. Samples analyzed over three FLU-VACS seasons.
2004–2005 | 2005–2006 | 2007–2008 | |
---|---|---|---|
H3N2 Vaccine Strain | Wyoming/3/2003 | California/7/2004 | Wisconsin/67/2005 |
Circulating Strain(s) | Wyoming/3/2003 | California/7/2004 | Wisconsin/67/2005 |
California/7/2004 | Wisconsin/67/2005 | Brisbane/10/2007 | |
Demographics | |||
Average Age (Years) | 30.0 | 28.8 | 22.8 |
Sex (% Female) | 71.4% | 83.3% | 70.1% |
Race (% White) | 92.9% | 94.4% | 86.2% |
Sequenced Samples | 14 | 18 | 87 |
IIV | 4 | 9 | 16 |
LAIV | 5 | 6 | 44 |
Placebo | 5 | 3 | 27 |
Intrahost SNV Data | 10 | 9 | 64 |
IIV | 3 | 6 | 11 |
LAIV | 4 | 1 | 30 |
Placebo | 3 | 2 | 23 |
Despite subject randomization, differences in pre-existing immunity due to prior vaccination or influenza infection could provide a dominant immune background that is not significantly altered by vaccination against that season’s strain. In the 2007–2008 season, 50.4% of individuals in the larger cohort and 44.3% in the sequenced cohort reported having ever received a prior influenza vaccination. To obtain a more reliable metric of strain-specific immunity, we measured pre-season antibody titers by hemagglutination inhibition (HAI) and neuraminidase inhibition (NAI) assays for all study participants against that season’s vaccine strain (S1 Fig). Vaccine-induced antibody titers and overall vaccine efficacy are generally stable for a single flu season [35]. Therefore, our pre-season HAI and NAI titers are likely to be similar to titers at the time of infection. Pre-season (post-vaccination) titers for individuals in the IIV group were above the geometric mean for the entire sequenced cohort, LAIV subjects had titers spanning the mean, and those in the placebo group were generally below the mean for all seasons. Since an HAI titer of 40 is typically considered to be associated with 50% protection given exposure [20,21,36], these data demonstrate that in the IIV group, and to a lesser degree, the LAIV group, individuals had strain-specific antibody levels that are sufficient to apply selective pressure against the infecting virus.
Viral load of sequenced samples
We have previously shown that viral load influences the sensitivity and specificity of iSNV detection [6]. In order to determine whether viral load was different among the IIV, LAIV, and placebo samples that we sequenced, we measured genome copy number by RT-qPCR for the 2004–2005, 2005–2006, and 2007–2008 seasons. For the 2007–2008 season, which had the most samples, there were no significant differences in copy number by vaccination group (Fig 1A). In agreement with the 2007–2008 data, we did not detect differences in copy number by vaccination group for the 2004–2005 and 2005–2006 seasons (S2 Fig). Since copy number is dependent on time from illness onset [37,38], we analyzed the data based on sample collection day (Fig 1B). Using days 2–4, for which there were at least 5 data points for each treatment group, we did not find any significant differences (p = 0.24–0.57 for days 2–4, non-parametric one way ANOVA). We divided the larger group of 2007–2008 subjects into groups based on pre-season HAI and NAI titers ≥40 or <40 against that season’s strain. This cutoff was identical to the HAI and NAI geometric mean titers for our sequenced cohort (61.9 and 34.5, respectively), given the dilutions used. We did not detect differences in copy number based on HAI or NAI titer (Fig 1C and 1D), even when accounting for day of symptom onset (p = 0.25 for HAI, p = 0.97 for NAI, Mann-Whitney U test). Because we only measured copy number in the subset of virus populations that were amplified and sequenced, these data should not be interpreted in the context of vaccination and overall shedding [39].
Deep sequencing of intrahost influenza populations
We used the Illumina platform to determine the whole genome consensus sequence and to identify intrahost single nucleotide variants for each patient-derived sample. Importantly, we have developed and rigorously benchmarked a variant calling pipeline that maintains high sensitivity for rare iSNV detection while dramatically reducing false positive variant calls [6] (S1 Table). We have found that the number of false positive iSNV calls is much higher in samples with genome copy numbers <103 per μl of transport media; therefore, we only report iSNV from a high quality dataset that includes 64 samples from the 2007–2008 season. High quality iSNV data for the 2004–2005 and 2005–2006 influenza seasons are included in Supplementary Figures and Tables. Our libraries yielded an average coverage above 20,000 reads per base with even coverage across the coding region of all segments for each season (S3 Fig).
HA and NA sequences do not cluster by vaccination status or pre-season antibody titer
Given the frequent observation of community-level diversity in circulating influenza viruses [2,17], we asked whether vaccinated individuals were infected with distinct strains relative to the placebo group. If vaccine failures were due to infection with an antigenically distinct variant, we would expect to see evidence of clustering by vaccination- or sero-status in HA and NA phylogenetic trees. We therefore analyzed HA and NA consensus sequences from all 87 individuals in the 2007–2008 season (Fig 2). There was very little diversity in either gene, and we found that sequences from individuals in each treatment group (e.g. IIV, LAIV, or placebo) were dispersed throughout the tree. More importantly, we found no evidence for clustering based on pre-season HAI and NAI titer (by colors in Fig 2, titers indicated at tips). We obtained similar results from the 2004–2005 and 2005–2006 seasons, albeit with fewer sequenced samples (S4 Fig). These data suggest that within-season and within-host antigenic drift due to higher levels of vaccine-induced antibodies (HAI or NAI >40) were not major determinants of vaccine failure in these seasons.
Intrahost diversity in vaccinated and unvaccinated individuals
We next analyzed the iSNVs present in 64 of our samples from the 2007–2008 season. We identified 360 minority variants across the entire genome, most of which were present at a frequency of <0.1 (Fig 3A). We did not observe many samples with a large number of higher frequency iSNV, which suggests that there were few, if any, mixed lineage infections in the samples from this season. The vast majority of iSNV were only found once (Fig 3B). We also evaluated whether partial immunity impacts viral diversity by comparing the number of iSNVs per sample in the HA and NA genes based on HAI and NAI titers, respectively. We did not observe a difference in iSNV count based on HAI and NAI titers for either HA or NA (Fig 3C and 3D, p = 0.20 for HA, p = 0.26 for NA, Mann Whitney U test; see also S5 Fig for iSNV stratified by titer). The average number of iSNV per sample was similar across the genome regardless of host treatment group (Table 2, p = 0.35, non-parametric one way ANOVA) or HAI and NAI titers (S6 Fig, p = 0.13 for HAI, p = 0.22 for NAI, Mann Whitney U test).
Table 2. Number of iSNV (mean ± interquartile range) by segment and treatment group for samples from the 2007–2008 season.
Segment | IIV (n = 11) | LAIV (n = 30) | Placebo (n = 23) |
---|---|---|---|
1 (PB2) | 1.91 ± (2) | 1.50 ± (1) | 1.57 ± (1) |
2 (PB1) | 0.82 ± (1.5) | 0.60 ± (1) | 0.70 ± (1) |
3 (PA) | 1.64 ± (1) | 1.57 ± (1) | 1.65 ± (1) |
4 (HA) | 0.91 ± (1.5) | 0.47 ± (1) | 0.61 ± (1) |
5 (NP) | 0.64 ± (1) | 0.27 ± (0) | 0.30 ± (0) |
6 (NA) | 0.27 ± (0.5) | 0.23 ± (0) | 0.22 ± (0) |
7 (M) | 0.09 ± (0) | 0.27 ± (0) | 0.35 ± (1) |
8 (NS) | 0.36 ± (1) | 0.23 ± (0) | 0.39 ± (1) |
Most iSNV from the 2004–2005 and 2005–2006 seasons were also found in only one sample each and at frequencies <0.1 (S7 Fig). The number of iSNV did not differ based on treatment group or HAI and NAI titer (S2 Table). We did not identify any variants specifically associated with vaccination group or HAI and NAI titers for any of the influenza seasons analyzed. Unlike experimental challenge studies, we only had one sample per person and could not evaluate changes in diversity at the level of the individual host. However, we did not identify any significant differences in diversity by day of infection across the cohort by vaccination status (Fig 4, p = 0.16–0.82 for days 2–4, non-parametric one way ANOVA) or antibody titer. We identified only a marginal difference in the number iSNV across the genome based on HAI titer on day 2 (uncorrected p-value 0.02 with >6 comparisons, Mann Whitney U test). These data suggest that our results are unlikely to be confounded by temporal sampling issues.
Evidence for low antigenic diversity within hosts
Some have suggested that vaccine-induced immunity will select for novel antigenic variants within hosts [23]. We therefore compared iSNV in the HA gene by vaccination group and serostatus. Of the 17 variants we identified that resulted in nonsynonymous changes within HA, 11 were in HA1 and 6 were in HA2 (Fig 5, S3 Table). Five antigenic sites, comprising 131 amino acid positions, have been described in HA1 for H3N2 viruses [40–42]. Six of the HA1 variants identified in our study were located in antigenic sites C and D (Fig 5B), one of which was found in two samples (S3 Table). Of these potential antigenic variants, three were found in vaccinated individuals (1 IIV, 2 LAIV) and four were found in those in the placebo group. When grouped by HAI titer, four were found in samples from individuals with titers ≥40, while three were found in individuals with titers <40. No variants were specific to vaccinated individuals, and antigenic diversity was similar across all groups. None of the identified mutations were observed in subsequent circulating H3N2 strains. There were no nonsynonymous iSNV in HA from the 2004–2005 samples and just 7 identified in individuals from the 2005–2006 season. Of these 7, N225D was observed in strains dominating prior, but not subsequent seasons.
Discussion
We set out to define the relationship between vaccine-induced immunity and the intrahost diversity of influenza virus. We hypothesized that non-sterilizing immunity could potentially select for novel antigenic variants and contribute to larger scale patterns of influenza evolution. We were able to definitively address these questions using samples derived from a randomized, placebo-controlled vaccine trial in healthy, young adults who likely had similar prior exposure to influenza viruses [32,34]. The availability of post-vaccination, preseason HAI and NAI titers allowed us to examine directly the impact of measured serologic antibody pressure rather than using vaccination alone as a surrogate marker. Because all individuals were infected naturally, our data provide a rare view of within-host influenza virus diversity in humans. We directly sequenced the samples without passage in cell culture, eliminating the possibility of culture-adapted mutations and employed a well-benchmarked variant calling pipeline [6] that dramatically reduces the false positive iSNV calls that often plague next generation sequencing studies. In this exhaustive and well-controlled study, we found no differences in intrahost influenza diversity based on vaccination status or HAI and NAI titers.
Our findings in a natural infection system are concordant with an equine influenza virus evolution study in vaccinated horses. Intrahost variation was similar between naïve and vaccinated horses, regardless of whether they were infected naturally or experimentally [27]. However, not all experimental infection studies mirror results seen during natural infection. A study investigating swine influenza virus found discrepancies in intrahost variation based on whether animals encountered natural infection or were experimentally infected [26]. Our data is in contrast with a study of experimentally-infected dogs that uncovered differences in intrahost diversity and evolution in antigenic sites based on vaccination status [25]. Two other studies of equine influenza virus found mixed infections of multiple influenza lineages during natural infection, which would not be seen in an experimental model but may be relevant to the transmission and spread of novel variants [7,8]. Overall, it is challenging to compare our results to those obtained in various animal models, where the hosts are often immunologically naïve and infected with a defined inoculum of a single genotype.
We did not detect phylogenetic clustering of HA and NA sequences based on vaccination status, type of administered vaccine, or pre-season HAI and NAI titers. These results are consistent with those of Dinis et al., who found no segregation of HA sequences based on vaccination status in a case test-negative study of vaccine effectiveness [23]. Together, these data suggest that vaccinated and unvaccinated individuals are infected with similar strains and that within season antigenic drift is not a major contributor to reduced vaccine efficacy in the seasons analyzed. Because we also stratified our analysis by pre-season HAI and NAI titer, we can similarly exclude viral escape from a non-sterilizing antibody response. Our data further suggest that pre-existing antibody against circulating strains does not apply sufficiently strong selective pressure to drive the emergence of antigenically distinct strains within a given host.
High intrahost diversity may be an important factor in viral evolution, since it increases the number of novel variants on which natural selection can act. Some have proposed that the intrahost diversity of RNA viruses is linked to virulence [29,30], suggesting that processes that act to restrict or enhance intrahost diversity may alter disease phenotypes. We found that within host diversity of influenza virus was quite low. Most iSNV were present at frequencies of less than 0.1, which means they would only plausibly be spread between hosts if the transmission bottleneck were reasonably large [5]. The number of iSNV in a given host was similar between vaccine and placebo groups, both across the genome and on the segments coding for HA and NA. Furthermore, there were no significant differences in diversity when samples were grouped by HAI and NAI titer, and we did not find evidence of treatment-specific iSNV. Together our data suggest that vaccine-induced immunity does not significantly influence intrahost diversity and is a relatively weak selective pressure at the level of the individual host.
While the number of iSNV was similar in all groups, we considered it possible that partial immunity may drive the emergence of specific antigenic variants. Vaccination and natural infection can induce a wide range of immune responses depending on host and viral factors. These can range from complete protection to leaky responses that allow infection, but influence disease severity or duration. Non-sterilizing immune responses have the potential to select for escape variants within each host. In the setting of current recommendations for universal vaccination, a highly vaccinated population could potentially select for antigenically evolved viruses more quickly than the spread of natural infection. For this reason, it is important to tease apart the role of vaccination on influenza evolution. We did find a number of variants in antigenic sites, but these were no more frequent than in other regions of the genome and did not vary with vaccination status or HAI and NAI titer. Importantly, our results are in conflict with earlier work in several model systems that demonstrate the rapid selection of antigenic variants in the presence of sub-neutralizing antibody or in experimentally infected animals [26,27,43–45]. This discrepancy may be due to differences in infectious dose, host genetic background, history of prior influenza infection, immune correlates not captured by HAI and NAI titers, or the strains tested. Animal studies are often performed in immunologically naïve or genetically identical animals, whereas humans have complex genetic backgrounds and immunological histories that could play a role in mediating population-wide immunity.
While global patterns of influenza transmission and evolution are complex, our study provides important insights regarding influenza evolution on intra- and interhost scales. We were able to define aspects of intrahost evolution within a geographically-constrained and relatively young cohort with similar vaccination histories and previous influenza exposures across groups, limiting potential confounding factors. Still, we acknowledge that by reducing these confounders, we may be missing important determinants of intrahost evolution. For example, intrahost diversity may be different in children, older adults, or those with high-risk conditions. Our study included 5119 person-years of observations to yield a dataset of 165 viruses, 119 of which were sequenced. Large, placebo-controlled, randomized influenza vaccine trials involving thousands of people are unlikely to be conducted in the future due to the recommendation for universal vaccination. Therefore, our sample set likely represents the best chance to directly assess the impact of vaccination on influenza evolution in the context of natural human infection.
Despite our well-controlled study, we found little evidence for vaccine-driven evolution in the context of the community that was sampled. None of the low frequency variants identified in the antigenic sites were found in subsequent seasons. While we cannot rule out the possibility that evolutionary patterns would be different in other geographic regions, different influenza seasons, or with other subtypes, we did not uncover differences between vaccinated and unvaccinated populations with respect to genome-wide or antigenic diversity in three H3N2 seasons. We were not able to evaluate the impact of vaccination on H1N1 or subtype B viruses. Furthermore, our sample set cannot discern whether there are differences in evolutionary pressure based on vaccine match/mismatch, as the vast majority of our samples were from antigenically matched seasons. A major limitation of our study was that only one sample was available for each participant, so we could not track changes in diversity or mutation accumulation within each individual over the course of their infection. However, we did not observe significant differences in the number of mutations over the first 6 days of infection, which is consistent with previous work in horses [27] and a deep sequencing study of 7 humans in an experimental challenge model [46]. Together with these works, our data suggest that within host dynamics are dominated by purifying selection with the transient appearance of minority variants and little sustained fixation. Selection and transmission of antigenically and epidemiologically important variants is likely to be a rare event when studied at this scale. Given that our data are derived from 165 incident infections in 5119 season-years of observation, detection of such events in the course of a natural infection would require an unrealistic sample size.
We evaluated the potential for vaccine-induced immunity to drive intrahost evolution, as this is an important issue in light of the current recommendation for universal influenza vaccination. We did not find evidence for vaccine-induced pressure on the intrahost or consensus level, despite employing several methods used to study evolutionary processes. Our study is larger than other reports and involves extensive analysis of both placebo and vaccination groups. While randomization, placebo control, and reliance on a young healthy population allowed us to address this question in a rigorous manner, these factors may have lessened person-to-person variation. By better defining the intrahost evolutionary mechanisms and how they impact population-wide influenza evolution, we hope to address questions of import to clinicians and public health workers, improve vaccine design, and develop more efficient epidemiological control measures.
Materials and methods
Ethics statement
This study was approved by the Institutional Review Board of the University of Michigan Medical School, and all human subjects provided informed consent.
Subjects and specimens
We characterized host-derived influenza populations archived from a randomized, double-blind, placebo-controlled, clinical trial of influenza vaccine efficacy that ran from the 2004–2005 through the 2007–2008 influenza seasons (ClinicalTrials.gov number, NCT00133523, [32–34]). Each year, healthy adults, ages 18–49, were randomized to receive trivalent inactivated influenza vaccine (IIV), live attenuated influenza vaccine (LAIV), or placebo. Throat swab specimens were collected from individuals with influenza-like illness within 7 days of onset; residual specimen material was stored in veal infusion broth (VIB) at -80°C. Viral RNA was extracted from 140μl of VIB using the QIAamp viral RNA mini kit (Qiagen 52906), eluted in 50μl buffer, and stored at -80°C. Hemagglutination inhibition (HAI) and Neuraminidase agglutination inhibition (NAI) titers for subjects in this study were previously measured and reported in [20,47].
Determination of genome copy number
Quantitative reverse transcription polymerase chain reaction (RT-qPCR) was performed on 5μl RNA from each sample using CDC RT-PCR primers InfA Forward, InfA Reverse, and InfA probe, which bind to a portion of the influenza M gene (CDC protocol, 28 April 2009). Each reaction contained 5.4μl nuclease-free water, 0.5μl each primer/probe, 0.5μl SuperScript III RT/Platinum Taq mix (Invitrogen 111732) 12.5μl PCR Master Mix, 0.1μl ROX, 5μl RNA. The PCR master mix was thawed and stored at 4°C, 24 hours before reaction set-up. A standard curve relating copy number to Ct values was generated based on 10-fold dilutions of a control plasmid run in duplicate.
Illumina library preparation and sequencing
We amplified cDNA corresponding to all 8 genomic segments from 3μl of the viral RNA using the SuperScript III One-Step RT-PCR Platinum Taq HiFi Kit (Invitrogen 12574). Reactions consisted of 0.5μl Superscript III Platinum Taq Mix, 12.5μl 2x reaction buffer, 8μl DEPC water, and 0.2μl of 10μM Uni12/Inf1, 0.3μl of 10μM Uni12/Inf3, and 0.5μl of 10μM Uni13/Inf1 universal influenza A primers [48]. The thermocycler protocol was: 42°C for 60 min then 94°C for 2 min then 5 cycles of 94°C for 30 sec, 44°C for 30 sec, 68°C for 3 min, then 28 cycles of 94°C for 30 sec, 57°C for 30 sec, 68°C for 3 min. Amplification of all 8 segments was confirmed by gel electrophoresis, and 750ng of each cDNA mixture were sheared to an average size of 300 to 400bp using a Covaris S220 focused ultrasonicator. Sequencing libraries were prepared using the NEBNext Ultra DNA library prep kit (NEB E7370L), Agencourt AMPure XP beads (Beckman Coulter A63881), and NEBNext multiplex oligonucleotides for Illumina (NEB E7600S). The final concentration of each barcoded library was determined by Quanti PicoGreen dsDNA quantification (ThermoFisher Scientific), and equal nanomolar concentrations were pooled. Residual primer dimers were removed by gel isolation of a 300-500bp band, which was purified using a GeneJet Gel Extraction Kit (ThermoFisher Scientific). Purified library pools were sequenced on an Illumina HiSeq 2500 with 2x125 nucleotide paired end reads. All raw sequence data have been deposited at the NCBI sequence read archive (BioProject submission ID: SUB1907046)
Variant detection
Sequencing reads that passed standard Illumina quality control filters were binned by index and aligned to the reference genome using Bowtie [49]. Single nucleotide variants (SNV) were identified and analyzed using DeepSNV [50], which relies on a clonal control to estimate the local error rate within a given sequence context and to identify strand bias in base calling. The clonal control was a library prepared in an identical fashion from 8 plasmids containing the genome for the respective circulating reference strain and sequenced in the same flow cell to control for batch effects. True positive SNV were identified from the raw output tables by applying the following filtering criteria in R: (i) Bonferonni corrected p value <0.01, (ii) average MapQ score on variant reads >30, (iii) average phred score on variant positions >35, (iv) average position of variant call on a read >32 and <94, (v) variant frequency >0.01. We only considered SNV identified in a single RT-PCR reaction and sequencing library for samples with copy number ≥105 genomes/μl transport media or in two separate RT-PCR reactions and sequencing libraries for samples with copy number 103−105 genomes per μl. For variants at a frequency of 50–99%, we called the minority base (i.e. 1–50%) as an iSNV. Our strategy for variant calling is described in [6] and all code can be found at https://github.com/lauringlab/variant_pipeline.
Phylogenetic analysis
Consensus nucleotide sequences for the HA and NA proteins were aligned using MUSCLE [51]. The best-fit models for nucleotide substitution was identified using jModelTest v2.1.10 [52]. Maximum likelihood phylogenetic trees were generated using RAxML v8 [53] with a GTRGAMMA model, Genbank sequences for vaccine strains as outgroups, and 1000 bootstraps. Trees were visualized and annotated using FigTree (v1.4.2).
Data analysis and statistics
All statistical analyses were performed using Prism 6 and R. Description of the analysis and annotated code are available at https://github.com/lauringlab/fluvacs_paper. HA structural models were generated and visualized with PyMol.
Supporting information
Acknowledgments
We thank Suzanne Ohmit, Ryan Malosh, Emily Toth Martin, and Robert Woods for helpful discussion.
Data Availability
All code is available at https://github.com/lauringlab/variant_pipeline and https://github.com/lauringlab/fluvacs_paper.
Funding Statement
This work was supported by a Clinician Scientist Development Award from the Doris Duke Charitable Foundation (CSDA 2013105) and R01 AI118886, both to ASL, and by U01 AI057853 and an unrestricted grant from Sanofi Pasteur, both to ASM. KD was supported by the Molecular Mechanisms in Microbial Pathogenesis Training Program (T32 AI007528) and by the Postdoctoral Translational Scholars Program of the University of Michigan Institute for Clinical and Health Research (UL1TR000433). JTM was supported by the Michigan Predoctoral Training Program in Genetics (T32 GM007544). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.World Health Organization. Fact sheet No 211. 2014 [Internet]. 211AD. http://www.who.int/mediacentre/factsheets/fs211/en/
- 2.Ghedin E, Fitch A, Boyne A, Griesemer S, DePasse J, Bera J, et al. Mixed infection and the genesis of influenza virus diversity. J Virol. 2009;83: 8832–8841. 10.1128/JVI.00773-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Ghedin E, Laplante J, DePasse J, Wentworth DE, Santos RP, Lepow ML, et al. Deep sequencing reveals mixed infection with 2009 pandemic influenza A (H1N1) virus strains and the emergence of oseltamivir resistance. J INFECT DIS. 2011;203: 168–174. 10.1093/infdis/jiq040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ghedin E, Holmes EC, Depasse JV, Pinilla LT, Fitch A, Hamelin M-E, et al. Presence of Oseltamivir-Resistant Pandemic A/H1N1 Minor Variants Before Drug Therapy With Subsequent Selection and Transmission. J INFECT DIS. 2012;206: 1504–1511. 10.1093/infdis/jis571 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Poon LLM, Song T, Rosenfeld R, Lin X, Rogers MB, Zhou B, et al. Quantifying influenza virus diversity and transmission in humans. Nat Genet. 2016;48: 195–200. 10.1038/ng.3479 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.McCrone JT, Lauring AS. Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling. J Virol. 2016;90: 6884–6895. 10.1128/JVI.00667-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Murcia PR, Baillie GJ, Daly J, Elton D, Jervis C, Mumford JA, et al. Intra- and interhost evolutionary dynamics of equine influenza virus. J Virol. 2010;84: 6943–6954. 10.1128/JVI.00112-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hughes J, Allen RC, Baguelin M, Hampson K, Baillie GJ, Elton D, et al. Transmission of equine influenza virus during an outbreak is characterized by frequent mixed infections and loose transmission bottlenecks. PLoS Pathog. 2012;8: e1003081 10.1371/journal.ppat.1003081 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wilker PR, Dinis JM, Starrett G, Imai M, Hatta M, Nelson CW, et al. Selection on haemagglutinin imposes a bottleneck during mammalian transmission of reassortant H5N1 influenza viruses. Nature Communications. 2013;4: 2636 10.1038/ncomms3636 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Varble A, Albrecht RA, Backes S, Crumiller M, Bouvier NM, Sachs D, et al. Influenza A virus transmission bottlenecks are defined by infection route and recipient host. Cell Host and Microbe. 2014;16: 691–700. 10.1016/j.chom.2014.09.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Parvin JD, Moscona A, Pan WT, Leider JM, Palese P. Measurement of the mutation rates of animal viruses: influenza A virus and poliovirus type 1. J Virol. 1986;59: 377–383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sanjuán R, Nebot MR, Chirico N, Mansky LM, Belshaw R. Viral mutation rates. J Virol. 2010;84: 9733–9748. 10.1128/JVI.00694-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Holmes EC. RNA virus genomics: a world of possibilities. J Clin Invest. 2009;119: 2488–2495. 10.1172/JCI38050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Centers for Disease Control and Prevention (U S). Selecting viruses for the seasonal influenza vaccine [Internet]. http://www.cdc.gov/flu/about/season/vaccine-selection.htm
- 15.Osterholm MT, Kelley NS, Sommer A, Belongia EA. Efficacy and effectiveness of influenza vaccines: a systematic review and meta-analysis. Lancet Infect Dis. 2012;12: 36–44. 10.1016/S1473-3099(11)70295-X [DOI] [PubMed] [Google Scholar]
- 16.Skowronski DM, Chambers C, Sabaiduc S, De Serres G, Winter A-L, Dickinson JA, et al. Integrated Sentinel Surveillance Linking Genetic, Antigenic, and Epidemiologic Monitoring of Influenza Vaccine-Virus Relatedness and Effectiveness During the 2013–2014 Influenza Season. J INFECT DIS. 2015;212: 726–739. 10.1093/infdis/jiv177 [DOI] [PubMed] [Google Scholar]
- 17.Holmes EC, Ghedin E, Halpin RA, Stockwell TB, Zhang X-Q, Fleming R, et al. Extensive geographical mixing of 2009 human H1N1 influenza A virus in a single university community. J Virol. 2011;85: 6923–6929. 10.1128/JVI.00438-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Smith DJ, Lapedes AS, de Jong JC, Bestebroer TM, Rimmelzwaan GF, Osterhaus ADME, et al. Mapping the antigenic and genetic evolution of influenza virus. Science. 2004;305: 371–376. 10.1126/science.1097211 [DOI] [PubMed] [Google Scholar]
- 19.Nelson MI, Holmes EC. The evolution of epidemic influenza. Nat Rev Genet. 2007;8: 196–205. 10.1038/nrg2053 [DOI] [PubMed] [Google Scholar]
- 20.Ohmit SE, Petrie JG, Cross RT, Johnson E, Monto AS. Influenza hemagglutination-inhibition antibody titer as a correlate of vaccine-induced protection. J INFECT DIS. 2011;204: 1879–1885. 10.1093/infdis/jir661 [DOI] [PubMed] [Google Scholar]
- 21.Tsang TK, Cauchemez S, Perera RAPM, Freeman G, Fang VJ, Ip DKM, et al. Association between antibody titers and protection against influenza virus infection within households. J INFECT DIS. 2014;210: 684–692. 10.1093/infdis/jiu186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.ARCHETTI I, HORSFALL FL. Persistent antigenic variation of influenza A viruses after incomplete neutralization in ovo with heterologous immune serum. J Exp Med. 1950;92: 441–462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dinis JM, Florek NW, Fatola OO, Moncla LH, Mutschler JP, Charlier OK, et al. Deep Sequencing Reveals Potential Antigenic Variants at Low Frequencies in Influenza A Virus-Infected Humans. Schultz-Cherry S, editor. J Virol. 2016;90: 3355–3365. 10.1128/JVI.03248-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Fonville JM, Wilks SH, James SL, Fox A, Ventresca M, Aban M, et al. Antibody landscapes after influenza virus infection or vaccination. Science. 2014;346: 996–1000. 10.1126/science.1256427 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hoelzer K, Murcia PR, Baillie GJ, Wood JLN, Metzger SM, Osterrieder N, et al. Intrahost evolutionary dynamics of canine influenza virus in naive and partially immune dogs. J Virol. 2010;84: 5329–5335. 10.1128/JVI.02469-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Murcia PR, Hughes J, Battista P, Lloyd L, Baillie GJ, Ramirez-Gonzalez RH, et al. Evolution of an Eurasian avian-like influenza virus in naïve and vaccinated pigs. PLoS Pathog. 2012;8: e1002730 10.1371/journal.ppat.1002730 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Murcia PR, Baillie GJ, Stack JC, Jervis C, Elton D, Mumford JA, et al. Evolution of equine influenza virus in vaccinated horses. J Virol. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Killingley B, Enstone J, Booy R, Hayward A, Oxford J, Ferguson N, et al. Potential role of human challenge studies for investigation of influenza transmission. Lancet Infect Dis. 2011;11: 879–886. 10.1016/S1473-3099(11)70142-6 [DOI] [PubMed] [Google Scholar]
- 29.Vignuzzi M, Stone JK, Arnold JJ, Cameron CE, Andino R. Quasispecies diversity determines pathogenesis through cooperative interactions in a viral population. Nature. 2006;439: 344–348. 10.1038/nature04388 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Beck A, Tesh RB, Wood TG, Widen SG, Ryman KD, Barrett ADT. Comparison of the live attenuated yellow fever vaccine 17D-204 strain to its virulent parental strain Asibi by deep sequencing. J INFECT DIS. 2014;209: 334–344. 10.1093/infdis/jit546 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lauring AS, Acevedo A, Cooper SB, Andino R. Codon usage determines the mutational robustness, evolutionary capacity, and virulence of an RNA virus. Cell Host and Microbe. 2012;12: 623–632. 10.1016/j.chom.2012.10.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ohmit SE, Victor JC, Rotthoff JR, Teich ER, Truscon RK, Baum LL, et al. Prevention of antigenically drifted influenza by inactivated and live attenuated vaccines. N Engl J Med. 2006;355: 2513–2522. 10.1056/NEJMoa061850 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ohmit SE, Victor JC, Teich ER, Truscon RK, Rotthoff JR, Newton DW, et al. Prevention of symptomatic seasonal influenza in 2005–2006 by inactivated and live attenuated vaccines. J INFECT DIS. 2008;198: 312–317. 10.1086/589885 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Monto AS, Ohmit SE, Petrie JG, Johnson E, Truscon R, Teich E, et al. Comparative efficacy of inactivated and live attenuated influenza vaccines. N Engl J Med. 2009;361: 1260–1267. 10.1056/NEJMoa0808652 [DOI] [PubMed] [Google Scholar]
- 35.Petrie JG, Ohmit SE, Truscon R, Johnson E, Braun TM, Levine MZ, et al. Modest Waning of Influenza Vaccine Efficacy and Antibody Titers During the 2007–2008 Influenza Season. J INFECT DIS. Oxford University Press; 2016;: jiw105. [DOI] [PubMed] [Google Scholar]
- 36.Potter CW, Oxford JS. Determinants of immunity to influenza infection in man. Br Med Bull. 1979;35: 69–75. [DOI] [PubMed] [Google Scholar]
- 37.Lee N, Chan PKS, Hui DSC, Rainer TH, Wong E, Choi K-W, et al. Viral loads and duration of viral shedding in adult patients hospitalized with influenza. J INFECT DIS. 2009;200: 492–500. 10.1086/600383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tsang TK, Cowling BJ, Fang VJ, Chan K-H, Ip DKM, Leung GM, et al. Influenza A Virus Shedding and Infectivity in Households. J INFECT DIS. 2015;212: 1420–1428. 10.1093/infdis/jiv225 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Petrie JG, Ohmit SE, Johnson E, Cross RT, Monto AS. Efficacy studies of influenza vaccines: effect of end points used and characteristics of vaccine failures. J INFECT DIS. 2011;203: 1309–1315. 10.1093/infdis/jir015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Caton AJ, Brownlee GG, Yewdell JW, Gerhard W. The antigenic structure of the influenza virus A/PR/8/34 hemagglutinin (H1 subtype). Cell. 1982;31: 417–427. [DOI] [PubMed] [Google Scholar]
- 41.Skehel JJ, Brown E, Daniels RS, Douglas AR, Knossow M, Wilson IA, et al. Studies with monoclonal antibodies prepared against X-31 influenza virus haemagglutinin. Biochem Soc Trans. 1985;13: 12–14. [DOI] [PubMed] [Google Scholar]
- 42.Lee M-S, Chen JS-E. Predicting antigenic variants of influenza A/H3N2 viruses. Emerging Infect Dis. 2004;10: 1385–1390. 10.3201/eid1008.040107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.GERBER P, LOOSLI CG, HAMBRE D. Antigenic variants of influenza A virus, PR8 strain. I. Their development during serial passage in the lungs of partially immune mice. J Exp Med. 1955;101: 627–638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.GERBER P, HAMRE D, LOOSLI CG. Antigenic variants of influenza A virus (PR8 strain). II. Serological and immunological characteristics of variants derived from variants. J Exp Med. 1956;103: 413–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.HAMRE D, LOOSLI CG, GERBER P. Antigenic variants of influenza A virus (PR8 strain) III. Serological relationships of a line of variants derived in sequence in mice given homologous vaccine. J Exp Med. 1958;107: 829–844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sobel Leonard A, McClain MT, Smith GJD, Wentworth D, Halpin RA, Lin X, et al. Deep Sequencing of Influenza A Virus from a Human Challenge Study Reveals a Selective Bottleneck and Only Limited Intrahost Genetic Diversification. J Virol. 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Monto AS, Petrie JG, Cross RT, Johnson E, Liu M, Zhong W, et al. Antibody to Influenza Virus Neuraminidase: An Independent Correlate of Protection. J INFECT DIS. 2015;212: 1191–1199. 10.1093/infdis/jiv195 [DOI] [PubMed] [Google Scholar]
- 48.Zhou B, Donnelly ME, Scholes DT, St George K, Hatta M, Kawaoka Y, et al. Single-reaction genomic amplification accelerates sequencing and vaccine production for classical and Swine origin human influenza a viruses. J Virol. 2009;83: 10309–10313. 10.1128/JVI.01109-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10: R25 10.1186/gb-2009-10-3-r25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Gerstung M, Beisel C, Rechsteiner M, Wild P, Schraml P, Moch H, et al. Reliable detection of subclonal single-nucleotide variants in tumour cell populations. Nature Communications. 2012;3: 811 10.1038/ncomms1814 [DOI] [PubMed] [Google Scholar]
- 51.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32: 1792–1797. 10.1093/nar/gkh340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9: 772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30: 1312–1313. 10.1093/bioinformatics/btu033 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All code is available at https://github.com/lauringlab/variant_pipeline and https://github.com/lauringlab/fluvacs_paper.