Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2024 Nov 12;22(11):e3002916. doi: 10.1371/journal.pbio.3002916

Deep mutational scanning of H5 hemagglutinin to inform influenza virus surveillance

Bernadeta Dadonaite 1, Jenny J Ahn 2, Jordan T Ort 3, Jin Yu 4, Colleen Furey 3, Annie Dosey 5,6, William W Hannon 1, Amy L Vincent Baker 7, Richard J Webby 8, Neil P King 5,6, Yan Liu 4, Scott E Hensley 3, Thomas P Peacock 9,10, Louise H Moncla 11, Jesse D Bloom 1,12,*
Editor: Katherine Kedzierska13
PMCID: PMC11584116  PMID: 39531474

Abstract

H5 influenza is considered a potential pandemic threat. Recently, H5 viruses belonging to clade 2.3.4.4b have caused large outbreaks in avian and multiple nonhuman mammalian species. Previous studies have identified molecular phenotypes of the viral hemagglutinin (HA) protein that contribute to pandemic potential in humans, including cell entry, receptor preference, HA stability, and reduced neutralization by polyclonal sera. However, prior experimental work has only measured how these phenotypes are affected by a handful of the >10,000 different possible amino-acid mutations to HA. Here, we use pseudovirus deep mutational scanning to measure how all mutations to a 2.3.4.4b H5 HA affect each phenotype. We identify mutations that allow HA to better bind α2-6-linked sialic acids and show that some viruses already carry mutations that stabilize HA. We also measure how all HA mutations affect neutralization by sera from mice and ferrets vaccinated against or infected with 2.3.4.4b H5 viruses. These antigenic maps enable rapid assessment of when new viral strains have acquired mutations that may create mismatches with candidate vaccine virus, and we show that a mutation present in some recent H5 HAs causes a large antigenic change. Overall, the systematic nature of deep mutational scanning combined with the safety of pseudoviruses enables comprehensive measurements of the phenotypic effects of mutations that can inform real-time interpretation of viral variation observed during surveillance of H5 influenza.


H5 influenza is considered a potential pandemic threat. Using deep mutational scanning, this work reveals how the >10,000 different possible amino-acid mutations in hemagglutinin affect cell entry, receptor preference, protein stability and neutralization by antibodies, which can inform real-time interpretation of viral mutations observed during surveillance of H5 influenza.

Introduction

Experimental research on H5 influenza and studies of past influenza pandemics have identified properties of several viral genes (e.g., hemagglutinin (HA) [13], polymerase basic 2 (PB2) [4,5], and nucleoprotein (NP) [68]) that increase the risk that animal influenza viruses pose to humans. Mutations in HA can increase pandemic risk by affecting several molecular phenotypes. One key phenotype is the ability of HA to use α2-6-linked rather than α2-3-linked sialic acid as its primary receptor [911], which is important since α2-6-linked sialic acids dominate the upper respiratory tract in humans [12]. Another relevant phenotype is HA stability, as mutations that stabilize HA are frequently present in viruses capable of transmitting by the airborne route [13,13,14]. HA’s antigenicity is another important phenotype: A component of pandemic preparedness is developing candidate strains for vaccine production [15], but new HA variants can contain mutations that reduce neutralization by antibodies elicited by mismatched vaccine strains. Finally, the ability of HA to mediate cell entry is an essential phenotype for viral fitness, so sites where mutations impair cell entry are good targets for HA-directed therapeutics [16,17].

While previous work has shown the importance of these HA phenotypes, the entire body of prior experimental research has only measured how these phenotypes are impacted by a small fraction of the many possible mutations to H5 HA. Therefore, it is often impossible to assess how newly observed HA mutations impact pandemic risk without performing experiments on each new mutation, an approach that lags behind virus evolution. Here, we address this challenge by prospectively measuring how all possible amino-acid mutations to HA affect the phenotypes of cell entry, α2-6-linked sialic-acid usage, stability, and serum antibody neutralization (Fig 1A). We make these measurements by applying a safe pseudovirus deep mutational scanning system (18) to the HA of a current WHO-recommended 2.3.4.4b clade H5 candidate vaccine virus (15) (Fig 1B and 1C). The resulting maps of mutational effects enable real-time interpretation of the phenotypic effects of HA mutations observed during viral surveillance.

Fig 1. Deep mutational scanning of H5 HA.

Fig 1

(A) HA phenotypes relevant to pandemic risk that were measured in this study. (B) Phylogenetic tree of H5Nx HAs. The A/goose/Guangdong/1/96-like lineage that was first identified as a highly pathogenic form of avian influenza in 1996 forms the large lower clade on the tree. The 2.3.4.4b clade that recently has spread globally is in red, and the A/American Wigeon/South Carolina/USDA-000345-001/2021 (H5N1) sequence used in our deep mutational scanning study is indicated. The tree file can be found in S1 Data. (C) Schematic of a library of genotype-phenotype linked pseudoviruses. See S1 Fig for more details on the library.

Results

H5 HA deep mutational scanning libraries

We designed H5 HA deep mutational scanning libraries to measure the effects of all 10,773 possible amino-acid mutations (567 sites × 19 amino-acid mutations/site). We created these libraries in the HA from the A/American Wigeon/South Carolina/USDA-000345-001/2021 (H5N1) strain, which is one of WHO’s candidate vaccine viruses [15] (Fig 1B). To generate pools of pseudoviruses with all the HA single amino-acid mutants, we used a lentivirus-based deep mutational scanning platform that generates pseudoviruses encoding different mutants of HA each linked to a nucleotide barcode in the lentiviral genome [18] (S1A and S1B Fig). These pseudoviruses also express the matched H5N1 neuraminidase (NA) on their surface; however, NA is expressed from a separate plasmid during pseudovirus production and only HA is mutagenized in our study (S1B Fig). The pseudoviruses can be studied at Biosafety Level 2 since they encode no viral genes other than HA and so cannot undergo multicycle replication.

We produced duplicate pseudovirus libraries that contained 95% and 93% of the possible HA amino-acid mutations, respectively (S1C Fig). Most HAs contained a single amino-acid mutation, although a small fraction contained no or multiple mutations (S1D Fig).

Effects of HA mutations on cell entry

To measure how mutations affect HA’s essential function of mediating viral entry into cells, we quantified HA-mediated pseudovirus entry into 293T cells (S2A Fig). These cells express both α2-3- and α2-6-linked sialic acids [19,20], which are typical receptors for avian and human influenza viruses, respectively. The measured effects of mutations on cell entry were highly correlated between replicate libraries (S2B Fig), demonstrating the reproducibility of the experiments.

There is wide variation in how HA mutations affect entry, with some sites tolerating many mutations and others under strong constraint (Fig 2 and interactive plot at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/cell_entry.html). Much of the constraint is shaped by how mutations affect HA’s folding and ability to mediate membrane fusion, since sites in the protein core including the stem helices in the fusion machinery [21] are especially intolerant to mutations (Fig 2B). HA’s globular head domain is more mutationally tolerant than the stalk region, consistent with the head’s greater variability among natural influenza viruses and prior experimental work [2225] (Fig 2B and 2C). Additionally, the known antigenic regions of the head [26,27] are more tolerant to mutations than the rest of the head (Fig 2C). However, there are some regions on HA’s surface that are intolerant of mutations (Fig 2B) and so represent attractive targets for efforts to design antibodies and other HA-directed therapeutics that are resistant to viral escape [16].

Fig 2. Effects of mutations on cell entry.

Fig 2

(A) Effects of HA mutations on pseudovirus entry into 293T cells. Orange indicates impaired entry, white parental-like entry, and blue improved entry. The x’s indicate the parental amino acid, and light gray indicates mutations that were not reliably measured in our experiments. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/cell_entry.html for an interactive version of this heatmap that allows zooming and mousing over points for details. Sites are numbered using the H3 scheme (see https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/numbering.html for details). (B) HA structure (PDBs: 4KWM, 1JSO) colored by the average effect of amino-acid mutations at each site on cell entry (orange indicates sites where mutations tend to impair cell entry). (C) Effects of mutations on cell entry for different domains and regions of HA. The boxes show the median and interquartile range, and more negative values indicate mutations adversely affect cell entry. In the heatmap overlays in (A) “RBP” denotes the receptor-binding pocket and “PBCS” denotes the polybasic cleavage site. The data underlying this figure can be found in S2 Data.

To validate that the effects of HA mutations measured in the pseudovirus deep mutational scanning reflect actual influenza virion cell entry, we generated conditionally replicative influenza viruses that lack the essential PB1 polymerase gene [28] and derive their other nonsurface proteins from a lab-adapted strain [29], making them safe for study at Biosafety Level 2 (S2C Fig). The effects of mutations on cell entry measured in the pseudovirus deep mutational scanning were highly correlated (R = 0.9) with the titers of conditionally replicative influenza viruses carrying these same mutations (S2D Fig).

Effects of HA mutations on entry into cells expressing α2-6- versus α2-3-linked sialic acids

Avian influenza viruses use α2-3-linked sialic acids as their preferred receptors, but human-transmissible strains typically prefer α2-6-linked sialic acids [911]. Current clade 2.3.4.4b H5 viruses in animals have HAs that bind poorly to the α2-6-linked sialic acids that dominate the human airways, but bind well to α2-3-linked and other forms of sialic acids [3032]. To determine how mutations to HA affect its preference for α2-6-linked versus α2-3-linked receptors, we measured pseudovirus entry into 293 cells engineered to express primarily α2-3-linked or α2-6-linked sialic acids (hereafter referred to as α2–3 or α2–6 cells; see Methods for details on these cells) [33] by applying the workflow in S2A Fig separately to each cell line. While α2–6 cells still support limited infection by pseudoviruses with the unmutated H5 HA, titers on these cells are significantly lower than those on α2–3 cells because the parental H5 HA used in our deep mutational scanning is much more efficient at using α2-3-linked sialic acids (S3A Fig). Replicate measurements of mutational effects on entry made with the 2 different libraries were well correlated, again demonstrating the reproducibility of the measurements (S3B Fig).

Most HA mutations had similar effects on entry regardless of the cells used (S4 Fig) since most mutations impact cell entry by affecting HA folding or membrane-fusion function (Fig 2B), which are essential for entry into any cell. However, some mutations were more favorable for entry in α2–6 versus α2–3 cells (Figs 3 and S4, interactive plots at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/a26_usage.html). The HA mutations that specifically enhanced entry into α2–6 versus α2–3 cells were mostly at a handful of sites in the sialic-acid binding pocket (Fig 3). Multiple mutations to site Q226 strongly increased HA’s preference for α2-6- versus α2-3-linked sialic acids in our deep mutational scanning (Fig 3A). These mutations include Q226L (throughout this manuscript, we use mature H3 HA numbering; see https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/numbering.html), which occurred in prior H3 and H2 HA pandemic viruses [10,34] and has previously been shown to enhance α2-6-linked sialic acid binding [1,3,3537]. Mutations at several other sites also markedly improved entry into α2–6 cells, including sites 137, 190, 193, 225, and 228 (Fig 3A and 3B). Changes to some of these sites have occurred in previous pandemics [10,34,38,39] or been shown to enhance α2-6-linked sialic-acid binding [3537,4043] (Fig 3A and 3B). Of the sites that strongly enhance entry into α2–6 cells, only a small number are accessible by single nucleotide change to the current 2.3.4.4b H5 HA gene sequence: These mutations include Q226L/R, E190Q/A, N224K/Y, G228A, and several others (Fig 3A).

Fig 3. Effects of mutations on entry into cells expressing α2-6- versus α2-6-linked sialic acids.

Fig 3

(A) Effects of mutations on pseudovirus entry into 293 cells expressing α2-6- or α2-3-linked sialic acids. Each point is a different amino-acid mutation, with shapes indicating whether the mutation is accessible by a single-nucleotide change to the natural HA gene sequence. Sites where more than one mutation has large differences between α2–3 and α2–6 cell entry are labeled. Note that only mutations with a net favorable effect on entry into α2–6 cells are shown. Note that these experiments were performed in cell-culture media that contained 10% fetal bovine serum, which may also impact the effects of some mutations due to decoy receptors present in the serum. (B) Structure of the HA near the sialic-acid binding pocket (PDBs: 4KWM, 1JSO) with sites colored according to the average increase in α2–6 usage caused by mutations at that site. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/a26_usage.html for interactive plots including a version of A that includes all mutations (not just those with net positive α2–6 cell entry) and an interactive heatmap of the change in α2–6 versus α2–3 usage for all mutations. The data underlying this figure can be found in S2 Data.

Note that while our experiments identify HA mutations that improve entry into the α2–6 cells, it is possible that these cells may not fully represent the abundance or diversity of different glycan structures in the human airways [4446]. We also note that improved entry into the α2–6 cells could come from either enhanced binding to α2-6-linked sialic acids on the surface of those cells or reduced binding to decoy receptors including α2-3-linked sialic acids present in the serum in the cell-culture media used in our experiments. Additionally, in some cases, multiple mutations may be needed to fully switch the preference of HA to α2–6 linked sialic acids [47].

Effects of mutations on HA stability

Mutations that increase HA stability are associated with increased airborne transmissibility of influenza viruses [1,3,4853]. HA undergoes a similar irreversible conformational change in response to both acidic pH and elevated temperature [21], so the loss of viral infectivity at acidic pH is widely used as a measure of HA stability [51,54,55]. To identify mutations that affect HA’s stability, we incubated the pseudovirus libraries in progressively more acidic pH buffers and quantified the retention of infectivity as a measure of HA stability (Fig 4A).

Fig 4. Effects of mutations on stability.

Fig 4

(A) To measure how mutations affect HA stability, the pseudovirus library was incubated in increasingly acidic buffers and then used to infect cells. Stability was quantified as the resistance of each mutant to inactivation by acidic pH. (B) HA structure colored by the average increase in HA stability caused by mutations at each site, with blue indicating sites where mutations increase stability. (C) The average increase in HA stability caused by mutations at each site. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/stability.html for an interactive heatmap of the effects of all mutations that includes options to show destabilizing as well as stabilizing mutations. The data underlying this panel can be found in S3 Data. (D) Validation of deep mutational scanning measurements by comparison to direct measurements of acid-induced inactivation of conditionally replicative influenza virions carrying the indicated mutations (see S5 Fig for details). The mutations validated here were chosen to span a wide range of effects on stability. The r value in the upper-left of each panel indicates the Pearson correlation. The data underlying this panel can be found in S4 Data.

Mutations at numerous sites increase HA stability (Fig 4B and 4C and interactive plot at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/stability.html). Although these stabilizing mutations are widely distributed in primary sequence, they are concentrated in 2 regions of HA’s three-dimensional structure: the stem’s alpha helices and the base of the head domain (Fig 4B). The spatial location of these mutations is consistent with the fact that acidic conditions or high temperature trigger an irreversible conformational change that involves rearrangement of the stem helices and displacement of the head domain [21,56]. Overall, the deep mutational scanning both identified previously known stabilizing mutations (e.g., Y17H, A19T, H24Q, E31K, H110Y, and T318I [1,2,14,50,57]) as well as many additional mutations that increase stability (Fig 4C).

We validated the deep mutational scanning stability measurements by generating conditionally replicative influenza virions (S2A Fig) carrying key mutations and quantifying their infectivity after acid treatment. The deep mutational scanning measurements correlated well with the stability of the influenza virions (Figs 4D and S5), validating the approach. We note that the map of stabilizing mutations could inform design of stabilized HA immunogens [58] for vaccines as well as inform viral surveillance.

Effects of mutations on escape from serum neutralization

HA is the major target for neutralizing antibodies to influenza, but the protein can acquire mutations that escape neutralization by antibodies elicited by older strains [59]. For potential pandemic influenza strains, WHO designates candidate vaccine viruses to be used to prepare for vaccine production in case of a human pandemic [15], but these viruses can become poorly matched to newer strains as HA evolves. Assessing the impact of HA mutations on serum neutralization can identify when viruses have gained escape mutations that might merit updates to candidate vaccine viruses. The HA used for our mutant libraries is a WHO clade 2.3.4.4b H5 candidate vaccine virus [15]. We used these libraries to measure how all HA mutations affect neutralization by polyclonal sera from mice or ferrets vaccinated or infected to elicit antibodies against clade 2.3.4.4b H5 HAs viruses (S6 Fig).

The mutations that most strongly escaped serum neutralization were on the head of HA in previously defined antigenic regions [60] (Fig 5A–5C and interactive plots at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/escape.html). While the strongest escape mutations from both the mouse and ferret sera were in antigenic regions A and B at the top of the HA head, the specific sites where mutations caused the greatest escape differ somewhat between the 2 species (Fig 5A–5C). There was also some animal-to-animal variation: For instance, most mouse sera targeted antigenic regions A and B, but the serum of 1 mouse targeted antigenic region E lower on the HA head (S7 Fig).

Fig 5. Effects of mutations on escape from serum neutralization.

Fig 5

(A) Total escape caused by all tolerated mutations at each HA site as measured in the deep mutational scanning, averaged across the sera of all animals from each species. The HA structure on the right shows the classically defined H3 antigenic regions [60]. See S6B Fig for details on the number of animals and immunization methods. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/escape.html for interactive plots showing escape for each animal. The data underlying this panel can be found in S5 Data. (B) Average ferret sera escape overlaid on a surface representation of HA (PBD: 4KWM). (C) Same as in (B) but for mouse sera. (D) Correlation between escape for different HA mutants measured with deep mutational scanning (DMS) versus the change in neutralization measured using conventional neutralization assays with conditionally replicative influenza virus. The data underlying this panel can be found in S6 Data.

To validate the escape mutations measured by deep mutational scanning, we performed standard neutralization assays using the conditionally replicative influenza viruses. The effects of mutations measured using deep mutational scanning correlated well with those measured using influenza-based neutralization assays (Fig 5D). Therefore, even though pseudoviruses are generally more sensitive than actual influenza virions to neutralization (S8 Fig), the mutation antigenic effects measured in the pseudovirus deep mutational scanning are recapitulated in actual influenza virions. Notably, several escape mutations that we validated (e.g., R90K, S125bN, P128S, E255K) are present among 2.3.4.4b clade sequences, illustrating the ongoing antigenic evolution of H5 HA.

Phenotypic measurements inform sequence-based viral surveillance

Large numbers of H5 influenza sequences are collected during ongoing surveillance of outbreaks in various animals [6163]. One major goal of this surveillance is to identify viruses that acquire mutations that increase their risk to humans or alter their antigenicity in ways that necessitate updates to candidate vaccine viruses.

To use the large-scale phenotypic measurements described here to inform surveillance, we estimated the relevant HA phenotypes of clade 2.3.4.4b viruses as the simple sum of the phenotypic effects of their individual mutations. For instance, Fig 6A shows a phylogenetic tree of recent viruses colored by their estimated escape from neutralization from sera from ferrets infected by 2.3.4.4b clade influenza virus; see https://nextstrain.org/groups/moncla-lab/h5nx/h5-dms/clade-2344b?c=ferret_sera_escape_dms_value for an interactive Nextstrain [64] version of the same tree. This tree enables immediate identification of subclades with antigenic mutations with reduced neutralization, including clades defined by mutations P128S, A160T/S, and P162Q, which are present in some clusters of sequences from the ongoing dairy cattle outbreak in the United States. These and other recent HA mutations also reduce neutralization by mouse sera (S9A Fig).

Fig 6. Estimated HA phenotypes of recent 2.3.4.4b strains.

Fig 6

Phylogenetic trees of the HA gene of a subsample of H5 2.3.4.4b clade viruses collected from 2020 onwards. The trees are colored by (A) ferret serum escape and (B) HA stability. A phenotype is estimated for each sequence by summing the experimentally measured effects of all constituent HA mutations relative to the parental A/American Wigeon/South Carolina/USDA-000345-001/2021 strain used in the deep mutational scanning (indicated with a yellow “x” and arrow), only including mutations with positive effects in the sum. Some key mutations strongly affecting the displayed phenotypes are labeled on the tree, with the number in brackets indicating the effect of the mutation. See S9 Fig for additional phenotypes mapped onto the same tree, and https://nextstrain.org/groups/moncla-lab/h5nx/h5-dms/clade-2344b for an interactive Nextstrain tree showing the phenotypes mapped onto various sets of sequences. The tree file can be found in S1 Data. (C) The HAs for strains carrying mutations that caused strong escape from ferret sera neutralization in the deep mutational scanning were assayed for neutralization by the ferret sera in conditionally replicative influenza virus neutralization assays (S10A Fig). These scatter plots show the neutralization measured in the influenza virus assays (y-axis) versus the predicted escape calculated as the simple sum of the mutations in each HA relative to the strain used in the deep mutational scanning. The r values indicate the Pearson correlation. The data underlying this panel can be found in S7 Data. (D) HAs from selected strains with stability-increasing mutations were assayed for their infectivity at different pHs using conditionally replicative influenza viruses. The scatterplot shows retained influenza virus infectivity at pH 5.7 relative to deep mutational scanning measured stability, which was calculated as a sum of stability effects for all mutations in a strain relative to the strain used in the deep mutational scanning. The data underlying this panel can be found in S8 Data. Retained infectivity at other pHs is shown in S10C Fig. The full list of HA amino-acid mutations in each strain is in S10D Fig.

Similar estimates can be made for other HA phenotypes relevant to pandemic risk that were measured in our experiments. Some recent clade 2.3.4.4b HAs have acquired mutations that increase their stability, including groups of viruses sampled in 2023 with the R81G mutation (Fig 6B). Likewise, a few (so far isolated) sequences on the tree have acquired mutations that enhance their usage of α2-6-linked sialic acids, including canine and human isolates from China that carry the Q226L mutation (S9B Fig).

To test if the deep mutational scanning measurements can predict the phenotypes of HAs from strains carrying potentially important mutations, we generated conditionally replicative influenza viruses with HAs from recent clade 2.3.4.4b strains that our experiments indicate have mutations that affect sera neutralization and HA stability (Figs 6 and S9 and S10). Simply summing the effects of the individual mutations measured by pseudovirus deep mutational scanning accurately predicted the neutralization escape or stability of the actual influenza virions with HAs from different clade 2.3.4.4b strains (Figs 6C and 6D and S9C and S10A–S10C), despite some of these HAs having up to 12 amino-acid mutations relative to the strain used in the deep mutational scanning (S10D Fig). The fact that some clusters of HA sequences from the recent dairy cattle outbreak carry mutations (P128S, A160T, or P162Q mutations) that appreciably reduce neutralization by mouse and ferret sera elicited by current candidate vaccine viruses underscores the importance of monitoring for potential antigenic drift relative to prototype vaccines.

Of course, sequence-based estimates of phenotypes from our deep mutational scanning are only approximate for viruses with more than 1 HA mutation relative to the parental strain used in our experiments, as epistasis can cause more diverged strains to have phenotypes different from the simple sum of the effects of their mutations. However, the deep mutational scanning accurately predicted serum neutralization and stability of multiple strains carrying as many as 12 amino-acid mutations relative to the strain used in our experiments (Figs 6C and 6D and S9C and S10). This fact demonstrates that sequence-based phenotypic estimates informed by deep mutational scanning provide a method to rapidly assess newly observed viruses and prioritize them for more detailed experimental characterization, which remains important to validate findings and check for epistasis [65,66].

Discussion

The last few decades have seen tremendous advances in viral sequencing. At the same time, experiments have advanced our understanding of molecular phenotypes of viral proteins including HA that contribute to pandemic risk [1,3,9,67]. However, synthesizing these advances to make phenotypic assessments from sequence data remains challenging because only a small fraction of the mutations observed during viral surveillance have been experimentally characterized.

Here, we take a step towards overcoming this challenge by experimentally measuring how nearly all mutations to H5 HA affect key molecular phenotypes. These measurements are valuable from both basic and applied standpoints. From a basic standpoint, they illuminate sequence-function relationships—for instance, by comprehensively defining sites in HA’s stalk and head that contribute to its stability, and mutations that affect HA’s ability to mediate infection of cells that express α2-6- versus α2-3-linked sialic acids. From an applied standpoint, the measurements define which sites are constrained by the virus’s need to maintain HA’s essential cell entry function, thereby identifying targets for new approaches that design antibodies [17] and other therapeutics [16] to bind specific epitopes.

Importantly, our experiments are sufficiently comprehensive to inform phenotypic assessments of newly collected HA sequences. For example, it is known that HA’s stability and receptor specificity contribute to transmissibility [1,3,48], but actualizing this knowledge for surveillance has been hampered by an inability to assess how most HA mutations affect these phenotypes. By measuring the effect of nearly every HA mutation, our experiments enable immediate sequence-based estimation of stability and α2-6-linked sialic acid usage—thereby enabling rapid prioritization of viruses with HAs that merit further monitoring and experimental characterization [65,66].

Similarly, our measurements of how HA mutations affect serum antibody neutralization make it possible to immediately identify viruses that are mismatched to existing candidate vaccine viruses. For instance, WHO recently released their antigenic measurements [68,69] on some HA mutations in clade 2.3.4.4b viruses associated with an ongoing dairy cattle outbreak in the US [68]. Their results are completely consistent with our deep mutational scanning: The 2 HA mutations basal to all cattle viruses have only modest antigenic effects in ferrets, but another mutation in a subset of viruses (A160T in H3 numbering, A156T in H5 numbering) causes an appreciable drop in neutralization. However, our data enable assessment of the antigenic effects of all the recently observed mutations, for instance, showing that another mutation (P162Q in H3 numbering) found in a subset of cat viruses nested within the cattle viruses also causes substantial neutralization escape.

Our deep mutational scanning utilized pseudoviruses that encode only a single viral gene (HA) and so cannot undergo multicycle replication. Although we and others have previously performed deep mutational scanning using replicative influenza viruses with genes from Biosafety Level 2 strains [23,24,70], we abandoned that approach in favor of pseudoviruses to avoid biosafety concerns associated with mutating replicative viruses with HA from a potential pandemic strain [71]. We acknowledge that the use of pseudoviruses does not eliminate information hazards [72,73]; however, specific examples of mutations that promote airborne-transmissibility of H5N1 by increasing HA stability and α2-6-linked sialic-acid usage have already been publicly described [1,3], so safely measuring how other HA mutations affect these phenotypes benefits surveillance efforts without creating qualitatively new information risks.

There are several caveats when applying the measurements reported here to viral surveillance. First, HA is just one of several influenza proteins [4,68] where mutations affect pandemic risk. Second, the pseudoviruses we used do not fully capture the balance between HA and NA protein activities that contributes to influenza virus fitness [7476]. Third, the cell lines used in our experiments probably do not fully recapitulate the distribution of glycan receptors in the human airways. Fourth, we measured how mutations affected neutralization by mice and ferret sera, which may have different specificities than human sera (although our experiments could be easily applied to human sera if it becomes available). Finally, we measured the effects of individual mutations, but mutations may sometimes combine in nonadditive ways, particularly in HAs that are substantially diverged from the parental HA used in our experiments [7780]. Nonetheless, the experimental results reported here demonstrate that the pseudovirus deep mutational scanning can accurately predict the neutralization and stability of other relatively closely related clade 2.3.4.4b HAs.

Overall, our study provides the first comprehensive measurements of how HA mutations affect key molecular phenotypes that contribute to the pandemic risk posed by H5 influenza. Combining these measurements with rapid sequencing of ongoing H5 influenza outbreaks can improve monitoring of ongoing viral evolution, as well as inform the choice of candidate vaccine viruses, the targeting of designed therapeutics [16,17], and the prioritization of viral strains for additional experimental study [65,66].

Material and methods

Ethics statement

Murine experiments were approved by the Institutional Animal Care and Use Committees (IACUC) of the Wistar Institute and the University of Pennsylvania (protocol number 201116). Ferret experiments were approved by St. Jude Children’s Research Hospital Institutional Animal Care and Use Committee (protocol number 428) according to the guidelines established by the Institute of Laboratory Animal Resources, approved by the Governing Board of the US National Research Council, and executed by trained personnel in a US Department of Agriculture (USDA)-inspected Animal Biosafety Level 3+ animal facility according to regulations established by the Division of Agricultural Select Agents and Toxins at the USDA Animal and Plant Health Inspection Service (APHIS), as governed by the US Federal Select Agent Program (FSAP) regulations (7 CFR Part 331, 9 CFR Part 121.3, 42 CFR Part 73.3).

HA sequence numbering

Throughout we use mature H3 numbering of the HA sequence. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/numbering.html for details and for a file that interconverts various numbering schemes.

Design of deep mutational scanning libraries

Deep mutational scanning libraries were designed in the background of A/American Wigeon/South Carolina/USDA-000345-001/2021 strain. The HA gene was codon optimized for use in the lentiviral backbone; see https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/pH2rU3_ForInd_H5_H5N1_American_Wigeon_genscript_T7_CMV_ZsGT2APurR.gb for the backbone plasmid containing the HA gene. The libraries were designed to have every possible amino acid mutation to HA and every other STOP codon in the first 20 positions of HA. Single mutant libraries were ordered from Twist Biosciences. Final quality control data for libraries produced by Twist is at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/Q-248900_Final_QC_Report_for_twist_library.xlsx.

Deep mutational scanning library plasmid library production

Out of 567 codon sites in the library, 18 failed during the variant library production by Twist Biosciences. Of these sites, 8 were in the transmembrane domain and 10 were in the HA ectodomain. We decided to mutagenize the 10 failed ectodomain sites in-house and add them to the library to ensure complete mutagenesis of the ectodomain; we did not attempt to readd the failed sites in the transmembrane domain since mutations in that region are unlikely to affect the HA phenotypes of interest for this study. To add mutations at the missing ectodomain sites, forward and reverse primers with NNG/NNC codons were designed and ordered from IDT, sequences for these primers can be found at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/mutagenesis_primers.csv. Forward and reverse primer pools were made by pooling primers at equimolar ratios to a final concentration of 5 μM. Linear HA template was produced by digesting the lentivirus backbone plasmid encoding HA (see above for plasmid map) with NotI and XbaI enzymes and gel purifying the digestion reaction. A PCR mutagenesis reaction was then performed as described previously [18] with a few changes, namely, 12 PCR cycles were used for the mutagenesis reaction and after the joining PCR reaction was purified, an additional Dpn digest was performed for 20 minutes in order to remove any potential wild-type template. Mutagenised HA fragments were then pooled together with the linear library pool produced by Twist Biosciences at a molar ratio proportional to the number of mutants in each pool so that each codon variant would have an equal representation in the final library. The pooled library was then split into 2 reactions (creating library-1 and library-2, which were then handled independently through the rest of the deep mutational scanning) and barcoded with primers containing a random 16-nucleotide sequence downstream of the stop codon as described previously [18]. To produce plasmid libraries, mCherry-containing lentivirus backbone (plasmid map at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/3137_pH2rU3_ForInd_mCherry_CMV_ZsGT2APurR.gb) was digested with XbaI and MluI restriction enzymes and barcoded linear HA libraries were inserted into the lentivirus backbone using NEB HiFi assembly kit (NEB E5520S). Assembled plasmid libraries were then electroporated into electrocompetent 10b cells (NEB C3020K) and incubated overnight on ampicillin containing LB plates. Ten electroporation reactions were performed per library, producing >3.7 M CFUs per library in total. High colony numbers are required to have a large barcode diversity in the plasmids in order to prevent the same barcode becoming associated with multiple different HA variants during lentiviral recombination during subsequent steps in the production of the pseudovirus libraries. The colonies were scraped from the LB plates, and plasmid library pools were purified using a QIAGEN Plasmid Maxi Kit (Cat. no. 12162).

Production of cell-stored deep mutational scanning libraries

Cells storing the deep mutational scanning libraries as a single integrated HA-containing proviral genome per cell (S1A and S1B Fig) were produced as described previously [18] with a few changes. In brief, VSV-G pseudotyped viruses were produced by transfecting two 6-well plates of 293T cells with the lentiviral backbone containing the barcoded HA deep mutational scanning libraries (1 μg per well), lentiviral Gag/Pol, Tat, and Rev helper plasmids (BEI Resources: NR-52517, NR-52519, NR-52518, 0.25 ng per well each), a VSV-G expression plasmid (0.25 ng per well, plasmid map at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/29_HDM_VSV_G.gb), and plasmid expressing the NA gene (1 μg per well, plasmid map at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/3405_HDM_N1_native_H5N1_turkey_Indiana.gb). NA gene came from A/turkey/Indiana/22-003707-003/2022 strain; this strain also belongs to 2.3.4.4b clade and has the same HA sequence but differs in NA sequence by 2 amino acids relative to the NA for the strain used in this study. This NA was used because it provided high pseudovirus titers and NA from the A/American Wigeon/South Carolina/USDA-000345-001/2021 strain was never tested. Note that while VSV-G pseudoviruses do not require NA for virus release or entry, the produced virions will also have HA expressed from the lentiviral backbone on their surface, so NA expression is needed to prevent HA from binding the virions to the producer cell surface. This is particularly important for a mutant library, as different HA mutants will have different expression and sialic-acid binding properties, which could lead to a different tendency of different virions to be retained on the cell surface by HA if NA is not added at a high excess. The VSV-G pseudotyped viruses were then used to infect 293T-rtTA cells (using a specific clone we have previously found to yield good pseudovirus titers [18]) at an infection rate of 0.4% so that most transduced cells receive only a single integrated proviral genome. Transduced cells were then selected using puromycin leading to a population of cells in which each cell contains an integrated proviral genome encoding a single barcoded variant of HA (S1A and S1B Fig). These cells were then frozen in approximately 20 M cells per aliquot and stored long term in liquid nitrogen for further use.

Long-read PacBio sequencing for variant-barcode linkage

To link barcode sequences with mutations in HA (S1A Fig), we performed long-read PacBio sequencing of lentiviral genomes in virions produced from cell-stored libraries as described previously [18]. To do so, 90 million cell-stored library cells were plated in a 3-layer flask and next day transfected with lentiviral helper plasmids (26.4 μg each), NA expression plasmid (10.8 μg), and VSV-G expression plasmid (11.25 μg) to produce VSV-G pseudotyped virions. At 30 hours after transfection, the virus-containing cell supernatant was collected and concentrated 15-fold using a protein column concentrator (Pierce Protein Concentrator Cat. no. 88537). Around 15 million transcription units of the concentrated virus stock were then used to infect 293T cells, and nonintegrated reverse-transcribed lentiviral genomes were recovered by miniprepping the 293T cells as described previously [18]. Eluted genomes were then split into 2 PCR reactions each of which added a unique tag for detecting strand exchange, which would prevent correct mutation-barcode linkage. Aliquots from the 2 PCR reactions were then pooled together, and another round of PCR was performed. In each PCR round, we minimized the number of reaction cycles in order to limit strand exchange. Long read sequencing was then performed on PacBio Sequel IIe machine. The PacBio circular consensus sequences (CCSs) were then aligned to the target amplicon (https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/data/PacBio_amplicon.gb) using alignparse [82], and consensus sequences for the HA variant associated with each barcode were determined requiring at least 2 CCSs per barcode. The final barcode-variant lookup table can be found at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/results/variants/codon_variants.csv. Library 1 had 40,458 variants and library 2 had 34,495 variants, with an average of 1.1 mutations per HA (S1C and S1D Fig).

Mutation effects on cell entry

Cell entry effects of each mutation in the library were determined as described previously [18] with some modifications (see also S2A Fig). HA pseudotyped library viruses were produced from 5-layer flasks in the presence of doxycycline (which induces HA expression from lentiviral backbone) and using 44 μg of each lentiviral plasmid and 18 μg of NA expression plasmid. Virus-containing supernatant was recovered 48 hours posttransfection and concentrated 7.5-fold using either protein column concentrator or Lenti-X concentrator (TakaraBio, Cat. no. 631232) according to manufacturer’s instructions. To measure effects of mutations on cell entry, 293T cells were infected with either libraries pseudotyped with VSV-G envelope (produced as described in the previous methods subsection) or HA pseudotyped libraries. For HA pseudotyped library infections 293T cells were plated in the presence of 2.5 μg/ml of amphotericin B a day before, which increases library virus titer by approximately 4-fold, and infections were performed in the presence of 1 μM oseltamivir in order to inhibit NA and prevent cleavage of receptors on target cell surface. For VSV-G pseudotyped library infections, approximately 15 million transcription units were used and for H5 pseudotyped library infections approximately 5 million transcription units were used. At 12 to 15 hours postinfection, cells were collected, nonintegrated lentiviral genomes were recovered using QIAprep Spin Miniprep Kit (Qiagen, cat. no. 27104), and amplicon libraries for barcode sequencing were prepared as described previously [8].

Cell entry scores for each variant were calculated using log enrichment ratio: log2 ([nvpost / nwtpost]/[nvpre / nwtpre]), where nvpost is the count of variant v in the H5-pseudotyped infection (postselection condition), nvpre is the count of variant v in the VSV-G-pseudotyped infection (preselection condition), and nwtpost and nwtpre are the counts for wild-type variants. Positive cell entry scores indicate that a variant is better at entering the cells compared to the unmutated parental HA, and negative scores indicate entry worse than the parental HA.

To calculate the mutation-level cell entry effects, a sigmoid global-epistasis [83] function was fitted to variant entry scores after truncating the values at a lower bound of −6, using the multi-dms software package [84]. The rationale for fitting the global epistasis models is that some of the variants have multiple HA mutations, and so the global epistasis model enables deconvolution of the mutation effects from these multiply mutated variants as well as just the single mutants. An interactive heatmap showing the effect of each mutation on cell entry is at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/cell_entry.html. For the mutation effects, values of zero mean no effect on cell entry, negative values mean impaired cell entry, and positive values mean improved cell entry. For the plots in this paper, we only show by default mutations observed in average of at least 2 barcoded variants per library and measured in both of the duplicate libraries.

Effects of mutations on α2-6- versus α2-3-linked sialic acid cell entry

To identify mutations that increase α2-6-linked sialic-acid receptor preference, we performed cell entry selections like the ones described above for 293T cells using cells that express either α2-3-linked or α2-6-linked sialic acids (see the Cell lines methods subsection below for the full description of these cell lines). For cell entry measurements on α2–3 cells, approximately 5 million transcription units of HA pseudotyped library virus were used, while for α2–6 cells, approximately 20 million transcription units of the library pseudovirus were used, with the titers (transcriptional units) determined on each cell line separately. The reason higher titers could be used for the α2–3 cells is because virus titers on α2–6 cells are significantly lower (S3A Fig). Cell entry scores for each mutation in the library were calculated the same way as for 293T cells except that VSV-G pseudotyped virus infection on α2–6 cells served as preselection condition for α2–3 cell entry, and VSV-G infection on α2–3 cells served as preselection condition for α2–6 cell entry. The reason for this change in preselection condition is because H5 HA pseudotypes just as effectively as VSV-G and so viruses pseudotyped with VSV-G will have H5 HA on their surface, which will be expressed from lentiviral backbone due to the presence of Tat during pseudovirus production. Because of this, variants of H5 HA with mutations that enhance α2-3- or α2-6-linked sialic acid usage would also lead to VSV-G pseudotyped virus enter those respective cells better, which would in turn lead to lower cell entry scores for such variants. Using unmatched pre- and postselection conditions should mitigate this entry bias, although in practice while entry scores for key sialic acid preference determining mutations do change when preselection and postselection conditions are not matched, the differences are fairly small. To calculate the increase in α2-6-linked sialic acid usage, we subtracted α2–3 cell entry mutation effects from α2–6 cell entry mutation effects, such that positive differences indicate better entry into the α2–6 cells. Interactive plots showing all entry scores for α2–3 cells are at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/htmls/293_SA23_entry_func_effects.html; cell entry scores for α2–6 cells are at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/htmls/293_SA26_entry_func_effects.html; and differences between α2–3 and α2–6 cell entry are interactively displayed at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/a26_usage.html.

Effects of mutations on HA stability

To measure the effects of mutations on HA stability, we followed the workflow outlined in (Fig 4A). Specifically, approximately 5 million transcription units of the pseudovirus library were incubated with D10 media or citrate-based acidic buffers. Buffers were prepared at 0.1 M by mixing together sodium citrate dihydrate and citric acid solutions and adjusting to pHs 5.7, 5.5, or 5.3 using hydrochloric acid or sodium hydroxide. Pseudovirus libraries were incubated for 1 hour at 37°C at 1:12 library to buffer volume ratio. After incubation, libraries were then concentrated using 100k Amicon spin columns (Milipore, UFC910008) by spinning for 15 minutes at 1,500g to approximately 1 ml volume and resuspended in 11 ml of PBS to exchange and neutralize acidic buffers. Viruses were spun down again for 20 minutes at 1,500g to reduce sample volume and exchange PBS for 2 ml of D10 media. Oseltamivir was added to the libraries at 1 μM and they were then used to infect 293T cells that were plated a day before in the presence of 2.5 μg/ml of amphotericin B. At 12 to 15 hours postinfection unintegrated, lentiviral genomes were recovered from cells using Qiagen Miniprep Kit. During the miniprep lysis step (P2 buffer), 0.003 ng of DNA standard was added to a final molar concentration of approximately 1% of recovered lentiviral DNA (this was calculated based on the number of expected nonintegrated lentiviral genomes recovered given the amount of virus used for infections). The DNA standard consisted of a plasmid encoding lentiviral backbone with a barcoded mCherry gene (plasmid map for nonneutralizable standard is at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/3068_ForInd_mC_BCs_pool1.gb, and barcodes in this standards can be found at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/data/neutralization_standard_barcodes.csv), and the purpose of this standard is to enable conversion of relative sequencing counts to absolute amount of each barcoded pseudovirus variant (relative to the standard) at each pH condition. Note that unlike for sera selections (described below) where we use VSV-G or RdPro pseudotyped viruses as nonneutralizable standard, here we choose to use DNA-based standard because VSV-G or RdPro may be affected by the pH treatment. Lentiviral genomes were then used to make amplicon libraries for Illumina sequencing as described previously [18].

The sequencing counts were converted to the fraction infectivity retained at each pH condition relative to D10 by using the counts of each variant relative to the DNA standard at each condition. The polyclonal [85] software (v6.11) was used to analyze mutation-level retained infectivity at each pH and to fit a pH-dependent neutralization curve while estimating the effect of each mutation on stability. An example notebook of this analysis can be found at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/notebooks/fit_escape_stability_Lib2-231002-pH.html. HA stability data can be viewed interactively in the context of the H5 HA protein structure using dms-viz [81] at https://dms-viz.github.io/v0/?data=https%3A%2F%2Fraw.githubusercontent.com%2Fdms-vep%2FFlu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS%2Fmain%2Fresults%2Fdms-viz%2Fdms-viz.json.

Effects of mutations on serum neutralization

To measure the effects of mutations on neutralization by polyclonal mouse or ferret sera, we followed the workflow in S6A Fig, which parallels previously described work [18,86]. Specifically, approximately 2.5 million transcription units of the library virus were mixed with nonneutralizable standard so that the final amount of DNA from the nonneutralized standard would be approximately 1% in the absence of serum selection. For mouse sera, VSV-G pseudotyped viruses were used as nonneutralizing standards, while for ferret sera, RdPro pseudotyped viruses were used. Both of these nonneutralizable standards and their generation have been described previously [86]. The reason different neutralizing standards were used for the 2 types of sera is because ferret sera nonspecifically neutralizes VSV-G virus but does not neutralize RdPro virus, while mouse sera does not neutralize either; as VSV-G pseudovirus is easier to produce, we opted to use it for mice sera instead of the RdPro. The choice of the nonneutralizable standard should not influence sera selection results as both pseudoviruses carry identical lentiviral genomes. At the infection stage, 1 μM of oseltamivir was also added. Library viruses were then incubated with 3 sera dilutions that were measured to reach IC99, IC99*4, and IC99*16 neutralization using luciferase-based pseudovirus neutralization assay for 45 minutes at 37°C. Strongly neutralizing dilutions of sera need to be used in these assays in order to select for only the variants that can escape polyclonal sera due to mutations they carry, and multiple sera dilutions are used to cover a greater dynamic range. After incubation with serum, 293T cells that were plated in the presence of 2.5 μg/ml of amphotericin B a day before were infected with library viruses. At 12 to 15 hours postinfection, cells were collected, nonintegrated lentiviral genomes were recovered using QIAprep Spin Miniprep Kit, and amplicon libraries for barcode sequencing were prepared as described previously [18].

The data were analyzed as described previously [18,86]. Briefly, the fraction infectivity retained at each serum concentration was calculated from the barcode counts of each pseudovirus variant relative to the nonneutralized standard. Then, the polyclonal [85] software (v6.11) was used to fit neutralization curves and estimates of mutation effects on neutralization to these variant-level fraction infectivity measurements. An example notebook for data fitting for mouse sera is at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/notebooks/avg_escape_antibody_escape_mouse-1-03.html and for ferret data is at https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/notebooks/avg_escape_antibody_escape_ferret-7.html. Serum escape data can be viewed interactively in the context of the H5 HA protein structure using dms-viz [81] at https://dms-viz.github.io/v0/?data=https%3A%2F%2Fraw.githubusercontent.com%2Fdms-vep%2FFlu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS%2Fmain%2Fresults%2Fdms-viz%2Fdms-viz.json.

H5N1 lentiviral pseudovirus for neutralization assays

To determine sera neutralizing concentrations needed to perform the deep mutational scanning experiments to measure the effects of mutations on neutralization (described in the previous methods subsection), lentiviral pseudovirus neutralization assays were used. H5N1 pseudoviruses were produced produced by transfecting 8e5 293T cells plated in wells of a 6-well plate with luciferase and zsGreen coding lentivirus backbone (1 μg per well, plasmid map https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/2727_pHAGE6-wtCMV-Luc2-BrCr1-ZsGreen-W-1247.gb), lentiviral Gag/Pol, Tat, and Rev helper plasmids (0.25 μg per well each), NA expression plasmid (0.05 μg per well), and a H5 HA expression plasmid (0.25 μg per well, plasmid map https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/HDM_H5_American_Wigeon.gb). Cell supernatants were collected 48 hours post transfection, filtered through a surfactant-free cellulose acetate 0.45 μm syringe filter (Corning, Cat. No. 431220) and stored at −80°C until further use. Pseudovirus titers were determined by serially diluting virus stocks on 293T cells in the presence of 1 μM oseltamivir and 2.5 μg/ml amphotericin B and measuring luciferase signal 48 hours after infection using Bright-Glo Luciferase Assay System (Promega, E2610), as described previously [87].

To perform neutralization assays, 2.5e4 293T cells were plated per well in poly-L-lysine coated, black-walled, 96-well plates (Greiner 655930) in the presence of 2.5 μg/ml amphotericin B. The next day, sera were serially diluted in D10 media and incubated with H5N1 pseudovirus in the presence of 1 μM oseltamivir for 45 minutes at 37°C. After incubation, virus and sera dilutions were transferred onto 293T cells. At 48 hours after infection, Bright-Glo Luciferase Assay System (Promega, E2610) was used to measure virus neutralization. The neutcurve software [88] (https://jbloomlab.github.io/neutcurve/) was used to fit Hill curves and plot neutralization curves from which the appropriate IC99 values were estimated.

Production of conditionally replicative influenza viruses

Conditionally replicative influenza viruses with wild-type, mutant, or selected strain HA genes were generated as schematized in S2C Fig and described previously [89,90] with several modifications explained here. The native HA sequences (not codon optimized) lacking the polybasic cleavage site for all strains and variants were cloned into the bidirectional pHW2000 influenza reverse genetics plasmid [81] (plasmid maps for all strains can be found at https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/tree/main/library_design/conditionally_replicative_virus_plasmids). The polybasic cleavage site was removed from the HA sequence to further enhance the safety of this system. The native NA sequence from A/turkey/Indiana/22-003707-003/2022 strain was also cloned into a pHW2000 plasmid (plasmid map https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/library_design/conditionally_replicative_virus_plasmids/3972_pHW_N1_TurkeyH5.gb). A mixture of 5e5 293T-PB1 cells [28] and 50,000 MDCK-SIAT1-PB1-TMPRSS2 cells [92] per well was plated into a 6-well plate. The next day, cells were transfected with plasmid mix consisting of A/WSN/1933 PB2, PA, NP, M, and NS reverse genetics plasmids [91], pHH-PB1flank-eGFP plasmid that consists of noncoding terminal regions from A/WSN/1933 PB1 segment and eGFP gene in place of PB1 coding sequence [28], pHW_HA_American-Wigeon_USDA-000345-001_H5N1_delPBCS plasmid and pHW_N1_TurkeyH5 plasmid. Next day, cell media was changed to Influenza growth media (see Cell lines methods subsection below) and virus supernatant was collected 48 hours after media change. Virus titers were determined by serially diluting virus stocks and infecting MDCK-SIAT1-PB1 cells in Neutralization Assay Medium (NAM) media. Flow cytometry was used to determine eGFP positive cell numbers, 16 hours postinfection, and Poisson distribution was used to determine virus titers. In cases where high virus titers were needed, rescued viruses were further passaged at low MOI (0.01) in MDCK-SIAT1-PB1-TMPRSS2 cells in NAM. For stability mutation validations, passaged viruses were further concentrated using 100k Amicon spin columns. To avoid any possibility of reassortment between the conditionally replicative influenza viruses with the H5N1 HA and NA and fully replication competent influenza, work with these conditionally replicative viruses was strictly segregated away from tissue-culture incubators and biosafety cabinets where fully replicative seasonal or lab-adapted influenza viruses were being studied.

Validation of stability enhancing mutations using conditionally replicative influenza virus

MDCK-SIAT1-PB1 cells were plated in NAM at 100 k per well in a 6-well dish, 4 hours prior to infection. Wild-type and mutant viruses were diluted to the target final MOI of 0.75 to 1 and incubated with citrate buffers at pHs 6.9, 5.7, 5.5, or 5.3 for 30 minutes at 37°C at a dilution of 4 μl virus to 96 μl of buffer. Citrate buffers were prepared at 0.1 M by mixing together sodium citrate dihydrate and citric acid solutions and adjusted to the required pH using hydrochloric acid or sodium hydroxide. pH-treated viruses were then brought up to a neutral pH by 3.9mL of NAM media. About 2 mL of the NAM-diluted virus was then added to MDCK-SIAT1-PB1 cells. At 16 to 20 hours postinfection, cells were collected for flow cytometry analysis and the percent positive eGFP cells were measured for each mutant. We conducted single replicates of each mutant and repeated these across 3 different days. The fractional infectivity was then calculated by taking the proportion of the percent positive green cells at pH 5.3 or pH 5.5 over the percent positive green cells at pH 6.9 (S5 Fig).

Neutralization assays using conditionally replicative influenza virus

To validate sera escape mutations, we performed neutralization assays using conditionally replicative virus as described previously [89]. In brief, 50 k MDCK-SIAT1-PB1 were plated into 96-well plates in NAM media 4 hours before infection. Sera were serially diluted in NAM media and mixed with wild-type or mutant conditionally replicative influenza viruses. The amount of virus to use was chosen such that the green fluorescence signal would change linearly with degree of sera neutralization. Virus and sera were incubated for 45 minutes at 37°C and preplated MDCK-SIAT1-PB1 were infected. Fluorescence signal was determined using Tecan M1000 plate reader, 16 hours postinfection. Fraction infectivity was determined relative to no serum controls and neutralization curves were plotted using neutcurve software [93].

Phylogenetic analysis

All available H5Nx HA sequences were downloaded from GISAID (gisaid.org); an acknowledgment list including accession numbers, originating labs, and submitting labs for all GISAID sequences used for phylogenetic analyses is provided in S13 Data. To visualize the DMS results in the context of H5Nx phylogenies, 3 trees were built: one that included a sample of historical H5Nx HA sequences (all clades), one that included clade 2.3.4.4b HA sequences, and that was subsetted to clade 2.3.4.4b sequences sampled recently in the US. All phylogenies were constructed with HA sequences only. For the H5Nx all clades tree, sequences were randomly subsampled to 2 sequences per host per country per clade per month. For the clade 2.3.4.4b trees, sequences were randomly subsampled to 4 sequences per country per month (all 2.3.4.4b), or 25 sequences per month (US). To maximize the ability to identify circulating sequences harboring putative adaptive mutations, all mammalian sequences were force-included in each build. Only HA sequences of at least 1650 nucleotides were included. Sequences were excluded if the isolate was environmental or laboratory-derived or if it was collected before 1996 (H5Nx), 2009 (clade 2.3.4.4b), or 2022 (US). Using the Nextstrain pipeline [64], sequences were aligned with MAFFT [94], divergence phylogenies were created using IQ-TREE 2 [95], and time-resolved phylogenies were generated with TreeTime [96] with ancestral state reconstruction of nucleotide and amino acid sequences. DMS phenotypes were then mapped onto trees by summing the effect values of HA substitutions relative to the DMS strain, A/American Wigeon/South Carolina/USDA-000345-001/2021, for both terminal tips and reconstructed internal nodes (code provided at https://github.com/moncla-lab/annotate-dms). For mouse and ferret sera escape, only positive effect values were summed, while for HA stability, cell entry, and α2–6 sialic acid usage, both positive and negative effect values were used.

Serum generation

For murine experiments, 10 female C57BL/6 mice (Charles River Laboratories) aged 6 to 8 weeks were primed intramuscularly with a combination of 5 μg clade 2.3.4.4b H5 rHA + AddaVax and boosted 28 days later with the same vaccine. For immunizations, equal volumes of recombinant protein and AddaVax were mixed according to manufacturer’s instructions. Blood samples were obtained by terminal bleeding 28 days after boost, and pooled sera were isolated by centrifugation using Z-Gel tubes (Sarsedt).

For ferret infections male ferrets (Triple F Farms, Sayre, Pennsylvania, USA) were used. Ferrets 10 and 19 were intranasally inoculated at 106 TCID50s, and terminal bleeds were collected 25 and 28 days postinfection, respectively. Ferret 7 was intramuscularly primed with inactivated whole virus and 14 days later challenged intranasally with 103 TCID50s. H5 strains used for ferret infections are indicated in S6 Fig. The ferret sera used in this study was all residual sera from prior experiments.

All sera were inactivated for 1 hour at 56°C to eliminate complement activity, and ferret sera were treated with receptor-destroying enzyme.

Cell lines and media

The HEK293 isogenic α2–3 cells and α2–6 cells were stably engineered as described previously [33,97]. α2–3 cells are 293 cells with combinatorial knockout of ST3GAL3, ST3GAL4, ST3GAL6, ST6GAL1, and ST6GAL2 genes and knock-in of the human ST3GAL4 gene. α2–6 cells are 293 cells with knockout of ST3GAL3, ST3GAL4, ST3GAL6, ST6GAL1, and ST6GAL2 genes and knock-in of the human ST6GAL1 gene. Note that both cell lines still express α2-3- and α2-6-linked sialic acids on O-linked glycans.

These cells and 293T cells were maintained D10 media (Dulbecco’s Modified Eagle Medium with 10% heat-inactivated fetal bovine serum, 2 mM l-glutamine, 100 U/mL penicillin, and 100 μg/mL streptomycin). D10 media was used for all experiments involving pseudovirus. MDCK-SIAT1-PB1 and MDCK-SIAT1-PB1-TMPRSS2 cells were passaged in D10 media and for assays described above plated in either Influenza growth media (Opti-MEM supplemented with 0.1% heat-inactivated FBS, 0.3% bovine serum albumin, 100 μg/mL of calcium chloride, 100 U/mL penicillin, and 100 μg/mL streptomycin), or Neutralization assay media (Medium-199 supplemented with 0.01% heat-inactivated FBS, 0.3% BSA, 100 U/mL penicillin, 100 μg/mL streptomycin, 100 μg/mL calcium chloride, and 25mM HEPES). Influenza growth media was used for all conditionally replicative virus rescue experiments, while Neutralization assay media was used for all experiments involving fluorescence readings because riboflavin present in Opti-MEM has high background fluorescence.

Supporting information

S1 Fig. H5 HA deep mutational scanning libraries.

(A) Schematic of lentiviral backbone used to produce the pseudoviruses. The HA gene is encoded in the lentiviral backbone followed by a 16-nucleotide barcode after the stop codon. Other notable elements include the 5’ LTR, an inducible TRE3GS promoter that drives HA expression, a constitutively expressed ZsGreen/puromycin cassette, and a repaired 3’ LTR that allows reactivation of integrated proviral genomes. The full sequence of each HA mutant is linked to the barcode by long-read PacBio sequencing, enabling subsequent experiments to simply sequence the short barcode to identify the full HA mutant. Note that this backbone has previously been described [18], and the full plasmid encoding mCherry rather than HA is available on AddGene as item 204579. (B) A 2-step process is used to create genotype-phenotype linked viruses pseudotyped with HA, as described previously for other viral entry proteins [18]. First, VSV-G pseudotyped lentiviral particles are generated by standard transfection, including a plasmid expressing the influenza NA gene to ensure virions do not remain attached to cells via the HA protein binding cell-surface sialic acid. These lentiviral particles are then used to infect 293T-rTTA cells at low multiplicity of infection such that infected cells typically just have 1 integrated provirus, and cells containing proviruses are then selected using puromycin. Because the lentiviral backbone contains a full 3’ LTR, the proviral genomes can be reactivated by transfection of the lentiviral helper plasmids and a NA expression plasmid to produce a library of genotype-phenotype linked HA-pseudotyped virions. For selections designed to measure HA cell entry, we also produce virions pseudotyped with VSV-G to use as a control comparison, with cell entry quantified by the relative infectivity of each HA variant in the presence versus absence of VSV-G (since the VSV-G pseudotyped particles do not require a functional HA for cell entry; see S2A Fig). (C) Number of targeted and actual HA amino-acid mutations found in each of the 2 replicate mutant libraries. The last column indicates the number of unique barcoded HAs in each library; this number exceeds the number of unique mutations as many mutations are found in multiple different barcoded HA variants. (D) Number of HA amino-acid mutations in each barcoded HA variant in each library. The data underlying this panel can be found in S9 Data.

(EPS)

pbio.3002916.s001.eps (4.4MB, eps)
S2 Fig. Deep mutational scanning for cell entry and validation using conditionally replicative influenza virions.

(A) The effects of mutations on cell entry were measured by generating the pseudovirus libraries with virions expressing either the mutant HAs or unmodified VSV-G on their surface (see S1B Fig). The ability of each barcoded variant to enter 293T cells was then compared between the VSV-G and HA pseudotyped libraries. Entry of the HA pseudotyped variants depends on the function of HA, whereas entry of the VSV-G pseudotyped variants is independent of HA function. Therefore, the effects of HA mutations on cell entry can be determined by comparing the frequency of each variant between the 2 conditions. (B) Correlation between mutational effects on cell entry in 293T cells as measured in each of the 2 independent replicate libraries. The Pearson correlation is indicated at top left. Throughout the rest of this paper, we report the average effect across the 2 library replicates. The data underlying this panel can be found in S2 Data. (C) For validation assays, we used conditionally replicative influenza virions that can be safely studied at Biosafety Level 2. The virions have the essential PB1 polymerase gene replaced by eGFP [28] and derive their non-HA/NA proteins from the lab-adapted A/WSN/1933 (H1N1) strain [29]. These viruses can be generated by reverse genetics and propagated in cells that constitutively express the PB1 protein but cannot undergo replication in standard cells as they do not express PB1. (D) Deep mutational scanning measurements of how the indicated mutations affect cell entry in 293T cells versus the titers of conditionally replicative influenza virions carrying the same mutations in MDCK-SIAT1-TMPRSS2-PB1 and 293T-PB1 cocultures. The titers of the conditionally replicative influenza virions represent those in the supernatant at 48 hours after generating the virions by reverse genetics. There are 2 points for each mutation for the conditionally replicative influenza virions because viruses were rescued twice from independent plasmid preparations. The mutations selected for validation were chosen to span a range of cell-entry effects. R indicates the Pearson correlation. The data underlying this panel can be found in S10 Data.

(EPS)

pbio.3002916.s002.eps (9.1MB, eps)
S3 Fig. H5 pseudovirus titers on cell lines expressing different sialic-acid receptors.

(A) Titers of the lentiviral particles pseudotyped with the unmutated H5 HA on standard 293 cells and the α2–3 or α2–6 293 cells that express α2-3-linked or α2-6-linked sialic acids [33]. The virions can infect all cells, but the titers are substantially lower on α2–6 cells, consistent with the fact that avian H5 HAs are better at utilizing α2-3-linked sialic acids. (B) Correlation between the effects of mutation on entry in α2–3 and α2–6 cells as measured with each of the 2 independent replicate pseudovirus libraries. The Pearson correlation is indicated at the top left. The measurements are noisier (more poorly correlated between replicates) for the α2–6 cells as there is worse entry into those cells for the parental library leading to worse sampling of most mutations in the deep mutational scanning as fewer virions are successfully able to enter these cells even in the absence of mutations. The data underlying this panel can be found in S2 Data. Throughout the rest of the paper, we report the average effect across the 2 library replicates.

(EPS)

S4 Fig. Correlations among effects of HA mutations on entry into different cell lines.

Correlations among the measured effects of mutations on entry into 293T, α2–3, and α2–6 cells. Note that the measurements for entry into α2–6 cells are more poorly correlated with the other cells for 2 reasons. First, there are some mutations that actually increase usage of α2-6-linked sialic acid as highlighted in Fig 3; these mutations are characterized by good overall entry into the α2–6 cells relative to other cells. But additionally, since the parental H5 HA used in our experiments is relatively poor at entering the α2–6 cells and mutation effects are relative to that parent, many mutations that are poor overall for cell entry nonetheless are less deleterious relative the parent (not as negative cell entry effect) on α2–6 cells compared to the other cells. In the scatter plots, R indicates the Pearson correlation. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/a26_usage.html for more detailed interactive plots of the effects of mutations on entry into the different cells. The data underlying this figure can be found in S2 Data.

(EPS)

pbio.3002916.s004.eps (12.3MB, eps)
S5 Fig. Validation of stability enhancing HA mutations.

(A) Correlations between the effects of mutations on HA stability as measured by the deep mutational scanning for each of the 2 independent library replicates. Experiments 1 and 2 were performed on different dates. Only stabilizing mutations are shown. The Pearson correlation is shown in the upper left. Throughout the rest of this paper, we report the average effect across libraries and experiments. The data underlying this panel can be found in S3 Data. (B and C) Stability-enhancing mutations identified in deep mutational scanning were validated using the conditionally replicative influenza virions. (B) Conditionally replicative influenza virions carrying each of the indicated mutations were incubated at a buffer of the indicated pH for 30 minutes, diluted into a neutral pH buffer, and then used to infect target cells. The top row shows the percent of cells infected by each virus after treatment at each pH, and the bottom row shows the titer per cell (MOI) calculated using the Poisson distribution. (C) The infectivity remaining for each mutant at the indicated pHs relative to pH 6.9 (as calculated from the bottom row of panel (B) versus the stability effects of the mutation measured in the deep mutational scanning. The Pearson correlation is indicated in the upper-left of each plot. The data underlying panels B and C can be found in S4 Data.

(EPS)

pbio.3002916.s005.eps (3.5MB, eps)
S6 Fig. Measuring escape from sera neutralization.

(A) Workflow for measuring escape from polyclonal serum using the pseudovirus libraries. The libraries are combined with a nonneutralizable standard virus that is not affected by the serum antibodies (in this case, VSV-G pseudovirus was used for mice sera and RdPro pseudovirus for ferret sera) and then incubated with increasing concentrations of sera and infected into 293T cells. Neutralization is quantified by comparing the frequency of each viral mutant barcode to that of the nonneutralizable standard, as described previously [18]. (B) All mouse sera came from mice that were vaccinated and boosted with recombinant HA protein that exactly matched the parental strain used in the deep mutational scanning. Ferret sera came from animals that were infected with 2.3.4.4b clade viruses or infected and then boosted with matched inactivated virus. For 2 of the ferrets, the immunizing strain differed by a single amino-acid mutation from the parental strain used in the deep mutational scanning.

(EPS)

pbio.3002916.s006.eps (3.2MB, eps)
S7 Fig. Effects of mutations on escape from neutralization for sera from individual mice and ferrets.

Total escape caused by mutations at each site in HA for individual mice and ferret sera used in this study. There is some animal-to-animal variation even within species: For instance, the sera of most mice is primarily escaped by mutations in antigenic regions A and B, but the serum from mouse-1-01 is primarily escaped by mutations in antigenic region E. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/escape.html for interactive plots. The data underlying this figure can be found in S5 Data.

(EPS)

pbio.3002916.s007.eps (29.7MB, eps)
S8 Fig. Neutralization of pseudoviruses versus conditionally replicative influenza viruses.

Comparison between mouse sera neutralization of A/AmericanWigeon/SouthCarolina/USDA-000345-001/2021 pseudovirus and conditionally replicative influenza virus with the same HA. Note that the pseudoviruses (pseudotyped lentiviral particles) are more sensitive to neutralization than the actual influenza virions. The data underlying this figure can be found in S11 Data.

(EPS)

pbio.3002916.s008.eps (863KB, eps)
S9 Fig. Mouse sera neutralization and α2-6-linked sialic acid binding increase for circulating 2.3.4.4b strains.

Trees similar to those in Fig 6 but showing different phenotypes, namely, (A) mouse sera escape and (B) increase in α2-6-linked sialic acid binding. (B) shows sequences starting from 2019, whereas (A) only shows sequences starting from 2020. The tree file can be found in S1 Data. (C) HAs for selected strains carrying mutations that caused strong escape from sera neutralization in the deep mutational scanning were assayed for neutralization by the mouse sera in influenza-based neutralization assays (S10B Fig). These scatter plots show the neutralization measured in the influenza virus assays (y-axis) versus the predicted escape calculated as the simple sum of the mutations in each HA relative to the strain used in the deep mutational scanning. The data underlying this figure can be found in S12 Data. The full list of mutations in each strain is provided in S10D Fig.

(EPS)

pbio.3002916.s009.eps (11.1MB, eps)
S10 Fig. Sera neutralization and stability of circulating 2.3.4.4b strains.

(A) Ferret and (B) mouse sera neutralization curves for 2.3.4.4b clade strains measured using conditionally replicative influenza virus. These are the neutralization curves that generated the data plotted in Figs 6C and S9C. The data underlying panels (A) and (B) can be found in S7 and S12 Datas, respectively. (C) same as Fig 6D but with infectivity measured at pH 5.3 and 5.5. (D) A list of HA amino-acid mutations (H3 numbering) in circulating 2.3.4.4b strains used for validation relative to the A/AmericanWigeon/SouthCarolina/USDA-000345-001/2021 strain used for the deep mutational scanning. The data underlying this figure can be found in S8 Data.

(EPS)

pbio.3002916.s010.eps (11.1MB, eps)
S1 Data. Tree file for H5Nx phylogenetic tree.

(JSON)

pbio.3002916.s011.json (7.8MB, json)
S2 Data. HA mutation effects on cell entry. Mutation functional effects measured on 293T, 293, α2–3, and α2–6 cells for 2 independent libraries.

(XLSX)

pbio.3002916.s012.xlsx (3.1MB, xlsx)
S3 Data. Mutation effects on HA stability. Mutation effects on pH-based HA stability measured for 2 independent libraries and repeated different dates.

(XLSX)

pbio.3002916.s013.xlsx (702.4KB, xlsx)
S4 Data. Raw values for conditionally replicative virus infectivity at different pHs.

(XLSX)

pbio.3002916.s014.xlsx (29.8KB, xlsx)
S5 Data. Effects of mutations to HA on sera escape. Serum escape values for 2 independent libraries measured for different animal sera.

(XLSX)

pbio.3002916.s015.xlsx (5.8MB, xlsx)
S6 Data. Mouse serum neutralization for conditionally replicative influenza virus.

Fraction infectivity values used to calculate mice sera neutralization values for different conditionally replicative influenza virus variants.

(CSV)

pbio.3002916.s016.csv (19.5KB, csv)
S7 Data. Ferret serum neutralization for different influenza strains.

Fraction infectivity values used to calculate ferret sera neutralization for conditionally replicative influenza virus with HA from different H5 strains.

(CSV)

pbio.3002916.s017.csv (11.4KB, csv)
S8 Data. Fraction infectivity retained for conditionally replicative influenza virus at different pHs.

Fraction infectivity retained at different pH values measured for conditionally replicative influenza virus with HA from different virus strains.

(CSV)

S9 Data. Table linking library barcodes with mutations in HA for 2 independent libraries.

(CSV)

pbio.3002916.s019.csv (3.7MB, csv)
S10 Data. Conditionally replicative virus titers and deep mutational scanning measured functional effects.

Conditionally replicative virus variant titers measured on MDCK-SIAT1-TMPRSS2-PB1 and 293T-PB1 cocultures compared to mutation cell entry effects measured using deep mutational scanning.

(CSV)

pbio.3002916.s020.csv (352B, csv)
S11 Data. Comparison between neutralization of conditionally replicative influenza virus and pseudovirus.

Fraction infectivity retained at different mouse serum concentrations for conditionally replicative influenza virus and H5 HA pseudovirus.

(CSV)

pbio.3002916.s021.csv (11KB, csv)
S12 Data. Mouse serum neutralization for different influenza strains.

Fraction infectivity values used to calculate mice sera neutralization for conditionally replicative influenza virus with HA from different H5 strains.

(CSV)

pbio.3002916.s022.csv (20.4KB, csv)
S13 Data. GISAID accession numbers used to build H5Nx phylogenetic tree.

(CSV)

pbio.3002916.s023.csv (763KB, csv)

Acknowledgments

We thank Henrik Clausen and Yoshiki Narimatsu from Copenhagen Center for Glycomics for sharing the α2–3 and α2–6 cells used in this study. We thank Charles Russell for helpful advice about HA stability. We thank Brendan Larsen for help with the homepage hosting the interactive plots. We thank all the scientists who contributed H5 HA sequences shown in the phylogenetic trees in this paper to Genbank or GISAID.

Abbreviations

APHIS

Animal and Plant Health Inspection Service

CCS

circular consensus sequence

FSAP

Federal Select Agent Program

HA

hemagglutinin

IACUC

Institutional Animal Care and Use Committees

NA

neuraminidase

NP

nucleoprotein

PB2

polymerase basic 2

USDA

US Department of Agriculture

Data Availability

The data reported in this paper are available in both interactive plots and spreadsheets of numerical values. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS for a webpage that hosts interactive plots that allow easy visualization and querying of the data. We recommend readers initially examine the summary page (https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/summary.html), and then query the additional more detailed plots provided for individual phenotypes as needed. Note that the plots provided for individual phenotypes on the homepage show the measurement for each individual replicate library when you mouse over the heatmaps, which can provide a useful way to assess the reproducibility associated with any specific measurement. See https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/results/summaries/phenotypes.csv for the numerical values of the measurements with our recommended filtering for only high-confidence measurements; the main interactive homepage mentioned above also contains links to additional CSV files with less filtered values and per-replicate measurements (although be sure you understand the filters before using those). The data can be viewed interactively in the context of the H5 HA protein structure using dms-viz at https://dms-viz.github.io/v0/?data=https%3A%2F%2Fraw.githubusercontent.com%2Fdms-vep%2FFlu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS%2Fmain%2Fresults%2Fdms-viz%2Fdms-viz.json See https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS for a GitHub repository that contains all of the computer code used in the analysis. This repository has been archived on Zenodo under DOI: 10.5281/zenodo.13993815. The repository provides instructions on how to run the pipeline either by starting with raw fastq files, which can be downloaded from BioProject PRJNA1123200, or from already pre-build barcode count files. The results directory in the GitHub repository contains all the raw data values generated by the pipeline. The data used to generate the figures in this manuscript is also provided as Supporting Data. Most of the analysis was performed using dms-vep-pipeline-3 (https://github.com/dms-vep/dms-vep-pipeline-3), version 3.13.0. Note that rerunning the pipeline on different computers may give slightly different numbers due to platform-specific differences in floating point precision. See https://nextstrain.org/groups/moncla-lab/h5nx/h5-dms/usa-clade-2344 b for interactive Nextstrain trees with the deep mutational phenotypes used to color the nodes; you can use the dropdown boxes at left to select which phenotype to use to color the nodes (the Color By box) or what set of sequences to show (eg, all-clades, clade-2344b, or usa-clade-2344b). The parental sequence used for the deep mutational scanning is marked with an “x”, and the estimated phenotypes of other sequences are expected to become less accurate the more diverged those sequences are from this parental sequence. Code for annotating DMS data onto phylogenies is available at https://github.com/moncla-lab/annotate-dms.

Funding Statement

NIH/NIAID grant R01AI141707 (J.D.B.) NIH/NIAID grant R01AI165821 (J.D.B.) NIH/NIAID grant AI47029 (L.H.M.) NIH/NIAID contract 75N93021C00015 (S.E.H., J.D.B., and L.H.M.) Gift from Open Philanthropy (to N.P.K.) The Biotechnology and Biological Sciences Research Council via the Pirbright Institute’s Strategic Programme grants BBS/E/PI/230002A and BBS/E/PI/230002B] (T.P.P) The UK Medical Research Council/Department for Environment, Food and Rural Affairs FluTrailMap-One Health consortium grants MR/Y03368X/1 and BB/Y007298/1 (T.P.P, Y.L.) The UK Medical Research Council grant MR/R010757/1 (Y.L) NIH grant S10-OD-020069 (Fred Hutch Scientific Computing) NIH grant S10-OD-028685 (Fred Hutch Scientific Computing). Howard Hughes Medical Institute (J.D.B.) The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Imai M, Watanabe T, Hatta M, Das SC, Ozawa M, Shinya K, et al. Experimental adaptation of an influenza H5 HA confers respiratory droplet transmission to a reassortant H5 HA/H1N1 virus in ferrets. Nature. 2012. Jun;486(7403):420–428. doi: 10.1038/nature10831 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Reed ML, Bridges OA, Seiler P, Kim JK, Yen HL, Salomon R, et al. The pH of Activation of the Hemagglutinin Protein Regulates H5N1 Influenza Virus Pathogenicity and Transmissibility in Ducks. J Virol. 2010. Feb;84(3):1527–1535. doi: 10.1128/JVI.02069-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Herfst S, Schrauwen EJA, Linster M, Chutinimitkul S, de Wit E, Munster VJ, et al. Airborne Transmission of Influenza A/H5N1 Virus Between Ferrets. Science. 2012. Jun 22;336(6088):1534–41. doi: 10.1126/science.1213362 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Benjamin Mänz, Martin Schwemmle, Linda Brunotte. Adaptation of Avian Influenza A Virus Polymerase in Mammals To Overcome the Host Species Barrier. J Virol. 2013. Jul 1;87(13):7200–7209. doi: 10.1128/JVI.00980-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Long JS, Giotis ES, Moncorgé O, Frise R, Mistry B, James J, et al. Species difference in ANP32A underlies influenza A virus polymerase host restriction. Nature. 2016. Jan;529(7584):101–104. doi: 10.1038/nature16474 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Pinto RM, Bakshi S, Lytras S, Zakaria MK, Swingler S, Worrell JC, et al. BTN3A3 evasion promotes the zoonotic potential of influenza A viruses. Nature. 2023. Jul 1;619(7969):338–347. doi: 10.1038/s41586-023-06261-8 [DOI] [PubMed] [Google Scholar]
  • 7.Gabriel G, Klingel K, Otte A, Thiele S, Hudjetz B, Arman-Kalcek G, et al. Differential use of importin-α isoforms governs cell tropism and host adaptation of influenza virus. Nat Commun. 2011. Jan 18;2(1):156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Götz V, Magar L, Dornfeld D, Giese S, Pohlmann A, Höper D, et al. Influenza A viruses escape from MxA restriction at the expense of efficient nuclear vRNP import. Sci Rep. 2016. Mar 18;6(1):23138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ayora-Talavera G, Shelton H, Scull MA, Ren J, Jones IM, Pickles RJ, et al. Mutations in H5N1 Influenza Virus Hemagglutinin that Confer Binding to Human Tracheal Airway Epithelium. PLoS ONE. 2009. Nov 18;4(11):e7836. doi: 10.1371/journal.pone.0007836 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ha Y, Stevens DJ, Skehel JJ, Wiley DC. X-ray structure of the hemagglutinin of a potential H3 avian progenitor of the 1968 Hong Kong pandemic influenza virus☆. Virology. 2003. May 10;309(2):209–218. [DOI] [PubMed] [Google Scholar]
  • 11.Lin YP, Xiong X, Wharton SA, Martin SR, Coombs PJ, Vachieri SG, et al. Evolution of the receptor binding properties of the influenza A(H3N2) hemagglutinin. Proc Natl Acad Sci U S A. 2012. Dec 26;109(52):21474–21479. doi: 10.1073/pnas.1218841110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Shinya K, Ebina M, Yamada S, Ono M, Kasai N, Kawaoka Y. Influenza virus receptors in the human airway. Nature. 2006. Mar;440(7083):435–436. [DOI] [PubMed] [Google Scholar]
  • 13.Linster M, van Boheemen S, de Graaf M, Schrauwen EJA, Lexmond P, Mänz B, et al. Identification, Characterization, and Natural Selection of Mutations Driving Airborne Transmission of A/H5N1 Virus. Cell. 2014. Apr 10;157(2):329–339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hanson A, Imai M, Hatta M, McBride R, Imai H, Taft A, et al. Identification of Stabilizing Mutations in an H5 Hemagglutinin Influenza Virus Protein. J Virol. 2016. Feb 26;90(6):2981–2992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.WHO. A(H5N1)—Northern hemisphere 2024–2025 [Internet]. 2024 [cited 2024 Apr 22]. Available from: https://www.who.int/publications/m/item/a(h5n1)—northern-hemisphere-2024-2025.
  • 16.Fleishman SJ, Whitehead TA, Ekiert DC, Dreyfus C, Corn JE, Strauch EM, et al. Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin. Science. 2011. May 13;332(6031):816–821. doi: 10.1126/science.1202617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bennett NR, Watson JL, Ragotte RJ, Borst AJ, See DL, Weidle C, et al. Atomically accurate de novo design of single-domain antibodies [Internet]. bioRxiv [Preprint]. 2024. [cited 2024 May 16]. p. 2024.03.14.585103. Available from: https://www.biorxiv.org/content/10.1101/2024.03.14.585103v1. [Google Scholar]
  • 18.Dadonaite B, Crawford KHD, Radford CE, Farrell AG, Yu TC, Hannon WW, et al. A pseudovirus system enables deep mutational scanning of the full SARS-CoV-2 spike. Cell. 2023. Mar 16;186(6):1263–1278.e20. doi: 10.1016/j.cell.2023.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kubota M, Takeuchi K, Watanabe S, Ohno S, Matsuoka R, Kohda D, et al. Trisaccharide containing α2,3-linked sialic acid is a receptor for mumps virus. Proc Natl Acad Sci U S A. 2016. Oct 11;113(41):11579–11584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang Q, Wang Y, Yang S, Lin C, Aliyu L, Chen Y, et al. A Linkage-specific Sialic Acid Labeling Strategy Reveals Different Site-specific Glycosylation Patterns in SARS-CoV-2 Spike Protein Produced in CHO and HEK Cell Substrates. Front Chem. 2021. Sep 24;9:735558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Carr CM, Kim PS. A spring-loaded mechanism for the conformational change of influenza hemagglutinin. Cell. 1993. May 21;73(4):823–32. doi: 10.1016/0092-8674(93)90260-w [DOI] [PubMed] [Google Scholar]
  • 22.Kirkpatrick E, Qiu X, Wilson PC, Bahl J, Krammer F. The influenza virus hemagglutinin head evolves faster than the stalk domain. Sci Rep. 2018. Jul 11;8(1):10432. doi: 10.1038/s41598-018-28706-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Thyagarajan B, Bloom JD. The inherent mutational tolerance and antigenic evolvability of influenza hemagglutinin. -, editor. eLife. 2014. Jul 8;3:e03300. doi: 10.7554/eLife.03300 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wu NC, Young AP, Al-Mawsawi LQ, Olson CA, Feng J, Qi H, et al. High-throughput profiling of influenza A virus hemagglutinin gene at single-nucleotide resolution. Sci Rep. 2014. May 13;4(1):4942. doi: 10.1038/srep04942 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Heaton NS, Sachs D, Chen CJ, Hai R, Palese P. Genome-wide mutagenesis of influenza virus reveals unique plasticity of the hemagglutinin and NS1 proteins. Proc Natl Acad Sci U S A. 2013. Dec 10;110(50):20248–20253. doi: 10.1073/pnas.1320524110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Wiley DC, Wilson IA, Skehel JJ. Structural identification of the antibody-binding sites of Hong Kong influenza haemagglutinin and their involvement in antigenic variation. Nature. 1981. Jan 1;289(5796):373–8. doi: 10.1038/289373a0 [DOI] [PubMed] [Google Scholar]
  • 27.Skehel JJ, Stevens DJ, Daniels RS, Douglas AR, Knossow M, Wilson IA, et al. A carbohydrate side chain on hemagglutinins of Hong Kong influenza viruses inhibits recognition by a monoclonal antibody. Proc Natl Acad Sci U S A. 1984;81(6):1779–1783. doi: 10.1073/pnas.81.6.1779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Bloom JD, Gong LI, Baltimore D. Permissive Secondary Mutations Enable the Evolution of Influenza Oseltamivir Resistance. Science. 2010. Jun 4;328(5983):1272–5. doi: 10.1126/science.1187816 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Francis T, Moore AE. A STUDY OF THE NEUROTROPIC TENDENCY IN STRAINS OF THE VIRUS OF EPIDEMIC INFLUENZA. J Exp Med. 1940. Nov 30;72(6):717–728. doi: 10.1084/jem.72.6.717 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ríos Carrasco M, Gröne A, van den Brand JMA, de Vries RP. The mammary glands of cows abundantly display receptors for circulating avian H5 viruses. J Virol. 2024. Oct 10;0(0):e01052–24. doi: 10.1128/jvi.01052-24 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Chopra P, Page CK, Shepard JD, Ray SD, Kandeil A, Jeevan T, et al. Receptor Binding Specificity of a Bovine A(H5N1) Influenza Virus [Internet]. bioRxiv [Preprint]. 2024. [cited 2024 Oct 14]. p. 2024.07.30.605893. Available from: https://www.biorxiv.org/content/10.1101/2024.07.30.605893v1. [Google Scholar]
  • 32.Santos JJS, Wang S, McBride R, Zhao Y, Paulson JC, Hensley SE. Bovine H5N1 influenza virus binds poorly to human-type sialic acid receptors [Internet]. bioRxiv [Preprint]. 2024. [cited 2024 Oct 14]. p. 2024.08.01.606177. Available from: https://www.biorxiv.org/content/10.1101/2024.08.01.606177v1. [Google Scholar]
  • 33.Narimatsu Y, Joshi HJ, Nason R, Van Coillie J, Karlsson R, Sun L, et al. An Atlas of Human Glycosylation Pathways Enables Display of the Human Glycome by Gene Engineered Cells. Mol Cell. 2019. Jul 25;75(2):394–407.e5. doi: 10.1016/j.molcel.2019.05.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Rui Xu, Ryan McBride, Paulson James C., Basler Christopher F., Wilson Ian A. Structure, Receptor Binding, and Antigenicity of Influenza Virus Hemagglutinins from the 1957 H2N2 Pandemic. J Virol. 2010. Feb 15;84(4):1715–1721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Matrosovich M, Tuzikov A, Bovin N, Gambaryan A, Klimov A, Castrucci MR, et al. Early Alterations of the Receptor-Binding Properties of H1, H2, and H3 Avian Influenza Virus Hemagglutinins after Their Introduction into Mammals. J Virol. 2000. Sep;74(18):8502–8512. doi: 10.1128/jvi.74.18.8502-8512.2000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Stevens J, Blixt O, Tumpey TM, Taubenberger JK, Paulson JC, Wilson IA. Structure and Receptor Specificity of the Hemagglutinin from an H5N1 Influenza Virus. Science. 2006. Apr 21;312(5772):404–410. doi: 10.1126/science.1124513 [DOI] [PubMed] [Google Scholar]
  • 37.Chutinimitkul S, van Riel D, Munster VJ, van den Brand JMA, Rimmelzwaan GF, Kuiken T, et al. In Vitro Assessment of Attachment Pattern and Replication Efficiency of H5N1 Influenza A Viruses with Altered Receptor Specificity. J Virol. 2010. Jul;84(13):6825–6833. doi: 10.1128/JVI.02737-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Qi Li, Kash John C., Dugan Vivien G., Wang Ruixue, Jin Guozhong, Cunningham Robert E., et al. Role of Sialic Acid Binding Specificity of the 1918 Influenza Virus Hemagglutinin Protein in Virulence and Pathogenesis for Mice. J Virol. 2009 Apr 15;83(8):3754–3761. [DOI] [PMC free article] [PubMed]
  • 39.de Vries Robert P., de Vries Erik, Martínez-Romero Carles, McBride Ryan, van Kuppeveld Frank J., Rottier Peter J. M., et al. Evolution of the Hemagglutinin Protein of the New Pandemic H1N1 Influenza Virus: Maintaining Optimal Receptor Binding by Compensatory Substitutions. J Virol. 2013. Dec 15;87(24):13868–13877. doi: 10.1128/JVI.01955-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kikutani Y, Okamatsu M, Nishihara S, Takase-Yoden S, Hiono T, de Vries RP, et al. E190V substitution of H6 hemagglutinin is one of key factors for binding to sulfated sialylated glycan receptor and infection to chickens. Microbiol Immunol. 2020;64(4):304–312. doi: 10.1111/1348-0421.12773 [DOI] [PubMed] [Google Scholar]
  • 41.Peacock TP, Sealy JE, Harvey WT, Benton DJ, Reeve R, Iqbal M. Genetic Determinants of Receptor-Binding Preference and Zoonotic Potential of H9N2 Avian Influenza Viruses. J Virol. 2021. Feb 10;95(5):e01651–20. doi: 10.1128/JVI.01651-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Zhang Y, Zhang Q, Gao Y, He X, Kong H, Jiang Y, et al. Key Molecular Factors in Hemagglutinin and PB2 Contribute to Efficient Transmission of the 2009 H1N1 Pandemic Influenza Virus. J Virol. 2012 Sep;86(18):9666–9674. doi: 10.1128/JVI.00958-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Gambaryan A, Tuzikov A, Pazynina G, Bovin N, Balish A, Klimov A. Evolution of the receptor binding phenotype of influenza A (H5) viruses. Virology. 2006. Jan 20;344(2):432–8. doi: 10.1016/j.virol.2005.08.035 [DOI] [PubMed] [Google Scholar]
  • 44.Gulati S, Smith DF, Cummings RD, Couch RB, Griesemer SB, St. George K, et al. Human H3N2 Influenza Viruses Isolated from 1968 To 2012 Show Varying Preference for Receptor Substructures with No Apparent Consequences for Disease or Spread. PLoS ONE. 2013. Jun 21;8(6):e66325. doi: 10.1371/journal.pone.0066325 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Canales A, Sastre J, Orduña JM, Spruit CM, Pérez-Castells J, Domínguez G, et al. Revealing the Specificity of Human H1 Influenza A Viruses to Complex N-Glycans. JACS Au. 2023. Mar 27;3(3):868–878. doi: 10.1021/jacsau.2c00664 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Byrd-Leotis L, Jia N, Matsumoto Y, Lu D, Kawaoka Y, Steinhauer DA, et al. Sialylated and sulfated N-Glycans in MDCK and engineered MDCK cells for influenza virus studies. Sci Rep. 2022. Jul 26;12:12757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Lei R, Liang W, Ouyang WO, Hernandez Garcia A, Kikuchi C, Wang S, et al. Epistasis mediates the evolution of the receptor binding mode in recent human H3N2 hemagglutinin. Nat Commun. 2024. Jun 18;15(1):5175. doi: 10.1038/s41467-024-49487-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Zaraket H, Bridges OA, Duan S, Baranovich T, Yoon SW, Reed ML, et al. Increased Acid Stability of the Hemagglutinin Protein Enhances H5N1 Influenza Virus Growth in the Upper Respiratory Tract but Is Insufficient for Transmission in Ferrets. J Virol. 2013. Sep;87(17):9911–9922. doi: 10.1128/JVI.01175-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Shelton H, Roberts KL, Molesti E, Temperton N, Barclay WS. Mutations in haemagglutinin that affect receptor binding and pH stability increase replication of a PR8 influenza virus with H5 HA in the upper respiratory tract of ferrets and may contribute to transmissibility. J Gen Virol. 2013. Jun;94(Pt 6):1220–1229. doi: 10.1099/vir.0.050526-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Russier M, Yang G, Rehg JE, Wong SS, Mostafa HH, Fabrizio TP, et al. Molecular requirements for a pandemic influenza virus: An acid-stable hemagglutinin protein. Proc Natl Acad Sci U S A. 2016. Feb 9;113(6):1636–1641. doi: 10.1073/pnas.1524384113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Yang G, Ojha CR, Russell CJ. Relationship between hemagglutinin stability and influenza virus persistence after exposure to low pH or supraphysiological heating. PLoS Pathog. 2021. Sep 3;17(9):e1009910. doi: 10.1371/journal.ppat.1009910 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Scholtissek C. Stability of infectious influenza A viruses at low pH and at elevated temperature. Vaccine. 1985. Sep 1;3(3):215–218. doi: 10.1016/0264-410x(85)90109-4 [DOI] [PubMed] [Google Scholar]
  • 53.Galloway SE, Reed ML, Russell CJ, Steinhauer DA. Influenza HA Subtypes Demonstrate Divergent Phenotypes for Cleavage Activation and pH of Fusion: Implications for Host Range and Adaptation. PLoS Pathog. 2013. Feb 14;9(2):e1003151. doi: 10.1371/journal.ppat.1003151 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Poulson RL, Tompkins SM, Berghaus RD, Brown JD, Stallknecht DE. Environmental Stability of Swine and Human Pandemic Influenza Viruses in Water under Variable Conditions of Temperature, Salinity, and pH. Appl Environ Microbiol. 2016. Jun 13;82(13):3721–3726. doi: 10.1128/AEM.00133-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Russier M, Yang G, Briard B, Meliopoulos V, Cherry S, Kanneganti TD, et al. Hemagglutinin Stability Regulates H1N1 Influenza Virus Replication and Pathogenicity in Mice by Modulating Type I Interferon Responses in Dendritic Cells. J Virol. 2020. Jan 17;94(3):e01423–19. doi: 10.1128/JVI.01423-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Xu R, Wilson IA. Structural Characterization of an Early Fusion Intermediate of Influenza Virus Hemagglutinin ▿. J Virol. 2011. May;85(10):5172–5182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Singanayagam A, Zambon M, Barclay WS. Influenza Virus with Increased pH of Hemagglutinin Activation Has Improved Replication in Cell Culture but at the Cost of Infectivity in Human Airway Epithelium. J Virol. 2019. Aug 13;93(17):e00058–19. doi: 10.1128/JVI.00058-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Milder FJ, Jongeneelen M, Ritschel T, Bouchier P, Bisschop IJM, de Man M, et al. Universal stabilization of the influenza hemagglutinin by structure-based redesign of the pH switch regions. Proc Natl Acad Sci U S A. 2022. Feb 8;119(6):e2115379119. doi: 10.1073/pnas.2115379119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kucharski AJ, Lessler J, Read JM, Zhu H, Jiang CQ, Guan Y, et al. Estimating the Life Course of Influenza A(H3N2) Antibody Responses from Cross-Sectional Data. PLoS Biol. 2015. Mar 3;13(3):e1002082. doi: 10.1371/journal.pbio.1002082 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Broecker F, Liu STH, Sun W, Krammer F, Simon V, Palese P. Immunodominance of Antigenic Site B in the Hemagglutinin of the Current H3N2 Influenza Virus in Humans and Mice. J Virol. 2018. Sep 26;92(20):e01100–18. doi: 10.1128/JVI.01100-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Nguyen TQ, Hutter C, Markin A, Thomas MN, Lantz K, Killian ML, et al. Emergence and interstate spread of highly pathogenic avian influenza A(H5N1) in dairy cattle [Internet]. bioRxiv [Preprint]. 2024. [cited 2024 May 2]. p. 2024.05.01.591751. Available from: https://www.biorxiv.org/content/10.1101/2024.05.01.591751v1. [Google Scholar]
  • 62.Agüero M, Monne I, Sánchez A, Zecchin B, Fusaro A, Ruano MJ, et al. Highly pathogenic avian influenza A(H5N1) virus infection in farmed minks, Spain, October 2022. Eurosurveillance. 2023. Jan 19;28(3):2300001. doi: 10.2807/1560-7917.ES.2023.28.3.2300001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Puryear W, Sawatzki K, Hill N, Foss A, Stone JJ, Doughty L, et al. Highly Pathogenic Avian Influenza A(H5N1) Virus Outbreak in New England Seals, United States. Emerg Infect Dis J. 2023. Apr [cited 2024 May 15];29(4). Available from: https://wwwnc.cdc.gov/eid/article/29/4/22-1538_article. doi: 10.3201/eid2904.221538 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Hadfield J, Megill C, Bell SM, Huddleston J, Potter B, Callender C, et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics. 2018. Dec 1;34(23):4121–4123. doi: 10.1093/bioinformatics/bty407 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Restori KH, Septer KM, Field CJ, Patel DR, VanInsberghe D, Raghunathan V, et al. Risk assessment of a highly pathogenic H5N1 influenza virus from mink. Nat Commun. 2024. May 15;15(1):4112. doi: 10.1038/s41467-024-48475-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Kandeil A, Patton C, Jones JC, Jeevan T, Harrington WN, Trifkovic S, et al. Rapid evolution of A(H5N1) influenza viruses after intercontinental spread to North America. Nat Commun. 2023. May 29;14(1):3082. doi: 10.1038/s41467-023-38415-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Xu R, Ekiert DC, Krause JC, Hai R, Crowe JE, Wilson IA. Structural Basis of Preexisting Immunity to the 2009 H1N1 Pandemic Influenza Virus. Science. 2010. Apr 16;328(5976):357–60. doi: 10.1126/science.1186430 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Caserta LC, Frye EA, Butt SL, Laverack M, Nooruzzaman M, Covaleda LM, et al. Spillover of highly pathogenic avian influenza H5N1 virus to dairy cattle. Nature. 2024. Jul 25;1–3. doi: 10.1038/s41586-024-07849-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.WHO. Genetic and antigenic characteristics of clade 2.3.4.4b A(H5N1) viruses identified in dairy cattle in the United States of America [Internet]. 2024 [cited 2024 May 23]. Available from: https://www.who.int/publications/m/item/genetic-and-antigenic-characteristics-of-clade-2.3.4.4b-a(h5n1)-viruses-identified-in-dairy-cattle-in-the-united-states-of-america.
  • 70.Yuan Li, Sarah Arcos, Sabsay Kimberly R., te Velthuis Aartjan J. W., Lauring Adam S. Deep mutational scanning reveals the functional constraints and evolutionary potential of the influenza A virus PB1 protein. J Virol. 2023. Oct 26;97(11):e01329–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Lipsitch M, Bloom BR. Rethinking Biosafety in Research on Potential Pandemic Pathogens. MBio. 2012. Oct 9;3(5):e00360–12. doi: 10.1128/mBio.00360-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Esvelt KM. Inoculating science against potential pandemics and information hazards. PLoS Pathog. 2018. Oct 4;14(10):e1007286. doi: 10.1371/journal.ppat.1007286 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Lewis G, Millett P, Sandberg A, Snyder-Beattie A, Gronvall G. Information Hazards in Biotechnology. Risk Anal. 2019. May;39(5):975–981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Gaymard A, Le Briand N, Frobert E, Lina B, Escuret V. Functional balance between neuraminidase and haemagglutinin in influenza viruses. Clin Microbiol Infect. 2016. Dec 1;22(12):975–983. doi: 10.1016/j.cmi.2016.07.007 [DOI] [PubMed] [Google Scholar]
  • 75.Guo Z, He Y, Benegal AN, Vahey MD. Neuraminidase activity modulates cellular co-infection during influenza A virus multicycle growth [Internet]. bioRxiv [Preprint]. 2022. [cited 2024 May 16]. p. 2022.09.20.508375. Available from: https://www.biorxiv.org/content/10.1101/2022.09.20.508375v1. [Google Scholar]
  • 76.Nataraj R, Chandra A, Kesavardhana S. Avian influenza virus neuraminidase stalk length and haemagglutinin glycosylation patterns reveal molecularly directed reassortment promoting the emergence of highly pathogenic clade 2.3.4.4b A (H5N1) viruses [Internet]. bioRxiv [Preprint].2024. [cited 2024 May 23]. p. 2024.05.22.595329. Available from: https://www.biorxiv.org/content/10.1101/2024.05.22.595329v1. [Google Scholar]
  • 77.Starr TN, Greaney AJ, Hannon WW, Loes AN, Hauser K, Dillen JR, et al. Shifting mutational constraints in the SARS-CoV-2 receptor-binding domain during viral evolution. Science. 2022. Jul 22;377(6604):420–424. doi: 10.1126/science.abo7896 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Wu NC, Thompson AJ, Xie J, Lin CW, Nycholat CM, Zhu X, et al. A complex epistatic network limits the mutational reversibility in the influenza hemagglutinin receptor-binding site. Nat Commun. 2018. Mar 28;9(1):1264. doi: 10.1038/s41467-018-03663-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Haddox HK, Dingens AS, Hilton SK, Overbaugh J, Bloom JD. Mapping mutational effects along the evolutionary landscape of HIV envelope. Chakraborty AK, editor. eLife. 2018. Mar 28;7:e34420. doi: 10.7554/eLife.34420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Moulana A, Dupic T, Phillips AM, Chang J, Nieves S, Roffler AA, et al. Compensatory epistasis maintains ACE2 affinity in SARS-CoV-2 Omicron BA.1. Nat Commun. 2022;Nov 16;13(1):7011. doi: 10.1038/s41467-022-34506-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Hannon WW, Bloom JD. dms-viz: Structure-informed visualizations for deep mutational scanning and other mutation-based datasets. J Open Source Softw. 2024. Jul 17;9(99):6129. [Google Scholar]
  • 82.Crawford KHD, Bloom JD. alignparse: A Python package for parsing complex features from high-throughput long-read sequencing. J Open Source Softw. 2019;4(44):1915. doi: 10.21105/joss.01915 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Otwinowski J, McCandlish DM, Plotkin JB. Inferring the shape of global epistasis. Proc Natl Acad Sci U S A. 2018. Aug 7;115(32):E7550–E7558. doi: 10.1073/pnas.1804015115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Haddox HK, Galloway JG, Dadonaite B, Bloom JD, Matsen FA, DeWitt WS. Jointly modeling deep mutational scans identifies shifted mutational effects among SARS-CoV-2 spike homologs [Internet]. bioRxiv [Preprint]. 2023. [cited 2023 Oct 4]. p. 2023.07.31.551037. Available from: https://www.biorxiv.org/content/10.1101/2023.07.31.551037v1. [Google Scholar]
  • 85.Yu TC, Thornton ZT, Hannon WW, DeWitt WS, Radford CE, Matsen FA IV, et al. A biophysical model of viral escape from polyclonal antibodies. Virus Evol. 2022. Jul 1;8(2):veac110. doi: 10.1093/ve/veac110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Dadonaite B, Brown J, McMahon TE, Farrell AG, Figgins MD, Asarnow D, et al. Spike deep mutational scanning helps predict success of SARS-CoV-2 clades. Nature. 2024. Jul;631(8021):617–626. doi: 10.1038/s41586-024-07636-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Crawford KHD, Eguia R, Dingens AS, Loes AN, Malone KD, Wolf CR, et al. Protocol and Reagents for Pseudotyping Lentiviral Particles with SARS-CoV-2 Spike Protein for Neutralization Assays. Viruses. 2020. May 6;12(5):513. doi: 10.3390/v12050513 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Loes AN, Tarabi RAL, Huddleston J, Touyon L, Wong SS, Cheng SMS, et al. High-throughput sequencing-based neutralization assay reveals how repeated vaccinations impact titers to recent human H1N1 influenza strains. J Virol. 2024. Sep 24;0(0):e00689–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Hooper KA, Bloom JD. A Mutant Influenza Virus That Uses an N1 Neuraminidase as the Receptor-Binding Protein. J Virol. 2013. Dec;87(23):12531–12540. doi: 10.1128/JVI.01889-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Doud MB, Lee JM, Bloom JD. How single mutations affect viral escape from broad and narrow antibodies to H1 influenza hemagglutinin. Nat Commun. 2018. Apr 11;9(1):1386. doi: 10.1038/s41467-018-03665-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Hoffmann E, Neumann G, Kawaoka Y, Hobom G, Webster RG. A DNA transfection system for generation of influenza A virus from eight plasmids. Proc Natl Acad Sci U S A. 2000. May 23;97(11):6108–6113. doi: 10.1073/pnas.100133697 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Lee JM, Huddleston J, Doud MB, Hooper KA, Wu NC, Bedford T, et al. Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants. Proc Natl Acad Sci U S A. 2018. Aug 28;115(35):E8276–E8285. doi: 10.1073/pnas.1806133115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Loes AN, Tarabi RAL, Huddleston J, Touyon L, Wong SS, Cheng SMS, et al. High-throughput sequencing-based neutralization assay reveals how repeated vaccinations impact titers to recent human H1N1 influenza strains [Internet]. bioRxiv [Preprint].2024. [cited 2024 May 17]. p. 2024.03.08.584176. Available from: https://www.biorxiv.org/content/10.1101/2024.03.08.584176v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Katoh K, Misawa K, Kuma K, Miyata T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002. Jul 15;30(14):3059–3066. doi: 10.1093/nar/gkf436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020. May;37(5):1530–1534. doi: 10.1093/molbev/msaa015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Sagulenko P, Puller V, Neher RA. TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol. 2018. Jan 1;4(1):vex042. doi: 10.1093/ve/vex042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Büll C, Nason R, Sun L, Van Coillie J, Madriz Sørensen D, Moons SJ, et al. Probing the binding specificities of human Siglecs by cell-based glycan arrays. Proc Natl Acad Sci U S A. 2021. Apr 27;118(17):e2026102118.S doi: 10.1073/pnas.2026102118 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Nonia Pariente, PhD

24 Oct 2024

Dear Jesse,

Thank you for submitting your manuscript entitled "Deep mutational scanning of H5 hemagglutinin to inform influenza virus surveillance" for consideration in PLOS Biology.

We have now had the time to assess your manuscript, your responses to the prior peer-review reports and the revisions made, as well as to discuss the file with an academic editor with relevant expertise. We consider that the revision has addressed the reviewer concerns well and we would like to publish your work.

However, before we can proceed, we need you to complete your submission in Editorial Manager by providing some metadata, and edit your manuscript so that it comply with our editorial policies (detailed below). To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links, complete (or edit) all additional questions in the submission questionnaire, and replace submission files, as needed.

a) Please confirm that the following financial statements are not missing the grant number: Gift from Open Philanthropy (to N.P.K.), UK Medical Research Council/Department for Environment (J.Y., Y.L., T.P.P), Food and Rural Affairs (Defra, UK) (J.Y., Y.L., T.P.P), and Howard Hughes Medical Institute (J.D.B.).

b) Thank you for providing the protocol number for the ferret experiments, please also include the protocol number for the murine experiments. Additionally, the Ethics statement needs to be a separate, independent (and the first) subheading in the Material & Methods section. You currently have it after “Serum generation”; please provide it separately and place it at the beginning of the Material & Methods section.

c) You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Please supply the numerical values either in the a supplementary file or as a permanent DOI’d deposition for the following figures:

Figure 2AC, 3A, 4CD, 5AD, 6CD, S1D, S2BD, S3AB, S4, S5ABC, S7, S8, S9C, S10ABC

NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

e) Please provide the tree files for Fig. 1B, 6AB, S9AB

f) Please indicate in all relevant main and supplementary Figure legends where the data can be found, e.g. “The data underlying this Figure can be found in S1 Data” or “The data underlying this Figure can be found in https://doi.org/10.5281/zenodo.XXXXX

g) Please ensure that your Data Statement in the Editorial Manager submission system accurately describes where your data can be found and is in final format, as it will be published as written there.

h) Thank you for providing the code and additional information like plasmid maps on Github. However, per journal policy, we cannot accept sole deposition of code in GitHub, as this does not provide a DOI and can be silently changed after after publication. You can archive this version of your publicly available GitHub code and additional data in Zenodo. Once you do this, it will generate a DOI number, which you will need to provide in the Data Accessibility Statement (you are welcome to also provide the GitHub access information, as this may be useful for readers as well). See the process for doing this here: https://docs.github.com/en/repositories/archiving-a-github-repository/referencing-and-citing-content

Once your full submission is complete, your paper will undergo a series of checks by our journal Operations team and it will be sent to me. If the above editorial requests have been addressed, I will be able to issue an "accept in principle" decision and your manuscript will enter our production process.

To provide the metadata for your submission and address the request above, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) by next Monday 28 October. Please let me know if this process is likely to take longer.

With best wishes,

Nonia

Nonia Pariente, PhD,

Editor-in-Chief

PLOS Biology

npariente@plos.org

Decision Letter 1

Nonia Pariente, PhD

29 Oct 2024

Dear Jesse,

Thank you for the submission of your revised Research Article "Deep mutational scanning of H5 hemagglutinin to inform influenza virus surveillance" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor, Katherine Kedzierska, I am pleased to say that we can in principle accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Best,

Nonia

Nonia Pariente, PhD,

Editor-in-Chief

PLOS Biology

npariente@plos.org

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. H5 HA deep mutational scanning libraries.

    (A) Schematic of lentiviral backbone used to produce the pseudoviruses. The HA gene is encoded in the lentiviral backbone followed by a 16-nucleotide barcode after the stop codon. Other notable elements include the 5’ LTR, an inducible TRE3GS promoter that drives HA expression, a constitutively expressed ZsGreen/puromycin cassette, and a repaired 3’ LTR that allows reactivation of integrated proviral genomes. The full sequence of each HA mutant is linked to the barcode by long-read PacBio sequencing, enabling subsequent experiments to simply sequence the short barcode to identify the full HA mutant. Note that this backbone has previously been described [18], and the full plasmid encoding mCherry rather than HA is available on AddGene as item 204579. (B) A 2-step process is used to create genotype-phenotype linked viruses pseudotyped with HA, as described previously for other viral entry proteins [18]. First, VSV-G pseudotyped lentiviral particles are generated by standard transfection, including a plasmid expressing the influenza NA gene to ensure virions do not remain attached to cells via the HA protein binding cell-surface sialic acid. These lentiviral particles are then used to infect 293T-rTTA cells at low multiplicity of infection such that infected cells typically just have 1 integrated provirus, and cells containing proviruses are then selected using puromycin. Because the lentiviral backbone contains a full 3’ LTR, the proviral genomes can be reactivated by transfection of the lentiviral helper plasmids and a NA expression plasmid to produce a library of genotype-phenotype linked HA-pseudotyped virions. For selections designed to measure HA cell entry, we also produce virions pseudotyped with VSV-G to use as a control comparison, with cell entry quantified by the relative infectivity of each HA variant in the presence versus absence of VSV-G (since the VSV-G pseudotyped particles do not require a functional HA for cell entry; see S2A Fig). (C) Number of targeted and actual HA amino-acid mutations found in each of the 2 replicate mutant libraries. The last column indicates the number of unique barcoded HAs in each library; this number exceeds the number of unique mutations as many mutations are found in multiple different barcoded HA variants. (D) Number of HA amino-acid mutations in each barcoded HA variant in each library. The data underlying this panel can be found in S9 Data.

    (EPS)

    pbio.3002916.s001.eps (4.4MB, eps)
    S2 Fig. Deep mutational scanning for cell entry and validation using conditionally replicative influenza virions.

    (A) The effects of mutations on cell entry were measured by generating the pseudovirus libraries with virions expressing either the mutant HAs or unmodified VSV-G on their surface (see S1B Fig). The ability of each barcoded variant to enter 293T cells was then compared between the VSV-G and HA pseudotyped libraries. Entry of the HA pseudotyped variants depends on the function of HA, whereas entry of the VSV-G pseudotyped variants is independent of HA function. Therefore, the effects of HA mutations on cell entry can be determined by comparing the frequency of each variant between the 2 conditions. (B) Correlation between mutational effects on cell entry in 293T cells as measured in each of the 2 independent replicate libraries. The Pearson correlation is indicated at top left. Throughout the rest of this paper, we report the average effect across the 2 library replicates. The data underlying this panel can be found in S2 Data. (C) For validation assays, we used conditionally replicative influenza virions that can be safely studied at Biosafety Level 2. The virions have the essential PB1 polymerase gene replaced by eGFP [28] and derive their non-HA/NA proteins from the lab-adapted A/WSN/1933 (H1N1) strain [29]. These viruses can be generated by reverse genetics and propagated in cells that constitutively express the PB1 protein but cannot undergo replication in standard cells as they do not express PB1. (D) Deep mutational scanning measurements of how the indicated mutations affect cell entry in 293T cells versus the titers of conditionally replicative influenza virions carrying the same mutations in MDCK-SIAT1-TMPRSS2-PB1 and 293T-PB1 cocultures. The titers of the conditionally replicative influenza virions represent those in the supernatant at 48 hours after generating the virions by reverse genetics. There are 2 points for each mutation for the conditionally replicative influenza virions because viruses were rescued twice from independent plasmid preparations. The mutations selected for validation were chosen to span a range of cell-entry effects. R indicates the Pearson correlation. The data underlying this panel can be found in S10 Data.

    (EPS)

    pbio.3002916.s002.eps (9.1MB, eps)
    S3 Fig. H5 pseudovirus titers on cell lines expressing different sialic-acid receptors.

    (A) Titers of the lentiviral particles pseudotyped with the unmutated H5 HA on standard 293 cells and the α2–3 or α2–6 293 cells that express α2-3-linked or α2-6-linked sialic acids [33]. The virions can infect all cells, but the titers are substantially lower on α2–6 cells, consistent with the fact that avian H5 HAs are better at utilizing α2-3-linked sialic acids. (B) Correlation between the effects of mutation on entry in α2–3 and α2–6 cells as measured with each of the 2 independent replicate pseudovirus libraries. The Pearson correlation is indicated at the top left. The measurements are noisier (more poorly correlated between replicates) for the α2–6 cells as there is worse entry into those cells for the parental library leading to worse sampling of most mutations in the deep mutational scanning as fewer virions are successfully able to enter these cells even in the absence of mutations. The data underlying this panel can be found in S2 Data. Throughout the rest of the paper, we report the average effect across the 2 library replicates.

    (EPS)

    S4 Fig. Correlations among effects of HA mutations on entry into different cell lines.

    Correlations among the measured effects of mutations on entry into 293T, α2–3, and α2–6 cells. Note that the measurements for entry into α2–6 cells are more poorly correlated with the other cells for 2 reasons. First, there are some mutations that actually increase usage of α2-6-linked sialic acid as highlighted in Fig 3; these mutations are characterized by good overall entry into the α2–6 cells relative to other cells. But additionally, since the parental H5 HA used in our experiments is relatively poor at entering the α2–6 cells and mutation effects are relative to that parent, many mutations that are poor overall for cell entry nonetheless are less deleterious relative the parent (not as negative cell entry effect) on α2–6 cells compared to the other cells. In the scatter plots, R indicates the Pearson correlation. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/a26_usage.html for more detailed interactive plots of the effects of mutations on entry into the different cells. The data underlying this figure can be found in S2 Data.

    (EPS)

    pbio.3002916.s004.eps (12.3MB, eps)
    S5 Fig. Validation of stability enhancing HA mutations.

    (A) Correlations between the effects of mutations on HA stability as measured by the deep mutational scanning for each of the 2 independent library replicates. Experiments 1 and 2 were performed on different dates. Only stabilizing mutations are shown. The Pearson correlation is shown in the upper left. Throughout the rest of this paper, we report the average effect across libraries and experiments. The data underlying this panel can be found in S3 Data. (B and C) Stability-enhancing mutations identified in deep mutational scanning were validated using the conditionally replicative influenza virions. (B) Conditionally replicative influenza virions carrying each of the indicated mutations were incubated at a buffer of the indicated pH for 30 minutes, diluted into a neutral pH buffer, and then used to infect target cells. The top row shows the percent of cells infected by each virus after treatment at each pH, and the bottom row shows the titer per cell (MOI) calculated using the Poisson distribution. (C) The infectivity remaining for each mutant at the indicated pHs relative to pH 6.9 (as calculated from the bottom row of panel (B) versus the stability effects of the mutation measured in the deep mutational scanning. The Pearson correlation is indicated in the upper-left of each plot. The data underlying panels B and C can be found in S4 Data.

    (EPS)

    pbio.3002916.s005.eps (3.5MB, eps)
    S6 Fig. Measuring escape from sera neutralization.

    (A) Workflow for measuring escape from polyclonal serum using the pseudovirus libraries. The libraries are combined with a nonneutralizable standard virus that is not affected by the serum antibodies (in this case, VSV-G pseudovirus was used for mice sera and RdPro pseudovirus for ferret sera) and then incubated with increasing concentrations of sera and infected into 293T cells. Neutralization is quantified by comparing the frequency of each viral mutant barcode to that of the nonneutralizable standard, as described previously [18]. (B) All mouse sera came from mice that were vaccinated and boosted with recombinant HA protein that exactly matched the parental strain used in the deep mutational scanning. Ferret sera came from animals that were infected with 2.3.4.4b clade viruses or infected and then boosted with matched inactivated virus. For 2 of the ferrets, the immunizing strain differed by a single amino-acid mutation from the parental strain used in the deep mutational scanning.

    (EPS)

    pbio.3002916.s006.eps (3.2MB, eps)
    S7 Fig. Effects of mutations on escape from neutralization for sera from individual mice and ferrets.

    Total escape caused by mutations at each site in HA for individual mice and ferret sera used in this study. There is some animal-to-animal variation even within species: For instance, the sera of most mice is primarily escaped by mutations in antigenic regions A and B, but the serum from mouse-1-01 is primarily escaped by mutations in antigenic region E. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/escape.html for interactive plots. The data underlying this figure can be found in S5 Data.

    (EPS)

    pbio.3002916.s007.eps (29.7MB, eps)
    S8 Fig. Neutralization of pseudoviruses versus conditionally replicative influenza viruses.

    Comparison between mouse sera neutralization of A/AmericanWigeon/SouthCarolina/USDA-000345-001/2021 pseudovirus and conditionally replicative influenza virus with the same HA. Note that the pseudoviruses (pseudotyped lentiviral particles) are more sensitive to neutralization than the actual influenza virions. The data underlying this figure can be found in S11 Data.

    (EPS)

    pbio.3002916.s008.eps (863KB, eps)
    S9 Fig. Mouse sera neutralization and α2-6-linked sialic acid binding increase for circulating 2.3.4.4b strains.

    Trees similar to those in Fig 6 but showing different phenotypes, namely, (A) mouse sera escape and (B) increase in α2-6-linked sialic acid binding. (B) shows sequences starting from 2019, whereas (A) only shows sequences starting from 2020. The tree file can be found in S1 Data. (C) HAs for selected strains carrying mutations that caused strong escape from sera neutralization in the deep mutational scanning were assayed for neutralization by the mouse sera in influenza-based neutralization assays (S10B Fig). These scatter plots show the neutralization measured in the influenza virus assays (y-axis) versus the predicted escape calculated as the simple sum of the mutations in each HA relative to the strain used in the deep mutational scanning. The data underlying this figure can be found in S12 Data. The full list of mutations in each strain is provided in S10D Fig.

    (EPS)

    pbio.3002916.s009.eps (11.1MB, eps)
    S10 Fig. Sera neutralization and stability of circulating 2.3.4.4b strains.

    (A) Ferret and (B) mouse sera neutralization curves for 2.3.4.4b clade strains measured using conditionally replicative influenza virus. These are the neutralization curves that generated the data plotted in Figs 6C and S9C. The data underlying panels (A) and (B) can be found in S7 and S12 Datas, respectively. (C) same as Fig 6D but with infectivity measured at pH 5.3 and 5.5. (D) A list of HA amino-acid mutations (H3 numbering) in circulating 2.3.4.4b strains used for validation relative to the A/AmericanWigeon/SouthCarolina/USDA-000345-001/2021 strain used for the deep mutational scanning. The data underlying this figure can be found in S8 Data.

    (EPS)

    pbio.3002916.s010.eps (11.1MB, eps)
    S1 Data. Tree file for H5Nx phylogenetic tree.

    (JSON)

    pbio.3002916.s011.json (7.8MB, json)
    S2 Data. HA mutation effects on cell entry. Mutation functional effects measured on 293T, 293, α2–3, and α2–6 cells for 2 independent libraries.

    (XLSX)

    pbio.3002916.s012.xlsx (3.1MB, xlsx)
    S3 Data. Mutation effects on HA stability. Mutation effects on pH-based HA stability measured for 2 independent libraries and repeated different dates.

    (XLSX)

    pbio.3002916.s013.xlsx (702.4KB, xlsx)
    S4 Data. Raw values for conditionally replicative virus infectivity at different pHs.

    (XLSX)

    pbio.3002916.s014.xlsx (29.8KB, xlsx)
    S5 Data. Effects of mutations to HA on sera escape. Serum escape values for 2 independent libraries measured for different animal sera.

    (XLSX)

    pbio.3002916.s015.xlsx (5.8MB, xlsx)
    S6 Data. Mouse serum neutralization for conditionally replicative influenza virus.

    Fraction infectivity values used to calculate mice sera neutralization values for different conditionally replicative influenza virus variants.

    (CSV)

    pbio.3002916.s016.csv (19.5KB, csv)
    S7 Data. Ferret serum neutralization for different influenza strains.

    Fraction infectivity values used to calculate ferret sera neutralization for conditionally replicative influenza virus with HA from different H5 strains.

    (CSV)

    pbio.3002916.s017.csv (11.4KB, csv)
    S8 Data. Fraction infectivity retained for conditionally replicative influenza virus at different pHs.

    Fraction infectivity retained at different pH values measured for conditionally replicative influenza virus with HA from different virus strains.

    (CSV)

    S9 Data. Table linking library barcodes with mutations in HA for 2 independent libraries.

    (CSV)

    pbio.3002916.s019.csv (3.7MB, csv)
    S10 Data. Conditionally replicative virus titers and deep mutational scanning measured functional effects.

    Conditionally replicative virus variant titers measured on MDCK-SIAT1-TMPRSS2-PB1 and 293T-PB1 cocultures compared to mutation cell entry effects measured using deep mutational scanning.

    (CSV)

    pbio.3002916.s020.csv (352B, csv)
    S11 Data. Comparison between neutralization of conditionally replicative influenza virus and pseudovirus.

    Fraction infectivity retained at different mouse serum concentrations for conditionally replicative influenza virus and H5 HA pseudovirus.

    (CSV)

    pbio.3002916.s021.csv (11KB, csv)
    S12 Data. Mouse serum neutralization for different influenza strains.

    Fraction infectivity values used to calculate mice sera neutralization for conditionally replicative influenza virus with HA from different H5 strains.

    (CSV)

    pbio.3002916.s022.csv (20.4KB, csv)
    S13 Data. GISAID accession numbers used to build H5Nx phylogenetic tree.

    (CSV)

    pbio.3002916.s023.csv (763KB, csv)

    Data Availability Statement

    The data reported in this paper are available in both interactive plots and spreadsheets of numerical values. See https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS for a webpage that hosts interactive plots that allow easy visualization and querying of the data. We recommend readers initially examine the summary page (https://dms-vep.org/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/summary.html), and then query the additional more detailed plots provided for individual phenotypes as needed. Note that the plots provided for individual phenotypes on the homepage show the measurement for each individual replicate library when you mouse over the heatmaps, which can provide a useful way to assess the reproducibility associated with any specific measurement. See https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS/blob/main/results/summaries/phenotypes.csv for the numerical values of the measurements with our recommended filtering for only high-confidence measurements; the main interactive homepage mentioned above also contains links to additional CSV files with less filtered values and per-replicate measurements (although be sure you understand the filters before using those). The data can be viewed interactively in the context of the H5 HA protein structure using dms-viz at https://dms-viz.github.io/v0/?data=https%3A%2F%2Fraw.githubusercontent.com%2Fdms-vep%2FFlu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS%2Fmain%2Fresults%2Fdms-viz%2Fdms-viz.json See https://github.com/dms-vep/Flu_H5_American-Wigeon_South-Carolina_2021-H5N1_DMS for a GitHub repository that contains all of the computer code used in the analysis. This repository has been archived on Zenodo under DOI: 10.5281/zenodo.13993815. The repository provides instructions on how to run the pipeline either by starting with raw fastq files, which can be downloaded from BioProject PRJNA1123200, or from already pre-build barcode count files. The results directory in the GitHub repository contains all the raw data values generated by the pipeline. The data used to generate the figures in this manuscript is also provided as Supporting Data. Most of the analysis was performed using dms-vep-pipeline-3 (https://github.com/dms-vep/dms-vep-pipeline-3), version 3.13.0. Note that rerunning the pipeline on different computers may give slightly different numbers due to platform-specific differences in floating point precision. See https://nextstrain.org/groups/moncla-lab/h5nx/h5-dms/usa-clade-2344 b for interactive Nextstrain trees with the deep mutational phenotypes used to color the nodes; you can use the dropdown boxes at left to select which phenotype to use to color the nodes (the Color By box) or what set of sequences to show (eg, all-clades, clade-2344b, or usa-clade-2344b). The parental sequence used for the deep mutational scanning is marked with an “x”, and the estimated phenotypes of other sequences are expected to become less accurate the more diverged those sequences are from this parental sequence. Code for annotating DMS data onto phylogenies is available at https://github.com/moncla-lab/annotate-dms.


    Articles from PLOS Biology are provided here courtesy of PLOS

    RESOURCES