Skip to main content
eLife logoLink to eLife
. 2018 Sep 6;7:e38795. doi: 10.7554/eLife.38795

Enhanced ER proteostasis and temperature differentially impact the mutational tolerance of influenza hemagglutinin

Angela M Phillips 1, Michael B Doud 2,3, Luna O Gonzalez 4, Vincent L Butty 5, Yu-Shan Lin 6, Jesse D Bloom 2,3, Matthew D Shoulders 1,
Editors: Franz-Ulrich Hartl7, Wenhui Li8
PMCID: PMC6172027  PMID: 30188321

Abstract

We systematically and quantitatively evaluate whether endoplasmic reticulum (ER) proteostasis factors impact the mutational tolerance of secretory pathway proteins. We focus on influenza hemaggluttinin (HA), a viral membrane protein that folds in the host’s ER via a complex pathway. By integrating chemical methods to modulate ER proteostasis with deep mutational scanning to assess mutational tolerance, we discover that upregulation of ER proteostasis factors broadly enhances HA mutational tolerance across diverse structural elements. Remarkably, this proteostasis network-enhanced mutational tolerance occurs at the same sites where mutational tolerance is most reduced by propagation at fever-like temperature. These findings have important implications for influenza evolution, because influenza immune escape is contingent on HA possessing sufficient mutational tolerance to evade antibodies while maintaining the capacity to fold and function. More broadly, this work provides the first experimental evidence that ER proteostasis mechanisms define the mutational tolerance and, therefore, the evolution of secretory pathway proteins.

Research organism: Human, Virus

Introduction

Protein evolution is necessarily constrained by the inherent biophysical properties of polypeptide sequences (Wylie and Shakhnovich, 2011; Wyganowski et al., 2013; Bloom and Glassman, 2009; DePristo et al., 2005; Tokuriki and Tawfik, 2009a; Gong et al., 2013). Gene variants that encode proteins unable to properly fold are selected against (Figure 1A) (Powers et al., 2009), irrespective of any theoretically beneficial new function the encoded protein may otherwise have acquired (Wylie and Shakhnovich, 2011; Wyganowski et al., 2013; Bloom and Glassman, 2009; DePristo et al., 2005; Tokuriki and Tawfik, 2009a; Gong et al., 2013). Importantly, the consequences of amino acid substitutions for protein folding depend not just on inherent biophysical properties, but also on the environment in which the nascent chain attempts to fold. In cells, this environment is defined most importantly by the proteostasis network, which consists of a subcellular compartment-specific array of chaperones and quality control factors (Balch et al., 2008; Wong et al., 2018; Hartl et al., 2011; Shoulders et al., 2013). These proteostasis factors assist client proteins in navigating complex folding landscapes (Wyganowski et al., 2013; Tokuriki and Tawfik, 2009b), and are thus theoretically well-positioned to modulate the mutational landscape accessible to evolving proteins (Figure 1B).

Figure 1. The proteostasis boundary model for protein evolution.

Figure 1.

(A) The accessibility of protein evolutionary intermediates is defined by protein stability, folding rate, and misfolding rate. Protein variants inside the boundary are accessible (shaded in black), and those outside the boundary are inaccessible (shaded in red). Figure generated using modified Mathematica macro from Powers et al (Powers et al., 2009). (B) In a biophysically permissive environment, such as cells with elevated levels of proteostasis factors, the boundary may be shifted, making otherwise non-functional variants accessible.

The proteostasis network consists of ~1000 factors involved in client protein folding, modification, trafficking, and quality control. Yet, to date, virtually all studies experimentally examining evolutionary consequences of eukaryotic proteostasis network factors have focused on a single chaperone – HSP90 (Geller et al., 2012; Rohner et al., 2013; Queitsch et al., 2002; Cowen and Lindquist, 2005; Geller et al., 2018; Sangster et al., 2008; Geiler-Samerotte et al., 2016). These focused studies have shown that HSP90 can (1) buffer the phenotypic effects of standing genetic diversity (Rohner et al., 2013; Queitsch et al., 2002; Sangster et al., 2008; Geiler-Samerotte et al., 2016) and (2) potentiate the phenotypic effects of new mutations (Cowen and Lindquist, 2005; Geller et al., 2018; Geiler-Samerotte et al., 2016), perhaps by facilitating protein folding and minimizing aggregation (Geller et al., 2018). Looking beyond HSP90, we recently leveraged chemical biology tools to demonstrate that other cytosolic proteostasis factors can also modulate protein evolutionary trajectories and mutational tolerance (Phillips et al., 2017).

The potentially critical impact of the extensive endoplasmic reticulum (ER) proteostasis machinery on the mutational tolerance and evolution of secretory pathway client proteins has remained unexplored. This gap in knowledge is important because roughly 1/3 of the proteome folds in the ER (Wong et al., 2018). Moreover, thousands of mutations in ER client proteins cause hundreds of currently incurable diseases, ranging from neurodegenerative pathologies to the lysosomal storage disorders and the collagenopathies (Balch et al., 2008). Additionally, viral membrane proteins often fold in the ER and must rapidly evolve to escape host immune factors (Chen et al., 1995; Daniels et al., 2003; Doud et al., 2017). Thus, understanding how ER chaperones influence the ability of client proteins to tolerate mutations and adapt to changing environments is essential as we work to design effective therapies for these diseases.

Influenza hemagglutinin (HA) is a uniquely attractive model protein for systematically and quantitatively evaluating whether and how ER proteostasis mechanisms impact client protein mutational tolerance. The folding of HA is perhaps as well-delineated as for any other membrane protein, and HA interacts extensively with components of the host cell’s ER proteostasis network (Chen et al., 1995; Daniels et al., 2003; Sauter et al., 1992; Nakajima et al., 1986; Frabutt et al., 2018; Hurtley et al., 1989; Hebert et al., 1997; Ueda and Sugiura, 1984; Skehel and Wiley, 2000; Klein et al., 2018; Gamblin et al., 2004; Pankow et al., 2015). Co-translationally, HA is heavily N-glycosylated and engages the ER’s lectin chaperones, calnexin and calreticulin, which increase HA’s folding efficiency and prevent misfolding (Chen et al., 1995; Daniels et al., 2003; Hebert et al., 1997). During the folding process, HA also interacts with protein disulfide isomerases to form its six disulfide bonds, as well as with binding immunoglobulin protein (BiP) and glucose-regulated protein 94 (GRP94), which are the ER homologs of HSP70 and HSP90, respectively (Daniels et al., 2003; Hurtley et al., 1989). Together, these host ER chaperones and folding enzymes assist HA monomer folding and prevent premature trimerization, ensuring that HA reaches its functional conformation or otherwise is targeted for ER-associated degradation (Balch et al., 2008; Shoulders et al., 2013; Frabutt et al., 2018).

Amino acid substitutions that impair HA stability, folding, or trafficking reduce HA fitness (Bloom and Glassman, 2009; Hebert et al., 1997; Klein et al., 2018). ER proteostasis factors that assist HA folding and trafficking are thus poised to modulate HA mutational tolerance (Figure 1B). Because HA variant fitness is directly coupled to viral replication via HA’s role in host cell binding and fusion (Sauter et al., 1992; Skehel and Wiley, 2000), we believed that we could apply a deep mutational scanning strategy (Fowler and Fields, 2014) to quantitatively evaluate how ER proteostasis impacts the fitness of all viable single amino acid HA variants. Specifically, we aimed to quantify HA variant fitness by sequencing HA mutant viral libraries after selection in modified ER proteostasis environments, an approach that is substantially more sensitive and higher-throughput than the alternative functional assays employed for deep mutational scanning of non-viral secretory pathway clients (Fowler and Fields, 2014).

In summary, HA is an attractive model secretory protein because HA folding is well-characterized, HA interacts extensively with host ER proteostasis factors, and HA mutational tolerance can be comprehensively quantified by pairing deep mutational scanning with sequencing. Moreover, understanding factors that both constrain and enhance HA mutational tolerance has direct therapeutic relevance. HA is the primary target of neutralizing antibodies (Doud et al., 2017), and influenza immune escape is thus contingent on HA possessing sufficient mutational tolerance to acquire antibody resistance while still maintaining the capacity to fold and function. The potent impact of host chaperones on viral evolutionary trajectories has only recently been realized (Phillips et al., 2017), and it remains unknown whether, and to what degree, host proteostasis mechanisms impact viral protein mutational tolerance.

Below, we integrate deep mutational scanning with small molecule control of ER proteostasis environments to test the hypothesis that upregulating host ER proteostasis factors mitigates the fitness consequences of otherwise biophysically deleterious amino acid substitutions in HA (Figure 1B). We examine HA mutational tolerance in cells with basal versus upregulated levels of ER proteostasis factors, both at normal body temperature and at an elevated, fever-like temperature. We find that upregulation of the ER proteostasis machinery broadly enhances HA mutational tolerance, while increased temperature generally reduces HA mutational tolerance. Remarkably, the same HA sites where variants are most commonly temperature-sensitive, and therefore most likely to be biophysically problematic, are also the sites where variants benefit most from upregulation of the host’s ER proteostasis machinery. Thus, enhanced host ER proteostasis mechanisms and increased temperature modulate HA mutational tolerance in opposite directions, a finding with important implications for influenza evolution. More broadly, this work provides the first experimental evidence that the composition of the ER proteostasis network can profoundly impact the mutational tolerance and evolution of secretory pathway clients.

Results

Modulating ER proteostasis during influenza infection

The composition of the ER proteostasis network is regulated by a stress-responsive signaling pathway termed the unfolded protein response (UPR). Under ER stress conditions, the UPR is activated and the IRE1 and ATF6 signaling cascades upregulate distinct but overlapping sets of ER proteostasis network components via the XBP1s and ATF6f transcription factors, respectively (Shoulders et al., 2013). Here, we profiled HA mutational tolerance at both a normal (37˚C) and a fever-like (39˚C) temperature in distinctive host ER proteostasis environments. ER proteostasis was modulated either by inducing XBP1s alone or by inducing XBP1s in concert with ATF6f in a small molecule-mediated, stress-independent manner in HEK293ATF6f/XBP1s cells using previously characterized chemical genetic methods (Shoulders et al., 2013). In these cells, activation of tetracycline repressor-regulated XBP1s by the small molecule doxycycline induces the transcriptional response downstream of IRE1, remodeling the ER proteostasis environment by enhancing transcript levels of ~180 ER chaperones and quality control factors (Wong et al., 2018; Shoulders et al., 2013). Similarly, induction of destabilized domain-regulated ATF6f by the small molecule trimethoprim in HEK293ATF6f/XBP1s cells increases transcript expression of ~40 ER chaperones and quality control factors that are distinct from but partially overlapping with those regulated by XBP1s (Wong et al., 2018; Shoulders et al., 2013). Concomitant induction of ATF6f and XBP1s creates a third, distinct ER proteostasis environment, increasing expression of ~350 genes (Shoulders et al., 2013). We specifically assessed HA mutational tolerance in a basal ER proteostasis environment and in the two most significantly remodeled proteostasis environments, activating XBP1s alone or simultaneously activating XBP1s and ATF6f.

The media composition and treatment regimen for influenza propagation differed from the conditions used in the original characterization of XBP1s and ATF6f/XBP1s activation in these cells, which also was performed only at 37˚C (Shoulders et al., 2013). Hence, before quantifying HA mutational tolerance in these distinctive cellular settings, we first comprehensively characterized each environment in the absence of an influenza infection using RNA-seq. We examined changes in ER and cytosolic proteostasis factor transcript expression (Figure 2), as well as global transcriptional changes to assess any off-target transcriptional effects (Figure 2—figure supplement 1). Essentially all transcripts upregulated in the original characterization of these cells (Shoulders et al., 2013) were also upregulated in response to our modified treatment regimen (Figure 2—figure supplement 2). Likely owing to the increased sensitivity of RNA-Seq compared to whole-genome arrays (Marioni et al., 2008), we also observed upregulation of 201 transcripts upon XBP1s induction and 268 transcripts upon ATF6f/XBP1s induction that were not observed to be significantly altered in the previous analysis. Still, as expected (Shoulders et al., 2013), transcripts upregulated upon XBP1s and ATF6f/XBP1s induction largely corresponded to secretory pathway and ER stress response genes (Figure 2—figure supplement 1 and Figure 2—source data 2).

Figure 2. Induction of XBP1s and ATF6f/XBP1s is selective and does not globally induce stress responses.

(A) Heat map of selected transcript-level effects of upregulating ER folding machinery in HEK293ATF6f/XBP1s cells via XBP1s or ATF6f/XBP1s at 37˚C, relative to a basal environment at 37˚C. (B) Heat map of selected transcript-level effects of upregulating ER proteostasis machinery via XBP1s or ATF6f/XBP1s at 39˚C, relative to a basal environment at 39˚C. (C) Heat map of selected transcript-level effects of increasing the temperature from 37˚C to 39˚C in a basal ER proteostasis environment for 24 hr. Genes in A–C grouped by ER (top) or cytosolic (bottom) proteostasis networks, and also based on functional classifications (Shoulders et al., 2013). For (A–B), tetracycline repressor-regulated XBP1s was induced by treatment with 0.1 μg/mL doxycycline (24 hr); destabilized domain-regulated ATF6f was induced by treatment with 1 μM trimethoprim (24 hr). Figure 2—figure supplement 1. Full transcriptome analysis confirmed selectivity of XBP1s and ATF6f/XBP1s induction. Figure 2—figure supplement 2. Comparison of HEKATF6f/XBP1s RNA-Seq characterization to previous whole-genome array characterization. Figure 2—figure supplement 3. Methods to induce XBP1s and ATF6f/XBP1s did not cause cytotoxicity and were functional during influenza infection. Figure 2—source data 1. Complete RNAseq differential expression analysis. Figure 2—source data 2. Comparison of HEKATF6f/XBP1s characterization to previous characterization. Figure 2—source data 3. Fold-change of HSR and UPR targets during influenza infection.

Figure 2—source data 1. Complete RNAseq differential expression analysis.
DOI: 10.7554/eLife.38795.007
Figure 2—source data 2. Comparison of HEKATF6f/XBP1s characterization to previous characterization.
Genes are tabulated as upregulated in the current study (RNAseq), the previous study (Array) alone (Shoulders et al., 2013), or upregulated in both studies, as illustrated in Figure 2—figure supplement 2. For each gene subset, the corresponding GO group enrichment analysis is provided.
DOI: 10.7554/eLife.38795.008
Figure 2—source data 3. Fold change of HSR and UPR targets during influenza infection.
DOI: 10.7554/eLife.38795.009

Figure 2.

Figure 2—figure supplement 1. Full transcriptome analysis confirmed selectivity of XBP1s and ATF6f/XBP1s induction.

Figure 2—figure supplement 1.

Volcano plots show change in transcript expression upon activating XBP1s alone or with ATF6f at 37 (A–B) and 39˚C (C–D) in HEK293ATF6f/XBP1s cells. (E) Change in transcript expression upon increasing temperature in basal cells. In (A–E), transcripts displaying >2 fold change in expression with p-values<0.00001 are plotted in red, with select outliers labeled. Figure 2—source data 1. Complete RNAseq differential expression analysis.
Figure 2—figure supplement 2. Comparison of HEKATF6f/XBP1s RNA-Seq characterization to previous whole-genome array characterization.

Figure 2—figure supplement 2.

Number of upregulated genes in current study represented in red (as determined by RNA-Seq following 24 hr treatment with 0.1 μg/mL doxycycline and 1 μM trimethoprim in low serum media); number of upregulated genes observed in previous study represented in blue (as determined by whole-genome array following 12 hr treatment with 1 μg/mL doxycycline and 10 μM trimethoprim in complete media [Shoulders et al., 2013]). For this comparison, genes were considered upregulated if they increased in expression by more than 2-fold, using a FDR of 0.05. This filtering is more stringent than that reported in Shoulders et al (Shoulders et al., 2013), but was necessary to compare data generated from RNA-Seq and a whole genome array. Figure 2—source data 2. Comparison of HEKATF6f/XBP1s characterization to previous characterization.
Figure 2—figure supplement 3. Methods to induce XBP1s and ATF6f/XBP1s are not cytotoxic and are functional during influenza infection.

Figure 2—figure supplement 3.

(A) Induction of XBP1s or ATF6f/XBP1s and increased temperature (72 hr) does not alter metabolic activity in the absence of influenza, as measured by a resazurin assay. (B) Wild-type WSN/1933 influenza titers (determined by TCID50 assay) from infections in modulated cellular environments (MOI = 0.01; 48 hr post-infection). For A–B, the average of biological triplicates is plotted with error bars representing the SEM. (C) Fold change of heat shock response (HSR) and unfolded protein response (UPR) gene transcripts in modulated ER proteostasis environments with and without influenza (MOI = 1, 8 hr post-infection), relative to a mock-infected basal environment at 37˚C. UPR gene transcripts are designated as ATF6f or XBP1s targets based on prior studies (Shoulders et al., 2013). Positive controls for activating the UPR (treatment with the N-glycosylation inhibitor tunicamycin) and the heat shock response (HSR) (treatment with arsenite to induce oxidative stress) are shown. Figure 2—source data 3. Fold-change of HSR and UPR targets during influenza infection.

Specifically, we found that XBP1s and ATF6f/XBP1s activation in HEK293ATF6f/XBP1s cells selectively enhanced transcript expression of numerous ER proteostasis factors, including many known to interact with HA, such as calnexin (CANX) (Chen et al., 1995; Daniels et al., 2003; Hebert et al., 1997), calreticulin (CALR) (Chen et al., 1995; Daniels et al., 2003; Hebert et al., 1997), and members of the protein disulfide isomerase family (PDI) (Figure 2A, top) (Daniels et al., 2003). Concomitant induction of XBP1s and ATF6f revealed increased expression of ATF6f and ATF6f/XBP1s targets, including BiP (HSPA5) and GRP94 (HSP90B1), respectively (Figure 2A, top). Stress-independent activation of XBP1s or ATF6f/XBP1s did not significantly impact transcript levels of cytosolic protein folding factors at either 37˚C or 39˚C, highlighting the selectivity of this approach for modulating ER proteostasis (Figure 2A–B, bottom; Figure 2—figure supplement 1A–D). Increasing the host cell temperature to 39˚C induced significantly fewer transcriptional changes than did activating XBP1s or ATF6f/XBP1s. Increased temperature did result in mild induction of the heat shock response, as expected (illustrated by elevated HSP70 (HSPA1A) levels in Figure 2C and Figure 2—figure supplement 1E).

We next assessed whether induction of the host cell’s ER proteostasis network and increased temperature altered host cell viability or prevented influenza propagation. First, we activated XBP1s alone or with ATF6f at both 37˚C and 39˚C in the absence of influenza, and evaluated cell viability 72 hr later, corresponding to the duration of our intended drug pre-treatment and influenza infection. Consistent with previous studies (Shoulders et al., 2013), these perturbations were not cytotoxic (Figure 2—figure supplement 3A). Next, we evaluated whether influenza could propagate in these remodeled host cells. Briefly, we infected cells in each pre-activated selection environment with wild-type influenza A/WSN/1933 (the strain used throughout this work; hereafter referred to as influenza) and titered the viral supernatant 48 hr post-infection using a TCID50 assay. Influenza growth was moderately attenuated at increased temperature (Figure 2—figure supplement 3B), but was still sufficient for performing selections and evaluating HA mutational tolerance by sequencing.

Finally, we evaluated whether our methods for perturbing ER proteostasis were functional during an influenza infection cycle. Using qPCR, we found that XBP1s and ATF6f target genes were still inducible by our chemical genetic methods in the presence of influenza, with only slightly attenuated upregulation relative to a mock-infection (Figure 2—figure supplement 3C). This modest reduction in chaperone transcript expression was likely caused by global suppression of host transcription by influenza (Russell et al., 2018). However, overall our tools for modulating ER proteostasis factors maintained efficacy during the course of an influenza infection.

Deep mutational scanning of HA in modulated ER proteostasis environments

To systematically profile HA mutational tolerance in modulated ER proteostasis environments, we employed deep mutational scanning (DMS), a method that couples saturating mutagenesis with fitness measurements, thereby efficiently sampling amino acid sequence space (Doud et al., 2017; Fowler and Fields, 2014; Doud and Bloom, 2016). We used biological triplicate HA viral mutant libraries that were previously generated from independent HA plasmid libraries using cells with basal levels of proteostasis factors at 37˚C (Doud and Bloom, 2016). The resulting HA mutant viral libraries contained nearly all viable single amino acid HA variants (Doud and Bloom, 2016). Variant sensitivity to ER proteostasis mechanisms was tested by simultaneously competing variants within each viral library in host cells with modulated ER proteostasis environments. Each of these competitions was carried out at both 37˚C and 39˚C, to impose additional biophysical selection pressure on HA and to mimic elevated host body temperature during an influenza infection (Figure 3A).

Figure 3. Deep mutational scanning of HA in modulated ER proteostasis environments.

Figure 3.

(A) Scheme for performing deep mutational scanning, and (B) quantifying the differential selection (diffsel) for each mutant in the selection environments, which can be visualized on sequence logo plots. In A, XBP1s or ATF6f/XBP1s were induced at either 37˚C or 39˚C for 16 hr prior to the selection, which consisted of a 48 hr infection. In B, fWT and fmut denote the wild-type and mutant frequencies, respectively. (C) Percentage of amino acid space sampled by each replicate library prior to filtering, and the percentage sampled by all three replicate mutant libraries. Figure 3—source data 1. Correlation coefficients for unfiltered absolute site differential selection values between biological replicate selections.

Figure 3—source data 1. Correlation coefficients for unfiltered absolute site differential selection values between biological replicate selections.
N = 565.
DOI: 10.7554/eLife.38795.011

We designed these DMS experiments to fully sample the library diversity whilst minimizing co-infection and genetic hitchhiking. To achieve this goal, we infected cells in each pre-activated selection environment with 106 virions at an MOI of 0.01 infectious virion/cell. Following a 48 hr infection, or about four replication cycles, we harvested viral RNA and prepared subamplicon sequencing libraries for accurate variant frequency quantitation (Doud and Bloom, 2016). We then quantified the differential selection (diffsel) for each mutation as the logarithm of its enrichment in the selection condition relative to that of the mock-selection condition (e.g., Basal 39˚C relative to Basal 37˚C) (Doud et al., 2017; Bloom, 2015). Hence, the wild-type residue at each site has a diffsel of zero, variants with positive diffsel values are more fit in the selection condition than the mock condition relative to wild-type, and variants with negative diffsel values are less fit in the selection condition than the mock condition relative to wild-type. The diffsel for variants across HA can be visualized with sequence logo plots, where the size of the amino acid letter abbreviations is proportional to the magnitude of the differential selection for that variant (Figure 3B) (Doud et al., 2017).

To decipher selection from the inherent experimental noise of batch competitions, we filtered the DMS data to include only variants with robust differential selection across biological triplicates. Specifically, we limited our analyses to variants present in each pre-selection replicate viral library (Figure 3C—Triplicate mutant libraries) that exhibited differential selection in the same direction across biological triplicates. These criteria reduced experimental noise owing to differences in replicate library composition, as well as uneven sampling and biased PCR amplification of low frequency variants. This filtering predominantly excluded variants that were negligibly affected by the selection, which often had slightly positive diffsel values in one replicate versus slightly negative diffsel values in another replicate. These criteria also excluded strongly enriched or depleted variants that were not present in triplicate pre-selection libraries and, less frequently, strongly enriched or depleted variants that did not behave reproducibly across triplicate selections. Notably, the unfiltered biological replicate diffsel values were strongly correlated across HA sites, with correlation coefficients ranging from R = 0.67–0.78 (Figure 3—source data 1 and Source data 1).

The filtered DMS data revealed that many HA variants were substantially depleted upon increasing the temperature (Figure 4A; Figure 4—figure supplement 1). In contrast, many variants were moderately enriched upon induction of ER proteostasis machinery (Figure 4B–E; Figure 4—figure supplements 25). This result was consistent with our hypothesis, as we anticipated that increasing the temperature would create a biophysically restrictive environment, reducing the fitness of HA variants, whereas increasing levels of ER proteostasis factors would create a biophysically permissive environment, enhancing the fitness of HA variants (Figure 1B).

Figure 4. HA variants were depleted at increased temperature but enriched upon XBP1s or ATF6f/XBP1s induction.

Cropped sequence logo plots show differential selection on HA at positions where variants were depleted upon increased temperature but enriched upon XBP1s and ATF6f/XBP1s induction (based on sequential numbering of WSN HA). Size of amino acid abbreviation corresponds to magnitude of selection. Only variants behaving consistently across biological triplicates are plotted; all selections are plotted on the same scale. Figure 4—figure supplement 1. Full WSN HA sequence logo plot: Basal 39˚C vs. Basal 37˚C. Figure 4—figure supplement 2. Full WSN HA sequence logo plot: XBP1s 37˚C vs. Basal 37˚C. Figure 4—figure supplement 3. Full WSN HA sequence logo plot: XBP1s 39˚C vs. Basal 39˚C. Figure 4—figure supplement 4. Full WSN HA sequence logo plot: ATF6f/XBP1s 37˚C vs. Basal 37˚C. Figure 4—figure supplement 5. Full WSN HA sequence logo plot: ATF6f/XBP1s 39˚C vs. Basal 39˚C. Source data 1. Complete analysis of deep mutational scanning data.

Figure 4.

Figure 4—figure supplement 1. Full WSN HA sequence logo plot: Basal 39˚C vs. Basal 37˚C.

Figure 4—figure supplement 1.

Logo plot displays variants that behaved consistently across three biological replicates. Variants above the line (representative of selection on wild-type) were enriched at 39˚C relative to 37˚C in a basal environment; variants below the line were depleted at 39˚C. The wild-type WSN HA residue is shown below the corresponding logo, with the sites numbered below the wild-type sequence (based on sequential numbering of WSN HA). The size of the amino acid letter abbreviation is proportional to the diffsel for that amino acid variant, and all logo plots are plotted on the same scale.
Figure 4—figure supplement 2. Full WSN HA sequence logo plot: XBP1s 37˚C vs. Basal 37˚C.

Figure 4—figure supplement 2.

Logo plot displays variants that behaved consistently across three biological replicates. Variants above the line (representative of selection on wild-type) were enriched upon XBP1s induction at 37˚C; variants below the line were depleted. The wild-type WSN HA residue is shown below the corresponding logo, with the sites numbered below the wild-type sequence (based on sequential numbering of WSN HA). The size of the amino acid letter abbreviation is proportional to the diffsel for that amino acid variant, and all logo plots are plotted on the same scale.
Figure 4—figure supplement 3. Full WSN HA sequence logo plot: XBP1s 39˚C vs. Basal 39˚C.

Figure 4—figure supplement 3.

Logo plot displays variants that behaved consistently across three biological replicates. Variants above the line (representative of selection on wild-type) were enriched upon XBP1s induction at 39˚C; variants below the line were depleted. The wild-type WSN HA residue is shown below the corresponding logo, with the sites numbered below the wild-type sequence (based on sequential numbering of WSN HA). The size of the amino acid letter abbreviation is proportional to the diffsel for that amino acid variant, and all logo plots are plotted on the same scale.
Figure 4—figure supplement 4. Full WSN HA sequence logo plot: ATF6f/XBP1s 37˚C vs. Basal 37˚C.

Figure 4—figure supplement 4.

Logo plot displays variants that behaved consistently across three biological replicates. Variants above the line (representative of selection on wild-type) were enriched upon ATF6f/XBP1s induction at 37˚C; variants below the line were depleted. The wild-type WSN HA residue is shown below the corresponding logo, with the sites numbered below the wild-type sequence (based on sequential numbering of WSN HA). The size of the amino acid letter abbreviation is proportional to the diffsel for that amino acid variant, and all logo plots are plotted on the same scale.
Figure 4—figure supplement 5. Full WSN HA sequence logo plot: ATF6f/XBP1s 39˚C vs. Basal 39˚C.

Figure 4—figure supplement 5.

Logo plot displays variants that behaved consistently across three biological replicates. Variants above the line (representative of selection on wild-type) were enriched upon ATF6f/XBP1s induction at 39˚C; variants below the line were depleted. The wild-type WSN HA residue is shown below the corresponding logo, with the sites numbered below the wild-type sequence (based on sequential numbering of WSN HA). The size of the amino acid letter abbreviation is proportional to the diffsel for that amino acid variant, and all logo plots are plotted on the same scale.

To globally examine the relative enrichment of HA variants, we plotted HA mutant diffsel values and examined the deviation of the distribution mean from zero, which indicates no differential selection (Figure 5A). This analysis confirmed that increased temperature indeed generally reduced mutant HA viral growth, resulting in depletion (i.e., diffsel < 0) of 597 variants versus enrichment (i.e., diffsel > 0) of only 361 variants, relative to wild-type HA (Figure 5A). Variant depletion at elevated temperature could be explained by reduced variant folding rates and thus slower variant viral growth, or perhaps by increased degradation. Although we do not observe upregulation of ER-associated degradation factors at elevated temperature (Figure 2C and Figure 2—figure supplement 1E), it is certainly possible that HA variants engage degradation factors more at an elevated temperature.

Figure 5. ER proteostasis mechanisms and temperature divergently impact HA variant viral growth.

(A) Average differential selection (diffsel) for each HA variant; the black line designates the mean of the distribution. Significance of deviation of the mean from zero (no differential selection) was tested by a one-sample t-test; *** designates two-tailed p-values≤0.001. Variant diffsel values are staggered about the x-axis to minimize overlap. The number of variants with diffsel >0 and <0 is listed below each distribution. (B) Correlation for diffsel values for inducing XBP1s at 39˚C versus increasing the temperature in basal cells (N = 271). (C) Correlation for diffsel values for inducing ATF6f/XBP1s at 39˚C versus increasing the temperature in basal cells (N = 320). For (A–C), average diffsel values for variants behaving consistently across biological triplicates are plotted. For (B–C), Pearson correlation and p-value from an F-test are shown. Figure 5—figure supplement 1. Induction of ATF6f/XBP1s and XBP1s similarly impact HA variant viral growth. Figure 5—source data 1. Differential selection values for each selection, pre- and post-filtering.

Figure 5—source data 1. Differential selection values for each selection, pre- and post-filtering.
DOI: 10.7554/eLife.38795.020

Figure 5.

Figure 5—figure supplement 1. Induction of ATF6f/XBP1s and XBP1s similarly impact HA variant viral growth.

Figure 5—figure supplement 1.

Correlation for diffsel values for inducing XBP1s at 39˚C versus inducing ATF6f/XBP1s at 39˚C (N = 261). Average diffsel values for variants behaving consistently across biological triplicates are plotted; Pearson correlation and p-value from an F-test are shown.

In contrast, induction of XBP1s alone or with ATF6f generally enhanced mutant HA viral growth at both 37˚C and 39˚C, resulting in an overall enrichment of HA variants relative to wild-type HA (Figure 5A). The enhanced variant growth upon XBP1s induction alone was quite similar to that of simultaneous induction of XBP1s and ATF6f (Figure 5—figure supplement 1 and Figure 5—source data 1), and is thus either predominantly caused by XBP1s, or by XBP1s-regulated factors that are redundant with those regulated by ATF6f. In either case, variant enrichment upon induction of XBP1s or ATF6f/XBP1s is likely explained by upregulation of ER protein folding factors (Figure 2A–B), while the lesser amount of variant depletion observed upon induction of XBP1s or ATF6f/XBP1s may be attributed to upregulation of ER quality control and degradation factors (Figure 2A–B). Importantly, we note that the enrichment and depletion of HA variants may also be caused by indirect effects of inducing XBP1s or ATF6f/XBP1s and increasing temperature. For example, these perturbations could potentially impact the protein levels and activities of other host proteins that engage HA, or perhaps differentially modulate viral growth kinetics.

Regardless of their precise origins, the generally opposing effects of increased temperature and upregulated proteostasis machinery on mutant HA viral growth motivated examination of these effects on individual HA variants. To this end, we determined the correlation between the average diffsel values for increased temperature and upregulated ER proteostasis machinery for individual HA variants (Figure 5B–C). This analysis revealed significant negative correlation between these selection conditions, demonstrating that the HA variants most depleted upon increased temperature corresponded to those most enriched upon induction of ER proteostasis factors.

Examination of variants and sites impacted most by host proteostasis and temperature

The opposing fitness effects of proteostasis mechanisms and temperature in the DMS data suggested that temperature-sensitive HA variants were most likely to be rescued by ER proteostasis factors. The potential implications of these opposing selection forces on HA motivated us to more rigorously evaluate this behavior for individual HA variants. DMS batch competitions are inherently noisy, largely owing to uneven sampling of library variants (Doud and Bloom, 2016). Hence, we performed pairwise competitions between wild-type influenza and influenza encoding differentially selected variants identified in the DMS. These competitions were performed in cells with either basal or elevated levels of ER proteostasis factors, accessed by inducing XBP1s alone or in combination with ATF6f, and at 37˚C and 39˚C (Figure 6A). We completed these pairwise competitions in biological triplicate for nine variants that in the DMS exhibited the most divergent behavior upon induction of XBP1s or ATF6f/XBP1s compared to increased temperature (A90D, K136N, S153E, K169G, P175G, K176F, P228A, L456D, and L456R; based on sequential numbering of WSN HA sites). We also included three synonymous negative controls (P175Psyn, P228Psyn, L456Lsyn), for which we mutated the wild-type codon to a synonymous codon that was not differentially selected in the DMS batch competition.

Figure 6. Divergent fitness effects revealed by DMS assessed by pairwise competitions.

(A) Scheme for pairwise competition between wild-type HA and mutant HA influenza. Detailed description of competition conditions in Figure 6—figure supplement 1. (B) The difference between the average diffsel upon increased temperature (ΔT) and the average diffsel upon induction of XBP1s or ATF6f/XBP1s (ΔUPR) is plotted. This difference from the pairwise competitions is plotted on the y-axis, with error bars representing SEM for biological triplicates, and the corresponding difference from the DMS is plotted on the x-axis. Variants that validated (i.e., diffsel upon increased temperature was significantly different from that upon UPR induction in both DMS and pairwise competitions) are in black; variants that did not validate are in red; synonymous (neutral) validated negative controls are in gray. All non-synonymous variants are labeled. Figure 6—figure supplement 1. Individual pairwise competition diffsel values.

Figure 6.

Figure 6—figure supplement 1. Individual pairwise competition diffsel values.

Figure 6—figure supplement 1.

(A) Diffsel of validated variants in pairwise competition with wild-type influenza upon induction of XBP1s or ATF6f/XBP1s and increased temperature. Variants were considered ‘validated’ if the diffsel upon induction of XBP1s or ATF6f/XBP1s was significantly different from that upon increased temperature. (B) Diffsel of non-validated variants in pairwise competition. (C) Diffsel of synonymous mutants in pairwise competition. For (A–C), the average of biological triplicates is plotted with error bars corresponding to SEM. Asterisks comparing selection conditions represent significance evaluated by Student’s t-test, while asterisks comparing an individual selection condition to no selection (diffsel of zero) represent significance evaluated by a one-sample t-test. For all significance tests, *, **, and *** represent two-tailed p-values≤0.05, 0.01, and 0.001, respectively; ‘ns’ corresponds to a p-value>0.05. Induction of UPR conditions in green corresponds to ΔUPR in Figure 6; increased temperature conditions in blue correspond to ΔT in Figure 6.

Comparison of the differential selection upon increased temperature with that upon XBP1s or ATF6f/XBP1s induction in the pairwise competitions recapitulated the divergent behavior observed in DMS for several individual HA variants. In particular, of the nine non-synonymous variants tested, seven variants exhibited significantly different behavior upon increased temperature versus XBP1s or ATF6f/XBP1s induction (Figure 6B—black; Figure 6—figure supplement 1A). This high rate of reproducibility lends substantial credence to the overall DMS data set. We note that some discrepancy between the DMS and pairwise competitions (Figure 6B—red; Figure 6—figure supplement 1B) is expected given the difference in wild-type frequencies between the DMS and pairwise competitions and the inherent false discovery rate of high-throughput screens (Doud and Bloom, 2016). As in the DMS (Figure 4), the pairwise diffsel upon increased temperature was overall more substantial than that upon XBP1s or ATF6f/XBP1s induction, though XBP1s or ATF6f/XBP1s induction did significantly impact the growth of K136N, P228A, and L456D (Figure 6—figure supplement 1A—see asterisks above individual bars). Importantly, all of the synonymous variants displayed similarly neutral behavior in the pairwise and DMS competitions (Figure 6B—gray; Figure 6—figure supplement 1C), indicating that the opposing nature of these selection pressures is unlikely to reflect experimental noise.

Validation of the opposing effects of temperature and host ER proteostasis mechanisms on HA variant growth in the pairwise competitions prompted us to further examine the properties of HA sites most impacted by these selection environments. For HA variants present in each replicate mutant library (Figure 3C––Mutant libraries 1–3), we summed the mutation differential selection values at each site to calculate the net site differential selection (net site diffsel), a measure of site mutational tolerance, and averaged the net site diffsel values across the triplicate libraries (Source data 1). Consistent with the mutation diffsel values in Figure 5A, the net site diffsel values were negatively skewed upon increased temperature, indicating reduced mutational tolerance, but positively skewed upon induction of XBP1s, indicating enhanced mutational tolerance (Figure 7A). Next, we sorted sites based on secondary structure to determine whether sites most affected by temperature or XBP1s induction belonged to particular secondary structural elements. We found that sites distributed throughout all secondary structural elements had enhanced mutational tolerance upon XBP1s induction, and that sites in strands and loops had particularly reduced mutational tolerance upon increased temperature. Thus, these selection pressures broadly impact HA mutational tolerance (Figure 7A; secondary structure).

Figure 7. XBP1s induction and temperature impacted HA mutational tolerance at sites across HA with diverse functional and structural roles.

(A) Average net site diffsel (sum of mutation diffsel values at a given site) is plotted for increased temperature in a basal environment (blue) and upon induction of XBP1s at 39˚C (purple). Sites are sorted by secondary structure, functional/structural role, conservation in H1 sequences, and surface accessibility (buried residues are the 50% least surface-accessible sites). Select outliers are labeled. Means are represented by black lines and significance of deviation from zero (no selection) was determined by a one-sample t-test and is indicated above each distribution; *, **, and *** represent one-tailed p-values≤0.05, 0.01, and 0.001, respectively; ns corresponds to a p-value>0.05. (B) Average net site diffsel is plotted as a function of site conservation, where x = 0 corresponds to a completely conserved site among 989 non-redundant human H1 sequences (1918–2008) (Zhang et al., 2017). (C) Average net site diffsel is plotted as a function of site surface accessibility, where x = 0 corresponds to a buried residue (PDBID 1RVX [Gamblin et al., 2004]). Figure 7—figure supplement 1. ATF6f/XBP1s induction and temperature impacted HA mutational tolerance at sites across HA with diverse functional and structural roles. Figure 7—source data 1. Properties and differential selection for each site of HA.

Figure 7—source data 1. Properties and differential selection for each site of HA.
DOI: 10.7554/eLife.38795.025

Figure 7.

Figure 7—figure supplement 1. ATF6f/XBP1s induction and temperature impacted HA mutational tolerance at sites across HA with diverse functional and structural roles.

Figure 7—figure supplement 1.

(A) Average net site diffsel (sum of mutation diffsel values at a given site) is plotted for increased temperature in a basal environment (blue) and upon induction of XBP1s at 39˚C (pink). Sites are sorted by secondary structure, functional/structural role, conservation in H1 sequences, and surface accessibility (buried residues are the 50% least surface-accessible sites). Select outliers are labeled. Means are represented by black lines and significance of deviation from zero (no selection) was determined by a one-sample t-test and is indicated above each distribution; *, **, and *** represent one-tailed p-values≤0.05, 0.01, and 0.001, respectively; ns corresponds to a p-value>0.05. (B) Average net site diffsel is plotted as a function of site conservation, where x = 0 corresponds to a completely conserved site among 989 non-redundant human H1 sequences (1918–2008) (Zhang et al., 2017). (C) Average net site diffsel is plotted as a function of site surface accessibility, where x = 0 corresponds to a buried residue (PDBID 1RVX [Gamblin et al., 2004]). Figure 7—source data 1. Properties and differential selection for each site of HA.

We next plotted the net site diffsel values for sites essential for HA function and structure, to determine if these sites were impacted by temperature or ER proteostasis factors. Receptor binding sites had slightly, but significantly, enhanced mutational tolerance upon XBP1s induction, and were unaffected by increased temperature. In contrast, the mutational tolerance at antigenic sites was substantially impacted by both XBP1s induction and increased temperature (Figure 7A; functional/structural role). Moreover, sites in N-glycosylation sequons and cysteine residues involved in disulfide bonding, which engage glycoprotein chaperones and protein disulfide isomerases, respectively (Chen et al., 1995; Daniels et al., 2003; Hebert et al., 1997), had enhanced mutational tolerance upon XBP1s induction and reduced mutational tolerance upon increased temperature (Figure 7A; glyc. sequon/disulfide bond). Some of these sites—discussed in more detail below—were among the most affected by temperature and ER proteostasis factors.

Furthermore, we assessed whether ER proteostasis mechanisms and temperature differentially impact HA mutational tolerance at sites based on evolutionary conservation and surface accessibility. Specifically, we evaluated site conservation across 989 human H1 sequences from the Influenza Research Database (Zhang et al., 2017). We considered completely conserved sites as conserved and all other sites as variable, and plotted the net site diffsel values for conserved and variable sites (Figure 7A; conservation), as well as the site conservation as a function of the net site diffsel (Figure 7B). This analysis illustrated that temperature and XBP1s induction affected mutational tolerance at both variable and conserved sites (Figure 7A; conservation). However, the mutational tolerance at variable sites was more affected than at conserved sites, as indicated by the net site diffsel values for conserved sites clustering near zero, whereas the net site diffsel values for variable sites were more evenly distributed along the y-axis (Figure 7B). Still, the modest impact on conserved sites suggested that host temperature and proteostasis mechanisms could modulate mutational tolerance at essentially any site in HA, regardless of the operating structural and functional constraints. This conclusion is consistent with the distribution of net site diffsel values as a function of surface accessibility, which revealed that mutational tolerance was impacted by XBP1s induction and temperature at both buried and surface-exposed sites (Figure 7A; surface accessibility), though surface-exposed sites were often more affected (Figure 7C). We note that, similar to the mutation diffsel distributions (Figure 5A), the site diffsel distributions for ATF6f/XBP1s induction mirrored those for XBP1s induction (Figure 7—figure supplement 1).

Finally, we examined the structural location of HA sites most impacted by host proteostasis mechanisms and temperature. Mapping net site diffsel values onto the HA crystal structure revealed that these selection pressures affect the mutational tolerance of sites across HA (Figure 8; Figure 8—figure supplement 1 (ATF6f/XBP1s)). Many HA sites strongly impacted by host ER proteostasis factors and temperature were in the globular head domain (Figure 8A). For example, K170, P175, and K176 were among the most mutationally intolerant sites upon increased temperature and the most mutationally tolerant sites upon induction of XBP1s or ATF6f/XBP1s (labeled in Figure 7A and Figure 7—figure supplement 1A). These sites are on the surface of the head domain (Figure 8A), in a region where positively charged residues, such as K170 and K176, increase host receptor binding and hinder antibody escape (Hensley et al., 2009). Additionally, the P175 site was the site most affected by XBP1s and ATF6f/XBP1s activation (Figure 7A and Figure 7—figure supplement 1A). Although P175 is a trans-proline in an unstructured coil, and is therefore less impactful than if found in a region of defined HA secondary structure, its mutational tolerance upon XBP1s induction could still be mediated by prolyl isomerase activity regulated by the UPR (Shoulders et al., 2013). Furthermore, several sites in the stem domain were strongly impacted by host ER proteostasis factors and temperature. For instance, mutational tolerance at the N497 and T499 sites was considerably reduced upon XBP1s induction, ATF6f/XBP1s induction, and upon increased temperature (Figure 8 and Figure 8—figure supplement 1), likely because these residues constitute an N-linked glycosylation sequon. N-Linked glycosylation directs the co- and post-translational folding pathways of HA. Removal of this glycosylation sequon may reduce the interaction of HA with the glycoprotein chaperones calnexin and calreticulin, thereby hindering HA folding and trafficking (Daniels et al., 2003; Hebert et al., 1997). Altogether, host ER proteostasis factors and temperature impact the mutational tolerance at sites across the entire HA structure, irrespective of secondary structure and surface accessibility. Sites that are especially affected contain wild-type residues that likely mediate interactions between HA and host ER proteostasis factors.

Figure 8. Mutational tolerance at sites across the mature HA protein was impacted by XBP1s induction and increased temperature.

(A) Average net site diffsel values (which correspond to mutational tolerance) are mapped onto the HA crystal structure PDBID 1RVX (Gamblin et al., 2004), with select outliers from Figure 7 labeled. (B) Cartoon representation of A displaying secondary structural elements. In A–B, negative net site diffsel values are mapped in blue; positive net site diffsel values are mapped in red. Figure 8—figure supplement 1. Mutational tolerance at sites across the mature HA protein was impacted by ATF6f/XBP1s induction and increased temperature.

Figure 8.

Figure 8—figure supplement 1. Mutational tolerance at sites across the mature HA protein was impacted by ATF6f/XBP1s induction and increased temperature.

Figure 8—figure supplement 1.

(A) Average net site diffsel values (which correspond to mutational tolerance) are mapped onto the HA crystal structure PDBID 1RVX (Gamblin et al., 2004), with select outliers from Figure 7—figure supplement 1 labeled. (B) Cartoon representation of A displaying secondary structural elements. In A–B, negative net site diffsel values are mapped in blue; positive net site diffsel values are mapped in red.

Discussion

Our results demonstrate that host proteostasis capacity and host cell temperature critically impact mutational tolerance across the entire HA protein. Activation of the XBP1s arm of the UPR globally enhanced the mutational tolerance of HA, while increased host cell temperature globally reduced the mutational tolerance of HA. Notably, the variants that were most enriched upon XBP1s induction corresponded to those most depleted upon increased temperature. This anti-correlation indicates that temperature-sensitive variants, which are likely biophysically deleterious relative to wild-type HA, are most likely to be rescued by the ER proteostasis machinery.

These selection pressures exerted similar fitness effects across the entire HA protein, influencing sites that differ in their secondary structure, surface accessibility, conservation in natural H1 sequences, and functional and structural roles. This impact is in stark contrast to selection pressures such as immune escape or drug resistance, which typically select for variants at a few specific sites. Universally modulating HA mutational tolerance suggests that host temperature and ER proteostasis mechanisms can shift the distribution of fitness effects of mutations on HA, and therefore are likely to broadly influence the accessibility of HA evolutionary trajectories. Because we observe a particularly strong impact at antigenic sites in the HA head domain (Doud and Bloom, 2016), inhibition of ER proteostasis mechanisms may limit the accessibility of antibody escape variants. This therapeutic strategy is especially appealing because prior work has demonstrated that influenza infection can induce the unfolded protein response and that ER proteostasis mechanisms are essential for influenza propagation (Frabutt et al., 2018; Hassan et al., 2012).

Similar to previous studies on the role of HSP90 in the evolution of endogenous protein clients (Cowen and Lindquist, 2005), our observations may derive from direct interactions between host proteostasis components and HA, or as an indirect consequence of perturbing proteostasis. The negative correlation we observe between upregulating ER proteostasis factors and increasing temperature evidences the former, because variants with compromised fitness in a biophysically challenging environment are most impacted by the ER proteostasis machinery. Alternatively, ER proteostasis factors and increased temperature may affect the kinetics of infection or expression of third-party mediators that regulate HA function or fitness.

Irrespective of the precise mechanism, this study provides the first experimental evidence that ER proteostasis factors define the mutational tolerance of a secretory pathway client protein, a phenomenon that likely extends far beyond HA and viral evolution. Prior to this work, chaperones known to potentiate and buffer protein evolution were limited to cytosolic isoforms (Phillips et al., 2017), largely of HSP90 (Rohner et al., 2013; Queitsch et al., 2002; Cowen and Lindquist, 2005; Geller et al., 2018; Sangster et al., 2008; Geiler-Samerotte et al., 2016; Geller et al., 2007). Here, we show that this role extends to ER chaperones and quality control machinery involved in the folding and assembly of membrane and secretory proteins. We note that understanding the evolutionary constraints of membrane and secretory proteins is critical, as they comprise highly significant therapeutic targets, including collagen (DiChiara et al., 2016), G-protein coupled receptors (Laschet et al., 2018), antibodies (Wong et al., 2018), and viral membrane proteins (Hurtley et al., 1989). This work suggests that ER proteostasis mechanisms could be tuned to ameliorate protein-misfolding diseases or prevent the development of antiviral resistance. Further investigation into the impact of ER proteostasis factors on the mutational tolerance of additional secretory pathway client proteins, as well as non-client proteins, will elucidate the underlying molecular details and the pervasiveness of our findings.

Materials and methods

Key resources table.

Reagent type
(species) or resource
Designation Source or
reference
Additional information
strain, strain
background
(Influenza A virus)
Influenza A/WSN/1933 PMID 27271655
genetic reagent
(Influenza A virus)
Influenza A/WSN/1933
Mutant Library 1
PMID 27271655
genetic reagent
(Influenza A virus)
Influenza A/WSN/1933
Mutant Library 2
PMID 27271655
genetic reagent
(Influenza A virus)
Influenza A/WSN/1933
Mutant Library 3
PMID 27271655
cell line
(Homo sapiens)
HEK293ATF6f/XBP1s PMID 23583182
recombinant
DNA reagent
pHW181-PB2 PMID 10801978 plasmid
recombinant
DNA reagent
pHW182-PB1 PMID 10801978 plasmid
recombinant
DNA reagent
pHW183-PA PMID 10801978 plasmid
recombinant
DNA reagent
pHW184-HA PMID 10801978 plasmid
recombinant
DNA reagent
pHW185-NP PMID 10801978 plasmid
recombinant
DNA reagent
pHW186-NA PMID 10801978 plasmid
recombinant
DNA reagent
pHW187-M PMID 10801978 plasmid
recombinant
DNA reagent
pHW188-NS1 PMID 10801978 plasmid
software,
algorithm
dms_tools PMID 25990960

Plasmids

The following plasmids were used for generating A/WSN/1933 influenza virus: pHW181-PB2, pHW182-PB1, pHW183-PA, pHW184-HA, pHW185-NP, pHW186-NA, pHW187-M, and pHW188-NS1 (Hoffmann et al., 2000).

Cell culture

HEK293ATF6f/XBP1s cells were cultured at 37°C in a 5% CO2 atmosphere in DMEM (CellGro) supplemented with 10% fetal bovine serum (CellGro) and 1% penicillin/streptomycin/glutamine (CellGro). Cells were generated from parental HEK293 cells (ATCC CRL-1573; authenticated by STR-profiling) as previously described (Shoulders et al., 2013). Here, 1 μM TMP and 0.1 μg/mL doxycycline were used to activate ATF6 and XBP1s, respectively. All cell lines were periodically tested for mycoplasma using the MycoSensor PCR Assay Kit from Agilent (302109).

Influenza virus

All experiments were performed with influenza A/WSN/1933 (H1N1) HA viral mutant libraries (Doud and Bloom, 2016). All infections were performed in WSN media (OptiMEM-I; Thermo Fisher Scientific) supplemented with 0.5% heat-inactivated FBS, 0.3% BSA (Invitrogen), 100 U/mL of penicillin and 100 μg/mL of streptomycin (Bio Whittaker), and 100 μg/mL of CaCl2); swapping the inoculum with fresh WSN media 2 hr post-infection.

qPCR

HEK293ATF6f/XBP1s cells were seeded at 750,000 cells/well in a 12-well plate and treated with 0.01% DMSO, 0.1 μg/mL doxycycline (Sigma), or 0.1 μg/mL doxycycline and 1 μM TMP (Alfa Aesar) for 24 hr, or 100 μM arsenite for 2 hr as a positive control for heat shock response activation or 10 μg/mL tunicamycin for 6 hr as a positive control for unfolded protein response activation. To monitor chaperone levels during influenza infection, HEK293ATF6f/XBP1s cells were infected with influenza A/WSN/1933 at an MOI of 1 for 8 hr to mimic the environment of a cell infected with a single influenza virion as in the DMS. Cellular RNA was harvested using the Omega RNA Extraction kit with Homogenizer Columns. 1 μg RNA was used to prepare cDNA using random primers (total reaction volume = 20 μL; Applied Biosystems High-Capacity Reverse Transcription kit). The reverse transcription reaction was diluted to 80 μL with water, and 2 μL of each sample was used for qPCR with 2 × Sybr Green (Roche) and primers for human RPLP2 (housekeeping gene), HSP70, HSP40, HSP90, ERDJ4, GRP94, BIP, SEC24D, and influenza Matrix (primer sequences in Supplementary file 1). All gene transcript levels were normalized to that of RPLP2, and the fold-change in expression relative to DMSO-treated, mock-infected cells was calculated (Figure 2—figure supplement 3C; Figure 2—source data 3). For qPCR of influenza-infected cells, a standard curve was prepared with a pDZ plasmid backbone containing the Influenza PR8 M segment to determine influenza Matrix copy number, which was used as a positive control for productive infection (Figure 2—source data 3).

RNA-seq

For the HEK293ATF6f/XBP1s cell line characterization, HEK293ATF6f/XBP1s cells were seeded at 750,000 cells/well in a 12-well plate and treated with 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP for 24 hr, at either 37˚C or 39˚C. Cellular RNA was harvested using Qiagen RNeasy Plus Mini Kit with QIAshredder homogenization columns. RNA-Seq libraries were prepared using the Kapa mRNA HyperPrep RNA-seq library construction kit system and were sequenced on an Illumina HiSeq SE40.

RNA-seq analysis

Analyses were performed using tools and methods presented in Huang et al. (2009). Quality control: Reads were aligned against hg19 (Feb., 2009) using bwa mem v. 0.7.12-r1039 [RRID:SCR_010910] with flags –t 16 –f and mapping rates, fraction of multiply-mapping reads, number of unique 20-mers at the 5´ end of the reads, insert size distributions and fraction of ribosomal RNAs were calculated using bedtools v. 2.25.0 [RRID:SCR_006646] (Quinlan and Hall, 2010). In addition, each resulting bam file was randomly down-sampled to a million reads, which were aligned against hg19 and read density across genomic features were estimated for RNA-Seq-specific quality control metrics.

RNA-Seq mapping and quantitation: Reads were aligned against GRCh38/ENSEMBL 89 annotation using STAR v. 2.5.3a with the following flags -runThreadN 8 --runMode alignReads --outFilterType BySJout --outFilterMultimapNmax 20 --alignSJoverhangMin 8 --alignSJDBoverhangMin 1 --outFilterMismatchNmax 999 --alignIntronMin 10 --alignIntronMax 1000000 --alignMatesGapMax 1000000 --outSAMtype BAM SortedByCoordinate --quantMode TranscriptomeSAM with --genomeDir pointing to a 75nt-junction GRCh38 STAR suffix array (Dobin et al., 2013). Gene expression was quantitated using RSEM v. 1.3.0 [RRID:SCR_013027] with the following flags for all libraries: rsem-calculate-expression --calc-pme --alignments -p 8 --forward-prob 0 against an annotation matching the STAR SA reference (Li and Dewey, 2011). Posterior mean estimates (pme) of counts and estimated RPKM were retrieved.

Differential expression analysis: Treatments were compared against DMSO-treatment (Basal) either at 37˚C or 39˚C for each condition (Figure 2; Figure 2—figure supplement 1; Figure 2—source data 1). Briefly, differential expression was performed in the R statistical environment (R v. 3.4.0) using Bioconductor’s DESeq two package on the protein-coding genes only [RRID:SCR_000154] (Love et al., 2014). Dataset parameters were estimated using the estimateSizeFactors(), and estimateDispersions() functions; read counts across conditions were modeled based on a negative binomial distribution, and a Wald test was used to test for differential expression (nbinomWaldtest(), all packaged into the DESeq() function), using the treatment type as a contrast. Fold-changes and p-values were reported for each protein-coding gene. Heat maps were generated in Spotfire (Tibco), using a complete linkage function and the cosine correlation as distance metrics (Figure 2).

Comparison of upregulated genes in present study to previous study (Shoulders et al., 2013): This analysis was performed to compare genes upregulated upon XBP1s and ATF6f/XBP1s induction at 37˚C, relative to DMSO treatment at 37˚C. The analysis was restricted to genes with clear representation in both the whole-genome array (Shoulders et al., 2013) and the RNA-Seq annotation (n = 13058). Transcripts detected by the two methods were matched based on an Affymetrix probe ID for the whole-genome array and an ENSEMBL ID based on DAVID v6.8 for the RNA-Seq (Huang et al., 2009). Residual unmatched transcripts were matched based on NCBI IDs, and finally based on gene names (correcting for updates in official gene symbols between annotations). Both datasets were filtered applying a 2-fold change cutoff and an FDR of 0.05. The resulting overlap is presented in Figure 2—figure supplement 2. The genes for each subset of the Venn Diagram (Figure 2—figure supplement 2) are listed in Figure 2—source data 2. For each subgroup, ENSEMBL identities were retrieved for each gene and used in the DAVID online Gene Ontology environment (Huang et al., 2009) to call enriched functional categories against the background of 13058 genes jointly called across both RNA quantification methods (Figure 2—source data 2).

Cell growth assay

HEK293ATF6f/XBP1s cells were seeded at 750,000 cells/well in poly-D-lysine (Sigma)-coated (0.05 mg/mL in PBS, 37˚C for 15 min) 12-well plates and pre-treated with 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP, and placed at either 37˚C or 39˚C. To mimic the infection conditions, the cellular growth media was replaced with WSN media supplemented with 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP 16 hr post-treatment, returning plates to either 37˚C or 39˚C. 48 hr after the mock infection, the media was replaced with WSN media containing 50 μM resazurin sodium salt (Sigma). After 4 hr of incubation, 100 μL of media was used to quantify resorufin fluorescence (excitation 530 nm; emission 590 nm) using a Take-3 plate reader (BioTeK) (Figure 2—figure supplement 3A).

Influenza infection in modified host cell environments

HEK293ATF6f/XBP1s cells were seeded at 750,000 cells/well in poly-D-lysine-coated 12-well plates and treated with 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP, at either 37˚C or 39˚C, for 16 hr. To assess viral growth in each selection condition, pre-treated cells were infected with influenza A/WSN/1933 at an MOI of 0.01 virions/cell for 48 hr, at either 37˚C or 39˚C, to mimic DMS and pairwise competition conditions. Viral supernatant was harvested 48 hr post-infection and titered using a TCID50 assay (Figure 2—figure supplement 3B).

Deep mutational scanning

We employed three biological replicate viral libraries that were previously generated from three independently prepared HA mutant plasmid libraries (Doud and Bloom, 2016). HEK293ATF6f/XBP1s cells were plated in poly-D-lysine-coated 15 cm plates at a density of 50 × 106 cells per dish; 8 hr after plating, cells were treated with 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP for 16 hr, at either 37˚C or 39˚C. 16 hr after treatment, 1 × 106 Infectious virions (determined using a TCID50 assay) from each viral library were used to infect two 15 cm plates in each condition at an MOI of 0.01 virions/cell. In addition, one 15 cm plate at both 37˚C and 39˚C was either mock infected (negative control) or infected with wild-type virus. For infection, the cellular growth medium was replaced with WSN media containing a mutant virus library, wild-type virus, or no virus for mock infection. After 2 hr, the inoculum was replaced with fresh WSN media containing 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP. 48 hr post-infection, the viral supernatant was harvested, centrifuged at 1000 × g for 5 min to remove cell debris, and stored at –80°C. To extract viral RNA from the viral supernatant, the supernatant was thawed and virions were concentrated by ultracentrifugation, spinning at 25,000 rpm for 2 hr at 4˚C (SW 32 Ti swinging bucket rotor, Beckman Coulter). The supernatant was then decanted, and the virion pellet was resuspended in 140 μL QIAgen Viral RNA Buffer AVL. Viral RNA was then extracted using a QIAgen Viral RNA mini kit, per the manufacturer’s instructions, changing collection tubes at each step. Viral RNA was reverse transcribed using the AccuScript High Fidelity first strand cDNA synthesis kit (Agilent) using 5ʹ-WSN-HA and 3ʹ-WSN-HA primers (primer sequences in Supplementary file 1). At least 106 HA molecules were PCR-amplified for preparation of subamplicon sequencing libraries, as previously described (Doud and Bloom, 2016), to ensure sufficient sampling of viral library diversity. Briefly, this sequencing library preparation method appends unique, random barcodes and part of the Illumina adapter to HA subamplicon molecules. In a second round of PCR, the complexity of the uniquely barcoded subamplicons was controlled to be less than the sequencing depth, and the remainder of the Illumina adapter is appended (Doud and Bloom, 2016). The resulting libraries were sequenced on an Illumina HiSeq 2500 in rapid run mode with 2 × 250 bp paired-end reads.

Deep mutational scanning data analysis

The software dms_tools (http://jbloomlab.github.io/dms_tools/) (Bloom, 2015) was used to align reads to the Influenza A/WSN/1933 HA reference sequence, count amino acid variants across HA, and calculate the differential selection for each variant between two selection conditions, as previously described (Doud et al., 2017). Briefly, reads were trimmed, aligned to the Influenza A/WSN/1933 reference sequence, and read pairs were quality filtered by discarding read pairs with low-quality sites in the barcode region of either read, in addition to read pairs with more than 7.5% low quality sites. The remaining read pairs were sorted by barcode and barcodes with less than two read pairs were discarded. For each remaining barcode, mutation calls were only made when greater than 75% of reads concurred (Doud and Bloom, 2016; Bloom, 2015). For each sample, the numbers of read pairs, barcoded read pairs, and reads per barcode are reported in Source data 1 (see sample _summarystats.txt) and the sequencing data analyses are available at https://github.com/amphilli/HA_DMS_2018 (copy archived at https://github.com/elifesciences-publications/amphilli/HA_DMS_2018). For evaluating mutation differential selection, variants not present in each replicate starting library were removed from the analysis and were subsequently filtered for variants that incurred selection of the same sign across biological triplicates (Figure 5—source data 1). The mutation differential selection values for these filtered variants were visualized on sequence logo plots, made using dms_tools (Figure 4; Figure 4—figure supplements 15) (Doud et al., 2017; Bloom, 2015). Correlation between mutation differential selection values across selection conditions was determined using linear regression using Graph Pad Prism software, reporting Pearson correlation coefficients (Figure 5B–C; Figure 5—figure supplement 1). For each site in HA, the net site differential selection (net site diffsel) was calculated by summing the mutation differential selection values separately for each replicate starting library, excluding mutation differential selection values for variants that were not present in the respective starting library. These replicate net site diffsel values were then then averaged and sorted by HA secondary structure, relative surface accessibility (Gamblin et al., 2004; Joosten et al., 2011), receptor binding and antigenic sites (Doud and Bloom, 2016), N-glycosylation sequons and disulfide bond sites (Chen et al., 1995; Daniels et al., 2003; Hebert et al., 1997), and natural HA site conservation (Zhang et al., 2017) (Figure 7; Figure 7—figure supplement 1; Figure 7—source data 1). HA secondary structure and relative surface accessibility were determined using DSSP (Joosten et al., 2011; Kabsch and Sander, 1983) based on the HA crystal structure (PDBID 1RVX) (Gamblin et al., 2004). In Figure 7A, sites designated buried were the 50% least surface-accessible sites; sites designated exposed were the 50% most surface-accessible sites. Receptor binding and antigenic sites were classified as in Doud et al, and included sites nearby antigenic sites (Doud and Bloom, 2016). Natural HA site conservation was based on 989 non-redundant human H1 sequences preceding the 2009 H1N1 pandemic, accessed from the NIAID Influenza Research Database (accessed Jan. 10th, 2018) (Zhang et al., 2017). Site entropies were calculating using the Influenza Research Database SNP Analysis Tool (Zhang et al., 2017), and were normalized to a scale of 0–100, with a site entropy of 100 representing the most variable site in HA and 0 representing the most conserved site (Figure 7B). In Figure 7A, sites designated conserved had a site entropy of zero; all other sites were designated as variable. The net site differential selection values were mapped onto the HA crystal structure (PDBID 1RVX) (Gamblin et al., 2004) using PyMOL (DeLano, 2002) (Figure 8; Figure 8—figure supplement 1).

Data availability

FASTQ files for DMS sequencing are available in the Sequence Read Archive under accession number SRP149672. The deep mutational scanning data analyses are available at https://github.com/amphilli/HA_DMS_2018, and is also available in Source data 1. All mutation differential selection values from deep mutational scanning (pre- and post-filtering) are available in Figure 5—source data 1. The complete RNAseq data are available from GEO under accession number GSE115168.

Infectious viral titering via tissue culture infectious dose (TCID50) assay

10-Fold dilutions of each virus were prepared in quadruplicate in 96-well plates. 5,000 MDCK-SIAT1 cells were then added to each well and incubated at 37°C for 72 hr, after which the wells were scored for the presence of cytopathic effect. The dilutions of virus displaying cytopathic effect in the MDCK-SIAT1 cells were then used to calculate the TCID50/μL using https://github.com/jbloomlab/reedmuenchcalculator as described by Thyagarajan and Bloom (Thyagarajan and Bloom, 2014), where virions/μL = 0.69*TCID50/μL.

Reverse genetics pairwise viral competitions

HA mutant-encoding plasmids were prepared by introducing point mutations into the pHW184-HA plasmid using the QuikChange II XL Site-Directed Mutagenesis Kit (Agilent 200522). Mutant viruses were generated from plasmids by transfecting a co-culture of 2.5 × 104 MDCK-SIAT1 and 3 × 105 HEK 293 T cells, as previously described (Hoffmann et al., 2000), followed by titering using the TCID50 assay. For each competition, HEK293ATF6f/XBP1s cells were plated in poly-D-lysine-coated 24-well dishes at 3.5 × 105 cells/well and treated 8 hr later with 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP at either 37°C or 39°C. 24 hr after plating, cells were infected with a 1:1 mixture of wild-type and mutant viruses at an MOI of 0.01 virions/cell in biological triplicate under conditions identical to that of the deep mutational scanning experiment. After 2 hr, the inoculum was replaced with fresh WSN media supplied with 0.01% DMSO, 0.1 μg/mL doxycycline, or 0.1 μg/mL doxycycline and 1 μM TMP. 48 hr post-infection, infectious supernatant was harvested, centrifuged at 1000 × g for 5 min to remove cell debris, and stored at –80°C. Viral RNA was extracted from 140 μL infectious supernatant using the QIAamp Viral RNA Mini kit and at least 106 HA molecules were reverse transcribed using SuperScript III Reverse Transcriptase (Thermo Fisher Scientific) with 5ʹ-WSN-HA and 3ʹ-WSN-HA primers (Supplementary file 1). The dsDNA was purified twice using 0.9 × AMPure XP beads (Beckman Coulter) and quantified using a Quant-iT PicoGreen assay (Life Technologies). Illumina NexteraXT sequencing libraries were prepared using a Mosquito HTS liquid handler (TTP Labtech) and sequenced on an Illumina HiSeq with 50 bp single-end reads.

Pairwise competition sequencing data analysis

To overcome biases in read mapping from consecutive, multi-nucleotide sequence variants, mapping target sequences were generated for each Influenza A/WSN/1933 HA variant. Briefly, regions flanking the variant (given the experimental read length) would be identified, relevant groups of bases substituted with their engineered variants, and the rest of the sequence substituted with ‘N’ in order to maintain the sequence position of said variants, thus generating sets of ‘pseudo-haplotypes’. Sequencing reads were aligned to the wild-type Influenza A/WSN/1933 HA sequence and the set of pseudo-haplotypes using bwa mem (v. 0.7.12-r1039) (arXiv:1303.3997), with flag –t 16, and sorted and indexed bam files were generated using samtools (v 1.3) (Li et al., 2009). These bam files were processed using samtools mpileup with flags –excl-flags 2052, -d 30000000 and the same reference sequences used for mapping (Sievers et al., 2011). Allele counts at each position of the wild-type and pseudo-haplotype sequences were summed and reported along the wild-type Influenza A/WSN/1933 HA sequence for each sample for calculation of variant frequencies. The change in variant frequency upon selection was normalized to that of wild-type, thus providing diffsel values on the same scale as the DMS competition (Figure 6B; Figure 6—figure supplement 1).

Statistics

All experiments were performed in biological triplicate with replicates defined as independent experimental entireties (i.e., from plating the cells to acquiring the data). For deep mutational scanning, each biological replicate mutant viral library was prepared from independently generated mutant plasmid libraries, as previously reported (Doud and Bloom, 2016). Differential selection values from deep mutational scanning, as well as selection values from pairwise competitions, were tested for significance of deviation from zero (wild-type behavior), using a one-sample t-test in Graph Pad Prism (Figure 5A; Figure 6—figure supplement 1). Differential selection values from pairwise competitions were compared between selection conditions by a Student’s t-test (Figure 6—figure supplement 1). Correlation between diffsel values was determined by linear regression using Graph Pad Prism software, reporting Pearson correlation coefficients and p-values from F-tests, testing significance of slope deviating from zero (Figure 5B–C; Figure 5—figure supplement 1). Correlation between site diffsel values (N = 565) was determined by linear regression using Pandas Python package, reporting Pearson correlation coefficients (Figure 3—source data 1). For DMS mutational tolerance analyses, statistical significance of the deviation of the mean of the net site diffsel distribution from zero (wild-type behavior) was tested by a Student’s t-test; one-tailed p-values are reported, to assess whether the net site diffsel distributions for XBP1s and ATF6f/XBP1s induction are significantly greater than zero, and whether the net site diffsel distribution for increased temperature is significantly less than zero (Figure 7A; Figure 7—figure supplement 1; Figure 7—source data 1).

Acknowledgements

The authors would like to thank Christopher E R Richardson (MIT), Andrew S DiChiara (AbbVie), and Chet M Berman (MIT) for feedback on this manuscript.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Matthew D Shoulders, Email: mshoulde@mit.edu.

Franz-Ulrich Hartl, Max Planck Institute for Biochemistry, Germany.

Wenhui Li, National Institute of Biological Sciences, China.

Funding Information

This paper was supported by the following grants:

  • National Science Foundation Graduate Research Fellowship to Angela M Phillips.

  • National Cancer Institute Koch Institute Support (core) Grant P30-CA14051 to Vincent L Butty, Matthew D Shoulders.

  • National Institute of Environmental Health Sciences MIT Center for Environmental Health Sciences (core) Grant P30-ES002109 to Vincent L Butty, Matthew D Shoulders.

  • Tufts University to Yu-Shan Lin.

  • National Institutes of Health Al127893 to Jesse D Bloom.

  • Howard Hughes Medical Institute Faculty Scholars grant to Jesse D Bloom.

  • Simons Foundation Faculty Scholars grant to Jesse D Bloom.

  • National Science Foundation CAREER Award 1652390 to Matthew D Shoulders.

  • Richard and Susan Smith Family Foundation Award for Excellence in Biomedical Research to Matthew D Shoulders.

  • Massachusetts Institute of Technology to Matthew D Shoulders.

  • National Institutes of Health 1DP2GM119162 to Matthew D Shoulders.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Funding acquisition, Investigation, Visualization, Writing—original draft, Writing—review and editing.

Resources, Methodology, Writing—review and editing.

Formal analysis, Investigation, Writing—review and editing.

Data curation, Formal analysis, Writing—review and editing.

Resources, Formal analysis, Visualization, Writing—review and editing.

Resources, Software, Supervision, Methodology, Writing—review and editing.

Conceptualization, Supervision, Funding acquisition, Project administration, Writing—review and editing.

Additional files

Source data 1. Complete analysis of deep mutational scanning data.
elife-38795-data1.zip (22.8MB, zip)
DOI: 10.7554/eLife.38795.028
Supplementary file 1. Primer sequences for qPCR and HA sequencing.
elife-38795-supp1.docx (16KB, docx)
DOI: 10.7554/eLife.38795.029
Transparent reporting form
DOI: 10.7554/eLife.38795.030

Data availability

FASTQ files for DMS sequencing are available in the Sequence Read Archive under accession number SRP149672. The deep mutational scanning data analysis will be available upon publication at https://github.com/amphilli/HA_DMS_2018 (copy archived at https://github.com/elifesciences-publications/amphilli/HA_DMS_2018), and is also available in Source Data 1. All differential selection values from deep mutational scanning (pre- and post-filtering) are available in Figure 5-source data 1. The complete RNAseq data are available from GEO under accession number GSE115168.

The following datasets were generated:

Angela M Phillips, author; Michael B Doud, author; Luna O Gonzalez, author; Vincent L Butty, author; Yu-Shan Lin, author; Jesse D Bloom, author; Matthew D Shoulders, author. ER proteostasis and temperature differentially impact the mutational tolerance of influenza hemagglutinin. 2018 https://www.ncbi.nlm.nih.gov/sra/SRP149672 Publicly available at the NCBI Gene Expression Omnibus (accession no: SRP149672)

Angela M Phillips, author; Michael B Doud, author; Luna O Gonzalez, author; Vincent L Butty, author; Yu-Shan Lin, author; Jesse D Bloom, author; Matthew D Shoulders, author. ER proteostasis and temperature differentially impact the mutational tolerance of influenza hemagglutinin. 2018 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE115168 Publicly available at the NCBI Gene Expression Omnibus (accession no: GSE115168)

References

  1. Balch WE, Morimoto RI, Dillin A, Kelly JW. Adapting proteostasis for disease intervention. Science. 2008;319:916–919. doi: 10.1126/science.1141448. [DOI] [PubMed] [Google Scholar]
  2. Bloom JD, Glassman MJ. Inferring stabilizing mutations from protein phylogenies: application to influenza hemagglutinin. PLoS Computational Biology. 2009;5:e1000349. doi: 10.1371/journal.pcbi.1000349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bloom JD. Software for the analysis and visualization of deep mutational scanning data. BMC Bioinformatics. 2015;16:168–184. doi: 10.1186/s12859-015-0590-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chen W, Helenius J, Braakman I, Helenius A. Cotranslational folding and calnexin binding during glycoprotein synthesis. PNAS. 1995;92:6229–6233. doi: 10.1073/pnas.92.14.6229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Cowen LE, Lindquist S. Hsp90 potentiates the rapid evolution of new traits: drug resistance in diverse fungi. Science. 2005;309:2185–2189. doi: 10.1126/science.1118370. [DOI] [PubMed] [Google Scholar]
  6. Daniels R, Kurowski B, Johnson AE, Hebert DN. N-linked glycans direct the cotranslational folding pathway of influenza hemagglutinin. Molecular Cell. 2003;11:79–90. doi: 10.1016/S1097-2765(02)00821-3. [DOI] [PubMed] [Google Scholar]
  7. DeLano WL. The PyMOL Molecular Graphics System 2002
  8. DePristo MA, Weinreich DM, Hartl DL. Missense meanderings in sequence space: a biophysical view of protein evolution. Nature Reviews Genetics. 2005;6:678–687. doi: 10.1038/nrg1672. [DOI] [PubMed] [Google Scholar]
  9. DiChiara AS, Taylor RJ, Wong MY, Doan ND, Rosario AM, Shoulders MD. Mapping and exploring the Collagen-I proteostasis network. ACS Chemical Biology. 2016;11:1408–1421. doi: 10.1021/acschembio.5b01083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Doud MB, Bloom JD. Accurate measurement of the effects of all amino-acid mutations on influenza hemagglutinin. Viruses. 2016;8:155. doi: 10.3390/v8060155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Doud MB, Hensley SE, Bloom JD. Complete mapping of viral escape from neutralizing antibodies. PLoS Pathogens. 2017;13:e1006271. doi: 10.1371/journal.ppat.1006271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fowler DM, Fields S. Deep mutational scanning: a new style of protein science. Nature Methods. 2014;11:801–807. doi: 10.1038/nmeth.3027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Frabutt DA, Wang B, Riaz S, Schwartz RC, Zheng YH. Innate sensing of influenza A virus hemagglutinin glycoproteins by the host endoplasmic reticulum (ER) Stress pathway triggers a potent antiviral response via ER-Associated protein degradation. Journal of Virology. 2018;92:e01690-17. doi: 10.1128/JVI.01690-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gamblin SJ, Haire LF, Russell RJ, Stevens DJ, Xiao B, Ha Y, Vasisht N, Steinhauer DA, Daniels RS, Elliot A, Wiley DC, Skehel JJ. The structure and receptor binding properties of the 1918 influenza hemagglutinin. Science. 2004;303:1838–1842. doi: 10.1126/science.1093155. [DOI] [PubMed] [Google Scholar]
  16. Geiler-Samerotte KA, Zhu YO, Goulet BE, Hall DW, Siegal ML. Selection transforms the landscape of genetic variation interacting with Hsp90. PLoS Biology. 2016;14:e2000465. doi: 10.1371/journal.pbio.2000465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Geller R, Vignuzzi M, Andino R, Frydman J. Evolutionary constraints on chaperone-mediated folding provide an antiviral approach refractory to development of drug resistance. Genes & Development. 2007;21:195–205. doi: 10.1101/gad.1505307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Geller R, Taguwa S, Frydman J. Broad action of Hsp90 as a host chaperone required for viral replication. Biochimica et biophysica acta. 2012;1823:698–706. doi: 10.1016/j.bbamcr.2011.11.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Geller R, Pechmann S, Acevedo A, Andino R, Frydman J. Hsp90 shapes protein and RNA evolution to balance trade-offs between protein stability and aggregation. Nature Communications. 2018;9:1781. doi: 10.1038/s41467-018-04203-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gong LI, Suchard MA, Bloom JD. Stability-mediated epistasis constrains the evolution of an influenza protein. eLife. 2013;2:e00631. doi: 10.7554/eLife.00631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hartl FU, Bracher A, Hayer-Hartl M. Molecular chaperones in protein folding and proteostasis. Nature. 2011;475:324–332. doi: 10.1038/nature10317. [DOI] [PubMed] [Google Scholar]
  22. Hassan IH, Zhang MS, Powers LS, Shao JQ, Baltrusaitis J, Rutkowski DT, Legge K, Monick MM. Influenza A viral replication is blocked by inhibition of the inositol-requiring enzyme 1 (IRE1) stress pathway. Journal of Biological Chemistry. 2012;287:4679–4689. doi: 10.1074/jbc.M111.284695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hebert DN, Zhang JX, Chen W, Foellmer B, Helenius A. The number and location of glycans on influenza hemagglutinin determine folding and association with calnexin and calreticulin. The Journal of Cell Biology. 1997;139:613–623. doi: 10.1083/jcb.139.3.613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hensley SE, Das SR, Bailey AL, Schmidt LM, Hickman HD, Jayaraman A, Viswanathan K, Raman R, Sasisekharan R, Bennink JR, Yewdell JW. Hemagglutinin receptor binding avidity drives influenza A virus antigenic drift. Science. 2009;326:734–736. doi: 10.1126/science.1178258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Hoffmann E, Neumann G, Kawaoka Y, Hobom G, Webster RG. A DNA transfection system for generation of influenza A virus from eight plasmids. PNAS. 2000;97:6108–6113. doi: 10.1073/pnas.100133697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Huang daW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Research. 2009;37:1–13. doi: 10.1093/nar/gkn923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hurtley SM, Bole DG, Hoover-Litty H, Helenius A, Copeland CS. Interactions of misfolded influenza virus hemagglutinin with binding protein (BiP) The Journal of Cell Biology. 1989;108:2117–2126. doi: 10.1083/jcb.108.6.2117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Joosten RP, te Beek TA, Krieger E, Hekkelman ML, Hooft RW, Schneider R, Sander C, Vriend G. A series of PDB related databases for everyday needs. Nucleic Acids Research. 2011;39:D411–D419. doi: 10.1093/nar/gkq1105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22:2577–2637. doi: 10.1002/bip.360221211. [DOI] [PubMed] [Google Scholar]
  30. Klein EY, Blumenkrantz D, Serohijos A, Shakhnovich E, Choi JM, Rodrigues JV, Smith BD, Lane AP, Feldman A, Pekosz A. Stability of the influenza virus hemagglutinin protein correlates with evolutionary dynamics. mSphere. 2018;3:e00554-17. doi: 10.1128/mSphereDirect.00554-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Laschet C, Dupuis N, Hanson J. The G protein-coupled receptors deorphanization landscape. Biochemical Pharmacology. 2018;153:62–74. doi: 10.1016/j.bcp.2018.02.016. [DOI] [PubMed] [Google Scholar]
  32. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The sequence alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323–338. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15:550–570. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Research. 2008;18:1509–1517. doi: 10.1101/gr.079558.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Nakajima S, Brown DJ, Ueda M, Nakajima K, Sugiura A, Pattnaik AK, Nayak DP. Identification of the defects in the hemagglutinin gene of two temperature-sensitive mutants of A/WSN/33 influenza virus. Virology. 1986;154:279–285. doi: 10.1016/0042-6822(86)90454-X. [DOI] [PubMed] [Google Scholar]
  37. Pankow S, Bamberger C, Calzolari D, Martínez-Bartolomé S, Lavallée-Adam M, Balch WE, Yates JR. ∆F508 CFTR interactome remodelling promotes rescue of cystic fibrosis. Nature. 2015;528:510–516. doi: 10.1038/nature15729. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Phillips AM, Gonzalez LO, Nekongo EE, Ponomarenko AI, McHugh SM, Butty VL, Levine SS, Lin YS, Mirny LA, Shoulders MD. Host proteostasis modulates influenza evolution. eLife. 2017;6:e28652. doi: 10.7554/eLife.28652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Powers ET, Morimoto RI, Dillin A, Kelly JW, Balch WE. Biological and chemical approaches to diseases of proteostasis deficiency. Annual Review of Biochemistry. 2009;78:959–991. doi: 10.1146/annurev.biochem.052308.114844. [DOI] [PubMed] [Google Scholar]
  40. Queitsch C, Sangster TA, Lindquist S. Hsp90 as a capacitor of phenotypic variation. Nature. 2002;417:618–624. doi: 10.1038/nature749. [DOI] [PubMed] [Google Scholar]
  41. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Rohner N, Jarosz DF, Kowalko JE, Yoshizawa M, Jeffery WR, Borowsky RL, Lindquist S, Tabin CJ. Cryptic variation in morphological evolution: HSP90 as a capacitor for loss of eyes in cavefish. Science. 2013;342:1372–1375. doi: 10.1126/science.1240276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Russell AB, Trapnell C, Bloom JD. Extreme heterogeneity of influenza virus infection in single cells. eLife. 2018;7:e32303. doi: 10.7554/eLife.32303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Sangster TA, Salathia N, Lee HN, Watanabe E, Schellenberg K, Morneau K, Wang H, Undurraga S, Queitsch C, Lindquist S. HSP90-buffered genetic variation is common in Arabidopsis thaliana. PNAS. 2008;105:2969–2974. doi: 10.1073/pnas.0712210105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Sauter NK, Hanson JE, Glick GD, Brown JH, Crowther RL, Park SJ, Skehel JJ, Wiley DC. Binding of influenza virus hemagglutinin to analogs of its cell-surface receptor, sialic acid: analysis by proton nuclear magnetic resonance spectroscopy and X-ray crystallography. Biochemistry. 1992;31:9609–9621. doi: 10.1021/bi00155a013. [DOI] [PubMed] [Google Scholar]
  46. Shoulders MD, Ryno LM, Genereux JC, Moresco JJ, Tu PG, Wu C, Yates JR, Su AI, Kelly JW, Wiseman RL, Pg T, Cl W. Stress-independent activation of XBP1s and/or ATF6 reveals three functionally diverse ER proteostasis environments. Cell Reports. 2013;3:1279–1292. doi: 10.1016/j.celrep.2013.03.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J, Thompson JD, Higgins DG. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Molecular Systems Biology. 2011;7:539. doi: 10.1038/msb.2011.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Skehel JJ, Wiley DC. Receptor binding and membrane fusion in virus entry: the influenza hemagglutinin. Annual Review of Biochemistry. 2000;69:531–569. doi: 10.1146/annurev.biochem.69.1.531. [DOI] [PubMed] [Google Scholar]
  49. Thyagarajan B, Bloom JD. The inherent mutational tolerance and antigenic evolvability of influenza hemagglutinin. eLife. 2014;3:e03300. doi: 10.7554/eLife.03300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Tokuriki N, Tawfik DS. Stability effects of mutations and protein evolvability. Current Opinion in Structural Biology. 2009a;19:596–604. doi: 10.1016/j.sbi.2009.08.003. [DOI] [PubMed] [Google Scholar]
  51. Tokuriki N, Tawfik DS. Chaperonin overexpression promotes genetic variation and enzyme evolution. Nature. 2009b;459:668–673. doi: 10.1038/nature08009. [DOI] [PubMed] [Google Scholar]
  52. Ueda M, Sugiura A. Physiological characterization of influenza virus temperature-sensitive mutants defective in the haemagglutinin gene. Journal of General Virology. 1984;65:1889–1897. doi: 10.1099/0022-1317-65-11-1889. [DOI] [PubMed] [Google Scholar]
  53. Wong MY, DiChiara AS, Suen PH, Chen K, Doan ND, Shoulders MD. Adapting secretory proteostasis and function through the unfolded protein response. Current Topics in Microbiology and Immunology. 2018;414:1–25. doi: 10.1007/82_2017_56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Wyganowski KT, Kaltenbach M, Tokuriki N. GroEL/ES buffering and compensatory mutations promote protein evolution by stabilizing folding intermediates. Journal of Molecular Biology. 2013;425:3403–3414. doi: 10.1016/j.jmb.2013.06.028. [DOI] [PubMed] [Google Scholar]
  55. Wylie CS, Shakhnovich EI. A biophysical protein folding model accounts for most mutational fitness effects in viruses. PNAS. 2011;108:9916–9921. doi: 10.1073/pnas.1017572108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Zhang Y, Aevermann BD, Anderson TK, Burke DF, Dauphin G, Gu Z, He S, Kumar S, Larsen CN, Lee AJ, Li X, Macken C, Mahaffey C, Pickett BE, Reardon B, Smith T, Stewart L, Suloway C, Sun G, Tong L, Vincent AL, Walters B, Zaremba S, Zhao H, Zhou L, Zmasek C, Klem EB, Scheuermann RH. Influenza research database: an integrated bioinformatics resource for influenza virus research. Nucleic Acids Research. 2017;45:D466–D474. doi: 10.1093/nar/gkw857. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Franz-Ulrich Hartl1
Reviewed by: Raul Andino2, Jeffery W Kelly3

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "Enhanced ER proteostasis and temperature differentially impact the mutational tolerance of influenza hemagglutinin" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Ulrich Hartl as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by Wenhui Li as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Raul Andino (Reviewer #2); Jeffery W Kelly (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

In this manuscript, Philips et al. explore the impact of ER proteostasis on the mutational and fitness landscape of a model substrate of the secretory pathway – influenza virus hemagglutinin (HA). The authors extend upon their (and others') previous work on host cell cytosolic proteostasis influence on viral sequence space.

The authors make use of a saturating library of single site HA mutants in a cellular assay of infection and replicative fitness of the mutant virions in comparison to the wild type HA in various conditions. Using targeted deep sequencing of HA, they were able to identify the spectrum of mutations which were either enriched or depleted (deep mutational scanning or DMS) under these conditions. They show that enhancing host cell ER proteostasis broadly increases mutational tolerance across the length of the model substrate HA and is likely to be relevant to shaping evolutionary trajectories of endogenous secretory pathway proteins. Importantly, they observe a striking rescue of high-temperature sensitive mutants by such an enhanced ER proteostatic state.

The data presented is comprehensive and extends our understanding of how compartment specific proteostasis mechanisms assist in shaping protein evolution.

Essential revisions:

1) A major issue with the authors’ central conclusion is that they did not examine other viral protein. If the proteostasis machinery has a direct effect on HA evolution it would be important to present evidence that other viral proteins are not affected, e.g. those that are not translocated into the ER. This point is important because it is possible that upregulation of the ER proteostasis pathway may result in pleiotropic and indirect effects on virus evolution. For example, ER proteostasis factors may affect expression of other factors that regulate HA function. Indeed, XBP1s regulates ~350 genes and ATF6f also affects a number of chaperones and quality control components. If other viral proteins are not analyzed, it is difficult to conclude that the observed effect is direct on client proteins.

It is suggested that the authors make a clear statement to the effect that their data cannot exclude the possibility that proteostasis effects not specific for the ER play a role.

2) Similarly, the increase on temperature (39°C) could modify factors and the kinetics of infection that indirectly can affect selection on HA variants. For example, have the authors considered the effect on translation and ER translocation rates?

This point could be addressed either experimentally or verbally.

3) In Figure 2—figure supplement 1, the authors examine the modified background conditions prior to conducting the DMS using RNAseq and claim consistency with their previous dataset. A visual comparison in the form of a Venn diagram (fold change based, at least, given that their previous dataset was from microarrays) can be presented to clarify the degree of overlap and outliers, if any.

4) Subsection “Modulating ER proteostasis during influenza infection”, third paragraph. The authors state that reduced growth did not affect the interpretation of the data. They should provide the average depth per sample and other related statistics either as a supplementary table or describe this in the methods. A replicate fidelity measure like a correlation coefficient or simple PCA would add to the validity of the dataset as a whole.

5) In the section describing the data filtering (subsection “Deep mutational scanning of HA in modulated ER proteostasis environments”, third paragraph), the reader would like to know the actual dropout percentages as per exclusion criteria (given numerically as a table).

6) In Figure 5A, the authors describe that at high temperature there is a gross reduction in the variant growth, consistent with the hypothesis of biophysical constraints imposed on them. Physiologically, this is likely due to a misfolding and degradation of the variants. Given that host cells do not upregulate components of ERAD at higher temperature (Figure 2 and Figure 2—figure supplement 1E) – is ERAD engaged more at such temperatures without burdening the overall proteostasis? Also, in conditions of UPR activation, at permissive and higher temperatures, the RNAseq data reveal highly upregulated ERAD components (in addition to other factors). Should the same variants not be subject to degradation? How many variants do not get enriched upon the ER proteostasis boost? The authors could explain their results in some more specific details.

7) Again related to Figure 5A, in the subsection “Deep mutational scanning of HA in modulated ER proteostasis environments”, fifth paragraph, describe the redundancy of the variants having a positive diffsel score between XBP1s and ATF6f/XBP1s conditions. Although the authors comment about the similarity, it would be interesting to see which sites overlap and which do not, possibly shedding light on subtle differences between the two modes of UPR activation.

8) Figure 7A describes the mutational tolerance in the XBP1s condition alone. What about the ATF6f/XBP1s condition?

9) In Figure 7B and 7C, the y-axis is unclear – is it a continuum from conserved to variable? Buried to exposed?

eLife. 2018 Sep 6;7:e38795. doi: 10.7554/eLife.38795.037

Author response


Essential revisions:

1) A major issue with the authors’ central conclusion is that they did not examine other viral protein. If the proteostasis machinery has a direct effect on HA evolution it would be important to present evidence that other viral proteins are not affected, e.g. those that are not translocated into the ER. This point is important because it is possible that upregulation of the ER proteostasis pathway may result in pleiotropic and indirect effects on virus evolution. For example, ER proteostasis factors may affect expression of other factors that regulate HA function. Indeed, XBP1s regulates ~350 genes and ATF6f also affects a number of chaperones and quality control components. If other viral proteins are not analyzed, it is difficult to conclude that the observed effect is direct on client proteins.

It is suggested that the authors make a clear statement to the effect that their data cannot exclude the possibility that proteostasis effects not specific for the ER play a role.

We agree with the reviewers that our data cannot exclude the possibility of secondary effects of ER proteostasis on HA mutational tolerance. In the original submission, we briefly addressed this possibility in the Discussion. We have now expanded that discussion to include the possible secondary effects of increased temperature (as requested in Essential revision #2, below), and also to explicitly state that future work is necessary to determine if the observed effects are specific to secretory pathway client proteins:

“Similar to previous studies on the role of HSP90 in the evolution of endogenous protein clients (Cowen and Lindquist, 2005), our observations may derive from direct interactions between host proteostasis components and HA, or as an indirect consequence of perturbing proteostasis. […] Alternatively, ER proteostasis factors and increased temperature may affect the kinetics of infection or expression of third-party mediators that regulate HA function or fitness.”

“Further investigation into the impact of ER proteostasis factors on the mutational tolerance of additional secretory pathway client proteins, as well as non-client proteins, will elucidate the underlying molecular details and the pervasiveness of our findings.”

2) Similarly, the increase on temperature (39°C) could modify factors and the kinetics of infection that indirectly can affect selection on HA variants. For example, have the authors considered the effect on translation and ER translocation rates?

This point could be addressed either experimentally or verbally.

We have addressed this point verbally (see above statement from Discussion).

3) In Figure 2—figure supplement 1, the authors examine the modified background conditions prior to conducting the DMS using RNAseq and claim consistency with their previous dataset. A visual comparison in the form of a Venn diagram (fold change based, at least, given that their previous dataset was from microarrays) can be presented to clarify the degree of overlap and outliers, if any.

We now added the requested comparative analysis for the induction of XBP1s alone or with ATF6f specifically at 37˚C, as only this temperature was tested in our previous dataset (Shoulders et al., 2013). Per the request, this comparison is now presented as a Venn diagram in Figure 2—figure supplement 2. We also expanded our discussion of this dataset comparison to address the degree of overlap and possible explanations for discrepancies (e.g. different treatment conditions and transcriptomic measurements), as follows:

“The media composition and treatment regimen for influenza propagation differed from the conditions used in the original characterization of XBP1s and ATF6f/XBP1s activation in these cells, which also was performed only at 37 ˚C (Shoulders et al., 2013). […] Still, as expected (Shoulders et al., 2013), transcripts upregulated upon XBP1s and ATF6f/XBP1s induction largely corresponded to secretory pathway and ER stress response genes (Figure 2—figure supplement 1 and Figure 2—source data 2).”

4) Subsection “Modulating ER proteostasis during influenza infection”, third paragraph. The authors state that reduced growth did not affect the interpretation of the data. They should provide the average depth per sample and other related statistics either as a supplementary table or describe this in the methods. A replicate fidelity measure like a correlation coefficient or simple PCA would add to the validity of the dataset as a whole.

We agree with the reviewer that these sequencing statistics are important for assessing the validity of the dataset. In the original submission, the number of read pairs, number of barcoded read pairs, and number of reads per barcode were reported for each sample in Source Data 1. We now realize that this information was not easily accessible, and revised the Materials and Methods to explicitly describe the sequencing library preparation and data processing and also specified the precise location of this information in Source Data 1. We also note that the samples with reduced viral growth were held to identical library preparation and sequencing data analyses standards, such that the diversity of each mutant viral library was sufficiently sampled and accurately quantified. The revised discussion follows below:

“At least 106 HA molecules were PCR-amplified for preparation of subamplicon sequencing libraries, as previously described (Russell, Trapnell and Bloom, 2018), to ensure sufficient sampling of viral library diversity. […] In a second round of PCR, the complexity of the uniquely barcoded subamplicons is controlled to be less than the sequencing depth, and the remainder of the Illumina adapter is appended (Russell, Trapnell and Bloom, 2018).”

“dms_tools (http://jbloomlab.github.io/dms_tools/) (Doud and Bloom, 2016) was used to align reads to the Influenza A/WSN/1933 HA reference sequence, count amino acid variants across HA, and calculate the differential selection for each variant between two selection conditions, as previously described (Doud, Hensley and Bloom, 2017). […] For each sample, the number of read pairs, barcoded read pairs, and reads per barcode are reported in Source Data 1 (see sample _summarystats.txt) and the complete sequencing data analyses are available at https://github.com/amphilli/HA_DMS_2018.”

Furthermore, per the reviewers’ suggestion, we evaluated the correlation between unfiltered site differential selection values for biological replicates, and have provided these correlation coefficients in Figure 3—source data 1. Additionally, we have added a statement to the Results to summarize the correlation between replicates:

“Notably, the unfiltered biological replicate diffsel values were strongly correlated across HA sites, with correlation coefficients ranging from R = 0.67–0.78 (Figure 3—source data 1 and Source Data 1).”

5) In the section describing the data filtering (subsection “Deep mutational scanning of HA in modulated ER proteostasis environments”, third paragraph), the reader would like to know the actual dropout percentages as per exclusion criteria (given numerically as a table).

There is now a table presented in Figure 3C summarizing the percentage of amino acid space sampled by each replicate mutant library, and also the percentage sampled for all three replicate libraries, and hence passing the filtering criteria. We note that, while calculating these numbers in revision, we noticed a mistake in our original filtering code that caused it to be falsely stringent for select phenylalanine variants and falsely relaxed for select glycine variants. After fixing this mistake and re-analyzing the data, the change in whether these variants passed our filters or not had no significant impact on the results or on any of our analyses.

6) In Figure 5A, the authors describe that at high temperature there is a gross reduction in the variant growth, consistent with the hypothesis of biophysical constraints imposed on them. Physiologically, this is likely due to a misfolding and degradation of the variants. Given that host cells do not upregulate components of ERAD at higher temperature (Figure 2 and Figure 2—figure supplement 1E) – is ERAD engaged more at such temperatures without burdening the overall proteostasis? Also, in conditions of UPR activation, at permissive and higher temperatures, the RNAseq data reveal highly upregulated ERAD components (in addition to other factors). Should the same variants not be subject to degradation? How many variants do not get enriched upon the ER proteostasis boost? The authors could explain their results in some more specific details.

Reduction in variant growth at 39˚C may be caused by enhanced ER-associated degradation (ERAD), as the reviewer suggests, or it may be caused by reduced variant folding rates (and thus slower variant viral growth). Though we do not observe upregulation of ERAD factors (Figure 2 and Figure 2—figure supplement 1E), our data do not speak to the activity of ERAD components at elevated temperature. As the reviewer suggests, it is certainly possible that ERAD components are engaged more at elevated temperature, resulting in the degradation of HA variants without inducing upregulation of ERAD factors.

Also as the reviewer suggests, UPR activation indeed upregulates ERAD components, in addition to ER chaperones and other proteostasis factors (Figure 2 and Figure 2—figure supplement 1A–D). Thus, variants could certainly be either assisted or degraded by induction of the UPR. The number of enriched and depleted variants is now noted in Figure 5A, though we emphasize that these numbers are all relative to how the wild-type HA performs, and thus positive and negative values do not reflect folding and degradation, respectively. We modified our Results section to elaborate on these points:

“This analysis confirmed that increased temperature indeed generally reduced mutant HA viral growth, resulting in depletion (i.e. diffsel < 0) of 597 variants versus enrichment (i.e. diffsel > 0) of only 361 variants, relative to wild-type HA (Figure 5A). […] For example, these perturbations could potentially impact the protein levels and activities of other host proteins that engage HA, or perhaps differentially modulate viral growth kinetics.”

7) Again related to Figure 5A, in the subsection “Deep mutational scanning of HA in modulated ER proteostasis environments”, fifth paragraph, describe the redundancy of the variants having a positive diffsel score between XBP1s and ATF6f/XBP1s conditions. Although the authors comment about the similarity, it would be interesting to see which sites overlap and which do not, possibly shedding light on subtle differences between the two modes of UPR activation.

We agree that this comparison is interesting and have added a correlation plot comparing activation of ATF6f/XBP1s and XBP1s as Figure 5—figure supplement 1. Outliers that are highly enriched or depleted in both conditions are labeled, and the full list of differential selection values for variants in these selection conditions is provided in Figure 5—source data 1. We note that few sites are strongly enriched in one condition and strongly depleted in the other. We also added references to Figure 5—figure supplement 1 and Figure 5—source data 1 to support our comments in the manuscript:

“The enhanced variant growth upon XBP1s induction alone was quite similar to that of simultaneous induction of XBP1s and ATF6f (Figure 5—figure supplement 1 and Figure 5—source data 1), and is thus either predominantly caused by XBP1s, or by XBP1s-regulated factors that are redundant with those regulated by ATF6f.”

8) Figure 7A describes the mutational tolerance in the XBP1s condition alone. What about the ATF6f/XBP1s condition?

Similar to the mutation differential selection distributions presented in Figure 5A, the site differential selection distributions (i.e. mutational tolerance distributions) for ATF6f/XBP1s are essentially identical to that of XBP1s alone, and were thus excluded from the original manuscript in an effort to simplify Figure 7. To address the reviewer’s question, we have now presented this analysis for ATF6f/XBP1s induction in Figure 7—figure supplement 1, and have commented on their similarity in the manuscript:

“We note that, similar to the mutation diffsel distributions (Figure 5A), the site diffsel distributions for ATF6f/XBP1s induction mirrored those for XBP1s induction (Figure 7—figure supplement 1).”

Furthermore, we also mapped the site diffsel values onto the HA crystal structure for ATF6f/XBP1s induction (Figure 8—figure supplement 1):

“Mapping net site diffsel values onto the HA crystal structure revealed that these selection pressures affect the mutational tolerance of sites across HA (Figure 8; Figure 8—figure supplement 1).”

9) In Figure 7B and 7C, the y-axis is unclear – is it a continuum from conserved to variable? Buried to exposed?

We agree that the y-axis in the original manuscript was unclear and have provided more detailed figure legends for Figures 7B–C to address this concern:

“(B) Net site diffsel is plotted as a function of site conservation, where x = 0 corresponds to a completely conserved site among 989 non-redundant human H1 sequences (1918–2008) (38). (C) Net site diffsel is plotted as a function of site surface accessibility, where x = 0 corresponds to a buried residue in the (PDBID 1RVX (32)).”

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 2—source data 1. Complete RNAseq differential expression analysis.
    DOI: 10.7554/eLife.38795.007
    Figure 2—source data 2. Comparison of HEKATF6f/XBP1s characterization to previous characterization.

    Genes are tabulated as upregulated in the current study (RNAseq), the previous study (Array) alone (Shoulders et al., 2013), or upregulated in both studies, as illustrated in Figure 2—figure supplement 2. For each gene subset, the corresponding GO group enrichment analysis is provided.

    DOI: 10.7554/eLife.38795.008
    Figure 2—source data 3. Fold change of HSR and UPR targets during influenza infection.
    DOI: 10.7554/eLife.38795.009
    Figure 3—source data 1. Correlation coefficients for unfiltered absolute site differential selection values between biological replicate selections.

    N = 565.

    DOI: 10.7554/eLife.38795.011
    Figure 5—source data 1. Differential selection values for each selection, pre- and post-filtering.
    DOI: 10.7554/eLife.38795.020
    Figure 7—source data 1. Properties and differential selection for each site of HA.
    DOI: 10.7554/eLife.38795.025
    Source data 1. Complete analysis of deep mutational scanning data.
    elife-38795-data1.zip (22.8MB, zip)
    DOI: 10.7554/eLife.38795.028
    Supplementary file 1. Primer sequences for qPCR and HA sequencing.
    elife-38795-supp1.docx (16KB, docx)
    DOI: 10.7554/eLife.38795.029
    Transparent reporting form
    DOI: 10.7554/eLife.38795.030

    Data Availability Statement

    FASTQ files for DMS sequencing are available in the Sequence Read Archive under accession number SRP149672. The deep mutational scanning data analysis will be available upon publication at https://github.com/amphilli/HA_DMS_2018 (copy archived at https://github.com/elifesciences-publications/amphilli/HA_DMS_2018), and is also available in Source Data 1. All differential selection values from deep mutational scanning (pre- and post-filtering) are available in Figure 5-source data 1. The complete RNAseq data are available from GEO under accession number GSE115168.

    The following datasets were generated:

    Angela M Phillips, author; Michael B Doud, author; Luna O Gonzalez, author; Vincent L Butty, author; Yu-Shan Lin, author; Jesse D Bloom, author; Matthew D Shoulders, author. ER proteostasis and temperature differentially impact the mutational tolerance of influenza hemagglutinin. 2018 https://www.ncbi.nlm.nih.gov/sra/SRP149672 Publicly available at the NCBI Gene Expression Omnibus (accession no: SRP149672)

    Angela M Phillips, author; Michael B Doud, author; Luna O Gonzalez, author; Vincent L Butty, author; Yu-Shan Lin, author; Jesse D Bloom, author; Matthew D Shoulders, author. ER proteostasis and temperature differentially impact the mutational tolerance of influenza hemagglutinin. 2018 https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE115168 Publicly available at the NCBI Gene Expression Omnibus (accession no: GSE115168)


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES