Skip to main content
PLOS Pathogens logoLink to PLOS Pathogens
. 2020 Jul 2;16(7):e1008671. doi: 10.1371/journal.ppat.1008671

Single cell heterogeneity in influenza A virus gene expression shapes the innate antiviral response to infection

Jiayi Sun 1,#, J Cristobal Vera 1,2,#, Jenny Drnevich 3, Yen Ting Lin 4, Ruian Ke 5, Christopher B Brooke 1,2,*
Editor: Adam S Lauring6
PMCID: PMC7363107  PMID: 32614923

Abstract

Viral infection outcomes are governed by the complex and dynamic interplay between the infecting virus population and the host response. It is increasingly clear that both viral and host cell populations are highly heterogeneous, but little is known about how this heterogeneity influences infection dynamics or viral pathogenicity. To dissect the interactions between influenza A virus (IAV) and host cell heterogeneity, we examined the combined host and viral transcriptomes of thousands of individual cells, each infected with a single IAV virion. We observed complex patterns of viral gene expression and the existence of multiple distinct host transcriptional responses to infection at the single cell level. We show that human H1N1 and H3N2 strains differ significantly in patterns of both viral and host anti-viral gene transcriptional heterogeneity at the single cell level. Our analyses also reveal that semi-infectious particles that fail to express the viral NS can play a dominant role in triggering the innate anti-viral response to infection. Altogether, these data reveal how patterns of viral population heterogeneity can serve as a major determinant of antiviral gene activation.

Author summary

The combination of the enormous diversity of viral populations, underlying heterogeneity within host cells, and random chance mean that no two infected cells look exactly the same. The role that single cell heterogeneity plays in determining infection outcomes remains very poorly understood. Here, we describe an approach that allowed us to quantify both viral and host gene expression in thousands of influenza A virus infected cells. We observed an enormous degree of heterogeneity in both viral and host gene expression between individual infected cells. We found that virions that fail to express the viral NS gene can be major drivers of host antiviral immune activation. Comparison of antiviral gene expression patterns between H1N1 and H3N2 infection revealed surprising differences in single cell patterns of innate immune activation. Altogether, these studies identify patterns of single cell heterogeneity in both host and viral gene expression as a significant driver of infection outcomes.

Introduction

RNA virus populations typically contain an enormous amount of sequence diversity due to an absence of virally encoded proofreading activity [1]. In some cases, this genetic diversity can significantly influence infection outcomes [24]. In addition, influenza A viruses (IAV) also exhibit substantial heterogeneity in the gene expression patterns of individual virions [5]. Most IAV virions are only capable of expressing variable, incomplete subsets of viral genes [6]. The production and gene expression patterns of these semi-infectious particles (SIPs) can vary significantly between IAV strains [7]. Altogether, the high degree of diversity within IAV populations means that patterns of viral gene expression can vary significantly between individual infected cells.

Numerous studies have leveraged recent advances in single cell analysis methods to assess the extent of cellular heterogeneity present during infection by a variety of viruses, including IAV [817]. These studies connect to a larger body of work that has begun to explore single cell heterogeneity within different host cell populations and tissues [1820]. The extent to which IAV population diversity influences patterns of single cell heterogeneity and the ways in which these patterns shape broader infection dynamics and outcomes remain poorly understood.

Here, we query the combined viral and host transcriptomes from thousands of individual cells, each infected by a single virion. This high-resolution dissection of viral and host gene expression patterns reveals that the transcriptional responses of individual infected cells can be highly divergent, resulting from the interplay between underlying cellular heterogeneity and viral population diversity. Thus, the host response to IAV infection consists of a heterogenous assemblage of highly variable single cell responses. This approach also reveals critical differences in the interactions of H1N1 and H3N2 viruses with the host antiviral machinery at the single cell level and implicates heterogeneity in NS segment expression as a major driver of the innate antiviral response to IAV infection. Altogether, these results establish the interplay between viral and host cell heterogeneity as a critical determinant of cellular infection outcomes.

Results

Generation of viral and host transcriptional data from thousands of singly infected cells

To assess the effects of viral population heterogeneity on the host response to infection, we examined the combined viral and host transcriptional profiles from thousands of single infected cells. To focus on the effects of viral heterogeneity, we wanted to remove the variability that could arise from variation in cellular MOI [21]. To ensure that the vast majority of infected cells were each infected with a single virion, we infected A549 cells with either the 2009 H1N1 pandemic strain A/California/07/2009 (Cal07), or the seasonal human H3N2 strain A/Perth/16/2009 (Perth09) at an MOI of 0.01, and blocked secondary spread in the culture through the addition of NH4Cl [22]. This resulted in only a tiny fraction of cells being infected, so to obtain sufficient numbers of infected cells for our analysis, we enriched infected cells by sorting based on surface expression of HA and/or M2 (Fig 1A). This approach had the added benefit of allowing us to sort HA-M2- bystander cells from the same culture to assess the paracrine effects of infection. In parallel, we sorted a group of mock-infected cells as a control.

Fig 1. Generation of viral and host transcriptional data for thousands of singly infected cells.

Fig 1

(A) Schematic depicting our strategy for generating single cell RNAseq libraries from thousands of cells infected at low MOI. In brief, we infect A549 cells with Cal07 or Perth09 at MOI = 0.01 to ensure that infected cells are infected with a single virion. We then block secondary spread with NH4Cl treatment to make sure infection timing is uniform across all infected cells. Finally, we sort “infected” and “bystander” cells based on surface expression of HA and/or M2 and immediately generate single cell RNAseq libraries from these sorted cell populations using the 10X Chromium device. In parallel, mock cells are sorted and used as uninfected controls. (B) tSNE dimensionality reduction plot showing the extent of overlap between 3 indicated cell populations from Perth09 experiment clustered based on transcriptional similarity.

We used the 10X Genomics Chromium platform to generate oligo-dT-primed single cell RNAseq libraries for mock, infected (HA+ and/or M2+), and bystander (HA-M2-) cells and sequenced them on an Illumina Novaseq. We demultiplexed the reads and mapped them to a customized hybrid reference containing both human and influenza sequences/annotation. Following quality control filtering, we had high quality combined viral and host transcriptomes from thousands of single virion-infected cells, as well as uninfected bystander and mock cells. Clustering of these three libraries by transcriptional similarity revealed that infected and uninfected cells largely clustered independently, as would be expected (Fig 1B, S1 Fig). For Cal07 (but not Perth09), mock and bystander cells clustered independently, but it is possible this is due to a batch effects, as the mock and bystander cells came from different culture vessels.

Although we sorted infected cells based on the presence of viral markers, it is possible that some uninfected cells were included within the infected cell libraries due to a combination of cell sorting errors, library index hopping, or cross-contamination with mRNA released from dying cells. To identify the truly-infected cells within the infected libraries, we examined the distributions of percentage of viral counts within each cell (S2 Fig). For cells in infected libraries of both Cal07 and Perth09, we observed clear bimodal distributions in the percentage of total viral counts. We assumed that cells within the lower peak (percentages of viral reads: < 1% for Cal07 and < 3% for Perth09) were actually uninfected. We calculated kernel density estimates on these distributions and used the first local minima to generate cutoff thresholds to differentiate truly-infected cells from uninfected cells (S2A and S2C Fig). Cells from infected libraries that fell below these thresholds clustered with the bystander cells, suggesting that these cells were truly uninfected (S2B and S2D Fig). We used this approach in subsequent analyses to define truly-infected cells; however, it is possible that a small fraction of the cells we call as uninfected by these criterion are also truly infected yet exhibit very low levels of viral gene expression.

Enormous single cell heterogeneity in viral gene expression patterns

We first asked whether we observed the same degree of heterogeneity in viral gene expression between infected cells that has been reported previously [6,10,12]. We calculated the fraction of all transcripts within each cell that were viral in origin (Fig 2A). Similar to prior studies, total viral transcript levels ranged enormously between individual cells: from <1% up to ~90%. Notably, this heterogeneity arose under conditions where both viral input (1 virion/cell) and infection timing were largely equivalent across all cells.

Fig 2. Enormous heterogeneity in viral gene expression patterns.

Fig 2

(A) Distributions of Cal07 and Perth09-infected A549 cells, binned by the fraction of total cellular poly(A) RNA that is viral in origin. (B) Plots show the fraction of total poly(A) RNA per cell that maps to the indicated viral gene segment of Cal07. Each dot represents a single cell, cells with no detectable reads mapping to the indicated segment were arbitrarily assigned a value of 0.001 to show up on the log10 scale. (C) Same figure as (B) for Perth09. (D) R2 correlation values plotted as heat map for all pairwise comparisons of Cal07 viral transcripts within infected cells positive for all viral genes. (E) Same figure as (D) for Perth09. (F) Distribution of normalized Cal07-PB1-derived reads per cell (orange) compared with a Poisson distribution of equal mean (blue line) on a log-log scale. (G) Distributions of top 10 most abundant host transcripts (left panel) and Cal07 viral gene expression (right panel) normalized by the 70th quantile on a log-log scale.

Interestingly, the distributions for Cal07 and Perth09 were quite different, as Perth09-infected cells tended to have a much higher fraction of viral transcript. This could be a reflection of differences in gene expression kinetics, or of differences in the relative capacities of the two viruses to exploit the host cell machinery necessary for viral gene expression. This pattern of extreme cell-to-cell heterogeneity was also observed for individual viral transcripts, again consistent with previous reports (Fig 2B and 2C)[12]. We examined the degree to which individual pairs of viral genes were correlated with each other within infected cells that expressed all viral genes (S3 and S4 Figs). For Cal07, we found that most pairs of viral transcripts were fairly well correlated, with one obvious exception: the NS segment (Fig 2D). Expression levels of the individual viral genes were generally less well correlated during Perth09 infection, compared with Cal07, and expression of both the M and NS segments by Perth09 was especially poorly correlated with the other viral genes (Fig 2E).

We also found that the expression distributions for individual viral genes were highly over-dispersed compared with a Poisson distribution (Fig 2F). The tails of the expression distributions for several viral genes were an order of magnitude longer than those of the top 10 most abundant host transcripts (Fig 2G), emphasizing that viral gene expression is substantially noisier than host gene expression.

Significant heterogeneity in the host transcriptional response to infection

We next asked whether the observed variation in viral gene expression levels was associated with variation in host cell transcription. To test this, we first clustered all infected cells based on their host transcription patterns. We identified multiple distinct transcriptional response groups to infection by either Cal07 or Perth09, demonstrating that there is not one standard transcriptional response to IAV infection and that a single cell type can simultaneously generate multiple distinct responses to the same virus population (Fig 3A and 3D). It should be noted that these cluster definitions are not absolute and that substantial heterogeneity in the normalized expression levels of individual host genes existed within individual clusters.

Fig 3. Significant heterogeneity in the host transcriptional response to infection.

Fig 3

(A) tSNE dimensionality reduction plot showing all Cal07-infected A549 cells clustered based on similarity of host transcription patterns. (B) Same tSNE plot of Cal07-infected cells shown in (A) with each cell colored by the percentage of total cellular poly(A) RNA that is viral in origin. (C) Same tSNE plot of Cal07-infected cells shown in (A) with each cell colored by predicted cell cycle stage, as determined by the Scran package. (D) Same figure as (A) for Perth09. (E) Same figure as (B) for Perth09. (F) Same figure as (C) for Perth09. (G) tSNE dimensionality reduction plot showing mock A549 cells from Cal07 experiment clustered based on similarity of host transcriptional patterns. (H) Principle coordinate axes (PCoA) plot comparing the multivariate dispersions for mock (black) and Cal07-infected (red) A549 cells. The first two axes (PCoA 1 and PCoA 2) in the multivariate homogeneity of group dispersions analysis are used.

We next asked whether the cell clustering patterns were correlated with levels of viral gene expression and/or cell cycle status. For Cal07, we did observe a relationship between overall viral gene expression levels and cluster structure, as the majority of cells with high levels of viral gene expression were concentrated in cluster 5 (Fig 3B). The clustering patterns of Cal07-infected cells also appeared to be partially influenced by cell cycle status, with cells in G2M phase disproportionately falling into cluster 0 (Fig 3C). While it is not surprising that cell cycle status would contribute to transcriptional heterogeneity during infection, these data highlight how little is known about how cell cycle status may influence the cellular response to infection. In contrast, we did not observe any clear relationships between viral gene expression levels, cell cycle status, and clustering pattern for Perth09-infected cells (Fig 3E and 3F).

We also observed the existence of multiple distinct clusters within mock cells (Fig 3G), raising the question of whether the observed heterogeneity in infected cells was simply a reflection of the intrinsic heterogeneity of the cell population prior to infection. In other words, does infection actually increase the overall heterogeneity in host gene expression patterns beyond that seen in mock cells? To quantify and compare heterogeneity in overall host transcription between the two cell populations, we calculated the multivariate homogeneity of dispersions for mock and infected cells using their host gene expression profiles (Fig 3H)[23]. We found that infection significantly increased the overall single cell heterogeneity in host cell transcription patterns (p<10−15 by pairwise t test). Altogether, our data are consistent with a model in which the interaction between viral population heterogeneity and pre-existing host cell heterogeneity gives rise to multiple distinct transcriptional responses at the single cell level.

Substantial heterogeneity in expression patterns of critical determinants of IAV infection outcome, including IFNs and ISGs

To explore the specific transcriptional differences that distinguished different clusters of infected cells, we performed single cell differential gene expression analysis which allowed us to compare the expression of host transcripts between different clusters [24,25]. This approach revealed that the induction of numerous host genes known or likely to be involved in shaping IAV infection outcome varied significantly between clusters (Fig 4A and 4C). Most strikingly, for Perth09 (but not Cal07), expression of type III IFNs and several ISGs were heavily concentrated within a single cluster, cluster 5 (Fig 4C). Beyond obvious factors like the IFNs and ISGS, both viruses, exhibited highly heterogeneous expression patterns of numerous other host genes likely to influence infection outcomes. For example, per-cell expression levels of NEAT1, a long non-coding RNA (lncRNA) involved in inflammasome formation, regulation of cytokine and chemokine expression, and nuclear paraspeckle formation [2631], varied significantly between infected cell clusters for both viruses (Fig 4B and 4D).

Fig 4. Substantial virus strain-specific variation in the expression patterns of critical determinants of IAV infection outcome.

Fig 4

(A) Heat map showing differential expression of the top 10 characteristic host genes for each cluster of Cal07-infected cells (from Fig 3A). Individual cells are each represented by a column, grouped by cluster, with individual rows representing relative expression of the top 10 specific host transcripts most significantly (lowest p values) associated with each cluster. (B) Comparison of normalized per cell counts of NEAT1 between clusters of Cal07-infected cells shown in (A). (C) Same heat map as (A) for each cluster from Perth09-infected cells (From Fig 3D), with the top 10 genes defining cluster 5 highlighted. (D) Comparison of normalized per cell counts of NEAT1 between clusters of Perth09-infected cells shown in (C).

Hundreds of other host genes exhibited similarly heterogeneous patterns of expression between individual infected cells (S1 Table, S2 Table), though it must be pointed out that these measurements can be skewed somewhat by variation in viral RNA levels. Altogether, these data clearly demonstrate how numerous drivers of infection outcomes (not just IFNs) may be primarily expressed by limited subsets of infected cells.

Many infected cells have undetectable levels of one or more viral transcripts

The vast majority of IAV virions fail to express one or more viral genes, resulting in the expression of variable, incomplete subsets of viral gene products under low MOI conditions [57,32]. We asked whether this variation in functional viral gene content within individual cells contributes to the observed heterogeneity in host gene transcription. To assess the presence or absence of each individual viral gene within infected cells while avoiding false positives due to RNA cross-contamination, we used the same approach that we used to determine infection status of cells in infected libraries. Just as with overall viral gene expression, nearly all individual viral genes exhibited clear bimodal patterns in the distributions of viral gene percentage within each cell on a log scale, suggestive of clear separations between positive and negative populations and allowing us to set cutoff thresholds based on the first local minima (S5A and S5B Fig). The only exceptions were PB2 and PA of Cal07. For these two genes that did not exhibit clear bimodal distribution, we set the cutoff thresholds at -1.5 (log10 scale) based on the distributions of polymerase gene expression in Perth09 and uninfected cells within the infected library of Cal07. It must be noted that while these cutoff thresholds represent our best effort to define the expression statuses of individual viral genes, they may not be perfectly accurate.

Using these cutoffs, we found that the fractions of infected cells that failed to express detectable transcripts ranged from ~35% to ~5% for the individual viral gene segments (Fig 5A and 5C). It is highly likely that our method of enriching for infected cells by sorting based on high level HA and/or M2 expression significantly biased these results. Regardless, we still observed substantial numbers of infected cells that fail to express detectable levels of individual viral transcripts, with roughly half of all infected cells failing to express detectable levels of at least one viral gene (Fig 5B and 5D). We were surprised to see so many cells that lacked detectable levels of the polymerase transcripts because we expected that sorting infected cells based on surface protein expression would bias cell collection against cells that could not synthesize new polymerase complexes. One important caveat with our approach is that we assessed transcript levels at 16 hpi and thus could have missed transient early expression of viral genes.

Fig 5. Many infected cells have undetectable levels of one or more viral transcripts.

Fig 5

(A) The percentage of all Cal07-infected cells called as positive for the indicated gene segments (B) All Cal07-infected cells binned by the total number of viral gene segments that were called positive, with the actual numbers of cells in each group detailed above. (C) Same figure as (A) for Perth09. (D) Same figure as (B) for Perth09.

The absence of individual viral genes has a significant effect on the host transcriptional response to infection

We next asked how the expression status of individual viral genes influences the overall host response to infection to both Cal07 and Perth09. We grouped all infected cells into positive and negative populations based on the expression of each individual viral gene segment and compared host transcript expression between the two infected cell populations. For each viral gene, we generated a list of host transcripts that were reported as significantly different between positive and negative cells by both NBID and MAST tests (S3S16 Tables). This approach allowed us to tease out the effects of individual viral gene segments from the more general effects of infection.

We identified hundreds of host genes with expression levels that varied significantly based on the expression status of individual viral genes (S6A and S6B Fig). Closer examination revealed that many of these hits were found for multiple viral genes, suggesting that they may correlate with overall viral gene expression levels or co-expression of multiple viral gene segments. When we focused on the host genes that only exhibited differential expression in association with the expression status of a single viral gene segment, we found that most were associated with the PA and NS segments (Fig 6A and 6D).

Fig 6. Dissection of the effects of individual viral gene expression on the host transcriptional response to infection.

Fig 6

(A) The number of host transcripts for which expression levels significantly differ depending on whether the indicated Cal07 gene segment is present or not, according to both MAST and NBID (host genes that are differentially regulated by the expression status of more than one viral segment are excluded). (B) tSNE plot of all Cal07-infected A549 cells colored based on whether NS segment-derived transcripts are detected (Cyan) or not detected (Salmon). (C) Percentages of NS negative and NS positive Cal07-infected A549 cells that have detectable levels of SLFN5. (D) Same figure as (A) for Perth09. (E) Same figure as (B) for Perth09. (F) Percentages of NS negative and NS positive Perth09-infected A549 cells that have detectable levels of the indicated host transcripts.

NS segment expression status is a major determinant of IFN and ISG induction by Perth09, but not Cal07

We focused on the NS segment, as the NS-encoded NS1 protein plays a well-established, multi-functional role in manipulating the host cell environment and the anti-viral response and could thus serve as a positive control for our approach [3336]. We found that approximately 4% of Cal07-infected cells and 9% of Perth09-infected cells had background or undetectable levels of NS segment transcript, and most of them came from cluster 5 with high viral transcript level which was in consistent with low correlation value between NS and other segments (Fig 6B and 6E). For Cal07, we observed over 300 host genes whose expression was uniquely influenced by NS. Notably, the expression of specific IFNs or ISGs was not significantly affected by NS expression status during Cal07 infection. In contrast, we did observe a significant reduction in SLFN5 expression frequency in infected cells that failed to express NS compared with those that did express NS (Fig 6C). SLFN5 is an interferon-stimulated gene (ISG) shown to negatively regulate STAT1-dependent anti-viral gene transcription [37]. These results both validate the utility of our approach and identify the suppressor of antiviral gene transcription SLFN5 as a novel host target of the NS gene segment during Cal07 infection.

In contrast with Cal07 infection, NS segment expression status for Perth09 was significantly correlated with expression levels of numerous IFNs, ISGs, and other innate immune factors (Fig 6F). For example, the type III IFNs IFNL1 and IFNL2 were expressed by ~20% of Perth09-infected cells lacking detectable NS expression, but both were only observed in ~2% of infected cells that did express NS. We also observed similar NS-dependent expression patterns for several ISGs (e.g. IFIT1, IFIT2, IFIT3, ISG15, ZC3HAV1, and OAS1), as well as the neutrophil-recruiting chemokine CXCL1. Thus, for Perth09 but not Cal07, single cell heterogeneity in NS segment expression status is a major determinant of the activation of the innate antiviral response.

H1N1 and H3N2 strains can differ significantly in single cell patterns of IFN and ISG transcription

Finally, we more broadly compared the activation of host innate anti-viral gene expression at the single cell level between Cal07 and Perth09. In Cal07-infected cells, expression of both type I and III IFNs were extremely rare, and only IFNL1 passed our quality control filter (detected in more than 4 cells) (Fig 7A) [13,38]. For type I IFN (but probably not type III IFN), this could partially be a function of the relatively late timepoint that we examined, as IFNβ expression typically peaks earlier during infection [39]. Similarly, expression of multiple ISGs was also very rare in Cal07-infected cells, indicating a near-complete failure to initiate an innate anti-viral response. In bystander cells, both IFN and ISG expressions were also minimal, suggesting that the inhibition of IFN induction by Cal07 was sufficient to largely prevent paracrine ISG activation (Fig 7B).

Fig 7. H1N1 and H3N2 strains differ significantly in single cell patterns of IFN and ISG transcription.

Fig 7

(A) tSNE plots of all Cal07-infected A549 cells with cells that have detectable level of IFNL1 colored in red. (B) Histograms comparing distributions of per-cell UMI counts in log10 scale for the indicated host transcripts across the three libraries (mock, bystander, and infected) for Cal07 experiment. Cells with zero count excluded to avoid dominating the y-scale. Vertical lines indicate cutoff thresholds of 3 UMI counts, with percentages of all cells (including cells with zero count) in each library above the threshold shown on right. (C) Same figure as (A) for Perth09. (D) Same figure as (B) for Perth09.

In Perth09-infected cells, IFNL1 (type III IFN) transcription was roughly 20-fold more frequent compared with Cal07, indicating a significant difference in the ability of the host to initiate an IFN response to infection by these two strains (Fig 7C). Consistent with this, some (but not all) ISGs were clearly expressed more frequently in bystander and Perth09-infected cells compared with Cal07, suggesting that Perth09 is less able to prevent paracrine activation (Fig 7D). Altogether, these data reveal clear differences in single cell patterns of IFN and ISG transcription between the human H1N1 and H3N2 strains tested.

Discussion

The host response to infection arises from the combined responses of many individual cells, both infected and uninfected. To understand the factors that govern host responses at the tissue and organismal levels, it is critical to define patterns of variation in single cell infection responses. Here, we demonstrate that IAV infection gives rise to a heterogeneous collection of divergent transcriptional responses at the single cell level. Notably, this heterogeneity occurred within a single cell type, under conditions where per-cell viral input and infection timing were normalized. Single cell responses in vivo may be significantly more variable. By comparing patterns of viral gene expression at the single cell level between two distinct strains of human IAV, we demonstrate how viral population heterogeneity can be a major driver of innate immune activation, in a virus strain-specific manner. These data establish a clear role for viral heterogeneity in modulating the host response to infection and highlight the power of single cell approaches to reveal new determinants of viral infection outcomes.

Variation in gene expression patterns between individual infected cells appears to arise from a combination of (1) genetic and genomic variation between individual virions, (2) underlying heterogeneity within host cells, and (3) stochastic variation in infection processes. In line with previous reports of significant genetic and phenotypic heterogeneity within A459 and other transformed cell line populations, we observed substantial transcriptional heterogeneity between uninfected A549 cells that could not be simply explained by cell cycle status [4042]. As a result, virions that enter different cells establish infection within differing cellular environments that may be more or less supportive and primed for differing transcriptional responses to infection. This effect is likely enhanced by the stochastic variation inherent in different viral life cycle stages [43].

Layered on top of this intrinsic cellular heterogeneity and stochasticity is the genetic and genomic heterogeneity characteristic of IAV populations [5]. We and others have previously demonstrated that well-characterized viral modulators of host cell function such as NS1 and PA-X are not ubiquitously expressed during infection [6,12]. Here, we show that this variability in viral gene expression between individual infected cells has clear consequences for the host cell transcriptional response. It is clear that future efforts to understand the role of critical viral proteins such as NS1 in shaping infection outcome will have to account for heterogeneity in single cell expression patterns.

The comparison of Cal07 and Perth09 revealed clear commonalities and differences in single cell expression patterns between the two viruses that may have relevance for understanding differences in the apparent pathogenicity of different human H1N1 and H3N2 viruses. The most obvious difference was seen in the activation of IFN and ISG transcription. In addition, while NS segment expression status significantly influenced the host response to both viruses, only Perth09 showed a clear relationship between NS levels and the activation of the host antiviral machinery. This finding is in line with previous studies that showed that (a) different human IAV strains differ in bulk induction of IFNβ, and (b) the 2009 pandemic H1N1 NS1 protein was less effective at suppressing transcription of human IFNs and ISGs compared with NS1 genes from other human IAV isolates [44,45]. While de novo NS segment transcription appeared to be largely dispensable for blocking IFN and ISG upregulation by Cal07 (at least under the conditions examined in this study), overall expression frequencies for select IFNs and ISGs were actually lower during Cal07 infection compared with Perth09, indicating that Cal07 is more effective at suppressing or evading IFN activation in A549 cells. Yet, it remains an open question how overall viral gene expression level might influence the host innate antiviral response, given the clear differences we observed between the two viruses. Overall, our data make clear that there is still a lot that we do not understand about the interplay between viral gene expression patterns, viral genotype, and the host antiviral response.

In this study, we took steps to eliminate two other sources of viral heterogeneity that are likely common during natural IAV infection. The first is presence of defective interfering particles (DIPs), which we minimized by generating our virus stocks under low MOI conditions, and subsequently verified (S7 Fig). DIPs appear to be common within IAV populations, even in humans, and can have complicated effects on both viral and host gene expression [13,4651]. We would predict that the presence of DIPs would further increase overall heterogeneity between individual cell responses to the virus. It is worth noting that we could not reliably detect the presence of DIP-associated transcripts within our data due to the unavoidably incomplete coverage of the viral gene segments generated by our short-read 3’-enriched sequencing approach. The second likely source of additional heterogeneity that we excluded from this study is variability in the cellular MOI. We and others have demonstrated that cellular co-infection can be common in vivo [7,5254]. This suggests that the number of viral genomes entering individual cells is likely quite variable. We recently showed that this variability in cellular MOI can have distinct phenotypic consequences, both for viral replication dynamics and for IFN induction [21]. Altogether, it appears likely that the heterogeneity that we describe here underrepresents what would be observed in vivo.

Our results extend our previous results and those of other groups in establishing the enormous amount of heterogeneity in viral gene expression that occurs at the single cell level, even under experimental conditions designed to minimize sources of variability [6,10,12,43,55]. Critical phenotypes such as viral load dynamics, transmissibility, and pathogenicity must emerge from the collective output of heterogeneous populations of infected cells. This raises questions of how selection may act upon patterns of viral heterogeneity to alter these emergent phenotypes and the extent to which these heterogeneity patterns are under viral genetic control. These questions are especially pertinent for segmented viruses like IAV but are relevant across diverse virus families.

Altogether, our results help establish the importance of considering the roles of viral and host cell heterogeneity in influencing the pathogenesis of viral infections. Similar to the way that viral populations are now viewed, our data clearly establishes that the host response to infection should be seen as a heterogeneous assemblage of single cell responses that collectively give rise to the bulk phenotypes that are generally measured. This creates the potential for complex interactions between responding cell subsets and raises the question of how such a heterogeneous system is effectively regulated. It also raises the possibility that viral and host response dynamics may be disproportionately driven by small subsets of cells that are obscured during bulk analysis. Dissection of these diverse constituents is likely to reveal new mechanisms that govern the pathogenesis of influenza virus infection.

Methods

Plasmids

The A/California/04/09 and A/Perth/16/2009 reverse genetics plasmids were generous gifts from Drs. Jonathan Yewdell and Seema Lakdawala, respectively. Plasmids encoding A/California/07/09 were generated by introducing A660G and A335G substitutions into HA and NP respectively, to match the amino acid sequences of A/California/07/09 HA and NP (NCBI accession numbers CY121680 and CY121683).

Cells

Madin-Darby canine kidney (MDCK) and human embryonic kidney HEK293T (293T) cells were maintained in Gibco’s minimal essential medium with GlutaMax (Life Technologies). Human lung epithelial A549 cells were maintained in Gibco’s F-12 medium (Life Technologies). MDCK and A549 cells were obtained from Dr. Jonathan Yewdell; 293T cells were obtained from Dr. Joanna Shisler. All media were supplemented with 8.3% fetal bovine serum (Seradigm). Cells were grown at 37°C and 5% CO2.

Viruses

Recombinant A/California/07/09 (Cal07) and A/Perth/16/2009 (Perth09) viruses were rescued via the 8-plasmid reverse genetics approach. For the rescue of both viruses, sub-confluent 293T cells were co-transfected with 500 ng of the following plasmids: pDZ::PB2, pDZ::PB1, pDZ::PA, pDZ::HA, pDZ::NP, pDZ::NA, pDZ::M, and pDZ::NS, using JetPrime (Polyplus) according to the manufacturer’s instructions. Plaque isolates derived from rescue supernatants were amplified into seed stocks in MDCK cells. Working stocks were generated by infecting MDCK cells at an MOI of 0.0001 TCID50/cell with seed stock and collecting and clarifying supernatants at 48 hpi. All viral growth was carried out in MEM with 1 μg/ml trypsin treated with l-(tosylamido-2-phenyl) ethyl chloromethyl ketone (TPCK-treated trypsin; Worthington), 1 mM HEPES, and 100 μg/ml gentamicin. The titers of the virus stocks were determined via standard 50% tissue culture infectious dose (TCID50) assay.

Viral infection and cell sorting for single cell RNAseq

Confluent A549 cell monolayers in 3 T-25 flasks were infected with Cal07 (or Perth09) at MOI of 0.01 TCID50/cell for 1 h. At 1 hpi, monolayers were washed with phosphate-buffered saline (PBS) and incubated in serum-containing F-12 medium. At 3 hpi, the medium was changed to MEM with 50 mM HEPES and 20 mM NH4Cl to block viral spread. At 16 hpi, monolayers were trypsinized and combined into single-cell suspension and washed with PBS. Cal07-infected cells were stained with Alexa Fluor 488-conjugated mouse anti-HA monoclonal antibody (mAb) EM4-CO4 (gift from Dr. Patrick Wilson) and Alexa Fluor 647-conjugated mouse anti-M2 mAb O19 (gift from Dr. Jonathan Yewdell). Perth09-infected cells were first stained with human anti-HA stem antibody FI6 (Gift from Dr. Adrian McDermott) and then stained with Alexa Fluor 488-conjugated donkey anti-human IgG (Jackson ImmunoResearch). After staining, cells were washed with PBS twice, and single live cells were sorted as “infected” or “bystander” populations based on the expression of HA and M2 on a BD FACS ARIA II sorter. Importantly, uninfected A549 cells from a separate flask were also trypsinized, stained, and sorted as “mock” population which served as a negative control.

Single cell RNAseq cDNA library generation

Sorted cell samples were counted and checked for viability on a BD20 cell counter (BIO-RAD) before they were diluted to equivalent concentrations with an intended capture of 4000 cells/sample. Each individual sample was used to generate individually barcoded cDNA libraries using the 10x Chromium Single Cell 3’ platform (Pleasanton, CA) following the manufacturer’s protocol. The Chromium instrument separates single cells into Gel Bead Emulsions (GEMs) that facilitate the addition of cell-specific barcodes to all cDNAs generated during oligo-dT-primed reverse transcription. The experiment with Cal07 used V2 reagent and the experiment with Perth09 used V3 reagent (all steps followed the manufacturer’s protocol).

Illumina Library preparation and sequencing

Following ds-cDNA synthesis, individually barcoded libraries compatible with the Illumina chemistry were constructed. The libraries were sequenced on an Illumina NovaSeq 6000 using S4 flowcell for the experiment with Cal07 and S3 flowcell for the experiment with Perth09. Raw data can be found on the NCBI Gene Expression Omnibus under the GEO accession number GSE143167.

Single cell RNAseq analysis

The three Cal07-associated 10x Chromium Single Cell v2 libraries (Infected, Bystander, and Mock) were demultiplexed and reference mapped using Cell Ranger Count (version 2.2) for alignment to a combined human+virus reference (human: hg38, version 1.2.0; Cal07: Genbank Accessions: CY121680-CY121687), and then combined using Cell Ranger Aggr. The three Perth09-associated 10x Chromium Single Cell v3 libraries (Infected, Bystander, and Mock) were processed and combined using Cell Ranger (version 3.1) for alignment to a combined human+virus reference (human: hg38, version 1.2.0; Perth09: Genbank Accessions: KJ609203.1- KJ609210.1). The resulting raw count matrix for each virus set was imported into an R pipeline using SimpleSingleCell [56], where it was filtered for empty droplets [57]. Cell cycle status was determined next by running the Cyclone tool in the scran R package [56]. Additional filtering was then performed on low feature cells (i.e. droplets removed if < 400 features/cell), low expressing features (i.e. features removed if < 4 cells/feature), and potential doublets (droplets removed if cell UMI counts > 2-fold of the median raw count number of host genes in cells of 3 libraries combined).

Overall cellular infection status (infected/uninfected) and individual virus gene presence/absence was determined by examining the distributions of percentage of viral counts within each cell on the log10 scale to magnify the differences at the low end. Thresholds for calling cells “not infected” overall and for individual viral gene presence/absence were set by calculating kernel density estimates on the distributions and finding the first local minima. For genes without clear bi-modal distributions (PB2 and PA of Cal07) the threshold was set to -1.5 (log10 scale), which was a consistent minimum in other low-expression viral genes and extremely close to the maximum value of PB2 and PA in uninfected cells within the Cal07 infected library. The filtered and annotated matrix was then imported into a Seurat pipeline for additional analysis and visualization [58], including normalization using the SCTransform method (all 3 libraries together for the combined tSNE but then each library separately for individual tSNE) [59], differential gene expression analysis using the MAST [25] and NBID tools [24], graph-based clustering of the cells [60], and PCA/tSNE dimensional reduction visualization.

Differential Gene Expression Analysis (DGE) for the Seurat clusters were performed by first sub-setting for ‘Infected’ status cells and then testing for each cluster versus all other clusters. The DGE gene list for each cluster consisted of genes with FDR < 0.01 in MAST test results generated from 10x Genomics Cell Ranger raw count matrix output. The DGE for missing individual virus genes were performed by first sub-setting the count matrices to contain only ‘Infected’ status cells and then using the individual virus gene status factors to test for cells with ‘Present’ versus ‘Absent’. The DGE gene list for each viral segment was produced by intersecting genes with FDR < 0.01 in both MAST and NBID test results to minimize false positives.

All code used for single cell analysis, along with associated documentation, is available from: https://github.com/BROOKELAB/SingleCell

Multivariate homogeneity of groups dispersions analysis

In the analyses, each host gene represents a variable/coordinate and thus each cell can be seen as a point in the multivariate space. Multivariate homogeneity of groups dispersion analysis [23] was performed on log transformed host gene expressions of the two group of cells, i.e. mock cells and infected cells, using the betadisper function in the R package (http://www.R-project.org/) vegan [61]. Distances between the points (i.e. cells) and their respective group centroid in the principal coordinates were then used to test homogeneity of variances and calculate the p-value. Note that similar analyses were performed on linear host gene expression values and results remain the same, i.e. overall host gene expressions are significantly more heterogenous than those in mock cells.

DVG detection in viral stocks

Viral RNA was extracted from 140 ul of each viral stock tested using the QIAamp viral RNA kit (Qiagen) and eluted in 60 ul distilled water. For cDNA reactions, 3 ul of RNA was mixed with 1 ul (2 uM) MBTUni-12 primer (5-ACGCGTGATCAGCAAAAGCAGG-3), 1 ul (10 mM) dNTPs, and 8 ul distilled water. The mixture was incubated for 5 min at 65°C and then placed on ice for 2 min. Subsequently, the mixture was removed from ice and the following were added: 1 ul SuperScript III (SSIII) RT (Invitrogen), 4 ul first-strand buffer, 1 ul DTT, and 1 ul RNase-in (Invitrogen). The reaction mixture was incubated at 45°C for 50 min, followed by a 15 min incubation at 70°C for inactivation. cDNA product (5 ul) was mixed with the following for PCR amplification: 2.5 ul (10 uM) MBTUni-12 primer, 2.5 ul (10 uM) MBTUni-13 primer (5-ACGCGTGATCAGTAGAAACAAGG-3), 0.5 ul Phusion polymerase (NEB), 10 ul high-fidelity buffer, 1 ul (10 mM) dNTPs, and 28.5 ul distilled water. The PCR protocol used was 98°C (30 s) followed by 25 cycles of 98°C (10 s), 57°C (30 s), 72°C (90 s), and a terminal extension of 72°C (5 min). PCR products were run on 1% agarose gel and visualized on a BIO-RAD Gel Doc Universal Hood II Molecular Imager.

Supporting information

S1 Fig. Viral and host transcriptional similarity in cells from Cal07 infection.

tSNE dimensionality reduction plot showing the extent of overlap between 3 indicated cell populations from Cal07 experiment clustered based on transcriptional similarity.

(TIFF)

S2 Fig. Determination of cutoff thresholds used to determine infection status.

(A) Histograms comparing distributions of total viral mRNA percentages in log10 scale across the three libraries (mock, bystander, and infected) for Cal07 experiment. Vertical dash line indicates cutoff threshold determined by calculating kernel density estimates on the distribution of infected library and finding the first local minima. (B) Cells from each library clustered as in S1 Fig are shown separately, with uninfected cells (below threshold) colored in cyan and infected cells (above threshold) colored in salmon. (C) Same figure as (A) for Perth09. (D) Same figure as (B) for Perth09.

(TIFF)

S3 Fig. All pairwise Cal07 viral gene correlation plots.

Normalized per cell copy numbers for the indicated Cal07 genes plotted against each other. Data only show infected cells that are positive for all viral gene segments.

(TIFF)

S4 Fig. All pairwise Perth09 viral gene correlation plots.

Normalized per cell copy numbers for the indicated Perth09 genes plotted against each other. Data only show infected cells that are positive for all viral gene segments.

(TIFF)

S5 Fig. Determination of cutoff thresholds used to determine or presence/absence of individual viral gene segments.

(A) Histograms show the percentages of mRNA molecules derived from each Cal07 gene segment in log10 scale. Vertical dash lines indicate cutoff thresholds determined by calculating kernel density estimates on the distributions and finding the first local minima. For PB2 and PA that do not have clear bi-modal distributions, the threshold was set to -1.5 (log10 scale), which was a consistent minimum in other low-expression viral genes and extremely close to the maximum value of PB2 and PA in uninfected cells within the infected library. (B) Same figure as (A) for Perth09.

(TIFF)

S6 Fig. Quantification of all differentially expressed host genes correlated with presence/absence of individual viral gene segments.

(A) The number of host transcripts for which expression levels significantly differ depending on whether the indicated Cal07 gene segment is present or not, according to both MAST and NBID (host genes that are differentially regulated by the expression status of more than one viral segment are included). (B) Same figure as (A) for Perth09.

(TIFF)

S7 Fig. Comparison on DVGs content between validated viral stocks with low DVGs and viral stocks used in scRNAseq experiments.

PCR products following 8-segment whole-genome amplification from viral cDNAs of the pre-verified viral stocks (Cal07 LD and Perth09 LD have been shown to have minimum DVGs using NGS sequencing in articles published by the lab) and viral stocks used in scRNAseq experiments (Cal07 SC and Perth09 SC) are visualized on 1% agarose gel.

(TIFF)

S1 Table. All host genes differentially expressed between Seurat clusters of Cal07 infected cells.

(TXT)

S2 Table. All host genes differentially expressed between Seurat clusters of Perth09 infected cells.

(TXT)

S3 Table. Combined DGE list based on Cal07 PB2 expression status within infected cells.

(TXT)

S4 Table. Combined DGE list based on Cal07 PB1 expression status within infected cells.

(TXT)

S5 Table. Combined DGE list based on Cal07 PA expression status within infected cells.

(TXT)

S6 Table. Combined DGE list based on Cal07 NP expression status within infected cells.

(TXT)

S7 Table. Combined DGE list based on Cal07 NA expression status within infected cells.

(TXT)

S8 Table. Combined DGE list based on Cal07 M expression status within infected cells.

(TXT)

S9 Table. Combined DGE list based on Cal07 NS expression status within infected cells.

(TXT)

S10 Table. Combined DGE list based on Perth09 PB2 expression status within infected cells.

(TXT)

S11 Table. Combined DGE list based on Perth09 PB1 expression status within infected cells.

(TXT)

S12 Table. Combined DGE list based on Perth09 PA expression status within infected cells.

(TXT)

S13 Table. Combined DGE list based on Perth09 HA expression status within infected cells.

(TXT)

S14 Table. Combined DGE list based on Perth09 NP expression status within infected cells.

(TXT)

S15 Table. Combined DGE list based on Perth09 NA expression status within infected cells.

(TXT)

S16 Table. Combined DGE list based on Perth09 NS expression status within infected cells.

(TXT)

Acknowledgments

We are grateful to Dr. Alvaro Hernandez and Ms. Chris Wright of the DNA Services Lab within the Roy J. Carver Biotechnology Center for expert assistance in experimental planning and the preparation and sequencing of single cell RNAseq libraries. We also would like to thank Dr. Chris Fields of HPCBio, also within the Roy J. Carver Biotechnology Center, for assistance in troubleshooting analysis.

Data Availability

All next generation sequencing data is available from the NCBI GEO via accession number GSE143167. All other relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work was partially supported by a National Institute of Allergy and Infectious Disease grant 1R01AI139246-01A1 (C.B.B.), the Defense Advanced Research Projects Agency INTERCEPT program through contracts R-00676-19-0 (R.K.) and W911NF-17-2-0034 (C.B.B.), and a gift from the Roy J. Carver Charitable Trust 17-4905 (C.B.B.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Andino R, Domingo E. Viral quasispecies. Virology. 2015;479–480: 46–51. 10.1016/j.virol.2015.03.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bordería AV, Isakov O, Moratorio G, Henningsson R, Agüera-González S, Organtini L, et al. Group Selection and Contribution of Minority Variants during Virus Adaptation Determines Virus Fitness and Phenotype. PLoS Pathog. 2015;11: e1004838 10.1371/journal.ppat.1004838 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Domingo-Calap P, Segredo-Otero E, Durán-Moreno M, Sanjuán R. Social evolution of innate immunity evasion in a virus. Nat Microbiol. 2019;4: 1006–1013. 10.1038/s41564-019-0379-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Xiao Y, Dolan PT, Goldstein EF, Li M, Farkov M, Brodsky L, et al. Poliovirus intrahost evolution is required to overcome tissue-specific innate immune responses. Nat Commun. 2017;8: 375 10.1038/s41467-017-00354-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Brooke CB. Population Diversity and Collective Interactions during Influenza Virus Infection. J Virol. 2017;91 10.1128/JVI.01164-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Brooke CB, Ince WL, Wrammert J, Ahmed R, Wilson PC, Bennink JR, et al. Most influenza a virions fail to express at least one essential viral protein. J Virol. 2013;87: 3155–3162. 10.1128/JVI.02284-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Brooke CB, Ince WL, Wei J, Bennink JR, Yewdell JW. Influenza A virus nucleoprotein selectively decreases neuraminidase gene-segment packaging while enhancing viral fitness and transmissibility. Proc Natl Acad Sci U S A. 2014;111: 16854–16859. 10.1073/pnas.1415396111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Drayman N, Patel P, Vistain L, Tay S. HSV-1 single cell analysis reveals anti-viral and developmental programs activation in distinct sub-populations. eLife. 2019;8 10.7554/eLife.46339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Guo F, Li S, Caglar MU, Mao Z, Liu W, Woodman A, et al. Single-Cell Virology: On-Chip Investigation of Viral Infection Dynamics. Cell Rep. 2017;21: 1692–1704. 10.1016/j.celrep.2017.10.051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Heldt FS, Kupke SY, Dorl S, Reichl U, Frensing T. Single-cell analysis and stochastic modelling unveil large cell-to-cell variability in influenza A virus infection. Nat Commun. 2015;6: 8938 10.1038/ncomms9938 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ramos I, Smith G, Ruf-Zamojski F, Martinez-Romero C, Fribourg M, Carbajal EA, et al. Innate immune response to influenza virus at single-cell resolution in human epithelial cells revealed paracrine induction of interferon lambda 1. J Virol. 2019. 10.1128/JVI.00559-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Russell AB, Trapnell C, Bloom JD. Extreme heterogeneity of influenza virus infection in single cells. eLife. 2018;7 10.7554/eLife.32303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Russell AB, Elshina E, Kowalsky JR, Te Velthuis AJW, Bloom JD. Single-cell virus sequencing of influenza infections that trigger innate immunity. J Virol. 2019. 10.1128/JVI.00500-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Schulte MB, Andino R. Single-cell analysis uncovers extensive biological noise in poliovirus replication. J Virol. 2014;88: 6205–6212. 10.1128/JVI.03539-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Steuerman Y, Cohen M, Peshes-Yaloz N, Valadarsky L, Cohn O, David E, et al. Dissection of Influenza Infection In Vivo by Single-Cell RNA Sequencing. Cell Syst. 2018;6: 679–691.e4. 10.1016/j.cels.2018.05.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Zanini F, Pu S-Y, Bekerman E, Einav S, Quake SR. Single-cell transcriptional dynamics of flavivirus infection. eLife. 2018;7 10.7554/eLife.32942 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wang C, Forst CV, Chou T, Geber A, Wang M, Hamou W, et al. Cell-to-Cell Variation in Defective Virus Expression and Effects on Host Responses during Influenza Virus Infection. Denison MR, editor. mBio. 2020;11: e02880–19, /mbio/11/1/mBio.02880-19.atom. 10.1128/mBio.02880-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lawson DA, Kessenbrock K, Davis RT, Pervolarakis N, Werb Z. Tumour heterogeneity and metastasis at single-cell resolution. Nat Cell Biol. 2018;20: 1349–1360. 10.1038/s41556-018-0236-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Papalexi E, Satija R. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat Rev Immunol. 2018;18: 35–45. 10.1038/nri.2017.76 [DOI] [PubMed] [Google Scholar]
  • 20.Stuart T, Satija R. Integrative single-cell analysis. Nat Rev Genet. 2019;20: 257–272. 10.1038/s41576-019-0093-7 [DOI] [PubMed] [Google Scholar]
  • 21.Martin BE, Harris JD, Sun J, Koelle K, Brooke CB. Cellular co-infection increases the speed and efficiency of influenza A virus production and shapes the interferon response to infection. bioRxiv. 2019. [cited 3 Sep 2019]. 10.1101/752329 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Martin K, Helenius A. Nuclear transport of influenza virus ribonucleoproteins: the viral matrix protein (M1) promotes export and inhibits import. Cell. 1991;67: 117–130. 10.1016/0092-8674(91)90576-k [DOI] [PubMed] [Google Scholar]
  • 23.Anderson MJ. Distance-based tests for homogeneity of multivariate dispersions. Biometrics. 2006;62: 245–253. 10.1111/j.1541-0420.2005.00440.x [DOI] [PubMed] [Google Scholar]
  • 24.Chen W, Li Y, Easton J, Finkelstein D, Wu G, Chen X. UMI-count modeling and differential expression analysis for single-cell RNA sequencing. Genome Biol. 2018;19: 70 10.1186/s13059-018-1438-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Finak G, McDavid A, Yajima M, Deng J, Gersuk V, Shalek AK, et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 2015;16 10.1186/s13059-015-0844-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Beyleveld G, Chin DJ, Moreno Del Olmo E, Carter J, Najera I, Cillóniz C, et al. Nucleolar Relocalization of RBM14 by Influenza A Virus NS1 Protein. Duprex WP, editor. mSphere. 2018;3 10.1128/mSphereDirect.00549-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Imamura K, Imamachi N, Akizuki G, Kumakura M, Kawaguchi A, Nagata K, et al. Long Noncoding RNA NEAT1-Dependent SFPQ Relocation from Promoter Region to Paraspeckle Mediates IL8 Expression upon Immune Stimuli. Mol Cell. 2014;53: 393–406. 10.1016/j.molcel.2014.01.009 [DOI] [PubMed] [Google Scholar]
  • 28.Landeras-Bueno S, Ortín J. Regulation of influenza virus infection by long non-coding RNAs. Virus Res. 2016;212: 78–84. 10.1016/j.virusres.2015.08.008 [DOI] [PubMed] [Google Scholar]
  • 29.Sasaki YTF, Ideue T, Sano M, Mituyama T, Hirose T. MENε/β noncoding RNAs are essential for structural integrity of nuclear paraspeckles. Proc Natl Acad Sci. 2009;106: 2525–2530. 10.1073/pnas.0807899106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Zhang F, Wu L, Qian J, Qu B, Xia S, La T, et al. Identification of the long noncoding RNA NEAT1 as a novel inflammatory regulator acting through MAPK pathway in human lupus. J Autoimmun. 2016;75: 96–104. 10.1016/j.jaut.2016.07.012 [DOI] [PubMed] [Google Scholar]
  • 31.Zhang P, Cao L, Zhou R, Yang X, Wu M. The lncRNA Neat1 promotes activation of inflammasomes in macrophages. Nat Commun. 2019;10 10.1038/s41467-019-09482-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Diefenbacher M, Sun J, Brooke CB. The parts are greater than the whole: the role of semi-infectious particles in influenza A virus biology. Curr Opin Virol. 2018;33: 42–46. 10.1016/j.coviro.2018.07.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Ayllon J, García-Sastre A. The NS1 protein: a multitasking virulence factor. Curr Top Microbiol Immunol. 2015;386: 73–107. 10.1007/82_2014_400 [DOI] [PubMed] [Google Scholar]
  • 34.Hale BG, Randall RE, Ortín J, Jackson D. The multifunctional NS1 protein of influenza A viruses. J Gen Virol. 2008;89: 2359–2376. 10.1099/vir.0.2008/004606-0 [DOI] [PubMed] [Google Scholar]
  • 35.Krug RM. Functions of the influenza A virus NS1 protein in antiviral defense. Curr Opin Virol. 2015;12: 1–6. 10.1016/j.coviro.2015.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Marc D. Influenza virus non-structural protein NS1: interferon antagonism and beyond. J Gen Virol. 2014;95: 2594–2611. 10.1099/vir.0.069542-0 [DOI] [PubMed] [Google Scholar]
  • 37.Arslan AD, Sassano A, Saleiro D, Lisowski P, Kosciuczuk EM, Fischietti M, et al. Human SLFN5 is a transcriptional co-repressor of STAT1-mediated interferon responses and promotes the malignant phenotype in glioblastoma. Oncogene. 2017;36: 6006–6019. 10.1038/onc.2017.205 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Killip MJ, Jackson D, Pérez-Cidoncha M, Fodor E, Randall RE. Single-cell studies of IFN-β promoter activation by wild-type and NS1-defective influenza A viruses. J Gen Virol. 2017;98: 357–363. 10.1099/jgv.0.000687 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Osterlund P, Strengell M, Sarin LP, Poranen MM, Fagerlund R, Melen K, et al. Incoming Influenza A Virus Evades Early Host Recognition, while Influenza B Virus Induces Interferon Expression Directly upon Entry. J Virol. 2012;86: 11183–11193. 10.1128/JVI.01050-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Ben-David U, Siranosian B, Ha G, Tang H, Oren Y, Hinohara K, et al. Genetic and transcriptional evolution alters cancer cell line drug response. Nature. 2018;560: 325–330. 10.1038/s41586-018-0409-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Croce MV, Colussi AG, Price MR, Segal-Eiras A. Identification and characterization of different subpopulations in a human lung adenocarcinoma cell line (A549). Pathol Oncol Res POR. 1999;5: 197–204. 10.1053/paor.1999.0212 [DOI] [PubMed] [Google Scholar]
  • 42.Watanabe N, Dickinson DA, Krzywanski DM, Iles KE, Zhang H, Venglarik CJ, et al. A549 subclones demonstrate heterogeneity in toxicological sensitivity and antioxidant profile. Am J Physiol Lung Cell Mol Physiol. 2002;283: L726–736. 10.1152/ajplung.00025.2002 [DOI] [PubMed] [Google Scholar]
  • 43.Dou D, Hernández-Neuta I, Wang H, Östbye H, Qian X, Thiele S, et al. Analysis of IAV Replication and Co-infection Dynamics by a Versatile RNA Viral Genome Labeling Method. Cell Rep. 2017;20: 251–263. 10.1016/j.celrep.2017.06.021 [DOI] [PubMed] [Google Scholar]
  • 44.Hale BG, Steel J, Medina RA, Manicassamy B, Ye J, Hickman D, et al. Inefficient control of host gene expression by the 2009 pandemic H1N1 influenza A virus NS1 protein. J Virol. 2010;84: 6909–6922. 10.1128/JVI.00081-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hayman A, Comely S, Lackenby A, Murphy S, McCauley J, Goodbourn S, et al. Variation in the ability of human influenza A viruses to induce and inhibit the IFN-β pathway. Virology. 2006;347: 52–64. 10.1016/j.virol.2005.11.024 [DOI] [PubMed] [Google Scholar]
  • 46.Alnaji FG, Holmes JR, Rendon G, Vera JC, Fields CJ, Martin BE, et al. Sequencing Framework for the Sensitive Detection and Precise Mapping of Defective Interfering Particle-Associated Deletions across Influenza A and B Viruses. Schultz-Cherry S, editor. J Virol. 2019;93 10.1128/JVI.00354-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Martin MA, Kaul D, Tan GS, Woods CW, Koelle K. The Dynamics of Influenza A H3N2 Defective Viral Genomes from a Human Challenge Study. bioRxiv. 2019. [cited 29 Oct 2019]. 10.1101/814673 [DOI] [Google Scholar]
  • 48.Nayak DP, Chambers TM, Akkina RK. Defective-interfering (DI) RNAs of influenza viruses: origin, structure, expression, and interference. Curr Top Microbiol Immunol. 1985;114: 103–151. 10.1007/978-3-642-70227-3_3 [DOI] [PubMed] [Google Scholar]
  • 49.Saira K, Lin X, DePasse JV, Halpin R, Twaddle A, Stockwell T, et al. Sequence analysis of in vivo defective interfering-like RNA of influenza A H1N1 pandemic virus. J Virol. 2013;87: 8064–8074. 10.1128/JVI.00240-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Vasilijevic J, Zamarreño N, Oliveros JC, Rodriguez-Frandsen A, Gómez G, Rodriguez G, et al. Reduced accumulation of defective viral genomes contributes to severe outcome in influenza virus infected patients. Thomas PG, editor. PLOS Pathog. 2017;13: e1006650 10.1371/journal.ppat.1006650 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Vignuzzi M, López CB. Defective viral genomes are key drivers of the virus-host interaction. Nat Microbiol. 2019. 10.1038/s41564-019-0465-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Fukuyama S, Katsura H, Zhao D, Ozawa M, Ando T, Shoemaker JE, et al. Multi-spectral fluorescent reporter influenza viruses (Color-flu) as powerful tools for in vivo studies. Nat Commun. 2015;6: 6600 10.1038/ncomms7600 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Jacobs NT, Onuoha NO, Antia A, Steel J, Antia R, Lowen AC. Incomplete influenza A virus genomes occur frequently but are readily complemented during localized viral spread. Nat Commun. 2019;10: 3526 10.1038/s41467-019-11428-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Marshall N, Priyamvada L, Ende Z, Steel J, Lowen AC. Influenza Virus Reassortment Occurs with High Frequency in the Absence of Segment Mismatch. Basler CF, editor. PLoS Pathog. 2013;9: e1003421 10.1371/journal.ppat.1003421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Sjaastad LE, Fay EJ, Fiege JK, Macchietto MG, Stone IA, Markman MW, et al. Distinct antiviral signatures revealed by the magnitude and round of influenza virus replication in vivo. Proc Natl Acad Sci. 2018;115: 9610–9615. 10.1073/pnas.1807516115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Lun ATL, McCarthy DJ, Marioni JC. A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor. F1000Research. 2016;5: 2122 10.12688/f1000research.9501.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Lun ATL, Riesenfeld S, Andrews T, Dao TP, Gomes T, participants in the 1st Human Cell Atlas Jamboree, et al. EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data. Genome Biol. 2019;20: 63 10.1186/s13059-019-1662-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, et al. Comprehensive Integration of Single-Cell Data. Cell. 2019;177: 1888–1902.e21. 10.1016/j.cell.2019.05.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Hafemeister C, Satija R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 2019;20: 296 10.1186/s13059-019-1874-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, et al. Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets. Cell. 2015;161: 1202–1214. 10.1016/j.cell.2015.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D, et al. vegan: Community Ecology Package. Available: https://github.com/vegandevs/vegan

Decision Letter 0

Andrew Pekosz, Adam S Lauring

21 Jan 2020

Dear Dr Brooke,

Thank you very much for submitting your manuscript "A common pattern of influenza A virus gene expression heterogeneity governs the innate antiviral response to infection" for consideration at PLOS Pathogens. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

Thank you for submitting to PLoS Pathogens. I am sorry that this submission took so long.

All three reviewers very much appreciated the significant amount of data and work that went into this manuscript. They appreciated the ways in which this goes beyond other single cell studies of influenza virus.

All three reviewers raised points regarding the data related to NS, and suggested additional experiments that would solidify these findings. Given these concerns were common to all three, some sort of validation would be necessary here prior to publication. Several reasonable suggestions were made.

Reviewer 1 also had concerns about batch effects. This can be addressed by additional experimentation if feasible, or by caveat if not.

While some of the shortcomings mentioned by reviewer 3 regarding additional time points, cell lines, and strains would provide additional information, they would most likely not have a major impact on the conclusions reached from the current data. These are important points, however, and should be clearly discussed as limitations of the study in the discussion section.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Adam S. Lauring

Associate Editor

PLOS Pathogens

Andrew Pekosz

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************

Thank you for submitting to PLoS Pathogens. I am sorry that this submission took so long.

All three reviewers very much appreciated the significant amount of data and work that went into this manuscript. They appreciated the ways in which this goes beyond other single cell studies of influenza virus.

All three reviewers raised points regarding the data related to NS, and suggested additional experiments that would solidify these findings. Given these concerns were common to all three, some sort of validation would be necessary here prior to publication. Several reasonable suggestions were made.

Reviewer 1 also had concerns about batch effects. This can be addressed by additional experimentation if feasible, or by caveat if not.

While some of the shortcomings mentioned by reviewer 3 regarding additional time points, cell lines, and strains would provide additional information, they would most likely not have a major impact on the conclusions reached from the current data. These are important points, however, and should be clearly discussed as limitations of the study in the discussion section.

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #1: This manuscript from Vera et al. describes two single-cell sequencing datasets comprising infections by exemplar strains representing the circulating H1N1 and H3N2 viruses. The data are of considerable interest and utility to the field of influenza research, and the datasets are relatively high-quality and represent significantly rich resources for further work. Indeed, while the authors enrichment for influenza positive cells doubtless introduces some bias (an unavoidable fact in any perturbation), this permitted them probably the deepest and most comprehensive infectious dataset to date. The authors initial assessments of their datasets are also of interest. Of particular interest is their finding that the NS segment exhibits differential dynamics from the other seven flu segments, however I do believe this finding requires more support than is currently present in this manuscript. The authors also find a difference in the innate immune response to H3N2 and H1N1 influenzas and provide data suggesting differential reliance upon the NS segment (likely NS1) for suppression of an innate immune response.

With one considerable reservation, I believe this work represents a high-quality contribution to the field of influenza research and worthy of publication.

Reviewer #2: This manuscript by Vera et al examines the heterogeneity of antiviral responses to two strains of influenza virus using single cell RNAseq. They employ a clever flow cytometry pre sorting strategy to ensure that only 1 virion per cell. This allows them to examine a very large number of single infected cells, providing a degree of resolution which wouldn't be possible with conventional approaches. They discover significant cell-cell variability in virus replication levels, virus genes present, and host response. They further go on to directly compare two different strains H1N1 and H3N2 and uncover differences in their ability to induce interferon. This manuscript is well written, experiments are well done and the findings will be of great interest to readers in the field. However, there are some experiment that could significantly add to this study and extend it even further beyond what has been described previously. I also have some minor concerns/comments.

Reviewer #3: In this manuscript “A common pattern of influenza A virus gene expression heterogeneity governs the innate antiviral response to infection” by Vera et al., the authors examined viral and host transcriptomic changes during influenza A/California/07/2009 H1N1 and A/Perth/16/2009 H3N2 infection in A549 cells at the single cell level. Authors found different patterns of viral gene expression that resulted in multiple distinct host transcriptional responses to influenza viral infection. Moreover, the authors observed that differences in viral NS segment gene expression were responsible for shaping the host cell response to infection. Authors also observed that the pattern of host anti-viral gene transcriptional heterogeneity at the single cell level was different in cells infected with influenza H1N1 and H3N2 seasonal viruses.

Overall, this is an interesting manuscript for influenza virologists that provide a significant amount of information on viral and host transcriptomic responses, at the individual cell level, in A549 cells infected with seasonal H1N1 and H3N2 influenza virus. There are, however, several general concerns, mainly with the novelty and experimental approach, that reduce the overall enthusiasm for the manuscript.

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be absolutely required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #1: Major Comments

1) The authors use an orthogonal (ie sequenced separately albeit derived from same biological sample) bystander dataset to precondition their influenza thresholds. While I definitely appreciate the authors attempt to set an empirical threshold, the underlying approach they have taken has two considerable caveats that might seriously impact this body of work:

a. The first is the known influence of batch effects on single-cell datasets, an effect for which there has been considerable computational efforts to address. Indeed, the structure described by the tSNE in Figure 1B is likely as consistent with batch effects as it is with differential underlying biology. This would impact differential expression measurements between flu+ and flu- datasets, but, more critically in my mind, could lead to incorrectly derived thresholds for presence/absence of an influenza gene.

b. The second is that any lysis (or other means or RNA release) that occurs post-sorting, either in the microfluidics or even while the cells are sitting after sorting, would not be captured in the bystander dataset. This would also lead to incorrectly derived thresholds.

Truthfully, I am not sure there is a way to “fix” this problem without repeating these experiments. To be clear, this is a problem which is present in many similar, published, datasets. I do not think it is a strong enough issue to warrant repeating these experiments to include some truly internal error correction control, however I do believe the authors could change the text to be much more upfront about these considerations. As written, my concern is the authors claim this as a much stronger error-correction method than I believe is warranted and so these issues might propagate in future studies.

2) Somewhat following from my first concern, I am a bit troubled that some of the NS segment dynamics could be explained by an “incorrect” threshold. In Figure 5D there appear to be two populations, “NS-low” and “NS-high”, that each exhibit parallel behavior with respect to their relationship with the rest of the flu segments. If NS-low is really NS null, and the increasing flu counts is more a function of capture efficiency (or reverse-transcription efficiency) than any underlying biological difference, then it is possible that the authors are merely describing segment absence, a phenomenon already described in great detail by this group. That this problem might be restricted to, or more severe in, the NS segment would be somewhat expected, as it would likely track with relative expression (ie more expression leading the greater extracellular contamination), and NS and M are the most highly expressed segments. The absence of this effect in M could then be explained by the deliberate sorting on M2. I think if the authors want to make this point, which is of interest and is aberrant with respect to other measurements, then an orthogonal means of measurement must be employed. I think the most straightforward would be to measure either NEP or NS1 and some other flu protein by flow cytometry and see if this population can be detected. Alternatively, if there are no reasonable antibodies against NS1/NEP (there are for WSN, however I admit ignorance for the cross-reactivity with circulating H1N1 and H3N2), the authors could try and use reporter strains (bearing fluorescent proteins) in the NS segment instead. Admittedly the absence of the phenotype in those populations could also be due to perturbations caused by generation of a reporter virus, but the authors should at least make the attempt. If neither of these experiments is possible, it would be ideal to state as such in the text and the authors MUST at least address this reasonable alternative hypothesis in a manner which is clear and upfront and obvious to the reader.

Reviewer #2: The finding that Slfn5 is inversely correlated with NS is extremely interesting. Can the authors perform follow up analysis using NS1 deficient viruses to show they get the predicted phenotype in bulk A549 cells. Additionally can they use Slfn5 ko to demonstrate a consequence for this observation?

Reviewer #3: 1) One of the concerns with this manuscript is the novelty. As indicated by the authors in several parts of the document, similar studies to those described in this manuscript have been previously conducted and reported in the literature by different research groups (e.g., Extreme heterogeneity of influenza virus infection in single cells; Single-cell analysis and stochastic modelling unveil large cell-to-cell variability in influenza A virus infection; Innate immune response to influenza virus at single-cell resolution in human epithelial cells revealed paracrine induction of interferon lambda 1; among others). Thus, it is unclear the new findings in this manuscript. At least the authors should clearly outline how the studies in this manuscript differ from those in previously published literature.

2) Another concern is that the results are based on the infection of a single cell type (A549) at a single time post-infection (16 hours). It is possible, and most likely, that different results will be obtained using a different cell line and different times post-infection.

3) Experiments to evaluate changes in viral and/or host transcriptomic responses at the single cell level from natural influenza human infections would be more relevant than the use of A549 cells. Alternatively, the use of more relevant primary airway culture systems rather than a cancer cell line would increase the relevance of this study. Moreover, looking at different times post-infection, rather than a single time point, to assess the dynamic changes in viral and host transcriptomic responses during viral infection will probably increase the significance of this study.

4) Another concern is the lack of synchronization of viral infections. It is possible that both viral and host transcriptomic differences are due to differences in the replication step of the virus in different cells. Synchronizing viral infections would allow to rule out this possibility.

5) The authors indicate that changes in viral and host transcriptomics were obtained from single virion-infected cells based on the use of low multiplicity of infection (MOI; 0.01). However, this has not been clearly shown. Likewise, the authors indicate that viral stocks lack defective interfering particles (DIPs) since they were generated under experimental conditions that minimize the presence of DIPs (e.g. low MOI). However, the presence/absence of DIPs in viral stocks have not been clearly evaluated and could be responsible, at least in part, of the differences in the viral and/or host transcriptomic responses.

6) To clearly demonstrate the role of the viral NS1 in the differences of host transcriptomic responses, the authors should consider the possibility of comparing results from wild-type (WT) and NS1-deficient H1N1 and/or H3N2 virus infected cells.

7) The authors observed differences in both viral and host transcriptomic responses between H1N1 and H3N2 seasonal viruses. It is possible, and most likely, that differences in viral replication/transcription dynamics of these two viruses are responsible of these disparities.

8) Likewise, the authors suggest that differences in antiviral transcriptomic responses between H1N1 and H3N2 seasonal viruses could be attributed to differences in the NS1 proteins. However, it is unclear if the authors have compared the ability of the NS1 proteins from H1N1 and H3N2 seasonal viruses to inhibit innate immune responses and host gene expression; and whether or not these differences are responsible for those seen in the host transcriptomic responses in infected cells. Moreover, to clearly demonstrate that the NS1 protein and no other viral proteins are responsible for the differences in host transcriptomic responses between H1N1 and H3N2 seasonal viruses, the authors should generate recombinant viruses with the same genetic backbone that only differ in the viral NS1 protein.

9) As indicated in the manuscript, influenza PA-X has been described to inhibit host gene expression during influenza viral infection. It is unclear if the differences observed between the two seasonal influenza H1N1 and H3N2 viruses could be explained based on the ability of their PA-X proteins to induce cellular host shut off.

10) Some of the figures were difficult to assess due to the low quality, mainly Figures 2, 3, 6, and 7.

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #1: 1) Line 67. The authors might have a comment to say about other datasets, but I find it relatively vague. I would appreciate either directly stating the “experimental design choices that complicated analysis” to inform the rest of the field, or leaving this sentence out entirely. This work stands on its own, but if the authors wish to make a comment to directly improve the field they should probably be explicit in their recommendations.

2) Line 90. This is an incorrect calculation as known from the senior author’s own work. While calculations of SIP frequency vary, the lower bounds are still around 10:1 SIP-non-SIP particles. If the authors are using an MOI of 0.01, the effective particle dose would then be at least 0.1, which emphatically would not lead to greater than 99.99% of infected cells experiencing only one virion. The MOI the authors use is certainly low enough for their experimental design and I would be perfectly happy with them removing the incorrect calculation from the text.

3) A bit on the same theme as the major comments above I am afraid. If the thresholds the authors set are biologically meaningful, the absence of the polymerase segment would be predicted to lead to less influenza transcripts observed, do the authors note this? Similarly, we might expect absence of polymerase segments to drive similar (not necessarily identical) transcriptional programs in the host, can you comment on this?

4) Lines 111-113 are a bit strong, there is probably still some non-zero rate of coinfection (particularly given the incorrect assumptions of particle ratios discussed above), and whether a 1 hour infection window is “tight” probably depends on who you ask. Slightly more equivocal language would be appreciated.

5) Line 229. This is written in a misleading fashion and needs to be fixed. Russell et al. did describe a statistical association with NS absence and induction of interferon, even though it was non-deterministic. The way I read this sentence is that there was no association found (given that you are claiming your results “echo” this previous finding). This does not detract from the authors’ observations, as they performed their experiment with a different strain under different conditions, so if they could simply fix the text to be a bit less misleading that would be perfect.

6) Lines 240-241. The use of R2 here, and the language used, doesn’t appear to reflect what the authors observe. It looks as if there are two NS populations (as stated above, NS-low and NS-high) both of which appear to exhibit positive correlations with other influenza segments. This is stated a bit more clearly in lines 257-259, but I feel would lead to considerable misunderstanding as written. By conflating two subpopulations into a single R2 measurement there is the dubious conclusion that there is no positive correlation. If the authors could clearly state as such, and even calculate the R2 for each population (even some arbitrary threshold would be fine, say 102 copies of NS1) they would likely much better capture the dynamics of their observations.

7) Lines 347-361. While not the exact strain used here, a previous study (Hayman, A. et al. Variation in the ability of human influenza A viruses to induce and inhibit the

IFN-β pathway. Virology 347, 52–64 (2006).) Found a similar increased interferon induction by a few circulating H3N2 strains. I think probably worth citing and a quick mention.

Reviewer #2: -What percent of cells that are HA-M2- at 16 hrs become either HA or M2+ at later time points? This might help understand some of the minor limitations of the flow based pre screen and would help add to the body of literature on the kinetics of IAV protein and mRNA expression heterogeneity at single cell levels.

-What is the rationale for only using top 10 host genes for Fig 2 D-E? Shouldn’t the top 100 be used given the range of virus levels?

-Can the authors provide information one the transcriptional responses to the extreme ends of the virus replication levels (1-90%)? Could this be a stronger driver than presence/absence of specific influenza genes?

-Another potential explanation for lack of polymerase still having surface HA and M2 is production through primary transcription alone, which could be mentioned.

-Can the authors speculate as to why PB2 is the most frequently missing segment? And what consequences this might have for reassortment etc

Reviewer #3: None

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here on PLOS Biology: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see http://journals.plos.org/plospathogens/s/submission-guidelines#loc-materials-and-methods

Decision Letter 1

Adam S Lauring

1 Jun 2020

Dear Dr Brooke,

We are pleased to inform you that your manuscript 'Single cell heterogeneity in influenza A virus gene expression shapes the innate antiviral response to infection' has been provisionally accepted for publication in PLOS Pathogens.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Adam S. Lauring

Section Editor

PLOS Pathogens

Andrew Pekosz

Section Editor

PLOS Pathogens

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

***********************************************************

This is a nice piece of work. Thank you for submitting to PLOS Pathogens and for your efforts to address the reviewers' comments.

Reviewer Comments (if any, and for reference):

Reviewer's Responses to Questions

Part I - Summary

Please use this section to discuss strengths/weaknesses of study, novelty/significance, general execution and scholarship.

Reviewer #1: The authors have appropriately addressed my remaining misgivings. i see no reason why the publication of this manuscript should be delayed further, as it represents a solid dataset and reasonable analysis that is of general interest to the audience of PLoS Pathogens.

Reviewer #2: In this revised manuscript by Sun and colleagues address the key critiques of all three reviewers, resulting in a significantly strengthened manuscript. They have added substantial new analysis and clarified several key points. This is a very rigorous assessment of the replication heterogeneity after influenza infections and uses several clever experimental and computational approaches. They are able to overcome several key caveats of other studies allowing for a highly refined assessment of infection heterogeneity. Overall the manuscript is well written and the experiments are well designed and executed. The biological findings and approaches presented will be of significant interest to readers in the field.

**********

Part II – Major Issues: Key Experiments Required for Acceptance

Please use this section to detail the key new experiments or modifications of existing experiments that should be

absolutely

required to validate study conclusions.

Generally, there should be no more than 3 such required experiments or major modifications for a "Major Revision" recommendation. If more than 3 experiments are necessary to validate the study conclusions, then you are encouraged to recommend "Reject".

Reviewer #1: (No Response)

Reviewer #2: (No Response)

**********

Part III – Minor Issues: Editorial and Data Presentation Modifications

Please use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity.

Reviewer #1: (No Response)

Reviewer #2: (No Response)

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review?

For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Adam S Lauring

22 Jun 2020

Dear Dr Brooke,

We are delighted to inform you that your manuscript, "Single cell heterogeneity in influenza A virus gene expression shapes the innate antiviral response to infection," has been formally accepted for publication in PLOS Pathogens.

We have now passed your article onto the PLOS Production Department who will complete the rest of the pre-publication process. All authors will receive a confirmation email upon publication.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Pearls, Reviews, Opinions, etc...) are generated on a different schedule and may not be made available as quickly.

Soon after your final files are uploaded, the early version of your manuscript, if you opted to have an early version of your article, will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Pathogens.

Best regards,

Kasturi Haldar

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0001-5065-158X

Michael Malim

Editor-in-Chief

PLOS Pathogens

orcid.org/0000-0002-7699-2064

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Viral and host transcriptional similarity in cells from Cal07 infection.

    tSNE dimensionality reduction plot showing the extent of overlap between 3 indicated cell populations from Cal07 experiment clustered based on transcriptional similarity.

    (TIFF)

    S2 Fig. Determination of cutoff thresholds used to determine infection status.

    (A) Histograms comparing distributions of total viral mRNA percentages in log10 scale across the three libraries (mock, bystander, and infected) for Cal07 experiment. Vertical dash line indicates cutoff threshold determined by calculating kernel density estimates on the distribution of infected library and finding the first local minima. (B) Cells from each library clustered as in S1 Fig are shown separately, with uninfected cells (below threshold) colored in cyan and infected cells (above threshold) colored in salmon. (C) Same figure as (A) for Perth09. (D) Same figure as (B) for Perth09.

    (TIFF)

    S3 Fig. All pairwise Cal07 viral gene correlation plots.

    Normalized per cell copy numbers for the indicated Cal07 genes plotted against each other. Data only show infected cells that are positive for all viral gene segments.

    (TIFF)

    S4 Fig. All pairwise Perth09 viral gene correlation plots.

    Normalized per cell copy numbers for the indicated Perth09 genes plotted against each other. Data only show infected cells that are positive for all viral gene segments.

    (TIFF)

    S5 Fig. Determination of cutoff thresholds used to determine or presence/absence of individual viral gene segments.

    (A) Histograms show the percentages of mRNA molecules derived from each Cal07 gene segment in log10 scale. Vertical dash lines indicate cutoff thresholds determined by calculating kernel density estimates on the distributions and finding the first local minima. For PB2 and PA that do not have clear bi-modal distributions, the threshold was set to -1.5 (log10 scale), which was a consistent minimum in other low-expression viral genes and extremely close to the maximum value of PB2 and PA in uninfected cells within the infected library. (B) Same figure as (A) for Perth09.

    (TIFF)

    S6 Fig. Quantification of all differentially expressed host genes correlated with presence/absence of individual viral gene segments.

    (A) The number of host transcripts for which expression levels significantly differ depending on whether the indicated Cal07 gene segment is present or not, according to both MAST and NBID (host genes that are differentially regulated by the expression status of more than one viral segment are included). (B) Same figure as (A) for Perth09.

    (TIFF)

    S7 Fig. Comparison on DVGs content between validated viral stocks with low DVGs and viral stocks used in scRNAseq experiments.

    PCR products following 8-segment whole-genome amplification from viral cDNAs of the pre-verified viral stocks (Cal07 LD and Perth09 LD have been shown to have minimum DVGs using NGS sequencing in articles published by the lab) and viral stocks used in scRNAseq experiments (Cal07 SC and Perth09 SC) are visualized on 1% agarose gel.

    (TIFF)

    S1 Table. All host genes differentially expressed between Seurat clusters of Cal07 infected cells.

    (TXT)

    S2 Table. All host genes differentially expressed between Seurat clusters of Perth09 infected cells.

    (TXT)

    S3 Table. Combined DGE list based on Cal07 PB2 expression status within infected cells.

    (TXT)

    S4 Table. Combined DGE list based on Cal07 PB1 expression status within infected cells.

    (TXT)

    S5 Table. Combined DGE list based on Cal07 PA expression status within infected cells.

    (TXT)

    S6 Table. Combined DGE list based on Cal07 NP expression status within infected cells.

    (TXT)

    S7 Table. Combined DGE list based on Cal07 NA expression status within infected cells.

    (TXT)

    S8 Table. Combined DGE list based on Cal07 M expression status within infected cells.

    (TXT)

    S9 Table. Combined DGE list based on Cal07 NS expression status within infected cells.

    (TXT)

    S10 Table. Combined DGE list based on Perth09 PB2 expression status within infected cells.

    (TXT)

    S11 Table. Combined DGE list based on Perth09 PB1 expression status within infected cells.

    (TXT)

    S12 Table. Combined DGE list based on Perth09 PA expression status within infected cells.

    (TXT)

    S13 Table. Combined DGE list based on Perth09 HA expression status within infected cells.

    (TXT)

    S14 Table. Combined DGE list based on Perth09 NP expression status within infected cells.

    (TXT)

    S15 Table. Combined DGE list based on Perth09 NA expression status within infected cells.

    (TXT)

    S16 Table. Combined DGE list based on Perth09 NS expression status within infected cells.

    (TXT)

    Data Availability Statement

    All next generation sequencing data is available from the NCBI GEO via accession number GSE143167. All other relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS Pathogens are provided here courtesy of PLOS

    RESOURCES