Skip to main content
Virus Evolution logoLink to Virus Evolution
. 2023 Aug 26;9(2):vead056. doi: 10.1093/ve/vead056

Evolution of transient RNA structure–RNA polymerase interactions in respiratory RNA virus genomes

Charlotte V Rigby 1,2,3, Kimberly R Sabsay 4,5, Karishma Bisht 6, Dirk Eggink 7,, Hamid Jalal 8, Aartjan J W te Velthuis 9,10,*,
PMCID: PMC10492445  PMID: 37692892

Abstract

RNA viruses are important human pathogens that cause seasonal epidemics and occasional pandemics. Examples are influenza A viruses (IAV) and coronaviruses (CoV). When emerging IAV and CoV spill over to humans, they adapt to evade immune responses and optimize their replication and spread in human cells. In IAV, adaptation occurs in all viral proteins, including the viral ribonucleoprotein (RNP) complex. RNPs consist of a copy of the viral RNA polymerase, a double-helical coil of nucleoprotein, and one of the eight segments of the IAV RNA genome. The RNA segments and their transcripts are partially structured to coordinate the packaging of the viral genome and modulate viral mRNA translation. In addition, RNA structures can affect the efficiency of viral RNA synthesis and the activation of host innate immune response. Here, we investigated if RNA structures that modulate IAV replication processivity, so-called template loops (t-loops), vary during the adaptation of pandemic and emerging IAV to humans. Using cell culture-based replication assays and in silico sequence analyses, we find that the sensitivity of the IAV H3N2 RNA polymerase to t-loops increased between isolates from 1968 and 2017, whereas the total free energy of t-loops in the IAV H3N2 genome was reduced. This reduction is particularly prominent in the PB1 gene. In H1N1 IAV, we find two separate reductions in t-loop free energy, one following the 1918 pandemic and one following the 2009 pandemic. No destabilization of t-loops is observed in the influenza B virus genome, whereas analysis of SARS-CoV-2 isolates reveals destabilization of viral RNA structures. Overall, we propose that a loss of free energy in the RNA genome of emerging respiratory RNA viruses may contribute to the adaption of these viruses to the human population.

Keywords: coronavirus, RNA structure, influenza A virus, RNA polymerase, t-loop, adaptation

Introduction

RNA viruses, such as influenza A viruses (IAV) or coronaviruses (CoV), are important pathogens that cause mild to severe respiratory disease in humans. RNA virus spread among humans occurs in seasonal epidemics, outbreaks of emerging viruses, or occasional pandemics (Alvarez-Munoz et al. 2021). Seasonal epidemics are caused by viruses that are largely adapted to humans and to which humans have pre-existing immunity, such as the seasonal IAV strains H3N2 and H1N1 or CoV strains NL63 and 229E. In contrast, outbreaks and pandemics are caused by emerging RNA viruses to which humans lack pre-existing immunity (Taubenberger and Morens 2010). Emerging viruses spill over from an animal reservoir to humans, and typically have poor human-to-human transmission. Examples of these are the IAV H5N1 and H7N9 strains. Pandemic viruses can efficiently spread from human-to-human due to adaptations that they acquired via mutation and natural selection, or through reassortment or recombination with a human-adapted virus. In recent history, IAV subtypes H1N1, H2N2, and H3N2 have caused pandemics in 1918, 1957, 1968, and 2009, while the CoV severe acute respiratory CoV 2 (SARS-CoV-2) caused the COVID-19 pandemic (Zhu et al. 2020).

Analysis of IAV strains has shown that human-adaptive mutations occur in nearly all viral proteins, but in particular in the IAV surface glycoproteins hemagglutinin (HA) and neuraminidase (NA); the proteins involved in viral replication polymerase basic 1 and 2 (PB1 and PB2), polymerase acidic (PA), and the nucleoprotein (NP); and the protein involved in modulating host immune responses, non-structural 1 (NS1) (Bouvier and Palese 2008; Miotto et al. 2010; Selman et al. 2012; Forbes et al. 2013; Jiménez-Alberto et al. 2013; Mänz, Schwemmle, and Brunotte 2013; Czudai-Matwich et al. 2014; Elderfield et al. 2014; Di Lella, Herrmann, and Mair 2016; Wen et al. 2018; Lin et al. 2019; Chang et al. 2020; Thompson and Paulson 2021). During the SARS-CoV-2 pandemic, adaptive mutations have been observed primarily in the spike (S) glycoprotein, and potentially important mutations have been reported for the nucleocapsid protein, envelope protein, and non-structural proteins 2, 3, 4, 10, 12, and 13 (Mou et al. 2021; Rochman et al. 2021; Harari et al. 2022; Johnson et al. 2022; Kistler, Huddleston, and Bedford 2022; Sonnleitner et al. 2022; Abbasian et al. 2023). Research indicates that mutation in the glycoproteins improve host receptor binding, virus transmission, pH stability, and host immune evasion, whereas mutations in the viral replication complex improve viral RNA synthesis (Laha et al. 2020; Weisblum et al. 2020; Gupta et al. 2021; McCarthy et al. 2021; Verkhivker et al. 2021; Yang et al. 2021; Kistler, Huddleston, and Bedford 2022; Kudriavtsev et al. 2022; Liu et al. 2022). In addition to mutations in the viral proteins, nucleotide changes and deletions have been observed in IAV and SARS-CoV-2 genome RNA structures located in untranslated regions (UTRs), indicating that changes at the RNA structure level may also play a role in the adaptation of RNA viruses (Priore et al. 2015; Ryder et al. 2021).

Secondary RNA structures in RNA virus genomes play critical roles in IAV and CoV infection cycles and perform functions in packaging, translation, and viral replication. In addition, RNA structures can be the target of therapeutics (Piasecka et al. 2020). Secondary RNA structures can form between neighbouring sequences, as well as involve long-range interactions, such as duplex formation between the 5′ and 3′ UTRs of various viruses (Furuse and Oshitani 2011; Baranovskaya et al. 2019). Disruption of some RNA structures affects RNA virus growth, including packaging signals; however, several IAV RNA structures have a large degree of variability among strains (Fujii et al. 2009; Hutchinson et al. 2009; Huston et al. 2021). Presently, our understanding of the role of RNA secondary structures as well as mutations in these structures during RNA virus replication and adaptation is limited, but it seems likely that RNA structures that negatively impact viral replication in a new host may be modified or removed in time.

In this study, we aimed to gain insight into the evolution of template loops (t-loop), a transient RNA secondary structure that folds effectively ‘around’ the viral RNA polymerase (Fig. 1A,B) and affects IAV replication efficiency on short viral RNA templates (French et al. 2022). In the t-loop model, we envision the RNA polymerase as an object on a tablecloth: when the enzyme synthesizes RNA, each translocation of the RNA polymerase changes the local RNA template ‘landscape’. Some structures may be (partially) perturbed, while others may form as their folding energy becomes favourable in the absence of competing structures. To gain insight into this relative or transient RNA structure landscape around the RNA polymerase on short templates, we used a sliding-window algorithm to calculate the total free energy (ΔG) for RNA structures upstream, downstream, and around the RNA polymerase along an RNA sequence and confirmed our predictions with biochemical assays (French et al. 2022). For full-length viral genome (segments), the effect of t-loops cannot be studied yet due to the lack of appropriate biochemical assays. However, we hypothesized that if strong t-loops are present in the viral genome and they affect the efficiency of viral RNA synthesis, they would be destabilized over time to improve viral fitness. These destabilizations would appear as synonymous nucleotide changes in primary sequence data. We expected that these putative t-loop destabilizations would occur in emerging IAV strains, because they would benefit most from improvements in viral replication. In addition, we hypothesized that after t-loop destabilization, continued adaption of the viral replication complex in a t-loop-less genome would make the viral RNA polymerase sensitive to the re-introduction of t-loops. Hence, we expected human-adapted IAV to be more sensitive to t-loop structures than recently emerged/less human-adapted viruses, and that reassortment events between human- and avian-adapted IAV would impose strong selections on strong t-loops, which would be apparent from sequence data.

Figure 1.

Figure 1.

Interferon beta (IFN-β) promoter activity in an RNP reconstitution assay. (A) Model of the elongation complex of the IAV RNA polymerase, in which the 3′ end is bound by the B-site on the side of the RNA polymerase. Arrows indicate movement of template and product RNA through the RNA polymerase. (B) Model of the elongation complex of the IAV RNA polymerase, in which the 3′ end base-pairs with a sequence located upstream of the RNA polymerase. The formation of such a loop in the template leads to reduced RNA polymerase activity and innate immune activation when the loop forms in the first half of the template. (C) Fold change in IFN-β promoter activation following the replication and transcription of an mvRNA containing no t-loop in the first half of the template (NP71.1) and an mvRNA containing a t-loop in the first half of the template (NP71.11). Fold change in IFN-β promoter activation was normalized to the IAV WSN H1N1 control for each template. Bottom panels show western blot analysis of NP and PB1 expression for the analysed IAV H3N2 replication complexes. Tubulin was used as loading control.

To test the above hypotheses, we performed cell culture assays as well as analyses of IAV, influenza B virus (IBV), and SARS-CoV-2 sequence data obtained through surveillance. We observe that the sensitivity of H3N2 IAV RNA polymerases to t-loops increased between 1968 and 2017. At the same time, the ΔG of t-loops in the IAV H3N2 genome linearly increased, and thus the overall t-loop stability in the genome reduced. Reductions in t-loop stability are also observed in H1N1 IAV following the 1918 and 2009 pandemics. No destabilization of t-loops is observed in the IBV genome, which has been circulating in humans for decades. However, an analysis of SARS-CoV-2 isolates does reveal destabilization of t-loops. Overall, we propose that the ΔG of RNA structures in the RNA genome of emerging respiratory RNA viruses may play a role in the adaption of these viruses to humans.

Results

T-loops differentially affect IAV H3N2 RNA polymerase activity and IFN induction

We previously identified t-loops as RNA elements that can reduce the replication efficiency of H1N1 and H5N1 IAV RNA polymerases on 71-nt long RNA templates called mini-viral RNAs (mvRNAs) (French et al. 2022). In our previous observations, mvRNAs containing a t-loop in the first half of the template were more likely to trigger interferon (IFN) promoter activity than mvRNAs without a t-loop in the first half of the template (French et al. 2022). So far, the sensitivity of the RNA polymerase of emerging and adapted IAV viruses to t-loops has not been directly compared. The IAV H3N2 genome has been continuously circulating in humans for 55 years, making the replication complexes of H3N2 IAVs ideal for performing a direct comparison between an emerged and adapted IAV strain. To perform the comparison, we transfected plasmids expressing the PB1, PB2, PA, and NP proteins of the A/WSN/1933 (abbreviated as WSN), A/Netherlands/1968 (abbreviated as H3N2-1968), and A/Netherlands/2017 (abbreviated as H3N2-2017) RNA polymerases into human embryonic kidney 293T (HEK293T) cells along with an IFN beta (IFN-β)-promoter-driven luciferase reporter plasmid and a TK-promoter-driven Renilla luciferase control plasmid. In addition, we provided a cellular RNA polymerase I-driven plasmid expressing a 71-nt long template without a t-loop in the first half of the template (NP71.1) and a 71-nt long template containing a t-loop in the first half of the template (NP71.11). These templates were extensively tested previously and used to show that the IFN-β promoter activity is indicative of poor RNA polymerase activity (French et al. 2022). Western blot analysis showed efficient expression of the viral proteins (Fig. 1). Measurement of the IFN-β promoter activity showed no difference in the ability of the WSN and H3N2-1968 RNA polymerase to induce IFN-β promoter activity. However, the H3N2-2017 RNA polymerase induced an almost eightfold higher IFN-β promoter activity (Fig. 1), suggesting that it had a higher sensitivity to the t-loop structure in the template than the WSN and H3N2-1968 RNA polymerase.

Initial analysis of IAV H3N2 t-loop stability

The above result suggested that the RNA polymerase of a recent IAV H3N2 isolate is more sensitive to t-loops than an IAV H3N2 isolate from 1968. We hypothesized that any t-loops in the 1968 IAV H3N2 genome would be absent in the 2017 IAV H3N2 genome if they negatively affected viral replication. To explore this, we focussed on an initial set of genome sequences of IAV H3N2 isolates from 1968, 1972, 1982, 1993, 2003, 2008, 2014, and 2017. These isolates capture the full IAV H3N2 tree (Fig. 2A) and did not show a particular trend in the variation of the GC content per genome over time (Fig. 2B).

Figure 2.

Figure 2.

Phylogeny and GC content of IAV H3N2 genomes between 1968 and 2017. (A) Phylogenetic analysis of IAV H3N2 based on the PB2 nucleotide sequences of isolates from 1968, 1972, 1982, 1993, 2003, 2008, 2014, and 2017. Bar represents degree of nucleotide change. (B) GC content (%) for the genomes shown in Fig. 2A.

Next, we analysed the putative RNA structure content in the IAV H3N2 genome. To this end, we modified the t-loop analysis that we described previously (French et al. 2022) to account for NP binding up and downstream of the IAV RNA polymerase during replication and RNP assembly (Fig. 3A,B). Specifically, we used a sliding-window analysis, in which a sequence covered by the 20-nt footprint of the IAV RNA polymerase is blocked from participating in secondary RNA structure formation (Kouba, Drncová, and Cusack 2019; Wandzik et al. 2020). We also assumed that NP has a footprint of 24 nt and that the separation of the template RNA from the downstream or upstream NP in the RNP makes 24 nt available for base pairing (Fig. 3B) (Ortega et al. 2000). With both an upstream and downstream NP removed, t-loop as well as any other secondary RNA structure formation can be calculated between the two 24-nt stretches of single-stranded RNA (Fig. 3B). The sliding window was started at the 3′ end of the viral genome and moved in 1-nt intervals along the length of the template RNA.

Figure 3.

Figure 3.

Model for RNP assembly during replication and t-loop formation. (A) Model of IAV replication and RNP assembly following the recruitment of a second RNA polymerase and newly synthesized NP molecules. (B) Model of t-loop formation in an IAV RNP that is the basis of this manuscript. The template is shown in orange, with the footprint of the RNA polymerase indicated as a grey arrow. Base pairing between the up- and downstream sequences is illustrated as concentric lines. The footprint of the NP molecules, 24 nt, is also indicated. (C) Analysis of the t-loop ΔG in the IAV H3N2 genome segments 1–3. Analysis was performed on the IAV H3N2 genomes of Fig. 2 and the ΔG sorted from negative to positive on the 1968 H3N2 values. See Table S1 for unsorted data. Scale bar indicates t-loop ΔG in kcal/mol. (D) Total t-loop ΔG in IAV genome segments for the IAV H3N2 genomes of Fig. 2. See Table S1 for data.

Before discussing our findings, we note a few caveats to our approach. Research has shown that the placement of NP along the viral genome is heterogeneous and that various secondary RNA structures are present in IAV genome segments in virio (Le Sage et al. 2018; Dadonaite et al. 2019). NP dissociation downstream or upstream of an existing RNA structure may make additional nucleotides available for t-loop formation. However, the stability of such a secondary RNA structure is likely such that it will not spontaneously unwind and contribute to t-loop formation. In addition, if spontaneous unwinding were to occur, any nucleotides placed further away from the RNA polymerase template channels are unlikely to form a tight t-loop around the RNA polymerase and will thus unlikely contribute to the modulation of RNA polymerase processivity. For simplicity, we have excluded the above two possibilities from our analysis, but point them out because we appreciate that the (transient) RNA structure landscape is likely more complex than we can currently model.

Analysis of the first three IAV H3N2 genome segments showed a location-specific loss of, as well as an increase in t-loop stability across the segments in both the cRNA and vRNA sense (Fig. 3C; Table S1). We also noticed that positive ΔG values were present and increased in some locations. Together, these observations suggest that the stability of t-loops in the IAV H3N2 genome varies over time and per location. To get a better sense of the impact of changes in t-loop stability across each segment in time, we calculated the total t-loop ΔG per segment, per IAV H3N2 isolate. To avoid introducing biases (e.g. varying number of data points, excluding parts of some segments for some isolates, etc.), we included both the positive and negative ΔG values in this analysis. Interestingly, we observed that of the three RNA polymerase encodings segments, in particular, the PB1 vRNA segment showed a trend corresponding with t-loop destabilization (Fig. 3D), while other segments showed only a modest variation in time. This trend in PB1 was also present if we analysed the negative t-loop ΔG values only (Fig. S1).

To investigate whether the observed trends were generated by chance, we simulated mutations in the IAV H3N2 isolate from 1968, with frequencies of 3.2 × 10−3 and 2.5 × 10−4 per nucleotide per year as the upper and lower bounds of the whole genome mutation rate (Yang et al. 2014, Pauly, Procario, and Lauring 2017). We performed the mutation simulations for the PB1 and PB2 segments for three consecutive decades, while maintaining the codon sequence. Next, we calculated the t-loop ΔG for the in silico variants in both the cRNA as vRNA sense. As shown in Fig. S2, the t-loop ΔG of the PB1 segment deviated from the in silico variants, trending towards less negative ΔG values, whereas the PB2 t-loop ΔG values overlapped with the in silico variants. However, the observed patterns were not significantly different for this small sample size.

The IAV H3N2 shows a progressive loss of t-loop stability in PB1 segment

We next extended our dataset using 50 full-length IAV H3N2 sequences for each of the above years (Table S2). We first analysed the total minimum free energy (MFE; which attempts to describe large RNA structures and structures forming up and downstream of the RNA polymerase) in segment 1 (PB2), segment 2 (PB1), and segment 3 (PA) as well as the total t-loop stability per IAV genome segment. In line with our initial analysis above, we observed clear segment-specific trends (Fig. 4). The total MFE followed a progressive increase in MFE stability (decrease in ΔG) for the PB2 segment, whereas the PB1 and PA segments demonstrated a loss of MFE stability (Fig. 4). In contrast, the t-loop stability was reduced for all three RNA polymerase segments across the 55 years analysed. Compared with the in silico variants (Fig. S2), the PB1 change in ΔG was significantly different (P < 0.001). The steepest increase in t-loop ΔG was observed for the PB1 segment between 1968 and 2008. An apparent plateau in PB2 and PB1 t-loop stability followed 2008, but additional (future) years would need to be analysed to confirm the start of a new trend. For simplicity, we fit the trends with a linear regression (P < 0.0001). In general, both the vRNA and cRNA sense showed similar directions in their increase or decrease in MFE or t-loop ΔG. However, the direction in MFE and t-loop ΔG change was not consistent, as was particularly evident in the PB2 segment (Fig. 4).

Figure 4.

Figure 4.

Total MFE and t-loop free energy analysis for IAV H3N2 isolates between 1968 and 2017. Analysis of the predicted MFE and t-loop content of the cRNA (orange) and vRNA (black) is shown. For each year, the first 50 full-length, unique sequences identified (where available) were analysed and plotted. Data points were fit with linear regression. Dotted line indicates 95 per cent confidence interval.

Changes in t-loop stability occur independently of codon mutation

Other RNA elements (e.g. packaging signals) may contribute or drive the changes in t-loop stability observed in Figs. 3 and 4, but presently it is unclear where they are located and how much each putative RNA element contributes to the IAV infection cycle. However, it is well established that IAVs proteins adapt through amino acid mutation, including the polymerase subunits (Elshina and Te Velthuis 2021), and thus that codon changes occur in the various genome segments. To investigate whether the observed changes in t-loop ΔG stability across the IAV H3N2 genome segments were the result of codon mutation, we plotted the change in ΔG against the amino acid changes (Fig. 5A) for the PB2 (14 changes), PB1 (13 changes), and PA (21 changes) segments (Fig. 5B). Quantification of the number of t-loop ΔG changes that co-occurred with amino acid changes showed that approximately 69 per cent of t-loop ΔG changes occurred independently of changes at the codon level in segment 2 (Table 1). For segments 1 and 3, 86–81 per cent of t-loop ΔG changes occurred independently of changes at the codon level (Table 1). Taking into account the number of codons encoding the three RNA polymerase subunits (i.e. 2232) and the number to t-loop ΔG changes we observe relative to the 1968 H3N2 isolate, a chi-squared analysis yields P = 0. Overall, this suggests that the t-loop ΔG changes occur independently of non-synonymous mutation.

Figure 5.

Figure 5.

Co-occurrence of t-loop structure stability and amino acids changes. The IAV RNA polymerase genes accumulate codon changes that alter the protein amino acid sequence of the RNA polymerase subunits. These amino acid changes alter protein function and RNA polymerase activity. To investigate if the observed trends in t-loop destabilization were driven by changes in the codons, we plotted the t-loop stability for the codons (annotated vertically with dotted boundaries) that change for each of the RNA polymerase genes over time and 17 surrounding nucleotides. The colour scale shown next to each graph indicates the t-loop ΔG in kcal/mol. Codon changes are indicated with a solid vertical line. A further vertical line is shown as an additional change occurred. Quantification of the co-occurrence of codon and t-loop stability changes is shown in Table 1.

Table 1.

Coincidence of t-loop stability changes and codon mutation.

Codon mutation co-occurrence
Segment Yes # (%) No # (%) Total # of codons Total # of t-loop changesa
Segment 1 (PB2) 2 (14.3%) 12 (85.7%) 759 1697
Segment 2 (PB1) 4 (30.8%) 9 (69.2%) 757 1649
Segment 3 (PA) 4 (19%) 17 (81%) 716 1600
a

t-loop change is defined as any change in the average ΔG over the years after 1968 relative to 1968.

The IAV H1N1 genome shows a loss of secondary structure destabilization

The above data showed that the putative t-loops in the H3N2 PB1 segment were destabilized more quickly than the t-loops in the H3N2 PA and PB2 segments. We wondered if the different origin of the PB1 segment, which had been introduced into IAV H3N2 through a reassortment event with an IAV from the avian reservoir prior to 1968, was driving this increased rate in transient RNA structure destabilization (Wendel et al. 2015). To explore the possibility that adaptation of an avian origin genome segment to the human host was contributing to the change in t-loop stability in the PB1 segment, we analysed the genome sequences of IAV H1N1 isolates (Fig. 6).

Figure 6.

Figure 6.

Total MFE and t-loop free energy analysis for IAV H1N1 isolates between 1918 and 2021. Analysis of the cRNA (orange) and vRNA (black) is shown. For each year, the first 50 full-length sequences identified were analysed and plotted. Dotted vertical line indicates start of 2009 H1N1 pandemic.

In recent memory, H1N1 IAV has been associated with two pandemics in which one or more genome segments were of avian IAV origin (Scholtissek et al. 1978; Webster et al. 1992). In 1918, an avian IAV spilled over into humans and gradually adapted, until it stopped circulating in humans at the start of the 1957 H2N2 pandemic. The same partially adapted H1N1 strain then remerged in 1977 and continued to circulate as seasonal IAV. In 2009, a triple-reassorted pandemic virus emerged, which contained PB2 and PA segments from an avian IAV and a PB1 segment from a human-adapted H3N2 IAV (Taubenberger and Morens 2009; Zeng et al. 2011).

Analysis of the t-loop stability in the three longest H1N1 IAV genome segments revealed a gradual loss of overall t-loop stability in all three segments between 1918 and 1957. Between 1977 and 2009, the t-loop stability remained relatively unchanged for the PB1 segment, whereas the PA segment continued to show some destabilization. The t-loop stability in the PB2 segment was more diverse, and particularly widespread from 1998 to 2009. In 2009, a shift in t-loop stability occurred in the PA segment, in line with our expectations for the introduction of an avian-adapted gene segment through reassortment. At the same time, the H3N2-derived PB1 segment displayed a rapid reduction in t-loop stability, whereas the avian-adapted PB2 segment showed only a modest reduction in t-loop stability. Analysis of the overall MFE ΔG in the H1N1 genome segments showed that introduction of a new segment also caused a shift in MFE ΔG. In contrast to the t-loop ΔG, no trend in MFE ΔG change was observed following the reassortment event. In line with our analysis of the H3N2 genome, this suggests that t-loop adaptation occurs independently of changes in other RNA structures in the genome. Overall, the above analysis suggests that IAV reassortment is followed by changes in t-loop stability, and that these changes can occur in both avian-adapted as well as human-adapted IAV genome segments following reassortment.

The influenza B virus genome does not show secondary structure destabilization

Having observed the relatively rapid change in IAV H3N2 and H1N1 secondary structure stability following IAV reassortment, we analyzed the genome of the IBV (Fig. 7). Since current IBV strains have been co-circulating among humans and not been associated with spill over events from a non-mammalian animal reservoir, we expected no (rapid) change in t-loop ΔG over time. Analysis of the t-loop stability indeed showed no trend that was indicative of stabilization or destabilization. The lack of a trend was both evident for both the vRNA and the cRNA sense of the three segments investigated (Fig. 7).

Figure 7.

Figure 7.

Total MFE and t-loop free energy analysis for IBV. Analysis of the cRNA (orange) and vRNA (black) is shown. For each year, the first 50 full-length sequences were analysed and plotted.

In contrast to the t-loop analysis, analysis of the MFE showed a more complicated picture for the first three segments of the IBV genome (Fig. 7). For both the PB2 and PB1 segments, we noted a bifurcation in the MFE stability around 1990. This bifurcation, which reduced and increased stability, was particularly prominent for the vRNA sense, and likely linked to the two IBV lineages that have been co-circulating since the 1980s. The MFE of the PA segment was more broadly distributed and did not show a pattern. Overall, these analyses indicate that the t-loop stability in the IBV genome segments is different from the IAV genome segments. It is tempting to speculate that this difference between IAV and IBV is caused by the absence of spill over of IBV strains from a non-mammalian animal reservoir.

The SARS-CoV-2 genome shows a loss of secondary structure destabilization

In 2019, SARS-CoV-2 emerged as the causative agent of the COVID-19 pandemic (Wu et al., 2020). Over the course of the pandemic, point mutations and deletions occurred and accumulated in the SARS-CoV-2 genome. Some of these mutations have been linked to amino acid changes that improved host receptor binding, while others caused deletions of known secondary structures. However, many synonymous mutations and less obvious deletions have not yet been linked to improved viral functions, although it has been observed that some variants display increased replication in tissue culture, suggesting that these mutations and deletions may contribute to viral replication efficiency (Pyke et al. 2021). Based on our observations with IAV, we hypothesized that t-loop-like structures may be changing in the SARS-CoV-2 genome.

To investigate the above question, we first estimated the footprint of the SARS-CoV-2 replication and transcription complex (RTC). To this end, we aligned motif C of the cryo-EM structure of the SARS-CoV-2 RTC (PDB 7CYQ) with motif C of the cryo-EM structure of the IAV RNA polymerase (PDB 6T0V) in Pymol and estimated the route that template RNA would take through the polymerase. We concluded that 20–22 nt of the template would be covered by the RTC during RNA synthesis (Wandzik et al. 2020; Yan et al. 2021). Our initial analyses showed that the above footprint parameter range did not affect the trends for the calculated MFE ΔG and the t-loop ΔG. We therefore set the footprint to the lower of these values for the data presented below, in order to collect more t-loop datapoints per genome.

We next analysed the original SARS-CoV-2 isolate and several SARS-CoV-2 variants of concern using our computational model. For each variant, we analysed the first 50, full-length, high-quality sequences published. We found that both in the positive and negative sense of the full-length genome, the MFE ΔG and t-loop ΔG trended towards a less negative ΔG, and thus less stable RNA structure content, over the course of the COVID-19 pandemic (Fig. 8). The trends could be fit with linear regression (P ≤ 0.0001).

Figure 8.

Figure 8.

Total MFE and t-loop free energy analysis for SARS-CoV-2. Analysis of the positive sense genomic RNA (black; purple bars indicate mean) and negative sense RNA (orange; green bars indicate mean) is shown. For each year, the first 50 full-length sequences were analysed and plotted.

Unlike IAV strains, SARS-CoV-2 has only been circulating among humans for a short time. To investigate whether the observed trends were generated by chance, we simulated mutations in the reference SARS-CoV-2 genome, with frequencies of 1.8 × 10−3 and 3.95 × 10−4 per nucleotide per year as the upper and lower bounds of the whole genome mutation rate (Li et al. 2020; Shen, Zhang, and He 2021). We performed the mutation simulations for 3 consecutive years, while maintaining the codon sequence, and next calculated the t-loop ΔG for the in silico variants. As shown in Fig. S3, the t-loop ΔG for the in silico variants was strikingly less negative than the actual SARS-CoV-2 variants, suggesting that the observed changes in the analysed SARS-CoV-2 variants had not likely arisen by chance, but also that the changes in the SARS-CoV-2 genome are more constrained than assumed during in silico mutation. This result contrasted the in silico mutation of the IAV segments.

Discussion

Respiratory RNA viruses evolve as they circulate among humans. It is well understood how mutations in viral proteins affect enzyme function and protein–protein interactions. In addition, mutations in long-range RNA secondary structures may affect RNA genome packaging or genome circularization during positive sense RNA virus replication. Currently, little is still known about the role of transient RNA secondary structures, or t-loops, which were proposed to only form when the viral replication complex copies the viral genome and modulates the local RNA secondary structure landscape (French et al. 2022).

We here considered the role of the IAV RNA polymerase and NP footprint to compute putative t-loops in the IAV genome segments PB2, PB1, and PA. We observed that the transient RNA structure landscape starts to evolve when emerged IAV strains circulate in humans. Interestingly, we also observed that the RNA polymerase of a 2017 H3N2 IAV isolate was more sensitive to t-loops than the RNA polymerase of a 1968 H3N2 IAV isolate, which suggests that the transient RNA structure landscape and t-loop sensitivity of the RNA polymerase become (partly) matched. We observed a similar trend in the evolution of SARS-CoV-2, which recently emerged as a zoonotic virus from an unknown animal reservoir. Such a trend was not evident in the IBV genome, which has circulated in humans without the introduction of non-mammalian adapted sequences for decades. Possible drivers for the observed trends in IAV and SARS-CoV-2 include differences in cellular environment between hosts, including differences in the temperature at the site of replication (Priore, Moss, and Turner 2012).

A bias in mutations towards bases not involved in base pair formation has been previously observed (Simmonds and Griffin 2020). It is possible that the change in the transient RNA structure landscape provides a fitness advantage to emerging respiratory RNA viruses, either by increasing the efficiency of viral replication or modulating the volume of aberrant viral products formed during viral RNA synthesis. In turn, the increased efficiency of viral replication could limit the detection of viral RNA by host pathogen receptors, such as RIG-I and MDA5, and contribute less to the severity to clinical disease. In addition, a reduction in RNA secondary structure (more equivalent to the MFE data in our analysis) has been proposed to reduce recruitment of human proteins, especially those found in stress granules (Vandelli et al. 2020).

Our observations were almost unilaterally based on in silico models that were derived from experiments performed on short IAV templates (French et al. 2022). In vitro assays will need to be performed for the SARS-CoV-2 RNA polymerase to confirm that this enzyme is indeed sensitive to the structures studied here. In addition, cell-based tools will need to be developed to confirm the existence of t-loops in IAV and SARS-CoV-2 genomes and test our findings. Our in silico model also has putative limitations, because it does not account for secondary RNA structures that reside between NP molecules or structures that may be stabilized by RNA–protein interactions in a cellular environment. Such RNA–protein interactions may modulate viral replication processivity in addition to t-loops. Our model also fails to account for the possibility of multiple RNA conformations, instead relying on a prediction of the most thermodynamically stable structure. Future work will need to be performed, if possible, to map such potential elements and their variation among viral strains and test the hypotheses that our study proposes. Future analyses can also focus on developing new statistical frameworks for the rate of stabilization vs destabilization of RNA elements in viral genomes.

Even with the above caveats in mind, our results suggest that changes in transient secondary structure stability are occurring as an emerging respiratory RNA virus adapts to humans. This implies that these elements affect viral replication and that a reduction in stability provides a fitness benefit for replication in human hosts. Removing elements from the viral genome that affect processivity and result in early termination or recombination may increase the efficiency of viral replication. Along the same lines, an increase in stability should provide a fitness advantage in non-mammalian hosts, should a human-adapted virus spill back into a non-mammalian host. Together, these results advance our insight into respiratory RNA virus evolution, and we believe that they yield new testable hypotheses that could help strengthen our framework for studying respiratory RNA virus replication and adaptation.

Material and methods

Cells and plasmids

Human embryonic kidney 293T (HEK293T) cells were originally sourced from ITCC and routinely checked for mycoplasma infection. HEK293T cells were maintained in DMEM containing 10 per cent FBS, glutamate, pyruvate, and high glucose (Gibco). WSN pcDNA3-based protein expression plasmids and pPol I-based viral template RNA expression plasmids were described previously (Vreede, Jung, and Brownlee 2004; Deng et al. 2005; Te Velthuis et al. 2018). The open-reading frames of the H3N2-2017 and H3N2-1968 PB2, PB1, PA, and NP genes were cloned into the pPPI4 expression vector using Gibson cloning. The firefly luciferase reporter plasmid under the control of the IFN-β promoter [pIFΔ(−116)lucter] and the transfection control plasmids constitutively expressing Renilla luciferase (pTK-Renilla) were described previously (Te Velthuis et al. 2018).

Transfections and IFN reporter assays

Transfections to perform RNP reconstitutions and IFN-β reporter assays in HEK293T cells were essentially performed as described previously (Te Velthuis et al. 2018). RNP reconstitutions were carried out in a 24-well format by transfecting 250 ng of the pcDNA or pPPI4 plasmids encoding PB1, PB2, PA, NP, and 250 ng of pPolI plasmid encoding a mvRNA template under the control of the pPolI promoter. HEK293T cells were additionally transfected with 100 ng of plasmid expressing firefly luciferase from the IFN-beta promoter and 10 ng of plasmid expressing Renilla luciferase. Transfections were performed with lipofectamine 2000 (Invitrogen) to the manufacturer’s specifications. Twenty-four hours later, the medium was aspirated, cells were washed with 1 ml PBS and split into two fractions. Cells were pelleted at 2,500 rpm using a benchtop centrifuge. The pellets from one fraction were resuspended in 30 μl of 1x Laemmli buffer for western blotting and the other fraction was resuspended in 50 μl PBS for the IFN-β promoter activity assay. The assay was done in duplicate using 25 μl of cell suspension in PBS per well in a white 96-well plate format. Next, 25 μl of DualGlo reagent (Promega) was added per well, samples were incubated at RT for 10 min (in dark), and the firefly luciferase readings were taken using a Synergy LX Multimode Microplate Reader (Biotek). Twenty-five μl of Stop-Glo reagent/well was added next, plate was incubated for 10 min at RT (in dark), and the Renilla luciferase readings were taken. Firefly luciferase values were normalized by the Renilla luciferase values. Data analysis was performed in Prism 9.5.0 (Graphpad).

Antibodies and Western blotting

IAV proteins were detected using rabbit polyclonal antibodies anti-PB1 (GTX125923, GeneTex), and anti-NP (GTX125989, GeneTex) diluted 1:2000 in blocking buffer (PBS, 5 per cent bovine serum albumin Research Products International (RPI), 0.1 per cent Tween 20 (RPI)). Cellular proteins were detected using the rat monoclonal antibody anti-tubulin (MCA77G, Bio-Rad) diluted 1:3000 in blocking buffer. Secondary antibodies IRDye 800 donkey anti-rabbit (926-32213, LI-COR) and IRDye 680 goat anti-rat (926-68076, LI-COR) were used to detect Western signals with a LI-COR Odyssey scanner.

IAV H3N2 and H1N1, and IBV sequence acquisition and processing

Initially, our analysis was performed on a limited, representative set of H3N2 sequences. To expand this set, 50 full-length IAV H3N2 sequences per year were acquired from EpiFlu (gisaid.org) for 1968–2014, and 2017 was downloaded from NCBI (see Table S2). Fifty full-length IAV H1N1 per year were acquired from NCBI for 1918–2021. The acquired sequences had >90 per cent full gene segment length and <1 per cent ambiguous nucleotide residues in all gene segments. The 11–12 nt-long conserved IAV promoter sequences were added manually if they had been trimmed and/or were missing in the downloaded sequences. Incomplete sequences that could not be manually updated were removed.

SARS-CoV-2 variant sequence acquisition and processing

Sequences were acquired from EpiCoV (gisaid.org) are listed in Table S2. Briefly, sequences were required to fit the following criteria: high coverage (<1%  N and <0.05%  unique amino acid mutations), human isolate, complete date record, and considered a complete sequence. Sequences were then sorted by sample collection date and the first 50 complete sequences selected for downstream analysis. This approach allowed us to only include the first 50 sequences of a new variant of concern, limiting putative sequence drift, incorrect variant calling, and recombination between variants. A limitation of our data selection procedure was that it overrepresented the geographic regions in which variants of concern emerged.

After selection, sequences were aligned using EMBL-EBLI Clustal Omega (Madeira et al. 2022). We noted substantial sequence variation at the 3′ and 5′ ends of the SARS-CoV-2 genome, which was likely caused by differences in amplicon design or genome assembly among labs. Prior to further analysis, we trimmed the genome sequences to nucleotide positions 602 and 27,552 of the SARS-CoV-2 reference sequence (NC_045512.2). The resulting sequences were approximately 26,935 nt in length, with variation stemming from insertions or deletions.

In silico mutation of the SARS-CoV genome

The SARS-CoV-2 reference genome (NC_045512.2) was mutated at frequencies of 1.8 × 10−3 and 3.95 × 10−4 per nucleotide per year (Li et al. 2020; Shen, Zhang, and He 2021). These frequencies were used as upper and lower estimations for SARS-CoV-2s whole genome mutation rate. IAV H3N2 PB2 and PB1 genome segments were mutated at frequencies 3.2 × 10−3 and 2.5 × 10−4 to represent upper and lower bounds of estimated mutation rates (Yang et al. 2014; Pauly, Procario, and Lauring 2017). Mutations were performed in python 3.10.0 using the Mutation-Simulator package (Kühl, Stich, and Ries 2021).

In silico analysis of t-loop and other secondary RNA structures

Our t-loop analysis was essentially performed as described previously (French et al. 2022), but with minor modifications to account for NP binding up and downstream of the IAV RNA polymerase during replication and RNP assembly (Fig. 2). Specifically, we used a sliding-window analysis, in which a sequence covered by the 20-nt footprint of the IAV RNA polymerase is blocked from participating in secondary RNA structure formation. Our in silico analysis starts with the RNA polymerase binding to the 3′-end of a viral genome (segment), which places residue 16 within the 20-nt footprint in the active site (Kouba, Drncová, and Cusack 2019; Wandzik et al. 2020). We next assume that NP has a footprint of 24 nt and that the separation of the template RNA from the downstream or upstream NP in the RNP makes 24 nt available for base pairing (Fig. 2B) (Ortega et al. 2000). With both an upstream and downstream NP are removed, t-loop as well as any other secondary RNA structure formation can be calculated between the two 24-nt stretches.

The above sliding window was started at the 3′-end of the viral genome and moved in 1-nt intervals along the length of the template RNA. For each window, the RNA sequence upstream of the RNA polymerase footprint, downstream of the RNA polymerase footprint, and between the upstream and downstream sequences was analysed and the stability ΔG of the RNA structures calculated using the ViennaRNA package v.2.5 within Python 3.9 (Lorenz et al. 2011). Locations were recorded based on the position of the active site (i.e. starting at position 16 with ΔG > 0, because no template had emerged from the RNA polymerase yet). Next, the total ΔG that the RNA polymerase encounters while copying the template was calculated.

Consensus sequence generation and GC content analysis

H3N2 consensus sequences for PB2, PB1, and PA were generated using EMBOSS Cons and GC content was calculated using Bio.SeqUtils package from BioPython (Cock et al. 2009; Madeira et al. 2022).

Supplementary Material

vead056_Supp

Acknowledgements

The authors would like to thank Dr Michael Oade, Rene Vigeveno, and Sarah van Leeuwen for discussions and reagents. Portions of the work reported in this paper were performed using the Princeton Research Computing resources at Princeton University, which is a consortium of groups led by the Princeton Institute for Computational Science and Engineering (PICSciE) and Office of Information Technology’s Research Computing. CVR was supported by a studentship from Public Health England. K.R.S. was supported by NIH Grant Nos. R01 GM140032 and R01 AI170520. K.B. was supported by NIH Grant No. DP2 AI175474. A.T.V. was supported by NIH Grant Nos. R21 AI147172, DP2 AI175474, and R01 AI170520, and Wellcome Trust and Royal Society Grant No. 206579/17/Z.

Contributor Information

Charlotte V Rigby, Lewis Thomas Laboratory, Department of Molecular Biology, Princeton University, Washington Road, Princeton, NJ 08544, USA; Department of Pathology, Addenbrooke’s Hospital, University of Cambridge, Hills Road, Cambridge CB2 2QQ, UK; Addenbrooke’s Hospital, Public Health England, Hills Road, Cambridge CB2 2QQ, UK.

Kimberly R Sabsay, Lewis Thomas Laboratory, Department of Molecular Biology, Princeton University, Washington Road, Princeton, NJ 08544, USA; Carl Icahn Laboratory, Lewis-Sigler Institute, Princeton University, South Drive, Princeton, NJ 08544, USA.

Karishma Bisht, Lewis Thomas Laboratory, Department of Molecular Biology, Princeton University, Washington Road, Princeton, NJ 08544, USA.

Dirk Eggink, Department of Medical Microbiology, Amsterdam UMC, Meibergdreef 9, Amsterdam 1105 AZ, The Netherlands.

Hamid Jalal, Addenbrooke’s Hospital, Public Health England, Hills Road, Cambridge CB2 2QQ, UK.

Aartjan J W te Velthuis, Lewis Thomas Laboratory, Department of Molecular Biology, Princeton University, Washington Road, Princeton, NJ 08544, USA; Center for Infectious Disease Control, National Institute for Public Health and the Environment (RIVM), Antonie van Leeuwenhoeklaan 9, Bilthoven 3721 MA, the Netherlands.

Supplementary data

Supplementary data is available at Virus Evolution online.

Conflict of interest:

None declared.

References

  1. Abbasian M. H. et al. (2023) ‘Global Landscape of SARS-CoV-2 Mutations and Conserved Regions’, Journal of Translational Medicine, 21: 152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alvarez-Munoz S. et al. (2021) ‘Key Factors that Enable the Pandemic Potential of RNA Viruses and Inter-Species Transmission: A Systematic Review’, Viruses, 13: 537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Baranovskaya I. et al. (2019) ‘Changes in RNA Secondary Structure Affect NS1 Protein Expression during Early Stage Influenza Virus Infection’, Virology Journal 16: 162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bouvier N. M., and Palese P. (2008) ‘The Biology of Influenza Viruses’, Vaccine, 26: D49–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chang P. et al. (2020) ‘Immune Escape Adaptive Mutations in the H7N9 Avian Influenza Hemagglutinin Protein Increase Virus Replication Fitness and Decrease Pandemic Potential’, Journal of Virology, 94: e00216–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cock P. J. A. et al. (2009) ‘Biopython: Freely Available Python Tools for Computational Molecular Biology and Bioinformatics’, Bioinformatics, 25: 1422–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Czudai-Matwich V. et al. (2014) ‘PB2 Mutations D701N and S714R Promote Adaptation of an Influenza H5N1 Virus to a Mammalian Host’, Journal of Virology, 88: 8735–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dadonaite B. et al. (2019) ‘The Structure of the Influenza A Virus Genome’, Nature Microbiology, 4: 1781–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Deng T. et al. (2005) ‘In Vitro Assembly of PB2 with a PB1-PA Dimer Supports a New Model of Assembly of Influenza A Virus Polymerase Subunits into a Functional Trimeric Complex’, Journal of Virology, 79: 8669–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Di Lella S., Herrmann A., and Mair C. M. (2016) ‘Modulation of the pH Stability of Influenza Virus Hemagglutinin: A Host Cell Adaptation Strategy’, Biophysical Journal, 110: 2293–301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Elderfield R. A. et al. (2014) ‘Accumulation of Human-Adapting Mutations during Circulation of A(H1N1)pdm09 Influenza Virus in Humans in the United Kingdom’, Journal of Virology, 88: 13269–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Elshina E., and Te Velthuis A. J. W. (2021) ‘The Influenza Virus RNA Polymerase as an Innate Immune Agonist and Antagonist’, Cellular and Molecular Life Sciences, 78: 7237–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Forbes N. et al. (2013) ‘Identification of Adaptive Mutations in the Influenza A Virus Non-Structural 1 Gene that Increase Cytoplasmic Localization and Differentially Regulate Host Gene Expression’, PLOS ONE, 8: e84673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. French H. et al. (2022) ‘Transient RNA Structures Cause Aberrant Influenza Virus Replication and Innate Immune Activation’, Science Advances, 8: 8655–8655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fujii K. et al. (2009) ‘Incorporation of Influenza A Virus Genome Segments Does Not Absolutely Require Wild-type Sequences’, Journal of General Virology, 90: 1734–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Furuse Y., and Oshitani H. (2011) ‘Evolution of the Influenza A Virus Untranslated Regions’, Infection, Genetics and Evolution, 11: 1150–4. [DOI] [PubMed] [Google Scholar]
  17. Gupta D. et al. (2021) ‘Structural and Functional Insights into the Spike Protein Mutations of Emerging SARS-CoV-2 Variants’, Cellular and Molecular Life Sciences, 78: 7967–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Harari S. et al. (2022) ‘Drivers of Adaptive Evolution during Chronic SARS-CoV-2 Infections’, Nature Medicine, 28: 1501–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Huston N. C. et al. (2021) ‘Comprehensive in Vivo Secondary Structure of the SARS-CoV-2 Genome Reveals Novel Regulatory Motifs and Mechanisms’, Molecular Cell, 81: 584–598.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Hutchinson E. C. et al. (2009) ‘Characterisation of Influenza A Viruses with Mutations in Segment 5 Packaging Signals’, Vaccine, 27: 6270–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Jiménez-Alberto A. et al. (2013) ‘Analysis of Adaptation Mutants in the Hemagglutinin of the Influenza A(H1N1)pdm09 Virus’, PLOS ONE, 8: e70005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Johnson B. A. et al. (2022) ‘Nucleocapsid Mutations in SARS-CoV-2 Augment Replication and Pathogenesis’, PLOS Pathogens, 18: e1010627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kistler K. E., Huddleston J., and Bedford T. (2022) ‘Rapid and Parallel Adaptive Mutations in Spike S1 Drive Clade Success in SARS-CoV-2’, Cell Host & Microbe, 30: 545–555.e4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kouba T., Drncová P., and Cusack S. (2019) ‘Structural Snapshots of Actively Transcribing Influenza Polymerase’, Nature Structural & Molecular Biology, 26: 460–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kudriavtsev A. V. et al. (2022) ‘Immune Escape Associated with RBD Omicron Mutations and SARS-CoV-2 Evolution Dynamics’, Viruses, 14: 1603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kühl M. A., Stich B., and Ries D. C. (2021) ‘Mutation-Simulator: Fine-grained Simulation of Random Mutations in Any Genome’, Bioinformatics, 37: 568–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Laha S. et al. (2020) ‘Characterizations of SARS-CoV-2 Mutational Profile, Spike Protein Stability and Viral Transmission’, Infection, Genetics and Evolution, 85: 104445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Le Sage V. et al. (2018) ‘Non-Uniform and Non-Random Binding of Nucleoprotein to Influenza A and B Viral RNA’, Viruses, 10: 522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Li X. et al. (2020) ‘Transmission Dynamics and Evolutionary History of 2019-nCoV’, Journal of Medical Virology, 92: 501–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Lin R.-W. et al. (2019) ‘Naturally Occurring Mutations in PB1 Affect Influenza A Virus Replication Fidelity, Virulence, and Adaptability’, Journal of Biomedical Science, 26: 55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Liu Y. et al. (2022) ‘The N501Y Spike Substitution Enhances SARS-CoV-2 Infection and Transmission’, Nature, 602: 294–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lorenz R. et al. (2011) ‘ViennaRNA Package 2.0’, Algorithms for Molecular Biology, 6: 26–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Madeira F. et al. (2022) ‘Search and Sequence Analysis Tools Services from EMBL-EBI in 2022’, Nucleic Acids Research 50: W276–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Mänz B., Schwemmle M., and Brunotte L. (2013) ‘Adaptation of Avian Influenza A Virus Polymerase in Mammals to Overcome the Host Species Barrier’, Journal of Virology, 87: 7200–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. McCarthy K. R. et al. (2021) ‘Recurrent Deletions in the SARS-CoV-2 Spike Glycoprotein Drive Antibody Escape’, Science, 371: 1139–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Miotto O. et al. (2010) ‘Complete-Proteome Mapping of Human Influenza A Adaptive Mutations: Implications for Human Transmissibility of Zoonotic Strains’, PLOS ONE, 5: e9025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Mou K. et al. (2021) ‘Emerging Mutations in Envelope Protein of SARS-CoV-2 and Their Effect on Thermodynamic Properties’, Informatics in Medicine Unlocked, 25: 100675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Ortega J. et al. (2000) ‘Ultrastructural and Functional Analyses of Recombinant Influenza Virus Ribonucleoproteins Suggest Dimerization of Nucleoprotein during Virus Amplification’, Journal of Virology, 74: 156–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Pauly M. D., Procario M. C., and Lauring A. S. (2017) ‘A Novel Twelve Class Fluctuation Test Reveals Higher than Expected Mutation Rates for Influenza A Viruses’, eLife, 6, e26437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Piasecka J. et al. (2020) ‘RNA Secondary Structure Motifs of the Influenza A Virus as Targets for siRNA-Mediated RNA Interference’, Molecular Therapy—Nucleic Acids, 19: 627–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Priore S. F. et al. (2015) ‘The Influenza A PB1-F2 and N40 Start Codons are Contained within an RNA Pseudoknot’, Biochemistry, 54: 3413–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Priore S. F., Moss W. N., and Turner D. H. (2012) ‘Influenza A Virus Coding Regions Exhibit Host-Specific Global Ordered RNA Structure’, PLOS ONE, 7: e35989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Pyke A. T. et al. (2021) ‘Replication Kinetics of B.1.351 and B.1.1.7 SARS-CoV-2 Variants of Concern Including Assessment of a B.1.1.7 Mutant Carrying a Defective ORF7a Gene’, Viruses, 13: 1087–1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Rochman N. D. et al. (2021) ‘Ongoing Global and Regional Adaptive Evolution of SARS-CoV-2’, Proceedings of the National Academy of Sciences, 118: e2104241118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Ryder S. P. et al. (2021) ‘Analysis of Emerging Variants in Structured Regions of the SARS-CoV-2 Genome’, Evolutionary Bioinformatics, 17: 11769343211014168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Scholtissek C. et al. (1978) ‘On the Origin of the Human Influenza Virus Subtypes H2N2 and H3N2’, Virology, 87: 13–20. [DOI] [PubMed] [Google Scholar]
  47. Selman M. et al. (2012) ‘Adaptive Mutation in Influenza A Virus Non-structural Gene is Linked to Host Switching and Induces a Novel Protein by Alternative Splicing’, Emerging Microbes & Infections, 1: e42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Shen S., Zhang Z., and He F. (2021) ‘The Phylogenetic Relationship within SARS-CoV-2s: An Expanding Basal Clade’, Molecular Phylogenetics and Evolution, 157: 107017–107017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Simmonds P., and Griffin D. E. (2020) ‘Pervasive RNA Secondary Structure in the Genomes of SARS-CoV-2 and Other Coronaviruses’, mBio, 11: 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Sonnleitner S. T. et al. (2022) ‘Cumulative SARS-CoV-2 Mutations and Corresponding Changes in Immunity in an Immunocompromised Patient Indicate Viral Evolution within the Host’, Nature Communications, 13: 2560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Taubenberger J. K., and Morens D. M. (2009) ‘Pandemic Influenza—Including a Risk Assessment of H5N1’, Revue Scientifique Technique Int off Epizoot, 28: 187–202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Taubenberger JK, Morens DM. (2010) ‘Influenza: The Once and Future Pandemic’, Public Health Reports, 125: 16–26. [PMC free article] [PubMed] [Google Scholar]
  53. Te Velthuis A. J. W. et al. (2018) ‘Mini Viral RNAs Act as Innate Immune Agonists during Influenza Virus Infection’, Nature Microbiology, 3: 1234–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Thompson A. J., and Paulson J. C. (2021) ‘Adaptation of Influenza Viruses to Human Airway Receptors’, The Journal of Biological Chemistry, 296: 100017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Vandelli A. et al. (2020) ‘Structural Analysis of SARS-CoV-2 Genome and Predictions of the Human Interactome’, Nucleic Acids Research, 48: 11270–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Verkhivker G. M. et al. (2021) ‘Comparative Perturbation-Based Modeling of the SARS-CoV-2 Spike Protein Binding with Host Receptor and Neutralizing Antibodies: Structurally Adaptable Allosteric Communication Hotspots Define Spike Sites Targeted by Global Circulating Mutations’, Biochemistry, 60: 1459–84. [DOI] [PubMed] [Google Scholar]
  57. Vreede F. T., Jung T. E., and Brownlee G. G. (2004) ‘Model Suggesting that Replication of Influenza Virus is Regulated by Stabilization of Replicative Intermediates’, Journal of Virology, 78: 9568–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Wandzik J. M. et al. (2020) ‘A Structure-Based Model for the Complete Transcription Cycle of Influenza Polymerase’, Cell, 181: 877–893.e21. [DOI] [PubMed] [Google Scholar]
  59. Webster R. G. et al. (1992) ‘Evolution and Ecology of Influenza A Viruses’, Microbiological Reviews, 56: 152–79. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Weisblum Y. et al. (2020) ‘Escape from Neutralizing Antibodies by SARS-CoV-2 Spike Protein Variants’, eLife, 9: e61312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Wen L. et al. (2018) ‘Large-scale Sequence Analysis Reveals Novel Human-adaptive Markers in PB2 Segment of Seasonal Influenza A Viruses’, Emerging Microbes & Infections, 7: 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Wendel I. et al. (2015) ‘The Avian-Origin PB1 Gene Segment Facilitated Replication and Transmissibility of the H3N2/1968 Pandemic Influenza Virus’, Journal of Virology, 89: 4170–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Yan L. et al. (2021) ‘Cryo-EM Structure of an Extended SARS-CoV-2 Replication and Transcription Complex Reveals an Intermediate State in Cap Synthesis’, Cell, 184: 184–193.e10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Yang J.-R. et al. (2014) ‘Newly Emerging Mutations in the Matrix Genes of the Human Influenza A(H1N1)pdm09 and A(H3N2) Viruses Reduce the Detection Sensitivity of Real-Time Reverse Transcription-PCR’, Journal of Clinical Microbiology, 52: 76–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Yang Z. et al. (2021) ‘SARS-CoV-2 Variants Increase Kinetic Stability of Open Spike Conformations as an Evolutionary Strategy’, mBio, 13: e0322721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Zeng H. et al. (2011) ‘The 2009 Pandemic H1N1 and Triple-Reassortant Swine H1N1 Influenza Viruses Replicate Efficiently but Elicit an Attenuated Inflammatory Response in Polarized Human Bronchial Epithelial Cells’, Journal of Virology, 85: 686–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Zhu N. et al. (2020) ‘A Novel Coronavirus from Patients with Pneumonia in China, 2019’, New England Journal of Medicine, 382: 727–33. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

vead056_Supp

Articles from Virus Evolution are provided here courtesy of Oxford University Press

RESOURCES