Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2015 Feb 2;112(7):2227–2232. doi: 10.1073/pnas.1420812112

Revealing the density of encoded functions in a viral RNA

Nikesh Patel a, Eric C Dykeman b,c, Robert H A Coutts d,1, George P Lomonossoff e, David J Rowlands a, Simon E V Phillips f, Neil Ranson a, Reidun Twarock b,c, Roman Tuma a, Peter G Stockley a,2
PMCID: PMC4343168  PMID: 25646435

Significance

Single-stranded RNA viruses self-assemble protective protein containers around their cognate genomes rapidly and efficiently at low concentrations. RNA encapsidation in vivo occurs preferentially with the cognate genome, in contrast to many in vitro reassembly experiments. We describe in molecular detail how this specificity and efficiency is accomplished using multiple contacts between coat proteins and dispersed packaging signals in the viral genome. The sequences and relative positioning of the packaging signals are important for this mechanism, creating a strong evolutionary constraint. Packaging signals overlap untranslated and coding regions ensuring assembly is in competition with other functions of the genome. Disrupting these contacts has deleterious consequences for capsid assembly identifying a novel antiviral drug target.

Keywords: virus assembly, single-molecule fluorescence correlation spectroscopy, satellite tobacco necrosis virus, packaging signal

Abstract

We present direct experimental evidence that assembly of a single-stranded RNA virus occurs via a packaging signal-mediated mechanism. We show that the sequences of coat protein recognition motifs within multiple, dispersed, putative RNA packaging signals, as well as their relative spacing within a genomic fragment, act collectively to influence the fidelity and yield of capsid self-assembly in vitro. These experiments confirm that the selective advantages for viral yield and encapsidation specificity, predicted from previous modeling of packaging signal-mediated assembly, are found in Nature. Regions of the genome that act as packaging signals also function in translational and transcriptional enhancement, as well as directly coding for the coat protein, highlighting the density of encoded functions within the viral RNA. Assembly and gene expression are therefore direct molecular competitors for different functional folds of the same RNA sequence. The strongest packaging signal in the test fragment, encodes a region of the coat protein that undergoes a conformational change upon contact with packaging signals. A similar phenomenon occurs in other RNA viruses for which packaging signals are known. These contacts hint at an even deeper density of encoded functions in viral RNA, which if confirmed, would have profound consequences for the evolution of this class of pathogens.


Almost all natural nucleic acid sequences appear in the form of RNA polynucleotides at some point in their existence. The concept of “one RNA, one function” (e.g., for tRNA or mRNA) is increasingly seen as far too simplistic with RNAs known to alter their secondary/tertiary structures in regulated ways to control the appearance of distinct functional states (1). This multifunctionality is also true of the genomes of single-stranded (ss) RNA viruses, where refolding to promote replication, translation and control of gene expression are well known (2). Here we describe additional layers of previously unsuspected functions within the genome of the satellite plant virus, Satellite Tobacco Necrosis Virus (STNV). Aspects of these functions are potential drug targets for this class of viruses, which include major human pathogens and a significant fraction of emerging viral diseases (3, 4). Virus-like particles (VLPs) are also increasingly being used both for vaccines and for delivery of imaging, diagnostic, or therapeutic agents (5, 6). Thus, understanding the detailed molecular mechanisms used by these viruses for assembly is of widespread interest.

Previously we showed, using single molecule fluorescence correlation spectroscopy (smFCS), that at low (nanomolar) concentrations the coat proteins (CPs) of STNV and bacteriophage MS2 package their cognate genomes preferentially (7). Cognate packaging in vitro is accompanied by a characteristic drop in the hydrodynamic radius (Rh) of the viral RNA, making it smaller than its capsid as a subset of CPs bind. We have shown that this behavior is consistent with an assembly mechanism (8) controlled by the interaction of multiple CPs with degenerate RNA sequence/structure sites we term packaging signals (PSs), distributed across the genomes. PSs are bound sequence-specifically by their cognate CPs, increasing the yield, rate and/or fidelity of assembly. PS-mediated assembly has many potential advantages for the virus, including solving a viral version of the Levinthal Paradox (9). For the T = 3 MS2, there is a known assembly origin, the 19-nt-long coat protein gene translational operator (TR) stem-loop (10). TR functions as an allosteric effector, switching the dimeric viral capsomere between the two quasi-conformers required to build the capsid (11). This mechanism requires 60 TR-like sites within the genomic RNA for CP contacts (12). These defined RNA–CP contacts require the RNA in every particle be in a very similar conformation, and an asymmetric tomographic reconstruction is consistent with this view (13).

For STNV, there were no known PSs, so we used RNA SELEX combined with a theoretical analysis protocol we designed for the identification of PSs from SELEX data, to isolate preferred sequences for CP binding. This dual approach identified an aptamer whose sequence matches a region of the cognate genome (14). Both the aptamer (B3) and genomic sites (B3-like, PS3, Fig. 1 A and B) are predicted to fold into stem-loops. An X-ray structure of VLPs reassembled around B3 shows that the presence of multiple copies of a preferred RNA sequence increases the ordering of the N-terminal region of the STNV CP. The N-terminal helix extends by roughly one turn to include amino acid residue 8, in contrast to the virion or mRNA containing VLP, in which the electron density shows that the polypeptide is disordered before residue 12. These data suggest that B3 acts like a PS (15) (Fig. 1).

Fig. 1.

Fig. 1.

The components of the STNV system. (A) (Upper Left) Ribbon diagram of the STNV CP subunit structure seen in virions [Protein Data Bank (PDB) ID 4V4M] with the disordered N-terminal amino acid sequence shown dashed, with the sequence of the first 25 amino acids below. When this assembles around the B3 aptamer, sequence and secondary structure shown in Upper Right (nucleotide identities with the STNV-1 genome in red), it forms a T = 1 capsid (Lower Right, PDB 3S4G), in which there is additional ordering of the CP N-terminal region (magenta helical turn) in response to binding the preferred RNA sequence. The virion is stabilized by multiple Ca2+ ions that bind around the differing symmetry axes (shown as color-coded dots). (B) The sequence and putative secondary structure of the 5′ genomic fragment encompassing five putative PSs, named in order 5′ to 3′ as PS1-5, respectively. Red nucleotides indicate the potential CP recognition motif (AXXA) and are the sites mutated to U to create variant fragments. The green G is a nonviral nucleotide added for ease of dye-labeling. (C) Map of the STNV1 genome showing the positions of these putative packaging signals (red) with respect to the CP gene.

SmFCS confirms that B3 binds STNV CP with high affinity (low nM) to form a capsomere, probably a CP trimer with at least one B3 bound, that can then be assembled into T = 1 capsids in response to increasing concentrations of CP. This reaction is highly sequence-specific, because oligos containing B3 variant in the recognition motif fail to assemble under identical conditions. Despite the catalytic role of B3, assembly is highly dependent on electrostatic interactions: The VLPs formed are sensitive to both the ionic strength and the removal of structural calcium ions. Using a 5′ genomic fragment (127 nt long) encompassing the B3-like PS3 and two additional putative PSs on either side, we have confirmed that CP-induced reduction in Rh occurs as a consequence of binding PS3. The five PSs work collectively to promote successful capsid assembly and their spacing within the fragment is also important. These functions are encoded within a region heavily implicated in enhancing translation and viral RNA replication (1619). In addition, PS3 is located within the CP region that encodes the amino acid sequence undergoing conformational change in response to PS binding. If similar contacts occur more widely for RNA viruses there may be even deeper embedding of function in viral RNAs.

Sequence-Specificity of Assembly

To study in vitro reassembly of STNV VLPs at low CP concentrations, we used fluorescently labeled oligonucleotides (see Fig. 1 A and B and Table S1 for details). This method enabled smFCS assembly assays at nanomolar RNA concentrations. Initially we monitored CP titrations into labeled B3 (1 nM) under reassembly conditions. This experiment reveals formation of a high affinity RNA–CP complex that begins to form at 1 nM CP and is fully formed by 5 nM CP. It has an apparent Rh of ∼5 nm (Fig. 2B), consistent with it being a CP trimeric capsomere based on the X-ray structure of the B3 VLP. Photon counting statistics (SI Materials and Methods) were used to estimate the affinity of the CP–B3 interaction, yielding a Kd of 0.8 ± 0.1 nM, and photon burst statistics suggest that the complex formed most likely contains a single B3 RNA (Fig. S1). Once formed, this complex is stable for extended periods (Fig. 2C) and indifferent to the addition of extra CP monomer until a threshold concentration ∼50 nM is reached. Thereafter assembly of T = 1 capsids occurs slowly in a CP concentration-dependent process that remains incomplete below 1 µM CP. These VLPs are also stable over extended periods, indifferent to the addition of more CP and protect the labeled RNA from RNase digestion (Fig. S2). This second phase of assembly must be driven by CP–CP interactions between the complex nucleated by the initial RNA–CP contact and incoming CPs. Note, at much higher concentrations (µM) the STNV CP is a monomer with no tendency to self-assemble in the absence of RNA (15). The slow kinetics of capsid assembly from the B3 capsomere suggests that the CPs must overcome a barrier to complex formation which we previously speculated was electrostatic, i.e., that the RNA reduces unfavorable interactions between the positively charged N-terminal regions of CP subunits (Fig. 1A).

Fig. 2.

Fig. 2.

RNA dependence and sequence-specificity of STNV assembly. (A) STNV CP was titrated into 1 nM of 5′ Alexa Fluor-488–labeled B3 (Upper) or B34U RNA (Lower) (see inset cartoons) in assembly buffer and the resulting hydrodynamic radius monitored using smFCS over time. Dotted red lines indicate titration points with the labels showing the final monomer CP concentrations reached. FFT smoothed data (thick lines) are shown for clarity, with horizontal gray dashed lines showing the sizes of capsid, B3, and capsomere. For B3 the curves for the differing CP concentrations postformation of the capsomere are color coded and displaced slightly along the time axis. Samples at various points in the titrations were analyzed by TEM (Fig. S3). (B) Size distributions of the species in solution for capsomere (magenta) and B3 containing VLP (green) together with the STNV CP + B34U (black) containing species in plot A. Gaussian peak fits are shown by thick lines, with vertical gray dashed lines showing expected sizes. (C) 1 nM B3 containing capsomeres (magenta) and VLPs (green) were assembled using 4 and 1,000 nM STNV CP, respectively, and their Rh measured over 2 h. The results are compared with similar measurements of 300 nM mRNA VLP (cyan) labeled on the CP with Alexa Fluor-488.

To see whether the RNA effect is purely electrostatic or sequence-specific we set up an assembly titration with B34U, a sequence variant of B3 of the same length predicted to form a B3-like stem-loop, but with the CP recognition motif, -ACAA-, in the loop replaced by -UUUU-. RNA SELEX suggests that the minimal motif consists of a base-paired stem topped by a loop containing -AXXA-, where X is any nucleotide (14). The result (Fig. 2A, Lower) shows very clearly the importance of this sequence for CP recognition. At ∼twofold higher CP concentration than for B3, the variant shows an increased Rh suggesting formation of the capsomere. However, no further assembly occurs until the CP concentration reaches 3 µM, when there is a sharp increase in Rh followed by a steady decline, consistent with formation of unstable nonspecific aggregates rather than VLPs, and the RNA remains accessible to RNase (Fig. S2). The initial B34U-CP complex is not a viable on-pathway assembly intermediate despite the RNA being of the same length and therefore having the same negative charge. This result shows that capsomere assembly is sequence-specific and that only correctly formed species are intermediates in VLP formation. Because B34U RNA forms a complex with CP, sequence recognition is likely to involve binding of the base-paired stem, which is then orientated correctly for further assembly by sequence-specific contacts with the loop motif. B34U is able to make the first of these contacts, but not the latter. Transmission electron microscopy (TEM) images (Fig. S3A) and Rh distribution plots (Fig. 2B) support these interpretations. In conclusion, aptamer B3 has the properties expected of a PS.

Natural PSs Act Cooperatively

To determine whether assembly with the genomic STNV PSs occurs similarly, and explore potential cooperative effects of having multiple PSs on the same RNA fragment, we carried out CP titrations using a dye-labeled 127 nt RNA. This fragment starts at the 5′ end of the STNV-1 genome (Fig. 1 B and C) and encompasses five putative PSs, namely sequences that can be folded into stable stem-loops displaying -AXXA- motifs (14). Aptamer B3 has the highest sequence identity with nucleotides 59–72, which is the middle PS on this fragment. We term this fragment PS1-5. We compared reassembly with identical RNA concentrations (10 nM) of PS1-5 and B3 alone (Fig. 3A). B3 forms the capsomere at low concentration, ∼5 nM and begins to assemble VLPs at 200 nM, although the kinetics of capsid formation are relatively slow and remain incomplete (>4,000 s). On titration to 3,000 nM CP, assembly goes slowly to completion. In contrast, PS1-5 appears to start binding CP only at 120 nM, resulting in a drop of its Rh by 20–30%, reminiscent of the behavior of the full-length genome (7). At 200 nM it rapidly assembles (∼250 s) into intact capsids that are nuclease resistant (Fig. S2), suggesting that the multiple PSs promote cooperative formation of the CP–CP contacts found in the capsid.

Fig. 3.

Fig. 3.

Packaging signals function cooperatively. (A) CP was titrated into 10 nM Alexa Fluor-488–labeled PS1-5 (black) or B3 (dashed - blue). Details as in legend Fig. 2A. Titration points are color coded for each RNA. (B) CP titration into 10 nM of Alexa Fluor-488–labeled PS1-5 and its sequence variants (X), shown as cartoons (Left). (C) Color-coded size distributions of species present in the titrations shown in A and B (substrate key, Left). Data shown are taken from the titration of 200 nM CP onwards in A and B. For PS1-5 and its variants this is 4,000–6,000 s, and for B3 from 2,000 to 8,000 s. (D) Expanded section of B showing the effects of CP addition on each variant. (E) CP titration into spacing variants (see cartoon; Left) and their size distribution plots (Right). Data shown are from 0 to 5,000 s.

This difference in behavior between B3 and PS1-5 could be due to the differences in the sequences/structures of their PSs, or simply because of the length of the PS1-5 fragment. To explore this behavior by the longer fragment, we examined the properties of variant fragments in which all of the PSs were mutated by conversion of 4 nt in their loop regions to uridines, or mutating just PS3, or ablating all of the PSs except PS3 (Fig. 3B). The all mutant PS1-5 binds CP at 120 nM like the wild-type fragment, but this leads to an increase in its Rh, not a decrease. It does not form T = 1 capsids or become nuclease resistant, even when the CP concentration is raised to 3 µM (Fig. 3 B and D). TEMs suggest that the complexes produced are aggregates rather than partially formed capsids (Fig. S4A). This outcome confirms that the loop sequence of PS3 at least is important for CP recognition. When only the putative flanking PSs (1 and 2, and 4 and 5) are mutated, binding again occurs at 120 nM CP and the Rh decreases, showing that this effect is due to initial binding at PS3. However, subsequent assembly is very slow and formation of complete capsids is only achieved at 3,000 nM CP. The result confirms that PS sites other than PS3 play important roles in making capsid assembly efficient, and supports the idea that multiple PSs act cooperatively. When only PS3 is mutated in the PS1-5 fragment, there is no decrease in Rh, confirming that CP binding to PS3 triggers this event, and assembly does not occur until 200 nM CP and does not result in formation of capsids (Fig. 3 B and D).

A hierarchy of PS CP affinities is an essential requirement of the PS-mediated assembly model (9) and is consistent with the sequence/structure variations in PSs PS1-5 on the genomic fragment. PS3 is the highest affinity site and the other PSs contribute to assembly efficiency but cannot trigger VLP formation alone. The putative STNV PS sites were identified by looking at the short range folding potential of genomic RNA, specifying a stem-loop containing the -AXXA- motif (14), and all such sites with a negative free energy of formation were identified as putative PSs. This simple approach appears successful. Of course, PSs with small folding free energies may only form active conformations in response to CP binding.

Another predicted property of PSs is that their relative locations within the genome is such that they facilitate the CP–CP interactions seen in the capsid, implying that their relative separation in the RNA sequence, which determines their spatial locations in three-dimensions, should be important. To test this, we produced three additional variants of PS1-5 in which 10-nt spacers, chosen in each case to avoid altering the potential secondary structures, were added either 5′ to PS3, i.e., between PS2 and PS3, or 3′ to it, between PS3 and PS4, or at both of these sites. These RNAs are designated 5′ or 3′ spacer PS1-5 and 5′ and 3′ spacer PS1-5, respectively. The results of their titrations with CP are shown in Fig. 3E. All three support fairly rapid assembly beyond 120 nM CP, but without an initial Rh decrease. The resulting smFCS traces are very noisy and their distributions suggest that these species form mixtures with few correctly assembled capsids. TEMs (Fig. S4B) suggest that although some T = 1 capsids are formed by these RNAs, there are many partially formed and malformed species present as well. RNase treatment leads to large changes in these Rh distributions (Fig. S2), confirming that the natural spacing of the five PS sites in the 5′ genomic fragment is important for accurate and efficient capsid assembly. The lack of Rh decreases in fragments with altered PS spacing but containing wild-type PS3 sequences suggests a more complex explanation of the phenomenon in the PS1-5 fragment. CP binding to B34U shows that capsomere-like species can form with high affinity for stem-loops lacking the loop recognition motif. The PS1-5 variants with a wild-type PS3 could allow CPs interacting at that site to contact the neighboring RNA stem-loops, each equivalent to B34U. Although these complexes are not efficient at ensuring cooperative capsid assembly, they could explain the initial decrease in Rh. The spacer variants suggest that both neighboring stem-loops must be contacted to cause this effect, emphasizing the importance of PS positioning in the genome.

Roles of Electrostatics in Assembly and Stability

The sequence-specificity of assembly initiation revealed by the experiments with B3, PS1-5, and their variants, is consistent with PS-mediated assembly, but not with expectations that electrostatic interactions are the principal driving forces behind assembly in this class of viruses (2025). The PS fragments and the variant sites, with the exception of the insert variants, are of the same length, have the same net charge and are predicted to share common secondary structures. Under ensemble assembly conditions (CP ≥ 10 µM) B34U triggers T = 1 capsid formation, although much less efficiently than B3 (15), i.e., the sequence-specific RNA discrimination is dramatically reduced at higher concentration, and reassembly can be driven almost exclusively by electrostatic interactions (26). In contrast, the outcomes of single-molecule in vitro reassembly, carried out at nanomolar concentrations, are more sensitive to sequence-specific effects and may reflect more accurately conditions in vivo. To examine the effects of charge neutralization on assembly, we repeated the B3 titration at higher RNA concentration (10 nM vs. 1 nM, Fig. S3B). More RNA increases the proportion of T = 1 capsid and larger intermediates present at lower CP concentrations, but assembly is still relatively slow and requires CP > 1 µM to be complete. It seems therefore that increasing the numbers of phosphodiesters lowers the barrier to assembly, consistent with it being electrostatic in nature.

As a test of this idea, we compared the salt stability of VLPs assembled at either concentration of B3 or with PS1-5, with that of the VLP from Escherichia coli, which packages a modified mRNA encoding the CP gene (Fig. 4A and Fig. S5A). Addition of NaCl to the reassembled particles leads to rapid disassembly of both the 1 and 10 nM B3 VLPs, which are fully disassembled by ∼150 and ∼400 mM, respectively, whereas the PS1-5 and mRNA VLPs remain stable up to 700 mM. STNV virions contain structural calcium ions that bind to three distinct sites located at the particle symmetry axes (Fig. 1A). Note, not all these sites are occupied in the crystal structure of the B3 VLP (15). Because these should contribute to electrostatic stability, we repeated the salt titration in the presence of 5 mM EDTA (Fig. 4A and Fig. S5B). Both B3 VLPs are fully disassembled under these conditions before addition of extra salt, whereas the PS1-5 VLP partially disassembles at 50 mM NaCl, with only the mRNA VLP remaining stable throughout the titration range. These results confirm that particle stability is strongly dependent on formation of favorable, relatively local, electrostatic contacts, following a sequence-specific assembly initiation. Clearly, the presence of longer RNAs encompassing multiple PS sites stabilizes the CP–CP contacts relative to the VLPs formed with the short RNA.

Fig. 4.

Fig. 4.

The roles of electrostatic interactions in assembly of capsomere and capsid. (A) Salt titration into mRNA VLPs (magenta), B3 VLPs, assembled at 1:1,000 nM (red) or 10:1,000 nM (blue) RNA:CP, and PS1-5 VLPs assembled at 10:1,000 nM (black). Rh was measured over 10 min at each NaCl concentration. The average and SE of the Rh’s are shown. The same experiments, performed in the presence of 5 mM EDTA, are shown as dotted lines. (B) Schematic of relative free energy barriers associated with formation of the capsomere and T = 1 capsid.

The results described above are consistent with a two-stage assembly model following a free energy path similar to that shown in Fig. 4B. Capsomere formation is rapid, high-affinity, and spontaneous, but for the RNA–CP complex to be on-pathway to capsid assembly, the PS stem-loop must be orientated precisely via sequence-specific interactions with the loop motif. In longer RNAs, this initial event is coupled to a decrease in the size of the viral RNA driven by CP binding to correctly placed additional PSs, a vital step in preparing it for specific encapsidation (7, 8). On-pathway capsomeres with short RNAs are prevented from self-assembly to capsid by an electrostatic activation barrier created by the repulsion of clustering six positively charged amino acid side chains into the small volume around the particle threefold helices (Fig. 1A). This inhibitory effect can be overcome using more RNA, either in the form of higher concentrations of single PSs or as natural fragments encompassing multiple, appropriately positioned PS sites. The latter are more effective at lowering the activation barrier, allowing assembly to be cooperative once it initiates. Note, the cooperativity for the complete genomic fragment, with up to 30 PSs, would be even more pronounced than seen here for the 5′ fragment with only five such sites.

The Relationship of the Viral RNA to its Coat Protein

By definition there is a close relationship between the genomic RNA of ssRNA viruses and their coat proteins; one is encoded by the other. For satellite viruses, this is heightened by the fact that the only “viral” RNA encapsidated is an extended mRNA encoding the CP subunit. The polymerase responsible for replication is supplied by the helper virus, Tobacco Necrosis Virus. The STNV genome therefore encompasses at least two functions: It is a substrate for both replication and translation. Previous studies have shown that sequences within the 5′ and 3′ untranslated regions (UTRs) are responsible for making both these processes efficient via complementary base pairing between the UTR residues 24–55 and 979–1009 (27) within the RNA. The site in the 5′UTR overlaps the PS containing region. It is therefore clear that replication/translation and virion assembly are distinct processes brought about by differing RNA conformations.

A global secondary structure of the entire genome calculated using Mfold suggests that only PS1 in the 5′ fragment used here would be formed spontaneously (14). However, if the RNA is folded allowing only for short-range interactions, i.e., in 50-nt sections, then all of the predicted PSs can be formed. The data with the 5′ fragment suggests that STNV packaging occurs on newly replicating RNAs as they emerge from the polymerase. The PS-mediated assembly mechanism also implies that there is a defined spatial relationship between the protein shell of the capsid and the PSs within the genomic RNA. Because ssRNA viruses use RNA-dependent RNA polymerases that lack proof reading functions, the maintenance of such PS sites is a significant challenge. In other systems it has been shown that the coding variants within a viral quasi-species are likely to increase the misincorporation of noncognate RNAs, leading to the suggestion that ssRNA can be horizontal vectors for gene transfer (28, 29). It is thus likely that strategies have evolved that favor conservation of PSs in viral evolution, thus enhancing packaging specificity.

Discussion

There are different ways in which ssRNA viruses may counter the challenge of enhancing packaging of their cognate genomic RNAs against the background of cellular competitor RNAs. One solution could be the physical separation of viral components from cellular factors, but the evidence of significant misincorporation even in viruses believed to operate this way (29) suggests that other mechanisms are likely to have selective advantages. An alternative possibility is that packaging of cognate genomes is the result of specificity between viral genomes and their cognate CPs. Such a mechanism would mirror a phenomenon occurring more widely, in particular in early events in the development of the RNA–protein world: a bias toward interactions between encoded proteins and their mRNAs, even when those proteins are not thought to function via RNA binding (3032).

We speculate that a similar phenomenon may give rise to PS conservation, and thus ensure packaging specificity, in STNV. In particular, the molecular details of the PS3–CP interaction illustrate how this can be achieved. PS3 is located at nt60 in the STNV-1 genome, and forms part of the codon for amino acid residue 10 in the CP, or 9 neglecting the N-terminal methionine (Fig. 1A). Thus, the strongest PS forms a section of the gene encoding the amino acids in the CP that alter their conformation upon PS binding (Fig. 5). This match may of course be coincidental, but a statistical analysis (SI Materials and Methods) suggests that it would only occur ∼9% of the time by chance. PS3 is also one of only three PS sites, with PS11 and PS12, conserved in all three known STNV strains (Fig. S6). Testing this idea experimentally would be challenging, so we compared STNV with the two other ssRNA viruses for which we have detailed CP–RNA structural data, namely MS2 and TMV (33, 34) (Fig. S7). The former assembles via a mechanism mediated by multiple PSs, whereas the latter is a helical virus that assembles from a unique PS-RNA contact. There are four known PSs within the MS2 CP gene. One of them encodes a 6-aa peptide that plays a significant role in recognizing the strongest PS in that genome, TR (Fig. 5B and Fig. S7 A and C). The probability that this match occurs by chance in that case is 3.1% (SI Materials and Methods). In Group 2 Tobacco Mosaic Viruses, the origin of assembly, a PS, encodes one of the two peptides forming the RNA-binding site in the CP (3537), suggesting similar relationships occur even in a helical virus (Fig. S7D). The occurrence of this phenomenon in a number of viruses suggests that PS encoding features of the protein sites they are in contact with could be an unappreciated aspect of viral CP–genome interactions.

Fig. 5.

Fig. 5.

PSs encode critical features of viral CPs involved in PS RNA recognition. (A) View of the contacts between the ordered portion of the B3 RNA (cf. PS3) and the PS3-encoded amino acids (space filling representation) in the B3 VLP structure (PDB ID 4BCU). (B) The MS2 coat protein dimer in A/B conformation in complex with the highest affinity genomic PS, the TR stem-loop (34). The amino acids in the A (blue) and B (green) chains that are encoded by one of the four PSs within the CP gene (Fig. S7) are shown in space-filling representation.

STNV CP shares a common fold with many other ssRNA viruses, most of which are more complex, having quasi-equivalent capsids, in part defined by the conformations of their positively charged N-terminal regions that often contain arginine-rich motifs, ARMs (38). Removal of this region in STNV prevents assembly (39), whereas in other viruses it promotes RNA-independent assembly of T = 1 capsids (40). Here we have used smFCS assays to probe the roles of RNA in STNV assembly at nanomolar concentrations, showing that it follows a two-stage PS-mediated mechanism, thus preventing assembly around noncognate RNAs. Even with relatively short PS fragments we have recreated assembly features seen with the intact genome, showing that CP recognition of degenerate PSs, and the latter’s relative spacing along the genomic RNA, both contribute to efficient cooperative capsid assembly (9, 41). These assays also validate the use of RNA SELEX against viral CPs, combined with theoretical analysis techniques pioneered by us, as a route for identifying PSs (15).

It is tempting to speculate that, because of the multiple selective advantages for viruses assembling via such mechanisms, viruses with similar CP folds will also assemble by similar routes (14). The requirement to encode PSs, and perhaps the peptide functions they encode, introduces a previously unsuspected constraint on viral evolution, especially given the density of functions located within the same RNA sequences. Disrupting this assembly mechanism would therefore be a novel antiviral strategy. Previously, we showed that hydrodynamic collapse of viral genomes is a signature of cognate encapsidation, and has many similarities to the assembly of the ribosome. The roles of the multiple genomic PSs revealed here also have similarities to protein sequestration by noncoding (nc) RNAs, “RNA sponges” (42), where proteins bind sequentially, specifically, and cooperatively to stem-loops, leading to protection of a ncRNA from degradation. Assembly of ssRNA viruses may therefore have more in common with cellular RNA biology than previously suspected.

Materials and Methods

STNV CP was prepared by dissociation of recombinant VLPs produced in E. coli (15). RNAs were either chemically synthesized with a dye attached to the 5′ end, (DNA technology A/S, Denmark) or transcribed in the presence of 5′-amino-GMP which was subsequently chemically coupled to a dye using an amine reactive Alexa Fluor 488 SDP ester (Invitrogen). SmFCS assays and data analysis were performed as described (8).

Supplementary Material

Supplementary File
pnas.201420812SI.pdf (2.9MB, pdf)

Acknowledgments

We thank Prof. Arwen Pearson for access to the X-ray density and model of the B3 VLP and for helpful comments, and Prof. Cheng Kao and his colleagues at Indiana University for access to their unpublished CLIP-SEQ data on MS2. We thank the University of Leeds for its support of the Single Molecule Facility within the Astbury Centre. This work was supported by UK Biotechnology and Biological Science Research Council Grants BB/J00667X/1, BB/L022095/1, BB/L021803/1, BB/J004596/1 and the Wellcome Trust (089311/Z/09/Z and 090932/Z/09/Z). R. Twarock acknowledges funding via a Royal Society Leverhulme Trust Senior Research fellowship (LT130088), and E.C.D. acknowledges funding via an Early Career Leverhulme Trust fellowship (ECF-2013-019).

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission. J.E.J. is a guest editor invited by the Editorial Board.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1420812112/-/DCSupplemental.

References

  • 1.Woodson SA. RNA folding pathways and the self-assembly of ribosomes. Acc Chem Res. 2011;44(12):1312–1319. doi: 10.1021/ar2000474. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Simon AE, Gehrke L. RNA conformational changes in the life cycles of RNA viruses, viroids, and virus-associated RNAs. Biochim Biophys Acta. 2009;1789(9-10):571–583. doi: 10.1016/j.bbagrm.2009.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Schneemann A. The structural and functional role of RNA in icosahedral virus assembly. Annu Rev Microbiol. 2006;60:51–67. doi: 10.1146/annurev.micro.60.080805.142304. [DOI] [PubMed] [Google Scholar]
  • 4.Enserink M. Infectious diseases. Crippling virus set to conquer Western Hemisphere. Science. 2014;344(6185):678–679. doi: 10.1126/science.344.6185.678. [DOI] [PubMed] [Google Scholar]
  • 5.Lehtinen M, Dillner J. Clinical trials of human papillomavirus vaccines and beyond. Nature reviews. Clin Oncol. 2013;10(7):400–410. doi: 10.1038/nrclinonc.2013.84. [DOI] [PubMed] [Google Scholar]
  • 6.Yan R, Hallam A, Stockley PG, Boyes J. Oncogene dependency and the potential of targeted RNAi-based anti-cancer therapy. Biochem J. 2014;461(1):1–13. doi: 10.1042/BJ20140173. [DOI] [PubMed] [Google Scholar]
  • 7.Borodavka A, Tuma R, Stockley PG. Evidence that viral RNAs have evolved for efficient, two-stage packaging. Proc Natl Acad Sci USA. 2012;109(39):15769–15774. doi: 10.1073/pnas.1204357109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Borodavka A, Tuma R, Stockley PG. A two-stage mechanism of viral RNA compaction revealed by single molecule fluorescence. RNA Biol. 2013;10(4):481–489. doi: 10.4161/rna.23838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Dykeman EC, Stockley PG, Twarock R. Solving a Levinthal’s paradox for virus assembly identifies a unique antiviral strategy. Proc Natl Acad Sci USA. 2014;111(14):5361–5366. doi: 10.1073/pnas.1319479111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ling CM, Hung PP, Overby LR. Specificity in self-assembly of bacteriophages Q beta and MS2. Biochemistry. 1969;8(11):4464–4469. doi: 10.1021/bi00839a036. [DOI] [PubMed] [Google Scholar]
  • 11.Stockley PG, et al. A simple, RNA-mediated allosteric switch controls the pathway to formation of a T=3 viral capsid. J Mol Biol. 2007;369(2):541–552. doi: 10.1016/j.jmb.2007.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dykeman EC, Stockley PG, Twarock R. Packaging signals in two single-stranded RNA viruses imply a conserved assembly mechanism and geometry of the packaged genome. J Mol Biol. 2013;425(17):3235–3249. doi: 10.1016/j.jmb.2013.06.005. [DOI] [PubMed] [Google Scholar]
  • 13.Dent KC, et al. The asymmetric structure of an icosahedral virus bound to its receptor suggests a mechanism for genome release. Structure. 2013;21(7):1225–1234. doi: 10.1016/j.str.2013.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bunka DH, et al. Degenerate RNA packaging signals in the genome of Satellite Tobacco Necrosis Virus: Implications for the assembly of a T=1 capsid. J Mol Biol. 2011;413(1):51–65. doi: 10.1016/j.jmb.2011.07.063. [DOI] [PubMed] [Google Scholar]
  • 15.Ford RJ, et al. Sequence-specific, RNA-protein interactions overcome electrostatic barriers preventing assembly of satellite tobacco necrosis virus coat protein. J Mol Biol. 2013;425(6):1050–1064. doi: 10.1016/j.jmb.2013.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.van Lipzig R, et al. The 5′ and 3′ extremities of the satellite tobacco necrosis virus translational enhancer domain contribute differentially to stimulation of translation. RNA. 2002;8(2):229–236. doi: 10.1017/s1355838202018071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bringloe DH, Gultyaev AP, Pelpel M, Pleij CW, Coutts RH. The nucleotide sequence of satellite tobacco necrosis virus strain C and helper-assisted replication of wild-type and mutant clones of the virus. J Gen Virol. 1998;79(Pt 6):1539–1546. doi: 10.1099/0022-1317-79-6-1539. [DOI] [PubMed] [Google Scholar]
  • 18.Bringloe DH, Pleij CW, Coutts RH. Mutation analysis of cis-elements in the 3′- and 5′-untranslated regions of satellite tobacco necrosis virus strain C RNA. Virology. 1999;264(1):76–84. doi: 10.1006/viro.1999.9994. [DOI] [PubMed] [Google Scholar]
  • 19.Meulewaeter F, Danthinne X, Van Montagu M, Cornelissen M. 5′- and 3′-sequences of satellite tobacco necrosis virus RNA promoting translation in tobacco. Plant J. 1998;14(2):169–176. doi: 10.1046/j.1365-313x.1998.00104.x. [DOI] [PubMed] [Google Scholar]
  • 20.Rudnick J, Bruinsma R. Icosahedral packing of RNA viral genomes. Phys Rev Lett. 2005;94(3):038101. doi: 10.1103/PhysRevLett.94.038101. [DOI] [PubMed] [Google Scholar]
  • 21.van der Schoot P, Bruinsma R. Electrostatics and the assembly of an RNA virus. Phys Rev E Stat Nonlin Soft Matter Phys. 2005;71(6 Pt 1):061928. doi: 10.1103/PhysRevE.71.061928. [DOI] [PubMed] [Google Scholar]
  • 22.Belyi VA, Muthukumar M. Electrostatic origin of the genome packing in viruses. Proc Natl Acad Sci USA. 2006;103(46):17174–17178. doi: 10.1073/pnas.0608311103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Balint R, Cohen SS. The incorporation of radiolabeled polyamines and methionine into turnip yellow mosaic virus in protoplasts from infected plants. Virology. 1985;144(1):181–193. doi: 10.1016/0042-6822(85)90316-2. [DOI] [PubMed] [Google Scholar]
  • 24.Bruinsma RF. Physics of RNA and viral assembly. Eur Phys J E Soft Matter. 2006;19(3):303–310. doi: 10.1140/epje/i2005-10071-1. [DOI] [PubMed] [Google Scholar]
  • 25.Yoffe AM, et al. Predicting the sizes of large RNA molecules. Proc Natl Acad Sci USA. 2008;105(42):16153–16158. doi: 10.1073/pnas.0808089105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Garmann RF, et al. Role of electrostatics in the assembly pathway of a single-stranded RNA virus. J Virol. 2014;88(18):10472–10479. doi: 10.1128/JVI.01044-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kaempfer R, van Emmelo J, Fiers W. Specific binding of eukaryotic initiation factor 2 to satellite tobacco necrosis virus RNA at a 5′-terminal sequence comprising the ribosome binding site. Proc Natl Acad Sci USA. 1981;78(3):1542–1546. doi: 10.1073/pnas.78.3.1542. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Routh A, Domitrovic T, Johnson JE. Host RNAs, including transposons, are encapsidated by a eukaryotic single-stranded RNA virus. Proc Natl Acad Sci USA. 2012;109(6):1907–1912. doi: 10.1073/pnas.1116168109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Routh A, Domitrovic T, Johnson JE. Packaging host RNAs in small RNA viruses: An inevitable consequence of an error-prone polymerase? Cell Cycle. 2012;11(20):3713–3714. doi: 10.4161/cc.22112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Polyansky AA, Zagrovic B. Evidence of direct complementary interactions between messenger RNAs and their cognate proteins. Nucleic Acids Res. 2013;41(18):8434–8443. doi: 10.1093/nar/gkt618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Polyansky AA, Hlevnjak M, Zagrovic B. Proteome-wide analysis reveals clues of complementary interactions between mRNAs and their cognate proteins as the physicochemical foundation of the genetic code. RNA Biol. 2013;10(8):1248–1254. doi: 10.4161/rna.25977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Hlevnjak M, Polyansky AA, Zagrovic B. Sequence signatures of direct complementarity between mRNAs and cognate proteins on multiple levels. Nucleic Acids Res. 2012;40(18):8874–8882. doi: 10.1093/nar/gks679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Namba K, Pattanayek R, Stubbs G. Visualization of protein-nucleic acid interactions in a virus. Refined structure of intact tobacco mosaic virus at 2.9 A resolution by X-ray fiber diffraction. J Mol Biol. 1989;208(2):307–325. doi: 10.1016/0022-2836(89)90391-4. [DOI] [PubMed] [Google Scholar]
  • 34.Valegård K, Murray JB, Stockley PG, Stonehouse NJ, Liljas L. Crystal structure of an RNA bacteriophage coat protein-operator complex. Nature. 1994;371(6498):623–626. doi: 10.1038/371623a0. [DOI] [PubMed] [Google Scholar]
  • 35.Fukuda M, Meshi T, Okada Y, Otsuki Y, Takebe I. Correlation between particle multiplicity and location on virion RNA of the assembly initiation site for viruses of the tobacco mosaic virus group. Proc Natl Acad Sci USA. 1981;78(7):4231–4235. doi: 10.1073/pnas.78.7.4231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Guilley H, Jonard G, Richards KE, Hirth L. Sequence of a specifically encapsidated RNA fragment originating from the tobacco-mosaic-virus coat-protein cistron. Eur J Biochem. 1975;54(1):135–144. doi: 10.1111/j.1432-1033.1975.tb04122.x. [DOI] [PubMed] [Google Scholar]
  • 37.Takamatsu N, Ohno T, Meshi T, Okada Y. Molecular cloning and nucleotide sequence of the 30K and the coat protein cistron of TMV (tomato strain) genome. Nucleic Acids Res. 1983;11(11):3767–3778. doi: 10.1093/nar/11.11.3767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Rao AL. Genome packaging by spherical plant RNA viruses. Annu Rev Phytopathol. 2006;44:61–87. doi: 10.1146/annurev.phyto.44.070505.143334. [DOI] [PubMed] [Google Scholar]
  • 39.Lane SW, et al. Construction and crystal structure of recombinant STNV capsids. J Mol Biol. 2011;413(1):41–50. doi: 10.1016/j.jmb.2011.07.062. [DOI] [PubMed] [Google Scholar]
  • 40.Sorger PK, Stockley PG, Harrison SC. Structure and assembly of turnip crinkle virus. II. Mechanism of reassembly in vitro. J Mol Biol. 1986;191(4):639–658. doi: 10.1016/0022-2836(86)90451-1. [DOI] [PubMed] [Google Scholar]
  • 41.Stockley PG, et al. Packaging signals in single-stranded RNA viruses: Nature’s alternative to a purely electrostatic assembly mechanism. J Biol Phys. 2013;39(2):277–287. doi: 10.1007/s10867-013-9313-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Duss O, et al. Structural basis of the non-coding RNA RsmZ acting as a protein sponge. Nature. 2014;509(7502):588–592. doi: 10.1038/nature13271. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File
pnas.201420812SI.pdf (2.9MB, pdf)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES