Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2011 Oct 24;108(46):E1111-E1119. doi: 10.1073/pnas.1102932108

Ribosomal frameshifting into an overlapping gene in the 2B-encoding region of the cardiovirus genome

Gary Loughran a,1, Andrew E Firth b,1,2, John F Atkins a,c
PMCID: PMC3219106  PMID: 22025686

Abstract

The genus Cardiovirus (family Picornaviridae) currently comprises the species Encephalomyocarditis virus (EMCV) and Theilovirus. Cardioviruses have a positive-sense, single-stranded RNA genome that encodes a large polyprotein (L-1ABCD-2ABC-3ABCD) that is cleaved to produce approximately 12 mature proteins. We report on a conserved ORF that overlaps the 2B-encoding sequence of EMCV in the +2 reading frame. The ORF is translated as a 128–129 amino acid transframe fusion (2B*) with the N-terminal 11–12 amino acids of 2B, via ribosomal frameshifting at a conserved GGUUUUY motif. Mutations that knock out expression of 2B* result in a small-plaque phenotype. Curiously, although theilovirus sequences lack a long ORF in the +2 frame at this genomic location, they maintain a conserved GGUUUUU motif just downstream of the 2A-2B junction, and a highly localized peak in conservation at polyprotein-frame synonymous sites suggests that theiloviruses also utilize frameshifting here, albeit into a very short +2-frame ORF. Unlike previous cases of programmed -1 frameshifting, here frameshifting is modulated by virus infection, thus suggesting a novel regulatory role for frameshifting in these viruses.

Keywords: picornavirus, translation, genetic recoding, Theiler’s murine encephalomyelitis virus, Saffold virus


The genus Cardiovirus belongs to the Picornaviridae family of positive-sense single-stranded RNA viruses. Currently, two cardiovirus species are recognized—Encephalomyocarditis virus (EMCV) and Theilovirus. The latter species encompasses a number of divergent clades and isolates, including Theiler’s murine encephalomyelitis virus (TMEV), Rat theilovirus (herein referred to as RTV), Vilyuisk virus (VHEV) and Saffold virus (SAFV). The genome comprises a single RNA molecule of approximately 8 kb that contains a single long open reading frame (ORF), which is translated as a polyprotein and cleaved by the virus-encoded 3C protease (13). Like many picornaviruses, cardioviruses use an unusual proteolysis-independent but ribosome-dependent mechanism, termed “StopGo” or “stop-carry on,” to separate the L-1ABCD-2A segment of the polyprotein from the 2BC-3ABCD segment during polyprotein translation (47). StopGo is mediated by the amino acid motif D(V/I)ExNPG|P (where the “|” represents the junction between 2A and 2B), which, together with upstream amino acids that have a strong alpha-helical propensity, prevents formation of a peptide bond between Asn-Pro-Gly and Pro but allows the continuation of translation with up to near-100% efficiency (713). Recently, SAFV has been shown to be a common and widespread human virus—causing infection mainly in early childhood—although the pathogenicity and clinical significance of SAFV infection is currently unclear (14, 15).

Many viruses harbor sequences that induce a portion of translating ribosomes to shift -1 nt and continue translating in the new reading frame (16, 17). The eukaryotic -1 frameshift site typically consists of a “slippery” heptanucleotide fitting the consensus motif X XXY YYZ, where XXX represents any three identical nucleotides; YYY represents AAA or UUU; Z represents A, C, or U; and spaces separate zero-frame codons (18). In the tandem slippage model, the P-site anticodon re-pairs from XXY to XXX, whereas the A-site anticodon re-pairs from YYZ to YYY, thus allowing for perfect re-pairing except at the wobble position (19). Because the codon:anticodon duplex in the P site is not monitored so strictly as that in the A site, certain deviations from the canonical XXX of the slippery site are tolerated, including UCC in some members of the Japanese encephalitis serogroup of flaviviruses, GGU in Bean leafroll luteovirus, GUU in Equine arteritis arterivirus, and GGA in Culex flavivirus (2024). The relative efficiencies of a large selection of slippery heptanucleotides have been tested extensively in reticulocyte lysates (18). The efficiency of frameshifting depends on the identity of the slippery site nucleotides but is typically < 1% in the absence of additional stimulatory elements. Thus, most known instances of eukaryotic -1 frameshifting are stimulated (typically to a level of between 1% and 50%, depending on the particular system) by the presence of a 3′ stable RNA secondary structure, such as a pseudoknot or stem-loop, that is separated from the slippery heptanucleotide by a “spacer” region of 5–9 nt (16, 17). Structures of this type are thought to be located at the mRNA unwinding site of the mRNA entrance channel when their stimulatory effect is exerted, with at least part of the relevant consequence being a ribosomal pause. However, another class of 3′ frameshift stimulators appear not to exert their effect via intra-mRNA structure but, instead, are likely to involve direct interaction between the mRNA and rRNA or other translation components (25, 26).

Until recently, most known cases of phylogenetically conserved eukaryotic -1 ribosomal frameshifting involved frameshifting between two long ORFs. The vast majority of known cases came from viral genomes, and very often the frameshift ORF encodes the viral polymerase that is required in lower quantities than the products of the upstream ORF (27). One reason for this bias is simply that long ORFs are easily detected by sequence analysis. With the recent massive increase in sequence data available for many viruses, powerful comparative computational analyses have revealed a number of cases where the frameshift ORF is relatively, and sometimes very, short (2830). Where the function of frameshifting is simply to produce a truncated form of a protein, the frameshift ORF may be almost nonexistent (e.g., a frameshift site followed immediately by a stop codon), as in the dnaX gene of Escherichia coli and related bacteria (reviewed in ref. 31). Here, we describe a short ORF accessed via -1 ribosomal frameshifting in the cardioviruses. Frameshifting occurs at a conserved G GUU UUY motif just downstream of the StopGo cassette and, in EMCV, results in the translation of a 128–129 amino acid transframe fusion protein, 2B*.

Results

Computational Analysis Reveals a Conserved Overlapping ORF in EMCV.

The polyprotein coding sequences from the 21 full- or near full-length EMCV sequences available in GenBank were extracted, translated, aligned, and back-translated to a nucleotide sequence alignment. Next, the alignment was analyzed for conservation at polyprotein-frame synonymous sites, as described in ref. 32. Briefly, beginning with pairwise sequence comparisons, conservation at synonymous sites (only) was evaluated by comparing the observed number of base substitutions with the number expected under a neutral evolution model. The procedure takes into account whether synonymous site codons are 1-, 2-, 3-, 4-, or 6-fold degenerate and the differing probabilities of transitions and transversions. Statistics were then summed over a phylogenetic tree as described in ref. 33, and averaged over a sliding window.

The analysis revealed a striking and highly statistically significant (p ∼ 10-32) increase in synonymous site conservation in a region covering approximately 120 codons beginning shortly downstream of the junction between the 2A and 2B coding regions (Fig. 1A). Within this region, the mean synonymous substitution rate was only approximately 27% of the genomic average (Fig. 1A). Such peaks in synonymous site conservation are generally indicative of functionally important overlapping elements—either coding or noncoding. In this case, however, an inspection of the positions of stop codons in the +1 and +2 reading frames relative to the polyprotein reading frame, in all 21 sequences, revealed a conserved absence of stop codons in the +2 reading frame in a region that corresponds precisely to the region of enhanced conservation (Fig. 1A), thus suggesting an overlapping coding sequence in the +2 reading frame as a possible explanation for the enhanced conservation at polyprotein-frame synonymous sites. One other explanation for extended conservation could be recombination, a phenomenon that is relatively common in the Picornaviridae, though analysis with a variety of recombination detection algorithms in the RDP3 package failed to reveal any putative recombination events that could explain the conservation in this region (35, 36).

Fig. 1.

Fig. 1.

Conservation at polyprotein-frame synonymous sites in three cardiovirus clades. (A) Twenty-one EMCV sequences, including mengovirus. (B) Twelve TMEV and RTV sequences. (C) Twenty-two SAFV sequences. Panel 1 shows a map of the EMCV coding region, along with the overlapping gene 2B*, the site of frameshifting, and the site of StopGo cotranslational separation at the 2A-2B junction. Panels 2–3, 6–7, and 10–11 show the positions of stop codons (blue triangles) in the +1 and +2 reading frames relative to the polyprotein reading frame, and alignment gaps (green rectangles), in all the aligned sequences. Note the conserved absence of stop codons in the +2 frame in EMCV in the 2B* region (panel 3) and in the +1 frame in TMEV/RTV in the L* region (panel 6). Panels 4–5, 8–9, and 12–13 depict the conservation at polyprotein-frame synonymous sites in a 15-codon sliding window. Panels 4, 8, and 12 depict the probability that the degree of conservation within a given window could be obtained under a null model of neutral evolution at synonymous sites, whereas panels 5, 9, and 13 depict the absolute amount of conservation as represented by the ratio of the observed number of substitutions within a given window to the number expected under the null model. The range of features detected by this type of analysis depends partly on the sliding window size: 15 codons is optimal for features with a width of 15 codons and was used to illustrate the very localized conservation peak in the TMEV/RTV and SAFV alignments. In EMCV, the conservation in the 2B* region is even more pronounced if one uses a larger window size—indeed, the total conservation summed over the 2B* ORF has a corresponding p-value of approximately 10-32. Conversely, features that cover significantly fewer than 15 codons may not be apparent in this analysis. Note that, due to codon usage, the lengths of “random” (i.e., noncoding) ORFs tend to be longer in the +1 frame than in the +2 frame (34).

At the 5′ end of the conserved region there is a conserved G GUU UUY motif (where Y represents either a C or a U), which we inferred to be the site of frameshifting (Fig. 2A). This heptanucleotide has been implicated previously as the slippery sequence for -1 ribosomal frameshifting in certain luteoviruses, such as Bean leafroll and Soybean dwarf (22). Moreover, although the G at position one is conserved, the corresponding polyprotein-frame codon is not conserved, with CCG (Pro), CAG (Gln), and AAG (Lys) all being represented in different EMCV sequences, thus providing further support for the functional importance of the G GUU UUY motif. Using alignment RNA-folding software such as alidot and pseudoknot-predicting software such as pknotsRG, we were unable to predict a convincing 3′ RNA secondary structure at a canonical distance (i.e., 5–9 nt) for stimulating efficient ribosomal frameshifting at the G GUU UUY motif (37, 38). However, we were also unable to definitively rule out such a structure, because long-range base pairings and/or structures involving complex tertiary interactions can be difficult to detect based on computational analysis alone (39, 40).

Fig. 2.

Fig. 2.

Representative nucleotide and peptide sequences. (A) Nucleotide sequences flanking the proposed site of frameshifting (confirmed in EMCV) in representative cardiovirus sequences (GenBank accession numbers shown at left). The codons that encode NPG|P of the StopGo cassette are highlighted in pale blue. The slippery heptanucleotide where -1 frameshifting occurs is highlighted in pale orange. The stop codon that terminates the very short +2-frame ORF in TMEV and SAFV is indicated in red (in EMCV, the +2-frame ORF is much longer; see Fig. 1A). (B) Amino acid sequence of the transframe protein 2B* in mengovirus DQ294633. The two antigens used for Abs anti-N and anti-C are highlighted in yellow. Peptide sequences detected by mass spectrometric analysis of 2B* purified from infected cells are indicated with underscores (anti-N IP) and overscores (anti-C IP). The amino acids, VF, corresponding to the site of frameshifting are indicated in bold. (C) Amino acid sequences of the much shorter predicted transframe peptide in representative TMEV and SAFV sequences.

A number of other peaks in synonymous site conservation were detected, including one corresponding to the cis-active RNA element (CRE) (41) that plays a role in replication, as well as several in the region of the genome that encodes 3D, the viral polymerase (Fig. 1A). In fact, such conservation peaks are common in the 5′- and/or 3′-terminal regions of many viral polyprotein coding sequences and often correspond to overlapping noncoding elements, such as RNA secondary structures involved in replication and transcription, translation enhancement, and/or packaging (42). In contrast to these elements, the conservation peak in the 2B region of EMCV extends to a greater degree over a greater region.

Computational Analysis of Theiloviruses.

Next, we applied the same analyses to other members of the Cardiovirus genus. Phylogenetically, VHEV clusters with TMEV, whereas SAFV forms another clade that is more closely related to the TMEV–VHEV clade than to EMCV (14, 43). The divergent RTV clade clusters ambiguously with either SAFV or TMEV, depending on the genomic region analyzed, but we chose to analyze RTV alongside TMEV because of the common presence of the L* ORF (Fig. 1B). When we analyzed the conservation at synonymous sites and the positions of +2-frame stop codons in alignments of either TMEV/RTV (Fig. 1B; 12 sequences) or SAFV (Fig. 1C; 22 sequences) we still observed a striking peak in synonymous site conservation just downstream of the 2A-2B junction. However, the extent of the peak was greatly reduced compared to that seen in EMCV. Moreover, the extent of the corresponding +2-frame ORF was similarly reduced—covering just 8 codons in the majority of sequences (Fig. 2A). Nonetheless, an inspection of the sequence data revealed a conserved G GUU UUU motif at the 5′ end of the conserved region and, again, the functionality of this motif was further supported by conservation of the G at position one despite both UCG (Ser) and UUG (Leu) codons being present at that position (recall that there are six possible codons for each of Ser and Leu, yet only three of the 12 codons end in G; Fig. 2A). Analysis of the 3′ sequence with RNA-folding algorithms again failed to reveal a 3′ RNA structure at the canonical (5–9 nt) distance for stimulation of -1 frameshifting. However, enhanced conservation at polyprotein-frame synonymous sites extended for approximately 5 codons 3′ of the +2-frame stop codon, suggesting that the 3′ nucleotides may nonetheless be involved in the stimulation of frameshifting.

Besides the conservation peak in the sequence encoding 2B, our analysis also recovered the known overlapping gene in the 5′-terminal region of the TMEV/RTV polyprotein coding sequence that encodes L*, a protein essential for the progression of TMEV infection to persistence (Fig. 1B) (4446) and—as for EMCV—the CRE, and several conservation peaks in the 3′ region of the genome.

There is currently only one VHEV sequence in GenBank with coverage of the 2B region, and because this sequence covers only part of the polyprotein, it was not used in the synonymous site conservation analysis. However, it maintains the G GUU UUU motif and an 8-codon +2-frame ORF, as observed for TMEV.

Conservation of a More Distal 3′ RNA Structure.

Although conserved RNA structures were not found at the canonical spacing (5–9 nt) for stimulating ribosomal frameshifting at the G GUU UUY motif, alignment folding did uncover a potential structure at a larger spacing (13–14 nt) from the slippery site (Fig. 3). Because of very high nucleotide conservation in this region, the structure—although conserved—has no support from compensatory substitutions within the EMCV alignment. However, a similar structure was also predicted for TMEV, RTV, and SAFV, where it is supported by an A-U to U-A substitution between TMEV and RTV, besides many differences in the primary nucleotide sequence between these viruses and EMCV (Fig. 3). Indeed, this structure was one of just six EMCV–TMEV phylogenetically conserved potential RNA structures predicted within the cardiovirus coding region in a computational survey of the Picornaviridae (47). The structure bears a resemblance—possibly coincidental—to the cardiovirus CRE, a structure essential for VPg uridylylation in all picornaviruses (48) (Fig. 3). Using the EMCV and TMEV CREs defined in ref. 41 as seeds, alignment folding predicts very similar core CRE structures in EMCV, TMEV, RTV, and SAFV: a 6-bp stem, 1-nt 3′ bulge, a potential 2-bp stem-extension, and a 16-nt loop. The predicted stems are supported by a number of compensatory substitutions, but certain nucleotides in the loop are completely conserved, most notably the 5′-most 6 nt, A AAC AC. In fact AAAC loop motifs are a widespread feature of Picornaviridae CREs, and this motif is also present in the loop of the EMCV shift-site 3′-proximal stem-loop structure (48) (Fig. 3).

Fig. 3.

Fig. 3.

RNA structures predicted from alignment folding. The G GUU UUY frameshift site in EMCV and corresponding predicted frameshift site in TMEV are boxed. A potential lower extension to the stem in EMCV is indicated by underscores, but this extension is not completely conserved throughout the EMCV alignment. The EMCV CRE is also shown, with the 5′-most loop nucleotides, AAACAC, that are conserved throughout the cardioviruses highlighted in bold. Similar nucleotides in the shift-site 3′-proximal stem-loops are also highlighted in bold. Numbers represent genomic coordinates of selected nucleotides within the sequences indicated by GenBank accession numbers.

Immunoblotting Demonstrates Expression of 2B*.

We chose to test for expression of the predicted transframe protein 2B* in mengovirus (a strain of EMCV) rather than TMEV because the length of 2B* in mengovirus (129 codons) is more amenable to experimental analysis. Two separate antibodies (Abs) were raised against predicted antigens within mengovirus 2B*: one against the amino-terminal 12 aa (anti-N) and a second against the carboxy-terminal 14 aa (anti-C). The amino-terminal 12 aa of 2B* are encoded by the zero-frame sequence 5′ of, and including, the G GUU UUU motif so that anti-N, but not anti-C, was expected to react with both 2B and 2B* (Fig. 2B). Additional Abs were also obtained specifically for 2B but proved ineffective.

Western blots with anti-N detected virus-specific proteins migrating at approximately 13 and 16 kDa in lysates from BHK-21 cells infected for 5 h with wild-type (WT) mengovirus (MOI 10; Fig. 4A). To determine whether these products might correspond to 2B and/or 2B*, separate in vitro translation reactions were programmed with constructs expressing 2B and 2B*. Although 2B* has a predicted molecular mass of 14.2 kDa (assuming -1 frameshifting at the G GUU UUU motif), in vitro expressed 2B* migrates at a position corresponding to approximately 16 kDa (Fig. 4A). Meanwhile, in vitro expressed 2B, which has a predicted molecular mass of 16.5 kDa, migrates at a position corresponding to approximately 13 kDa (Fig. 4A). Thus, we identified the more slowly migrating product as 2B*. Interestingly, in infected cells, 2B* was much more readily detected with anti-N than 2B (Fig. 4A). Possible reasons for this include the unknown WT frameshifting efficiency, differences in 2B and 2B* stability, or possible 3C protease cleavage between phylogenetically conserved Gln and Gly residues just 3 aa downstream of the shift site (conserved also in TMEV, RTV, and SAFV), which could remove the N-terminal antigen from a portion of 2B. Although the anti-N Western blots of the 2B and 2B* in vitro translations suggested otherwise, it is also possible that 2B suffered from a reduced transfer efficiency or reduced Ab affinity (e.g., due to the differing amino acid sequences immediately adjacent to the N-terminal epitope) relative to 2B*.

Fig. 4.

Fig. 4.

Immunodetection of 2B* and 2B. (A) 2B/2B* anti-N Western blot (WB) of protein lysates from BHK-21 cells either mock-infected (mock) or infected (MOI 10) with WT mengovirus (vMC0) for 5 h. As controls for the apparent molecular masses of 2B and 2B*, in vitro translated (IVT) 2B and 2B*—as well as a negative control (Co) with no input RNA added—were also immunoblotted. The right-hand panel shows radiolabeled in vitro translated 2B and 2B*. (B) Western blots of anti-N and anti-C immunoprecipitates prepared from mock- or vMC0-infected (MOI 1.0) BHK-21 cells as indicated by − or +. Cell lysates were prepared 8 h p.i. and incubated with IgG beads ± either anti-N or anti-C as indicated. Control cell lysates removed before the addition of beads or Abs are shown in the far right lanes marked “lysates.” The 2B/2B* N-terminal and 2B* C-terminal antigens are indicated in Fig. 2B.

Western blots with anti-C also detected a virus-specific product in lysates from virus-infected cells that migrates at approximately 16 kDa but, as expected, did not detect a product migrating at approximately 13 kDa (Fig. 4B). To determine whether the approximately 16-kDa product detected by both anti-N and anti-C is the same product, immunocomplexes were prepared from BHK-21 cells infected with WT mengovirus (MOI 1.0) for 8 h. Infected cells were lysed in RIPA buffer to dissociate proteins that could coimmunoprecipitate with the approximately 16-kDa product. As shown in Fig. 4B, the approximately 16-kDa protein immunoprecipitated by anti-N was detected by anti-C, and similarly the approximately 16-kDa protein immunoprecipitated by anti-C was detected by anti-N. This strongly suggests that the antigens recognized by anti-N and anti-C are on the same protein.

Mass Spectrometric Confirmation of 2B* Expression and the Site of Frameshifting.

To further establish that the approximately 16-kDa protein detected by both anti-N and anti-C is in fact the predicted 2B*, we resolved by SDS-PAGE anti-N and anti-C immunoprecipitates prepared from both mengovirus-infected and mock-infected BHK-21 cells, followed by total protein staining with Coomassie blue and excision of gel slices containing a product migrating at approximately 16 kDa that was present only in those immunoprecipitates prepared from virus-infected cells. Gel slices were digested with trypsin and the resulting peptides analyzed by liquid chromatography–tandem mass spectrometry (LC/MS/MS) [with Fourier transform–ion cyclotron resonance (FT-ICR)]. Peptides corresponding to 75% and 60% of the predicted 2B* sequence were identified, respectively, from the anti-N and anti-C immunoprecipitates (Fig. 2B). Three different sized peptides from both anti-N and anti-C immunoprecipitates spanned the predicted frameshift site. These data confirm that a portion of ribosomes translate 2B* via -1 frameshifting on the G GUU UUU motif during translation of the mengovirus transcript (raw data in Fig. S1).

Frameshifting Is Stimulated by Virus Infection and Requires at Least 50 nt 3′ of the Slip Site.

Eukaryotic -1 frameshifting is usually stimulated by a stable 3′ RNA secondary structure beginning 5–9 nt 3′ of the shift site. However, as mentioned previously, no such conserved structures were predicted, but we did identify a potential structure beginning 13–14 nt 3′. The frameshifting efficiency and possible role of sequences 3′ of the slip site were investigated using the dual luciferase reporter assay (49, 50). WT (G GUU UUU) and mutant (A GUG UUU) slip-site sequences, as well as up to 125 nt 3′, were cloned between the two luciferase genes in vector pDluc so that the downstream firefly luciferase gene was in the -1 frame relative to the upstream renilla luciferase gene. Each frameshift construct had a corresponding in-frame control (IFC) construct in which the firefly gene was in the same frame as the renilla gene. Frameshifting efficiencies were determined by comparing the ratio of firefly to renilla enzymatic activities in parallel cell cultures transfected with either the frameshift or IFC construct.

Constructs were transfected into BHK-21 cells 18 h before either mock-infection or infection with WT mengovirus (MOI 2.0). Frameshifting efficiencies of approximately 3.5% and 6.6% were observed in virus-infected cells transfected with constructs containing the WT shift site and, respectively, 50 or 125 nt 3′ (Fig. 5). In contrast, the same constructs promoted only low levels (< 1%) of frameshifting in the absence of virus infection (Fig. 5). When the shift site was mutated, frameshifting was reduced to background levels. The potential RNA structure in EMCV ends 48 nt 3′ of the G GUU UUY motif (Fig. 3) and, consistent with this structure having a stimulatory effect on frameshifting, constructs with 50 nt of 3′ sequence stimulated frameshifting in virus-infected cells to a level 3-fold higher than constructs with just 45 nt of 3′ sequence (Fig. 5). Targeted mutations that disrupted the potential structure also reduced frameshifting to low levels; however, mutations designed to restore the potential stem but with reversed base pairings did not restore frameshifting to WT levels, suggesting that, if the structure does indeed stimulate frameshifting, then both the structure and primary sequence elements within the structure are important for this effect (Fig. S2). Because of the potential for additional stimulatory factors, and the effect of virus infection, it is not clear at this stage whether these values are closely representative of the actual level of frameshifting in the virus itself.

Fig. 5.

Fig. 5.

Extent of the 3′ sequence that contributes to frameshift stimulation. Dual luciferase assay showing frameshifting efficiencies following transfection of WT and slip-site mutated (SS) pDluc constructs with 45, 50, or 125 nt 3′ of the slip site. Relevant IFCs were also transfected in parallel. Eighteen hours later, transfectants were either mock-infected or infected with vMC0 (MOI 2.0) for 7 h. Cells were then lysed and luciferase activities measured. Error bars represent the standard deviation of four independent experiments, in each of which each construct was transfected in quadruplicate.

Inactivating 2B* Results in a Small-Plaque Phenotype.

To investigate whether 2B* is an essential protein, we introduced mutations into an infectious clone of mengovirus. Three mutants were generated that are synonymous with respect to the polyprotein reading frame but that are unable to express full-length 2B*: (i) a G GUU UUU to A GUG UUU mutation to prevent frameshifting at the slip site (SS), (ii) a CCG AAC GAC to CCU AAU GAC mutation at nt positions +48 and +51 from the shift site to introduce two consecutive +2 frame premature termination codons just 3′ of the slip site (PTC1), and (iii) a CUA UCG GAU to CUU AGU GAU mutation at nt positions +153 - 156 from the shift site to introduce an alternative set of +2 frame stop codons (PTC2). Although StopGo separation is not known to be affected by downstream elements, to avoid this as a potential confounding factor we tested whether any of the mutants might affect StopGo. Parts of the mengovirus genome (specifically, from the start of 1D to the end of 2B) from WT, SS, PTC1, and PTC2 viruses were cloned into a plasmid suitable for cell-free transcription and translation (Fig. 6A). As a control for impaired StopGo, a construct in which the critical NPG|P motif was mutated to NPLV was also prepared. Similar to WT, failure of StopGo separation occurred at just a very low level in the SS, PTC1, and PTC2 mutants (Fig. 6B).

Fig. 6.

Fig. 6.

Analysis of 2B* knockout mutants. (A) Schematic representation of the mengovirus region expressed by in vitro translation to test whether the SS, PTC1 or PTC2 mutations affect StopGo separation. (B) SDS-PAGE of radiolabeled in vitro translation reactions. LV has a mutation in the StopGo region (NPG|P to NPLV) that completely inhibits StopGo separation of 2B from 1D-2A. SS, PTC1, and PTC2 have the same mutations as the mutant mengoviruses of the same names (see main text). IFC is an in-frame control in which the WT slip site sequence has been mutated from G GTT TTT to A GTG TTT T so that, instead of 2B, a protein identical to transframe 2B* is translated. (C) Mean plaque diameter determined by measuring 100 plaques formed by standard plaque assay after infection with each WT or mutated mengovirus as indicated. Error bars indicate standard deviations. Inset: anti-N and anti-C Western blots of anti-N immunocomplexes on lysates prepared from BHK-21 cells infected with either WT mengovirus (MOI 1.0) for 8 h or each indicated mutated mengovirus (MOI 1.0) for 22 h. (D) Pulse labeling of vMC0, vMC0-SS, vMC0-PTC1, vMC0-PTC2 (MOI 10), and mock-infected BHK-21 cells at 8 and 22 h p.i.. The positions of the major mengovirus proteins are shown to the left. Arrowheads indicate positions of host proteins that are absent or decreased in cells infected with vMC0 when compared to mock-infected cells. There were no labeled proteins in the WT lane at 22 h p.i. because cells infected with vMC0 for 22 h had already undergone CPE.

RNA transcribed from WT, SS, PTC1, and PTC2 infectious clones was transfected into BHK-21 cells to generate virus stocks. Although end-point titers were similar between all viruses, plaque sizes were significantly reduced when either the slip site was mutated or premature termination codons were introduced into the 2B* reading frame (Fig. 6C). We also consistently observed a delayed cytopathic effect (CPE) when BHK-21 cells were infected with any of the three mutant viruses [up to approximately 24 h postinfection (p.i.) at MOI 1.0] compared to WT virus infections (CPE within approximately 10 h p.i. at MOI 1.0). Host cell shutoff was also delayed in the mutants, with many host cell proteins expressed at higher levels after 8 h of mutant virus infection than after 8 h of infection with WT virus (arrowheads in Fig. 6D). This may be due to slower virus multiplication within the cell, rather than a specific role for 2B* in shutoff.

Immunoprecipitation (IP) experiments using lysates from BHK-21 cells infected with WT mengovirus for 8 h or the three mutant viruses for 22 h detected the approximately 16-kDa product (2B*) for WT virus but failed to detect any specific product from the mutant viruses when probed with either anti-N or anti-C (Fig. 6C). Immunoblotting with anti-N detected the approximately 16-kDa product only for WT virus, whereas an approximately 13-kDa product (2B) was detected for WT virus and all three mutants (Fig. 7A). In principle, anti-N should have detected C-terminally truncated versions of 2B* for the PTC1 and PTC2 mutants. These products have expected sizes of approximately 3.0 kDa and 7.1 kDa, respectively, making detection difficult. It is also possible that such truncated products are rapidly degraded. Nonetheless, anti-N Western blots for the PTC2 mutants revealed a faint specific product migrating at a suitable position for the approximately 7.1-kDa truncated 2B* (Fig. 7A).

Fig. 7.

Fig. 7.

Partial reversion of the mutated slip site restores 2B* expression and plaque size. (A) Anti-N Western blot of lysates prepared from BHK-21 cells infected for 8 h (MOI approximately 10) with either WT or the mutant mengoviruses (SS, PTC1, and PTC2) as indicated. P1 indicates the first passage from a virus stock, whereas P10 and P20, respectively, indicate 10 and 20 serial passages of a virus stock. A higher exposure is shown for a region below 10 kDa, showing a PTC2-specific product that may represent the approximately 7.1-kDa truncated 2B*. (B) Mean plaque diameter determined by measuring 100 plaques formed by standard plaque assay after infection with WT mengovirus, or slip-site mutated mengovirus that had been serially passaged once (P1) or eight times (P8) as indicated. Error bars indicate standard deviations.

Although clearly detected with Abs, we were unable to unambiguously detect 2B* via pulse labeling (Fig. 6D); 2B* may migrate very close to 2A (predicted molecular mass of approximately 16.7 kDa) or a host cell protein, and/or it may be present at relatively low abundance. Meanwhile, the truncated 2B* products expected to be produced by the PTC1 and PTC2 mutants lack any methionines and so cannot incorporate the 35S label and, in any case, based on the immunoblotting, are not readily detectable. Nonetheless, one effect of frameshifting may be apparent in Fig. 6D, in that the ratio of the downstream-encoded products (2BC-3ABCD) to those encoded upstream of the shift site (L-1ABCD-2A) appears to be greater in SS than in WT (compare, for example, the relative intensities of the bands corresponding to 1C and 3C in SS and WT in Fig. 6D). Notwithstanding potential confounding effects (e.g., polyprotein processing and turnover), this difference may represent the portion of ribosomes that are removed from the transcript within the region encoding 2B and 2B* in WT virus but not in the shift-site mutant. A discrepancy in the molar ratio between the upstream and downstream products has been noted previously (51, 52)—though only by comparison with the shift-site mutant may one discriminate potential ribosomal drop-off during StopGo from ribosomal “removal” via the frameshifting.

Given that 2B* mutants have an attenuated phenotype in cell culture, one might expect there to be selection for reversion to WT sequence. To test this, the SS, PTC1, and PTC2 mutants were serially passaged in cell culture, and progeny from each passage were sequenced. The shift-site mutant (SS) partially reverted (A GUG UUU to A GUU UUU) from passage 3 in one experiment and from passage 8 in another, but was not observed to fully revert to WT G GUU UUU by passage 20. Indeed (see the Introduction), the G at position 4 in SS is expected to have a much greater inhibitory effect on frameshifting than the A at position 1, and the partial revertant A GUU UUU is likely to allow some frameshifting (cf. ref. 18). In fact, 2B* expression is detectable via Western blotting using lysates prepared from cells infected with the partial revertant (Fig. 7A). Furthermore, the small-plaque phenotype observed for SS (Fig. 6C) is almost completely reversed in the partial revertant (Fig. 7B). Failure to revert A to G at position 1, despite all WT isolates containing a G at position 1, may be a consequence of selection in cell culture as opposed to in vivo. The PTC1 and PTC2 mutants, which involve two and four nucleotide changes, respectively, were not observed to revert. This is not surprising because (unlike the shift-site mutant) single nucleotide changes are not expected to even partially improve fitness for these mutants.

Discussion

We have demonstrated the existence of programmed ribosomal frameshifting and a 128–129 aa transframe gene, 2B*, in EMCV. Evidence includes (i) a 117-codon overlapping ORF conserved in all available EMCV sequences, (ii) a statistically highly significant enhancement in the conservation at polyprotein-frame synonymous sites coinciding with the position of the ORF, (iii) a well-defined and conserved translation mechanism via programmed ribosomal frameshifting at the 5′ end of the ORF, (iv) a product of the expected size was specifically detected by two separate Abs raised against N- and C-terminal peptides, (v) mass spectrometry of this product purified from infected cells confirmed expression of 2B* and the predicted site of frameshifting, and (vi) three separate 2B* knockout mutants, each differing from WT virus by just 2–4 mutations synonymous in the polyprotein frame, exhibited an attenuated small-plaque phenotype.

Interestingly, a frameshift stimulatory RNA structure was not apparent at the canonical distance of 5–9 nt 3′ of the shift site. The reporter assays with 45 and with 50 nt 3′ of the shift site show the importance of the longer sequence for efficient frameshifting. Fifty nucleotides, but not 45 nt, is adequate to allow formation of the potential stem-loop structure identified in EMCV, TMEV, and SAFV (Fig. 3). As the 5′ end of this potential structure is 13–14 nt 3′ of the shift site, if it forms it would be at the leading edge of the mRNA entrance tunnel at the time the shift occurs. There are other candidates for a frameshift stimulator at this position (albeit for +1 not -1 frameshifting), as well as evidence for 3′ stimulators that do not act via intra-mRNA base pairing (25, 26, 53). The predicted 3′ RNA structure bears some resemblance to the cardiovirus CRE element that binds 3CD (41, 48) (Fig. 3). If the shift-site 3′-proximal stem-loop also binds 3CD, then it is plausible that the RNA-protein complex could act as an unusual stimulator for frameshifting (with implications for the distance between the shift site and the stimulator), and this would also explain why efficient frameshifting was only observed in virus-infected cells. Such a mechanism has a precedent in an artificial system where eukaryotic -1 frameshifting stimulated by a 3′-adjacent iron-responsive element (IRE) stem-loop structure was shown to be greatly enhanced by stimulation of IRE-binding proteins (54). However, this is just one of many possible explanations: virus infection could also conceivably modulate frameshifting efficiency via other virus or host trans-acting protein factors, changes in tRNA populations, ribosome modifications, and other changes in cellular environment and/or architecture (cf. refs. 5557). It is also possible that non-Watson–Crick intra-mRNA interactions extend the structure into the spacer region in such a way that allows it to affect frameshifting in a manner more analogous to typical frameshift-stimulating structures (albeit still, unusually, apparently dependent on virus infection). Establishing the nature and mode of action of the 3′ stimulator, as well as the influence of virus-infection, is, however, outside the scope of this report.

Although we have not tested the functionality of the proposed site of frameshifting in SAFV and TMEV, it seems probable from the computational analysis that they, likewise, utilize programmed frameshifting at this location. Unlike EMCV, however, frameshifting in TMEV and SAFV would result in just a 14–15 aa transframe peptide. Although such a small peptide may be functional in itself, one could also speculate an alternative function for frameshifting in TMEV and SAFV. One possibility might be that frameshifting occurs only when StopGo separation fails to occur, thus resulting in a larger protein comprising 2A fused to a 14–15 aa C-terminal transframe peptide. An alternative explanation might be that frameshifting acts as a ribosome sink to reduce the number of ribosomes translating the 3′-encoded replicase proteins, in particular the polymerase 3D, freeing up more ribosomes to translate the 5′-encoded structural proteins; though, to be effective, such a sink would require significant levels of frameshifting (e.g., 50%). Interestingly, StopGo has also been proposed as a candidate for a ribosome sink that might be regulated during the course of virus infection (7), though a possible advantage of frameshifting over StopGo as a ribosome sink is that frameshifting and termination at an out-of-frame stop codon cleanly removes ribosomes from the message, whereas discontinuation of translation at the StopGo cassette may result in a stalled ribosome. A third explanation may be that the frameshifting plays some sort of regulatory role in the viral lifecycle (cf. ref. 39). This is an attractive hypothesis given that the efficiency of frameshifting in EMCV appears to be modulated by viral infection.

For whatever benefit, however, it seems that programmed frameshifting evolved at this location in the ancestral cardiovirus. Then in EMCV, but not TMEV or SAFV, the ancestral frameshift site was co-opted (perhaps while also retaining the original function) as a suitable site to begin the evolution of a long overlapping gene via incremental elongation by substitutions of intervening +2-frame stop codons with sense codons and selection on the encoded amino acid sequence (cf. ref. 34). This finding adds to a small number of known cases of phylogenetically conserved overlapping genes that are internal to longer coding sequences and accessed via programmed ribosomal frameshifting. Other examples include potyvirus PIPO, alphavirus TF, and flavivirus NS1′ (21, 28, 29). Such overlapping genes are difficult to identify and are often overlooked. However, it is important to be aware of these genes as early as possible. Undetected overlapping genes are a source of confusion and confound mutagenic analysis of the genes they overlap, in this case 2B. Furthermore, only once it has been identified can the functions of an overlapping gene be investigated in their own right.

Materials and Methods

Computational Analysis.

Cardiovirus nucleotide sequences in Genbank with complete or near-complete coverage of the polyprotein coding sequence were identified by applying National Center for Biotechnology Information tblastn (58) to the polyprotein sequences derived from GenBank cardiovirus reference sequences NC_001479, NC_001366, NC_009448, and NC_010810. The following sequences were retrieved. EMCV: DQ464062, HM641897, DQ464063, FJ604852, FJ604853, FJ897755, EU780148, EU780149, DQ517424, AF356822, M81861, X00463, DQ288856, X74312, AY296731, M22457, M22458, M37588, X87335, DQ294633, L22089, and DQ835185. TMEV: DQ401688, M16020, EU718732, EU723238, M20562, X56019, M20301, EU718733, and HQ652539. RTV: EU542581, EU815052, and AB090161. SAFV: AM922293, EU376394, GU595289, EU681177, EU681176, FN999911, FR682076, GU943518, EF165067, FJ463615, FJ463616, FJ463617, HM181996, EU681178, EU681179, HM181997, HM181998, HM181999, FM207487, GU943513, GU943514, and JF813004. X00463, which is nearly identical to M81861 but has six single-nucleotide deletions that cause local changes in reading frame in the 1A/1B coding region, was omitted from the analysis. Within each clade, the polyprotein-encoding sequences were extracted, translated, aligned, and back-translated to produce nucleotide sequence alignments using EMBOSS and Clustal (59, 60). Besides the full- or near full-length sequences above, one additional partial sequence with coverage of the 2B region was retrieved (namely, EU723237, which was the only VHEV sequence in Genbank with coverage of 2B).

Antibodies.

Polyclonal Abs to two predicted antigens within 2B* were prepared by GenScript. Anti-N was raised against peptide sequence PFTFKPRQRPVFC (N-terminal 12 aa shared by 2B and 2B* + “C”). Anti-C was raised against peptide sequence CRDHKPDKPVRRNSS (“C” + C-terminal 14 aa of 2B*). Rabbits were injected with one of the two peptides, and Abs were affinity-purified from immune sera. A third Ab was raised against peptide sequence CAWENVKGTLNNPEF (“C” + aa 66–79 of 2B) but proved to be ineffective.

Viruses and Cell Culture.

BHK-21 cells (ATCC) were maintained in DMEM supplemented with 10% FBS, 1 mM L-glutamine, and antibiotics. Virus vMC0 is a recombinant mengovirus that is identical to GenBank accession number DQ294633 except in the 5′UTR where vMC0 has the poly-C tract deleted (61). For virus infection, BHK-21 cells were washed with PBS and then overlaid with vMC0 diluted in PBS/1% FBS at the multiplicities of infection (MOI) indicated throughout the text. After adsorption (60 min at 37 °C) with gentle agitation, virus was removed and cells overlaid with DMEM/2% FBS for the times p.i. indicated.

Immunodetection and IP.

Infected cells were lysed in RIPA buffer plus protease inhibitors on ice for 20 min. Cell debris were removed from cell lysates by centrifugation at 15,000  × g at 4 °C for 15 min. Proteins were resolved by 15% Bis/Tris-PAGE (MES buffer) and transferred to nitrocellulose membranes (Protran) by electrotransfer in a tris/glycine buffer with 20% methanol at 1 mA/cm2 (constant amps) for 55 min. Membranes were incubated at 4 °C overnight in 2% milk with a 1∶1000 dilution of anti-N or a 1∶500 dilution of anti-C. Immunoreactive bands were detected on membranes after incubation with appropriate fluorescently labeled secondary Abs using a LI-COR Odyssey®Infrared Imaging Scanner (LI-COR Biosciences). For IP, cell lysates were incubated with 20 μL of protein G Agarose beads plus anti-N or anti-C (3 μg of each Ab) overnight at 4 °C with gentle rocking. The beads were washed (3×) with ice-cold RIPA buffer and then removed from the beads by boiling for 5 min in 20 μL of 2× SDS-PAGE sample buffer for electrophoresis and either stained with Coomassie blue (for gel excision and subsequent mass spectroscopy analysis) or transferred to nitrocellulose membranes for Western blot analysis.

Mass Spectrometry.

Coomassie blue stained approximately 16-kDa products were excised from the gel and subjected to in-gel trypsin digestion. LC/MS/MS data were acquired using an LTQ-FT hybrid mass spectrometer (ThermoElectron Corporation). Peptide molecular masses were measured by FT-ICR. Peptide sequencing was performed by collision-induced dissociation in the linear ion trap of the LTQ-FT instrument. Digest samples were introduced by nanoLC (Eksigent, Inc.) with nanoelectrospray ionization (ThermoElectron Corporation). NanoLC was performed using a homemade C18 nanobore column [75-μm i.d. × 10 cm; Atlantis C18 (Waters Corporation); 3-μm particle size]. Peptides were eluted from a 50-min linear gradient run from 4% acetonitrile (with 0.1% formic acid) to 70% acetonitrile (with 0.1% formic acid). Peptides were identified using the MASCOT search engine.

Recombinant Constructions.

Full-length recombinant mengovirus genomes containing mutations in the slip site (SS), premature termination codons 1 (PTC1), and premature termination codons 2 (PTC2) were engineered by standard two-step PCR using primers 1–8 in Table S1, along with pMC0 as template. Amplicons were cloned BglII/AflII into pMC0 using standard cloning procedures and all clones were confirmed by sequencing. WT, slip-site mutated (SS), and IFC dual luciferase constructs were generated by two-step PCR with primers 3, 4, and 9–15 in Table S1, again using pMC0 as template. PCR products were cloned XhoI/BamHI into XhoI/BglII digested dual luciferase vector pDluc using standard cloning procedures, and all clones were confirmed by sequencing.

Dual Luciferase Assays.

BHK-21 cells were transfected in quadruplicate with 25 ng of each plasmid as indicated using Lipofectamine 2000 reagent (Invitrogen), using the 1-d protocol in which suspended cells (2 × 104) are added directly to the DNA complexes in half-area 96-well plates. Transfected cells were incubated at 37 °C for 18 h before virus infection as described above for the indicated times and MOI. Luciferase activities were determined using the Dual Luciferase Stop & Glo® Reporter Assay System (Promega). Relative light units were measured on a Veritas Microplate Luminometer with two injectors (Turner Biosystems). Transfected cells were washed once with PBS and then lysed in 12.6 μL of 1× passive lysis buffer, and light emission was measured following injection of 25 μL of either renilla or firefly luciferase substrate. Firefly luciferase activity was calculated relative to the activity of renilla luciferase, and frameshifting efficiencies were determined by comparing the ratio of firefly to renilla enzymatic activities in parallel cell cultures transfected with either the test construct or an IFC.

In Vitro Transcription and Transfection.

Plasmids were linearized with BamHI and transcribed with T7 RNA polymerase (Ambion) for 3 h at 37 °C as recommended by the manufacturer. RNA integrity was examined by electrophoresis, and RNA was transfected into BHK-21 cells using DMRIE-C (Invitrogen) according to the manufacturer’s recommendations.

Cell-Free Translation.

We chose not to use rabbit reticulocyte lysates for cell-free translation because of potential distortions in the apparent molecular masses of small proteins caused by large amounts of endogenous β-globin. In addition, Western blotting with rabbit reticulocyte lysates subsequently probed with Abs raised in rabbit often results in high background. Therefore, cell-free translations were performed in human lysates (Thermo Scientific; Human In vitro Protein Expression kit; catalog no. 88855). RNA transcribed from constructs expressing 2B or 2B* (see SI Materials and Methods for the construction of these plasmids) were used to program in vitro translation reactions with [35S] methionine according to the manufacturer’s instructions. Translation reactions were then analyzed by SDS-PAGE, and gels were either dried and exposed to a phosphorscreen or else Western blotted with anti-N as described above.

Plaque Assays.

Virus titers were quantified by plaque assay. Twenty-four hours after transfection of each infectious RNA, cells were subjected to three cycles of freezing and thawing followed by centrifugation at 1,800 × g to pellet cell debris. Supernatants were collected then aliquotted and stored at -80 °C. BHK-21 cells were infected with 10-fold dilutions of each supernatant for 1 h at 37 °C. The virus inoculums were then removed and cells were overlayed with 1.0% agarose containing MEM. After 48 h of incubation, cells were fixed with 2.5% formalin, and plaques were visualized with crystal violet.

Metabolic Labeling.

BHK-21 cells at 90–100% confluency in 35-mm petri dishes were infected with WT or mutant mengovirus at MOI 10. After 60 min of adsorption at 37 °C, the virus inoculums were removed by aspiration. The cells were washed once with methionine-free DMEM and incubated in 0.25 ml of methionine-free DMEM. The cells were labeled with [35S] methionine (50 μCi/mL) at 37 °C for 30 min. Cells were then washed once in PBS and lysed by being suspended in 0.2 mL of sample buffer and heated at 95 °C for 5 min. Lysates were analyzed by SDS-PAGE (15%). After electrophoresis, the gels were processed for fluorography with Amplify™ (Amersham).

Supplementary Material

Supporting Information

Acknowledgments.

We thank Chad Nelson for performing the mass spectrometry and analysis. We are also very grateful to Martina Scallan for facilitating use of the virus laboratory. We thank Mike Howard for providing vector pDluc and Robert Fujinami for providing a copy of the mengovirus cDNA (pMC0) developed by Ann Palmenberg. We also thank Ann Palmenberg for providing other materials and for stimulating discussions. This work was supported by Science Foundation Ireland Grant 08/IN.1/B1889 (to J.F.A.) and Wellcome Trust Grant 088789 (to A.E.F.). J.F.A. was personally supported by US National Institutes of Health Grant R01 GM079523.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

See Author Summary on page 18579.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1102932108/-/DCSupplemental.

References

  • 1.Palmenberg AC, et al. The nucleotide and deduced amino acid sequences of the encephalomyocarditis viral polyprotein coding region. Nucleic Acids Res. 1984;12:2969–2985. doi: 10.1093/nar/12.6.2969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Jackson RJ. A detailed kinetic analysis of the in vitro synthesis and processing of encephalomyocarditis virus products. Virology. 1986;149:114–127. doi: 10.1016/0042-6822(86)90092-9. [DOI] [PubMed] [Google Scholar]
  • 3.Palmenberg AC. Proteolytic processing of picornaviral polyprotein. Annu Rev Microbiol. 1990;44:603–623. doi: 10.1146/annurev.mi.44.100190.003131. [DOI] [PubMed] [Google Scholar]
  • 4.Batson S, Rundell K. Proteolysis at the 2A/2B junction in Theiler’s murine encephalomyelitis virus. Virology. 1991;181:764–767. doi: 10.1016/0042-6822(91)90914-w. [DOI] [PubMed] [Google Scholar]
  • 5.Palmenberg AC, et al. Proteolytic processing of the cardioviral P2 region: Primary 2A/2B cleavage in clone-derived precursors. Virology. 1992;190:754–762. doi: 10.1016/0042-6822(92)90913-a. [DOI] [PubMed] [Google Scholar]
  • 6.Atkins JF, et al. A case for “StopGo”: Reprogramming translation to augment codon meaning of GGN by promoting unconventional termination (Stop) after addition of glycine and then allowing continued translation (Go) RNA. 2007;13:803–810. doi: 10.1261/rna.487907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Brown JD, Ryan MD. Ribosome “skipping”: “Stop-Carry On” or “StopGo” translation. In: Atkins JF, Gesteland RF, editors. Recoding: Expansion of Decoding Rules Enriches Gene Expression. Heidelberg: Springer; 2010. pp. 101–122. [Google Scholar]
  • 8.Ryan MD, Drew J. Foot-and-mouth disease virus 2A oligopeptide mediated cleavage of an artificial polyprotein. EMBO J. 1994;13:928–933. doi: 10.1002/j.1460-2075.1994.tb06337.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hahn H, Palmenberg AC. Mutational analysis of the encephalomyocarditis virus primary cleavage. J Virol. 1996;70:6870–6875. doi: 10.1128/jvi.70.10.6870-6875.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hahn H, Palmenberg AC. Deletion mapping of the encephalomyocarditis virus primary cleavage site. J Virol. 2001;75:7215–7218. doi: 10.1128/JVI.75.15.7215-7218.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Donnelly ML, Gani D, Flint M, Monaghan S, Ryan MD. The cleavage activities of aphthovirus and cardiovirus 2A proteins. J Gen Virol. 1997;78:13–21. doi: 10.1099/0022-1317-78-1-13. [DOI] [PubMed] [Google Scholar]
  • 12.Donnelly ML, et al. Analysis of the aphthovirus 2A/2B polyprotein ‘cleavage’ mechanism indicates not a proteolytic reaction, but a novel translational effect: A putative ribosomal ‘skip’. J Gen Virol. 2001;82:1013–1025. doi: 10.1099/0022-1317-82-5-1013. [DOI] [PubMed] [Google Scholar]
  • 13.Doronina VA, et al. Site-specific release of nascent chains from ribosomes at a sense codon. Mol Cell Biol. 2008;28:4227–4239. doi: 10.1128/MCB.00421-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Blinkova O, et al. Cardioviruses are genetically diverse and cause common enteric infections in South Asian children. J Virol. 2009;83:4631–4641. doi: 10.1128/JVI.02085-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zoll J, et al. Saffold virus, a human Theiler’s-like cardiovirus, is ubiquitous and causes infection early in life. PLoS Pathog. 2009;5:e1000416. doi: 10.1371/journal.ppat.1000416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Brierley I, Gilbert RJC, Pennell S. Pseudoknot-dependent programmed -1 ribosomal frameshifting: Structures, mechanisms and models. In: Atkins JF, Gesteland RF, editors. Recoding: Expansion of Decoding Rules Enriches Gene Expression. Heidelberg: Springer; 2010. pp. 149–174. [Google Scholar]
  • 17.Miller WA, Giedroc DP. Ribosomal frameshifting in decoding plant viral RNAs. In: Atkins JF, Gesteland RF, editors. Recoding: Expansion of Decoding Rules Enriches Gene Expression. Heidelberg: Springer; 2010. pp. 193–220. [Google Scholar]
  • 18.Brierley I, Jenner AJ, Inglis SC. Mutational analysis of the “slippery-sequence” component of a coronavirus ribosomal frameshifting signal. J Mol Biol. 1992;227:463–479. doi: 10.1016/0022-2836(92)90901-U. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Jacks T, Madhani HD, Masiarz FR, Varmus HE. Signals for ribosomal frameshifting in the Rous sarcoma virus gag-pol region. Cell. 1988;55:447–458. doi: 10.1016/0092-8674(88)90031-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ogle JM, Ramakrishnan V. Structural insights into translational fidelity. Annu Rev Biochem. 2005;74:129–177. doi: 10.1146/annurev.biochem.74.061903.155440. [DOI] [PubMed] [Google Scholar]
  • 21.Balmori Melian E, et al. NS1′ of flaviviruses in the Japanese encephalitis serogroup is a product of ribosomal frameshifting and plays a role in viral neuro-invasiveness. J Virol. 2010;84:1641–1647. doi: 10.1128/JVI.01979-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Domier LL, McCoppin NK, Larsen RC, D’Arcy CJ. Nucleotide sequence shows that Bean leafroll virus has a Luteovirus-like genome organization. J Gen Virol. 2002;83:1791–1798. doi: 10.1099/0022-1317-83-7-1791. [DOI] [PubMed] [Google Scholar]
  • 23.den Boon JA, et al. Equine arteritis virus is not a togavirus but belongs to the coronavirus-like superfamily. J Virol. 1991;65:2910–2920. doi: 10.1128/jvi.65.6.2910-2920.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Firth AE, Blitvich BJ, Wills NM, Miller CL, Atkins JF. Evidence for ribosomal frameshifting and a novel overlapping gene in the genomes of insect-specific flaviviruses. Virology. 2010;399:153–166. doi: 10.1016/j.virol.2009.12.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Guarraia C, Norris L, Raman A, Farabaugh PJ. Saturation mutagenesis of a +1 programmed frameshift-inducing mRNA sequence derived from a yeast retrotransposon. RNA. 2007;13:1940–1947. doi: 10.1261/rna.735107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Chung BY, Firth AE, Atkins JF. Frameshifting in alphaviruses: A diversity of 3′ stimulatory structures. J Mol Biol. 2010;397:448–456. doi: 10.1016/j.jmb.2010.01.044. [DOI] [PubMed] [Google Scholar]
  • 27.Bekaert M, Rousset JP. An extended signal involved in eukaryotic -1 frameshifting operates through modification of the E site tRNA. Mol Cell. 2005;17:61–68. doi: 10.1016/j.molcel.2004.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chung BY, Miller WA, Atkins JF, Firth AE. An overlapping essential gene in the Potyviridae. Proc Natl Acad Sci USA. 2008;105:5897–5902. doi: 10.1073/pnas.0800468105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Firth AE, Chung BY, Fleeton MN, Atkins JF. Discovery of frameshifting in Alphavirus 6K resolves a 20-year enigma. Virol J. 2008;5:108. doi: 10.1186/1743-422X-5-108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Firth AE, Atkins JF. A conserved predicted pseudoknot in the NS2A-encoding sequence of West Nile and Japanese encephalitis flaviviruses suggests NS1′ may derive from ribosomal frameshifting. Virol J. 2009;6:14. doi: 10.1186/1743-422X-6-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Baranov PV, Gesteland RF, Atkins JF. Recoding: Translational bifurcations in gene expression. Gene. 2002;286:187–201. doi: 10.1016/s0378-1119(02)00423-7. [DOI] [PubMed] [Google Scholar]
  • 32.Firth AE, Wills NM, Gesteland RF, Atkins JF. Stimulation of stop codon readthrough: Frequent presence of an extended 3′ RNA structural element. Nucleic Acids Res. 2011;39:6679–6691. doi: 10.1093/nar/gkr224. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Firth AE, Brown CM. Detecting overlapping coding sequences in virus genomes. BMC Bioinformatics. 2006;7:75. doi: 10.1186/1471-2105-7-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Belshaw R, Pybus OG, Rambaut A. The evolution of genome compression and genomic novelty in RNA viruses. Genome Res. 2007;17:1496–1504. doi: 10.1101/gr.6305707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Simmonds P. Recombination and selection in the evolution of picornaviruses and other Mammalian positive-stranded RNA viruses. J Virol. 2006;80:11124–11140. doi: 10.1128/JVI.01076-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Martin DP, et al. RDP3: A flexible and fast computer program for analyzing recombination. Bioinformatics. 2010;26:2462–2463. doi: 10.1093/bioinformatics/btq467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hofacker IL. Vienna RNA secondary structure server. Nucleic Acids Res. 2003;31:3429–3431. doi: 10.1093/nar/gkg599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Reeder J, Steffen P, Giegerich R. pknotsRG: RNA pseudoknot folding including near-optimal structures and sliding windows. Nucleic Acids Res. 2007;35:W320–324. doi: 10.1093/nar/gkm258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Barry JK, Miller WA. A –1 ribosomal frameshift element that requires base pairing across four kilobases suggests a mechanism of regulating ribosome and replicase traffic on a viral RNA. Proc Natl Acad Sci USA. 2002;99:11133–11138. doi: 10.1073/pnas.162223099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Su L, Chen L, Egli M, Berger JM, Rich A. Minor groove RNA triplex in the crystal structure of a ribosomal frameshifting viral pseudoknot. Nat Struct Biol. 1999;6:285–292. doi: 10.1038/6722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Lobert PE, Escriou N, Ruelle J, Michiels T. A coding RNA sequence acts as a replication signal in cardioviruses. Proc Natl Acad Sci USA. 1999;96:11560–11565. doi: 10.1073/pnas.96.20.11560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Simmonds P, et al. Bioinformatic and functional analysis of RNA secondary structure elements among different genera of human and animal caliciviruses. Nucleic Acids Res. 2008;36:2530–2546. doi: 10.1093/nar/gkn096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Liang Z, Kumar AS, Jones MS, Knowles NJ, Lipton HL. Phylogenetic analysis of the species Theilovirus: Emerging murine and human pathogens. J Virol. 2008;82:11545–11554. doi: 10.1128/JVI.01160-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Kong WP, Roos RP. Alternative translation initiation site in the DA strain of Theiler’s murine encephalomyelitis virus. J Virol. 1991;65:3395–3399. doi: 10.1128/jvi.65.6.3395-3399.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.van Eyll O, Michiels T. Non-AUG-initiated internal translation of the L* protein of Theiler’s virus and importance of this protein for viral persistence. J Virol. 2002;76:10665–10673. doi: 10.1128/JVI.76.21.10665-10673.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Stavrou S, et al. Theiler’s murine encephalomyelitis virus L* amino acid position 93 is important for virus persistence and virus-induced demyelination. J Virol. 2010;84:1348–1354. doi: 10.1128/JVI.01585-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Witwer C, Rauscher S, Hofacker IL, Stadler PF. Conserved RNA secondary structures in Picornaviridae genomes. Nucleic Acids Res. 2001;29:5079–5089. doi: 10.1093/nar/29.24.5079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Steil BP, Barton DJ. Cis-active RNA elements (CREs) and picornavirus RNA replication. Virus Res. 2009;139:240–252. doi: 10.1016/j.virusres.2008.07.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Grentzmann G, Ingram JA, Kelly PJ, Gesteland RF, Atkins JF. A dual-luciferase reporter system for studying recoding signals. RNA. 1998;4:479–486. [PMC free article] [PubMed] [Google Scholar]
  • 50.Fixsen SM, Howard MT. Processive selenocysteine incorporation during synthesis of eukaryotic selenoproteins. J Mol Biol. 2010;399:385–396. doi: 10.1016/j.jmb.2010.04.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Paucha E, Seehafer J, Colter JS. Synthesis of viral-specific polypeptides in Mengo virus-infected L cells: Evidence for asymmetric translation of the viral genome. Virology. 1974;61:315–326. doi: 10.1016/0042-6822(74)90269-4. [DOI] [PubMed] [Google Scholar]
  • 52.Lucas-Lenard J. Cleavage of mengovirus polyproteins in vivo. J Virol. 1974;14:261–269. doi: 10.1128/jvi.14.2.261-269.1974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Ivanov IP, Atkins JF. Ribosomal frameshifting in decoding antizyme mRNAs from yeast and protists to humans: Close to 300 cases reveal remarkable diversity despite underlying conservation. Nucleic Acids Res. 2007;35:1842–1858. doi: 10.1093/nar/gkm035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Kollmus H, Hentze MW, Hauser H. Regulated ribosomal frameshifting by an RNA-protein interaction. RNA. 1996;2:316–323. [PMC free article] [PubMed] [Google Scholar]
  • 55.Hatfield D, et al. Chromatographic analysis of the aminoacyl-tRNAs which are required for translation of codons at and around the ribosomal frameshift sites of HIV, HTLV-1, and BLV. Virology. 1989;173:736–742. doi: 10.1016/0042-6822(89)90589-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Goff SP. Genetic reprogramming by retroviruses: Enhanced suppression of translational termination. Cell Cycle. 2004;3:123–125. [PubMed] [Google Scholar]
  • 57.Kobayashi Y, Zhuang J, Peltz S, Dougherty J. Identification of a cellular factor that modulates HIV-1 programmed ribosomal frameshifting. J Biol Chem. 2010;285:19776–19784. doi: 10.1074/jbc.M109.085621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 59.Rice P, Longden I, Bleasby A. EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet. 2000;16:276–277. doi: 10.1016/s0168-9525(00)02024-2. [DOI] [PubMed] [Google Scholar]
  • 60.Larkin MA, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
  • 61.Duke GM, Osorio JE, Palmenberg AC. Attenuation of Mengo virus through genetic engineering of the 5′ noncoding poly(C) tract. Nature. 1990;343:474–476. doi: 10.1038/343474a0. [DOI] [PubMed] [Google Scholar]
Proc Natl Acad Sci U S A. 2011 Nov 15;108(46):18579-18580.

Author Summary

AUTHOR SUMMARY

Proteins are essential for all known life, and the sequences of their constituent amino acids are specified by the nucleotide sequences of their template messenger RNAs. Cellular organelles called ribosomes travel along messenger RNAs, reading consecutive groups of three nucleotides called triplets, or codons, and translating each triplet as an amino acid according to the “genetic code,” until they reach a “stop” codon. However, in response to signals in some messenger RNAs, a proportion of ribosomes shift into a different triplet reading frame (a phenomenon termed programmed ribosomal frameshifting), which leads to a different sequence of amino acids, and thus a different protein. We have identified in a virus genome a previously overlooked case of ribosomal frameshifting that exhibits some surprising differences from previously studied examples.

Programmed ribosomal frameshifting (generally by -1 nucleotide) is used by many viruses, including HIV, SARS coronavirus, and Japanese encephalitis virus. Such viruses have very compact genomes, and ribosomal frameshifting provides an economical mechanism for controlling gene expression and, by establishing overlapping genes, increasing the number of proteins that may be produced from a small genome. Thus, ribosomal frameshifting plays a crucial role in the biology and virulence of many viruses (1). The degree to which the genes of humans and other higher organisms utilize frameshifting remains uncertain, though recent work suggests that, in addition to a few well-characterized cases, frameshifting may be used on a much broader scale than previously appreciated (2).

Nearly all verified cases of programmed -1 frameshifting involve two long open reading frames (ORFs, a string of codons uninterrupted by stop codons), where frameshifting at or near the end of one ORF results in translation of a following, or downstream, ORF as a “transframe” fusion with the first ORF. Often, the two alternative protein products are required in a fixed ratio, and frameshifting provides an economical mechanism for achieving this. Although it has been suggested that, in certain cases, the efficiency of frameshifting may vary in a controlled fashion over the course of viral infection, there is currently little evidence for such an effect. This is in stark contrast to a handful of well-characterized cases of +1 frameshifting (mechanistically a very different process) in mammals, fungi, and bacteria, where frameshifting can play a fundamental role in the regulation of gene expression (1).

We performed a comparative computational analysis of encephalomyocarditis virus (EMCV) sequences. EMCV is a member of the Cardiovirus genus, in the Picornaviridae family, whose genomic RNA has been very extensively used to investigate translation. Using this approach, we identified a case of -1 frameshifting where the frameshift ORF is relatively short and, unusually, it is completely contained within the non-frameshift ORF as an “overlapping gene” (Fig. P1). Cardioviruses have an RNA genome that contains a single long ORF that is translated as a “polyprotein” (L-1ABCD-2ABC-3ABCD) and subsequently cleaved to produce 12 separate proteins (3). Like many similar viruses, cardioviruses use an unusual ribosome-dependent mechanism, termed “StopGo” or “stop-carry on,” to separate the L-1ABCD-2A segment of the polyprotein from the 2BC-3ABCD segment as the polyprotein is generated (4). Frameshifting just downstream of the 2A|2B junction results in the production of a transframe protein termed 2B* (Fig. P1). Production of the 2B* protein, and the computationally predicted site of frameshifting, were verified with standard techniques (immunoblotting, immunoprecipitation, and mass spectrometry) using extracts from virus-infected cells. Mutations that abrogated expression of 2B* resulted in a weakened virus that grew more slowly than the normal virus.

Fig. P1.

Fig. P1.

Map of the EMCV genome, showing the polyprotein L-1ABCD-2ABC-3ABCD, the overlapping gene 2B*, and the site of programmed ribosomal frameshifting.

Although the identification of a “hidden” gene in a virus as well-studied as EMCV is an interesting finding, the observation (below) that the frameshifting is dependent on virus infection is of broader significance. To investigate the elements required for frameshifting in EMCV, we cloned the shift site and surrounding sequences and placed them between two genes commonly used in genetic analysis: renilla luciferase and firefly luciferase, genes used as markers because of their luminescent properties. This placement made expression of the firefly luciferase dependent on frameshifting (5). To calculate frameshifting efficiencies, luciferase activities were compared to corresponding controls in which the firefly luciferase is in-frame with renilla luciferase. When these constructs of combined genes were introduced into cells, some of which were subsequently infected with EMCV, efficient frameshifting was only observed in those cells that were infected. This result suggests that the virus produces a factor or modifies the cellular environment to stimulate frameshifting at the EMCV frameshift site. This contrasts with previously studied -1 frameshift cases and highlights the possibility that the frameshifting efficiency may be modulated over the course of viral infection.

This unusual case of frameshifting provides another example of the remarkable diversity of strategies that RNA viruses use to control the expression of their genes. Further study of the unique features may provide new insights into the mechanics of protein synthesis. Moreover, the discovery highlights the potential for “hidden” frameshift ORFs within the genes of humans and other nonviral organisms, and the techniques that might be employed to identify such transframe genes.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

See full research article on page E1111 of www.pnas.org.

Cite this Author Summary as: PNAS 10.1073/pnas.1102932108.

References

  • 1.Atkins JF, Gesteland RF, editors. Recoding: Expansion of Decoding Rules Enriches Gene Expression. Heidelberg: Springer; 2010. [Google Scholar]
  • 2.Belew AT, Advani VM, Dinman JD. Endogenous ribosomal frameshift signals operate as mRNA destabilizing elements through at least two molecular pathways in yeast. Nucleic Acids Res. 2011;39:2799–2808. doi: 10.1093/nar/gkq1220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Palmenberg AC. Proteolytic processing of picornaviral polyprotein. Annu Rev Microbiol. 1990;44:603–623. doi: 10.1146/annurev.mi.44.100190.003131. [DOI] [PubMed] [Google Scholar]
  • 4.Donnelly ML, et al. Analysis of the aphthovirus 2A/2B polyprotein ‘cleavage’ mechanism indicates not a proteolytic reaction, but a novel translational effect: a putative ribosomal ‘skip’. J Gen Virol. 2001;82:1013–1025. doi: 10.1099/0022-1317-82-5-1013. [DOI] [PubMed] [Google Scholar]
  • 5.Grentzmann G, Ingram JA, Kelly PJ, Gesteland RF, Atkins JF. A dual-luciferase reporter system for studying recoding signals. RNA. 1998;4:479–486. [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES