Abstract
The SARS-CoV-2 frameshifting element (FSE), a highly conserved mRNA region required for correct translation of viral polyproteins, defines an excellent therapeutic target against Covid-19. As discovered by our prior graph-theory analysis with SHAPE experiments, the FSE adopts a heterogeneous, length-dependent conformational landscape consisting of an assumed 3-stem H-type pseudoknot (graph motif 3_6), and two alternative motifs (3_3 and 3_5). Here, for the first time, we build and simulate, by microsecond molecular dynamics, 30 models for all three motifs plus motif-stabilizing mutants at different lengths. Our 3_6 pseudoknot systems, which agree with experimental structures, reveal interconvertible L and linear conformations likely related to ribosomal pausing and frameshifting. The 3_6 mutant inhibits this transformation and could hamper frameshifting. Our 3_3 systems exhibit length-dependent stem interactions that point to a potential transition pathway connecting the three motifs during ribosomal elongation. Together, our observations provide new insights into frameshifting mechanisms and anti-viral strategies.
Subject terms: Computational biophysics, Molecular modelling, Viral infection
Modeling the SARS-CoV-2 frameshifting RNA element (FSE), a drug target against Covid-19, the authors analyze the dynamics of three FSE conformations at different lengths to propose a conformational transition pathway and describe anti-viral strategies.
Introduction
In less than three years, COVID-19 through its novel infectious agent SARS-CoV-2 has already caused more than 566 million infections and 6 million deaths worldwide. Although the development of multiple vaccines has provided hope for a post-pandemic world, new virus variants with higher infectivity and increased ability to evade the immune system require us to maintain vigilance. Thus, the identification of novel anti-viral therapeutic targets and development of drugs against them remains a priority.
The single stranded SARS-CoV-2 RNA genome of 29,891 nucleotides includes two overlapping and frame shifted open reading frames ORF1a and 1b, which encode for viral polyproteins that begin the viral protein production. To correctly translate both polypeptides, the virus utilizes programmed –1 ribosomal frameshifting (–1 PRF) to stall and backtrack the ribosome by one nucleotide to bypass the stop codon near the start site of ORF1b.
First discovered in the Rous sarcoma virus in 19851, the –1 PRF stalling of the ribosome is associated with a small (<100-nt) RNA frameshifting element2. SARS-CoV-2 similarly employs such a frameshifting element (FSE) located at the ORF1a/1b junction. This FSE consists of a 7-nt slippery site and a downstream 77-nt stimulatory region, which typically folds into an H-type pseudoknot (Fig. 1). The functional importance and high conservation of the FSE make it a promising candidate for anti-viral drugs and gene therapy; for example, in the latest Omicron variant, there are 31 new mutations in the spike gene region with respect to the previous variants, but no changes in the FSE (Supplementary Fig. 1)3–6. Whether frameshifting is orchestrated by the FSE acting as a road blocker or through more complex conformational switches remains unknown7–13. Hence, exploring the secondary (2D) and tertiary (3D) structural dynamics of the FSE during translation is essential for both untangling the frameshifting mechanism and developing anti-viral strategies.
Unlike the stem-loop structure for HIV-1 FSE14 or the 2-stem pseudoknot for IBV FSE15, the assumed structure for SARS-CoV-2 FSE is a 3-stem H-type pseudoknot, where the Stem 1 loop binds the 3 end to form Stem 2, and Stem 3 lies between them (Fig. 1). This motif has been reported by chemical probing, Cryo-EM, NMR, crystallography3,16–22, and molecular dynamics (MD)23–25. Using our coarse-grained RNA-As-Graphs (RAG) representation as dual graphs26–29, we assign this pseudoknot motif dual graph 3_6 (Fig. 1)3,24. RAG translates double-stranded stems to vertices and single-stranded loops to edges. We use RAG to identify key RNA motifs, design novel RNA motifs from building blocks, and perform inverse folding to transform one RNA motif into another30–35. Recent applications of RAG explored the FSE conformational landscape, including RNA mutations to alter the FSE motif3,24.
Further studies of the SARS-CoV-2 FSE have revealed a complex conformational landscape, with alternative pseudoknots3,19,36,37 as well as unknotted structures3,18,19,38–41 (see3 for a detailed comparison). In particular, our prior modeling and SHAPE chemical reactivity experiments reveal an alternative 3-stem H-type pseudoknot where the Stem 1 loop binds with the 5 end to form a different Stem 2 (3_3 dual graph), and a 3-way junction where the 5 and 3 ends pair (3_5 dual graph)3. The three motifs (3_6, 3_3, and 3_5) have common Stems 1 and 3 (though stem lengths vary) but competing Stem 2 (see Fig. 1).
Moreover, our graph and SHAPE studies have emphasized the length dependence of the FSE motifs: for short lengths such as 77-nt without the slippery site, the 3_6 pseudoknot is the dominant motif, and the 3_5 junction is minor; for longer lengths such as 87-nt and 144-nt, conformations containing the 3_6 pseudoknot become minor, while those containing the 3_3 pseudoknot become dominant3. We calculated these FSE length/motif landscapes using partition functions of 2D structures predicted with SHAPE reactivity restraints, where we term a particular conformation 3_6, 3_3, or 3_5 according to the central 77-nt FSE fold motif3. As in other positive-sense RNA viruses42–44, structural transitions among these three (and other possible) motifs likely exist and play an important role in frameshifting.
Here we employ several computational structure prediction programs to build candidate FSE 3D models and analyze microsecond MD trajectories of the three motifs at three lengths: 77, 87, and 144-nt (Fig. 1). We consider the 19 wildtype trajectories studied here as pieces of the heterogeneous FSE landscape during ribosomal translation. We also study our motif-strengthening mutants (experimentally validated in3) that stabilize each motif over the others (11 trajectories). All starting models and 30 MD trajectories are analyzed with respect to known SHAPE data and available experimental structures.
From the 19 wildtype and 11 mutant FSE trajectories, we identify critical structural features and motions. For the 3_6 pseudoknot, we capture both the L and the linear shapes observed by Cryo-EM and crystallography studies19,20,22,45, with pseudoknot-stabilizing hydrogen bonds. A threaded ring conformation and a structural switch between the L and linear shapes may play a role in ribosomal pausing and frameshifting. Importantly, we can suppress this transition in our sextuple mutant. From the alternative motifs, especially the 3_3 pseudoknot, we find length-dependent stem interactions that suggest a potential FSE transition pathway during ribosomal translation. All these mechanistic insights help suggest frameshifting mechanisms and open new avenues for anti-viral therapy. Namely, small molecules or gene editing mutations in these key regions could hamper frameshifting: 3_6 threading (3 helix end of Stem 1), structural switch (Stem 2/3 junction), and pseudoknot-stabilizing interactions (hydrogen-bonded triplets near Stem 2).
Results
MD model validation and selection
Using five 3D prediction programs, we create 26 initial models compatible with our 3 wildtype FSE motifs3 (3_6, 3_3, and 3_5) at 3 relevant lengths (77, 87, and 144-nt), as listed in Table 1. An initial check for consistent 2D structures with SHAPE data left 23 viable candidates (Supplementary Table 1), which were then subjected to microsecond MD simulations (see “Methods”). MD trajectory convergence and structure validation tests excluded four more cases, with 19 viable trajectories remaining (numbered by superscripts in Table 1). See detailed analyses in SI (Supplementary Figs. 2-8, and Supplementary Tables 2-4).
Table 1.
All FSE systems considered (37) and validated (30) | ||||||
---|---|---|---|---|---|---|
Length | WT 3_6 | WT 3_3 | WT 3_5 | M 3_6 | M 3_3 | M 3_5 |
77-nt | *R1, S2, *I3, V4 | R, S10, *I11, V12 | R16, S17, *I18, *V19 | R, *S, I, V | *I | R, S, *I, V |
87-nt | *R5, S, *I6, V | R, S, *I13, V | N/A | N/A | N/A | N/A |
144-nt | R7, I8, F9 | R, I14, F15 | N/A | *R, I | N/A | N/A |
The 19 convergent and validated wildtype MD trajectories and the 11 mutant trajectories are numbered as indicated by the superscript. Representative trajectories for the Results section are marked with asterisks.
We consider all these trajectories as parts of the heterogeneous FSE conformational landscape, relevant with length-dependent ribosomal interactions. To simplify our presentation below, we select representative systems (marked with asterisks in Table 1) based on multi-trajectory clustering (Fig. 2) and structure validation (Supplementary Table 3). This results in L-shape trajectories 1 and 5 and linear shape trajectories 3 and 6 for 3_6, trajectories 11 and 13 for 3_3, and compact shape trajectory 18 and elongated shape trajectory 19 for 3_5.
For the motif-strengthening mutants, we only consider trajectories from prediction programs that correspond to our 11 validated wildtype systems, and representative systems are chosen based on Stem 2 lengths (Table 1, see details in Supplementary Tables 5-6 and Supplementary Figs. 9-14).
Overview of the three fold motifs
In the following sections, we focus on these representative systems (Table 1). For the 3_6 pseudoknot, we highlight a ring formed by Stem 1 strand and the junctions, and relate the 5 end threading that may impact ribosomal pausing to the L shape model (Fig. 3). We identify hydrogen-bond networks that stabilize the pseudoknot complex (Fig. 3), and compare our models to the experimental structures (Fig. 4)19,20,22,45.
For the alternative 3_3 pseudoknot and 3_5 junction, we discuss length-dependent flanking stem or triplet formation (3_3) and the Stem 2/3 interactions (3_5) that provide insights into FSE transitions (Fig. 5).
Inherent motions and motif-strengthening mutants are discussed in Figs. 6 and 7. Notably, a key structural switch between the L and linear shapes for 3_6 that may send frameshifting signals to the ribosome is suppressed in our mutants.
The combined insights suggest target regions for small-molecule binding and CRISPR gene-editing, as well as a transition pathway connecting the three motifs during ribosomal translation (Fig. 8).
L-shaped and linear 3_6 conformations
In two recent Cryo-EM structures, the 3_6 pseudoknot exhibits an L shape with a bent Stem 3 from the co-axial plane of Stems 1 and 2 (Fig. 3)19,20. Moreover, a ring forms by linking the 3 strand of Stem 1, and the Stem 1/3 and 2/3 junctions, with the 5 end threading through the ring hole, which may hamper ribosomal unwinding19,46.
Here, we observe this threaded L shape in our 77 and 87-nt 3_6 models (Fig. 3, Supplementary Table 4, and Supplementary Fig. 8), stabilized by hydrogen-bond networks. At 77-nt, the Stem 1 loop and the Stem 2/3 junction form a quadruplet to seal the ring top. At 87-nt, another triplet forms between the threaded 5 end and Stem 1 at the ring bottom. Moreover, the flexible 5 end folds into a small helix, which is also seen in our SHAPE probing3 and in the Cryo-EM structure19.
In contrast to this L shape, we also capture linear shape models observed by crystallography22,45 at all FSE lengths, where the 3 stems stack vertically (Fig. 3). A similar ring forms, but the hole is narrower and the 5 end prefers being non-threaded by winding around the structure. Other than ring-stabilizing hydrogen bonds, interactions between the 5 end and Stem 3 help stiffen the junction to maintain the linear shape.
Although 5 end threading is also captured in our linear models and the crystal structure (PDB: 7MLX)22, it is more prevalent in the L shape (Supplementary Table 4). Such threading likely forms before ring closure, as experiments for the 3_6 pseudoknot suggest that Stem 1 forms first and Stem 2 last during FSE folding46. Therefore, it is likely that Stem 3 bending (L shape) is associated with threading to avoid steric clashes.
In both the L and the linear shapes, multiple hydrogen bonds act to stabilize the 3_6 pseudoknot complex (Fig. 3). In the 77-nt L shape, the Stem 1/2 and 2/3 junction residues interact to form a short triplex, which is further extended by binding with Stem 2. This triplex stabilizes the loose junctions and links the 3 end tightly near the Stem 1 loop to maintain the pseudoknot.
In the 77-nt linear shape, Stems 1 and 2 are longer than those in the L shape, but similar triplets are found at the Stem 2 junctions (Fig. 3). These triplets are also seen in the crystal structures22,45, indicating their importance in stabilizing the pseudoknot. Another triplet forms between Stem 2 and the 3 end, anchoring the flexible 3 end.
Comparisons to experimental structures
Comparing our L shape models to the two Cryo-EM structures19,20, we see global structural similarity in Stems 1 and 2 stacking (Fig. 4). The 88-nt Cryo-EM structure (PDB: 6XRZ, resolution 6.9 Å)19 has a wider ring than our 87-nt model, possibly due to a more bent Stem 3. Its 5 end helix is shifted and shorter, and is further away from Stem 3. The 77-nt Cryo-EM mRNA-ribosome complex (PDB: 7O7Z, resolution 5-7 Å)20 has slightly longer Stems 1 and 3 than our 77-nt model, and its 5 end is pulled by the ribosome.
The two crystal structures (PDB: 7LYJ and 7MLX, resolution 2.1 Å)22,45 align well with our 77-nt linear shape model (3-4 Å RMSD). Stem 3 loop is more stretched in the crystal structures, probably because associated residues were mutated to avoid dimerization. Evidence of threading exists in the Roman et al. crystal structure (PDB: 7MLX) but not in the Jones et al. structure (PDB: 7LYJ). Consistent with our comments above, a wider ring hole accompanies the threading to avoid steric clashes.
Overall, our independently developed yet well aligned 3_6 MD structures provide credibility for the following alternative structure modeling.
FSE transitions suggested by length-dependent 3_3 pseudoknot interactions
In our SHAPE experiments, the dominant motif in the FSE landscape shifts from 3_6 to 3_3 pseudoknot when the sequence length increases from 77-nt to 87 and 144-nt3. This alternative 3_3 pseudoknot contains a different Stem 2 formed by the Stem 1 loop and the 5 end. At 77-nt, Stem 2 is short with 3 base pairs; at 87 and 144-nt, upstream residues form 2 additional base pairs for Stem 2, and also a flanking stem SF with the 3 end to further seal the conformation (Fig. 5, more details in Supplementary Figs. 8 and 15).
Consistently, a clear jump occurs for 3_3 Stem 2 hydrogen bond number, when the length increases from 77 to 87-nt, resulting in a stronger Stem 2 of 3_3 than that of 3_6 (Supplementary Fig. 16). A similar trend is observed from the stem interaction energy (Supplementary Fig. 17).
These length-dependent interactions suggest potential motif transitions during ribosomal translation and RNA refolding. Indeed, our 77-nt 3_3 model is identified as an intermediate structure, in which the 3 end residues U74 and U75 form two triplets with 3_3 Stem 2 (Fig. 5). In 3_6, the same 3 end residues pair with Stem 1 loop (A20) to form Stem 2; in 3_5, they pair with the 5 end (G2 and G1) to form Stem 2. Hence, Stem 2 interactions in all three motifs co-exist in our 77-nt 3_3 model, potentially making this state a starting conformation for a transition from 3_3 to 3_6 or 3_5.
For the 87-nt 3_3 systems, the flanking stem SF formed by the 5 and 3 ends blocks Stem 2 of 3_6 and 3_5, and the hydrogen bonding between residue U86 and the Stem 3 base pair C72-G49 keeps the 3 end away from Stem 2 (Fig. 5). In our 144-nt models, additional stems form to avoid the mixed Stem 2 triplets (Supplementary Figs. 8 and 15). Hence, all these interactions, especially stem SF, must be unwound by the ribosome before the 3 end is free to form Stem 2 of 3_6 or 3_5.
Together, these insights suggest the following transition pathway during ribosomal elongation: when the ribosome is far away from the FSE 5 end, the flanking stem SF favors 3_3, but as the ribosome moves to occlude the slippery site, SF is unwound to allow formation of alternative Stem 2, and a transition to 3_6 or 3_5 occurs.
Elongated and compact 3-way 3_5 shapes
The 3_5 3-way junction is a minor motif seen in our SHAPE experiments at 77-nt3, where the 5 and 3 ends base pair to form Stem 2. In our MD simulations, we capture both an elongated and a compact 3_5 conformations (Fig. 5).
In the elongated model, Stems 1 and 2 are co-axially stacked, and Stem 3 is longer. A hydrogen-bond network is formed by 5 residues at the 3-way junction to stabilize the helical arrangements, so that Stem 2 would remain around Stem 3 and not interact with Stem 1 loop to form alternatives.
In the compact model, Stems 1 and 3 are stacked, and the Stem 3 loop bends towards the groove of Stem 2. Similar hydrogen bonds at the junction secure the Stem 2 orientation away from Stem 1. These hydrogen bonds must be broken to allow transition to another motif.
Dominant motions in 3_6: L to linear shape transitions
Using principal component analysis (PCA), we find that the linear 3_6 models are rather stable, while the L models switch between the two shapes, via bending of Stem 3 (Fig. 6, Supplementary Fig. 18). As the two Cryo-EM structures both capture the L shape and the two crystal structures both exhibit the linear shape, we speculate that the FSE is highly dynamic and may switch between these conformations during frameshifting.
Meanwhile, the pseudoknot complex (Stems 1 and 2) and the threaded ring conformation are maintained throughout this motion, so does the ring-holding triplet at bottom.
Consistent with the above motions, we see a peak in the 3_6 root mean square fluctuations (RMSF) in the Stem 3 loop region for all lengths (Fig. 6). The RMSF, the average number of hydrogen bonds (H-bond), and the interaction energies all indicate that Stem 1 is the strongest, followed by Stem 3, and lastly Stem 2 (Supplementary Figs. 16 and 17).
Stretching/bending motions in 3_3
The 3_3 pseudoknot’s dominant motion is a combined contraction and stretching caused by the bending of 3 end and Stem 3 loop (Fig. 6, Supplementary Fig. 18). In this motion, Stems 1 and 2, especially triplets that contain interactions from all three Stem 2 (purple and red residues in Fig. 6), are intact and move in unison. That these triplets are not transient suggests that this may be a stable intermediate during the FSE translation pathway directed by the elongating ribosome.
Comparing to 3_6, we see a higher RMSF peak value in the 3_3 Stem 3 loop region, and more fluctuations in 3_3 Stem 1 region due to the pseudoknot bending.
Stem 3 twisting in 3_5
For the 3_5 junction, both the elongated and the compact models experience bending of Stem 1 loop and twisting of Stem 3 (Fig. 6, Supplementary Fig. 18). As a result, the structure becomes more compact, and Stem 2 is closer to Stem 3. All the hydrogen bonds that lock the Stem 2 orientation (Fig. 5) are maintained, so that a stable 3_5 motif remains. Peak RMSF in the loop regions, and low values in the 5 and 3 ends are notable.
Overall, all three conformations have stable Stem 1, flexible Stem 3 loop, and relatively stable Stem 2 regions. The triplets and hydrogen bonds are mostly maintained throughout the simulations, and this helps stabilize key features such as the ring of 3_6 and the combined Stem 2 interactions in 3_3.
Minimal mutations to stabilize the 3_6 linear shape
Our predicted mutations confirmed by SHAPE probing in our previous study were designed to suppress conformational transitions and stabilize specific motifs over all alternatives, for the 77 and 144-nt 3_6 pseudoknot, 77-nt 3_3 pseudoknot, and 77-nt 3_5 junction (Table 1)3,24. Our dynamics analyses below of these mutants compared to the wildtype trajectories help interrogate the mechanisms and consequences of structural stability.
The 6 mutations in the 77-nt 3_6 pseudoknot-strengthening mutant (PSM) include 4 mutations ([G18A, C19A, C68A, A69C]) that lengthen Stem 2 by up to 4 base pairs (Table 2) and 2 mutations at the 5 end to exclude alternative 3_3 and 3_5 Stem 2. Comparing the mutant with longest Stem 2 (9 base pairs) to its corresponding wildtype model, we observe a dramatic transformation from L shape (wildtype) to linear shape (Fig. 7). Indeed, all 3_6 mutant systems adopt this linear shape (Supplementary Table 6, Supplementary Fig. 14), and the structural switch between the two shapes has been suppressed in most systems (Supplementary Fig. 19).
Table 2.
Trajectory | WT S2 | Mutant S2 | Base pairs involving mutations | Structural features |
---|---|---|---|---|
77-nt 3_6 PSM [G3U, U4A, G18A, C19A, C68A, A69C] | ||||
1 | 4 | 4 | — | Threaded, linear |
2 | 4 | 9 | A18-U76, A19-U75, G25-C69, U26-A68 | Threaded, linear |
3 | 7 | 7 | A19-U75, G25-C69 | Non-threaded, linear |
4 | 4 | 8 | A18-U76, A19-U75, G25-C69, U26-A68 | Non-threaded, linear |
144-nt 3_6 PSM [G40U, U41A, G55A, C56A, C105A, A106C, C137A] | ||||
7 | 4 | 5 | G62-C106 | Non-threaded, linear |
8 | 5 | 4 | A56-U112 | Non-threaded, linear |
77-nt 3_3 PSM [U4C, G71A, G72U] | ||||
11 | 3 | 7 | C4-G21 | No Stem 2 triplets |
77-nt 3_5 Mutant [G72C, U74C] | ||||
16 | 3 | 7 | G1-C74, G3-C72 | Elongated |
17 | 3 | 7 | G1-C74, G3-C72 | T shape |
18 | 4 | 7 | G1-C74, G3-C72 | Elongated |
19 | 3 | 6 | G3-C72 | Elongated |
For each mutant, the mutations, the trajectories as numbered in Table 1, the wildtype and mutant Stem 2 lengths, the newly formed Stem 2 base pairs involving the mutated residues, and the structural features are listed.
For the 144-nt 3_6 PSM, one additional mutation in the downstream region suppresses formation of competing stems3. The central 3_6 pseudoknot region aligns well between the wildtype and mutant systems, both adopting the linear shape (Fig. 7). The major difference occurs in the upstream region: in the wildtype, upstream and downstream stems form on the same side of the central 3_6 pseudoknot; in the mutant, they are on different sides, due to our [G40U, U41A] mutations. From PCA, we see a relatively stable central 3_6 pseudoknot, while quite flexible upstream and downstream stems in the mutant (Supplementary Fig. 19).
As both our 77 and 144-nt 3_6 mutants adopt linear conformations, we hypothesize that this may be a more stable conformation, by separating the 5 and 3 ends further away from each other to avoid alternative 3_3 and 3_5 Stem 2.
3_3 mutant to eliminate alternative Stem 2 interactions
In our 77-nt 3_3 PSM, a large increase of Stem 2 length from 3 to 7 base pairs is induced by mere three mutations [U4C, G71A, G72U] (Table 2, Supplementary Fig. 14). The first mutation enhances the 3_3 Stem 2 and the others avoid alternative 3_6 and 3_5 motifs. The main structural changes are a vertical 5 end between the Stem 1 loop and helix instead of staying horizontal below, compact Stems 1 and 2, and elimination of triplets formed by the 3 end with Stem 2 (Fig. 7). Hence, our mutations stabilize the 3_3 motif. The dominant motion occurs in the Stem 3 region (Supplementary Fig. 19).
Elongated 3_5 mutant
Our 77-nt 3_5 mutant with only 2 mutations [G72C, U74C] also enjoys a considerable enhancement of Stem 2 from 3-4 base pairs to 6-7 (Table 2). Though Stem 2 remains close to Stem 3, the loop region of Stem 3 now stretches out, leading to an elongated conformation (Fig. 7). Moreover, the co-axial stacking changes from Stems 1 and 3 in the wildtype to Stems 1 and 2 in the mutant. Indeed, all four 3_5 mutant models adopt a stacking of Stems 1 and 2, and all resemble the elongated conformation except one (Supplementary Fig. 14). The dominant motion is bending of Stem 1 and 3 loops (Supplementary Fig. 19).
Overall, our enhanced Stem 2 in the three mutants leads to dramatic structural changes, especially for the 77-nt 3_6 and 3_5 systems. PCA analysis reveals stabilization of the linear shape in 3_6 PSM. For the 77-nt 3_3 mutant, triplets associated with possible structural transitions are eliminated.
Discussion
From our 30 molecular dynamics trajectory ensemble of SARS-CoV-2 FSE systems, we can begin to piece together aspects of the complex conformational landscape. In particular, our simulations extend beyond 3D structure models for the prevalent 3_6 pseudoknot in the literature3,16–21,23,24,37,47, by providing the first viable 3D models for the alternative 3_3 pseudoknot and 3_5 junction, as well as the motif-strengthening mutants3. Moreover, guided by our prior SHAPE probing, our analyses of the 3 FSE motifs at different lengths help examine the conformational changes that might occur during ribosomal translation.
Our 3_6 models exhibit two distinct conformations: a threaded L shape that resembles the Cryo-EM structures19,20, and a non-threaded linear shape that aligns well with the crystal structures22,45 (Figs. 3 and 4). An interconversion between the L and linear shapes can be achieved by bending of Stem 3, as revealed by our PCA motion analysis (Fig. 6). Moreover, our mutants stabilize the linear shape and suppress the interconversion (Fig. 7). Importantly, our 3_3 models show length-dependent stem interactions: an intermediate structure containing base triplets from all three alternate Stem 2 at 77-nt, while a classic 3_3 stabilized by flanking stem SF at 87 and 144-nt (Fig. 5). Our 3_3 triple mutant successfully lengthens Stem 2 and eliminates these triplets.
The combined insights suggest the following FSE structural transition pathway relevant to ribosomal translation (Fig. 8). When the ribosome is far away from the FSE region, the dominant conformation is a 3_3 with stem SF. As the ribosome approaches and occludes the slippery site, stem SF is unwound, and the 3 end moves to the 3_3 Stem 2 region to form triplets, initiating the structural transition to 3_6 or 3_5. In this step, the ring starts to close at the Stem 1/3 and 2/3 junctions, and the 5 end can either thread through the ring hole or wind around Stem 3. When the ribosome further elongates, the 5 end becomes completely occluded, and only 3_6 remains viable. If the 5 end is threaded, to avoid steric clashes, Stem 3 would bend to widen the ring hole; if non-threaded, the 5 end would interact with Stem 3 to stiffen the junction and hold the linear shape (Fig. 3).
Such length-varying considerations of RNA are relevant to ribosomal translation and co-transcription in general, where the RNA can fold into different transient structures to accomplish various functions48,49. For SARS-CoV-2, this FSE transition pathway may be associated with regulatory functions. The timescale at which the transitions occur depends on the scale of conformational rearrangements. Base pairing or tertiary structure changes occur on microsecond to second range. Major interconversions between secondary structures occur on millisecond and longer50. Given that the ribosome pauses ~2.8s between translocations51, this time allows for the structural switches and transitions discussed here to occur. Enhanced sampling simulations are required to probe such transitions.
The biophysical insights from our work also suggest three general therapeutic approaches using small-molecule binding and CRISPR gene-editing (Fig. 8). The first anti-viral strategy is to alter the 3_6 pseudoknot plasticity. By mutating residues that form the pseudoknot-stabilizing hydrogen bonds (Stem 1/2 and 2/3 junctions, Fig. 3), we can further strengthen or destroy the pseudoknot. Since conformational plasticity has a large impact on frameshifting efficiency9, this should interrupt the frameshifting process. Indeed, Bhatt et al. achieve a significant reduction in frameshifting efficiency by mutating these junctions20. In our prior SHAPE probing, mutations in this region modify the conformational landscape to 100% 3_63. Both studies underscore the sensitivity of the 3_6 pseudoknot and its associated frameshifting to these junctions, which define good targets for CRISPR gene-editing.
The second approach is to strengthen the 5 end threading in the 3_6 ring conformation. A higher unfolding force is required when threading exists46, so strengthening the threading may increase the mechanical barrier for translation. Recently, two alkaloids (emetine and cephaeline) predicted to bind the threading initiation site were found to inhibit SARS-CoV-2 viral replication52. Hence, the 3 helix end of Stem 1, which we find to seal the ring and initiate threading, defines a target binding region to impede ribosomal translation (Fig. 8).
The third approach is to target the 3_6 pseudoknot structural switch between the L and linear shapes. In the mRNA-ribosome Cryo-EM structure captured during translation20, the L shape 3_6 wedges at the mRNA entry channel and resists unwinding by the helicase, which generates tension on the upstream mRNA20. This structural switch might then enhance fluctuations of this tension and send frameshifting signals to the ribosome. When switching from the L to linear shape, residues in the Stem 2/3 junction are exposed (Fig. 6); small molecules like MTDB10,53 can thus block the switch and hamper frameshifting. Another option is to deploy our 3_6 mutant, which assures a stabilized linear shape (Fig. 7).
In sum, by analyzing the hydrogen bonding interactions and motions of different 3_6 systems, we offer three strategic anti-viral targeting regions: the 3 helix end of Stem 1 and Stem 1/2 and 2/3 junction residues (Fig. 8). Although several drugs/small molecules have been shown to inhibit SARS-CoV-2 frameshifting, including MTDB17,53,54, alkaloids52, Merafloxacin55, Ivacaftor, and Huperzine A56, they are mainly found by high-throughput drug screening, so the underlying inhibition mechanism is unexplained and, in some cases, the binding regions are unknown. Our targeting regions above emerged from mechanistic considerations.
Of course, like every computational approach, there are inherent limitations and approximations in the modeling and simulations performed here. The SHAPE experiments used to deduce the 3 conformers cannot directly reveal the 2D structure, but only provide nucleotide reactivities as restraints for guiding the 2D structure predictions. Our MD trajectories are started from predicted 3D models, but each model is anchored to SHAPE, Cryo-EM, and/or crystallographic data, rendering the results credible (see Fig. 4 and SI). Though microsecond simulations are near the state-of-the-art, each trajectory provides only a local sampling of the complex, multidimensional thermodynamic space. Enhanced sampling simulations could be used to further unravel the complex rugged landscape of the FSE. While we used simple ionic environments to avoid biases, a more extensive investigation with other ions like Mg2+ and K+ is warranted. Nevertheless, taken together, the 30 trajectories here contribute to an emerging view of the FSE conformational landscape as the ribosome elongates.
Our methods and analyses also extend to other viruses and diseases more broadly. mRNA-based therapeutics have already demonstrated success in vaccines and drugs to treat viral diseases and cancers, with the advantage of fast production and flexible design57–59. Highly-conserved functional RNAs like the frameshifting element define good drug targets. With our continuously evolving computational and experimental toolkit for investigating RNA systems at increasing complexity, biophysical approaches will continue to contribute to disease diagnostics and treatment.
Methods
RAG notation and mutations
In our RNA-As-Graphs (RAG) framework, RNA secondary structures containing pseudoknots are represented as dual graphs26. Each stem (≥2 base pairs) denotes a vertex, and every single strand or loop is an edge (hairpins are self-loops; 1-nt bulges, internal loops with two 1-nt strands, and dangling ends are ignored). Every non-isomorphic dual graph is assigned an identifier V_n, where V is the vertex number and n is a unique motif identifier. Our dual graph library consists of over 100,000 unique dual graphs for 2-9 vertices29.
To design RNAs with minimal mutations that make the FSE fold in silico onto a target dual graph, we developed our inverse folding program for dual graphs Dual-RAG-IF24,35. For manually selected mutation regions and a target 2D structure, Dual-RAG-IF uses a genetic algorithm to generate a pool of candidate RNA sequences with mutations. These candidates are screened by 2D prediction programs to ensure the correct graph folding, and are optimized for minimal mutations. Detailed design of the mutants is described in3,24.
FSE lengths and conformations
We model the FSE structure at three sequence lengths: 77-nt without the 7-nt slippery site, 87-nt with the slippery site plus 3 additional residues at the end, and 144-nt with the slippery site plus 30 additional residues at each end. We perform MD simulations for all three conformations for the 77-nt FSE. (Even though the 3_3 pseudoknot was not observed at this length, we study it for comparison with other lengths.) For 87 and 144-nt, we model the 3_6 and the 3_3 conformations, with additional stems formed by the upstream and downstream nucleotides (Fig. 1).
Besides wildtype FSEs, we also model four motif-strengthening mutants predicted previously3,24: 77-nt 3_6 PSM with 6 mutations [G3U, U4A, G18A, C19A, C68A, A69C], 144-nt 3_6 PSM with an additional mutation C137A, 77-nt 3_3 PSM with 3 mutations [U4C, G71A, G72U], and 77-nt 3_5 mutant with 2 mutations [G72C, U74C].
2D and 3D FSE structures
The 2D structure of the wildtype 77-nt 3_6 pseudoknot is predicted by PKNOTS60, and all other 2D conformations are modeled by ShapeKnots from RNAstructure package with SHAPE reactivities incorporated3,61.
Corresponding 3D structures are predicted, with the sequences and the 2D structures as input using RNAComposer62, Vfold3D63, SimRNA64, and iFoldRNA65 for 77 and 87-nt, and RNAComposer, iFoldRNA, and Farfar266 for 144-nt, as SimRNA and Vfold3D failed to produce models for this length (see Table 1). For 3D structure prediction programs that give multiple structures as output, the first structure that retained the correct motif is selected as the initial 3D model.
Initial 3D model validation
We extract 2D structures of the initial predicted models using 3DNA-DSSR67. If the central 77-nt FSE region does not fold into the correct motif (3_6, 3_3, or 3_5), the model is rejected. We also calculate the Hamming distance between each model’s 2D structure and the input (SHAPE) 2D structure. Models with Hamming distances >10 are rejected.
Molecular dynamics details
We use Gromacs 2020.3 and 2020.468, with the Amber OL3 forcefield69. The systems are solvated in the cubic box with TIP3P water model, with a buffer of 10 Å from the RNA molecule70. The systems are first neutralized with sodium ions and set to a 0.1M NaCl bulk concentration with additional Na+ and Cl− ions. The systems are energy minimized via steepest descent and equilibrated under NVT (300 K) and NPT (1 bar and 300 K) ensembles for 100 ps each. Simulations are run with a timestep of 2 fs and a SHAKE-like LINCS algorithm71 with constraints on all atom bonds. The Particle Mesh Ewald method72 is used to treat long-range electrostatics. Production runs are performed for 1 ~ 1.5 μs under NPT to ensure stable RMSD. Structures from the last 500 ns of each simulation are used for analysis.
Clustering is performed on frames every 200 ps for RNA non-H backbone atoms, using the Gromos clustering method with 2, 2.5, 3, and 3.5 Å cutoffs. The largest cluster center structures (cutoff of 2.5 Å for 77-nt and 87-nt systems or 3.5 Å for 144-nt systems) are extracted from MD simulations to show and analyze in Results and Supplementary Information. The cutoffs are chosen to ensure that all simulations for the same dual graph topology produce a feasible number of clusters with outlier structures excluded. See Supplementary Fig. 7 for more details.
PCA motion analysis (on structures every 250 ps), whole system density, calculations of Rg, RMSF, interaction energy (sum of short-term Lennard-Jones and Coulomb interactions) between the two strands within each stem, and the number of hydrogen bonds in each stem are performed via Gromacs 2020.368. The 2D structures are extracted using 3DNA-DSSR67. To compare different MD trajectories of 77-nt 3_6 or 3_5, multi-trajectory clustering is conducted using structures extracted from the last 500 ns simulations by R package bio3d73.
All microsecond MD simulations were conducted on the Prince or Greene supercomputer clusters at the New York University High Performance Computing facilities. Each compute node in the Prince cluster is equipped with two Intel Xeon E5-2690v4 2.6 GHz CPUs (Broadwell, 14 cores/socket, 28 cores/node) and 125 GB memory. Each simulation is performed with seven to eight dedicated nodes (i.e., 196-224 cores), so the simulations complete in 7-10 days. Each compute node in the Greene clusters is equipped with two Intel Xeon Platinum 8268 24C 205W 2.9 GHz CPUs with 48 cores/node and 192 GB memory. Each simulation is performed with 30 nodes using 32 cores each, so that the simulations complete in 2-4 days.
MD trajectory validation
To monitor convergence of the MD trajectories, the system density is examined for NPT ensemble. Next, all simulations are examined for a steady state RMSD74 plateau, and those with large RMSD fluctuations are extended until reaching a steady state over the last 500 ns. To further evaluate structural convergence, time series of Rg and eRMSD are calculated via Gromacs68 and Barnaba75, respectively. The evolution of hydrogen bonds over MD simulations is also used to assess structural stability every 100 ns. The cumulative number of hydrogen bonds is plotted against residue number. It adopts a mountain-like shape, where the number increases as new hydrogen bonds form in the 5 strand of the stem and decreases in the 3 strand.
All stable MD trajectories are further validated for 2D and 3D structures. For wildtype systems, we monitor graph motifs, 2D structures, and 3D clashes. Clashscores are calculated as the number of serious steric clashes per 1000 atoms using MolProbity76. Models having wrong motifs are rejected, and those having Hamming distances >10 or clashscores >5 are tagged. In addition, all the 3_6 models are aligned to the available 4 experimental structures using PyMol77align, with RMSD calculated. To validate the mutant systems, we check both the graph motifs and the 2D structures.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank Shereef Elmetwaly for technical assistance and David Ackerman, Stratos Efstathiadis, and Shenglong Wang from the NYU High-Performance Computing facilities for providing our group dedicated resources to perform this work.
We gratefully acknowledge funding from the National Science Foundation RAPID Award 2030377 from the Division of Mathematical Science and the Division of Chemistry, National Science Foundation Award DMS-2151777 from the Division of Mathematical Sciences, National Institutes of Health R35GM122562 Award from the National Institute of General Medical Sciences, and Philip-Morris International to T. Schlick.
Author contributions
T.S. conceived the project and supervised the study. Q.Z. predicted FSE 2D structures, S.J. predicted initial 3D models, S.J. and S.Y. performed molecular dynamics simulations, and S.Y. and Q.Z. validated 3D models and MD trajectories. All authors analyzed the data and wrote the manuscript, and Q.Z. and S.Y. prepared the figures.
Peer review
Peer review information
Nature Communications thanks Karissa Sanbonmatsu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
Complete data are present in the paper and/or associated Supplementary Materials. Additional information including initial 3D prediction models and validated trajectory cluster centers are shared in Zenodo (10.5281/zenodo.6625172).
Experimental RNA structures analysed in this paper are available in the Protein Data Bank under accession codes 6XRZ, 7O7Z, 7LYJ, and 7MLX.
Code availability
The codes used to discover motif-strengthening mutations are available in the GitHub Schlicklab repository (https://github.com/Schlicklab/Dual-RAG-IF).
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Shuting Yan, Qiyao Zhu.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-31353-w.
References
- 1.Jacks T, Varmus H. Expression of the rous sarcoma virus pol gene by ribosomal frameshifting. Science. 1985;230:1237–1242. doi: 10.1126/science.2416054. [DOI] [PubMed] [Google Scholar]
- 2.Brierley I, et al. An efficient ribosomal frame-shifting signal in the polymerase-encoding region of the coronavirus IBV. EMBO J. 1987;6:3779–3785. doi: 10.1002/j.1460-2075.1987.tb02713.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Schlick T, et al. To knot or not to knot: Multiple conformations of the SARS-CoV-2 frameshifting RNA element. J. Amer. Chem. Soc. 2021;143:11404–11422. doi: 10.1021/jacs.1c03003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kelly J, Woodside M, Dinman J. Programmed − 1 ribosomal frameshifting in coronaviruses: A therapeutic target. Virology. 2021;554:75–82. doi: 10.1016/j.virol.2020.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Dinman J, Ruiz-Echevarria M, Czaplinski K, Peltz S. Peptidyl-transferase inhibitors have antiviral properties by altering programmed − 1 ribosomal frameshifting efficiencies: Development of model systems. Proc. Nat. Acad. Sci., USA. 1997;94:6606–6611. doi: 10.1073/pnas.94.13.6606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kinzy T, et al. New targets for antivirals: The ribosomal A-site and the factors that interact with it. Virology. 2002;300:60–70. doi: 10.1006/viro.2002.1567. [DOI] [PubMed] [Google Scholar]
- 7.Lopinski J, Dinman J, Bruenn J. Kinetics of ribosomal pausing during programmed − 1 translational frameshifting. Mol. Cell. Biol. 2000;20:1095–1103. doi: 10.1128/MCB.20.4.1095-1103.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Namy O, Moran S, Stuart D, Gilbert R, Brierley I. A mechanical explanation of rna pseudoknot function in programmed ribosomal frameshifting. Nature. 2006;441:244–247. doi: 10.1038/nature04735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ritchie D, Foster D, Woodside M. Programmed − 1 frameshifting efficiency correlates with RNA pseudoknot conformational plasticity, not resistance to mechanical unfolding. Proc. Nat. Acad. Sci., USA. 2012;109:16167–16172. doi: 10.1073/pnas.1204114109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ritchie D, Soong J, Sikkema W, Woodside M. Anti-frameshifting ligand reduces the conformational plasticity of the SARS virus pseudoknot. J. Amer. Chem. Soc. 2014;136:2196–2199. doi: 10.1021/ja410344b. [DOI] [PubMed] [Google Scholar]
- 11.Kim H, et al. A frameshifting stimulatory stem loop destabilizes the hybrid state and impedes ribosomal translocation. Proc. Nat. Acad. Sci. USA. 2014;111:5538–5543. doi: 10.1073/pnas.1403457111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chen J, et al. Dynamic pathways of − 1 translational frameshifting. Nature. 2014;512:328–332. doi: 10.1038/nature13428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Caliskan N, Katunin V, Belardinelli R, Peske F, Rodnina M. Programmed − 1 frameshifting by kinetic partitioning during impeded translocation. Cell. 2014;157:1619–1631. doi: 10.1016/j.cell.2014.04.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Parkin N, Chamorro M, Varmus H. Human immunodeficiency virus type 1 gag-pol frameshifting is dependent on downstream mRNA secondary structure: demonstration by expression in vivo. Virol. J. 1992;66:5147–5151. doi: 10.1128/jvi.66.8.5147-5151.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Brierley I, Digard P, Inglis S. Characterization of an efficient coronavirus ribosomal frameshifting signal: Requirement for an RNA pseudoknot. Cell. 1989;57:537–547. doi: 10.1016/0092-8674(89)90124-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wacker A, et al. Secondary structure determination of conserved SARS-CoV-2 RNA elements by NMR spectroscopy. Nucleic Acids Res. 2020;48:12415–12435. doi: 10.1093/nar/gkaa1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kelly J, et al. Structural and functional conservation of the programmed − 1 ribosomal frameshift signal of SARS coronavirus 2 (SARS-CoV-2) J. Biol. Chem. 2020;295:10741–10748. doi: 10.1074/jbc.AC120.013449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Lan T, et al. Secondary structural ensembles of the Sars-Cov-2 RNA genome in infected cells. Nat. Commun. 2022;13:1128–1128. doi: 10.1038/s41467-022-28603-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhang K, Zheludev I, Hagey R, Haslecker R, et al. Cryo-EM and antisense targeting of the 28-kDa frameshift stimulation element from the SARS-CoV-2 RNA genome. Nat. Struct. Mol. Biol. 2021;28:747–754. doi: 10.1038/s41594-021-00653-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bhatt P, et al. Structural basis of ribosomal frameshifting during translation of the SARS-CoV-2 RNA genome. Science. 2021;372:1306–1313. doi: 10.1126/science.abf3546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ziv O, et al. The short- and long-range RNA-RNA interactome of SARS-CoV-2. Mol. Cell. 2020;80:1067–1077.e5. doi: 10.1016/j.molcel.2020.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Roman C, Lewicka A, Koirala D, Li N, Piccirilli J. The SARS-CoV-2 programmed − 1 ribosomal frameshifting element crystal structure solved to 2.09 Å using chaperone-assisted RNA crystallography. ACS Chem. Biol. 2021;16:1469–1481. doi: 10.1021/acschembio.1c00324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Omar S, et al. Modeling the structure of the frameshift-stimulatory pseudoknot in SARS-CoV-2 reveals multiple possible conformers. PLOS Comput. Biol. 2021;17:e1008603. doi: 10.1371/journal.pcbi.1008603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Schlick T, Zhu Q, Jain S, Yan S. Structure-altering mutations of the SARS-CoV-2 frameshifting RNA element. Biophys. J. 2021;120:1040–1053. doi: 10.1016/j.bpj.2020.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Rangan R, et al. De novo 3D models of SARS-CoV-2 RNA elements from consensus experimental secondary structures. Nucleic Acids Res. 2021;49:3092–3108. doi: 10.1093/nar/gkab119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gan H, et al. RAG: RNA-As-Graphs database–concepts, analysis, and features. Bioinformatics. 2004;20:1285–1291. doi: 10.1093/bioinformatics/bth084. [DOI] [PubMed] [Google Scholar]
- 27.Zahran M, Bayrak C, Elmetwaly S, Schlick T. RAG-3D: a search tool for RNA 3D substructures. Nucleic Acids Res. 2015;43:9474–9488. doi: 10.1093/nar/gkv823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Baba N, Elmetwaly S, Kim N, Schlick T. Predicting large RNA-like topologies by a knowledge-based clustering approach. J. Mol. Biol. 2016;428:811–821. doi: 10.1016/j.jmb.2015.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jain S, Saju S, Petingi L, Schlick T. An extended dual graph library and partitioning algorithm applicable to pseudoknotted rna structures. Methods. 2019;162:74–84. doi: 10.1016/j.ymeth.2019.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Jain S, Bayrak C, Petingi L, Schlick T. Dual graph partitioning highlights a small group of pseudoknot-containing RNA submotifs. Genes. 2018;9:371. doi: 10.3390/genes9080371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Schlick T. Adventures with RNA graphs. Methods. 2018;143:16–33. doi: 10.1016/j.ymeth.2018.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Jain S, Schlick T. F-RAG: Generating atomic models from RNA graphs using fragment assembly. J. Mol. Biol. 2017;429:3587–3605. doi: 10.1016/j.jmb.2017.09.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Jain S, Laederach A, Ramos S, Schlick T. A pipeline for computational design of novel RNA-like topologies. Nucleic Acids Res. 2018;46:7040–7051. doi: 10.1093/nar/gky524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zhu Q, Schlick T. A Fiedler vector scoring approach for novel RNA motif selection. J. Phys. Chem. 2021;125:1144–1155. doi: 10.1021/acs.jpcb.0c10685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Jain S, Tao Y, Schlick T. Inverse folding with RNA-as-graphs produces a large pool of candidate sequences with target topologies. J. Struct. Biol. 2020;209:107438. doi: 10.1016/j.jsb.2019.107438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Huston N, et al. Comprehensive in vivo secondary structure of the SARS-CoV-2 genome reveals novel regulatory motifs and mechanisms. Mol. Cell. 2021;81:584–598.e5. doi: 10.1016/j.molcel.2020.12.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Trinity, L., Wark, I., Lansing, L., Jabbari, H. & Stege, U. Shapify: Pathways to SARS-CoV-2 frameshifting pseudoknot. Research Square, doi: 10.21203/rs.3.rs-1370718/v1, preprint posted March 2022 .
- 38.Manfredonia I, et al. Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements. Nucleic Acids Res. 2020;48:12436–12452. doi: 10.1093/nar/gkaa1053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Ahmed F, et al. A comprehensive analysis of cis-acting RNA elements in the SARS-CoV-2 genome by a bioinformatics approach. Front. Genet. 2020;11:1385. doi: 10.3389/fgene.2020.572702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Andrews R, et al. A map of the SARS-CoV-2 RNA structurome. NAR Genom. Bioinform. 2021;3:lqab043. doi: 10.1093/nargab/lqab043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Iserman C, et al. Genomic RNA elements drive phase separation of the SARS-CoV-2 nucleocapsid. Mol. Cell. 2020;80:1078–1091. doi: 10.1016/j.molcel.2020.11.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kuhlmann M, Chattopadhyay M, Stupina V, Gao F, Simon A. An RNA element that facilitates programmed ribosomal readthrough in Turnip Crinkle Virus adopts multiple conformations. Virol. J. 2016;90:8575–8591. doi: 10.1128/JVI.01129-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Moomau C, Musalgaonkar S, Khan Y, Jones J, Dinman J. Structural and functional characterization of programmed ribosomal frameshift signals in West Nile virus strains reveals high structural plasticity among cis-acting RNA elements. J. Biol. Chem. 2016;291:15788–15795. doi: 10.1074/jbc.M116.735613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Houck-Loomis B, Durney M, Salguero C, et al. An equilibrium-dependent retroviral mRNA switch regulates translational recoding. Nature. 2011;480:561–564. doi: 10.1038/nature10657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Jones C, Ferre-D’amare A. Crystal structure of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) frameshifting pseudoknot. RNA. 2022;28:239–249. doi: 10.1261/rna.078825.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Neupane K, Zhao M, Lyons A, et al. Structural dynamics of single SARS-CoV-2 pseudoknot molecules reveal topologically distinct conformers. Nat. Commun. 2021;12:4749. doi: 10.1038/s41467-021-25085-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Rangan R, et al. RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look. RNA. 2020;26:937–959. doi: 10.1261/rna.076141.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Lai D, Proctor J, Meyer I. On the importance of cotranscriptional RNA structure formation. RNA. 2013;19:1461–1473. doi: 10.1261/rna.037390.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Feng S, et al. Alternate rRNA secondary structures as regulators of translation. Nat. Struct. Mol. Biol. 2011;18:169–176. doi: 10.1038/nsmb.1962. [DOI] [PubMed] [Google Scholar]
- 50.Mustoe A, Brooks C, Al-Hashimi H. Hierarchy of RNA Functional Dynamics. Annu. Rev. Biochem. 2014;83:441–466. doi: 10.1146/annurev-biochem-060713-035524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Wen J, et al. Following translation by single ribosomes one codon at a time. Nature. 2008;452:598–603. doi: 10.1038/nature06716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ren P, Shang W, Yin W, et al. A multi-targeting drug design strategy for identifying potent anti-SARS-CoV-2 inhibitors. Acta Pharmacol. Sin. 2022;43:483–493. doi: 10.1038/s41401-021-00668-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Park S, Kim Y, Park H. Identification of RNA pseudoknot-binding ligand that inhibits the − 1 ribosomal frameshifting of SARS-coronavirus by structure-based virtual screening. J. Amer. Chem. Soc. 2011;133:10094–10100. doi: 10.1021/ja1098325. [DOI] [PubMed] [Google Scholar]
- 54.Neupane K, et al. Anti-frameshifting ligand active against SARS coronavirus-2 is resistant to natural mutations of the frameshift-stimulatory pseudoknot. J. Mol. Biol. 2020;432:5843–5847. doi: 10.1016/j.jmb.2020.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Sun Y, et al. Restriction of SARS-CoV-2 replication by targeting programmed − 1 ribosomal frameshifting. Proc. Natl. Acad. Sci. USA. 2021;118:e2023051118. doi: 10.1073/pnas.2023051118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Chen Y, et al. A drug screening toolkit based on the − 1 ribosomal frameshifting of SARS-CoV-2. Heliyon. 2020;6:e04793. doi: 10.1016/j.heliyon.2020.e04793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Desterro J, Bak-Gordon P, Carmo-Fonseca M. Targeting mRNA processing as an anticancer strategy. Nat. Rev. Drug Discov. 2020;19:112–129. doi: 10.1038/s41573-019-0042-3. [DOI] [PubMed] [Google Scholar]
- 58.Fiedler K, Lazzaro S, Lutz J, Rauch S, Heidenreich R. mRNA cancer vaccines. Recent Results Cancer Res. 2016;209:61–85. doi: 10.1007/978-3-319-42934-2_5. [DOI] [PubMed] [Google Scholar]
- 59.Sahin U, Karikó K, Töreci Ö. mRNA-based therapeutics — developing a new class of drugs. Nat. Rev. Drug Discov. 2014;13:759–780. doi: 10.1038/nrd4278. [DOI] [PubMed] [Google Scholar]
- 60.Rivas E, Eddy S. A dynamic programming algorithm for RNA structure prediction including pseudoknots. J. Mol. Biol. 1999;285:2053–2068. doi: 10.1006/jmbi.1998.2436. [DOI] [PubMed] [Google Scholar]
- 61.Hajdin C, et al. Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots. Proc. Natl. Acad. Sci. USA. 2013;110:5498–5503. doi: 10.1073/pnas.1219988110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Biesiada M, Purzycka K, Szachniuk M, Blazewicz J, Adamiak R. Automated RNA 3D Structure Prediction with RNAComposer. Methods Mol. Biol. 2016;1490:199–215. doi: 10.1007/978-1-4939-6433-8_13. [DOI] [PubMed] [Google Scholar]
- 63.Xu X, Chen S. Hierarchical assembly of RNA three-dimensional structures based on loop templates. J. Phys. Chem. B. 2018;122:5327–5335. doi: 10.1021/acs.jpcb.7b10102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Boniecki M, et al. SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction. Nucleic Acids Res. 2016;44:e63–e63. doi: 10.1093/nar/gkv1479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Krokhotin A, Houlihan K, Dokholyan N. iFoldRNA v2: folding RNA with constraints. Bioinformatics. 2015;31:2891–2893. doi: 10.1093/bioinformatics/btv221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Watkins A, Rangan R, Das R. FARFAR2: improved de novo rosetta prediction of complex global RNA folds. Structure. 2020;28:963–976.e6. doi: 10.1016/j.str.2020.05.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Lu X, Bussemaker H, Olson W. DSSR: an integrated software tool for dissecting the spatial structure of RNA. Nucleic Acids Res. 2015;43:e142–e142. doi: 10.1093/nar/gkv541. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Abraham M, Murtola T, Schulz R, et al. Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1-2:19–25. doi: 10.1016/j.softx.2015.06.001. [DOI] [Google Scholar]
- 69.Zgarbová M, et al. Refinement of the Cornell et al. nucleic acids force field based on reference quantum chemical calculations of glycosidic torsion profiles. J. Chem. 2011;7:2886–2902. doi: 10.1021/ct200162x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Jorgensen W, Chandrasekhar J, Madura J, Impey R, Klein M. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. doi: 10.1063/1.445869. [DOI] [Google Scholar]
- 71.Hess B, Bekker H, Berendsen H, Fraaije J. Lincs: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997;18:1463–1472. doi: 10.1002/(SICI)1096-987X(199709)18:12<1463::AID-JCC4>3.0.CO;2-H. [DOI] [Google Scholar]
- 72.Essmann U, et al. A smooth particle mesh ewald method. J. Chem. Phys. 1995;103:8577–8593. doi: 10.1063/1.470117. [DOI] [Google Scholar]
- 73.Grant B, Rodrigues A, ElSawy K, McCammon J, Caves L. Bio3d: an R package for the comparative analysis of protein structures. Bioinformatics. 2006;22:2695–2696. doi: 10.1093/bioinformatics/btl461. [DOI] [PubMed] [Google Scholar]
- 74.Bottaro S, Di Palma F, Bussi G. The role of nucleobase interactions in RNA structure and dynamics. Nucleic Acids Res. 2014;42:13306–13314. doi: 10.1093/nar/gku972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Bottaro S, et al. Barnaba: software for analysis of nucleic acid structures and trajectories. RNA. 2019;25:219–231. doi: 10.1261/rna.067678.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Williams C, Headd J, Moriarty N, Prisant M, et al. Molprobity: More and better reference data for improved all-atom structure validation. Protein Sci. 2018;27:293–315. doi: 10.1002/pro.3330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Schrödinger, LLC. The PyMOL molecular graphics system, version 1.8 (2015) .
- 78.Brierley I, Pennell S, Gilbert R. Viral RNA pseudoknots: versatile motifs in gene expression and replication. Nat. Rev. Microbiol. 2007;5:598–610. doi: 10.1038/nrmicro1704. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Complete data are present in the paper and/or associated Supplementary Materials. Additional information including initial 3D prediction models and validated trajectory cluster centers are shared in Zenodo (10.5281/zenodo.6625172).
Experimental RNA structures analysed in this paper are available in the Protein Data Bank under accession codes 6XRZ, 7O7Z, 7LYJ, and 7MLX.
The codes used to discover motif-strengthening mutations are available in the GitHub Schlicklab repository (https://github.com/Schlicklab/Dual-RAG-IF).