Abstract
Viroids, the smallest know plant pathogens, consist solely of a circular, single-stranded, non-coding RNA. Thus for all of their biological functions, like replication, processing, and transport, they have to present sequence or structural features to exploit host proteins. Viroid binding protein 1 (Virp1) is indispensable for replication of pospiviroids, the largest genus of viroids, in a host plant as well as in protoplasts. Virp1 is known to bind at two sites in the terminal right (TR) domain of pospiviroids; each site consists of a purine- (R-) and a pyrimidine- (Y-)rich motif that are partially base-paired to each other. Here we model the important structural features of the domain and show that it contains an internal loop of two Y · Y cis Watson-Crick/Watson-Crick (cWW) pairs, an asymmetric internal loop including a cWW and a trans Watson/Hoogsteen pair, and a thermodynamically quite stable hairpin loop with several stacking interactions. These features are discussed in connection to the known biological functions of the TR domain.
Introduction
Viroids, the smallest RNA pathogens known, consist of a circular, non-coding, single-stranded RNA (ssRNA) with a length of about 250–400 nucleotides depending on the viroid species1, 2. They are replicated in their plant hosts in an RNA-to-RNA rolling circle mechanism using either an asymmetric or a symmetric pathway3. In either pathway, replication includes a processing step of cleaving oligomeric replication intermediates to molecules of unit length. This step proceeds either by a viroid-internal ribozyme or by proteinaceous nucleases of the host. Based on this feature, viroids are classified into two families4: members of Pospiviroidae possess a thermodynamically stable rod-like secondary structure (Fig. 1a), replicate in the nucleus, and do not self-cleave; members of Avsunviroidae replicate in the chloroplast and self-cleave via a hammerhead ribozyme. The families are named after their respective type members potato spindle tuber viroid (PSTVd) and avocado sun blotch viroid. Pospiviroidae are divided into the genera Pospi-, Hostu-, Cocad-, Apsca-, and Coleviroids, based on sequence identity and biological properties4. All information for replication by cellular enzymes is included in structural features of the circular viroid RNA, which is abundant in the plant, and the less abundant oligomeric linear ssRNA replication intermediates of (+) and (−) polarity.
Only a few proteins or enzymes involved in the replication cycle of viroids are known. For example, members of Pospiviroidae and Avsunviroidae are transcribed by DNA-dependent RNA polymerase II and chloroplastic nuclear-encoded polymerase, respectively5, 6. Note that both enzymes are redirected to use ssRNA as template. Ligation of unit-length monomers to circles is supported by DNA ligase in case of PSTVd7 and by chloroplastic plant tRNA ligase for members of the Avsunviroidae family8, respectively.
Virp1 (Viroid binding protein 1) was isolated by screening a tomato cDNA expression library with PSTVd RNA9. The protein binds specifically (+) sense RNA of Pospi- and Hostuviroid members, but not of Apscaviroid and Avsunviroidae members10. Using electrophoretic mobility shift assays (EMSA) and the three-hybrid interaction system, the PSTVd region binding to the protein was determined: this is the terminal right (TR) domain of the rod like structure11 (Fig. 1a,b). Two binding sites, named RY motifs (Fig. 1), composed of the sequence elements 5′ ACAGG and 5′ CCUUCUC were found. In the RY motif close to the terminal right hairpin loop, the 3′ part of the purine-rich R sequence is paired to the 5′ part of the pyrimidine-rich Y sequence12; the same pairing for the inner RY motif is only present in suboptimal foldings. The binding affinity of Virp1 was about fivefold stronger to the terminal RY motif than to the inner motif. Furthermore, Virp1 was shown to be indispensable for replication of PSTVd and citrus exocortis viroid (CEVd), a further Pospiviroid, in Nicotiana benthamiana host plants as well as in protoplasts13. Similarly, the nuclear import of the satellite RNA of cucumber mosaic virus was shown to depend on Virp114. Furthermore, the transcription factor TFIIIA binds to the TR in the region G167–U176 15.
Virp1 (see Supplementary Fig. S1) is a member of the bromodomain(s) and extraterminal domain (BET) protein family including a nuclear localization signal. BET proteins are known to be involved in chromatin biology and transcriptional regulation16, 17. The bromodomain recognizes acetyl-lysine residues in histones and other proteins18, 19. The ET domain consists of three regions; the conserved N-terminal ET (NET) region has an acidic patch that may interact with other proteins or nucleic acids20. The RNA binding domain of Virp1 has been localized to the C-terminal half of the protein9, 12.
The knowledge on the three-dimensional structure of viroids is sparse. At least for PSTVd the presence of a tertiary structure (in the sense of tRNA-L-conformation) was excluded with certainty21. The left terminal part of the TL domain was shown by nuclear magnetic resonance (NMR) spectroscopy to be rod-like but not bifurcated22; that is, this study supported secondary structure predictions but delivered no three-dimensional details. Loop 6 (Fig. 1a) has the sequence 5′ G36AC 3′/5′ C322GA 3′ flanked by a U:G wobble pair and a cis Watson-Crick/Watson-Crick (cWW) G:C pair; according to structural modelling, the loop motif is a set of three stacked non-WC pairs23. This motif mediates trafficking from palisade to spongy mesophyll in N. benthamiana. Loop 7 (Fig. 1a) is required to traffic from nonvascular into the vascular tissue phloem to initiate systemic infection. According to modelling, the loop nucleotides U43 · C317 are a water-inserted cWW base pair flanked by canonical cWW base pairs24. Loop 15 of PSTVd (Fig. 1a) is similar to loop E (sarcin/ricin motif) of eukaryotic 5 S rRNA25–27. A detailed analysis of this loop, mainly based on isostericity matrices, allowed Zhong et al.28 to design disruptive as well as compensatory mutations that are critical in rolling-circle replication29–31.
We created three-dimensional RNA models of the terminal right domain of several circular viroids, using a variety of recently published web services, programs and databases, for example FARFAR/Rosetta32–34, RNA Bricks35, FRAbase (RNA FRAgments search engine & dataBASE)36, based on sequence and two-dimensional structural similarity to known three-dimensional RNA structures. These programs were selected for use mainly based on their complementarity (assembly of 3-nt fragments plus refinement versus search for similar structural elements using sequence and structure patterns) from a palette of available tools37–42. The predicted models encompassed three loops: a symmetrical basepaired 2×2 loop, an asymmetrical internal loop, and a hairpin loop both stabilized by hydrogen bonding and stacking interactions. On the one hand this predicted structure should provide an extraordinary platform for protein and/or nucleic acid binding in trans, on the other this model should facilitate further experimental and structural studies of the TR interaction with Virp1 and TFIIIA.
Results
Secondary structure prediction of TR domain
To get an overview on available TR sequences from members of Pospiviroid and their secondary structures, we assembled alignments of full-length sequences for each species of Pospiviroid and extracted the TR domain. For example, all unique sequences of PSTVd were aligned and a consensus structure was predicted; Fig. 1b details the TR domain of the PSTVd consensus structure. The TR stem-loop consisted of six well-defined helices but the sequences of loops more distant from the hairpin loop were difficult to align, especially if sequences of further Pospiviroid members were included. This is of concern only to the Virp1 binding site of lower affinity (nts 149–153/202–206), while the binding site of higher affinity (nts 170–174/183–187) is located in the more easily alignable region of the terminal stem-loop (nts 161–198; Fig. 1c). We will concentrate in the following on this terminal stem-loop that consists of three helices separated by two internal loops (IL1 and IL2) and the hairpin loop (HP). The final alignment of terminal stem-loop sequences (see Supplementary Fig. S2), on which the consensus structure (Fig. 1c) and the structure logo (Fig. 2) are based, contained 95 different sequences of pospiviroids.
The nucleotides in the two helices close to the HP are well conserved in sequence (marked by b and c in Fig. 2) but show some covariation marked by green and blue background in Fig. 1c. The sequence of the terminal HP loop is 5′ UUUC in most pospiviroids, with the exception of CEVd and IrVd that have 5′ CUCGW (W is A or U). The sequence of IL1 is 5′ GA/GC in CEVd and 5′ CW/UU in PCFVd, PSTVd, and most CLVd. The sequence of IL2 is 5′ C/UCC, conserved in most pospiviroids. This degree of conservation in sequence as well as in secondary structure of the TR stem-loop gave some confidence as a basis for modelling.
Results from Rosetta modelling
We predicted models of IL1 (Fig. 3), IL2 (Fig. 4), and HP (Fig. 5) within the TR stem-loop (Fig. 1c) with the Rosetta web server (ROSIE/FARFAR/RNA De Novo). Fragment assembly for the full TR stem-loop showed only a limited convergence; with shorter structural elements, like the single loops, 1,000 generated structures and 10,000 MC cycles were usually sufficient to generate reasonable best-scoring structures. As input to the web server, we used the respective parts of the consensus structure (in bracket-dot notation) but sequences from individual viroid variants (for examples see Fig. 3, top row).
Internal loop 1 (IL1)
The IL1 loop typical for PSTVd is a symmetric internal loop of 2×2 nucleotides (10× CC/UU; Fig. 1b; Fig. 3a, top row). Other loop nucleotides also occur frequently (Fig. 2): 15× CU/UU (Fig. 3c); 2× UU/UU; a 1×1 loop of C/U followed by an A:U pair (13×; Fig. 3b); a G:C pair followed by 1×1 loop of A/G is typical for CEVd (19×). The loop is closed by G:C cWW pairs but also other closing pairs appear quite often. The loop nucleotides form standard cWW pairs in all models of these sequence variants (Fig. 3, middle row). Thus other cWW pairs should be able to substitute: U · C, G · A, A · G, and A · A are isosterically close to C · U; G · U, A · C, and C · C are isosterically close to U · U43, 44. These isosteric pairs, however, do not occur in viroids. Only a restricted set of pair combinations are able to retain the same backbone geometry according to RNAredesign45, 46; this is obvious from the logos shown in Fig. 3 bottom. For example, CC/UU might be replaced by UU/CC or CU/UU by UU/UC. Thus, the viroid-specific combinations of nucleotides might be either essential for interaction with Virp1, or the reduced stability of IL1—in comparison to an IL1 composed of purines with their increased stacking—is critical.
The stability of such tandem “mismatches” is partially known47, 48: the tandem U · U stabilizes a helix when flanked by G · C pairs, as in case of viroids, but is less stable than symmetric tandem G · U or A · G pairs. The pyrimidine base-pairs led to a contraction of the sugar phosphate backbone of the opposing strands in comparison to standard cWW pairs; this might be of importance for protein binding.
Internal loop 2 (IL2)
Predicted IL2 loop nucleotides were highly connected by a complex network of base/base hydrogen bonds and stacking interactions (Fig. 4a). An example of these interactions is shown in Fig. 4b: C187 · A172:U185 form a triple pair consisting of a cWW A:U and a trans Watson/Sugar edge (tWS) C · A.
According to RNAredesign, U186 and C187 are critical for the backbone geometry of the loop (Fig. 4c). The cWW pair C171 · C188 is replaced by a U · U in 22% of structures, which is the only near-isosteric pair to C · C44.
Hairpin loop (HP)
The most common HP loop is UUUC closed by an U:A pair (Fig. 5a); the HP loop of CEVd and IrVd is CUCGW closed by a G:C pair (see Supplementary Fig. S2). The predicted HP loop structures were characterized by internal interactions of loop nucleotides. For the most common HP loop, Rosetta predicted a conformation with a trans Sugar/Hoogsteen (tSH) pair U177:C180 next to the loop-closing cWW U:A pair and stacking of the two last loop nucleotides (Fig. 5a). The exceptional HP loop of a TCDVd (Fig. 5b) was very similar with a tSH pair U177:C180 next to the loop-closing A:U pair and stacking of the two last loop nucleotides. For the HP loop of CEVd, Rosetta predicted a quite similar structure, despite the five loop nucleotides instead of the tetra-loops in most other cases: the fifth nucleotide of the loop was bulged out (see Supplementary Fig. S3b).
RNAredesign suggested quite different sequences for all loop models shown in Fig. 5 and S3: the optimized sequences contained more purines than the natural sequences, which might point on the one hand to increased thermodynamic stability of the optimized sequences due to increased stacking interactions, and on the other to functional constraints of the natural sequences. Such constraints might be an adaptation to stability requirements and restrictions induced by binding to other molecules.
Structural similarities in RNABricks and FRAbase
The Rosetta models, as depicted above, are based on assembly of short fragments from known structures. These fragments might include nucleotides that interact with additional ligands like a protein or another RNA. That is, “unusual” conformations of nucleotides in the final models might be due to absent ligands. In contrast, if a larger fragment of the target is found in a known (template) structure including ligands, one might get insight on the basis of such “unusual” conformations and requirements for interactions. So we searched for structural elements similar to the terminal stem-loop in RNABricks35 and FRAbase36.
IL1
An internal loop with a 5′ CU/5′ UU motif, frequent in pospiviroids, is described as a non-canonical tandem base-pair (i. e. two cWW pairs) in the 3′-UTR of poliovirus-like enteroviruses49. Remarkably, this loop is conserved but replacement of the C · U and U · U pairs by a C · G and U · A pairs, respectively, had no effect on virus viability and growth in cell culture. This is in contrast with viroids: replacement of IL1 by two A · U pairs diminished replication efficiency and abolished systemic infection50 (see below and Supplementary Table S4).
The intron 1 of human cellular nucleic acid-binding protein (also named zinc finger protein 9) contains a (CCUG)n repeat; expansion of this repeat causes myotonic dystrophy type 251. The repeat folds into a stem-loop with 5′ CU/5′ UC tandem cWW pairs flanked by GC pairs that binds and inactivates a splicing regulator52. This symmetric tandem mismatch, however, occurs only in CLVd JF446928.
IL2
RNA Bricks found an internal loop with some similarity to IL2 in PDB structures of 16S ribosomal RNA; for an example see Fig. 6a. The internal loop has the sequence C934/1381UCC1383 with a 3′ closing A:U pair as in PSTVd (Fig. 4) but with a 5′ G:C pair. C934 is bulged out from the motif and makes a tWW pair with A938:U1345 (see motif IL_40845.1 in the RNA 3D Motif Atlas)53. C1382 and C1383 pair with C936 and A935 in cWS conformation, respectively. U1381 pairs with G1379 in cHS conformation as annotated by x3dna-dssr; rnaview strangely determines this interaction as a cSW pair. A protein contact seems not relevant for this motif. The sequence of this loop is highly conserved according to an alignment of bacterial 16S ribosomal RNAs54; a corresponding logo is shown in Fig. 6b. The most variant loop nucleotide obviously is C1383 (Fig. 6b and c) that is replaced by an U in ~3% of sequences; this replacement did not occur in IL2.
HP
Hairpin loops with the four nucleotides and closing base pair common in PSTVd’s HP (Fig. 5a) are present in the histone mRNA stem-loop (PDB entries 1JU755 and 1KKS56), which binds to the stem-loop binding protein (SLBP) and the histone mRNA 3′-exonuclease 1 (Exo) for 3′ end processing. These hairpin structures—without corresponding proteins—were determined by NMR spectroscopy. An overlay of both hairpin loops is shown in Fig. 7a; for this overlay, the terminal C1:G8 pairs were matched in chimera. Note the stackings of U2, U3 and U4, and of U5 on A7 in 1JU7 (red), while U3, U5 and A7 stack in 1KKS (cyan). In 1JU7 the C6 is bulged out to the major groove, whereas in 1KKS the C6 is bulged out to the minor groove. According to FRAbase, further PDB entries (1ZBH, Cheng & Patel, unpublished; 4L8R and 4QOZ)57 are similar to HP; each of these contains two of the histone stem-loops bound by protein (for example see Fig. 7b). These loop structures are very close, but not identical to 1JU7 and 1KKS, respectively, due to the interactions with protein, like stacking of Tyr144(Exo)/U3 and His195(Exo)/C6 and numerous hydrogen bondings between protein side chains and RNA backbone and bases. The sequence of the histone mRNA stem-loop is highly conserved (Fig. 7c,d), which might be forced by interactions with the proteins. The most variable position is C6 (24% A, 68% C, 1% G, 7% U) as in HP (C189; Fig. 2). The differences between the sequence logos of the histone mRNA stem-loop and HP might be due to additional requirements of HP like stability in the (−) strand, but might also point to a clear biological difference (see Discussion).
We further searched for HP-loop variants from other viroids. The PDB entry 4V7E58, a model of the Triticum aestivum 80S ribosome based on cryo-electron microscopy with resolution of <5 Å, contains in the 28S rRNA a hairpin loop with sequence 767UUCU770 closed by a C:G pair instead of the A:U pair in a TCDVd (ID GQ169709). The first and last loop nucleotide stabilize the loop by a cWW interaction, and U769 and U770 stack on each other (see Supplementary Fig. S5); both interactions might be independent from the loop-closing pair. The PDB entries 3J3V and 3J3W59, which contain structures of the Bacillus subtilis 50S ribosome subunit based on cryo-electron microscopy and usage of ribosome structures of Escherichia coli and Thermus thermophilus as templates, show in the 23S rRNA a hairpin loop, similar to that of CEVd (see Supplementary Fig. S3b), that is involved in a kissing loop interaction (Fig. 8). Most interactions in the rRNA loop are due to the kissing interactions; however, the last loop nucleotide, in the rRNA as well as in the predicted CEVd loop, makes a cSH interaction with the 3′ nucleotide of the loop-closing basepair.
Synthetic mutants in the TR domain
A synthetic mutant of HP and its progeny
Hammond60 described a mutant PSTVd construct R+ (with the replacement UU177 → GAA in HP; see Supplementary Table S2) that was not infectious as transcribed RNA in contrast to the wild-type (WT) variant. After Agrobacterium tumefaciens-mediated inoculation of tomato plants with a corresponding plasmid containing a R+ cDNA insertion in (+) orientation, viroid progeny with secondary mutations appeared in galls and roots, and in 10% of infected plants also in leaves. The only variant (1 in Supplementary Table S2) appearing in leaves had, in comparison to WT, an inversion of the loop closing pair from U176:A181 in the WT to A:U, similar to the already mentioned TCDVd variant (Fig. 5b), and an additional U in the HP loop. The original variant R+, which was never recovered, and a further evolved variant (2 in see Supplementary Table S2) might be less viable due to a possible structural rearrangement influencing IL2 and the RY motif and thus did not point to critical loop nucleotides. The loop-closing pair and the loop size, however, seemed uncritical for replication and trafficking.
Binding affinity and infectivity of IL2 mutants
Gozmanova et al.12 analyzed the binding of a Virp1 fragment to mutants of stem-loops, which were similar to the hairpin depicted in Fig. 1b. That is, their TR fragment contained both RY motifs, but only the terminal motif (as in Fig. 1c) was mutated. Infectivity of longer-than-unit-length viroid transcripts and relative binding affinity of stem-loops are given in Supplementary Table S3. From these data and further biophysical experiments, the authors concluded that the binding to the terminal RY motif was about fivefold stronger than to the internal motif.
Rosetta modelling of the synthetic IL2 mutants, which include single nucleotide replacements or a reverse complement of the loop sequence, predicted that these mutants possess at least one non-WC interaction different from but also additional to WT. Both modified interaction types might lead to an increased stability, rigidity and/or bending of the mutant loops, explaining the reduced binding of mutants to Virp1.
Inhibition of replication and trafficking by mutations in the TR domain
Zhong et al.50 analyzed PSTVd loop motifs with respect to replication in protoplasts and to (replication plus) trafficking in plants. A replacement of the cWW pairs of IL1 by standard A · U WC pairs reduced replication efficiency to 29% of WT and abolished systemic infection. This experimental result fits to the findings from Rosetta modelling: either the lower thermodynamic stability of the WT loop is critical for replication competence, or the cWW loop pairs—in comparison to a loop without pairs—“protect” against endoribonucleases, or the backbone contraction in the WT loop is critical for protein binding. A replacement of the IL2 loop by three standard WC pairs reduced replication efficiency to 46% of WT and abolished systemic infection. A mutation of UU176 → AA, which opens at least the HP-closing U · A pair, reduced replication efficiency to 53% of WT and abolished systemic infection. An additional mutation, which occurred in one plant, restored the loop-closing pair by an A · U pair that allowed this PSTVd mutant for systemic infection. This analysis points to a lower importance of loops in the TR domain for replication than loops in the TL and central domain but shows that these are most critical for trafficking50.
Discussion
In the following we will discuss selected features from the predictions and compare several results from Rosetta modelling with findings from the structure databases.
Relevance of the two binding sites in TR
Figure 1b only shows the terminal Virp1 binding site with pairing of the R to Y motif, whereas the second, internal binding site is masked by different pairings. A structure with both Virp1 binding sites12, is inferior to the one shown here by about 10 kJ/mol (mean ΔΔG 37°C of 234 PSTVd variants) or a ratio of both structure ensembles (with two versus one perfect binding site) of about 50. This fits roughly to the different binding of Virp1 to both sites12. However, only five of the 234 PSTVd variants miss the internal RY motif (ACs HQ452399–401, HQ452411, HQ452412)61, which points to the in vivo importance of a twofold RY motif.
IL1 is closed by two cWW pairs
The majority of IL1 is a 2 × 2 C · U forming cWW pairs; alternatives are a single cWW plus a standard WC pair (for example 5′ CA/UU 3′, 5′ GA/GC 3′). These interactions fit to low BzCN reactivity of IL162. A further “loop” stabilization by two A · U pairs with standard A-RNA geometry is prohibitive for viroid replication and abolishes trafficking50.
Base-pairing in the asymmetric IL2 loop
According to the Rosetta model, only U186 is bulged out while the three other nucleotides are paired. In contrast, the C171 is bulged out in the 16S rRNA loop and makes contact to a distant base pair. In both cases, the logos for IL2 (Fig. 4c) and for the rRNA loop (Fig. 6b and c) do not point to compatible mutations of each motif. This fits to the higher degree of sequence conservation of IL2 than of IL1.
The HP structure allows for interaction despite its stability
According to DeJong et al.55, the “stacking of loop nucleotides [in the histone mRNA stem-loop that is quite similar to HP] has the net effect of extending the helix by an additional base pair step”, which accounts for the relatively high thermodynamic stability of HP. However, the mRNA hairpin still allows for interaction with an RNA binding protein (Fig. 7) or for a kissing-loop interaction (Fig. 8).
In our opinion, however, the similarity between HP and histone mRNA stem-loop should not be taken as a hint towards biologically relevant interactions of viroid RNA with SLBP and Exo: (i) sequences similar to SLBP are not found by BLAST in higher plants; (ii) sequences similar to Exo are present in plant genomes, but these miss the SAP domain responsible for RNA binding; and (iii) histone 3′ end processing is believed to be restricted to metazoa and green algae63, 64. Thus the detected similarity might be a pure consequence of the recurrence of evolutionary unrelated RNA motifs65.
This work was inspired by the numerous tools for 3D RNA structure prediction that all promise to be manageable by non-experts. Indeed, our results point to an overall TR structure with a nice combination of rigidity and flexibility allowing the TR for binding of proteins. We have, however, not found a reason for the proposed TFIIIA binding to the TR region. Taken together, the predicted structure is encouraging for further exploration by mutational analysis and experimental structure determination.
Methods
Viroid sequences and consensus structures
For each of the 10 species of genus Pospiviroid 4 a single sequence was used to search with nucleotide BLAST for somewhat similar sequences in GenBank. For sequences from each species, a preliminary alignment was produced with MAFFT/G-INS-i66. According to these alignments, partial sequences were removed and the remaining sequences were adjusted for same strand orientation and similar start/end points. After a second alignment, redundant sequences were removed; that is, sequences from each genus had to differ from each other by at least one mutation. The final alignments were optimized in ConStruct67 at 37 °C, excluding lonely base pairs, and drawn with R2R68. An example is the consensus structure of PSTVd (Fig. 1a) which was based on 234 unique sequences. From the PSTVd alignment, the TR domain was cut out, redundant sequences were removed, aligned with MAFFT/X-INS-i69, and optimized in ConStruct (Fig. 1b). From all individual alignments, the range of the terminal stem-loop of three helices was cut out, pooled, redundant sequences were removed, and aligned with MAFFT/X-INS-i. This alignment is shown in Supplementary Fig. S2, the corresponding R2R drawing in Fig. 1c.
Rosetta modelling
We used the Rosetta web server (ROSIE (Rosetta Online Server that Includes Everyone)/FARFAR (Fragment Assembly of RNA with Full Atom Refinement)/RNA De Novo) to predict 3D structures33. FARFAR34 assembles RNA fragments with length ≤3 nucleotides known from x-ray structures; these fragment sequences match partial sequences of the target RNA. The assembly is a Monte Carlo (MC) process guided by a knowledge-based energy function. Resulting structure models are then refined in an all-atom potential that is thought to discriminate native-like from non-native conformations. We used the following options: full run; variation of bond lengths and angles; optimization after fragment assembly; allowance for bulges; usage of updated (2012) force field; 1,000 structures; 10,000 MC cycles. From the predicted 3D structure(s), we extracted information on types of base pairs and further interactions by rnaview70 and x3dna-dssr71.
RNAredesign45 samples the possible sequence space that stabilizes a given 3D structure with a fixed structure backbone.
Database searches
The databases RNABricks35 and FRAbase36 store short RNA motifs extracted from experimentally determined RNA 3D structures. Both databases allow a user to search for sequences similar to a query. Additionally, RNABricks includes a 3D search engine; as input we used 3D models predicted by ROSIE.
Further software
chimera v1.10.172, PyMOL (https://www.pymol.org/) with nuccyl (http://www.biosci.ki.se/groups/ljo/software/nuccyl.html), and RasMol v2.7.573 were used to visualize 3D structures.
Electronic supplementary material
Acknowledgements
I thank Prof. Mina Tsagris (Dep. Biology, Voutes University Campus, Heraklion, Crete, Greece) who initiated the project.
Competing Interests
The authors declare that they have no competing interests.
Footnotes
Electronic supplementary material
Supplementary information accompanies this paper at doi:10.1038/s41598-017-00764-x
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Palukaitis P. What has been happening with viroids? Virus Genes. 2014;49:175–184. doi: 10.1007/s11262-014-1110-8. [DOI] [PubMed] [Google Scholar]
- 2.Navarro B, et al. Viroids: How to infect a host and cause disease without encoding proteins. Biochimie. 2012;94:1474–1480. doi: 10.1016/j.biochi.2012.02.020. [DOI] [PubMed] [Google Scholar]
- 3.Flores R, et al. Rolling-circle replication of viroids, viroid-like satellite RNAs and hepatitis delta virus: variations on a theme. RNA Biol. 2011;8:200–206. doi: 10.4161/rna.8.2.14238. [DOI] [PubMed] [Google Scholar]
- 4.Di Serio F, et al. Current status of viroid taxonomy. Arch. Virol. 2014;159:3467–3478. doi: 10.1007/s00705-014-2200-6. [DOI] [PubMed] [Google Scholar]
- 5.Schindler I-M, Mühlbach H-P. Involvement of nuclear DNA-dependent RNA polymerases in potato spindle tuber viroid replication: a reevaluation. Plant Sci. 1992;84:221–229. doi: 10.1016/0168-9452(92)90138-C. [DOI] [Google Scholar]
- 6.Navarro J, Vera A, Flores R. A chloroplastic RNA polymerase resistant to tagetitoxin is involved in replication of avocado sunblotch viroid. Virology. 2000;268:218–225. doi: 10.1006/viro.1999.0161. [DOI] [PubMed] [Google Scholar]
- 7.Nohales M-Á, Flores R, Darós J. Viroid RNA redirects host DNA ligase 1 to act as an RNA ligase. P. Natl. Acad. Sci. USA. 2012;109:13805–13810. doi: 10.1073/pnas.1206187109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nohales M-Á, Molina-Serrano D, Flores R, Darós J. Involvement of the chloroplastic isoform of tRNA ligase in the replication of viroids belonging to the family Avsunviroidae. J. Virol. 2012;86:8269–8276. doi: 10.1128/JVI.00629-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Martínez de Alba A, Sägesser R, Tabler M, Tsagris M. A bromodomain-containing protein from tomato specifically binds potato spindle tuber viroid RNA in vitro and in vivo. J. Virol. 2003;77:9685–9694. doi: 10.1128/JVI.77.17.9685-9694.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Martínez de Alba, A. Isolation and characterisation of viroid-binding proteins. Thesis, Universidad del Pais Vasco, Bilbao, Spain (2000).
- 11.Maniataki E, Martínez de Alba A, Sägesser R, Tabler M, Tsagris M. Viroid RNA systemic spread may depend on the interaction of a 71-nucleotide bulged hairpin with the host protein VirP1. RNA. 2003;9:346–354. doi: 10.1261/rna.2162203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gozmanova M, Denti M, Minkov I, Tsagris M, Tabler M. Characterization of the RNA motif responsible for the specific interaction of potato spindle tuber viroid RNA (PSTVd) and the tomato protein Virp1. Nucleic Acids Res. 2003;31:5534–5543. doi: 10.1093/nar/gkg777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kalantidis K, et al. Virp1 is a host protein with a major role in Potato spindle tuber viroid infection in Nicotiana plants. J. Virol. 2007;81:12872–12880. doi: 10.1128/JVI.00974-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chaturvedi S, Kalantidis K, Rao A. A bromodomain-containing host protein mediates the nuclear importation of a satellite RNA of Cucumber mosaic virus. J. Virol. 2013;88:1890–1896. doi: 10.1128/JVI.03082-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wang Y, et al. A land plant-specific transcription factor directly enhances transcription of a pathogenic noncoding RNA template by DNA-dependent RNA polymerase II. Plant Cell. 2016;28:1094–1107. doi: 10.1105/tpc.16.00100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wu S, Chiang C. The double bromodomain-containing chromatin adaptor Brd4 and transcriptional regulation. J. Biol. Chem. 2007;282:13141–13145. doi: 10.1074/jbc.R700001200. [DOI] [PubMed] [Google Scholar]
- 17.Sanchez R, Zhou M. The role of human bromodomains in chromatin biology and gene transcription. Curr. Opin. Drug Discov. Devel. 2009;12:659–665. [PMC free article] [PubMed] [Google Scholar]
- 18.Mujtaba S, Zeng L, Zhou M. Structure and acetyl-lysine recognition of the bromodomain. Oncogene. 2007;26:5521–5527. doi: 10.1038/sj.onc.1210618. [DOI] [PubMed] [Google Scholar]
- 19.Rao R, Thelen J, Miernyk J. In silico analysis of protein Lys-Nε-acetylation in plants. Front Plant Sci. 2014;5:381. doi: 10.3389/fpls.2014.00381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lin Y, et al. Solution structure of the extraterminal domain of the bromodomain-containing protein BRD4. Protein Sci. 2008;17:2174–2179. doi: 10.1110/ps.037580.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Riesner D, Kaper J, Randles J. Stiffness of viroids and viroid-like RNA in solution. Nucleic Acids Res. 1982;10:5587–5598. doi: 10.1093/nar/10.18.5587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Dingley A, Steger G, Esters B, Riesner D, Grzesiek S. Structural characterization of the 69 nucleotide potato spindle tuber viroid left-terminal domain by NMR and thermodynamic analysis. J. Mol. Biol. 2003;334:751–767. doi: 10.1016/j.jmb.2003.10.015. [DOI] [PubMed] [Google Scholar]
- 23.Takeda R, Petrov A, Leontis N, Ding B. A three-dimensional RNA motif in Potato spindle tuber viroid mediates trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana. Plant Cell. 2011;23:258–272. doi: 10.1105/tpc.110.081414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhong X, Tao X, Stombaugh J, Leontis N, Ding B. Tertiary structure and function of an RNA motif required for plant vascular entry to initiate systemic trafficking. EMBO J. 2007;26:3836–3846. doi: 10.1038/sj.emboj.7601812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Branch A, Benenfeld B, Robertson H. Ultraviolet light-induced crosslinking reveals a unique region of local tertiary structure in potato spindle tuber viroid and HeLa 5 S RNA. P. Natl. Acad. Sci. USA. 1985;82:6590–6594. doi: 10.1073/pnas.82.19.6590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wang Y, Zhong X, Itaya A, Ding B. Evidence for the existence of the loop E motif of Potato spindle tuber viroid in vivo. J. Virol. 2007;81:2074–2077. doi: 10.1128/JVI.01781-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Eiras M, Kitajima E, Flores R, Darós J. Existence in vivo of the loop E motif in potato spindle tuber viroid RNA. Arch. Virol. 2007;152:1389–1393. doi: 10.1007/s00705-007-0952-y. [DOI] [PubMed] [Google Scholar]
- 28.Zhong X, et al. Tertiary structural and functional analyses of a viroid RNA motif by isostericity matrix and mutagenesis reveal its essential role in replication. J. Virol. 2006;80:8566–8581. doi: 10.1128/JVI.00837-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Baumstark T, Schröder A, Riesner D. Viroid processing: switch from cleavage to ligation is driven by a change from a tetraloop to a loop E conformation. EMBO J. 1997;16:599–610. doi: 10.1093/emboj/16.3.599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Schrader O, Baumstark T, Riesner D. A mini-RNA containing the tetraloop, wobble-pair and loop E motifs of the central conserved region of potato spindle tuber viroid is processed into a minicircle. Nucleic Acids Res. 2003;31:988–998. doi: 10.1093/nar/gkg193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Gas M, Hernández C, Flores R, Daròs J. Processing of nuclear viroids in vivo: an interplay between RNA conformations. PLoS Pathog. 2007;3:e182. doi: 10.1371/journal.ppat.0030182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kladwang W, VanLang C, Cordero P, Das R. A two-dimensional mutate-and-map strategy for non-coding RNA structure. Nat. Chem. 2011;3:954–962. doi: 10.1038/nchem.1176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lyskov S, et al. Serverification of molecular modeling applications: the Rosetta Online Server that Includes Everyone (ROSIE) PLoS One. 2013;8:e63906. doi: 10.1371/journal.pone.0063906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Das R, Karanicolas J, Baker D. Atomic accuracy in predicting and designing noncanonical RNA structure. Nat. Methods. 2010;7:291–294. doi: 10.1038/nmeth.1433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chojnowski G, Walen T, Bujnicki J. RNA Bricks–a database of RNA 3D motifs and their interactions. Nucleic Acids Res. 2013;42:D123–31. doi: 10.1093/nar/gkt1084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Popenda M, et al. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures. BMC Bioinformatics. 2010;11:231. doi: 10.1186/1471-2105-11-231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Miao, Z. et al. RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme. RNA, doi:10.1261/rna.060368.116 (2017). [DOI] [PMC free article] [PubMed]
- 38.Miao Z, et al. RNA-Puzzles Round II: assessment of RNA structure prediction programs applied to three large RNA structures. RNA. 2015;21:1066–1084. doi: 10.1261/rna.049502.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Cruz J, et al. RNA-Puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction. RNA. 2012;18:610–625. doi: 10.1261/rna.031054.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dawson W, Bujnicki J. Computational modeling of RNA 3D structures and interactions. Curr. Opin. Struct. Biol. 2015;37:22–28. doi: 10.1016/j.sbi.2015.11.007. [DOI] [PubMed] [Google Scholar]
- 41.Magnus M, et al. Computational modeling of RNA 3D structures, with the aid of experimental restraints. RNA Biol. 2014;11:522–536. doi: 10.4161/rna.28826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Laing C, Schlick T. Computational approaches to 3D modeling of RNA. J Phys Condens Matter. 2010;22:283101. doi: 10.1088/0953-8984/22/28/283101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Leontis N, Stombaugh J, Westhof E. The non-Watson-Crick base pairs and their associated isostericity matrices. Nucleic Acids Res. 2002;30:3497–3531. doi: 10.1093/nar/gkf481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Stombaugh J, Zirbel C, Westhof E, Leontis N. Frequency and isostericity of RNA base pairs. Nucleic Acids Res. 2009;37:2294–2312. doi: 10.1093/nar/gkp011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Yesselman J, Das R. RNA-Redesign: a web server for fixed-backbone 3D design of RNA. Nucleic Acids Res. 2015;43:W498–501. doi: 10.1093/nar/gkv465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Gorodkin J, Heyer L, Brunak S, Stormo G. Displaying the information contents of structural RNA alignments: the structure logos. Comp. Appl. Biosci./Bioinformatics. 1997;13:583–586. doi: 10.1093/bioinformatics/13.6.583. [DOI] [PubMed] [Google Scholar]
- 47.Wu M, McDowell J, Turner D. A periodic table of symmetric tandem mismatches in RNA. Biochemistry. 1995;34:3204–3211. doi: 10.1021/bi00010a009. [DOI] [PubMed] [Google Scholar]
- 48.Xia T, McDowell J, Turner D. Thermodynamics of nonsymmetric tandem mismatches adjacent to G · C base pairs in RNA. Biochemistry. 1997;36:12486–12497. doi: 10.1021/bi971069v. [DOI] [PubMed] [Google Scholar]
- 49.Lescrinier E, et al. Structure of the pyrimidine-rich internal loop in the poliovirus 3′-UTR: the importance of maintaining pseudo-2-fold symmetry in RNA helices containing two adjacent non-canonical base-pairs. J. Mol. Biol. 2003;331:759–769. doi: 10.1016/S0022-2836(03)00787-3. [DOI] [PubMed] [Google Scholar]
- 50.Zhong X, Archual A, Amin A, Ding B. A genomic map of viroid RNA motifs critical for replication and systemic trafficking. Plant Cell. 2008;20:35–47. doi: 10.1105/tpc.107.056606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Liquori C, et al. Myotonic dystrophy type 2 caused by a CCTG expansion in intron 1 of ZNF9. Science. 2001;293:864–867. doi: 10.1126/science.1062125. [DOI] [PubMed] [Google Scholar]
- 52.Childs-Disney J, et al. Structure of the myotonic dystrophy type 2 RNA and designed small molecules that reduce toxicity. ACS Chem. Biol. 2013;9:538–550. doi: 10.1021/cb4007387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Zirbel C, et al. Identifying novel sequence variants of RNA 3D motifs. Nucleic Acids Res. 2015;43:7504–7520. doi: 10.1093/nar/gkv651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Cannone J, et al. The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics. 2002;3:2. doi: 10.1186/1471-2105-3-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.DeJong E, Marzluff W, Nikonowicz E. NMR structure and dynamics of the RNA-binding site for the histone mRNA stem-loop binding protein. RNA. 2002;8:83–96. doi: 10.1017/S1355838202013869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zanier K, et al. Structure of the histone mRNA hairpin required for cell cycle regulation of histone gene expression. RNA. 2002;8:29–46. doi: 10.1017/S1355838202014061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Tan D, Marzluff W, Dominski Z, Tong L. Structure of histone mRNA stem-loop, human stem-loop binding protein, and 3′hExo ternary complex. Science. 2013;339:318–321. doi: 10.1126/science.1228705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Gogala M, et al. Structures of the Sec61 complex engaged in nascent peptide translocation or membrane insertion. Nature. 2014;506:107–110. doi: 10.1038/nature12950. [DOI] [PubMed] [Google Scholar]
- 59.Li N, et al. Cryo-EM structures of the late-stage assembly intermediates of the bacterial 50S ribosomal subunit. Nucleic Acids Res. 2013;41:7073–7083. doi: 10.1093/nar/gkt423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Hammond R. Agrobacterium-mediated inoculation of PSTVd cDNAs onto tomato reveals the biological effect of apparently lethal mutations. Virology. 1994;201:36–45. doi: 10.1006/viro.1994.1263. [DOI] [PubMed] [Google Scholar]
- 61.Luigi M, Luison D, Tomassoli L, Faggioli F. Natural spread and molecular analysis of pospiviroids infecting ornamentals in Italy. J. Plant Path. 2011;93:491–495. [Google Scholar]
- 62.Giguère T, Adkar-Purushothama C, Perreault J. Comprehensive secondary structure elucidation of four genera of the family Pospiviroidae. PLoS One. 2014;9:e98655. doi: 10.1371/journal.pone.0098655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Dávila López M, Samuelsson T. Early evolution of histone mRNA 3′ end processing. RNA. 2007;14:1–10. doi: 10.1261/rna.782308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Marzluff W, Wagner E, Duronio R. Metabolism and regulation of canonical histone mRNAs: life without a poly(A) tail. Nat. Rev. Genet. 2008;9:843–854. doi: 10.1038/nrg2438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lescoute A, Leontis N, Massire C, Westhof E. Recurrent structural RNA motifs, Isostericity Matrices and sequence alignments. Nucleic Acids Res. 2005;33:2395–2409. doi: 10.1093/nar/gki535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Katoh K, Kuma K, Miyata T, Toh H. Improvement in the accuracy of multiple sequence alignment program MAFFT. Genome Inform. Ser. 2005;16:22–33. [PubMed] [Google Scholar]
- 67.Wilm A, Linnenbrink K, Steger G. ConStruct: Improved construction of RNA consensus structures. BMC Bioinformatics. 2008;9:219. doi: 10.1186/1471-2105-9-219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Weinberg Z, Breaker R. R2R – software to speed the depiction of aesthetic consensus RNA secondary structures. BMC Bioinformatics. 2011;12:3. doi: 10.1186/1471-2105-12-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Katoh K, Toh H. Improved accuracy of multiple ncRNA alignment by incorporating structural information into a MAFFT-based framework. BMC Bioinformatics. 2008;9:212. doi: 10.1186/1471-2105-9-212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Yang H, et al. Tools for the automatic identification and classification of RNA base pairs. Nucleic Acids Res. 2003;31:3450–3460. doi: 10.1093/nar/gkg529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Lu X, Olson W. 3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures. Nat. Protoc. 2008;3:1213–1227. doi: 10.1038/nprot.2008.104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Pettersen E, et al. UCSF Chimera–a visualization system for exploratory research and analysis. J. Comput. Chem. 2004;25:1605–1612. doi: 10.1002/jcc.20084. [DOI] [PubMed] [Google Scholar]
- 73.Bernstein H. Recent changes to RasMol, recombining the variants. Trends Biochem. Sci. 2000;25:453–455. doi: 10.1016/S0968-0004(00)01606-6. [DOI] [PubMed] [Google Scholar]
- 74.Keese P, Symons R. Domains in viroids: Evidence of intermolecular RNA rearrangement and their contribution to viroid evolution. P. Natl. Acad. Sci. USA. 1985;82:4582–4586. doi: 10.1073/pnas.82.14.4582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.López-Carrasco, A. & Flores, R. Dissecting the secondary structure of the circular RNA of a nuclear viroid in vivo: a “naked” rod-like conformation similar but not identical to that observed in vitro. RNA Biol. doi:10.1080/15476286.2016.1223005 (2016). [DOI] [PMC free article] [PubMed]
- 76.Gast F, Kempe D, Spieker R, Sänger H. Secondary structure probing of potato spindle tuber viroid (PSTVd) and sequence comparison with other small pathogenic RNA replicons provides evidence for central non-canonical base-pairs, large A-rich loops, and a terminal branch. J. Mol. Biol. 1996;262:652–670. doi: 10.1006/jmbi.1996.0543. [DOI] [PubMed] [Google Scholar]
- 77.Leontis N, Westhof E. Geometric nomenclature and classification of RNA base pairs. RNA. 2001;7:499–512. doi: 10.1017/S1355838201002515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Carter A, et al. Functional insights from the structure of the 30S ribosomal subunit and its interactions with antibiotics. Nature. 2000;407:340–348. doi: 10.1038/35030019. [DOI] [PubMed] [Google Scholar]
- 79.Burge S, et al. Rfam 11.0: 10 years of RNA families. Nucleic Acids Res. 2012;41:D226–32. doi: 10.1093/nar/gks1005. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.