Abstract
In this review, we describe the creation of the Nucleic Acid Database (NDB) at Rutgers University and how it became a testbed for the current infrastructure of the RCSB Protein Data Bank. We describe some of the special features of the NDB and how it has been used to enable research. Plans for the next phase as the Nucleic Acid Knowledgebase (NAKB) are summarized.
Keywords: nucleic acid structures, nucleic acid conformation, biological structure database, DNA, RNA, validation standards
1. Introduction
The first single crystal structures of nucleic acids were determined in the 1970s, almost twenty years after the model of the DNA double helix based on fiber data was published [1,2]. Short fragments of RNA yielded the first atomic-level views of the double helix and demonstrated conformational flexibility [3,4,5]. These structures were archived as small molecules in the Cambridge Crystallographic Database (CSD) [6]. The structure of tRNA, determined in 1974 [7,8,9], showed that RNA can fold into a compact structure and demonstrated the importance of tertiary interactions. As DNA synthesis became possible, structures of the DNA double helix with predefined sequences were determined. The first structures were left-handed Z-form DNA fragments [10], and in 1981, the first single crystal structure of a full turn of B-form DNA was published [11]. The tRNA structures and larger nucleic acid fragments were archived in the Protein Data Bank (PDB [12]). By 1990, there were nearly 100 publicly released nucleic acid structures, thus allowing analyses of sequence-dependent features, hydration patterns, and ligand interactions.
During the late 1970s and 1980s, several faculty members in the Chemistry Department at Rutgers University focused their research on nucleic acids. Ken Breslauer worked on the macroscopic properties of nucleic acids using calorimetric approaches [13,14,15,16]; these works, seminal for the understanding of thermodynamics of DNA, have continued to this day [17,18,19,20]. Roger Jones developed new methods to synthesize DNA [21]. Jerry Manning developed the counterion condensation theory to understand DNA folding [22], and continued this work in collaboration with the Breslauer group [23]. Wilma Olson performed detailed analyses of the structure of DNA [24]. During that period, Helen Berman carried out nucleic acid crystallography research at the Institute for Cancer Research in Philadelphia and had close interactions with the Rutgers group. In 1989, she joined the Chemistry faculty at Rutgers.
The setting at Rutgers was ideal for collaborative studies using both experimental and computational approaches to investigate nucleic acid structure. It was necessary to have a resource that contained the structural information which resided in the CSD, in the PDB, or in the laboratories of individual researchers to facilitate these efforts. In collaboration with David Beveridge, with whom Berman was collaborating on computational analyses of nucleic acid hydration, Olson and Berman proposed to create the Nucleic Acid Database (NDB). In the early 1990s, funding was received from the National Science Foundation to establish “A Comprehensive Database of the Three-Dimensional Structures of Nucleic Acids”. The goal was to create a searchable database that would integrate information from several sources and make a variety of reports, thus enabling research on nucleic acid structure.
2. Development of the Nucleic Acid Database
The first step in the development of the NDB was to collect and curate the structural data [25]. Coordinates were accessed from the CSD and the PDB. Each structure and experiment were carefully reviewed to create appropriate annotations beyond what was available from each resource. Rather than working directly with the flat files maintained by the PDB, the NDB imported the parsed data files into a relational database management system (DBMS). Sybase [26] was chosen as the DBMS in large part because it was being used by Genbank [27,28]. A query system called NDBquery was put into place. In the early years, distribution was accomplished via FTP and a system called Gopher [29]. By 1995, a web server was set up, which generated a modest amount of activity to access and analyze the 350 structures represented in the NDB. The NDB was actively involved in the development of mmCIF, whose data model is compatible with a relational DBMS. By 1996, mmCIF [30] became the master format for the NDB. The software that was developed and the experience gained using this data representation set the stage for the management of the Protein Data Bank using mmCIF as the master format by the Research Collaboratory for Structural Bioinformatics (RCSB) beginning in 1998.
The NDB also became a driver for the creation of geometrical standards for nucleic acid structures. Careful analysis of high-resolution structures from CSD permitted the calculation of standard reference bond distances and angles for the bases, sugars, and phosphates of nucleic acids [31,32]. Using these values, Parkinson et al. [33] created new parameters that enabled improved refinement of nucleic acid-containing crystal structures against their experimental data. Those standards were widely used. In 1998, the NDB helped organize a conference whose outcome was the standard coordinate frame definition for nucleic acid bases [34]. This standard became widely adopted by researchers studying nucleic acid base morphology.
3. Features of the NDB
In addition to facilitating access to primary data for nucleic acid structures, the NDB provides tables of derived features, such as classifications of base pairing topologies [35], backbone torsion angles, and conformational and base pair classifications [36,37].
The NDB also offers different types of data visualization and presentation. The most important is the NDB Atlas page (Figure 1), which gives summary information about the structure, visualizations of the crystal asymmetric unit, the biological unit, unit cells, and for RNA structures; it provides a view that combines the secondary and tertiary structural features. Links to other resources are also provided.
The functionality of the NDB and its query engine was first and foremost driven by research projects on the nucleic acid structural and computational biologists. Careful attention was given to the quality and uniformity of the metadata so that it would be possible to use Boolean logic to create queries; individual questions could be made into logical constructs joined by logical AND, OR, and NOT. This requirement represented a challenge for building a robust system of precisely defined terms incorporated into a formal computer-readable language; mmCIF was that dictionary.
The NDB website was designed so that the user could select structures with features of interest and then use those structures for further analysis, e.g., through the creation of detailed tabular or graphical reports. Soon after the first functional version of the NDB was available, we started to use its potential to study the geometrical features of nucleic acids. The original NDB reporting capability allowed the user to obtain tabular reports of various properties of the selected nucleic acid structures from basic information about the publication or refinement parameters and graphical reports of selected geometric features such as bond distances (Figure 2) or torsion angles (Figure 3). Once funding for the NDB became limited in the 2000s, it was not possible to maintain these reporting capabilities.
4. Research Enabled by the NDB
The NDB has been used by many researchers to analyze the structures of nucleic acids. There are over 1100 citations to the original NDB article. The type of research enabled by the NDB includes DNA conformational analyses [39], DNA structure prediction [40], RNA structure prediction [41], analyses of protein-nucleic acid interactions [42,43], and the creation of new specialty databases [44]. In our research, we have used the NDB to study a variety of aspects of nucleic acids. For example, we surveyed A, B, and Z-form double helical DNA structures and used Fourier averaging to determine hydration patterns, e.g., for DNA nitrogenous bases [45]. Both base and later phosphate studies showed sequence and conformation-dependent water position preferences (Figure 4).
The growing volume of available crystal structures with ever growing sequence variability also led us to ask whether conformational properties of various DNA and RNA forms could be better characterized. This task posed new challenges to NDB querying and reporting capabilities. Specific subsets of structures were selected based on sequence, function, or structural features using SQL queries; their properties were reported as text or graphs (Figure 5). Ultimately, we were able to sharpen conformational definitions for established subtypes of A-B-Z forms (Figure 6) [46].
The growing number of nucleic acid structures and the appearance of new forms such as quadruplexes and large-folded RNAs demonstrated the plasticity of nucleic acid molecules. It became clear that the conformational space of nucleic acids is extremely complex and that capturing it would require a concerted understanding of base pairing motifs and the backbone structural variability.
Early analyses showed that backbone conformational variability was fundamentally influenced by flexibility around the O3′–P–O5′ phosphodiester bonds that connected adjacent nucleotide residues, described by torsion angles ζ and α [47]. Our multidimensional statistical analysis, therefore, focused on dinucleotide fragments analyzed in torsion space, taking full advantage of the availability of the NDB and PDB.
In the 2000s, research conducted by several groups concentrated on analysis of RNA backbone flexibility culminated in an RNA Consortium consensus set of dinucleotide conformers [48]. The effort was later complemented by an analogous set of DNA conformers [49] and, ultimately, a comprehensive classification system for dinucleotide fragments covering both DNA and RNA [50]. This classification algorithm provides an automated structural ranking of dinucleotide fragments at two levels of detail: fully geometrical classification into dinucleotide conformational classes (NtC) and a more human-accessible structural alphabet (CANA). The assignment of the CANA and NtC classes makes it possible to study the structural propensities of dinucleotide sequences. For example, analysis of DNA in transcription factors and in histone core particle complexes showed important trends of protein interactions with specific bending associated NtC classes (Figure 7) [49].
NtC assignments have also inspired development of a new validation tool linking the global geometry criterion (closeness of fit to the nearest NtC class) and the quality of fit into electron density (Figure 8) [50]. It offers a simple information-rich graphical representation of the overall quality of nucleic acid structure in the form of a 2D graph.
In an additional effort to understand, classify, and validate nucleic acids, we have developed a procedure similar to Ramachandran analysis for proteins, making use of eta (η) and theta (θ) virtual torsion angles (pseudotorsions) [53,54]. Measured (η,θ) pairs define backbone conformations for each central residue within a trinucleotide. Plots are designed to quickly reveal rare conformations that may need extra checking (Figure 9). A web server was recently set up to investigate the utility of this approach for RNA structures determined using cryoEM (ptp.emdataresource.org) (accessed on 30 March 2022).
5. Current State of Nucleic Acid Structural Biology
When the NDB was established in the early 1990s, most of the nucleic acid structures were small fragments with the exception of tRNA. There were a few structures of protein-nucleic acid complexes, limited to virus capsids with viral genomic RNA or DNA and transcription factors bound to duplex DNA. Molecular machines, such as the ribosome, were yet to be determined. In contrast, there are now more than 14,000 nucleic acid-containing structures in the PDB and NDB (Figure 10). A notable trend is the recent increase in the use of electron microscopy (EM) for structure determination. Protein/DNA complexes are the most abundant, followed by protein/RNA, DNA-only, and RNA-only. In addition to the increase in the number of structures, the structures are very diverse, as shown in Figure 11.
These structures have significantly expanded our knowledge of structure/function relationships and raised the potential of new knowledge from systematic analyses of structure collections. Many different databases and tools have been created to enable specialized analyses of nucleic acid structures. Some have focused on DNA [67], some on RNA [68,69,70,71,72], and some on the interactions between proteins and nucleic acids [73,74]. A systematic long-term analysis of dinucleotides led to a unified RNA + DNA automated classification system [50] available at DNATCO (dnatco.datmos.org) (accessed on 30 March 2022). The NDB (ndbserver.rutgers.edu) (accessed on 30 March 2022) is unique in that all nucleic acid structures and their complexes are contained in a single resource.
6. Going Forward
The NDB is maintained to the extent that new structures and manually curated annotations are added each week, but there is little significant development since its last full funding in 2003. Even so, thousands of users from the Americas, Asia, Europe, and other locations continue to make multiple visits to the NDB website each month. The most heavily visited pages are Advanced Search and DNA and RNA galleries.
In 2018, the collaborative group of scientists managing both the NDB (at Rutgers) and RNAhub services (at Bowling Green State University) proposed to create the Nucleic Acid Knowledge Base (NAKB), with the goal of integrating information already in the NDB with additional sequence, structure, function, and interaction-based annotations for all major classes of NA-containing 3D structures. This new service, which will ultimately replace the NDB, is currently under construction. The NAKB aims to enable users to quickly find and download all structures and metadata relevant to their search topic, whether broad or focused, based on the NDB’s internal curation scheme, computationally generated annotations, and/or external database references for DNA, RNA, mixed NA, and for NA-binding enzymatic, regulatory, and structural proteins. All NA-containing structures in the PDB will be indexed, including structures obtained using Electron Microscopy. The NAKB will be updated weekly.
The NDB has employed manual expert curation collected over three decades to identify major NA secondary structure features (duplex, triplex, and quadruplex) and high-level classifications (e.g., ribosomal RNA or telomeric DNA), as well as interactions with ligands (e.g., minor groove binding) and protein classification [37]. Integrated computationally created annotations have included bond distance, angle, and torsion geometries, base and base-pair morphologies, as well as RNA 3D motifs, interactions (base pair types and parameters, base-to-backbone, and base stacking interactions), and RNA equivalence (3D structure similarity) classes [75,76].
New NAKB content will include equivalence class calculations for all nucleic acid molecule types (RNA, DNA, hybrid nucleic acids), enabling more accurate retrieval for closely related NA structures, analogous to the way that UniProt identifier mapping has improved search capabilities for related proteins in PDB [77]. Computationally derived annotations produced by DSSR software [78], including secondary structure features, sugar pucker type, and pseudo-torsion angles, will be added.
New search capabilities will be developed for specific classes of chemical modifications of nucleotides; nucleic acid 3D structure motifs by their common names, for example, G-Quadruplex, R-loop, Holliday Junction, Sarcin–Ricin, Kink-turn; ribosome functional states, e.g., full or single subunit, translational state, and numbers and positions of bound tRNAs; and deeper classification levels for selected proteins such as transcription factors.
The NAKB website will employ a modern web infrastructure with flexible data representation viewable on phones and tablets as well as desktop computers. For each NA-containing 3D structure, an atlas page will provide a summary overview of annotations as well as access to 1D, 2D, and 3D visualizations, external analysis tools, and file downloads. Mappings to external database links will initially include: PDB, Uniprot, RNACentral, Rfam. External analysis tools will include DNATCO and DNAproDB. Some of the reporting functions that were available in the original NDB so that the types of conformational analysis described earlier will be reenabled.
Acknowledgments
We are grateful to our collaborators for their many contributions over NDB’s three decades of existence.
Author Contributions
Conceptualization, H.M.B.; writing—original draft, review, editing, H.M.B., C.L.L. and B.S. All authors have read and agreed to the published version of the manuscript.
Funding
The early funding of the NDB by the National Science Foundation is gratefully acknowledged. Funding for B.S. is from program Inter excellence of Ministry of Education, Youth, and Sports of the Czech Republic, grant number LTAUSA18197, and institutional support to the Institute of Biotechnology of the Czech Academy of Sciences, grant number RVO 86652036. Funding for C.L.L. is from the National Institutes of Health General Medical Sciences, grant numbers R01 GM079429 and R01 GM085238.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Watson J.D., Crick F.H.C. A structure for deoxyribose nucleic acid. Nature. 1953;171:737–738. doi: 10.1038/171737a0. [DOI] [PubMed] [Google Scholar]
- 2.Franklin R.E., Gosling R.G. Molecular configuration in sodium thymonucleate. Nature. 1953;171:740–741. doi: 10.1038/171740a0. [DOI] [PubMed] [Google Scholar]
- 3.Sussman J.L., Seeman N.C., Kim S.-H., Berman H.M. The crystal structure of a naturally occurring dinucleotide phosphate uridylyl 3′,5′-adenosine phosphate. models for RNA chain folding. J. Mol. Biol. 1972;66:403–421. doi: 10.1016/0022-2836(72)90423-8. [DOI] [PubMed] [Google Scholar]
- 4.Seeman N.C., Rosenberg J.M., Suddath F.L., Kim J.J.P., Rich A. RNA double helical fragments at atomic resolution: I. The crystal and molecular structure of sodium adenylyl-3′-5′-uridine hexahydrate. J. Mol. Biol. 1976;104:109–144. doi: 10.1016/0022-2836(76)90005-X. [DOI] [PubMed] [Google Scholar]
- 5.Rosenberg J.M., Seeman N.C., Day R.O., Rich A. RNA double helical fragments at atomic resolution: II. The structure of sodium guanylyl-3′,5′-cytidine nonhydrate. J. Mol. Biol. 1976;104:145–167. doi: 10.1016/0022-2836(76)90006-1. [DOI] [PubMed] [Google Scholar]
- 6.Allen F.H., Bellard S., Brice M.D., Cartright B.A., Doubleday A., Higgs H., Hummelink T., Hummelink-Peters B.G., Kennard O., Motherwell W.D.S., et al. The Cambridge Crystallographic Data Centre: Computer-based search, retrieval, analysis and display of information. Acta Crystallogr. 1979;B35:2331–2339. doi: 10.1107/S0567740879009249. [DOI] [Google Scholar]
- 7.Robertus J.D., Ladner J.E., Finch J.T., Rhodes D., Brown R.S., Clark B.F.C., Klug A. Structure of yeast phenylalanine tRNA at 3 A resolution. Nature. 1974;250:546–551. doi: 10.1038/250546a0. [DOI] [PubMed] [Google Scholar]
- 8.Suddath F., Quigley G., McPherson A., Sneden D., Kim J., Kim S., Rich A. Three-dimensional structure of yeast phenylalanine transfer RNA at 3.0 Ångstroms resolution. Nature. 1974;248:20–24. doi: 10.1038/248020a0. [DOI] [PubMed] [Google Scholar]
- 9.Quigley G.J., Seeman N.C., Wang A.H., Suddath F.L., Rich A. Yeast phenylalanine transfer RNA: Atomic coordinates and torsion angles. Nucleic Acids Res. 1975;2:2329–2341. doi: 10.1093/nar/2.12.2329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wang A.H.-J., Quigley G.J., Kolpak F.J., Crawford J.L., van Boom J.H., van der Marel G.A., Rich A. Molecular structure of a left-handed double helical DNA fragment at atomic resolution. Nature. 1979;282:680–686. doi: 10.1038/282680a0. [DOI] [PubMed] [Google Scholar]
- 11.Drew H.R., Wing R.M., Takano T., Broka C., Tanaka S., Itakura K., Dickerson R.E. Structure of a B-DNA dodecamer: Conformation and dynamics. Proc. Natl. Acad. Sci. USA. 1981;78:2179–2183. doi: 10.1073/pnas.78.4.2179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Bernstein F.C., Koetzle T.F., Williams G.J., Meyer E.F., Jr., Brice M.D., Rodgers J.R., Kennard O., Shimanouchi T., Tasumi M. The Protein Data Bank: A computer-based archival file for macromolecular structures. J. Mol. Biol. 1977;112:535–542. doi: 10.1016/S0022-2836(77)80200-3. [DOI] [PubMed] [Google Scholar]
- 13.Breslauer K.J. A calorimetric determination of enthalpies and heat capacities of protonation. J. Chem. Thermodyn. 1979;11:527–530. doi: 10.1016/0021-9614(79)90090-9. [DOI] [Google Scholar]
- 14.Breslauer K.J. Methods for Obtaining Thermodynamic Data on Oligonucleotide Transitions. In: Hinz H., editor. Thermodynamic Data for Biochemistry and Biotechnology. Springer; New York, NY, USA: 1986. pp. 402–427. [Google Scholar]
- 15.Breslauer K.J. A thermodynamic perspective of DNA Bending. Curr. Biol. 1991;1:416–422. doi: 10.1016/0959-440X(91)90041-Q. [DOI] [Google Scholar]
- 16.Breslauer K.J. Extracting Thermodynamic Data From Equilibrium Melting Curves for Oligonucleotide Order-Disorder Transitions. In: Agrawal S., editor. Methods in Molecular Biology, Vol. 26: Protocols for Oligonucleotide Conjugates. Humana Press; Totowa, NJ, USA: 1994. pp. 347–372. Chapter 14. [DOI] [PubMed] [Google Scholar]
- 17.Chalikian T.V., Breslauer K.J. Thermodynamic analysis of biomolecules: A volumetric approach. Curr. Opin. Struct. Biol. 1998;8:657–664. doi: 10.1016/S0959-440X(98)80159-0. [DOI] [PubMed] [Google Scholar]
- 18.Chalikian T.V., Volker J., Plum G.E., Breslauer K.J. A more unified picture for the thermodynamics of nucleic acid duplex melting: A characterization by calorimetric and volumetric techniques. Proc. Natl. Acad. Sci. USA. 1999;96:7853–7858. doi: 10.1073/pnas.96.14.7853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Klump H.H., Volker J., Breslauer K.J. Energy mapping of the genetic code and genomic domains: Implications for code evolution and molecular Darwinism. Q. Rev. Biophys. 2020;53:e11. doi: 10.1017/S0033583520000098. [DOI] [PubMed] [Google Scholar]
- 20.Volker J., Klump H.H., Breslauer K.J. DNA metastability and biological regulation: Conformational dynamics of metastable omega-DNA bulge loops. J. Am. Chem. Soc. 2007;129:5272–5280. doi: 10.1021/ja070258q. [DOI] [PubMed] [Google Scholar]
- 21.Jones R. Preparation of protected deoxyribonucleosides. In: Gait M.J., editor. Oligonucleotide Synthesis, a Practical Approach. IRL Press; Washington, DC, USA: 1984. pp. 22–34. [Google Scholar]
- 22.Manning G. The molecular theory of polyelectrolyte solutions with applications to the electrostatic properties of polynucleotides. Q. Rev. Biophys. 1978;11:179–246. doi: 10.1017/S0033583500002031. [DOI] [PubMed] [Google Scholar]
- 23.Volker J., Klump H.H., Manning G.S., Breslauer K.J. Counterion association with native and denatured nucleic acids: An experimental approach. J. Mol. Biol. 2001;310:1011–1025. doi: 10.1006/jmbi.2001.4841. [DOI] [PubMed] [Google Scholar]
- 24.Erie D., Sinha N., Olson W., Jones R., Breslauer K. A dumbbell-shaped, double-hairpin structure of DNA: A thermodynamic investigation. Biochemistry. 1987;26:7150–7159. doi: 10.1021/bi00396a042. [DOI] [PubMed] [Google Scholar]
- 25.Berman H.M., Olson W.K., Beveridge D.L., Westbrook J., Gelbin A., Demeny T., Hsieh S.H., Srinivasan A.R., Schneider B. The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids. Biophys. J. 1992;63:751–759. doi: 10.1016/S0006-3495(92)81649-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kitakami H., Tateno Y., Gojobori T. Toward unification of taxonomy databases in a distributed computer environment. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1994;2:227–235. [PubMed] [Google Scholar]
- 27.Bilofsky H.S., Burks C., Fickett J.W., Goad W.B., Lewitter F.I., Rindone W.P., Swindell C.D., Tung C.S. The GenBank genetic sequence databank. Nucleic Acids Res. 1986;14:1861–1863. doi: 10.1093/nar/14.1.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Benson D.A., Karsch-Mizrachi I., Lipman D.J., Ostell J., Rapp B.A., Wheeler D.L. GenBank. Nucleic Acids Res. 2000;28:15–18. doi: 10.1093/nar/28.1.15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Parker M. Biological data access through Gopher. Trends Biochem. Sci. 1993;18:485–486. doi: 10.1016/S0968-0004(10)80001-5. [DOI] [PubMed] [Google Scholar]
- 30.Fitzgerald P.M.D., Westbrook J.D., Bourne P.E., McMahon B., Watenpaugh K.D., Berman H.M. 4.5 Macromolecular dictionary (mmCIF) In: Hall S.R., McMahon B., editors. International Tables for Crystallography G. Definition and Exchange of Crystallographic Data. Springer; Dordrecht, The Netherlands: 2005. pp. 295–443. [Google Scholar]
- 31.Clowney L., Jain S.C., Srinivasan A.R., Westbrook J., Olson W.K., Berman H.M. Geometric Parameters in Nucleic Acids: Nitrogenous Bases. J. Am. Chem. Soc. 1996;118:509–518. doi: 10.1021/ja952883d. [DOI] [Google Scholar]
- 32.Gelbin A., Schneider B., Clowney L., Hsieh S.-H., Olson W.K., Berman H.M. Geometric parameters in nucleic acids: Sugar and phosphate constituents. J. Am. Chem. Soc. 1996;118:519–528. doi: 10.1021/ja9528846. [DOI] [Google Scholar]
- 33.Parkinson G., Vojtechovsky J., Clowney L., Brunger A.T., Berman H.M. New parameters for the refinement of nucleic acid-containing structures. Pt 1Acta Cryst. D Biol. Cryst. 1996;52:57–64. doi: 10.1107/S0907444995011115. [DOI] [PubMed] [Google Scholar]
- 34.Olson W.K., Bansal M., Burley S.K., Dickerson R.E., Gerstein M., Harvey S.C., Heinemann U., Lu X.J., Neidle S., Shakked Z., et al. A standard reference frame for the description of nucleic acid base-pair geometry. J. Mol. Biol. 2001;313:229–237. doi: 10.1006/jmbi.2001.4987. [DOI] [PubMed] [Google Scholar]
- 35.Leontis N.B., Westhof E. Geometric nomenclature and classification of RNA base pairs. RNA. 2001;7:499–512. doi: 10.1017/S1355838201002515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Schneider B., de la Cruz J., Feng Z., Chen L., Dutta S., Persikova I., Westbrook J., Yang H., Young J., Zardecki C., et al. The Nucleic Acid Database. In: Gu J., Bourne P.E., editors. Structural Bioinformatics. 2nd ed. Wiley-Blackwell; Hoboken, NJ, USA: 2009. pp. 305–319. [Google Scholar]
- 37.Coimbatore Narayanan B., Westbrook J., Ghosh S., Petrov A.I., Sweeney B., Zirbel C.L., Leontis N.B., Berman H.M. The Nucleic Acid Database: New features and capabilities. Nucleic Acids Res. 2014;42:D114–D122. doi: 10.1093/nar/gkt980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Srinivasan A.R., Olson W.K. Yeast tRNAPhe conformation wheels: A novel probe of the monoclinic and orthorhombic models. Nucleic Acids Res. 1980;8:2307–2329. doi: 10.1093/nar/8.10.2307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dans P.D., Perez A., Faustino I., Lavery R., Orozco M. Exploring polymorphisms in B-DNA helical conformations. Nucleic Acids Res. 2012;40:10668–10678. doi: 10.1093/nar/gks884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gupta A., Kulkarni M., Mukherjee A. Accurate prediction of B-form/A-form DNA conformation propensity from primary sequence: A machine learning and free energy handshake. Patterns. 2021;2:100329. doi: 10.1016/j.patter.2021.100329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Bayrak C.S., Kim N., Schlick T. Using sequence signatures and kink-turn motifs in knowledge-based statistical potentials for RNA structure prediction. Nucleic Acids Res. 2017;45:5414–5422. doi: 10.1093/nar/gkx045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Corsi F., Lavery R., Laine E., Carbone A. Multiple protein-DNA interfaces unravelled by evolutionary information, physico-chemical and geometrical properties. PLoS Comput. Biol. 2020;16:e1007624. doi: 10.1371/journal.pcbi.1007624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Srivastava A., Ahmad S., Gromiha M.M. Deciphering RNA-Recognition Patterns of Intrinsically Disordered Proteins. Int. J. Mol. Sci. 2018;19:1595. doi: 10.3390/ijms19061595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Sagendorf J.M., Markarian N., Berman H.M., Rohs R. DNAproDB: An expanded database and web-based tool for structural analysis of DNA-protein complexes. Nucleic Acids Res. 2020;48:D277–D287. doi: 10.1093/nar/gkz889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Schneider B., Berman H.M. Hydration of the DNA bases is local. Biophys. J. 1995;69:2661–2669. doi: 10.1016/S0006-3495(95)80136-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Schneider B., Neidle S., Berman H.M. Conformations of the sugar-phosphate backbone in helical DNA crystal structures. Biopolymers. 1997;42:113–124. doi: 10.1002/(SICI)1097-0282(199707)42:1<113::AID-BIP10>3.0.CO;2-O. [DOI] [PubMed] [Google Scholar]
- 47.Kim S.-H., Berman H.M., Seeman N.C., Newton M.D. Seven basic conformations of nucleic acid structural units. Acta Crystallogr. Sect. B. 1973;29:703–710. doi: 10.1107/S0567740873003201. [DOI] [Google Scholar]
- 48.Richardson J.S., Schneider B., Murray L.W., Kapral G.J., Immormino R.M., Headd J.J., Richardson D.C., Ham D., Hershkovits E., Williams L.D., et al. RNA backbone: Consensus all-angle conformers and modular string nomenclature (an RNA Ontology Consortium contribution) RNA. 2008;14:465–481. doi: 10.1261/rna.657708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Schneider B., Bozikova P., Cech P., Svozil D., Cerny J. A DNA Structural Alphabet Distinguishes Structural Features of DNA Bound to Regulatory Proteins and in the Nucleosome Core Particle. Genes. 2017;8:278. doi: 10.3390/genes8100278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Cerny J., Bozikova P., Svoboda J., Schneider B. A unified dinucleotide alphabet describing both RNA and DNA structures. Nucleic Acids Res. 2020;48:6367–6381. doi: 10.1093/nar/gkaa383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gouge J., Satia K., Guthertz N., Widya M., Thompson A.J., Cousin P., Dergai O., Hernandez N., Vannini A. Redox Signaling by the RNA Polymerase III TFIIB-Related Factor Brf2. Cell. 2015;163:1375–1387. doi: 10.1016/j.cell.2015.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Frouws T.D., Duda S.C., Richmond T.J. X-ray structure of the MMTV-A nucleosome core. Proc. Natl. Acad. Sci. USA. 2016;113:1214–1219. doi: 10.1073/pnas.1524607113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Olson W.K. Configurational statistics of polynucleotide chains. An updated virtual bond model to treat effects of base stacking. Macromolecules. 1980;13:721–728. doi: 10.1021/ma60075a045. [DOI] [Google Scholar]
- 54.Wadley L.M., Keating K.S., Duarte C.M., Pyle A.M. Evaluating and learning from RNA pseudotorsional space: Quantitative validation of a reduced representation for RNA structure. J. Mol. Biol. 2007;372:942–957. doi: 10.1016/j.jmb.2007.06.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Vander Zanden C.M., Rowe R.K., Broad A.J., Robertson A.B., Ho P.S. Effect of Hydroxymethylcytosine on the Structure and Stability of Holliday Junctions. Biochemistry. 2016;55:5781–5789. doi: 10.1021/acs.biochem.6b00801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wang Y., Patel D.J. Solution structure of a parallel-stranded G-quadruplex DNA. J. Mol. Biol. 1993;234:1171–1183. doi: 10.1006/jmbi.1993.1668. [DOI] [PubMed] [Google Scholar]
- 57.Davey C.A., Sargent D.F., Luger K., Maeder A.W., Richmond T.J. Solvent mediated interactions in the structure of the nucleosome core particle at 1.9 a resolution. J. Mol. Biol. 2002;319:1097–1113. doi: 10.1016/S0022-2836(02)00386-8. [DOI] [PubMed] [Google Scholar]
- 58.Lawson C.L., Carey J. Tandem binding in crystals of a trp repressor/operator half-site complex. Nature. 1993;366:178–182. doi: 10.1038/366178a0. [DOI] [PubMed] [Google Scholar]
- 59.Weaver T.M., Cortez L.M., Khoang T.H., Washington M.T., Agarwal P.K., Freudenthal B.D. Visualizing Rev1 catalyze protein-template DNA synthesis. Proc. Natl. Acad. Sci. USA. 2020;117:25494–25504. doi: 10.1073/pnas.2010484117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Jang S.B., Hung L.W., Chi Y.I., Holbrook E.L., Carter R.J., Holbrook S.R. Structure of an RNA internal loop consisting of tandem C-A+ base pairs. Biochemistry. 1998;37:11726–11731. doi: 10.1021/bi980758j. [DOI] [PubMed] [Google Scholar]
- 61.Chan C.W., Badong D., Rajan R., Mondragon A. Crystal structures of an unmodified bacterial tRNA reveal intrinsic structural flexibility and plasticity as general properties of unbound tRNAs. RNA. 2020;26:278–289. doi: 10.1261/rna.073478.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Huang L., Wang J., Watkins A.M., Das R., Lilley D.M.J. Structure and ligand binding of the glutamine-II riboswitch. Nucleic Acids Res. 2019;47:7666–7675. doi: 10.1093/nar/gkz539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Noeske J., Wasserman M.R., Terry D.S., Altman R.B., Blanchard S.C., Cate J.H. High-resolution structure of the Escherichia coli ribosome. Nat. Struct. Mol. Biol. 2015;22:336–341. doi: 10.1038/nsmb.2994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Li X., Liu S., Zhang L., Issaian A., Hill R.C., Espinosa S., Shi S., Cui Y., Kappel K., Das R., et al. A unified mechanism for intron and exon definition and back-splicing. Nature. 2019;573:375–380. doi: 10.1038/s41586-019-1523-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Landeras-Bueno S., Wasserman H., Oliveira G., VanAernum Z.L., Busch F., Salie Z.L., Wysocki V.H., Andersen K., Saphire E.O. Cellular mRNA triggers structural transformation of Ebola virus matrix protein VP40 to its essential regulatory form. Cell Rep. 2021;35:108986. doi: 10.1016/j.celrep.2021.108986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Lu X.J. DSSR-enabled innovative schematics of 3D nucleic acid structures with PyMOL. Nucleic Acids Res. 2020;48:e74. doi: 10.1093/nar/gkaa426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Schneider B., Bozikova P., Necasova I., Cech P., Svozil D., Cerny J. A DNA structural alphabet provides new insight into DNA flexibility. Pt 1Acta Cryst. D Struct. Biol. 2018;74:52–64. doi: 10.1107/S2059798318000050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Appasamy S.D., Hamdani H.Y., Ramlan E.I., Firdaus-Raih M. InterRNA: A database of base interactions in RNA structures. Nucleic Acids Res. 2016;44:D266–D471. doi: 10.1093/nar/gkv1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Boccaletto P., Machnicka M.A., Purta E., Piatkowski P., Baginski B., Wirecki T.K., de Crecy-Lagard V., Ross R., Limbach P.A., Kotter A., et al. MODOMICS: A database of RNA modification pathways. 2017 update. Nucleic Acids Res. 2018;46:D303–D307. doi: 10.1093/nar/gkx1030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Chojnowski G., Walen T., Bujnicki J.M. RNA Bricks—A database of RNA 3D motifs and their interactions. Nucleic Acids Res. 2014;42:D123–D131. doi: 10.1093/nar/gkt1084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Zok T., Antczak M., Zurkowski M., Popenda M., Blazewicz J., Adamiak R.W., Szachniuk M. RNApdbee 2.0: Multifunctional tool for RNA structure annotation. Nucleic Acids Res. 2018;46:W30–W35. doi: 10.1093/nar/gky314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.The RNAcentral Consortium. Petrov A.I., Kay S.J.E., Kalvari I., Howe K.L., Gray K.A., Bruford E.A., Kersey P.J., Cochrane G., Finn R.D. RNAcentral: A comprehensive database of non-coding RNA sequences. Nucleic Acids Res. 2017;45:D128–D134. doi: 10.1093/nar/gkw1008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Bernier C.R., Petrov A.S., Waterbury C.C., Jett J., Li F., Freil L.E., Xiong X., Wang L., Migliozzi B.L., Hershkovits E., et al. RiboVision suite for visualization and analysis of ribosomes. Faraday Discuss. 2014;169:195–207. doi: 10.1039/C3FD00126A. [DOI] [PubMed] [Google Scholar]
- 74.Sagendorf J.M., Berman H.M., Rohs R. DNAproDB: An interactive tool for structural analysis of DNA-protein complexes. Nucleic Acids Res. 2017;45:W89–W97. doi: 10.1093/nar/gkx272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Petrov A.I., Zirbel C.L., Leontis N.B. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas. RNA. 2013;19:1327–1340. doi: 10.1261/rna.039438.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Zirbel C., Leontis N. Nonredundant 3D Structure Datasets for RNA Knowledge Extraction and Benchmarking. In: Leontis N., Westhof E., editors. RNA 3D Structure Analysis and Prediction. Volume 27. Springer; Berlin/Heidelberg, Germany: 2012. pp. 281–298. [Google Scholar]
- 77.Dana J.M., Gutmanas A., Tyagi N., Qi G., O’Donovan C., Martin M., Velankar S. SIFTS: Updated Structure Integration with Function, Taxonomy and Sequences resource allows 40-fold increase in coverage of structure-based annotations for proteins. Nucleic Acids Res. 2019;47:D482–D489. doi: 10.1093/nar/gky1114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Lu X.J., Bussemaker H.J., Olson W.K. DSSR: An integrated software tool for dissecting the spatial structure of RNA. Nucleic Acids Res. 2015;43:e142. doi: 10.1093/nar/gkv716. [DOI] [PMC free article] [PubMed] [Google Scholar]