ABSTRACT
Accurate biomolecular structure prediction enables the prediction of mutational effects, the speculation of function based on predicted structural homology, the analysis of ligand binding modes, experimental model building, and many other applications. Such algorithms to predict essential functional and structural features remain out of reach for biomolecular complexes containing nucleic acids. Here, we report a quantitative and qualitative evaluation of nucleic acid structures for the CASP16 blind prediction challenge by 12 of the experimental groups who provided nucleic acid targets. Blind predictions accurately model secondary structure and some aspects of tertiary structure, including reasonable global folds for some complex RNAs; however, predictions often lack accuracy in the regions of highest functional importance. All models have inaccuracies in non‐canonical regions where, for example, the nucleic‐acid backbone bends, deviating from an A‐form helix geometry, or a base forms a non‐standard hydrogen bond (not a Watson‐Crick base pair). These bends and non‐canonical interactions are integral to forming functionally important regions such as RNA enzymatic active sites. Additionally, the modeling of conserved and functional interfaces between nucleic acids and ligands, proteins, or other nucleic acids remains poor. For some targets, the experimental structures may not represent the only structure the biomolecular complex occupies in solution or in its functional life cycle, posing a future challenge for the community.
Keywords: CASP, community‐wide experiment, functionally relevant structure prediction accuracy, nucleic acid structure prediction, nucleic‐acid‐protein complexes, RNA folding
1. Introduction
Structural biologists have realized the immense benefits of accurate computational structure prediction methods now available for proteins [1], but the accuracy of computational nucleic acid structure prediction lags far behind [2, 3]. The experimental determination of structures plays an integral role in the field's journey toward accurate structure prediction. The structures determined experimentally increase the amount of data from which prediction algorithms can learn and enable blind structure prediction challenges like the Critical Assessment of Techniques for Structure Prediction (CASP) [3] and RNA Puzzles [4]. Additionally, studies on nucleic acid biology and structures identify the features of nucleic acid structure that are important to predict accurately for functional or biophysical reasons, guiding the goals of structure prediction. In this manuscript, as in CASP15 [5], structure providers for the CASP16 experiment analyze important structural and functional features of their nucleic acid‐containing structures and analyze how accurately modelers predicted these features.
In the second iteration of the nucleic acid category of CASP (CASP16, 2024), 37 RNA, 1 DNA, 11 RNA‐multimer, 16 nucleic‐acid‐protein complexes, and 6 nucleic‐acid ligand complexes were provided as modeling targets by 22 structure determination groups from 10 countries. All targets were released for prediction from May to July 2024. Among these, 12 targets were solved by x‐ray crystallography, 29 by cryogenic electron microscopy (cryo‐EM), and 1 by nuclear magnetic resonance (NMR). All target providers were invited to contribute to this study, resulting in 12 sections highlighting 18 of the targets (Table 1). The numerical evaluation of CASP16 models is available at the Prediction Center website (see Data Availability Statement). A general evaluation of predicted accuracy over all targets is provided elsewhere in this issue [3, 6].
TABLE 1.
CASP16 RNA targets included in this study.
| Target | Name | PDB | Length (nt) | Stoich. | Method | Res. (Å) | Potential templates | Comments |
|---|---|---|---|---|---|---|---|---|
| R1205 | Exoribonuclease‐resistant RNA (xrRNA) from plant‐infecting ST9a | 9CFN | 59 | R1 | x‐ray | 2.9 | 6D3P, 7JJU | Only residues 1–55 were resolved. |
| R1209 | HIV‐1 RRE SLII | 9C2K | 68 | R1 | x‐ray | 2.42 | — | Crystallized in complex with a Fab as a crystallization chaperone. |
| R1296 | HIV‐1 RRE SLII G34U | 9C75 | 68 | R1 | x‐ray | 3.0 | — | Crystallized in complex with a Fab as a crystallization chaperone. |
|
M1293 R1293 |
Translation enhancer motif | — | 82 (227, 215 aa) | R1 P1 P′1 | x‐ray | 2.52 | 8SH5 | Crystallized in complex with a Fab as a crystallization chaperone. |
| R1261 | ZTP riboswitch with ZMP | 9BZC | 89 | R1 | x‐ray | 2.52 | 6WZS, 6WZR, 4ZNP | |
| R1262 | ZTP riboswitch with AICA | 9BZ1 | 89 | R1 | x‐ray | 2.80 | 6WZS, 6WZR, 4ZNP | |
| R1263 | ZTP riboswitch with m‐1‐pyridinyl AICA | 8VQV | 64 | R1 | x‐ray | 2.43 | 4XW7, 4XWF | Ligand pocket is between two symmetric copies of the RNA. |
| R1264 | ZTP riboswitch with AICA derivative | 8VVJ | 64 | R1 | x‐ray | 2.15 | 4XW7, 4XWF | Ligand pocket is between two symmetric copies of the RNA. |
| R1288 | SAMURI | 9FN2 | 58 | R1 | x‐ray | 2.9 | — | Ligand SAH present in the structure; RNA was crystallized with SAM, resulting in transfer of methyl group to N3 of A52, that is, structure contains m3A (=3‐methyladenosine) and SAH as bound ligand. |
| D1273 | RKEC1 DNA aptamer bound to dopamine | 9HIO | 27 | D1 | NMR | N/A | — | Ligand present. Ensemble of 15 structures. |
| R1241 | Group II intron | 9QTJ | 481 | R1 | Cryo‐EM | 3.8 (3.3–4.1) | 4FAQ, 7UIN, 8FLI, 5G2Y, 7D0F, 4R0D | |
| R1286 | Lactobacillus salivarius (Lsa) ROOL ncRNA | 9J6Y | 526 | R1 | Cryo‐EM | 3.1 (2.7–6.7) | — | R1 stoichiometry was set for CASP16 prediction but was R6 in experimental structure. |
| R1283 v1, v2, v3 | Enterococcus faecalis (Efa) ROOL ncRNA |
9ISV, 9J3R, 9J3T |
580 | R1, R4, R8 | Cryo‐EM |
3.1 (2.5–8.2), 4.7 (4.1–14.9), 3.8 (3.4–6.7) |
— | |
| R1285o | OLE RNA | 9MCW | 577 | R2 | Cryo‐EM | 2.9 (2.4–6.1) | — | Only the 5′ domain was resolved. |
| R1281o | 2′ FY RNA 6HBC‐dimer | — | 720 | R2 | Cryo‐EM | 9.6 (8.5–15.9) | 7PTK, 7PTL | RNA was modified with 2′ F on the pyrimidines but this could not be annotated in fasta format. |
| M1282 | Spike‐RBD + 2′ FY RNA anti‐spike‐aptamer | — | 51 (+ 197 aa) | R1 P1 | Cryo‐EM | 3.3 (2.0–48.8) | — | RNA was modified with 2′ F on the pyrimidines but this could not be annotated in fasta format. |
| M1239 v1, v2 | SPβ LSI‐RDF synaptic complex | — | 26, 24, 33, 35 (620 aa) | D2 D′2 D″2 D‴2 P4 | Cryo‐EM | 4.23 (3.8–15.0) | 1ZR4, 3BVP, 4KIS, 5U96, 5UAE, 5UDO | No template is available for the full complex. |
| M1276 | Cryptic DNA binding protein UDE in complex with ssDNA | — | 33 | D1 P1 | x‐ray | 1.25 | — |
Here, we detail the interactions of functional importance in these CASP targets. These regions are often non‐canonical in nature, deviating from the conventional A‐form helix or Watson‐Crick (WC) base pairs. We observe that these functionally important, but non‐canonical regions are particularly challenging for CASP16 predictors to model accurately. Outside of these functionally important regions, top groups predicted the secondary structure and often the global fold with high accuracy, suggesting that attention can now be focused on improving prediction accuracy of the functionally important regions we outline here.
Predictors modeled NA‐ligand interactions with known templates accurately, but could not predict novel interactions. We additionally observe very limited success in predictions of NA‐NA interactions, a new prediction category. In biological systems, NAs often function as part of NA‐protein complexes, and we reveal that, unlike NA‐NA interactions, even when groups accurately predicted the protein binding site for the nucleic acid, the nucleic acid pose, and in particular the NA‐protein chemical interactions, were not predicted accurately. Predictions of interfaces NA‐ligand, NA‐NA, and NA‐protein are of vital importance in understanding biological machines, revealing these new prediction categories as key areas for improvement.
The best performing of the 65 predicting groups in CASP16 use a wide range of methods, including fragment assembly, physics‐based simulation, and deep learning, to sample a large amount of structures. A key attribute of the top groups was their manual selection of models using expert knowledge. Automated algorithms, like AlphaFold3, perform significantly worse overall [3]. Methods often rely heavily on templates or prior structural information, which we observe can manifest as over‐reliance on canonical motifs or previous structures. This reliance will need to be reduced and supplemented with other information, including evolutionary data, to overcome current limitations in predicting functionally important nucleic acid interactions.
2. Results
2.1. Exoribonuclease‐Resistant Viral RNA (CASP: R1205, PDB: 9CFN) Provided by Jeanine Gezelle and Anna‐Lena Steckelberg
Exoribonuclease‐resistant RNAs (xrRNAs) are specialized viral RNA structures that block degradation by cellular 5′‐3′ exoribonucleases at specific sites within the viral RNA genome [7]. This resistance results in the accumulation of viral subgenomic RNAs, which play critical roles in immune modulation and viral replication [8]. Initially identified in human‐pathogenic flaviviruses, such as Dengue virus and Zika virus, xrRNAs have since been found across a wide range of RNA virus families infecting both animals and plants [9, 10, 11, 12]. Despite a complete lack of sequence similarity, xrRNAs from distantly related virus families share a conserved core motif: a central ring encircling the 5′ end of the RNA [7, 10, 13]. This ring functions as a molecular brace, effectively countering the unidirectional unwinding forces of cellular exoribonucleases to protect the RNA from degradation.
CASP16 target R1205 (PDB: 9CFN) represents the first high‐resolution structure of a plant‐virus xrRNA in its nuclease‐resistant conformation [14]. The structure, derived from the plant‐infecting ST9a virus, was solved by x‐ray crystallography to 2.9 Å resolution. The crystallization construct consisted of 59 nucleotides, with nucleotides 56–59 excluded from the final model due to insufficient electron density at the 3′ end. The RNA adopts a compact fold composed of a stem loop (SL) of coaxially stacked helices P1 and P2, with a 9‐base pair pseudoknot (PK) between the apical loop and the 3′ end of the RNA (Figure 1A,B). The PK causes the 3′ end to wrap around the 5′ end, forming the characteristic protective ring topology previously observed in xrRNAs from other viral families (Figure 1B). A highly conserved network of non‐canonical interactions orients the helices and anchors the ring within the xrRNA scaffold (Figure 1A,B). Most notably, a hairpin loop, L2B, protrudes from the apical loop L2 to cap the base of the PK. The backbone of L2B resembles a U‐turn RNA tetraloop, but it is closed by a reverse Hoogsteen U‐A base pair and a reverse WC base pair involving flipped‐out A31 of P2. L2B is positioned at an acute angle next to P2, behind the protective ring, and engages in multiple non‐canonical tertiary interactions with the J1/2 and J1/3 regions of the RNA. These include (a) A5 stacking between A28 and A29 and interacting with the sugar edge of G26, (b) C6 forming a long‐range base pair with G27, and (c) two conserved base triple interactions, A42‐A5‐G26 and G43‐A29‐U25, that position the 3′ end of the protective ring. While R1205 represents the first structure of a plant‐virus xrRNA in its active PK conformation, two previously solved structures captured related xrRNA variants in a SL conformation (PDB: 6D3P, 7JJU) [10, 15]. In these cases, the intramolecular PK was replaced by equivalent trans interactions between two adjacent RNA molecules, forming a “domain‐swapped” dimer. These earlier structures could have informed the CASP16 predictions.
FIGURE 1.

Predictions of the ST9a Xrn1‐resistant RNA (CASP: R1205, PDB: 9CFN). (A) Secondary structure diagram of ST9a xrRNA crystallization construct. Non–Watson–Crick base pairs are in Leontis–Westhof annotation and the Xrn1 stop site is depicted by the arrow. Non‐modeled nucleotides are shown as lowercase letters. (B) Ribbon diagram of the ST9a xrRNA crystal structure. Colors match (D). Arrows in all panels indicate strand directionality 5′‐3′. (C) 3D ribbon diagram of the RNApolis group CASP16 prediction with leading metrics, colors to match (A). (D) Left: 3D ribbon diagram of Vfold prediction TS481_3, colors to match (A). Right: Overlay of the PK and J1/2‐L2B region of the Vfold model (dark) and the experimental crystal structure (PDB: 9CFN, light/tints); RMSD = 1.537. All 3D structure analysis was performed in PyMOL. (E) Backbone depiction of the GromihaLab prediction, colors to match (A) and (B), arrows in the 5′‐3′ direction. (F, G) 3D ribbon diagrams of CASP16 predictions with leading metrics from (F) AF3‐server and (G) GuangzhouRNA‐meta, colors to match (A). Note that the RNA backbones in (D–F) form knots. (H) Secondary structure depictions of the P1 and P2 regions in the crystal structure (left) and in CASP16 predictions (right).
Many models correctly predicted some aspects of the global fold of R1205—including helices P1 and P2, the PK, and the 3′ end wrapping around the 5′ end—yet all failed to accurately predict crucial non‐canonical tertiary interactions (Figure 1C–G). The model with the lowest RMSD (7.46 Å) was TS063_4, submitted by the RNApolis group. Surprisingly, TS063_4 and related models from RNApolis (all with an RMSD < 10 Å) were the only structures to correctly predict the L2B hairpin positioned behind the protective ring at the base of the PK (Figure 1C). However, key non‐canonical interactions of L2B, including the conserved Hoogsteen base pair and base‐triple interactions, were missing. Furthermore, the models predicted a shorter 6‐base pair PK, highlighting the challenge of folding a structure that accommodates both P1‐P2 and the full 9‐base pair PK. In the crystal structure, this is achieved by a sharp bend in the L2 backbone, along with two adenosines that interact with the minor groove of the PK (Figure 1B). These unusual A‐minor interactions in cis are not found in any other structures currently available in the PDB and were not predicted by any CASP16 models.
The best model according to most CASP metrics was V‐fold TS481_3 (GTD‐TS 0.43, local Distance Difference Test (lDDT) 0.499) (Figure 1D). This model was the only one to successfully predict a subset of the non‐canonical L2B interactions, including the A‐U Hoogsteen base pair, A5 stacking between A28 and A29, and the interaction of A5 with the sugar edge of G26 (Figure 1D, right). However, while these local interactions were correctly predicted, their position within the global structure was incorrect. Specifically, in this model, L2B is positioned in front of the protective ring. This orientation facilitates the accommodation of both P1‐P2 and an 8‐base pair PK in the same structure, but it introduces a remarkable folding problem: the resulting RNA structure contains two intertwined helices that form a genuine knot. To form this knot, the 3′ end would need to pass through a closed ring in the RNA, a highly improbable folding path. Notably, we found that a majority of high‐scoring models adopted similar knot topologies.
While the xrRNA crystal structure exhibits numerous sharp backbone bends and non‐canonical interactions (Figure 1A,B), most CASP16 models rely predominantly on canonical A‐form helices. This is particularly evident in the AlphaFold3 models, as well as those from GromihaLab, KiharaLab, and the GuangzhouRNA group (Figure 1E–G). Although these models scored reasonably well in CASP metrics, they all failed to capture key non‐canonical interactions. Instead, all created a knot in the RNA backbone, similar to the Vfold model (Figure 1E–G). Beyond missing critical tertiary interactions, most models also mispredicted elements of the secondary structure. This is especially evident in P2, where a register shift was introduced due to a failure to position C6 and/or A31 in bulged regions engaged in long‐range tertiary interactions (Figure 1H).
In summary, we found that an overreliance on canonical A‐form RNA helices limited the accuracy of most CASP16 models. Despite two related RNA structures being available in the PDB [10, 15], and L2B‐like motifs identified in ribosomal RNAs [10], only one model accurately predicted the L2B tertiary interactions (Figure 1D). Additionally, predictions often misinterpreted nucleotides involved in long‐range tertiary interactions as part of extended helices (Figure 1H). To improve prediction accuracy, more diverse training sets are needed, including additional high‐resolution structures of RNA featuring non‐canonical interactions. Most remarkably, to maximize base‐pairing interactions, many models predicted implausible structures that introduced genuine knots in the RNA backbone (Figure 1D–G). Considering folding intermediates in RNA structure predictions could help avoid such issues and better reflect biologically relevant structures.
2.2. HIV‐1 Rev Response Element Stem‐Loop II (CASP: R1209 and R1296, PDB: 9C2K and 9C75) Provided by Manju Ojha and Deepak Koirala
The Rev. Response Element (RRE) is about 350‐nucleotide‐long cis‐acting RNA domain found in the env coding region of the HIV‐1 genome [16, 17]. RRE plays a crucial role in HIV‐1 replication by interacting with the viral‐encoded protein Rev to form an oligomeric complex that facilitates the nuclear export of intron‐containing viral RNAs during the late phase of viral infection [18, 19, 20]. This ribonucleoprotein complex has been considered a promising therapeutic target for developing drugs against HIV infections [21, 22]; however, the mechanism underlying this crucial virological process remains poorly understood, mainly due to the lack of high‐resolution structural information on RRE and RRE‐Rev complexes. The proposed secondary structures of HIV‐1 RRE consist of four stem‐loop structures II‐V budding from a basal stem I [17, 23]. Previous studies based on small‐angle x‐ray scattering (SAXS) and atomic force microscopy (AFM) proposed an unusual A‐shaped tertiary structure of the intact RRE [24, 25], but detailed structural features were missing due to the low resolution of those methods. Recently, Tipo et al. [26] solved the 2.85 Å crystal structure of the stem‐loop II (SLII) using a t‐RNA scaffold approach (CASP: R1203, PDB: 8UO6) and found that two molecules within the crystallographic unit fold into two distinct conformations: “open” and “closed” [26].
Using a Fab‐assisted crystallography approach [27, 28], we crystallized and determined the structure of a 68‐nucleotide SLII construct (CASP: R1209, PDB: 9C2K, 2.42 Å resolution), where a Fab binding sequence replaced the IIc loop [29]. Our SLII crystal structure revealed a unique three‐way junction (3WJ) architecture, where the base IIa stem bifurcates into the IIb and IIc stem‐loop structures, with extensive interactions between the junction nucleotides [29]. Interestingly, while the structures of individual domains IIa, IIb, and IIc were similar to that determined by Tipo et al. [26], our structure showed a completely different arrangement within the 3WJ, resulting in a different 3D architecture of the RNA. We also solved the crystal structure of the SLII G34U mutant (CASP: R1296, PDB: 9C75, 3.0 Å resolution), which adopted a different conformation than its non‐mutant form, where the junction nucleotides reregistered to form a simpler 3WJ [29]. Notably, the non‐mutant form appeared more compact than the G34U mutant. Our functional studies have shown that the compact non‐mutant fold represents a functional conformation, and the extended G34U mutant induces loss of nuclear export function [29]. Due to the highly plastic nature of the SLII structure and its relation to functional consequences, understanding the multiple possible folds of SLII RNA and how it performs with RNA 3D structure prediction was quite interesting; therefore, crystallized sequences of both non‐mutant and G34U mutant were submitted to the CASP16 for blind predictions.
CASP16 predictors proposed 213 and 230 models of SLII (R1209) and SLIIG34U (R1296), respectively. For SLII (R1209), the predicted models deviated greatly from the crystal structure, with the top five models only obtaining global RMSDs ranging from 7.8 to 9.4 Å (Figure 2A–E). However, with a few minor deviations, the top five predictions accurately modeled the individual subdomains of the crystal structure: IIa (RMSD = 0.856 ± 0.213 Å; mean ± SD), IIb (RMSD = 2.899 ± 0.284 Å; Figure 2F), and IIc (RMSD = 0.786 ± 0.230 Å; Figure 2G). The predicted models of IIb (Figure 2F) showed higher variability than the predictions of IIc (Figure 2G) across all five models, likely since IIb is more dynamic than IIc, as indicated by high crystallographic B factors for region IIb. These comparisons, therefore, show that the significant discrepancies between the crystal structure and predicted models originate from the 3WJ region that defines the relative orientations of these stem‐loop structures. Even for the top model, R1209TS006_2, predicted by RNA_Dojo, with the lowest global RMSD (7.81 Å), base pairing and tertiary interaction patterns within the 3WJ are quite different from those observed in the crystal structure. Within the 3WJ, although the prediction of unpaired U35‐A38 nucleotides matched the crystal structure, unlike the flipped‐out G33 and G34 in the crystal, these nucleotides form non‐canonical interactions with G12 and G10 in the predicted model. Similarly, in the predicted model, the IIa‐IIb junction (J ab) G9 makes a non‐canonical pairing with IIc‐IIa junction (J ca) U64, unlike in the crystal structure, where it makes a tertiary interaction with C32 and U64 remains unpaired. Additionally, G13 is base paired with C32 in the predicted model but forms a non‐canonical G13•A31 pair in the crystal structure. For all top five models, A31 has been predicted to be flipped out as a bulge. Finally, other than model R1209TS006_2, which agreed with the crystal structure, all other top four models showed J ca A63 and U64 base pairing with G40 and C41, respectively.
FIGURE 2.

Comparison of top‐five predicted models with HIV‐1 RRE SLII and SLIIG34U mutant crystal structures (CASP: R1209 and R1296, PDB: 9C2K and 9C75). (A–E) The top‐five predicted models for SLII (R1209, colored) scored according to the CASP16 global RMSDs and superimposed with the corresponding crystal structure (light blue). The alignment of the IIb stem‐loops (F) and IIc stem‐loops (G) from the same top‐five models with the corresponding regions of the crystal structure. The IIb and IIc stem‐loop models are colored differently, but corresponding crystal structures are colored the same for facile comparisons. The RMSDs shown in (F) and (G) represent all‐atom alignment in PyMol. (H–L) The top‐five predicted models for SLIIG34U (R1296, colored differently) scored according to the CASP16 global RMSDs and superimposed with the corresponding crystal structure (blue).
Compared to SLII (R1209), the predicted models for SLIIG34U (R1296) aligned better with the corresponding crystal structure, with the top‐five global RMSDs ranging from 3.08 to 8.23 Å (Figure 2H–L). Similar to R1209, all top‐five predicted models modeled individual subdomains with high accuracy: IIa (RMSD = 0.771 ± 0.075 Å), IIb (RMSD = 1.339 ± 0.158 Å), and IIc (RMSD = 0.566 ± 0.220 Å), suggesting that the variations between the crystal structure and predicted models reflect the discrepancies within the 3WJ region. Nevertheless, unlike R1209, one of the models, R1296TS481_3 predicted by Vfold, is highly similar to the crystal structure with a global RMSD of 3.08 Å. Vfold even correctly predicted two non‐canonical base pairings, G10•U35 and G11•U34, while other models predicted U35 as an unpaired nucleotide. The only noticeable difference within the 3WJ is the wobble pairing between the junction IIb‐IIc (J bc) G40 and J ca U64 in the model, which was not found in the crystal structure, even though the position and helical stacking of these nucleotides are similar. Overall, perhaps because the 3WJ of SLIIG34U is relatively simple with no evident tertiary interactions compared to those in non‐mutant SLII, the blind predictions performed better for the mutant sequence.
Our experimental SLII and SLIIG34U structures captured in crystals demonstrate how a point mutation in an RNA structure can drastically alter its overall conformation. However, solving such conformationally diverse structures is often infeasible due to the challenges associated with RNA structural determination. Therefore, the high similarity of the predicted model and SLIIG34U highlights the value of computational approaches for understanding 3D folding and the complex conformational landscape that can emerge from an RNA sequence in solution. However, such RNA structural plasticity complicates the prediction of functionally relevant 3D structures. In these two cases (R1209 and R1296), whereas the predicted structures of subdomains for most models align closely with the corresponding crystal structures, the relative positioning of these subdomains seems to be the key factor determining the divergence between the experimental and predicted results. Crystal structures only provide a static 3D reference for comparisons with predicted models; thus, it remains unclear whether the deviations between the crystal structure and predicted models (in‐solution or near‐native state) originate from the relative orientations of subdomains within the RNA structure or whether the crystal structure itself is limited to a specific fold due to the stabilization by lattice contacts. For instance, the crystal structure of SLII showed significant crystal contacts between the symmetry‐mate nucleotides, whereas SLIIG34U exhibited no apparent crystal contacts near 3WJ. While blind prediction of RNA 3D structures has improved significantly in recent years, acquiring more high‐resolution experimental structures using various methods (x‐ray, NMR and cryo‐EM) enriches the RNA structural database, which is used to improve computational prediction algorithms.
2.3. 3′ Cap‐Independent Translation Enhancer (3′ CITE) From Thin Paspalum Asymptomatic Virus (CASP: R1293/M1293) Provided by Anna Lewicka, Yoshita Srivastava, and Joseph A. Piccirilli
In eukaryotes, processed messenger RNA (mRNA) is covalently modified with a 5′ N7‐methylguanine triphosphate cap and 3′ poly(A) tail, which serve to identify the mRNA by the eukaryotic translation initiation factors [30]. Recognition of the 5′ cap structure by eukaryotic initiation factor 4E (eIF4E) represents a hallmark feature in the initiation of protein translation.
Some RNAs that lack a 5′‐cap have evolved mechanisms to circumvent this canonical mechanism. For example, positive strand RNA viruses that replicate in the cytoplasm cannot access the cellular capping machinery present in the nucleus. To enable the virus to exploit the host's translation machinery, the 5′ cap is functionally replaced with an RNA sequence that folds into a structured element [31]. Recently discovered cap‐independent translation enhancers (CITEs) have been found in plant viruses. These structured RNA translation elements located within the 3′ UTR usually span 100–200 nt in length and encompass a single domain. Their complex tertiary structures are capable of binding host factors that facilitate the translation of viral genes [32].
There are seven classes of CITEs characterized based on their RNA structures and functional identity [33]. One of the classes, named PTE (PMV or PEMV‐like translational enhancer) binds to eIF4E using a T‐shaped structured RNA element, which contains a G‐rich region (G domain, J 2/3) that exhibits apparent complementarity to the C‐rich region (C domain, J4/5) located in a 3WJ between P4 and P5 (Figure 3A,B) [34, 35]. The crystal structure of the PTE from Pea enation mosaic virus 2 (PEMV2) RNA in complex with a structural chaperone, Fab BL3–6, revealed that the G‐rich and C‐rich regions interact through a complex network of long‐range interactions [34]. The long‐range interactions between the G domain, C domain, and the J5/3 form an unusual architecture that involves parallel strand WC base pairing within a unique stacked four‐layer scaffold that projects a single G of the G domain away from the core fold into solvent, where it is poised to engage in eIF4E binding (Figure 3A–C) [35].
FIGURE 3.

Prediction accuracy for RNA CITE element (CASP: R1293/M1293). (A) Crystal structure of TPAV in complex with Fab BL3‐6. (B) Secondary structure diagram of TPAV RNA (C) G and C domains of TPAV (D) From left to right: Superposition of Vfold model (R1293TS481_1, blue) on TPAV RNA (red), Superposition of RNApolis model 2 (R1293TS063_2, green) on TPAV RNA, Superposition of GeneSilico model (R1293TS338_1, cyan) on TPAV RNA. (E) From left to right: Superposition of paired G and C domains of Vfold model (R1293TS481_1) on TPAV, Superposition of paired G and C domains of RNApolis model (R1293TS063_2) on TPAV, Superposition of paired G and C domains of GeneSilico model (R1293TS338_1) on TPAV.
Recently, we have crystallized and solved the structure of another PTE class homolog from thin paspalum asymptomatic virus (TPAV). The TPAV CITE differs slightly from PEMV2 in its G and C domains. To obtain diffraction‐quality crystals, the solvent‐exposed RNA loop L1 was mutated to the Fab‐binding sequence (GAAACAC).
We submitted the structure of this Fab‐RNA complex to CASP16 for structure prediction and received over 200 models for the RNA and over 150 models for the RNA‐Fab hybrid. We analyzed the models by RMSD and global metrics such as Global Distance Test‐Total Score (GDT‐TS). We chose to further analyze the models with less than 10 Å RMSD. These models also had better lDDT and GDT‐TS scores. We noticed that most models within the specified RMSD cutoff were able to predict the overall fold of the RNA reasonably well. Additionally, we also looked at the INF (Interaction Network Fidelity) scores and noticed that many models correctly predict the secondary structure of the RNA. With the structure of homologous PEMV2 being available, we anticipated that prediction efforts might utilize it as a modeling template. Some of the models that stood out for overall agreement with the crystal structure were from RNApolis (R1293TS063_2), Vfold (R1293TS481_1) and GeneSilico groups (R1293TS338_1) (Figure 3D).
One of the major challenges for modeling was the interaction between the G domain in the asymmetric bulge between P2 and P3 and the C domain between P4 and P5. The G and C domains in our TPAV structure make long‐range WC interactions to form a conformation suitable for eIF4E binding. In the crystal structure, the G domain makes a bend at position 17G, which changes the direction of RNA. This abrupt turn flips the G18 nucleobase out of the stacked core towards the solvent. The backbone reversal due to the acute turn at G18 places G19 and A20 below G17 and C16, respectively, where they make interactions with C49 and U61, the single J5/3 nucleotide. Upon inspecting the models closely, we found that some of the best models based on global metrics deviated from the crystal structure in the paired interactions between the G and C domains. RNApolis and Vfold do a relatively better job than GeneSilico at predicting the paired G and C domains with the regional RMSD < 2 Å. A bottleneck in modeling these long‐range interactions seems to be in modeling bends and turns in the RNA sugar backbone. The model from GeneSilico tries to constrain the G‐C domain to form a more commonly observed anti‐parallel helix structure. A specific difference between the predicted models and the crystal structure is the conformation of the G domain's first residue (G15). In the crystal structure, this residue inserts into the pocket created between the P2‐P3 stack on one side and the interacting G and C domains on the other side. The nucleobase of G15 is almost perpendicular to the plane of the A13. U71 base pair. G15 forms phosphate‐ribose zipper interactions with A20, similar to A18 and A23 in PEMV2, but with an additional interaction between G15:N2 and A20:O3′. In the recently reported structure of the PTE from saguaro cactus virus [28] the first residue of the G domain adopts a similar orientation as observed here for TPAV. In contrast, the predictions for the TPAV element model this residue as being flipped out from the core into solvent. The difference between the crystal structure and the predictions may result from modeling the structure against the published PEMV2 structure, where the corresponding nucleotide is flipped out of the core.
A general observation from the predictions was that most models do a good job describing base‐paired regions but predict bulge and junction regions less reliably. This shortcoming shows up in the R1293TS481_1 modeled by Vfold, in which A6 in the core changes the angle between P1 and P2 (Figure 3B). This outcome probably reflects model bias toward maintaining a strictly A form helical structure in RNA.
We also analyzed the predictions for the RNA‐protein complex, focusing on the interface region. We noticed that most models with a high global QS metric were able to accurately predict the RNA‐protein interfaces. This outcome is anticipated as the RNA‐protein interface is well established in the literature due to the Fab and its antigenic loop serving as a portable crystallization module. RNApolis, which predicted one of the best models for the RNA component, also predicts the RNA‐protein interface quite accurately with an RMSD of 1 Å and a global QS score of 0.96.
2.4. ZTP Riboswitches Bound to Synthetic and Natural Ligands (CASP: R1261, R1262, R1263, and R1264, PDB: 9BZC, 9BZ1, 8VQV, and 8VVJ) Provided by Christopher P. Jones
The targets R1261, R1262, R1263, and R1264 are co‐crystal structures of ZTP (5‐aminoimidazole‐4‐carboxamide riboside 5′‐triphosphate) riboswitches bound to various ligands. ZTP riboswitches often regulate genes associated with purine biosynthesis and folate metabolism [36] by sensing the levels of ZMP (5‐aminoimidazole‐4‐carboxamide riboside 5′‐phosphate) and ZTP. Their conserved folds are characterized by four paired elements that include one pseudoknot. However, variations on this conserved fold include the insertion of additional paired elements.
R1261 and R1262 contain novel structures of the Clostridium beijerinckii ZTP riboswitch co‐crystallized with ZMP and AICA, respectively. R1261 and R1262 differ in that R1261 was treated with CsCl (for phasing), which may explain an alternative loop conformation present near the Cs+ binding site. The C. beijerinckii ZTP riboswitch contains an additional paired element inserted between the two conserved sub‐domains [37], and the RNA is relatively modular compared to other ZTP riboswitches, making it an attractive specimen for folding studies [38, 39]. As these structures are related to those of previously determined ZTP riboswitches, their prediction should be low‐hanging fruit. Indeed, predictions for this RNA were generally acceptable with 72% and 77% of predictions achieving TM‐score greater than 0.45 for R1261 and R1262, respectively. However, predictors were not generally able to achieve atomic accuracy (TM‐score > 0.70): only 4 models (1.8%) for R1261 and 1 (0.5%) for R1262 achieved a TM‐score greater than 0.70. The RNA‐ligand interaction was predicted accurately, as expected because of the availability of a template: 45% and 31% of predictions for R1261 and R1262, respectively, achieved a lDDT for the RNA‐ligand interface greater than 0.75.
R1263 and R1264 are co‐crystal structures of two synthetic ligands with the previously solved Schaalia odontolytica ZTP riboswitch [40]. These synthetic ligands were developed as part of an ongoing investigation of the types of ligands that bind and activate the ZTP riboswitch in vitro and in vivo [41]. R1263 contains the best synthetic ligand, m‐1‐pyridinyl‐AICA, determined at a higher resolution than previously achieved [41, 42] and should be easily predicted. 33% and 26% of the models submitted for R1263 and R1264, respectively, were of high quality (TM‐score > 0.70), more than for C. beijerinckii ZTP riboswitch, likely because of the better RNA template. R1264 contains a novel ligand (1,8‐napthyridinyl‐3‐AICA) whose bulkier aromatic ring system led the RNA to crystallize under alternative conditions and in a different RNA conformation [42]. Residue C52 is observed adjacent to m‐1‐pyridinyl‐AICA and must move to accommodate the bulkier ligand. Predictors were unable to predict this change in R1264 accurately, and no group achieved a local lDDT for C52 greater than 0.70 for R1264. However, a third of the models predicted the conformation of C52 in R1263, which matched the template. 37% and 33% of models achieved lDDT for the RNA‐ligand interface greater than 0.75. Predicting this type of conformational change, which is perhaps warranted given the size of the ligand, would greatly enable structure‐guided design at the stage of ligand design and synthesis. This is particularly relevant owing to the difficulty in crystallizing bulkier ligands and the twinning apparent in these crystals.
2.5. SAM Utilizing Alkyltransferase Ribozyme SAMURI (CASP: R1288, PDB: 9FN2) Provided by Hsuan‐Ai Chen and Claudia Höbartner
Most natural ribozymes catalyze the cleavage or formation of phosphodiester bonds, while synthetic ribozymes afford a broader spectrum of chemical reactions [43, 44, 45]. Recently, in vitro selected ribozymes that catalyze site‐specific RNA modifications have been attracting increasing interest for their purported roles in an RNA world, and for their practical utility as tools in RNA biology. Specific examples of RNA‐modifying ribozymes include the first methyltransferase ribozyme (MTR1) [46], the engineered preQ1 riboswitch‐ribozyme [47], and two S‐adenosylmethionine (SAM)‐based ribozymes: SMRZ1 [48] and SAMURI [49]. These ribozymes install methyl (and, in some cases, other alkyl) groups at defined positions within a target RNA.
Analyzing the structure of SAMURI is especially intriguing in comparison to natural SAM‐binding riboswitches, which control gene expression in response to SAM or S‐adenosylhomocysteine (SAH) [50]. While SAMURI readily transfers the methyl (or alkyl) group from SAM (or analogous selenomethionine cofactors) to N3 of a specific adenosine in the substrate RNA (Figure 4A), natural SAM riboswitches actively suppress RNA (self‐)methylation. We solved the crystal structure of SAMURI in the post‐catalytic state containing 3‐methyladenosine (m3A) and SAH [51]. The structure revealed how SAMURI positions the reactants in an orientation poised for an SN2‐like substitution reaction and, thus, adds a new example to the versatile structures RNAs can adopt to achieve distinct functions.
FIGURE 4.

Prediction accuracy of SAMURI‐SAM complex (CASP: R1288, PDB: 9FN2). (A) Secondary structure of SAMURI and the reaction scheme. Three A‐form helices (P1‐P3) are shown in bold lines and the catalytic core (J1/2 and J2/3) in colored letters. Colored dashed lines represent the WC base pairs according to the crystal structure. (B) Crystal structure of SAMURI‐SAM complex (PDB: 9FN2). Recognition patterns of each layer are shown in the boxes: cofactor layer in blue, reaction layer in magenta, stabilization layer in green, and bottom layer in yellow. (C) The model R1288LG338_4 successfully predicted the coaxial folding of P2 and P3. A partially correct base pairing interaction in the cofactor layer is shown in the box. (D) The overall structure R1288LG262_4 and its SAH binding mode. (E) The predicted SAH binding mode (left box) and bottom layer (right box) from the model R1288LG055_1.
SAMURI adopts a 3WJ structure (Figure 4A), with the two substrate binding arms, P1 and P3, and the internal stem, P2, forming three canonical A‐form helices. The catalytic core features a four‐layered architecture composed of the cofactor layer, the reaction layer, the stabilization layer, and the bottom layer (Figure 4B). A base triple in the bottom layer extends the π‐π stacking of the coaxial P2 to P3 helices, which assists cofactor binding. A characteristic feature is the kink between P1 and P3 that docks the target adenosine (A52) in the reaction layer by H‐bonding of its WC and Hoogsteen edges, while exposing the minor groove edge to facilitate the reaction at N3.
This collection of key structural features of SAMURI was not recapitulated in the predictions. None of the R1288 predicted models within RMSD < 10 Å managed to model the active site and ligand interactions correctly. The lDDT scored 0–0.16 for all models, which is lower than the general standard of a low‐quality model (lDDT > 0.3). This result reflects the challenge to predict the active site and ligand binding mode of synthetic functional RNAs.
Since the lDDT values are too low, we inspected the models with root mean square deviation (RMSD) within 10 Å (best score ~4 Å), with the ultimate goal of solving an x‐ray structure by molecular replacement using a predicted model (inspired by a similar approach by CASP15 evaluators [5]). We defined a manual scoring scheme to scrutinize the structural details of the overall fold with three helices, the four layers, the kink, the SAH binding, and magnesium ion locations. As expected, all models predicted the three paired regions (P1, P2, and P3) correctly as double helices, albeit distinct in shape and orientation. Laudably, the models R1288LG338_4 and R1288LG272_1 successfully predicted the coaxial stacking of P2–P3 (Figure 4C), even though the catalytic core was miscalculated in both cases. Despite several canonical base pairs in the catalytic core (Figure 4A, dashed lines), it was quite difficult for the algorithms to find the correct partners. The best predictions fell in the bottom layer, where half of the models correctly inserted G30 in a base triplet. The models R1288LG055_1 and R1288LG262_1 captured this interaction similar to the crystal structure (Figure 4E). Several of the models correctly found the base pairs U12‐A35 in the stabilization layer and C11‐G36 in the reaction layer. However, the correct partners for the experimentally observed triplet and quartet interactions were not identified. The sharp turn for accurately locating A52 in the backbone was another major hurdle. A few models (R1288LG272_1, R1288LG091_1, and R1288LG338_4) showed a slight turn but too shallow to form the reaction triplet (Figure 4C), while other models located A52 in a smooth transition from P1 to P3 (Figure 4D).
The location and binding mode of SAH were challenging to predict. While most models placed SAH near the catalytic core, neither the orientation nor the binding mode was correctly calculated to reflect the G9 in syn conformation paired with the sugar edge of U37, which positions U37 to anchor SAH through H‐bonding (Figure 4B) and stacking between G51 and A52. Only the model R1288LG338_4 spotted a G9‐U37 interaction but predicted it as a classical wobble pair (Figure 4C). Several other models placed G10 in a wobble pair with U37 instead of the experimentally observed WC pair with C34 (Figure 4D,E), or predicted a G10‐A35 purine‐purine base pair in place of the stabilization quartet. The model R1288LG262_4 placed the SAH in a splayed conformation, with the base tucked in the core and the methionine tail pointing towards the opening space in the P1 direction (Figure 4D), while model R1288GL055_1 found a solution with SAH base‐paired to A7 and the homocysteine tail close to A52 (Figure 4E). In the crystal structure, two magnesium ions were found near the kink and the nucleobase of SAH (Figure 4B). The location of magnesium ions was not successfully predicted because both kink and SAH were incorrectly placed.
The best models according to manual scoring are R1288LG338_4, R1288LG272_1, and R1288LG091_1, although none reached more than 4 out of 10 points. Surprisingly, the rankings based on either lDDT or RMSD do not reflect the result of our assessment. Therefore, the top three candidates of each ranking (lDDT, RMSD, and manual evaluation) were chosen to perform molecular replacement in Phenix [52]. Unfortunately, but not unexpectedly, none of these models generated a correct solution. The best translation function Z‐score was 7, achieved by R1288LG338_4, which ranked high in our evaluation but was modest by lDDT and RMSD.
To summarize, the prediction of SAMURI structure in the product state with bound SAH was not successful, as reflected by the low lDDT score and medium to high RMSD values. Several models predicted the bottom triplet correctly and spotted two additional core base pairs accurately. However, none of the predicted models brought A52 and the ligand into an orientation that would reflect the experimentally determined alkylation site. In general, structure predictions are still very challenging for synthetic RNAs that do not have any natural references and for which no analogous structures have previously been reported. Thus, more experimental structures are still needed to enrich the training repertoire.
2.6. Dopamine‐Bound DNA Aptamer RKEC1 (CASP: D1273, PDB: 9HIO) Provided by Eric Largy, Philip E. Johnson, and Cameron D. Mackereth
Aptamers are oligonucleotides that have been selected to bind a desired ligand, and many have been created to recognize small molecules, primarily for use in biosensors. Despite the increasingly popular development of small molecule‐binding DNA aptamers, very little atomic information has been acquired on their ligand‐aptamer complexes. This lack of molecular details thus limits our knowledge of binding mechanisms and hinders aptamer optimization. Related to CASP target D1273, a dopamine‐binding aptamer was developed in the context of creating field‐effect transistor‐based sensors that rely upon ligand‐induced DNA conformational changes [53]. Truncation of the duplex formed by the 5′ and 3′ ends did not significantly perturb binding characteristics [54, 55], and to create RKEC1 we have removed an additional three nucleotides at the 5′ end to further reduce the aptamer size and improve affinity.
Using NMR spectroscopy methods, we found that the dopamine‐bound RKEC1 aptamer consists of a highly atypical DNA fold (Table 1; Figure 5A,B). The most notable feature of the DNA architecture is the absence of canonical duplex regions. In total, there are only two WC base pairs (C14:G19 and C16:G23, numbered as in the CASP target sequence), and these are not adjacent in the structure. Despite the lack of canonical base pairs, there remain 11 additional base pairs formed by RKEC1 (solid lines in Figure 5A). These fall into categories more often present in RNA structures such as reverse WC (T2:A25, T9:A17, and C10:G18), reverse wobble (T1:G24), and several homomeric pairs (G3:G26, A5:A25, G6:G24, G7:G23, G7:G24, G11:G19 and G11:G21). There are also six additional base pairs connected by a single hydrogen bond (dotted lines in Figure 5A). A final complexity in the structure arises from numerous places in which bases participate in two nucleotide planes, such that there are few instances of true base triples even though three DNA strands form several of the structural elements.
FIGURE 5.

Molecular details of dopamine‐bound RKEC1 aptamer (CASP: D1273, PDB: 9HIO). (A) Schematic of the dopamine‐bound RKEC1 aptamer. The only two WC base pairs are highlighted in gray. Additional base pairs with at least two hydrogen‐bonds are shown as solid lines between nucleotides. Dotted lines indicate bases joined by a single hydrogen bond. (B) Representative cartoon model for the structure closest to the ensemble average of 15 calculated structures. (C, D) Illustration of the interactions the aptamer makes with dopamine due to stacking (C) and via hydrogen bonds (D).
Given the unusual nature of the RKEC1 DNA fold, it presents a difficult target. Predictions for the DNA in the complex included 107 models submitted to CASP from 22 different groups, of which 6 were obtained from servers. Despite the small size of the DNA at 27 nucleotides, none of the predicted models had an RMSD less than 10 Å, with model RMSD ranging from 10.16 to 27.79 Å. In practice, however, the significant deviation of the predicted models from the experimental models meant that there were no meaningful stretches of structural similarity. Nevertheless, we were able to identify some broadly correct elements in the predictions, such as triple‐helical segments in some models and instances of bases that are shared between planes. When inspected in detail, however, the strand polarities differ from the experimental structure and bring together different bases. We had expected the atypical base‐pairing to be challenging, but approximately 70% of the 1254 base pairs in all models are formed by non‐canonical base pair arrangements (i.e., not WC). It is therefore possible that problems related to accurate secondary structure prediction were more inhibitory than the diversity of base pairing in RKEC1.
As there was dopamine present in the complex, ligand predictions were also submitted. In this case, there were 8 groups that provided a total of 34 models. In the experimental structure, the dopamine ligand is positioned such that it stacks between bases T22 and G23 (Figure 5C). Hydrogen bonds connect the dopamine hydroxyls to T8 and T9, and the amine group by sugar and base atoms from A17 and G18 near the phosphate backbone (Figure 5D). In the ligand predictions, the previously mentioned high deviation from the experimental DNA conformation resulted in predictions that were too distant from the experimental binding site to enable meaningful binding pose comparisons. In the absence of strict similarity, we decided to look for common types of contacts in the predictions, and we noted some success in modeling the stacking interaction between dopamine and bases, but poor prediction of any hydrogen bonding. For example, the five models from LG055 capture the correct idea of dopamine stacking between bases (but now placed between A4 and G24) and position the dopamine amino group close to the phosphate backbone (between G3 and A4). In LG262, the three models also show a stacked dopamine between two bases (C10 and G21) but no other significant contacts. There are also several models with dopamine only stacked on one side (one model in LG262, four models in LG338, and four models in LG294) and interestingly, dopamine is also stacked on one side even with single‐stranded DNA (LG464).
Given the unusual DNA fold in the dopamine‐RKEC1 complex, we anticipated that CASP target D1273 would be a challenging case. In terms of DNA structures in the PDB, there is an overrepresentation of duplex structures. The only other major contribution of DNA structures in the PDB is G‐quadruplexes. To date, there are only a small number of structures with higher DNA complexity, notably within some of the previously determined DNA aptamer structures. This deficiency in the availability of diverse folds likely plays a role in the lack of predictions with significant similarity to the experimental structure. It is also the case that for D1273 the dopamine ligand serves an integral part of the DNA fold: stacking between two bases and participating in hydrogen bonds. Prediction of just the DNA might therefore have also been difficult in the absence of the ligand.
2.7. Oceanobacillus iheyensis Group IIC Intron (CASP: R1241) Provided by Shekhar Jadhav, Michela Nigro, and Marco Marcia
Group II introns are bacterial and organellar self‐splicing ribozymes that constitute the evolutionary ancestors of the eukaryotic spliceosomal small nuclear RNAs. Group II introns self‐excise from precursor mRNA transcripts in a two‐step reaction that is identical in terms of molecular mechanism and chemistry to the reaction catalyzed by the spliceosome [56]. In the first step of splicing, group II introns cleave their 5′‐exon either through a hydrolytic or a transesterification mechanism, and in the second step of splicing, the excised 5′‐exon ligates to the 3′‐exon, producing a mature mRNA and an exon‐free intron [57, 58]. The free intron is still enzymatically active and can retrotranspose into genomic DNA target sites, with important biotechnological and medical implications in gene editing. To execute such complex reactions, group II introns assemble their six structural domains (D1 to D6, Figure 6A) into a well‐ordered tertiary structure that harbors an RNA‐based active site adjuvated by a metal‐ion cluster cofactor in its core (Figure 6B) [56]. This active site can be specifically and selectively targeted by small molecules with potential applications as antifungals [61]. Resolving the molecular architecture of group II introns accurately at near‐atomic resolution and elucidating their precise mechanism of action is therefore crucial to understanding the fundamental reaction of splicing, to support the design of antifungal drugs, and to guide the engineering of novel RNA nanomachines for gene editing.
FIGURE 6.

O. iheyensis group II intron structural analysis (CASP: R1241). (A) Secondary structure of O. iheyensis group IIC intron. D1‐D6 indicate structural domains 1 to 6, respectively. The previously unresolved regions in D2 and D6 are colored in blue and Indian red, respectively. The tertiary contacts are indicated with Greek letters. (B) Experimental tertiary structure of O. iheyensis group IIC intron determined in this work. The newly resolved η‐η′ and π‐π′ interactions between D6 and D2 are indicated by Greek letters. (C) Superposition of our experimental model (gray) over the R1241TS338_1 prediction (orange, representative of the 4 structures with accurate fold and D2–D6 contacts). The inset at the bottom displays a zoom into the D2‐D6 interaction. (D) Superposition of our experimental model (gray) over the R1241TS006_4 prediction (green, representative of the 101 structures with accurate fold but inaccurate D2–D6 contacts). The inset displays a zoom into the D2–D6 interaction. (E) Superposition of our experimental model (gray) over the R1241TS169_1 prediction (blue, representative of the 75 inaccurate predictions) All images of RNA 3D structures and secondary structure map were generated in the ChimeraX and VARNA, respectively [59, 60].
So far, experimental structures are available for domains D1–D5 of the so‐called group IIC intron from the bacterium Oceanobacillus iheyensis , which splices through the hydrolytic mechanism, and for D1–D6 of homologous group IIA and IIB introns from other bacteria or algal chloroplasts, which splice through the transesterification mechanism [56]. The structure of D6 from the O. iheyensis group IIC intron is instead still undetermined. As a result, it remains unclear how D6—which harbors key functional elements such as a branch‐point adenosine and the 3′‐splice junction—interacts with the rest of the ribozyme in introns that splice through the hydrolytic mechanism. We determined the D1–D6 structure of the O. iheyensis group IIC intron by cryo‐EM in the resolution range between 3.3–4.1 Å (3.8 Å average resolution, Figure 6B), and provided its sequence as a target for the CASP16 competition.
Thirty‐six groups submitted a total of 180 predictions for R1241. Here, we discuss the predicted models in comparison to our experimental cryo‐EM structure with respect to the following parameters: (i) accuracy in the prediction of the secondary structure; (ii) accuracy in the prediction of the overall tertiary fold, as judged by the all‐atom root mean square deviation (RMSD) between predicted and experimental models; and (iii) accuracy in the prediction of specific tertiary contacts, with a focus on the previously undetermined contacts between D6 and the rest of the molecule (known from biochemical characterization as the η‐η′ and π‐π′ interactions between D6 and D2, Figure 6B) [56].
Remarkably, four predictions display accurate models in terms of secondary and tertiary structural organization and of D2‐D6 interactions (Figure 6C). These predictions have an RMSD of 2.5–3.2 Å with respect to the experimental model. Among these models, three were predicted by the Vfold group and one was predicted by the GeneSilico group. Vfold uses the Protein Data Bank (PDB) server‐based structure templates to produce an initial ab initio model, which is then refined by all atom energy minimization [62]. Its success on our target can thus possibly be explained by the existence of 12 group IIA/B intron structures homologous to our target in the PDB (41%–56% sequence identity) and displaying the D2‐D6 interaction. Compared to Vfold, GeneSilico uses coarse‐grained RNA representations and Monte Carlo sampling to explore the RNA conformational space and derive the most energetically favorable model [63]. This approach seems to be the most unbiased and accurate on our target.
One hundred one further predictions display an accurate secondary structure and overall tertiary fold, with RMSD 5.4–18.8 Å with respect to the experimental structure (Figure 6D). However, these predictions fail to accurately visualize the novel D6 η‐η′ and π‐π′ interactions. Among these structures, 21 structures display no interaction at all between D6 and the rest of the ribozyme, and 80 structures display tertiary interactions between D2 and D6 different from the experimentally validated η‐η′ and π‐π′ (Figure 6D). A characteristic feature of the predictions that fail to reproduce the experimental D2‐D6 interactions is that they model inaccurately the characteristic 3‐way junction of D2.
Finally, 75 predictions display inaccurate tertiary structure models, with RMSD values of 8.4–106.4 Å with respect to the experimental model (Figure 6E). Most of these predictions crucially fail in correctly reproducing the experimental secondary structure of the target. Alternatively, predictions that reproduce the correct secondary structure fail in modeling the characteristic five‐way junction of D1, which is the folding scaffold that dictates the orientation of all peripheral elements with respect to the core [64].
In summary, our analysis reveals that key factors of success in RNA structure modeling are the accurate predictions of RNA secondary structure motifs (stem loops), multi‐way junctions, and long‐range tertiary contacts. A current limitation is that prediction algorithms do not currently predict metal and ligand binding sites. This limitation is particularly severe for our target, which harbors a functional metal cluster in the active site and additionally requires tens of rigidly bound ions for structural stability [65]. We suggest that future generations of RNA structure modeling algorithms include ion cofactors as intrinsic tertiary structure elements of the targets to improve RNA structure predictions [66].
2.8. Rumen‐Originating, Ornate, Large (ROOL) RNA Hexamer (CASP: R1286, PDB: 9J6Y) and Octamer (CASP: R1283v1‐v3, PDB: 9ISV, 9J3R and 9J3T) Provided by Liu Wang, Jiahao Xie, and Zhaoming Su
The Rumen‐Originating, Ornate, Large (ROOL) RNA, one of the largest and most structurally complex non‐coding RNAs, was first identified through comparative analysis of cow rumen metagenome data [67]. Another independent study revealed its presence in the megaplasmid of Lactobacillus salivarius (Lsa), where its abundance can exceed that of 16S ribosomal RNA under specific conditions [68]. ROOL RNA frequently localizes in prophage regions, purified phage particles, non‐prophage regions, and tRNA loci [67]. The biological functions of ROOL RNAs remain unclear. Recent studies have resolved the cryo‐EM structures of ROOL RNAs from Enterococcus faecalis (Efa) as 3.1 Å monomer (R1283v1), 4.7 Å tetramer (R1283v2), and 3.8 Å octamer (R1283v3), from Lsa as 3.1 Å hexamer (R1286), and from environmental bacteria (env‐120) as 3.1 Å octamer (Table 1, Figure 7A,B) [69, 70]. Another two ROOL RNA sequences from Lsa forming a hexamer (CASP: R1252) and environmental bacteria (env‐209) forming octamers (CASP: R1253) with two alternative conformations have also been submitted for CASP16 [70]. Except for the two Lsa sequences, the other sequences share less than 50% similarity. However, they all adopt a conserved architecture comprising 16 base‐paired regions (denoted P1–P16, Figure 7C), five multiway junctions, and intricate interaction networks involving base‐pairings, minor groove interactions, and magnesium ion coordination.
FIGURE 7.

Evaluation of the ROOL RNA structure predictions (CASP: R1286 and R1283v1‐v3, PDB: 9J6Y, 9ISV, 9J3R, and 9J3T). (A‐B) Cryo‐EM structure of Efa ROOL octamer (R1283) and Lsa ROOL hexamer (R1286). (C) Secondary structure of Efa ROOL monomer, with flexible regions P9‐P12, P13, and P15 in transparency. Intramolecular interactions are highlighted by lines and boxes. The unresolved region L13 is annotated with a dotted line. (D) Average RMSD and lDDT of ROOL monomers, of R0252, R0253, R1283v1, and R1286. (E) Representative predicted model of Lsa ROOL. (F) Cryo‐EM structure of Efa ROOL monomer with regions resolved only in oligomeric states annotated in transparency. (G) Predicted model of Efa ROOL by AF3_server with consistent secondary structure. (H–J) Comparison of the correctly predicted multiway junctions of (H) 5WJ1, (I) 4WJ, and (J) 3WJ1 in R1283v1TS304_3 and the cryo‐EM structure. (K–M) Superposition of the correctly predicted intramolecular interactions of (K) P2‐L5 TL/TLR, (L) L13‐P15 PK, (M) L6‐P8 minor‐groove interaction in R1283v1TS304_3 and the cryo‐EM structure. (N‐O) Comparison of the inaccurately predicted multiway junctions of (N) 3WJ2 and (O) 5WJ2 in R1283v1TS304_3 and the cryo‐EM structure. (P) Superposition of the inaccurately predicted J9/12‐L13 PK and that in the cryo‐EM structure. (Q‐T) Predicted homo‐oligomers of ROOL RNAs forming (Q) ring‐like structure (R1283v3TS272_3o), (R) bundle‐like structure (R1283v3TS272_1o), (S) RNA origami‐like structure (R1252TS294_1o) and (T) unusual asymmetric structure (R1252TS304_4o).
Predictions for the four ROOL RNA monomer structures are generally inaccurate, as evidenced by average RMSDs worse than 40 Å and average lDDTs below 0.5 (Figure 7D). Although comparative genomics analysis has shown a highly conserved secondary structure [67], the majority of predictions have not incorporated sequence conservation and covariation into their predicted 3D structures, and therefore are not discussed further (an example in Figure 7E). The high flexibility of ROOL RNA monomer could be one reason for the poor prediction results, as demonstrated by the missing density in the Efa ROOL monomer structure that is only resolved in oligomeric states (Figure 7F). While none of the prediction results resemble the experimental structures in R0252, R0253, and R1286, an interesting result was observed for the R1283v1 monomer prediction, in which the AlphaFold 3 model R1283v1TS304_3 submitted by the AF3‐server exhibits a highly accurate secondary structure and a reasonable overall fold compared to R1283v1 (GDT‐TS = 0.15, TM‐score = 0.365, lDDT = 0.619) (Figure 7G).
This AF3‐server predicted model consists of P1–P16 and contains all five multiway junctions in line with the conserved secondary structure and contains most tertiary interactions as observed in the cryo‐EM structure (Figure 7H–P). The first five‐way junction (5WJ1) consisting of P1–P3, P7, and P16, and the four‐way junction (4WJ) consisting of P3–P6, closely resemble the cryo‐EM structure (Figure 7H,I), whereas the predicted P12 in the first 3WJ (3WJ1) containing P8, P9, and P12, is translocated relative to the cryo‐EM structure (Figure 7J). Intriguingly, the AF3‐server result accurately predicts two intramolecular tertiary interactions, P2‐L5 tetraloop/tetraloop receptor and L13‐P15 pseudoknot (PK) (Figure 7K,L) and closely resembles the L6‐P8 minor‐groove interaction present in the cryo‐EM structure (Figure 7M). The predicted 3WJ2 and 5WJ2 do not agree with the cryo‐EM structure; the prediction swapped P10 and P11 in 3WJ2 (Figure 7N), swapped P13 and P15 in 5WJ2, and incorrectly predicted P13 base pairs (Figure 7O). The last intramolecular PK L13‐P15 is not predicted (Figure 7P). The AF3‐server likely achieved this reasonable prediction of such a complex RNA structure by taking the sequence conservation and covariation into account during structure prediction. Perhaps, the AF3‐server prediction relied on models that were sampled using automatically generated multiple sequence alignments in which more sequences were included for R1283 resulting in a more complete covariation analysis than other ROOL sequences [3]. All other predictions of Efa and other ROOL RNAs did not yield similar structures.
Homo‐oligomeric protein structures have been extensively characterized. In contrast, there are far fewer examples of oligomeric RNA structures, and RNA homodimers are most commonly reported [71, 72, 73]. In fact, there are only four homo‐multimer structures from two RNA sequences in the PDB consisting of more than two protomers, including the φ29 prohead RNA forming either four‐ or five‐membered ring architectures, and the Chili RNA aptamer tetramer [74, 75, 76]. This scarcity of RNA oligomer structural information has compromised the prediction accuracy of RNA oligomers in CASP16. We speculate that most oligomer predictions rely on existing RNA and protein homo‐oligomer templates in the ROOL hexamer or octamer prediction tasks (R1252o, R1253v1, R1253v2, and R1283v3o). These predictions include ring‐like structures resembling the φ29 prohead RNA or pore‐forming protein oligomers, bundle‐like structures that resemble membrane protein bundles, and some synthetic RNA origami (Figure 7Q–S). While most predictions for ROOL oligomers are approximately symmetrical, we also observe unusual cases of asymmetric oligomeric assembly (Figure 7T).
2.9. Ornate Large Extremophilic (OLE) RNA Dimer (CASP: R1285o, PDB: 9MCW) Provided by Rachael C. Kretsch, Yuan Wu, Rhiju Das, Wah Chiu
The Ornate Large Extremophilic (OLE) RNA family was identified as a large, non‐coding, highly structured RNA family in gram‐positive bacteria in extreme environments by covariation analysis [77]. Prior to CASP16, OLE RNA was known to form an RNA‐protein complex which localizes to the cell membrane [78] and was implicated in many environmental sensing pathways [79]. However, the biochemical mechanism of action remains unknown [80].
The 5′ domain of OLE RNA from Clostridium acetobutylicum forms a well‐ordered RNA‐only dimer which resolves to 2.9 Å using single particle cryo‐EM [70] (Table 1, Figure 8A). Each monomer has five helices (P4‐P8) which lay parallel to each other such that the loops of the helices point into the dimer interface. These loops form two dimer interfaces labeled B1 and B2. The final helix (P9) crosses this stack perpendicularly, eventually pointing into the dimer interface and forming the final dimer interface, B3. Two independently solved cryo‐EM structures of the OLE RNA dimer agree closely; supporting the use of this molecule as a target for CASP16 prediction [69, 70]. The stoichiometry was given as a dimer to predictors.
FIGURE 8.

Predictions of the OLE RNA dimer (CASP: R1285o, PDB: 9MCW). (A) The cryo‐EM structure of the OLE RNA dimer, PDB: 9MCW, is displayed in ribbon format. The two protein binding sites and three dimer interfaces are labeled and the nucleotides involved are displayed. (B–G) Models of interest are displayed. In (D–G) the model is displayed in color and overlaid on cryo‐EM structure PDB 9MCW in gray. Only PDB 9MCW is displayed in (E) because no group predicted the interface.
No homologous structures were known, posing a challenge for prediction. However, there is a large multiple sequence alignment and biochemical information in the literature. According to the CASP16 metrics, R1285TS028_4o from the NKRNA‐s server had the most accurate fold (GDT‐TS = 23.2, TM‐score = 0.682, lDDT = 0.572); R1285TS189_4o from LCBio had the most accurate dimer interface (ICS = 0.444, IPS = 0.532, ilDDT = 0.297) (Figure 8B,C). Although these models predicted the global fold, the predictions were inaccurate in important structural and functional regions: dimer interfaces and protein binding domains.
The majority of models predicted no interface correctly (as assessed by interactions labeled in the structure by rna_motif [81]); at most, one of the three dimer interfaces was predicted correctly in each model. Interface B1, a set of four non‐canonical A‐A base pairs between the loops of P4 of both chains, proved the most challenging interface for predictors. Although some models predicted these loops in close proximity (Figure 8B,C), none predicted the intermolecular base pairing between the loops (Figure 8D). The second interface, B2, is a set of base‐pairing and stacking interactions between the loops of P5 and P7 of one chain and the loop of P6 of the other chain. Model R1285TS286_4o from CSSB_experimental modeled the interaction between the loops of P5 and P6 as the only intermolecular interactions, but it did not predict the additional involvement of the loop of P7 (Figure 8E). Twenty models predicted B3 correctly. This interaction is the simplest interface made of a kissing loop with four G‐C base pairs (this region was low resolution and hence not assessed in the official CASP16 assessment). Only six models predicted a B2‐like interface, and none of these models predicted completely accurate base interactions. The dimer interface could play an important role in scaffolding the OLE RNA‐protein complex or as a switch in the OLE RNPs response to external stimuli. Therefore, it is important to improve the accuracy of RNA–RNA interface predictions to foster a better understanding of these biological systems.
Within each monomer, there are two protein binding sites that appear preorganized to expose the known protein binding sequences, but in both sites, the RNA secondary structure differs from that hypothesized in the literature [82]. First, RpsU is known to bind a CAA motif in P6 that was previously thought to be partially occluded in a stem. Contrary to this, the cryo‐EM structure shows that this motif is solvent exposed, potentially allowing for protein docking without RNA structural rearrangement. The majority (61%) of models incorrectly predict at least two of these three nucleotides are paired; although a few models did predict this site relatively accurately (Figure 8F). Finally, OapC was known to bind a kink turn formed immediately upstream P5. It was previously hypothesized that this region base‐paired with the region immediately downstream of P5; however, the cryo‐EM structure shows that three G‐A non‐canonical base‐pairs were formed with the region between P6 and P7 instead. 75% of the models predicted the correct three G‐A non‐canonical base‐pairs, with many additionally obtaining excellent accuracy for the kink turn (Figure 8G). This observation suggests that predictors were able to use the sequence or evolutionary data to deduce the residues involved in the kink turn.
The analysis of OLE RNA predictions was encouraging, as the global fold was successfully predicted despite the absence of homologous RNA; however, critical interactions were still inaccurately predicted. In particular, predictors were not able to predict the dimer interface, modeling at most one of the three interfaces. In fact, many of the models predicted only one or no RNA–RNA interaction between the chains, underpredicting the complexity of the interface. It was also evident that predictors could only successfully predict the simplest type of interface, canonical base‐pairing, with few or no correct interactions predicted for the more complex interfaces. Hence, structure predictors should aim to improve RNA–RNA interface prediction, which would have implications for understanding the broader OLE family of RNAs and aid in the identification of other RNA families that form similarly complex interfaces.
2.10. RNA Origami Dimer Structure and Protein‐Binding Aptamer With 2′‐F Pyrimidine Modifications (CASP: R1281o and M1282) Provided by Emil L. Kristoffersen, Nikolaj H. Zwergius, and Ebbe S. Andersen
Production of nucleic acids using chemically modified building blocks has proven essential for medicine and biotechnology. Indeed, modified nucleic acids, including natural as well as unnatural modifications of the canonical nucleotides [83], are used in mRNA vaccines [84], RNAi [85] and gene editing [86]. For these reasons, it is becoming increasingly important to predict the structure and effect of modified nucleic acids.
2′‐F modified RNA is a widely used modification that greatly improves polymer properties including resistance to nuclease degradation and reduced recognition by the innate immune system. While the highly electronegative 2′‐F modification increases base pairing and stacking stability, it may also alter the interactions and the structures that can be formed [87]. A hybrid of 2′‐F pyrimidines (FY) and natural purines (here termed FY RNA) is often used, since it increases the stability of canonical basepairs evenly and therefore is not expected to significantly change the folding landscape. However, this assumption only applies to canonical basepairs and not to tertiary motifs. FY RNA has greatly increased serum stability and can be efficiently produced using in vitro transcription with a mutant RNA polymerase that can incorporate 2′‐F nucleotides [88]. However, as we shall see, even FY RNA alters folding relative to unmodified RNA.
We have produced and investigated FY RNA structures by in vitro transcription and cryogenic electron microscopy (cryo‐EM). For CASP16 we submitted the sequences of (1) a FY RNA origami design (R1281) which was noted to have A2‐stoichiometry and (2) a FY RNA protein‐binding aptamer and its binding partner (the RBD domain of the SARS‐CoV2 spike protein) (M1282). Both targets had been investigated with cryo‐EM revealing distinct 3D structures for the modified FY RNA and the unmodified RNA. Our structural analysis allowed determination of the strand‐path and general folding for the origami (GSFSC resolution ~10 Å) and precise structural determination of the aptamer‐protein complex (GSFSC resolution ~3 Å).
Importantly, the CASP target submission platform uses FASTA format and does not allow specification of modified nucleotides or amino acids. Therefore, we noted in “additional information” that the RNA sequence had 2′‐F modified pyrimidines. But in the end, all predictions were submitted as unmodified RNA.
2.11. FY RNA Origami Dimer (R1281o, Stoichiometry A2)
We analyzed a 6‐helix origami structure of FY RNA. The unmodified RNA sequence was previously investigated [89]; was a target in CASP15; and is in the protein data bank (PDB: 7PTK (young) and 7PTL (mature)). It was originally designed to form a bundle‐like structure, with two RNA helices closing the bundle by forming a “clasp” held together by kissing loop interactions. Interestingly, the RNA origami had a slow maturation process, going from a metastable young conformation to a later‐forming mature conformation [89]. Surprisingly, we found that the FY RNA version of the origami dimerized. Despite relatively poor resolution (~10 Å) we were able to confidently trace the backbone and build a general structure of the dimeric origami (Figure 9A). The model shows that the molecule rearranges so that two helices in each monomer form two intermolecular kissing loops instead of the intramolecular kissing loop observed in the RNA version.
FIGURE 9.

Prediction of FY RNA dimeric origami and FY RNA anti‐spike aptamer (CASP: R1281o and M1282). (A) Experimental determined structure of the dimerized FY RNA 6 helix bundle with a clasp origami. (B) Examples of predictions classified into class I to IV based on manual assessment. (C) Experimental determined structure of the FY RNA aptamer (green) binding to the RBD domain (blue cartoon with transparent surface representation of electrostatics). (D) Left: Examples of predictions where the aptamer binds different regions of the RBD domain (aligned according to the RBD domain). Right: The aptamers alone.
CASP16 predictors provided 95 predictions of this homodimer. We accessed all predictions and classified them into four classes based on manual assessment (I‐IV) (Figure 9B). Class I (18%) consisted of models similar to the published RNA origami bundle (resembling the young structure) but where the clasp formed an intermolecular kissing loop. At first glance, Class I (exemplified by TS183_2o) looked promising; however, a closer look revealed that the intermolecular kissing loops were not correct. Class II (17%, exemplified by TS448_3o) consisted of predictions where two bundle‐like structures (resembling the mature RNA bundle) were positioned next to each other in different ways, without forming any base pairing interactions between the monomers. Class III (7%, exemplified by TS481_1o) consisted of a split‐open origami that dimerizes with all its internal kissing loops. Such a structure might be interesting to design in the future but did not resemble the experimental structure. Class IV (exemplified by TS286_5o) was the most populated class (57%) representing somewhat collapsed structures looking very different from the experimental origami structures.
In summary, none of the predictions captured the rearrangement upon FY modification of the RNA origami that caused it to dimerize. Importantly, the predictions were submitted as unmodified RNA. However, only the FY RNA version of the bundle dimerizes. Indeed, the ability to specify modifications and use them in the predictions will be crucial for future prediction competitions, since modifications are widespread in natural RNAs and will be increasingly used in RNA medicine and nanotechnology.
2.12. FY RNA Aptamer—Protein Complex (M1282)
A FY RNA aptamer binding to the SARS‐CoV 2 Spike protein had previously been reported [90]; but the aptamer structure and precise binding interactions were unknown. We investigated its structure with cryo‐EM, revealing the structural basis of how the aptamer binds to the Receptor Binding Domain (RBD) of the Spike protein (Figure 9C). Importantly, an (unmodified) RNA version of the aptamer does not bind the Spike protein at all [90].
CASP16 predictors submitted 31 predictions of the aptamer–RBD complex. We first assessed all predictions in terms of binding location (Figure 9D, left) and then the aptamer structure (Figure 9D, right). 56% of the predicted aptamers bound to the positively charged “back” of the RBD domain, 8% bound the “front” of the RBD domain and 36% of the aptamers were predicted to bind to the “top” of the RBD domain, (see Figure 9C, for position of back, front and top). The top position is the region where the aptamer binds according to the experimental data. However, despite locating the correct regions of the RBD, these predictions did not identify the specific interactions between the nucleic acid molecule and the protein. Next, we aligned all the aptamers (discarding the RBD domain from the models) (Figure 9D, right). This alignment revealed that most (but not all see eg. TS262_1) predictions identified the correct canonical base pairing pattern but did not predict the precise 3D conformations.
In summary, the canonical base pairing patterns of the aptamer were often predicted correctly; however, important tertiary interactions were overlooked. Due to the missed tertiary interactions, and/or other factors, the correct binding pose was not predicted well. This was the case even when the aptamer was docked at the correct area of the RBD. Again, the predictions were done for unmodified RNA (not FY RNA), emphasizing the importance of predicting chemically modified biomolecules in the future.
2.13. Cryo‐EM Structure of SPβ LSI‐RDF Synaptic Complex From the Bacillus Phage SPβ (CASP: M1239v1 and M1239v2, PDB: 9DXD (Pre‐Rotation State)). Provided by Heewhan Shin and Phoebe A. Rice
Large serine integrase (LSI) and recombination directionality factor (RDF) pairs are encoded by many temperate phages. Upon infection, the LSI catalyzes the unidirectional integration of the phage DNA into that of its bacterial host, where it can passively “hitchhike” for many generations. When the phage needs to escape that host and find a new one, it expresses the cognate RDF protein that binds to the integrase, altering its preferred reaction direction and triggering excision of phage DNA out of the host genome.
LSI/RDF pairs are of interest as fundamentally intriguing molecular machines and as biotechnology tools. In addition to high site‐specificity and unidirectionality in DNA recombination, LSI/RDF systems display several advantages over other commonly used genome editing tools [91, 92]. For example, LSIs require much shorter DNA binding sites than integrases from the tyrosine recombinase family. Compared to CRISPR‐RNA guided systems, not only are the LSIs intrinsically designed to operate multiplexed insertion/deletion/inversion of kilobase‐sized DNA payloads, but also no DNA repair is needed after the recombination reaction because all broken bonds are relegated.
LSIs have a modular structure with an N‐terminal catalytic domain followed by two DNA binding domains, with a coiled coil (CC) inserted into the 2nd DNA binding domain (DBD2) via flexible hinges. They bind as dimers to their cognate DNA sites, but recombination requires formation of a tetrameric complex that synapses the two substrate DNAs. The substrate DNA sites for integration are termed attB (~40 bp; from the bacterial host genome) and attP (~50 bp; from phage DNA) [91, 93, 94]. The products of integration are two sites (“attL” and “attR”) that each contain one half of attB and one half of attP. attL and attR are the substrates for the excision reaction, which regenerates attB and attP. The critical difference among these sites, which dictates whether inactive dimers or active tetramers can be formed, is the placement of the DBD2‐binding motif, which is 4–5 bp further from the center in attP sites vs. attB sites. While the catalytic domain is central to both dimers and tetramers, additional interactions between the tips of the CCs, which move with DBD2, form “handshake” interactions that stabilize active tetramers before recombination but also lock product dimers to prevent reverse reactions.
We were particularly focused on understanding the mechanistic basis behind how LSIs and their cognate RDFs control the directionality of DNA recombination. Previous biochemical and structural studies described the importance of the CC motif and the functional difference between attB and attP sites [95]; yet there were no available structures of intact LSI‐DNA synaptic complexes, nor were there models showing how the RDF controls the CC's trajectory. Our recent series of structures fill these gaps [96].
We thought that the synaptic complexes would be interesting CASP targets due to their conformational complexity (Figure 10A, left). First, while the four LSI molecules are chemically identical, their relative conformational arrangements differ based on which DNA half sites (attB‐ vs. attP‐derived) they are bound to, creating structural asymmetry; second, this asymmetry positions the tips of the CC‐motifs to make tetramer‐stabilizing inter‐dimer interactions (only between subunits bound to different types of half‐sites); third, we captured the complex in the recombination intermediate state where each catalytic Ser22 residue is covalently linked to the 5′ end of the phosphate at the cleavage site. Finally, the catalytic domains can switch between inactive dimer and active tetramer conformations. Here we analyze the CASP predictions specifically for the complex of phage SPβ integrase, with its cognate RDF, synapsing two copies of attL—essentially, the complex poised to undertake the excision reaction.
FIGURE 10.

Experimental versus predicted structure of SPβ LSI‐RDF‐DNA complex versus predicted structure (CASP: M1239v1, PDB: 9DXD). (A) Cryo‐EM structure of target M1239, the SPβ LSI‐RDF synaptic complex in the pre‐rotation state, (left) showing four distinct domains: An N‐terminal catalytic domain (CAT, green), DNA binding domains 1 and 2 (DBD1 and DBD2, blue and yellow respectively), a coiled‐coil motif subdomain (CC, orange), and a recombination directionality factor (RDF, cyan). The best CASP prediction model (M1239v1TS294_1, right) is shown on the right. Superimposed CC dimers from the experimental structure (dark orange) and the predicted structure (light orange) are aligned according to the CC‐motif (residues 453–464), indicated by the asterisks (middle). (B) A superimposed half‐complex of the experimental and predicted structures highlighting atomic distance (≥ 7Å) between the protein frameworks (Cα) marked by red arrows, from the experimental to the predicted structure (transparency = 0.5).
The overall scaffold of the synaptic complex consists of two attL‐bound, chemically identical LSI‐RDF dimers largely stabilized via hydrophobic interfaces among the N‐terminal catalytic domains. In the center of this tetramer is a flat and hydrophobic platform where subunit rotation between left and right halves of the synaptic complex takes place during DNA recombination. Overall, this central tetrameric scaffold is very similar to that seen in 2005 for an otherwise quite different “small” serine recombinase [97]. In comparing 41 predicted models (M1239 category) to the SPβ LSI‐RDF synaptic complex, 35 out of 41 entries predicted the overall tetrameric scaffold similar to the reference structure.
Overall, some but not all CASP models successfully predicted that there would be two different spacings of the two C‐terminal DNA binding domains along the DNA. Most if not all successfully docked the DNA binding motifs with respect to the minor or major grooves of the DNA (Figure 10A right). However, the placement of the individual DNA binding domains did not correlate with the DNA sequence.
Nine out of the 35 entries with tetrameric cores were predicted to have CC‐motifs engaged in a handshake interaction, and these entries are divided into two distinct modes: intra‐ and inter‐dimer interactions (inter‐ is the correct configuration for this particular complex). Specifically, four entries (M1239v1TS033_1, M1239v1TS091_1, M1239v1TS110_1, and M1239v1TS167_1) adopted an intra‐dimer handshake between CC‐motifs. This interaction may be relevant to a post‐DNA strand exchange conformation but is incorrect for this intermediate‐state tetramer. Five entries (M1239v1TS231_1, M1239v1TS262_1, M1239v2TS262_1, M1239v1TS294_1, and M1239v2TS489_1) showed inter‐dimer handshake interactions stabilizing synapsis between two DNA substrates, with the latter four of those resembling the overall scaffold seen experimentally, and with the entry from the Kihara lab (Figure 10A right) being the closest. However, none of the models matched the experimentally seen binding angle between the two CC‐motifs (Figure 10A), perhaps because they also failed to bend the attP half sites towards the core tetramer rather than away, as is seen in the attB half sites and in previously determined structures of distantly related small serine recombinases. The predicted attP bend may, however, reflect a pre‐ or post‐cleavage state that is yet to be experimentally resolved. Finally, none of the entries correctly modeled all four copies of the phosphoserine linkage between Ser22 and the 5′ phosphate of the DNA.
In conclusion, numerous entries successfully demonstrated that predicting protein structural asymmetry from chemically identical molecules within a synaptic complex was possible; some of the models were, overall, remarkably similar to the experimental structure. However, sequence specificity in protein‐DNA binding as well as atomic details for modeling the bending, compression, or stretching of DNA under tension—particularly generated from the covalently attached protein‐DNA complex and from constraints in the CC‐motif interactions—were significantly different from the experimental structure.
2.14. Cryptic DNA‐Binding Protein UDE From Cerapachys biroi in Complex With DNA (CASP: M1276) Provided by Reinhard Albrecht, Yimin Hu and Marcus D. Hartmann
Due to a long‐standing interest in uracil‐binding proteins [98], several years ago, our lab began investigating UDE, a protein implicated in the development of pupating insects (holometabola) in the late larval stages as a uracil‐DNA degrading factor. First identified in Drosophila, this protein is primarily found in holometabola [99], but also in a range of plant‐pathogenic fungi. Canonical UDE is an all‐helical protein with either two or three domains, comprising one or two copies of an N‐terminal three‐helix bundle domain and a C‐terminal domain consisting of six helices. In several species, it is also found as part of larger multi‐domain proteins. Over the years, we have solved crystal structures of several two‐domain UDE constructs from different species, including the yellow fever mosquito Aedes aegypti (AaUDE; AAEL003864), which we had already entered as a target for CASP12. At that time, UDE constituted a new protein fold with no homologous structures available, and none of the participating groups was able to predict the entire UDE structure [100]. Today, however, very accurate UDE models are available in the AlphaFold database for many species, even though we have not yet released our experimental UDE structures.
After having solved the UDE structure, the most interesting aspect remaining was its DNA binding mode. To this end, we confirmed single‐ (ss) and double‐stranded (ds) DNA binding activity for all constructs we could obtain for several species, as long as these constructs included at least one of the two possible N‐terminal three‐helix bundle domains. We found that this interaction was seemingly independent of the DNA sequence; for some of the constructs, we were also able to obtain co‐crystal structures with DNA fragments. Here, the highest resolution structure was obtained for a two‐domain UDE construct from Cerapachys biroi (CbUDE, A0A026WM60), the clonal raider ant, in complex with a ssDNA fragment, which we submitted as a hybrid prediction target in CASP16.
As anticipated from the binding experiments, the crystal structure revealed that the binding is mediated via the DNA backbone, without prominent interactions between the bases and the protein, and with an interface that mostly involves the N‐terminal domain. Of the 33 nucleotides of the ssDNA fragment used in the crystallization trial, only a stretch of about 8 nucleotides is well resolved in the electron density. In the 5′ to 3′ direction, the first four of them are bound along the interface between the N‐ and C‐terminal domains in a somewhat linear fashion, while the remaining four nucleotides are bound along the tip of the N‐terminal domain (Figure 11A). In both segments, the bases protrude with a spacing resembling that in dsDNA. However, most interestingly, the two segments are separated by the intercalation of an arginine side chain located in the middle of the second helix of the N‐terminal domain. This intercalation leads to a kink in the overall DNA trajectory and the geometry of the phosphodiester bond between the two segments. Although the functional and mechanistic details of UDE still remain to be uncovered, we expect this conserved arginine and its intercalation mode to play a key role.
FIGURE 11.

The crystal structure of CbUDE in complex with ssDNA compared with the best first models submitted by the CASP16 predictors (CASP: M1276). (A) Crystal structure of CbUDE bound to ssDNA. (B) Superposition of the crystal structure and the top eight first models as described in the text, including (C) Bhattacharya (TS369_1), (D) AF3‐server (TS304_1), (E) Huang‐HUST (TS091_1), (F) MIEnsembles‐Server (TS110_1), and (G) MULTICOM_human (TS345_1), as well as MULTICOM (TS051_1), NKRNA‐s (TS028_1), and Zheng (TS462_1).
While most of the groups participating in the hybrid prediction category in CASP16 did a good job of predicting the CbUDE protein structure itself, only a few managed to predict relevant features of the DNA recognition mode. As judged by the first model submitted and in descending order, the best predictions are from Bhattacharya (TS369), AF3‐server (TS304), Huang‐HUST (TS091), MIEnsembles‐Server (TS110), followed by identical models from MULTICOM_human (TS345) and MULTICOM (TS051), and essentially identical models from NKRNA‐s (TS028) and Zheng (TS462), with Interface Contact Scores (ICS) and Interface Contacts Recall (ICR) scores above 0.5. When extending the evaluation to all five models submitted, elofsson (TS241) is also found in this leading cohort with a model identical to the first model of the AF3‐server. While all these groups managed to predict the binding interface along the tip of the N‐terminal domain, none of them predicted the intercalation of the arginine and the resultant kink in the DNA chain (Figure 11B). Rather, the DNA is modeled as a helical extension to the 3′ fragment bound to the N‐terminal domain, clashing with the arginine, and consequently, none of the models predicted the interface responsible for the binding of the 5′ segment. In conclusion, while several groups were able to predict a part of the protein‐nucleic acid interaction site in CbUDE‐ssDNA, further progress is needed to predict presumably functionally relevant details.
3. Conclusions
With the advent of accurate protein structure prediction algorithms, the nucleic acid community is eager for similar advances. Such algorithms should aim to predict the biologically, chemically, and biophysically important aspects of nucleic‐acid‐containing structures. Through 12 examples, we have outlined vital features of nucleic acid structures, exposing areas for improvement in prediction accuracy. Critically, the detailed analysis by experimentalists reveals inaccuracies that CASP metrics may conceal, such as the enzymatically important η‐η′ and π‐π′ interactions of the group IIC intron and the binding site of SAMURI.
In most targets, secondary structure was predicted accurately, even for large targets with no homologous structures such as ROOL and OLE. However, multiple experimentalists note an overreliance on simple helical secondary structure units. For example, many important backbone kinks, turns, or bends were not predicted accurately, including the L2 bend in xrRNA, G‐C domain in 3′ CITE, the target adenosine in SAMURI, the DNA near the intercalating arginine in UDE, and the DNA under tension in the SPβ LSI‐RDF synaptic complex.
Further, non‐canonical interactions were underpredicted or poorly predicted in many targets such as xrRNA, RRE SLII, SAMURI, and the DNA aptamer. Some non‐canonical motifs and backbone kinks were predicted correctly, such as the kink turn in OLE; predictors were able to predict the correct nucleotides involved in the OLE kink‐turn despite literature hypotheses involving different nucleotides. Additionally, there is some indication that evolutionary information may have assisted the prediction of tertiary interactions such as those in ROOL, even though global fold remained out of reach. Determining the global fold, or the arrangement of helices, remains a challenge more generally. For example, the orientation of the three helices of RRE SLII were predicted well when the junction was mutated to be simple (in target R1296) but very poorly when there were non‐canonical interactions at the junction (target R1209). These non‐canonical interactions play a vital role in forming functionally important regions in nucleic acid structures, including dictating the orientation of helices at junctions, and hence incorrect prediction can lead to drastic error in global fold.
We additionally emphasize the importance of co‐predicting the nucleic‐acid structure with essential ligands because non‐canonical interactions, including those with the ligand, were important for forming ligand binding sites. For example, dopamine intercalates between two bases in the DNA aptamer; the ZTP riboswitch changed conformation when a bulkier ligand was bound, and ions are vital for active site stability in group IIC intron. Such ligands are intrinsic structural elements and should be considered when modeling the nucleic acid structure. However, as noted for the DNA aptamer, many ligand predictions lacked basic chemical interactions integral to nucleic‐acid‐ligand interactions such as stacking and hydrogen bonding. This exposes nucleic‐acid‐ligand interactions as an area for improvement.
The principles considered in nucleic‐acid‐ligand interactions extend to RNA–RNA interactions and nucleic‐acid‐protein interactions. In both cases, interfaces were generally underpredicted or predicted inaccurately. RNA multimer predictions were of poor quality, despite inter‐molecular RNA–RNA contacts consisting of the same building blocks, such as base pairs, as RNA tertiary structure. This suggests that knowledge of RNA tertiary structure was not directly transferable to RNA–RNA interface prediction. The novelty of RNA homo‐multimerization as a prediction challenge was evidenced by predictors submitting asymmetric assemblies for ROOL, inaccurate or underpredicted interfaces for OLE, and dimers with no interface for the 2′‐F RNA origami. With the lack of previous examples, this poor performance is not surprising. Hopefully, the discovery of these complex RNA‐multimeric structures will result in increased interface prediction accuracy in future challenges.
While there are more data on nucleic‐acid‐protein complexes, their predictions remain challenging except for known Fab‐RNA interactions such as those used to engineer a crystallization construct for 3′ CITE. Generally, the location of the interface on the protein was predicted accurately, but the interactions, such as the intercalation of an arginine of UDE into ssDNA, were not predicted accurately. For example, while the DNA helix was docked in the correct location of the SPβ LSI‐RDF synaptic complex, all predictions placed the incorrect section of the DNA helix in the binding pocket. Finally, the majority of predictions placed the RNA aptamer in the incorrect location of the SARS‐CoV 2 spike protein and those that did place it correctly, did not predict interactions accurately.
The inaccuracy of the SARS‐CoV 2 spike protein‐RNA aptamer interface may be due to the 2′‐F modification of the RNA which predictors did not appear to consider. This modification also induced a dimerization of an RNA origami which does not dimerize without modification, highlighting the importance of considering nucleotide modifications. In the future, the CASP target definition format might be expanded to provide a machine‐readable field or format for nucleotide modifications.
We note that predictions that deviate from the experimental structure may be valid biologically‐relevant structures. The experimental structure presented here‐in do not represent the only ground‐truth structures—they represent the structure(s) of the molecule under specific experimental conditions and time points. For example, predictions for the SPβ LSI‐RDF synaptic complex could represent different states along the catalytic cycle of recombination or predictions of the RRE SLII could represent different states in the solution ensemble of structures it occupies that were not crystallized. This offers challenges and opportunities for the structure prediction community including how to consider experimental conditions in structure prediction and how to assess the accuracy of both the predictions and experimental structures, perhaps by systematic comparison to independently collected evolutionary and experimental data. In the future, experimentalists could be asked to contribute data containing information about the ensemble of structures, such as particle stacks of cryo‐EM data, to tackle these challenges.
Overall, we observe accurate predictions of nucleic acid secondary structure and, in some cases, global folds; but we warn against over‐reliance on canonical structural motifs, in particular in functionally important regions. We urge improvement in the prediction of backbone kinks and non‐canonical base‐pairing. The prediction of nucleic acid interactions with proteins, ligands, and other nucleic acids should remain emphasized in future CASPs to drive progress in this new frontier for structure prediction. Due to the effort of experimentalists across the world, CASP16 presented nucleic acid structures with novel folds, novel interactions, and even novel prediction categories, demonstrating that there remain underexplored aspects of nucleic acid structure only unveiled by future experimental structure determination.
Author Contributions
Rachael C. Kretsch: formal analysis, writing – review and editing, writing – original draft, conceptualization. Reinhard Albrecht: formal analysis, writing – original draft. Ebbe S. Andersen: writing – original draft, formal analysis. Hsuan‐Ai Chen: writing – original draft, formal analysis. Wah Chiu: writing – original draft, formal analysis. Rhiju Das: formal analysis, writing – original draft. Jeanine G. Gezelle: formal analysis, writing – original draft. Marcus D. Hartmann: formal analysis, writing – original draft. Claudia Höbartner: writing – original draft, formal analysis. Yimin Hu: writing – original draft, formal analysis. Shekhar Jadhav: writing – original draft, formal analysis. Philip E. Johnson: formal analysis, writing – original draft. Christopher P. Jones: formal analysis, writing – original draft. Deepak Koirala: formal analysis, writing – original draft. Emil L. Kristoffersen: writing – original draft, formal analysis. Eric Largy: writing – original draft, formal analysis. Anna Lewicka: formal analysis, writing – original draft. Cameron D. Mackereth: formal analysis, writing – original draft. Marco Marcia: formal analysis, writing – original draft. Michela Nigro: writing – original draft, formal analysis. Manju Ojha: formal analysis, writing – original draft. Joseph A. Piccirilli: writing – original draft, formal analysis. Phoebe A. Rice: writing – original draft, formal analysis. Heewhan Shin: formal analysis, writing – original draft. Anna‐Lena Steckelberg: formal analysis, writing – original draft. Zhaoming Su: writing – original draft, formal analysis. Yoshita Srivastava: formal analysis, writing – original draft. Liu Wang: writing – original draft, formal analysis. Yuan Wu: formal analysis, writing – original draft. Jiahao Xie: writing – original draft, formal analysis. Nikolaj H. Zwergius: writing – original draft, formal analysis. John Moult: conceptualization, supervision, project administration, writing – review and editing. Andriy Kryshtafovych: conceptualization, supervision, project administration, writing – review and editing.
Conflicts of Interest
The authors declare no conflicts of interest.
Acknowledgments
We acknowledge the following sources of funding and support: NIH/NIGMS (R01GM100482 to A.K., R35GM149336 to J.A.P., Northeastern Collaborative Access Team beamlines P30 GM124165); National Institutes of Health grants (R01GM079429 to W.C., R35GM122579 to R.D., R35GM150778 to A‐L.S, T32 grant GM066706 to M.O., transition award K22HL139920‐01 to C.P.J., intramural program of the National Heart, Lung and Blood Institute); National Institutes of Health Common Fund Transformative High‐Resolution Cryo‐Electron Microscopy program (U24GM129541); Center for Structural Biology of HIV‐1 RNA (CRNA) Collaborative Development Pilot Grant Program (NIH NIAID1U54AI170660, Sub‐award SUBK00019302) to D.K.; the National Science Foundation (2330652 to W.C. and R.D., GRFP DGE‐2036197 to J.G.G.); Bio‐X Bowes Graduate Student Fellowship (R.C.K.); the Howard Hughes Medical Institute (to R.D.); National Key Research and Development Program of China (2022YFC2303700 to Z.S.); Natural Science Foundation of China (32222040 to Z.S.); Swedish National Research Council (VR, 2024‐04107 to M.M.); European Commission (HORIZON‐MSCA‐2023‐DN‐01) action (project: TargetRNA, n. 101168667 to M.M.); France Canada Research Fund (to C.D.M. and P.E.J.); Natural Sciences and Engineering Research Council of Canada (to P.E.J.); EMBION Cryo‐EM Facility at iNANO, Aarhus University; Novo Nordisk Foundation (NNF21OC0070452 to E.S.A); the Villum Foundation (VIL71957 E.L.K); Eiger 16 M detector on 24‐ID‐E is funded by a NIH‐ORIP HEI grant (S10OD021527); a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. 10.13039/100000015; Deutsche Forschungsgemeinschaft (project no 463143961 and Gottfried Wilhelm Leibniz Programme to C.H.). This article is subject to HHMI's Open Access to Publications policy. HHMI lab heads have previously granted a nonexclusive CC BY 4.0 license to the public and a sublicensable license to HHMI in their research articles. Pursuant to those licenses, the author‐accepted manuscript of this article can be made freely available under a CC BY 4.0 license immediately upon publication.
Kretsch R. C., Albrecht R., Andersen E. S., et al., “Functional Relevance of CASP16 Nucleic Acid Predictions as Evaluated by Structure Providers,” Proteins: Structure, Function, and Bioinformatics 94, no. 1 (2026): 51–78, 10.1002/prot.70043.
Funding: This work was supported by NIH/NIGMS (R01GM100482 to A.K., R35GM149336 to J.A.P., Northeastern Collaborative Access Team beamlines P30 GM124165); National Institutes of Health grants (R01GM079429 to W.C., R35GM122579 to R.D., R35GM150778 to A.‐L.S., T32 grant GM066706 to M.O., transition award K22HL139920‐01 to C.P.J., intramural program of the National Heart, Lung and Blood Institute); National Institutes of Health Common Fund Transformative High‐Resolution Cryo‐Electron Microscopy Program (U24GM129541); Center for Structural Biology of HIV‐1 RNA (CRNA) Collaborative Development Pilot Grant Program (NIH NIAID1U54AI170660, Sub‐award SUBK00019302) to D.K.; the National Science Foundation (2330652 to W.C. and R.D., GRFP DGE‐2036197 to J.G.G.); Bio‐X Bowes Graduate Student Fellowship (R.C.K.); the Howard Hughes Medical Institute (to R.D.); National Key Research and Development Program of China (2022YFC2303700 to Z.S.); Natural Science Foundation of China (32222040 to Z.S.); Swedish National Research Council (VR, 2024‐04107 to M.M.); European Commission (HORIZON‐MSCA‐2023‐DN‐01) action (project: TargetRNA, n. 101168667 to M.M.); France Canada Research Fund (to C.D.M. and P.E.J.); Natural Sciences and Engineering Research Council of Canada (to P.E.J.); EMBION Cryo‐EM Facility at iNANO, Aarhus University; Novo Nordisk Foundation (NNF21OC0070452 to E.S.A); the Villum Foundation (VIL71957 E.L.K); Eiger 16 M detector on 24‐ID‐E is funded by a NIH‐ORIP HEI grant (S10OD021527); a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. 10.13039/100000015; Deutsche Forschungsgemeinschaft (project no 463143961 and Gottfried Wilhelm Leibniz Programme to C.H.).
Contributor Information
John Moult, Email: jmoult@tunc.org.
Andriy Kryshtafovych, Email: akryshtafovych@ucdavis.edu.
Data Availability Statement
The data that support the findings of this study are openly available at the CASP website, the scores can be found at: https://predictioncenter.org/casp16/results.cgi?tr_type=rna, https://predictioncenter.org/casp16/results.cgi?tr_type=rna_multi, https://predictioncenter.org/casp16/results.cgi?tr_type=hybrid. The predicted models can be found at: https://predictioncenter.org/download_area/CASP16/predictions/RNA/, https://predictioncenter.org/download_area/CASP16/predictions/oligo/, https://predictioncenter.org/download_area/CASP16/predictions/hybrid/.
References
- 1. Akdel M., Pires D. E. V., Pardo E. P., et al., “A Structural Biology Community Assessment of AlphaFold2 Applications,” Nature Structural & Molecular Biology 29 (2022): 1056–1067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Das R., Kretsch R. C., Simpkin A. J., et al., “Assessment of Three‐Dimensional RNA Structure Prediction in CASP15,” Proteins 91 (2023): 1747–1770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Kretsch R. C., Hummer A. M., He S., et al., “Assessment of Nucleic Acid Structure Prediction in CASP16,” bioRxiv 2025.05.06.652459, 2025, 10.1101/2025.05.06.652459. [DOI] [PubMed]
- 4. Bu F., Adam Y., Adamiak R. W., et al., “RNA‐Puzzles Round V: Blind Predictions of 23 RNA Structures,” Nature Methods 22 (2025): 399–411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Kretsch R. C., Andersen E. S., Bujnicki J. M., et al., “RNA Target Highlights in CASP15: Evaluation of Predicted Models by Structure Providers,” Proteins 91 (2023): 1600–1615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Westhof E., Sun H., Bu F., and Miao Z., The RNA‐Puzzles Assessments of RNA‐Only Targets in CASP16 (Authorea Inc, 2025), 10.22541/au.174717729.93103064/v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Vicens Q. and Kieft J. S., “Shared Properties and Singularities of Exoribonuclease‐Resistant RNAs in Viruses,” Computational and Structural Biotechnology Journal 19 (2021): 4373–4380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Slonchak A. and Khromykh A. A., “Subgenomic Flaviviral RNAs: What Do We Know After the First Decade of Research,” Antiviral Research 159 (2018): 13–25. [DOI] [PubMed] [Google Scholar]
- 9. Pijlman G. P., Funk A., Kondratieva N., et al., “A Highly Structured, Nuclease‐Resistant, Noncoding RNA Produced by Flaviviruses Is Required for Pathogenicity,” Cell Host & Microbe 4 (2008): 579–591. [DOI] [PubMed] [Google Scholar]
- 10. Steckelberg A.‐L., Akiyama B. M., Costantino D. A., Sit T. L., Nix J. C., and Kieft J. S., “A Folded Viral Noncoding RNA Blocks Host Cell Exoribonucleases Through a Conformationally Dynamic RNA Structure,” Proceedings of the National Academy of Sciences of The United States of America 115 (2018): 6404–6409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Steckelberg A.‐L., Vicens Q., and Kieft J. S., “Exoribonuclease‐Resistant RNAs Exist Within Both Coding and Noncoding Subgenomic RNAs,” MBio 9 (2018): e02461‐18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Szucs M. J., Nichols P. J., Jones R. A., Vicens Q., and Kieft J. S., “A New Subclass of Exoribonuclease‐Resistant RNA Found in Multiple Genera of Flaviviridae,” MBio 11 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Akiyama B. M., Laurence H. M., Massey A. R., et al., “Zika Virus Produces Noncoding RNAs Using a Multi‐Pseudoknot Structure That Confounds a Cellular Exonuclease,” Science 354 (2016): 1148–1152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Gezelle J. G., Korn S. M., McDonald J. T., et al., “A Conserved Viral RNA Fold Enables Nuclease Resistance Across Kingdoms of Life.” Nucleic Acids Resistance 53, no. 16 (2025), 10.1093/nar/gkaf840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Steckelberg A.‐L., Vicens Q., Costantino D. A., Nix J. C., and Kieft J. S., “The Crystal Structure of a Polerovirus Exoribonuclease‐Resistant RNA Shows How Diverse Sequences Are Integrated Into a Conserved Fold,” RNA 26 (2020): 1767–1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Malim M. H., Tiley L. S., McCarn D. F., Rusche J. R., Hauber J., and Cullen B. R., “HIV‐1 Structural Gene Expression Requires Binding of the Rev Trans‐Activator to Its RNA Target Sequence,” Cell 60 (1990): 675–683. [DOI] [PubMed] [Google Scholar]
- 17. Watts J. M., Dang K. K., Gorelick R. J., et al., “Architecture and Secondary Structure of an Entire HIV‐1 RNA Genome,” Nature 460 (2009): 711–716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Malim M. H., Hauber J., Le S. Y., Maizel J. V., and Cullen B. R., “The HIV‐1 Rev Trans‐Activator Acts Through a Structured Target Sequence to Activate Nuclear Export of Unspliced Viral mRNA,” Nature 338 (1989): 254–257. [DOI] [PubMed] [Google Scholar]
- 19. Malim M. H. and Cullen B. R., “HIV‐1 Structural Gene Expression Requires the Binding of Multiple Rev Monomers to the Viral RRE: Implications for HIV‐1 Latency,” Cell 65 (1991): 241–248. [DOI] [PubMed] [Google Scholar]
- 20. Cullen B. R., “Nuclear RNA Export Pathways,” Molecular and Cellular Biology 20 (2000): 4181–4187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Zapp M. L., Stern S., and Green M. R., “Small Molecules That Selectively Block RNA Binding of HIV‐1 Rev Protein Inhibit Rev Function and Viral Production,” Cell 74 (1993): 969–978. [DOI] [PubMed] [Google Scholar]
- 22. Jin Y. and Cowan J. A., “Targeted Cleavage of HIV Rev Response Element RNA by Metallopeptide Complexes,” Journal of the American Chemical Society 128 (2006): 410–411. [DOI] [PubMed] [Google Scholar]
- 23. Huang X. J., Hope T. J., Bond B. L., McDonald D., Grahl K., and Parslow T. G., “Minimal Rev‐Response Element for Type 1 Human Immunodeficiency Virus,” Journal of Virology 65 (1991): 2131–2134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Fang X., Wang J., O'Carroll I. P., et al., “An Unusual Topological Structure of the HIV‐1 Rev Response Element,” Cell 155 (2013): 594–605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Degenhardt M. F. S., Degenhardt H. F., Bhandari Y. R., et al., “Determining Structures of RNA Conformers Using AFM and Deep Neural Networks,” Nature 637 (2025): 1234–1243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Tipo J., Gottipati K., Slaton M., Gonzalez‐Gutierrez G., and Choi K. H., “Structure of HIV‐1 RRE Stem‐Loop II Identifies Two Conformational States of the High‐Affinity Rev Binding Site,” Nature Communications 15 (2024): 4198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Banna H. A., Das N. K., Ojha M., and Koirala D., “Advances in Chaperone‐Assisted RNA Crystallography Using Synthetic Antibodies,” BBA Advances 4 (2023): 100101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Ojha M., Vogt J., Das N. K., et al., “Structure of Saguaro Cactus Virus 3′ Translational Enhancer Mimics 5′ Cap for eIF4E Binding,” Proceedings of the National Academy of Sciences of The United States of America 121 (2024): e2313677121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Ojha M., Hudson L., Photenhauer A., et al., “Mutation‐Driven RRE Stem‐Loop II Conformational Change Induces HIV‐1 Nuclear Export Dysfunction,” Nucleic Acids Res 53, no. 12 (2025): gkaf583, 10.1093/nar/gkaf583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Browning K. S. and Bailey‐Serres J., “Mechanism of Cytoplasmic mRNA Translation,” Arabidopsis Book 13 (2015): e0176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Jaafar Z. A. and Kieft J. S., “Viral RNA Structure‐Based Strategies to Manipulate Translation,” Nature Reviews. Microbiology 17 (2019): 110–123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Martinez‐Salas E., Francisco‐Velilla R., Fernandez‐Chamorro J., and Embarek A. M., “Insights Into Structural and Mechanistic Features of Viral IRES Elements,” Frontiers in Microbiology 8 (2017): 2629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Nicholson B. L. and White K. A., “3′ Cap‐Independent Translation Enhancers of Positive‐Strand RNA Plant Viruses,” Current Opinion in Virology 1 (2011): 373–380. [DOI] [PubMed] [Google Scholar]
- 34. Kraft J. J., Peterson M. S., Cho S. K., et al., “The 3′ Untranslated Region of a Plant Viral RNA Directs Efficient Cap‐Independent Translation in Plant and Mammalian Systems,” Pathogens 8, no. 1 (2019): 28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Lewicka A., Roman C., Jones S., Disare M., Rice P. A., and Piccirilli J. A., “Crystal Structure of a Cap‐Independent Translation Enhancer RNA,” Nucleic Acids Research 51 (2023): 8891–8907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Kim P. B., Nelson J. W., and Breaker R. R., “An Ancient Riboswitch Class in Bacteria Regulates Purine Biosynthesis and One‐Carbon Metabolism,” Molecular Cell 57 (2015): 317–328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Jones C. P. and Ferré‐D'Amaré A. R., “Recognition of the Bacterial Alarmone ZMP Through Long‐Distance Association of Two RNA Subdomains,” Nature Structural & Molecular Biology 22 (2015): 679–685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Strobel E. J., Cheng L., Berman K. E., Carlson P. D., and Lucks J. B., “A Ligand‐Gated Strand Displacement Mechanism for ZTP Riboswitch Transcription Control,” Nature Chemical Biology 15 (2019): 1067–1076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Hua B., Jones C. P., Mitra J., et al., “Real‐Time Monitoring of Single ZTP Riboswitches Reveals a Complex and Kinetically Controlled Decision Landscape,” Nature Communications 11 (2020): 4531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Trausch J. J., Marcano‐Velázquez J. G., Matyjasik M. M., and Batey R. T., “Metal Ion‐Mediated Nucleobase Recognition by the ZTP Riboswitch,” Chemistry & Biology 22 (2015): 829–837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Tran B., Pichling P., Tenney L., et al., “Parallel Discovery Strategies Provide a Basis for Riboswitch Ligand Design,” Cell Chemical Biology 27 (2020): 1241–1249.e4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Fullenkamp C. R., Mehdi S., Jones C. P., et al., “Machine Learning‐Augmented Molecular Dynamics Simulations (MD) Reveal Insights Into the Disconnect Between Affinity and Activation of ZTP Riboswitch Ligands,” Angewandte Chemie (International Ed. in English) 64 (2025): e202505971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Wilson T. J. and Lilley D. M. J., “The Potential Versatility of RNA Catalysis,” Wiley Interdisciplinary Reviews: RNA 12 (2021): e1651. [DOI] [PubMed] [Google Scholar]
- 44. Panchapakesan S. S. S. and Breaker R. R., “The Case of the Missing Allosteric Ribozymes,” Nature Chemical Biology 17 (2021): 375–382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Micura R. and Höbartner C., “Fundamental Studies of Functional Nucleic Acids: Aptamers, Riboswitches, Ribozymes and DNAzymes,” Chemical Society Reviews 49 (2020): 7331–7353. [DOI] [PubMed] [Google Scholar]
- 46. Scheitl C. P. M., Ghaem Maghami M., Lenz A.‐K., and Höbartner C., “Site‐Specific RNA Methylation by a Methyltransferase Ribozyme,” Nature 587 (2020): 663–667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Flemmich L., Heel S., Moreno S., Breuker K., and Micura R., “A Natural Riboswitch Scaffold With Self‐Methylation Activity,” Nature Communications 12 (2021): 3877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Jiang H., Gao Y., Zhang L., Chen D., Gan J., and Murchie A. I. H., “The Identification and Characterization of a Selected SAM‐Dependent Methyltransferase Ribozyme That Is Present in Natural Sequences,” Nature Catalysis 4 (2021): 872–881. [Google Scholar]
- 49. Okuda T., Lenz A.‐K., Seitz F., Vogel J., and Höbartner C., “A SAM Analogue‐Utilizing Ribozyme for Site‐Specific RNA Alkylation in Living Cells,” Nature Chemistry 15 (2023): 1523–1531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Batey R. T., “Recognition of S‐Adenosylmethionine by Riboswitches,” RNA 2 (2011): 299–311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Chen H.‐A., Okuda T., Lenz A. K., Scheitl C. P. M., Schindelin H., and Höbartner C., “Structure and Catalytic Activity of the SAM‐Utilizing Ribozyme SAMURI,” Nature Chemical Biology (2025), 10.1038/s41589-024-01808-w. [DOI] [PubMed] [Google Scholar]
- 52. Liebschner D., Afonine P. V., Baker M. L., et al., “Macromolecular Structure Determination Using X‐Rays, Neutrons and Electrons: Recent Developments in Phenix,” Acta Crystallographica Section D, Structural Biology 75 (2019): 861–877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Nakatsuka N., Yang K. A., Abendroth J. M., et al., “Aptamer‐Field‐Effect Transistors Overcome Debye Length Limitations for Small‐Molecule Sensing,” Science 362 (2018): 319–324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Liu X., Hou Y., Chen S., and Liu J., “Controlling Dopamine Binding by the New Aptamer for a FRET‐Based Biosensor,” Biosensors & Bioelectronics 173 (2021): 112798. [DOI] [PubMed] [Google Scholar]
- 55. Kaiyum Y. A., Hoi Pui Chao E., Dhar L., et al., “Ligand‐Induced Folding in a Dopamine‐Binding DNA Aptamer,” Chembiochem 25 (2024): e202400493. [DOI] [PubMed] [Google Scholar]
- 56. Chillón I. and Marcia M., “Self‐Splicing GroupIIIntrons,” in Ribozymes (Wiley, 2021), 143–167, 10.1002/9783527814527.ch6. [DOI] [Google Scholar]
- 57. Marcia M. and Pyle A. M., “Visualizing Group II Intron Catalysis Through the Stages of Splicing,” Cell 151 (2012): 497–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Manigrasso J., Chillón I., Genna V., et al., “Visualizing Group II Intron Dynamics Between the First and Second Steps of Splicing,” Nature Communications 11 (2020): 2837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Darty K., Denise A., and Ponty Y., “VARNA: Interactive Drawing and Editing of the RNA Secondary Structure,” Bioinformatics 25 (2009): 1974–1975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Meng E. C., Goddard T. D., Pettersen E. F., et al., “UCSF ChimeraX: Tools for Structure Building and Analysis,” Protein Science 32 (2023): e4792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Silvestri I., Manigrasso J., Andreani A., et al., “Targeting the Conserved Active Site of Splicing Machines With Specific and Selective Small Molecule Modulators,” Nature Communications 15 (2024): 4980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Xu X., Zhao P., and Chen S.‐J., “Vfold: A Web Server for RNA Structure and Folding Thermodynamics Prediction,” PLoS One 9 (2014): e107504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Moafinejad S. N., de Aquino B. R. H., Boniecki M. J., et al., “SimRNAweb v2.0: A Web Server for RNA Folding Simulations and 3D Structure Modeling, With Optional Restraints and Enhanced Analysis of Folding Trajectories,” Nucleic Acids Research 52 (2024): W368–W373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Zhao C., Rajashankar K. R., Marcia M., and Pyle A. M., “Crystal Structure of Group II Intron Domain 1 Reveals a Template for RNA Assembly,” Nature Chemical Biology 11 (2015): 967–972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Marcia M. and Pyle A. M., “Principles of Ion Recognition in RNA: Insights From the Group II Intron Structures,” RNA 20 (2014): 516–527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Manigrasso J., Marcia M., and De Vivo M., “Computer‐Aided Design of RNA‐Targeted Small Molecules: A Growing Need in Drug Discovery,” Chem 7 (2021): 2965–2988. [Google Scholar]
- 67. Weinberg Z., Lünse C. E., Corbino K. A., et al., “Detection of 224 Candidate Structured RNAs by Comparative Analysis of Specific Subsets of Intergenic Regions,” Nucleic Acids Research 45 (2017): 10811–10823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Cousin F. J., Lynch D. B., Chuat V., et al., “A Long and Abundant Non‐Coding RNA in Lactobacillus salivarius ,” Microbial Genomics 3 (2017): e000126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Wang L., Xie J., Gong T., et al., “Cryo‐EM Reveals Mechanisms of Natural RNA Multivalency,” Science 388 (2025): 545–550. [DOI] [PubMed] [Google Scholar]
- 70. Kretsch R. C., Wu Y., Shabalina S. A., et al., “Naturally Ornate RNA‐Only Complexes Revealed by Cryo‐EM,” Nature 643 (2025): 1135–1142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Jones C. P. and Ferré‐D'Amaré A. R., “RNA Quaternary Structure and Global Symmetry,” Trends in Biochemical Sciences 40 (2015): 211–220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Schweke H., Pacesa M., Levin T., et al., “An Atlas of Protein Homo‐Oligomerization Across Domains of Life,” Cell 187 (2024): 999–1010.e15. [DOI] [PubMed] [Google Scholar]
- 73. Bou‐Nader C. and Zhang J., “Structural Insights Into RNA Dimerization: Motifs, Interfaces and Functions,” Molecules 25 (2020): 2881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Ding F., Lu C., Zhao W., et al., “Structure and Assembly of the Essential RNA Ring Component of a Viral DNA Packaging Motor,” Proceedings of the National Academy of Sciences of The United States of America 108 (2011): 7357–7362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Mao H., Saha M., Reyes‐Aldrete E., et al., “Structural and Molecular Basis for Coordination in a Viral DNA Packaging Motor,” Cell Reports 14 (2016): 2017–2029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Mieczkowski M., Steinmetzger C., Bessi I., et al., “Large Stokes Shift Fluorescence Activation in an RNA Aptamer by Intermolecular Proton Transfer to Guanine,” Nature Communications 12 (2021): 3549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Puerta‐Fernandez E., Barrick J. E., Roth A., and Breaker R. R., “Identification of a Large Noncoding RNA in Extremophilic Eubacteria,” Proceedings of the National Academy of Sciences of The United States of America 103 (2006): 19490–19495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Block K. F., Puerta‐Fernandez E., Wallace J. G., and Breaker R. R., “Association of OLE RNA With Bacterial Membranes via an RNA‐Protein Interaction: OLE RNA Membrane Localization,” Molecular Microbiology 79 (2011): 21–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Breaker R. R., Harris K. A., Lyon S. E., Wencker F. D. R., and Fernando C. M., “Evidence That OLE RNA Is a Component of a Major Stress‐Responsive Ribonucleoprotein Particle in Extremophilic Bacteria,” Molecular Microbiology 120 (2023): 324–340. [DOI] [PubMed] [Google Scholar]
- 80. Lyon S. E., Wencker F. D. R., Fernando C. M., Harris K. A., and Breaker R. R., “Disruption of the Bacterial OLE RNP Complex Impairs Growth on Alternative Carbon Sources,” PNAS Nexus 3 (2024): gae075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81. Das R. and Watkins A. M., “RiboDraw: Semiautomated Two‐Dimensional Drawing of RNA Tertiary Structure Diagrams,” NAR Genomics and Bioinformatics 3 (2021): lqab091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82. Fernando C. M. and Breaker R. R., “Bioinformatic Prediction of Proteins Relevant to Functions of the Bacterial OLE Ribonucleoprotein Complex,” mSphere 9 (2024): e0015924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Duffy K., Arangundy‐Franklin S., and Holliger P., “Modified Nucleic Acids: Replication, Evolution, and Next‐Generation Therapeutics,” BMC Biology 18 (2020): 112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84. Morais P., Adachi H., and Yu Y.‐T., “The Critical Contribution of Pseudouridine to mRNA COVID‐19 Vaccines,” Frontiers in Cell and Development Biology 9 (2021): 789427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85. Das R. S., Datta D., Brown C. R., et al., “Expanding Conjugate Space of RNAi Therapeutics: Ligand at the 3′ End of the Antisense Strand Achieves Uncompromised In Vivo Potency and Efficacy and Reveals Interactions With the Argonaute‐2 PAZ Domain,” Journal of Medicinal Chemistry 68 (2025): 4397–4409. [DOI] [PubMed] [Google Scholar]
- 86. Sakovina L., Vokhtantsev I., Vorobyeva M., Vorobyev P., and Novopashina D., “Improving Stability and Specificity of CRISPR/Cas9 System by Selective Modification of Guide RNAs With 2′‐Fluoro and Locked Nucleic Acid Nucleotides,” International Journal of Molecular Sciences 23 (2022): 13460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87. Pallan P. S., Greene E. M., Jicman P. A., et al., “Unexpected Origins of the Enhanced Pairing Affinity of 2′‐Fluoro‐Modified RNA,” Nucleic Acids Research 39 (2011): 3482–3495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88. Padilla R. and Sousa R. A., “Y639F/H784A T7 RNA Polymerase Double Mutant Displays Superior Properties for Synthesizing RNAs With Non‐Canonical NTPs,” Nucleic Acids Research 30 (2002): e138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89. McRae E. K. S., Rasmussen H. Ø., Liu J., et al., “Structure, Folding and Flexibility of Co‐Transcriptional RNA Origami,” Nature Nanotechnology 18 (2023): 808–817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Valero J., Civit L., Dupont D. M., et al., “A Serum‐Stable RNA Aptamer Specific for SARS‐CoV‐2 Neutralizes Viral Entry,” Proceedings of the National Academy of Sciences of The United States of America 118 (2021): e2112942118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91. Smith M. C. M. and Thorpe H. M., “Diversity in the Serine Recombinases: Diversity in the Serine Recombinases,” Molecular Microbiology 44 (2002): 299–307. [DOI] [PubMed] [Google Scholar]
- 92. Fanton A., Bartie L. J., Martins J. Q., et al., “Site‐Specific DNA Insertion Into the Human Genome With Engineered Recombinases,” bioRxiv 2024.2024.11.01.621560, 2024, 10.1101/2024.11.01.621560. [DOI] [PubMed]
- 93. Khaleel T., Younger E., McEwan A. R., Varghese A. S., and Smith M. C. M., “A Phage Protein That Binds φC31 Integrase to Switch Its Directionality: Excision by φC31 Integrase,” Molecular Microbiology 80 (2011): 1450–1463. [DOI] [PubMed] [Google Scholar]
- 94. Ghosh P., Wasil L. R., and Hatfull G. F., “Control of Phage Bxb1 Excision by a Novel Recombination Directionality Factor,” PLoS Biology 4 (2006): e186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95. Rutherford K. and Van Duyne G. D., “The Ins and Outs of Serine Integrase Site‐Specific Recombination,” Current Opinion in Structural Biology 24 (2014): 125–131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96. Shin H., Pigli Y., Reyes T. P., et al., “Structural Basis of Directionality Control in Large Serine Integrases,” bioRxiv 2025.01.03.631226, 2025, 10.1101/2025.01.03.631226. [DOI]
- 97. Li W., Kamtekar S., Xiong Y., Sarkis G. J., Grindley N. D., and Steitz T. A., “Structure of a Synaptic Gammadelta Resolvase Tetramer Covalently Linked to Two Cleaved DNAs,” Science 309 (2005): 1210–1215. [DOI] [PubMed] [Google Scholar]
- 98. Hartmann M. D., Boichenko I., Coles M., Zanini F., Lupas A. N., and Hernandez Alvarez B., “Thalidomide Mimics Uridine Binding to an Aromatic Cage in Cereblon,” Journal of Structural Biology 188 (2014): 225–232. [DOI] [PubMed] [Google Scholar]
- 99. Békési A., Pukáncsik M., Muha V., et al., “A Novel Fruitfly Protein Under Developmental Control Degrades Uracil‐DNA,” Biochemical and Biophysical Research Communications 355 (2007): 643–648. [DOI] [PubMed] [Google Scholar]
- 100. Kryshtafovych A., Albrecht R., Baslé A., et al., “Target Highlights From the First Post‐PSI CASP Experiment (CASP12, May‐August 2016),” Proteins 86, no. Suppl 1 (2018): 27–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data that support the findings of this study are openly available at the CASP website, the scores can be found at: https://predictioncenter.org/casp16/results.cgi?tr_type=rna, https://predictioncenter.org/casp16/results.cgi?tr_type=rna_multi, https://predictioncenter.org/casp16/results.cgi?tr_type=hybrid. The predicted models can be found at: https://predictioncenter.org/download_area/CASP16/predictions/RNA/, https://predictioncenter.org/download_area/CASP16/predictions/oligo/, https://predictioncenter.org/download_area/CASP16/predictions/hybrid/.
