Abstract
Computational RNA design tasks are often posed as inverse problems, where sequences are designed based on adopting a single desired secondary structure without considering 3D geometry and conformational diversity. We introduce gRNAde, a geometric RNA design pipeline operating on 3D RNA backbones to design sequences that explicitly account for structure and dynamics. Under the hood, gRNAde is a multi-state Graph Neural Network that generates candidate RNA sequences conditioned on one or more 3D backbone structures where the identities of the bases are unknown. On a single-state fixed backbone re-design benchmark of 14 RNA structures from the PDB identified by Das et al. [2010], gRNAde obtains higher native sequence recovery rates (56% on average) compared to Rosetta (45% on average), taking under a second to produce designs compared to the reported hours for Rosetta. We further demonstrate the utility of gRNAde on a new benchmark of multi-state design for structurally flexible RNAs, as well as zero-shot ranking of mutational fitness landscapes in a retrospective analysis of a recent RNA polymerase ribozyme structure.
1. Introduction
Why RNA design?
Historical efforts in computational drug discovery have focussed on designing small molecule or protein-based medicines that either treat symptoms or counter the end stages of disease processes. In recent years, there is a growing interest in designing new RNA-based therapeutics that intervene earlier in disease processes to cut off disease-causing information flow in the cell [Damase et al., 2021, Zhu et al., 2022]. Notable examples of RNA molecules at the forefront of biotechnology today include mRNA vaccines [Metkar et al., 2024] and CRISPR-based genomic medicine [Doudna and Charpentier, 2014]. Of particular interest for structure-based design are ribozymes and riboswitches in the untranslated regions of mRNAs [Mandal and Breaker, 2004, Leppek et al., 2018]. In addition to coding for proteins (such as the spike protein in the Covid vaccine), naturally occurring mRNAs contain riboswitches that are responsible for cell-state dependent protein expression of the mRNA. Riboswitches act by ‘switching’ their 3D structure from an unbound conformation to a bound one in the presence of specific metabolites or small molecules. Rational design of riboswitches will enable translation to be dependent on the presence or absence of partner molecules, essentially acting as ‘on-off’ switches for highly targeted mRNA therapies in the future [Felletti et al., 2016, Mustafina et al., 2019, Mohsen et al., 2023].
Challenges of RNA modelling.
Despite the promises of RNA therapeutics, proteins have instead been the primary focus in the 3D biomolecular modelling community. Availability of a large number of protein structures from the PDB combined with advances in deep learning for structured data [Bronstein et al., 2021, Duval et al., 2023] have revolutionized protein 3D structure prediction [Jumper et al., 2021] and rational design [Dauparas et al., 2022, Watson et al., 2023]. Applications of deep learning for computational RNA design are underexplored compared to proteins due to paucity of 3D structural data [Schneider et al., 2023]. Most tools for RNA design primarily focus on secondary structure without considering 3D geometry [Churkin et al., 2018] and use non-learnt algorithms for aligning 3D RNA fragments [Han et al., 2017, Yesselman et al., 2019], which can be restrictive due to the hand-crafted nature of the heuristics used.
In addition to limited 3D data for training deep learning models, the key technical challenge is that RNA is more dynamic than proteins. The same RNA can adopt multiple distinct conformational states to create and regulate complex biological functions [Ganser et al., 2019, Hoetzel and Suess, 2022, Ken et al., 2023]. Computational RNA design pipelines must account for both the 3D geometric structure and conformational flexibility of RNA to engineer new biological functions.
Our contributions.
This paper introduces gRNAde, a geometric deep learning-based pipeline for RNA inverse design conditioned on 3D structure, analogous to ProteinMPNN for proteins [Dauparas et al., 2022]. As illustrated in Figure 1, gRNAde generates candidate RNA sequences conditioned on one or more backbone 3D conformations, enabling both single- and multi-state fixed-backbone sequence design. The model is trained on RNA structures from the PDB at 4.0Å or better resolution (12K 3D structures from 4.2K unique RNAs) [Adamczyk et al., 2022], ranging from short RNAs such as riboswitches, aptamers and ribozymes to larger ribosomal RNAs.
We demonstrate the utility of gRNAde for the following design scenarios:
Improved performance and speed over Rosetta. We compare gRNAde to Rosetta [Leman et al., 2020], the state-of-the-art physically based tool for 3D RNA inverse design, for single-state fixed backbone design of 14 RNA structures of interest from the PDB identified by Das et al. [2010]. We obtain higher native sequence recovery rates with gRNAde (56% on average) compared to Rosetta (45% on average). Additionally, gRNAde is significantly faster than Rosetta for inference; e.g. sampling 100+ designs in 1 second for an RNA of 60 nucleotides on an A100 GPU, compared to the reported hours for Rosetta.
Enables multi-state RNA design, which was previously not possible with Rosetta. gRNAde with multi-state GNNs improves sequence recovery over an equivalent single-state model on a benchmark of structurally flexible RNAs, especially for surface nucleotides which undergo positional or secondary structural changes.
Zero-shot learning of RNA fitness landscape. In a retrospective analysis of mutational fitness landscape data for an RNA polymerase ribozyme [McRae et al., 2024], we show how gRNAde’s perplexity, the likelihood of a sequence folding into a backbone structure, can be used to rank mutants based on fitness in a zero-shot/unsupervised manner and outperforms random mutagenesis for improving fitness over the wild type in low throughput scenarios.
2. The gRNAde pipeline
2.1. The 3D RNA inverse folding problem
Figure 1 illustrates the RNA inverse folding problem: the task of designing new RNA sequences conditioned on a structural backbone. Given the 3D coordinates of a backbone structure, machine learning models must generate sequences that are likely to fold into that shape. The underlying assumption behind inverse folding (and rational biomolecule design) is that structure determines function [Huang et al., 2016]. To the best of our knowledge, gRNAde is the first explicitly multi-state inverse folding pipeline, allowing users to design sequences for backbone conformational ensembles (a set of 3D backbone structures) as opposed to a single structure. Our multi-state design framework aims to better capture RNA conformational dynamics which is often important for functionality in structured RNAs [Ken et al., 2023].
2.2. RNA conformational ensembles as geometric multi-graphs
Featurization.
The input to gRNAde is an RNA to be re-designed. For instance, this could be a set of PDB files with 3D backbone structures for the given RNA (a conformational ensemble) and the corresponding sequence of n nucleotides. As shown in Figure 2, gRNAde builds a geometric graph representation for each input structure:
We start with a 3-bead coarse-grained representation of the RNA backbone, retaining the coordinates for P, C4’, N1 (pyrimidine) or N9 (purine) for each nucleotide [Dawson et al., 2016]. This ‘pseudotorsional’ representation describes RNA backbones completely in most cases while reducing the size of the torsional space to prevent overfitting [Wadley et al., 2007].
Each nucleotide is assigned a node in the geometric graph with the 3D coordinate corresponding to the centroid of the 3 bead atoms. Random Gaussian noise with standard deviation 0.1Å is added to coordinates during training to prevent overfitting on crystallisation artifacts, following Dauparas et al. [2022]. Each node is connected by edges to its 32 nearest neighbours as measured by the pairwise distance in 3D space, .
Nodes are initialized with geometric features analogous to the featurization used in protein inverse folding [Ingraham et al., 2019, Jing et al., 2020]: (a) forward and reverse unit vectors along the backbone from the 5’ end to the 3’ end, and ); and (b) unit vectors, distances, angles, and torsions from each C4’ to the corresponding P and N1/N9.
Edge features for each edge from node to are initialized as: (a) the unit vector from the source to destination node, ; (b) the distance in 3D space, , encoded by 32 radial basis functions; and (c) the distance along the backbone, , encoded by 32 sinusoidal positional encodings.
Multi-graph representation.
As described in the previous section, given a set of (conformer) structures in the input conformational ensemble, each RNA backbone is featurized as a separate geometric graph with the scalar features , vector features , and an adjacency matrix. For clear presentation and without loss of generality, we omit edge features and use to denote scalar/vector feature channels.
The input to gRNAde is thus a set of geometric graphs which is merged into what we term a ‘multi-graph’ representation of the conformational ensemble, , by stacking the set of scalar features into one tensor along a new axis for the set size . Similarly, the set of vector features is stacked into one tensor . Lastly, the set of adjacency matrices are merged via a union into one single joint adjacency matrix .
2.3. Multi-state GNN for representation learning on conformational ensembles
The gRNAde model, illustrated in Appendix Figure 13, processes one or more RNA backbone graphs via a multi-state GNN encoder which is equivariant to 3D roto-translation of coordinates as well as to the ordering of conformers, followed by conformer order-invariant pooling and sequence decoding. We describe each component in the following sections.
Multi-state GNN encoder.
When representing conformational ensembles as a multi-graph, each node feature tensor contains three axes: (#nodes, #conformations, feature channels). We perform message passing on the multi-graph adjacency to independently process each conformer, while maintaining permutation equivariance of the updated feature tensors along both the first (#nodes) and second (#conformations) axes. This works by operating on only the feature channels axis and generalising the PyTorch Geometric [Fey and Lenssen, 2019] message passing class to account for the extra conformations axis; see Appendix Figure 14 and the pseudocode for details.
We use multiple rotation-equivariant GVP-GNN [Jing et al., 2020] layers to update scalar features and vector features for each node :
(1) |
(2) |
where MSG, UPD are Geometric Vector Perceptrons, a generalization of MLPs to take tuples of scalar and vector features as input and apply -equivariant non-linear updates. The overall GNN encoder is -equivariant due to the use of reflection-sensitive input features (dihedral angles) combined with -equivariant GVP-GNN layers.
Our multi-state GNN encoder is easy to implement in any message passing framework and can be used as a plug-and-play extension for any geometric GNN pipeline to incorporate the multi-state inductive bias. It serves as an elegant alternative to batching all the conformations, which we found required major alterations to message passing and pooling depending on downstream tasks.
Conformation order-invariant pooling.
The final encoder representations in gRNAde account for multi-state information while being invariant to the permutation of the conformational ensemble. To achieve this, we perform a Deep Set pooling [Zaheer et al., 2017] over the conformations axis after the final encoder layer to reduce and to and :
(3) |
A simple sum or average pooling does not introduce any new learnable parameters to the pipeline and is flexible to handle a variable number of conformations, enabling both single-state and multi-state design with the same model.
Sequence decoding and loss function.
We feed the final encoder representations after pooling, , to autoregressive GVP-GNN decoder layers to predict the probability of the four possible base identities (A, G, C, U) for each node/nucleotide. Decoding proceeds according to the RNA sequence order from the 5’ end to 3’ end. gRNAde is trained in a self-supervised manner by minimising a cross-entropy loss (with label smoothing value of 0.05) between the predicted probability distribution and the ground truth identity for each base. During training, we use autoregressive teacher forcing [Williams and Zipser, 1989] where the ground truth base identity is fed as input to the decoder at each step, encouraging the model to stay close to the ground-truth sequence.
Sampling.
When using gRNAde for inference and designing new sequences, we iteratively sample the base identity for a given nucleotide from the predicted conditional probability distribution, given the partially designed sequence up until that nucleotide/decoding step. We can modulate the smoothness or sharpness of the probability distribution by using a temperature parameter. At lower temperatures, for instance ≤1.0, we expect higher native sequence recovery and lower diversity in gRNAde’s designs. At higher temperatures, the model produces more diverse designs by sampling from a smoothed probability distribution. We can also consider unordered decoding [Dauparas et al., 2022] and masking or logit biasing during sampling, depending on the design scenario at hand. This enables gRNAde to perform partial re-design of RNA sequences, retaining specified nucleotide identities while designing the rest of the sequence. Similar approaches for functional protein design have been shown to be successful in the wet lab [Sumida et al., 2024].
2.4. Evaluation metrics for designed sequences
In principle, inverse folding models can be sampled from to obtain a large number of designed sequences for a given backbone structure. Thus, in-silico metrics to determine which sequences are useful and which ones to prioritise in wet lab experiments are a critical part of the overall pipeline. We currently use the following metrics to evaluate gRNAde’s designs, visualised in Figure 3:
Native sequence recovery, which is the average percentage of native (ground truth) nucleotides correctly recovered in the sampled sequences. Recovery is the most widely used metric for biomolecule inverse design [Dauparas et al., 2022] but can be misleading in the case of RNAs where alternative nucleotide base pairings can form the same structural patterns.
Secondary structure self-consistency score, where we ‘forward fold’ the sampled sequences using a secondary structure prediction tool (we used EternaFold [Wayment-Steele et al., 2022]) and measure the average Matthew’s Correlation Coefficient (MCC) to the groundtruth secondary structure, represented as a binary adjacency matrix. MCC values range between −1 and +1, where +1 represents a perfect prediction, 0 an average random prediction and −1 an inverse prediction. This measures how well the designs recover base pairing patterns.
Tertiary structure self-consistency scores, where we ‘forward fold’ the sampled sequences using a 3D structure prediction tool (we used RhoFold [Shen et al., 2022]) and compute the average RMSD, TM-score and GDT_TS to the groundtruth C4’ coordinates to measure how well the designs recover global structural similarity and 3D conformations.
Perplexity, which can be thought of as the average number of bases that the model is selecting from for each nucleotide. Formally, perplexity is the average exponential of the negative log-likelihood of the sampled sequences. A perfect model would have perplexity of 1, while a perplexity of 4 means that the model is making random predictions (the model outputs a uniform probability over 4 possible bases). Perplexity does not require a ground truth structure to calculate, and can also be used for ranking sequences as it is the model’s estimate of the compatibility of a sequence with the input backbone structure.
Significance and limitations.
Self-consistency metrics, termed ‘designability’ (eg. scRMSD≤2Å), as well as perplexity have been found to correlate with experimental success in protein design [Watson et al., 2023]. While precise designability thresholds are yet to be established for RNA, pairs of structures with TM-score≥0.45 or GDT_TS≥0.5 are known to correspond to roughly the same fold [Zhang et al., 2022]. Another major limitation for in-silico evaluation of 3D RNA design compared to proteins is the relatively worse state of structure prediction tools [Schneider et al., 2023].
3. Experimental Setup
3D RNA structure dataset.
We create a machine learning-ready dataset for RNA inverse design using RNASolo [Adamczyk et al., 2022], a novel repository of RNA 3D structures extracted from solo RNAs, protein-RNA complexes, and DNA-RNA hybrids in the PDB. We used structures at resolution ≤4.0Å resulting in 4,223 unique RNA sequences for which a total of 12,011 structures are available (RNASolo date cutoff: 31 October 2023). Dataset statistics are available in Appendix Figure 15, illustrating the diversity of our dataset in terms of sequence length, number of structures per sequence, as well as structural variations among conformations per sequence.
Structural clustering.
In order to ensure that we evaluate gRNAde’s generalization ability to novel RNAs, we cluster the 4,223 unique RNAs into groups based on structural similarity. We use US-align [Zhang et al., 2022] with a similarity threshold of TM-score >0.45 for clustering, and ensure that we train, validate and test gRNAde on structurally dissimilar clusters (see next paragraph). We also provide utilities for clustering based on sequence homology using CD-HIT [Fu et al., 2012], which leads to splits containing biologically dissimilar clusters of RNAs.
Splits to evaluate generalization.
After clustering, we split the RNAs into training (~4000 samples), validation and test sets (100 samples each) to evaluate two different design scenarios:
Single-state split. This split is used to fairly evaluate gRNAde for single-state design on a set of RNA structures of interest from the PDB identified by Das et al. [2010], which mainly includes riboswitches, aptamers, and ribozymes. We identify the structural clusters belonging to the RNAs identified in Das et al. [2010] and add all the RNAs in these clusters to the test set (100 samples). The remaining clusters are randomly added to the training and validation splits.
Multi-state split. This split is used to test gRNAde’s ability to design RNA with multiple distinct conformational states. We order the structural clusters based on median intra-sequence RMSD among available structures within the cluster1. The top 100 samples from clusters with the highest median intra-sequence RMSD are added to the test set. The next 100 samples are added to the validation set and all remaining samples are used for training.
Validation and test samples come from clusters with at most 5 unique sequences, in order to ensure diversity. Any samples that were not assigned clusters are directly appended to the training set. We also directly add very large RNAs (> 1000 nts) to the training set, as it is unlikely that we want to design very large RNAs. We exclude very short RNA strands (< 10 nts).
Evaluation metrics.
For a given data split, we evaluate models on the held-out test set by designing 16 sequences (sampled at temperature 0.1) for each test data point and computing averages for each of the metrics described in Section 2.4: native sequence recovery, structural self-consistency scores and perplexity. We employ early stopping by reporting test set performance for the model checkpoint for the epoch with the best validation set recovery. Standard deviations are reported across 3 consistent random seeds for all models.
Hyperparameters.
All models use 4 encoder and 4 decoder GVP-GNN layers, with 128 scalar/16 vector node features, 64 scalar/4 vector edge features, and drop out probability 0.5, resulting in 2,147,944 trainable parameters. All models are trained for a maximum of 50 epochs using the Adam optimiser with an initial learning rate of 0.0001, which is reduced by a factor 0.9 when validation performance plateaus with patience of 5 epochs. Detailed ablation studies of key modelling decisions are available in Appendix D.
4. Results
4.1. Single-state RNA design benchmark
We set out to compare gRNAde to Rosetta, a state-of-the-art physically based toolkit for biomolecular modelling and design [Leman et al., 2020]. We reproduced the benchmark setup from Das et al. [2010] for Rosetta’s fixed backbone RNA sequence design workflow on 14 RNA structures of interest from the PDB, which mainly includes riboswitches, aptamers, and ribozymes (full listing in Table 2). We trained gRNAde on the single-state split detailed in Section 3, explicitly excluding the 14 RNAs as well as any structurally similar RNAs in order to ensure that we fairly evaluate gRNAde’s generalization abilities vs. Rosetta.
gRNAde improves sequence recovery over Rosetta.
In Figure 4, we compare gRNAde’s native sequence recovery for single-state design with numbers taken from Das et al. [2010] for Rosetta, FARNA (a predecessor of Rosetta), and ViennaRNA (the most popular 2D inverse folding method). gRNAde has higher recovery of 56% on average compared to 45% for Rosetta, 32% for FARNA, and 27% for ViennaRNA. See Appendix Table 2 for per-RNA results.
gRNAde is significantly faster than Rosetta.
In addition to superior sequence recovery, gRNAde is significantly faster than Rosetta for high-throughout design pipelines. Training gRNAde from scratch takes roughly 2–6 hours on a single A100 GPU, depending on the exact hyperparameters. Once trained, gRNAde can design hundreds of sequences for backbones with hundreds of nucleotides in ~1 second with GPU acceleration. On the other hand, Rosetta takes order of hours to produce a single design due to performing expensive Monte Carlo optimisations2. Deep learning methods like gRNAde are arguably easier to use since no expert customization is required and setup is easier compared to Rosetta [Dauparas et al., 2022], potentially making RNA design more broadly accessible.
gRNAde’s perplexity correlates with sequence and structural recovery.
In Figure 4b, we plot native sequence recovery per sample for Rosetta vs. gRNAde, shaded by gRNAde’s average perplexity for each sample. Perplexity is an indicator of the model’s confidence in its own prediction (lower perplexity implies higher confidence) and appears to be correlated with native sequence recovery. Additionally, visualisations of gRNAde’s designs for a riboswitch in Figure 5 show that perplexity is also correlated with structural self-consistency scores. In the subsequent Section 4.3, we further demonstrate the utility of gRNAde’s perplexity for zero-shot ranking of RNA fitness landscapes.
4.2. Multi-state RNA design benchmark
Structured RNAs often adopt multiple distinct conformational states to perform biological functions [Ken et al., 2023]. For instance, riboswitches adopt at least two distinct functional conformations: a ligand bound (holo) and unbound (apo) state, which helps them regulate and control gene expression [Stagno et al., 2017]. If we were to attempt single-state inverse design for such RNAs, each backbone structure may lead to a different set of sampled sequences. It is not obvious how to select the input backbone as well as designed sequence when using single-state models for multi-state design. gRNAde’s multi-state GNN, descibed in Section 2.3, directly ‘bakes in’ the multi-state nature of RNA into the architecture and designs sequences explicitly conditioned on multiple states.
In order to evaluate gRNAde’s multi-state design capabilities, we trained equivalent single-state and multi-state gRNAde models on the multi-state split detailed in Section 3, where the validation and test sets contain progressively more structurally flexible RNAs as measured by median RMSD among multiple available states for an RNA.
Multi-state gRNAde boosts sequence recovery.
In Figure 6a, we compared a single-state variant of gRNAde with otherwise equivalent multi-state models (with up to 3 and 5 states, respectively) in terms of native sequence recovery 3. Multi-state variants show marginal improvements, overall. As a caveat, it is worth noting that multi-state models consume more GPU memory than an equivalent single-state model during mini-batch training (approximate peak GPU usage for max. number of states = 1: 12GB, 3: 28GB, 5: 50GB on a single A100 with at most 3000 total nodes in a mini-batch).
Improved recovery in structurally flexible regions.
In Figure 6b, we evaluated gRNAde’s multi-state sequence recovery at a fine-grained, per-nucleotide level. Multi-state GNNs improve sequence recovery over the single-state variant on structurally flexible nucleotides, as characterised by undergoing changes in base pairing/secondary structure, higher average RMSD between 3D coordinates across states, and larger solvent accessible surface area.
4.3. Zero-shot ranking of RNA fitness landscape
Lastly, we explored the use of gRNAde as a zero-shot ranker of mutants in RNA engineering campaigns. Given the backbone structure of a wild type RNA of interest as well as a candidate set of mutant sequences, we can compute gRNAde’s perplexity of whether a given sequence folds into the backbone structure. Perplexity is inversely related to the likelihood of a sequence conditioned on a structure, as described in Section 2.4. We can then rank sequences based on how ‘compatible’ they are with the backbone structure in order to select a subset to be experimentally validated in wet labs.
Retrospective analysis on ribozyme fitness landscape.
A recent study by McRae et al. [2024] determined a cryo-EM structure of a dimeric RNA polymerase ribozyme at 5Å resolution4, along with fitness landscapes of ~75K mutants for the catalytic subunit 5TU and ~48K mutants for the scaffolding subunit t1. We design a retrospective study using this data of (sequence, fitness value) pairs where we simulate an RNA engineering campaign with the aim of improving catalytic subunit fitness over the wild type 5TU sequence.
We consider various design budgets ranging from hundreds to thousands of sequences selected for experimental validation, and compare 4 unsupervised approaches for ranking/selecting variants: (1) random choice from all ~75,000 sequences; (2) random choice from all 449 single mutant sequences; (3) random choice from all single and double mutant sequences (as sequences with higher mutation order tend to be less fit); and (4) negative gRNAde perplexity (lower perplexity is better). For each design budget and ranking approach, we compute the expected maximum change in fitness over the wild type that could be achieved by screening as many variants as allowed in the given design budget. We run 10,000 simulations to compute confidence intervals for the 3 random baselines.
gRNAde outperforms random baselines in low design budget scenarios.
Figure 7 illustrates the results of our retrospective study. At low design budgets of up to hundreds of sequences, which are relevant in the case of a low throughput fitness screening assay, gRNAde outperforms all random baselines in terms of the maximum change in fitness over the wild type. The top 10 mutants as ranked by gRNAde contain a sequence with 4-fold improved fitness, while the top 200 leads to a 5-fold improvement5. Note that gRNAde is used zero-shot here, i.e. it was not fine-tuned on any assay data.
Perspective.
Overall, it is promising that gRNAde’s perplexity correlates with experimental fitness measurements out-of-the-box (zero-shot) and can be a useful ranker of mutant fitness in our retrospective study. In realistic design scenarios, improvements could likely be obtained by fine-tuning gRNAde on a low amount of experimental fitness data. For example, latent features from gRNAde may be finetuned or used as input to a prediction head with supervised learning on fitness landscape data. This study acts as a sanity check before committing to wet lab validation of gRNAde designs. We see random mutagenesis and directed evolution-based approaches as complementary to de-novo design and inverse folding approaches like gRNAde. Random mutagenesis can be thought of as local exploration around a wild type sequence, optimising fitness within an ‘island’ of activity. Structure-based design approaches are akin to global jumps in sequence space, with the potential to find new islands further away from the wild type [Huang et al., 2016].
5. Conclusion
We introduce gRNAde, a geometric deep learning pipeline for RNA sequence design conditioned on one or more 3D backbone structures. gRNAde is superior to the physically based Rosetta for 3D RNA inverse folding in terms of performance, inference speed, and ease of use. Further, gRNAde enables explicit multi-state design for structurally flexible RNAs which was previously not possible with Rosetta. gRNAde’s perplexity correlates with native sequence and structural recovery, and can be used for zero-shot ranking of mutants in RNA engineering campaigns. To the best of our knowledge, gRNAde is also the first geometric deep learning architecture for multi-state biomolecule representation learning; the model is generic and can be repurposed for other learning tasks on conformational ensembles, including multi-state protein design.
Key avenues for future development of gRNAde include supporting multiple interacting chains, accounting for partner molecules with RNAs, and supporting negative design against undesired conformations. We discuss practical tradeoffs to using gRNAde in real-world RNA design scenarios in Appendix C, including limitations due to the current state of 3D RNA structure prediction tools. Finally, we are hopeful that advances in RNA structure determination and computationally assisted cryo-EM [Kappel et al., 2020, Bonilla and Kieft, 2022] will further increase the amount of RNA structures available for training geometric deep learning models in the future.
Acknowledgements
We would like to thank Roger Foo, Phillip Holliger, Alex Borodavka, Rhiju Das, Janusz Bujnicki, Edoardo Gianni, Ben Porebski, Michael Mohsen, Christian Choe, and John Boom for helpful comments and discussions. CKJ was supported by the A*STAR Singapore National Science Scholarship (PhD). SVM was supported by the UKRI Centre for Doctoral Training in Application of Artificial Intelligence to the study of Environmental Risks (EP/S022961/1). AM was supported by a U.S. NSF grant (DBI2308699) and two U.S. NIH grants (R01GM093123 and R01GM146340). This research was partially supported by Google TPU Research Cloud and Cambridge Dawn Supercomputer Pioneer Project compute grants.
A. 3D Visualisation of gRNAde Designs
B. Related Work
We attempt to briefly summarise recent developments in RNA structure modelling and design, with an emphasis on deep learning-based approaches.
RNA inverse folding.
Most tools for RNA inverse folding focus on secondary structure without considering 3D geometry [Churkin et al., 2018, Runge et al., 2019] and approach the problem from the lens of energy optimisation [Ward et al., 2023]. Rosetta fixed backbone re-design [Das et al., 2010] is the only energy optimisation-based approach that accounts for 3D structure. Deep neural networks such as gRNAde can incorporate 3D structural constraints and are orders of magnitude faster than optimisation-based approaches; this is particularly attractive for high-throughput design pipelines as solving the inverse folding optimisation problem is NP hard [Bonnet et al., 2020].
RNA structure design.
Inverse folding models for protein design have often been coupled with backbone generation models which design structural backbones conditioned on various design constraints [Watson et al., 2023, Ingraham et al., 2023, Didi et al., 2023]. Current approaches for RNA backbone design use classical (non-learnt) algorithms for aligning 3D RNA motifs [Han et al., 2017, Yesselman et al., 2019], which are small modular pieces of RNA that are believed to fold independently. Such algorithms may be restricted by the use of hand-crafted heuristics and we plan to explore data-driven generative models for RNA backbone design in future work.
RNA structure prediction.
There have been several recent efforts to adapt protein folding architectures such as AlphaFold2 [Jumper et al., 2021] and RosettaFold [Baek et al., 2021] for RNA structure prediction [Li et al., 2023b, Wang et al., 2023, Baek et al., 2024]. A previous generation of models used GNNs as ranking functions together with Rosetta energy optimisation [Watkins et al., 2020, Townshend et al., 2021]. None of these architectures aim at capturing conformational flexibility of RNAs, unlike gRNAde which represents RNAs as multi-state conformational ensembles. Neither can structure prediction tools be used for RNA design tasks as they are not generative models.
RNA language models.
Self-supervised language models have been developed for predictive and generative tasks on RNA sequences, including general-purpose models such as RNA FM [Chen et al., 2022] and RiNaLMo [Penic et al., 2024] as well as mRNA-specific CodonBERT [Li et al., 2023a]. RNA sequence data repositories are orders of magnitude larger than those for RNA structure (eg. RiNaLMo is trained on 36 million sequences). However, standard language models can only implicitly capture RNA structure and dynamics through sequence co-occurence statistics, which can pose a chellenge for designing structured RNAs such as riboswitches, aptamers, and ribozymes. RibonanzaNet [He et al., 2024] represents a recent effort in developing structure-informed RNA language models by supervised training on experimental readouts from chemical mapping, although RibonanzaNet cannot be used for RNA design. Inverse folding methods like gRNAde are language models conditioned on 3D structure, making them a natural choice for structure-based design.
C. FAQs on using gRNAde
How to chose the number of states to provide as input to gRNAde?
In general, this would depend on the design objective. For instance, designing riboswitches may necessitate multi-state design, while a single-state pipeline may be more sensible for locking an aptamer into its bound conformation [Yesselman et al., 2019]. Note that it may be possible to benefit from multi-state gRNAde models even when performing single-state design by using slightly noised variations of the same backbone structure as an input conformational ensemble.
How to prioritise or chose amongst designed sequences?
We have currently provided 3 types of evaluation metrics: native sequence recovery, structural self-consistency scores and perplexity, towards this end. We suspect that recovery may not be the ideal choice, except for design scenarios where we require certain regions of the RNA sequence to be conserved or native-like. Self-consistency scores may provide an overall more holistic evaluation metric as they accounts for alternative base pairings which still lead to similar structures as well as better capture the recovery of structural motifs responsible for functionality. However, structural self-consistency scores inherit the limitations of the structure prediction methods used as part of their computation. For instance, computing the self-consistency score between an RNA backbone and its own native sequence provides an upper bounds on the maximum score that designs can obtain under a given structure prediction method. Lastly, gRNAde’s perplexity estimates the likelihood of a sequence given a backbone and can be useful for ranking designs and mutants in RNA engineering campaigns (especially for design scenarios where structure prediction tools are not performant).
In real-world design scenarios, we can pair gRNAde with another machine learning model (an ‘oracle’) for ranking or predicting the suitability of designed sequences for the objective (for instance, binding affinity or some other notion of fitness). We hope to conduct further experimental validation of gRNAde designs in the wet lab in order to better understand these tradeoffs.
Why not average single-state logits over multiple states for multi-state design?
ProteinMPNN [Dauparas et al., 2022] proposes to average logits from multiple backbones for multi-state protein design. Here is a simple example to highlight issues with such an approach: Consider two states A and B, and choice of labels X, Y, and Z. For state A: X, Y, Z are assigned probabilities 75%, 20%, 5%. For state B: X, Y, Z are assigned probabilities 5%, 20%, 75%. Logically, label Y is the only one that is compatible with both states. However, averaging the probabilities would lead to label X or Z being more likely to be sampled in designs. As an alternative, gRNAde is based on multi-state GNNs which can take as input one or more backbone structures and generate sequences conditioned on the conformational ensemble directly.
D. Ablation Study
Table 1 presents an ablation study as well as aggregated benchmark for various configurations of gRNAde. Key takeaways are highlighted below. Note that all results in the main paper are reported for models trained on the maximum length of 5000 nucleotides using autoregressive decoding and rotation-equivariant GNN layers, as this lead to the lowest perplexity values.
Table 1:
Split | Max. #states | Model | GNN | Max. train length | Perplexity (↓) | Native seq. recovery (↑) | Self-consistency metrics |
|||
---|---|---|---|---|---|---|---|---|---|---|
2D – EternaFold scMCC (↑) | scRMSD (↓) | 3D – RhoFold scTM-score (↑) | scGDT_TS (↑) | |||||||
| ||||||||||
Single-state split | 1 | AR | Equiv | 500 | 1.77±0.07 | 0.438±0.01 | 0.624±0.07 | 13.01±1.18 (0.5%) | 0.21±0.0 (14.3%) | 0.22±0.0 (12.7%) |
1 | AR | Equiv | 1000 | 1.73±0.08 | 0.453±0.01 | 0.648±0.01 | 13.10±0.58 (1.0%) | 0.20±0.0 (10.8%) | 0.21±0.0 (10.6%) | |
1 | AR | Equiv | 2500 | 1.41±0.01 | 0.493±0.01 | 0.633±0.03 | 11.76±0.91 (1.4%) | 0.27±0.0 (28.8%) | 0.27±0.0 (28.0%) | |
1 | AR | Equiv | 5000 | 1.29±0.02 | 0.530±0.01 | 0.585±0.03 | 11.70±0.56 (1.3%) | 0.26±0.0 (24.8%) | 0.25±0.0 (20.1%) | |
|
||||||||||
1 | AR | Inv | 5000 | 1.32±0.04 | 0.549±0.00 | 0.612±0.02 | 11.50±0.64 (1.9%) | 0.28±0.0 (32.1%) | 0.28±0.0 (26.2%) | |
|
||||||||||
1 | NAR | Inv | 5000 | 1.54±0.04 | 0.571±0.00 | 0.430±0.02 | 14.26±0.51 (1.3%) | 0.19±0.0 (15.9%) | 0.18±0.0 (12.7%) | |
1 | NAR | Equiv | 5000 | 1.46±0.06 | 0.584±0.00 | 0.473±0.02 | 13.04±0.88 (1.3%) | 0.23±0.0 (24.0%) | 0.22±0.0 (17.9%) | |
| ||||||||||
3 | AR | Equiv | 5000 | 1.23±0.05 | 0.539±0.01 | 0.620±0.01 | 11.47±1.05 (2.5%) | 0.28±0.0 (31.4%) | 0.28±0.0 (27.2%) | |
5 | AR | Equiv | 5000 | 1.25±0.01 | 0.539±0.02 | 0.596±0.03 | 11.90±1.00 (2.9%) | 0.27±0.0 (31.6%) | 0.26±0.0 (26.4%) | |
| ||||||||||
Groundtruth sequence prediction baseline: | - | 1.000±0.00 | 0.686±0.00 | 5.23±0.07 (27.9%) | 0.56±0.0 (68.7%) | 0.55±0.0 (68.7%) | ||||
Random sequence prediction baseline: | - | 0.251±0.00 | 0.012±0.00 | 24.40±0.34 (0.0%) | 0.04±0.0 (0.0%) | 0.02±0.0 (0.0%) | ||||
ViennaRNA 2D-only baseline: | - | 0.259±0.00 | 0.611±0.00 | 20.34±0.10 (0.0%) | 0.07±0.0 (0.6%) | 0.07±0.0 (1.1%) | ||||
| ||||||||||
Multi-state split | 1 | AR | Equiv | 500 | 1.87±0.06 | 0.445±0.01 | 0.603±0.03 | 13.08±0.20 (3.5%) | 0.10±0.0 (1.2%) | 0.25±0.0 (20.7%) |
1 | AR | Equiv | 1000 | 1.84±0.01 | 0.447±0.01 | 0.580±0.01 | 13.02±0.56 (2.3%) | 0.09±0.0 (0.9%) | 0.25±0.0 (20.4%) | |
1 | AR | Equiv | 2500 | 1.73±0.04 | 0.480±0.02 | 0.567±0.01 | 12.83±0.05 (3.4%) | 0.10±0.0 (1.9%) | 0.26±0.0 (21.2%) | |
1 | AR | Equiv | 5000 | 1.68±0.03 | 0.455±0.01 | 0.569±0.02 | 12.88±0.20 (4.1%) | 0.11±0.0 (1.6%) | 0.26±0.0 (22.6%) | |
|
||||||||||
1 | AR | Inv | 5000 | 1.72±0.01 | 0.463±0.01 | 0.559±0.03 | 13.09±0.27 (4.1%) | 0.10±0.0 (2.2%) | 0.27±0.0 (23.0%) | |
|
||||||||||
1 | NAR | Inv | 5000 | 2.01±0.04 | 0.457±0.01 | 0.461±0.01 | 14.06±0.23 (3.2%) | 0.08±0.0 (1.7%) | 0.23±0.0 (16.5%) | |
1 | NAR | Equiv | 5000 | 1.89±0.06 | 0.432±0.01 | 0.423±0.01 | 13.63±0.27 (3.6%) | 0.09±0.0 (1.2%) | 0.24±0.0 (18.3%) | |
| ||||||||||
3 | AR | Equiv | 5000 | 1.60±0.03 | 0.467±0.03 | 0.561±0.03 | 13.31±0.38 (3.4%) | 0.10±0.0 (2.6%) | 0.24±0.0 (19.0%) | |
5 | AR | Equiv | 5000 | 1.55±0.04 | 0.473±0.01 | 0.549±0.03 | 13.48±0.79 (3.3%) | 0.10±0.0 (3.0%) | 0.24±0.0 (20.2%) | |
| ||||||||||
Groundtruth sequence prediction baseline: | - | 1.000±0.00 | 0.570±0.01 | 9.78±0.13 (10.3%) | 0.16±0.0 (11.7%) | 0.36±0.0 (36.7%) | ||||
Random sequence prediction baseline: | - | 0.249±0.00 | 0.128±0.00 | 21.15±0.21 (0.9%) | 0.02±0.0 (0.0%) | 0.09±0.0 (3.3%) | ||||
ViennaRNA 2D-only baseline: | - | 0.258±0.00 | 0.601±0.00 | 15.47±0.20 (2.4%) | 0.05±0.0 (0.2%) | 0.19±0.0 (15.2%) | ||||
| ||||||||||
All data | 1 | AR | Equiv | 5000 | 1.23±0.01 | 0.733±0.00 | 0.627±0.02 | 8.10±0.28 (20.7%) | 0.42±0.0 (46.1%) | 0.41±0.0 (43.0%) |
2 | AR | Equiv | 5000 | 1.21±0.01 | 0.783±0.01 | 0.629±0.03 | 8.40±0.09 (19.1%) | 0.42±0.0 (47.8%) | 0.41±0.0 (41.7%) | |
3 | AR | Equiv | 5000 | 1.19±0.01 | 0.787±0.01 | 0.606±0.02 | 7.88±0.68 (20.5%) | 0.43±0.0 (47.4%) | 0.42±0.0 (44.0%) | |
5 | AR | Equiv | 5000 | 1.15±0.01 | 0.811±0.01 | 0.617±0.02 | 7.51±0.30 (20.7%) | 0.45±0.0 (50.2%) | 0.44±0.0 (46.7%) |
Max. train RNA length
Limiting the maximum length of RNAs used for training can be seen as ablating the use of ribosomal RNA families (which are thousands of nucleotides long and form complexes with specialised ribosomal proteins). We find that training on only short RNAs fewer than 1000s of nucleotides leads to worse sequence recovery and 3D self-consistency scores, even though it improves 2D self-consistency across both evaluation splits. This suggests that tertiary interactions learnt from ribosomal RNAs can generalise to other RNA families to some extent (large ribosomal RNAs were excluded from test sets).
GNN
We ablated whether the internal representations of the GVP-GNN are rotation invariant or equivariant. Equivariant GNNs are theoretically more expressive [Joshi et al., 2023] and we do find them more capable at fitting the training distribution (as shown by lower perplexity). However, we do not find significant differences in terms of other performance metrics across different GNN layers.
Model
‘AR’ implies autoregressive decoding (described in Section 2.3, uses 4 encoder and 4 decoder layers), while ‘NAR’ implies non-autoregressive, one-shot decoding using an MLP (uses 8 encoder layers). Across both evaluation splits, AR models show significantly higher self-consistency scores than NAR, even though NAR lead to higher sequence recovery. AR is more expressive and can condition predictions at each decoding step on past predictions, while one-shot NAR samples from independent probability distributions for each nucleotide. Thus, AR is a better inductive bias for predicting base pairing and base stacking interactions that are drivers of RNA structure [Vicens and Kieft, 2022]. For instance, G-C and A-U pairs can often be swapped for one another, but non-autoregressive decoding does not capture such paired constraints.
Max. #states
We evaluate the impact of increasing the maximum number of states as input to gRNAde. Multi-state models marginally improve native sequence recovery as well as structural self-consistency scores over an equivalent single state variant, even for the single-state benchmark where the multi-state model is being used with only one state as input. This suggests that seeing multiple states during training can be useful for gRNAde’s performance even for single-state design tasks.
Non-learnt baselines.
We report the performance of two non-learnt baselines to contextualise gRNAde’s performance: for each test sample, simply predicting the groundtruth sequence back and predicting a random sequence. Structural self-consistency scores for the Groundtruth baseline provides a rough upper bounds on the maximum score that any gRNAde designs can theoretically obtain given the current state of 2D/3D structure predictors being used. gRNAde always performs better than the random baseline and often reaches 2D self-consistency scores close to the upper bound. Both 2D and 3D self-consistency scores are inherently limited by the performance of the structure prediction methods used.
2D inverse folding baseline.
We additionally report results for ViennaRNA’s 2D-only inverse folding method to further demonstrate the utility of 3D inverse folding. ViennaRNA has improved 2D self-consistency scores over gRNAde but fails to capture tertiary interactions in its designs, as evident by poor recovery and 3D self-consistency scores similar to the random baseline.
Split.
Single- and multi-state splits are described in Section 3; the multi-state split is relatively harder than the single-state split based on overall reduced performance for all baselines and models. Models trained on ‘All data’ use all RNASolo samples for training, solely for the purpose of releasing the best possible gRNAde checkpoints for real-world usage. Evaluation metrics for ‘All data’ are reported on the single-state test set.
E. Additional Results
Table 2:
PDB ID | Description | ViennaRNA Recovery |
FARNA Recovery |
Rosetta Recovery |
gRNAde (single-state) | ||
---|---|---|---|---|---|---|---|
Recovery | Perplexity | 2D self-cons. | |||||
| |||||||
1CSL | RRE high affinity site | 0.25 | 0.20 | 0.44 | 0.5719 | 1.2812 | 0.8644 |
1ET4 | Vitamin B12 binding RNA aptamer | 0.25 | 0.34 | 0.44 | 0.6250 | 1.3457 | −0.0135 |
1F27 | Biotin-binding RNA pseudoknot | 0.30 | 0.36 | 0.37 | 0.3437 | 1.6203 | 0.4523 |
1L2X | Viral RNA pseudoknot | 0.24 | 0.45 | 0.48 | 0.4721 | 1.3181 | 0.5692 |
1LNT | RNA internal loop of SRP | 0.33 | 0.27 | 0.53 | 0.5843 | 1.4337 | 0.1379 |
1Q9A | Sarcin/ricin domain from E.coli 23S rRNA | 0.27 | 0.40 | 0.41 | 0.5044 | 1.3411 | 0.0597 |
4FE5 | Guanine riboswitch aptamer | 0.29 | 0.28 | 0.36 | 0.5300 | 1.3824 | 0.9116 |
1X9C | All-RNA hairpin ribozyme | 0.26 | 0.31 | 0.50 | 0.5000 | 1.3905 | 0.6630 |
1XPE | HIV-1 B RNA dimerization initiation site | 0.27 | 0.24 | 0.40 | 0.7037 | 1.2177 | 0.7768 |
2GCS | Pre-cleavage state of glmS ribozyme | 0.25 | 0.26 | 0.44 | 0.5078 | 1.3053 | 0.4062 |
2GDI | Thiamine pyrophosphate-specific riboswitch | 0.25 | 0.38 | 0.48 | 0.6500 | 1.2363 | −0.0251 |
2OEU | Junctionless hairpin ribozyme | 0.23 | 0.30 | 0.37 | 0.9519 | 1.0913 | 0.7768 |
2R8S | Tetrahymena ribozyme P4-P6 domain | 0.27 | 0.36 | 0.53 | 0.5689 | 1.1881 | 0.7281 |
354D | Loop E from E. coli 5S rRNA | 0.28 | 0.35 | 0.55 | 0.4410 | 1.4938 | 0.0430 |
| |||||||
Overall recovery: | 0.27 | 0.32 | 0.45 | 0.5682 |
F. Additional Figures
Figure 13: gRNAde model architecture.
Figure 14: Multi-graph tensor representation of RNA conformational ensembles.
Listing 1: Pseudocode for multi-state GNN encoder layer.
Figure 15: RNASolo data statistics.
Listing 1:
1 class MultiGVPConv ( MessagePassing ): |
2 ’’’ GVPConv for handling multiple conformations ’’’ |
3 def __init__ (self , ...) : |
5 ... |
6 |
7 def forward (self, x_s , x_v , edge_index, edge_attr ): |
8 |
9 # stack scalar feats along axis 1: |
10 # [ n_nodes , n_conf , d_s ] -> [n_nodes , n_conf * d_s] |
11 x_s = x_s. view (x_s . shape [0] , x_s. shape [1] * x_s. shape [2]) |
12 |
13 # stack vector feat along axis 1: |
14 # [ n_nodes , n_conf , d_v , 3] -> [ n_nodes , n_conf * d_v *3] |
15 x_v = x_v. view (x_v . shape [0] , x_v. shape [1] * x_v. shape [2]*3) |
16 |
17 # message passing and aggregation |
18 message = self . propagate ( |
19 edge_index , s=x_s , v=x_v , edge_attr = edge_attr ) |
20 |
21 # split scalar and vector channels |
22 return _split_multi ( message , d_s , d_v , n_conf ) |
23 |
24 def message (self , s_i , v_i , s_j , v_j , edge_attr ): |
25 |
26 # unstack scalar feats: |
27 # [ n_nodes , n_conf * d] -> [ n_nodes , n_conf , d_s] |
28 s_i = s_i. view (s_i . shape [0] , s_i. shape [1]// d_s , d_s) |
29 s_j = s_j. view (s_j . shape [0] , s_j. shape [1]// d_s , d_s) |
30 |
31 # unstack vector feats: |
32 # [ n_nodes , n_conf * d_v *3] -> [ n_nodes , n_conf , d_v , 3] |
33 v_i = v_i. view (v_i . shape [0] , v_i. shape [1]//( d_v *3) , d_v , 3) |
34 v_j = v_j. view (v_j . shape [0] , v_j. shape [1]//( d_v *3) , d_v , 3) |
35 |
36 # message function for edge j-i |
37 message = tuple_cat (( s_j , v_j), edge_attr , (s_i , v_i)) |
38 message = self . message_func ( message ) # GVP |
39 |
40 # merge scalar and vector channels along axis 1 |
41 return _merge_multi (* message ) |
42 |
43 def _split_multi (x, d_s , d_v , n_conf ): |
44 ’’’ |
45 Splits a merged representation of (s, v) back into a tuple. |
46 ’’’ |
47 s = x[... , :-3 * d_v * n_conf ]. view (x. shape [0] , n_conf , d_s) |
48 v = x[... , -3 * d_v * n_conf :]. view (x. shape [0] , n_conf , d_v , 3) |
49 return s , v |
50 |
51 def _merge_multi(s, v): |
52 ’’’ |
53 Merges a tuple (s, v) into a single ‘torch.Tensor‘, |
54 where the vector channels are flattened and |
55 appended to the scalar channels. |
56 ’’’ |
57 # s: [ n_nodes , n_conf , d] -> [ n_nodes , n_conf * d_s ] |
58 s = s. view (s. shape [0] , s. shape [1] * s. shape [2]) |
59 # v: [ n_nodes , n_conf , d, 3] -> [ n_nodes , n_conf * d_v *3] |
60 v = v. view (v. shape [0] , v. shape [1] * v. shape [2]*3) |
61 return torch .cat ([s, v], −1) |
Footnotes
For each RNA sequence, we compute the pairwise C4’ RMSD among all available structures. We then compute the median RMSD across all sequences within each structural cluster.
While we have not run Rosetta ourselves, we note that its documentation states that “runs on RNA backbones longer than ~ten nucleotides take many minutes or hours”.
Self-consistency scores for multi-state design are perhaps less reliable than single-state design due to the lack of multi-state structure prediction tools. At present, we compute the average score between multiple ground truth structures and the predicted structure for designed sequences.
This RNA was not present in gRNAde’s training data, which contains structures at ≤4.0Å resolution.
As a caveat, the fitness assays from McRae et al. [2024] used for creating the landscape have inherent noise and cannot easily differentiate between mutants of similar activity.
Open source code: github.com/chaitjo/geometric-rna-design
References
- Adamczyk B., Antczak M., and Szachniuk M.. Rnasolo: a repository of cleaned pdb-derived rna 3d structures. Bioinformatics, 2022. (Cited on page 2, 6) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baek M., DiMaio F., Anishchenko I., Dauparas J., Ovchinnikov S., Lee G. R., Wang J., Cong Q., Kinch L. N., Schaeffer R. D., et al. Accurate prediction of protein structures and interactions using a three-track neural network. Science, 2021. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baek M., McHugh R., Anishchenko I., Jiang H., Baker D., and DiMaio F.. Accurate prediction of protein–nucleic acid complexes using rosettafoldna. Nature Methods, 2024. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonilla S. L. and Kieft J. S.. The promise of cryo-em to explore rna structural dynamics. Journal of Molecular Biology, 2022. (Cited on page 10) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonnet E., Rzazewski P., and Sikora F.. Designing rna secondary structures is hard. Journal of Computational Biology, 2020. (Cited on page 16) [DOI] [PubMed] [Google Scholar]
- Bronstein M. M., Bruna J., Cohen T., and Velickovic P.. Geometric deep learning: Grids, groups, graphs, geodesics, and gauges. arXiv preprint, 2021. (Cited on page 2) [Google Scholar]
- Chen J., Hu Z., Sun S., Tan Q., Wang Y., Yu Q., Zong L., Hong L., Xiao J., Shen T., et al. Interpretable rna foundation model from unannotated data for highly accurate rna structure and function predictions. arXiv preprint, 2022. (Cited on page 16) [Google Scholar]
- Churkin A., Retwitzer M. D., Reinharz V., Ponty Y., Waldispühl J., and Barash D.. Design of rnas: comparing programs for inverse rna folding. Briefings in bioinformatics, 2018. (Cited on page 2, 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Damase T. R., Sukhovershin R., Boada C., Taraballi F., Pettigrew R. I., and Cooke J. P.. The limitless future of rna therapeutics. Frontiers in bioengineering and biotechnology, 2021. (Cited on page 2) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Das R., Karanicolas J., and Baker D.. Atomic accuracy in predicting and designing noncanonical rna structure. Nature methods, 2010. (Cited on page 1, 2, 6, 7, 14, 16, 19) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dauparas J., Anishchenko I., Bennett N., Bai H., Ragotte R. J., Milles L. F., Wicky B. I., et al. Robust deep learning based protein sequence design using proteinmpnn. Science, 2022. (Cited on page 2, 3, 5, 7, 17) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dawson W. K., Maciejczyk M., Jankowska E. J., and Bujnicki J. M.. Coarse-grained modeling of rna 3d structure. Methods, 2016. (Cited on page 3) [DOI] [PubMed] [Google Scholar]
- Didi K., Vargas F., Mathis S., Dutordoir V., Mathieu E., Komorowska U. J., and Lio P.. A framework for conditional diffusion modelling with applications in motif scaffolding for protein design. In NeurIPS 2023 Machine Learning for Structural Biology Workshop, 2023. (Cited on page 16) [Google Scholar]
- Doudna J. A. and Charpentier E.. The new frontier of genome engineering with crispr-cas9. Science, 2014. (Cited on page 2) [DOI] [PubMed] [Google Scholar]
- Duval A., Mathis S. V., Joshi C. K., Schmidt V., Miret S., Malliaros F. D., Cohen T., Lio P., Bengio Y., and Bronstein M.. A hitchhiker’s guide to geometric gnns for 3d atomic systems. arXiv preprint, 2023. (Cited on page 2) [Google Scholar]
- Felletti M., Stifel J., Wurmthaler L. A., Geiger S., and Hartig J. S.. Twister ribozymes as highly versatile expression platforms for artificial riboswitches. Nature communications, 2016. (Cited on page 2) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fey M. and Lenssen J. E.. Fast graph representation learning with pytorch geometric. ICLR 2019 Representation Learning on Graphs and Manifolds Workshop, 2019. (Cited on page 4) [Google Scholar]
- Fu L., Niu B., Zhu Z., Wu S., and Li W.. Cd-hit: accelerated for clustering the next-generation sequencing data. Bioinformatics, 2012. (Cited on page 6) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ganser L. R., Kelly M. L., Herschlag D., and Al-Hashimi H. M.. The roles of structural dynamics in the cellular functions of rnas. Nature reviews Molecular cell biology, 2019. (Cited on page 2) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han D., Qi X., Myhrvold C., Wang B., Dai M., Jiang S., Bates M., Liu Y., An B., Zhang F., et al. Single-stranded dna and rna origami. Science, 2017. (Cited on page 2, 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- He S., Huang R., Townley J., Kretsch R. C., Karagianes T. G., Cox D. B., Blair H., Penzar D., Vyaltsev V., Aristova E., et al. Ribonanza: deep learning of rna structure through dual crowdsourcing. bioRxiv, 2024. (Cited on page 16) [Google Scholar]
- Hoetzel J. and Suess B.. Structural changes in aptamers are essential for synthetic riboswitch engineering. Journal of Molecular Biology, 2022. (Cited on page 2) [DOI] [PubMed] [Google Scholar]
- Huang P.-S., Boyken S. E., and Baker D.. The coming of age of de novo protein design. Nature, 2016. (Cited on page 3, 10) [DOI] [PubMed] [Google Scholar]
- Ingraham J., Garg V., Barzilay R., and Jaakkola T.. Generative models for graph-based protein design. NeurIPS, 2019. (Cited on page 3) [Google Scholar]
- Ingraham J. B., Baranov M., Costello Z., Barber K. W., Wang W., Ismail A., Frappier V., Lord D. M., Ng-Thow-Hing C., Van Vlack E. R., et al. Illuminating protein space with a programmable generative model. Nature, 2023. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jing B., Eismann S., Suriana P., Townshend R. J. L., and Dror R.. Learning from protein structure with geometric vector perceptrons. In International Conference on Learning Representations, 2020. (Cited on page 3, 4, 20) [Google Scholar]
- Joshi C. K., Bodnar C., Mathis S. V., Cohen T., and Lio P.. On the expressive power of geometric graph neural networks. In International Conference on Machine Learning, 2023. (Cited on page 18) [Google Scholar]
- Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Zidek A., Potapenko A., et al. Highly accurate protein structure prediction with alphafold. Nature, 2021. (Cited on page 2, 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kappel K., Zhang K., Su Z., Watkins A. M., Kladwang W., Li S., Pintilie G., Topkar V. V., Rangan R., Zheludev I. N., et al. Accelerated cryo-em-guided determination of three-dimensional rna-only structures. Nature methods, 2020. (Cited on page 10) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ken M. L., Roy R., Geng A., Ganser L. R., Manghrani A., Cullen B. R., Schulze-Gahmen U., Herschlag D., and Al-Hashimi H. M.. Rna conformational propensities determine cellular activity. Nature, 2023. (Cited on page 2, 3, 8) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leman J. K., Weitzner B. D., Lewis S. M., Adolf-Bryfogle J., Alam N., Alford R. F., Aprahamian M., Baker D., Barlow K. A., Barth P., et al. Macromolecular modeling and design in rosetta: recent methods and frameworks. Nature methods, 2020. (Cited on page 2, 7) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leppek K., Das R., and Barna M.. Functional 5’ utr mrna structures in eukaryotic translation regulation and how to find them. Nature reviews Molecular cell biology, 2018. (Cited on page 2) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li S., Moayedpour S., Li R., Bailey M., Riahi S., Kogler-Anele L., Miladi M., Miner J., Zheng D., Wang J., et al. Codonbert: Large language models for mrna design and optimization. bioRxiv, 2023a. (Cited on page 16) [Google Scholar]
- Li Y., Zhang C., Feng C., Pearce R., Lydia Freddolino P., and Zhang Y.. Integrating end-to-end learning with deep geometrical potentials for ab initio rna structure prediction. Nature Communications, 2023b. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mandal M. and Breaker R. R.. Gene regulation by riboswitches. Nature reviews Molecular cell biology, 2004. (Cited on page 2) [DOI] [PubMed] [Google Scholar]
- McRae E. K., Wan C. J., Kristoffersen E. L., Hansen K., Gianni E., Gallego I., Curran J. F., Attwater J., Holliger P., and Andersen E. S.. Cryo-em structure and functional landscape of an rna polymerase ribozyme. Proceedings of the National Academy of Sciences, 2024. (Cited on page 2, 9, 10, 19) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Metkar M., Pepin C. S., and Moore M. J.. Tailor made: the art of therapeutic mrna design. Nature Reviews Drug Discovery, 2024. (Cited on page 2) [DOI] [PubMed] [Google Scholar]
- Mohsen M. G., Midy M. K., Balaji A., and Breaker R. R.. Exploiting natural riboswitches for aptamer engineering and validation. Nucleic Acids Research, 2023. (Cited on page 2) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mustafina K., Fukunaga K., and Yokobayashi Y.. Design of mammalian on-riboswitches based on tandemly fused aptamer and ribozyme. ACS Synthetic Biology, 2019. (Cited on page 2) [DOI] [PubMed] [Google Scholar]
- Penic R. J., Vlasic T., Huber R. G., Wan Y., and Sikic M.. Rinalmo: General-purpose rna language models can generalize well on structure prediction tasks. arXiv preprint, 2024. (Cited on page 16) [Google Scholar]
- Runge F., Stoll D., Falkner S., and Hutter F.. Learning to design RNA. In ICLR, 2019. (Cited on page 16) [Google Scholar]
- Schneider B., Sweeney B. A., Bateman A., Cerny J., Zok T., and Szachniuk M.. When will rna get its alphafold moment? Nucleic Acids Research, 2023. (Cited on page 2, 6) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen T., Hu Z., Peng Z., Chen J., Xiong P., Hong L., Zheng L., Wang Y., King I., Wang S., et al. E2efold-3d: End-to-end deep learning method for accurate de novo rna 3d structure prediction. arXiv preprint, 2022. (Cited on page 5) [Google Scholar]
- Stagno J., Liu Y., Bhandari Y., Conrad C., Panja S., Swain M., Fan L., Nelson G., Li C., Wendel D., et al. Structures of riboswitch rna reaction states by mix-and-inject xfel serial crystallography. Nature, 2017. (Cited on page 8) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sumida K. H., Nunez-Franco R., Kalvet I., Pellock S. J., Wicky B. I., Milles L. F., Dauparas J., Wang J., Kipnis Y., Jameson N., et al. Improving protein expression, stability, and function with proteinmpnn. Journal of the American Chemical Society, 2024. (Cited on page 5) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Townshend R. J., Eismann S., Watkins A. M., Rangan R., Karelina M., Das R., and Dror R. O.. Geometric deep learning of rna structure. Science, 2021. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vicens Q. and Kieft J. S.. Thoughts on how to think (and talk) about rna structure. Proceedings of the National Academy of Sciences, 2022. (Cited on page 18) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wadley L. M., Keating K. S., Duarte C. M., and Pyle A. M.. Evaluating and learning from rna pseudotorsional space: quantitative validation of a reduced representation for rna structure. Journal of molecular biology, 2007. (Cited on page 3) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang W., Feng C., Han R., Wang Z., Ye L., Du Z., Wei H., Zhang F., Peng Z., and Yang J.. trrosettarna: automated prediction of rna 3d structure with transformer network. Nature Communications, 2023. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ward M., Courtney E., and Rivas E.. Fitness functions for rna structure design. Nucleic Acids Research, 2023. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watkins A. M., Rangan R., and Das R.. Farfar2: improved de novo rosetta prediction of complex global rna folds. Structure, 2020. (Cited on page 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watson J. L., Juergens D., Bennett N. R., Trippe B. L., Yim J., Eisenach H. E., Ahern W., Borst A. J., Ragotte R. J., Milles L. F., et al. De novo design of protein structure and function with rfdiffusion. Nature, 2023. (Cited on page 2, 6, 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wayment-Steele H. K., Kladwang W., Strom A. I., Lee J., Treuille A., Becka A., Participants E., and Das R.. Rna secondary structure packages evaluated and improved by high-throughput experiments. Nature methods, 2022. (Cited on page 5) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Williams R. J. and Zipser D.. A learning algorithm for continually running fully recurrent neural networks. Neural computation, 1989. (Cited on page 5) [Google Scholar]
- Yesselman J. D., Eiler D., Carlson E. D., Gotrik M. R., d’Aquino A. E., Ooms A. N., Kladwang W., Carlson P. D., Shi X., Costantino D. A., et al. Computational design of three-dimensional rna structure and function. Nature nanotechnology, 2019. (Cited on page 2, 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zaheer M., Kottur S., Ravanbakhsh S., Poczos B., Salakhutdinov R. R., and Smola A. J.. Deep sets. NeurIPS, 2017. (Cited on page 4, 20) [Google Scholar]
- Zhang C., Shine M., Pyle A. M., and Zhang Y.. Us-align: universal structure alignments of proteins, nucleic acids, and macromolecular complexes. Nature methods, 2022. (Cited on page 6) [DOI] [PubMed] [Google Scholar]
- Zhu Y., Zhu L., Wang X., and Jin H.. Rna-based therapeutics: An overview and prospectus. Cell Death & Disease, 2022. (Cited on page 2) [DOI] [PMC free article] [PubMed] [Google Scholar]