Skip to main content
Biophysics Reviews logoLink to Biophysics Reviews
. 2024 Nov 5;5(4):041304. doi: 10.1063/5.0237319

AI-integrated network for RNA complex structure and dynamic prediction

Haoquan Liu 1, Chen Zhuo 1, Jiaming Gao 1, Chengwei Zeng 1, Yunjie Zhao 1,a)
PMCID: PMC11540444  PMID: 39512332

Abstract

RNA complexes are essential components in many cellular processes. The functions of these complexes are linked to their tertiary structures, which are shaped by detailed interface information, such as binding sites, interface contact, and dynamic conformational changes. Network-based approaches have been widely used to analyze RNA complex structures. With their roots in the graph theory, these methods have a long history of providing insight into the static and dynamic properties of RNA molecules. These approaches have been effective in identifying functional binding sites and analyzing the dynamic behavior of RNA complexes. Recently, the advent of artificial intelligence (AI) has brought transformative changes to the field. These technologies have been increasingly applied to studying RNA complex structures, providing new avenues for understanding the complex interactions within RNA complexes. By integrating AI with traditional network analysis methods, researchers can build more accurate models of RNA complex structures, predict their dynamic behaviors, and even design RNA-based inhibitors. In this review, we introduce the integration of network-based methodologies with AI techniques to enhance the understanding of RNA complex structures. We examine how these advanced computational tools can be used to model and analyze the detailed interface information and dynamic behaviors of RNA molecules. Additionally, we explore the potential future directions of how AI-integrated networks can aid in the modeling and analyzing RNA complex structures.

I. INTRODUCTION AND MOTIVATION

RNA complexes are fundamental biomolecules that bind proteins, DNA, or small molecules. They play essential roles in critical biological processes, such as transcription and translation.1–3 For example, RNA polymerase II (Pol II) recruits the U1 small nuclear ribonucleoprotein particle to initiate co-transcriptional splicing.4 RNA complexes are also closely linked to the occurrence of diseases, such as AIDS and COVID-19.5–7 It is widely known that the function of biomolecules is intricately connected to their tertiary structure. Understanding their structures helps scientists uncover the mechanisms underlying various biological processes, leading to advancements in numerous natural disciplines. This structural insight can inform the development of new therapeutic strategies, enhance our understanding of disease mechanisms, and drive innovations in drug design, ultimately contributing to improved health outcomes and the advancement of biomedical science.8–11

Experimental methods, such as x-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, and cryo-electron microscopy (cryo-EM), have been developed to determine the structure of biomolecules. However, these techniques are expensive and time-consuming, limiting their widespread use.12–14 Additionally, each method faces technical challenges. For example, samples for x-ray crystallography must be crystallizable, which limits the types of samples with high flexibility.15 NMR spectroscopy presents challenges when analyzing large biomolecules. Cryo-EM density maps have low resolution, making structure determination challenging and expert-intensive.16 The existing constraints are causing a slowdown in the modeling and analysis of biomolecular structures. This impedes the speed at which scientific and medical advancements can be made. Hence, there is an urgent need to develop computational approaches to speed up research on RNA complexes.

Several computational prediction approaches have recently been developed to overcome the experimental limitations. Monomeric structure prediction methods, such as Alphafold3, RoseTTAFold, and 3dRNA, have been proposed and have achieved accuracy comparable to experimental methods. However, predicting RNA complex structures remains challenging.17–20 Current RNA complex prediction methods predominantly rely on molecular docking. These methods face challenges due to RNA's flexible conformation and the complicated features of its interfaces. Information on RNA binding sites contributes to determining the locations where molecules interact with RNA, which is particularly useful for identifying targets for small molecules. However, larger molecules involve more extensive binding interfaces, necessitating more detailed information on interfacial contacts. Over the past decade, advancements in network-based approaches have significantly improved the ability to obtain information on binding sites and interfacial contacts. These methods, known for their robustness and simplicity, have greatly enhanced the molecular docking process. In addition, structure prediction methods usually focus on static structures. The functional RNA complexes are often related to their dynamic behavior, particularly the dynamic changes of their interfaces. Dynamic network analysis methods have been extensively utilized in this area of research. Artificial intelligence (AI) and network-based methods have recently been increasingly proposed and developed to overcome the limitations of experimental techniques in RNA complex modeling and dynamical analysis.

This review article focuses on RNA complex structure modeling and dynamic analysis. We start by summarizing how AI and network-based methods are integrated into RNA complex structure modeling, emphasizing advances in binding site and interface contact prediction [Figs. 1(a) and 1(b)]. These AI-network-driven approaches have provided more accurate and efficient information to model RNA complexes. Following this, we present an overview of AI-integrated network methods for analyzing complex dynamic motions [Fig. 1(c)]. This includes techniques for understanding how these complexes behave and interact over time, which is crucial for comprehending their biological functions and mechanisms. We then discuss potential future directions in this field and the remaining challenges and opportunities for innovation. We also explore emerging technologies and methodologies that could further enhance our understanding of RNA complexes. We aim to provide a comprehensive outlook on the evolving landscape of RNA complex structure prediction and dynamic behavior analysis in the context of AI and network-based approaches.

FIG. 1.

FIG. 1.

The AI-integrated network methods are applied in RNA complex structure modeling and dynamic analysis. (a) Prediction of the binding site of the receptor (R) concerning the ligand (L). (b) Complex contact prediction. (c) Dynamic motions analysis for RNA complex.

II. COMPLEX STRUCTURE MODELING

A. Overview

Classical complex structure prediction algorithms are usually based on molecular docking, which predicts the RNA complex structures using individual components. The idea of docking algorithms originates from the lock-and-key model and the induced fit theory.21 The docking process aims to obtain the optimal binding mode that satisfies the spatial shape complementarity and energy minimization simultaneously.22 Traditional docking methods typically explore conformational space using techniques, such as Fast Fourier Transform (FFT) and Monte Carlo, to generate numerous candidate complex configurations.23 Then, these conformations are ranked by using scoring functions to predict the near-native structures according to energy scores.24,25 Many classical molecular docking algorithms exist, such as HADDOCK and HSYMDOCK for protein–protein docking, HDOCK and 3dRPC for protein-NA docking, and NLDock for NA-ligand docking.24,26–32 RNA complex docking accuracy is limited by two main factors: (1) the conformational flexibility of RNA and (2) the complex interface interaction characteristics between RNA and protein. Therefore, researchers are interested in identifying binding site and interface contact information to guide the complex modeling.

Binding site information can help quickly identify potential RNA binding hotspots, which is valuable for guiding RNA complex modeling. It has also been shown to improve the prediction of RNA–small molecule interactions, thereby accelerating downstream drug discovery tasks.33 For larger proteins with extensive binding interfaces, it is important to consider not only the RNA binding area but also the protein binding pose. Therefore, contact information between the two monomers can also aid in predicting the complex structure. In structure prediction, it has been proven that contact information between residues or nucleotides enhances the accuracy of predictions.34 Recently, AI-integrated network methods are gradually beginning to emerge. In these approaches, biomolecules are viewed as networks, with nodes representing residues (proteins) or nucleotides (RNA) and edges typically defined by a distance threshold. This threshold is primarily based on the length of physical interactions. Two residues/nucleotides are considered connected if the distance between any pair of their heavy atoms is less than the defined distance threshold.35

B. Binding sites prediction

The nucleotides in RNA that interact with other molecules are known as binding sites or binding pockets. Experimental RNA binding sites are determined by the nucleotides located within a specific atomic distance cutoff, such as 4 Å, from other molecules.35,36 RNA binding sites are critical in determining complex structures and provide information on biomolecular interaction and assembly. Accurate knowledge of binding sites can significantly enhance our understanding of molecular regulation, leading to the development of targeted RNA inhibitors.

Recently, traditional methods for identifying RNA binding sites have been developed. These prediction approaches can be broadly categorized into sequence-based and tertiary structure-based. Rsite2 is a sequence-based method for RNA binding site prediction.37 Zeng et al. found a close relationship between the distance of nucleotides and the locations of RNA binding sites. They developed Rsite2, which calculates the Euclidean distances between each nucleotide and other nucleotides using the predicted secondary structures derived from the sequence. It then identifies the nucleotides that are the extreme points on the distance curve as functional sites. This approach has demonstrated that the connectivity between nucleotides holds the potential for discovering the functional sites by testing three ncRNAs [tRNA (Lys), Diels-Alder ribozyme, and RNase P]. Tertiary structure-based approaches include Rsite and RBind. Rsite replaces the secondary structure Euclidean distance with a tertiary structure Euclidean distance, refining the identification process of these functional sites.38 RBind transforms RNA tertiary structures into networks and utilizes network properties (degree and closeness centrality) to identify the binding connectivity.35 By analyzing these network properties, RBind effectively identifies RNA binding sites, offering a comprehensive understanding of the structural and functional relationships within RNA molecules. These methods have shown the effectiveness of network properties in binding site prediction. Recently, AI-integrated network methods have been developed to further explore the role of network features in identifying binding sites. RNetsite considers multiple local and global network features and utilizes an ensemble machine learning strategy to identify RNA binding sites, effectively demonstrating the effectiveness of AI-integrated networks strategy in RNA binding site prediction.39 RNAsite uses a random forest model that combines structural features derived from network properties, such as degree and closeness centrality, with additional structural features such as accessible surface area and Laplacian norm. It also incorporates multiple sequence alignment (MSA) to consider the evolutionary conservation of each position in the RNA sequence.36 Similarly, RLBind employs a deep learning-based pipeline using a convolutional neural network that integrates both structural and sequence features for binding site prediction.40 The sequence properties include nucleotide types and evolutionary conservation, while the structure properties encompass network topological characteristics, biochemical properties, and accessible surface areas. Table I provides a brief description of the methods described above.

TABLE I.

List of RNA binding sites prediction methods.

Name Proposed year Type Technique Features References
Rsite 2015 Structure-based Distance RNA tertiary structures 38
Rsite2 2016 Sequence-based Distance RNA sequence 37
RBind 2018 Structure-based Network Network properties 35
RNAsite 2021 Structure-based AI + Network Network properties 36
RNA tertiary structures
RNA sequence
RLBind 2022 Structure-based AI + Network Network properties 40
RNA tertiary structures
RNA sequence
RNetsite 2023 Structure-based AI + Network Network properties 39

To evaluate the performance of binding site prediction methods, we compare Rsite2 with network-based RBind and AI-integrated network-based RNetsite. The comparison was performed using a combination of the RNA-molecule binding site benchmark datasets TE18 and RB9. After removing three overlapping structures from TE18 and RB9, 24 RNA test cases were included (called TE24). The test results showed that RNetsite achieved the highest performance on TE24 in terms of Precision, Recall, and Matthews correlation coefficient (MCC), with RBind showing the second-best performance [Figs. 2(a)–2(c)]. This demonstrates that network features significantly aid in identifying binding sites, and the integration of AI technologies further enhances the effectiveness of RNA binding site recognition.

FIG. 2.

FIG. 2.

The performance of RNetsite (purple), RBind (green), and Rsite2 (blue) models on the TE24 set. (a) Precision. (b) Recall. (c) MCC. RNetsite produced the best results, followed by RBind and Rsite2.

C. RNA complex contacts

Intermolecular contacts between complexes are crucial for regulating and executing various cellular processes. Typically, two residues or nucleotides from interacting biomolecules are considered to be in contact if the distance between their atoms is within a defined threshold, usually 6 or 8 Å.41 The prediction of complex inter-monomer contacts aims to determine whether two positions from different monomers within an interface are spatially close to each other in a tertiary structure. Understanding these contacts offers valuable insight into the mechanisms underlying cellular communication, regulation, and the organization of biological activities. In addition, accurately predicting contacting monomer pairs between interacting biomolecules is crucial for the structural characterization of complex interactions. This prediction can serve as an intermediate step in complex structure prediction,42,43 as the identified contacts can be incorporated into docking algorithms to enhance the accuracy of complex structure models.26,44,45 Moreover, these predicted contacts are invaluable for guiding the design of interfaces and can be extended to predict novel inter-complex interactions.46

Biomolecules will likely be highly conserved or coevolutionary to maintain fundamental or functional interactions. Due to evolutionary pressure, coevolving monomer pairs are often found to be spatially proximal in complex structures.47 Coevolutionary analysis methods, such as mfDCA and plmDCA, have been employed in previous studies on inter-protein contacts.48,49 However, these methods have certain limitations, such as the need for a large number of homology sequences and relatively low accuracy.

Although AI-based approaches have also been applied to predict inter-contacts,50–54 RNA complex inter-contact prediction is still in its early stages, hindered by challenges, such as scarce data, the flexible RNA structure, and the complexity of RNA complex binding interfaces. Nonetheless, AI-based network methods are being explored to predict intra-RNA contacts.55–57 For example, we have developed DIRECT, a hybrid approach that utilizes a restricted Boltzmann machine (RBM) to extract contact pattern information from the structural distance graph.58 The results demonstrate that the AI-based network approach enhances the accuracy of contact inference from sequence co-variations. Given that protein datasets are much larger than those available for RNA contact prediction, we found that knowledge from a protein-coevolution transformer-based language model can be effectively transferred to the task. To achieve this, we developed CoT-RNA-Transfer, which significantly enhances RNA contact prediction through transfer learning using a publicly available protein language model.59 Our findings suggest that structural patterns learned from proteins can be successfully transferred to RNAs, paving the way for new research opportunities.

III. COMPLEX DYNAMIC ANALYSIS

A. Overview

Numerous biological functions are intricately regulated by complex RNA dynamical processes.60,61 The binding interface is predominantly governed by atomic and residue-level interactions, which play a crucial role in driving complex conformational changes and motions. Understanding these motions is essential for gaining insights into RNA complex binding interfaces' stability and regulating their functions. Molecular dynamics (MD) simulations offer a practical approach for studying these dynamic processes. MD simulations provide a direct perspective on the dynamic behavior of biomolecular structures. However, as system sizes increase and enhanced sampling techniques become more popular, there is a growing need for advanced analysis tools capable of extracting information from vast amounts of data and providing new insights.62 Dynamic network methods have been developed and have become effective tools for extracting interface features from MD simulations, such as key nucleotides/residues and communication pathways within molecular complexes.63–65 The dynamical network method transforms the atomic representation of MD simulations into a graph representation with nodes and edges, facilitating the analysis of dynamic structures in biomolecular systems through graph analysis methods.66,67 Building on the development of dynamic network analysis methods, various AI techniques have recently been introduced to the dynamic analysis community.68–70

B. Dynamic network analysis methods

In the context of MD simulations, residues and nucleotides within an RNA complex are represented as individual network nodes. Edges are established between pairs of nodes when the corresponding residues or nucleotides are in contact during the MD simulation. Two nonconsecutive residues or nucleotides are defined to be in contact if any of their heavy atoms (non-hydrogen atoms) are within a defined distance threshold (such as 4.5 Å) for the majority of frames throughout the MD simulations.63 The binding interface is the residue-nucleotide pairs if the distance between residue and nucleotide is less than 4 Å.39,63

In the dynamic network analysis, the edges are weighted by the correlation values derived from the simulation. The distance between two nodes connected by an edge decreases as the correlation (or interaction energy) between the residues/nucleotides increases. The correlation, denoted as Cij, between the motions of nodes i and j provides a quantitative measure of the information transfer between these nodes. In essence, the motion of residue or nucleotide i can be used to predict the directional motion of residue or nucleotide j, indicating a level of coordinated movement or interaction within the molecular system. The weighted edges can be easily employed to investigate the behavior within the binding interface.

The dynamical cross coefficients (DCC) have been widely used in the analysis of MD simulation data.71–74 The DCC assesses the consistency of motion direction between two nodes, which can be calculated by the following equation:

Cij=Δrit·ΔrjtΔrit2Δrjt212 (1)

where rit is the position of the Cα or P of the i th residue or nucleotide of the protein or RNA at time t. Δrit=ritrit, and · refers to the time average of the quantity within the brackets. The DCC between two residues is represented by a value ranging from −1 to 1. A positive Cij value indicates correlated motion, where the residues move in the same direction in most frames. A negative Cij value indicates anticorrelated motion, with residues moving in opposite directions across most frames. If the correlation value between the two residues is close to zero, their motion is considered uncorrelated. The property of DCC has made it a tool to assess the stability of binding interfaces: a positive DCC indicates a tendency for the interface to maintain stability while a negative DCC suggests a tendency to breakup. However, it may not fully capture the impact of varying velocities on the interfaces. The method DCC studies the consistency of motion direction to determine whether the interface is disintegrating. Due to the constraints imposed by the backbone connections in monomeric structures, the structure tends to be rigid, leading to less variation in velocity. As a result, assessing motion correlation based on direction alone is feasible in monomers. However, even when the motion directions are aligned in multimeric complexes, differences in velocity can also cause the interface to breakup. Therefore, relying solely on direction to assess interface stability is inaccurate.

To address this limitation, we have developed distance-based dynamical network correlation (DDNC) analysis, which integrates direction and velocity to more precisely characterize interface binding behavior.39 The DDNC for residue i and nucleotide j at time t can be calculated as follows:

Mijt=ritrij·rjtrijritrijrjtrij. (2)

rit is the position of the ith residue or nucleotide at time t. rij serves as a steady-state reference point derived from the initial structure, obtained by solving an optimization problem as follows:

min|rijr|,rij  s.t.{(rijm)(rirj)=0,(ririj)(rjrij)=0. (3)

ri” (“ rj”) represents the position of the ith ( jth) residue or nucleotide of the protein or RNA in the initial structure. m is the midpoint of the line connecting ri and rj. r is the average position of the binding interface in the initial structure This ensures that rij is one of the closest points to the average position binding interface and guarantees that (1) the correlation between residue i and nucleotide j calculated by Eq. (2) is uncorrelated, indicating a steady state in the initial structure and (2) the stabilization and breaking of the interface both lead to corresponding changes in the correlation coefficients, highlighting the dynamic motions of the interface. Finally, a period from time T1 to time T2 is denoted as τ, the DDNC of this period is defined as follows:

Mijτ=Mijtτ, (4)

where ·τ denotes the time average of the quantity within the brackets during period τ. To compare the performance of DCC and DDNC, we tested both methods in the context of experimentally validated interface disruption between HIV TAR RNA and P-TEFb/Tat complex caused by an RNA inhibitor.6,75 The HIV viral protein Tat hijacks the cellular positive transcription elongation factor b (P-TEFb) to initiate viral transcription. The P-TEFb/Tat complex binds to the viral transactivation response element (TAR) RNA to overcome the elongation pause. Figure 3(a) shows the dynamic changes of this system. The competitive inhibitor JB181 has been found to disrupt the P-TEFb/Tat and TAR interface based on both experimental data and molecular dynamics simulations. Figures 3(b)–3(d) represent the interface dynamics characterized by DCC, while Figs. 3(e)–3(g) represent the interface dynamics characterized by DDNC. In these figures [Figs. 3(b)–3(g)], positive values indicate that the interface remains stable, whereas negative values suggest that the interface tends to disrupt. The analysis results of DDNC show that the interface breaks up in Fig. 3(e). Subsequently, the interface exhibits fluctuations [Fig. 3(f)] and ultimately breaks up [Fig. 3(g)]. In contrast, the analysis results of DCC depict a different breaking-up process. The interface breaks up in Fig. 3(a), returns to a stable state [Fig. 3(b)], and ultimately breaks up [Fig. 3(c)]. Simulation results indicate that DDNC outperforms the traditional DCC method in capturing the dynamic behavior of the interface [Figs. 3(a)–3(g)].

FIG. 3.

FIG. 3.

Comparison of DCC and DDNC in molecular dynamics simulations of unstable interfaces. (a) MD simulation of HIV TAR RNA and P-TEFb/Tat complex. (b)–(d) DCC characterization of interface behavior. (e)–(g) DDNC characterization of interface behavior. The interface changes characterized by DDNC are more reasonable than DCC.

Biomolecular systems are represented by networks with connected nodes. Various computational methods, such as Dijkstra's and Floyd-Warshall algorithms, can be applied to determine the essential pathways within the system.76 This computational approach has been widely studied, leading to the development of tools that precisely identify information pathways transmitting signals from the allosteric site to the target.77–79 These tools have significantly advanced the ability to model signal propagation within biomolecular systems, offering deeper insight into allosteric regulation and its impact on molecular interactions.63,66,80

IV. FUTURE DIRECTIONS

AI-integrated network methods have significantly influenced the field of structural biology, offering advanced approaches to tackle the modeling of complex and analyzing dynamic motions. These methods enable researchers to learn the intricate interactions within biological structures, facilitating a deeper understanding of their structures, behavior, and functions. There are several future directions to explore in this area.

Predicting binding sites of RNA remains an improvement for further exploration. Unlike proteins, RNA molecules exhibit more flexibility due to the higher number of backbone torsion angles and intricate base pairing, resulting in a more complicated and flexible topology.81,82 The unique structure and dynamic nature of RNA make it challenging to predict its binding sites accurately. Despite these difficulties, network-based binding site prediction using artificial intelligence techniques shows a promising approach for advancing RNA research. Current efforts are focused on understanding the relationship between RNA structure and functional binding sites through the analysis of the RNA structural network. The ongoing exploration and improvement of AI-based network methods, such as graph neural networks, which have shown potential in predicting protein binding sites, have significant potential to advance RNA binding site research. For example, GraphBind uses an end-to-end graph neural network to identify nucleic acid-binding residues on proteins. Since binding sites often exhibit highly conserved patterns within local tertiary structures, GraphBind constructs graphs based on the structural contexts of target residues and their spatial neighborhoods. To recognize binding residues accurately, GraphBind employs hierarchical graph neural networks (HGNNs) to capture latent local patterns of structural and physicochemical characteristics.83 Similarly, GPSite utilizes predicted protein structures to construct protein graphs. It uses an edge-enhanced graph neural network to extract residual and relational geometric contexts end-to-end, facilitating precise binding site identification.84

Predicting interactions within RNA complexes is still in its infancy, presenting challenges that could shape future research directions. One of the primary challenges lies in the structural flexibility of RNA, which differs significantly from proteins. RNA complex interfaces tend to exhibit complicated characteristics. For example, RNA exerts its biological functions by forming complexes with proteins. During the binding process, RNA and proteins recognize and attract each other through long-range electrostatic interactions, followed by short-range interactions, such as van der Waals forces and hydrogen bonds, which optimize their conformations and facilitate tight binding.85 RNA regions often exhibit flexible conformational changes upon binding. Determining how these flexible structural regions interact with the complex is critical. Traditional coevolutionary methods rely heavily on multiple sequence alignment (MSA) information, leading to lower prediction accuracy in “orphan” structures whose homologous sequence data are scarce.86,87 To address this, AI-driven large language models can be employed to overcome the “orphan” effect caused by limited homologous information, thereby improving the accuracy of RNA complex contact predictions. AI-driven large language models have already been applied to predict inter-protein contacts.50–54 Graph neural networks have been also increasingly applied to protein complexes to predict inter-chain residue-residue contacts. GLINTER and PLMGraph-Inter exemplify this approach, using the structures of interacting monomers and their rotationally invariant graph representations as input features.52,88 The network model can further help address the challenges posed by the flexible changes in the RNA binding regions.

Static networks are valuable for identifying potential binding sites and complex binding interface contacts. They provide insight into the potential interaction hotspots and post-binding structure. However, function regulation is closely related to conformational dynamics, especially in flexible RNA structures. Understanding the dynamic changes in complex structures can help elucidate their functional mechanisms, which is particularly important for downstream applications such as drug discovery and RNA inhibitor design (Fig. 4). Dynamic networks offer a deeper understanding of the post-binding behavior by analyzing conformational changes and interaction dynamics over time. Then, researchers can comprehensively understand the static and dynamic aspects of regulating RNA function. We recently successfully designed two competitive inhibitors to RNA hairpin motifs, SL3 and RBE RNA, with the help of this strategy.89 The SL3 RNA plays a crucial role in recognizing and packaging the HIV-1 genome by interacting with the NC protein. Among potential inhibitors, L22#15 has demonstrated a promising low binding affinity of approximately 5 μM in NMR experiments, outperforming other SL3 RNA inhibitors. Similarly, RBE RNA, which regulates viral gene expression through its interaction with the HIV-1 Rev protein, has been targeted by the inhibitor L22 + 16, achieving a binding affinity of around 11 μM. These findings indicate the effectiveness of network-based models in RNA function regulation and inhibitor design.

FIG. 4.

FIG. 4.

Flowchart illustrating the process of designing RNA inhibitor. (a) Predicting potential binding sites for inhibitor binding. (b) Inferring interface interactions within RNA complexes. (c) Analyzing functional regulation through MD simulations.

Recently, AI-based techniques have opened new possibilities to capture changes in simulations, providing deeper insights into biomolecules' complex motions and interactions. One direction is to utilize AI-based techniques to capture subtle MD dynamic behaviors within complex systems. Karamzadeh et al. used statistical machine learning methods to extract critical structural features that exhibit significant differences in dynamics.68 Structural stability and subtle, flexible changes can help us better understand the regulatory mechanisms of biomolecules, facilitating the design of RNA molecules with new functionalities. Another direction is to use short simulation trajectories to predict future simulation states. For example, Zhu et al. employed a neural relational inference model based on a graph neural network to predict latent interactions associated with allostery to understand the allosteric pathway from the ligand binding or mutation site to the active center in a protein.90 Thus, comprehending the patterns of biomolecular changes can save substantial time and resources, thereby accelerating the discovery of structural changes induced by drug design.

Integrating AI and physics-based network models enables rapid identification of binding sites and physical interactions, facilitating the prediction of complex docking structures. However, predicting binding sites' robustness remains a challenge due to the flexibility of RNA. Combining AI with coarse-grained complex network models can significantly improve the accuracy and robustness of binding site predictions. Once the complex structure is determined, studying its dynamic behavior becomes equally important. Considering the nodes' direction and velocity in the network allows for a more precise interface analysis. Integrating more dynamic data with AI-integrating network methods makes predicting long-term dynamic structures from short-term dynamic simulations possible. Integrating AI and physics-based network models can reduce interference from irrelevant information, significantly enhancing the efficiency of learning abstract features.

V. CONCLUSION

This review introduces AI-integrated network methods, highlighting their significance in biomolecular complex modeling and dynamic analysis. We have reviewed the evolution of binding site prediction methods, tracing the development from traditional network analysis to the integration of AI technologies. Subsequently, once the binding sites have been identified, the contacts between the binding interfaces can be studied. The progress in predicting contacts for protein complexes has been successful, whereas the prediction of contacts in RNA complexes is still at an early stage. Further development in these methods could contribute to the structural prediction of RNA complexes, marking a promising area for future research. Moreover, we discussed dynamic network analysis techniques for biomolecular MD simulations, which are crucial for understanding allosteric effects and functional regulation in biomolecules. These methods are helpful to enhance our ability to design inhibitors and tackle other downstream tasks more effectively. As AI progresses, its application in this field is expected to offer valuable insight and guidance for future research in complex structure prediction and practical applications, such as inhibitor design and regulation of molecular function.

ACKNOWLEDGMENTS

This work was supported by the National Natural Science Foundation of China (Grant No. 12175081); the Science Fund for Distinguished Young Scholars of Hubei Province (Grant No. 2024AFA077); and the Fundamental Research Funds for the Central Universities (Grant No. CCNU22QN004). This work was also supported by the Central China Normal University's excellent postgraduate education innovation funding project (Grant No. 2024CXZZ146).

NOMENCLATURE

AI

artificial intelligence

Cryo-EM

cryo-electron microscopy

DCC

dynamical cross coefficients

DDNC

distance-based dynamical network correlation

MCC

Matthews correlation coefficient

MD

molecular dynamics

MSA

multiple sequence alignment

NMR

nuclear magnetic resonance

Note: This paper is part of the BPR Special Topic on Artificial Intelligence for Biophysics.

AUTHOR DECLARATIONS

Conflict of Interest

The authors have no conflicts to disclose.

Author Contributions

Haoquan Liu: Data curation (equal); Methodology (equal); Writing – original draft (lead); Writing – review & editing (equal). Chen Zhuo: Data curation (supporting); Investigation (supporting). Jiamin Gao: Data curation (supporting); Investigation (supporting). Chengwei Zeng: Data curation (supporting); Investigation (supporting). Yunjie Zhao: Funding acquisition (lead); Investigation (lead); Project administration (lead); Supervision (lead); Writing – review & editing (lead).

DATA AVAILABILITY

Data sharing is not applicable to this article as no new data were created or analyzed in this review.

References

  • 1. Mishra P., Sankar S. H. H., Gosavi N., and Bharathavikru R. S., “ RNA nucleoprotein complexes in biological systems,” Proc. Indian Natl. Sci. Acad. 88, 300–323 (2022). 10.1007/s43538-022-00087-0 [DOI] [Google Scholar]
  • 2. Dimitrova-Paternoga L., Jagtap P. K. A., Chen P.-C., and Hennig J., “ Integrative structural biology of protein-RNA complexes,” Structure 28, 6–28 (2020). 10.1016/j.str.2019.11.017 [DOI] [PubMed] [Google Scholar]
  • 3. Liu H. and Zhao Y., “ Integrated modeling of protein and RNA,” Briefings Bioinf. 25, bbae139 (2024). 10.1093/bib/bbae139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Zhang S., Aibara S., Vos S. M., Agafonov D. E., Lührmann R., and Cramer P., “ Structure of a transcribing RNA polymerase II–U1 snRNP complex,” Science 371, 305–309 (2021). 10.1126/science.abf1870 [DOI] [PubMed] [Google Scholar]
  • 5. Schulze-Gahmen U. and Hurley J. H., “ Structural mechanism for HIV-1 TAR loop recognition by Tat and the super elongation complex,” Proc. Natl. Acad. Sci. U. S. A. 115, 12973–12978 (2018). 10.1073/pnas.1806438115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Ning S., Zeng C., Zeng C., and Zhao Y., “ The TAR binding dynamics and its implication in Tat degradation mechanism,” Biophys. J. 120, 5158–5168 (2021). 10.1016/j.bpj.2021.11.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Wu Y., Wang F., Shen C., Peng W., Li D., Zhao C., Li Z., Li S., Bi Y., Yang Y. et al. , “ A noncompeting pair of human neutralizing antibodies block COVID-19 virus binding to its receptor ACE2,” Science 368, 1274–1278 (2020). 10.1126/science.abc2241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Yin J., Li C., Ye C., Ruan Z., Liang Y., Li Y., Wu J., and Luo Z., “ Advances in the development of therapeutic strategies against COVID-19 and perspectives in the drug design for emerging SARS-CoV-2 variants,” Comput. Struct. Biotechnol. J. 20, 824–837 (2022). 10.1016/j.csbj.2022.01.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Li G., Hilgenfeld R., Whitley R., and De Clercq E., “ Therapeutic strategies for COVID-19: Progress and lessons learned,” Nat. Rev. Drug Discov. 22, 449–475 (2023). 10.1038/s41573-023-00672-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Huiwen W., “ Prediction of protein-ligand binding affinity via deep learning models,” Briefings Bioinf. 25, bbae310 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Ning S., Wang H., Zeng C., and Zhao Y., “ Prediction of allosteric druggable pockets of cyclin-dependent kinases,” Briefings Bioinf. 23, bbac290 (2022). 10.1093/bib/bbac290 [DOI] [PubMed] [Google Scholar]
  • 12. Szpotkowski K., Wójcik K., and Kurzyńska-Kokorniak A., “ Structural studies of protein–nucleic acid complexes: A brief overview of the selected techniques,” Comput. Struct. Biotechnol. J. 21, 2858–2872 (2023). 10.1016/j.csbj.2023.04.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Guo L., Ding J., Guo R., Hou Y., Wang D., and Huang L., “ Biochemical and structural insights into RNA binding by Ssh10b, a member of the highly conserved Sac10b protein family in archaea*,” J. Biol. Chem. 289, 1478–1490 (2014). 10.1074/jbc.M113.521351 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Nakagawa R., Kannan S., Altae-Tran H., Takeda S. N., Tomita A., Hirano H., Kusakizako T., Nishizawa T., Yamashita K., Zhang F. et al. , “ Structure and engineering of the minimal type VI CRISPR-Cas13bt3,” Mol. Cell 82, 3178–3192 (2022). 10.1016/j.molcel.2022.08.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Chayen N. E. and Saridakis E., “ Protein crystallization: From purified protein to diffraction-quality crystal,” Nat. Methods 5, 147–153 (2008). 10.1038/nmeth.f.203 [DOI] [PubMed] [Google Scholar]
  • 16. Li T., He J., Cao H., Zhang Y., Chen J., Xiao Y., and Huang S.-Y., “ All-atom RNA structure determination from cryo-EM maps,” Nat. Biotechnol. (published online). 10.1038/s41587-024-02149-8 [DOI] [PubMed] [Google Scholar]
  • 17. Abramson J., Adler J., Dunger J., Evans R., Green T., Pritzel A., Ronneberger O., Willmore L., Ballard A. J., Bambrick J. et al. , “ Accurate structure prediction of biomolecular interactions with AlphaFold 3,” Nature 630, 493–500 (2024). 10.1038/s41586-024-07487-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Baek M., DiMaio F., Anishchenko I., Dauparas J., Ovchinnikov S., Lee G. R., Wang J., Cong Q., Kinch L. N., Schaeffer R. D. et al. , “ Accurate prediction of protein structures and interactions using a three-track neural network,” Science 373, 871–876 (2021). 10.1126/science.abj8754 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Wang J., Mao K., Zhao Y., Zeng C., Xiang J., Zhang Y., and Xiao Y., “ Optimization of RNA 3D structure prediction using evolutionary restraints of nucleotide–nucleotide interactions from direct coupling analysis,” Nucl. Acids Res. 45, 6299–6309 (2017). 10.1093/nar/gkx386 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Zhao Y., Huang Y., Gong Z., Wang Y., Man J., and Xiao Y., “ Automated and fast building of three-dimensional RNA structures,” Sci. Rep. 2, 734 (2012). 10.1038/srep00734 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Holyoak T., “ Molecular recognition: Lock-and-key, induced fit, and conformational selection,” in Encyclopedia of Biophysics, edited by Roberts G. C. K. ( Springer, Berlin Heidelberg, 2013). [Google Scholar]
  • 22. Guedes I. A., de Magalhães C. S., and Dardenne L. E., “ Receptor–ligand molecular docking,” Biophys. Rev. 6, 75–87 (2014). 10.1007/s12551-013-0130-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Padhorny D., Kazennov A., Zerbe B. S., Porter K. A., Xia B., Mottarella S. E., Kholodov Y., Ritchie D. W., Vajda S., and Kozakov D., “ Protein–protein docking by fast generalized Fourier transforms on 5D rotational manifolds,” Proc. Natl. Acad. Sci. U. S. A. 113, E4286–E4293 (2016). 10.1073/pnas.1603929113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Huang Y., Liu S., Guo D., Li L., and Xiao Y., “ A novel protocol for three-dimensional structure prediction of RNA-protein complexes,” Sci. Rep. 3, 1887 (2013). 10.1038/srep01887 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Zeng C., Jian Y., Vosoughi S., Zeng C., and Zhao Y., “ Evaluating native-like structures of RNA-protein complexes through the deep learning method,” Nat. Commun. 14, 1060 (2023). 10.1038/s41467-023-36720-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Dominguez C., Boelens R., and Bonvin A. M. J. J., “ HADDOCK: A protein−protein docking approach based on biochemical or biophysical information,” J. Am. Chem. Soc. 125, 1731–1737 (2003). 10.1021/ja026939x [DOI] [PubMed] [Google Scholar]
  • 27. Yan Y., Tao H., and Huang S.-Y., “ HSYMDOCK: A docking web server for predicting the structure of protein homo-oligomers with Cn or Dn symmetry,” Nucl. Acids Res. 46, W423–W431 (2018). 10.1093/nar/gky398 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Yan Y., Tao H., He J., and Huang S.-Y., “ The HDOCK server for integrated protein–protein docking,” Nat. Protoc. 15, 1829–1852 (2020). 10.1038/s41596-020-0312-x [DOI] [PubMed] [Google Scholar]
  • 29. Huang Y., Li H., and Xiao Y., “ 3dRPC: A web server for 3D RNA–protein structure prediction,” Bioinformatics 34, 1238–1240 (2018). 10.1093/bioinformatics/btx742 [DOI] [PubMed] [Google Scholar]
  • 30. Huang Y., Li H., and Xiao Y., “ Using 3dRPC for RNA–protein complex structure prediction,” Biophys. Rep. 2, 95–99 (2016). 10.1007/s41048-017-0034-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Jiang H., Wang J., Cong W., Huang Y., Ramezani M., Sarma A., Dokholyan N. V., Mahdavi M., and Kandemir M. T., “ Predicting protein–ligand docking structure with graph neural network,” J. Chem. Inf. Model. 62, 2923–2932 (2022). 10.1021/acs.jcim.2c00127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Feng Y., Zhang K., Wu Q., and Huang S.-Y., “ NLDock: A fast nucleic acid–ligand docking algorithm for modeling RNA/DNA–ligand complexes,” J. Chem. Inf. Model. 61, 4771–4782 (2021). 10.1021/acs.jcim.1c00341 [DOI] [PubMed] [Google Scholar]
  • 33. Zhong L., Li Y., Xiong L., Wang W., Wu M., Yuan T., Yang W., Tian C., Miao Z., Wang T. et al. , “ Small molecules in targeted cancer therapy: Advances, challenges, and future perspectives,” Signal Transduction Targeted Ther. 6, 201 (2021). 10.1038/s41392-021-00572-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Maddhuri Venkata Subramaniya S. R., Terashi G., Jain A., Kagaya Y., and Kihara D., “ Protein contact map refinement for improving structure prediction using generative adversarial networks,” Bioinformatics 37, 3168–3174 (2021). 10.1093/bioinformatics/btab220 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Wang K., Jian Y., Wang H., Zeng C., and Zhao Y., “ RBind: Computational network method to predict RNA binding sites,” Bioinformatics 34, 3131–3136 (2018). 10.1093/bioinformatics/bty345 [DOI] [PubMed] [Google Scholar]
  • 36. Su H., Peng Z., and Yang J., “ Recognition of small molecule–RNA binding sites using RNA sequence and structure,” Bioinformatics 37, 36–42 (2021). 10.1093/bioinformatics/btaa1092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Zeng P. and Cui Q., “ Rsite2: An efficient computational method to predict the functional sites of noncoding RNAs,” Sci. Rep. 6, 19016 (2016). 10.1038/srep19016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Zeng P., Li J., Ma W., and Cui Q., “ Rsite: A computational method to identify the functional sites of noncoding RNAs,” Sci. Rep. 5, 9179 (2015). 10.1038/srep09179 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Liu H., Jian Y., Hou J., Zeng C., and Zhao Y., “ RNet: A network strategy to predict RNA binding preferences,” Briefings Bioinf. 25, bbad482 (2023). 10.1093/bib/bbad482 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Wang K., Zhou R., Wu Y., and Li M., “ RLBind: A deep learning method to predict RNA–ligand binding sites,” Briefings Bioinf. 24, bbac486 (2023). 10.1093/bib/bbac486 [DOI] [PubMed] [Google Scholar]
  • 41. Collins K. W., Copeland M. M., Brysbaert G., Wodak S. J., Bonvin A. M. J. J., Kundrotas P. J., Vakser I. A., and Lensink M. F., “ CAPRI-Q: The CAPRI resource evaluating the quality of predicted structures of protein complexes,” J. Mol. Biol. 436, 168540 (2024). 10.1016/j.jmb.2024.168540 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Hopf T. A., Schärfe C. P. I., Rodrigues J. P. G. L. M., Green A. G., Kohlbacher O., Sander C., Bonvin A. M. J. J., and Marks D. S., “ Sequence co-evolution gives 3D contacts and structures of protein complexes,” eLife 3, e03430 (2014). 10.7554/eLife.03430 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Ovchinnikov S., Kamisetty H., and Baker D., “ Robust and accurate prediction of residue–residue interactions across protein interfaces using evolutionary information,” eLife 3, e02030 (2014). 10.7554/eLife.02030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Li H. and Huang S.-Y., “ Protein–protein docking with interface residue restraints*,” Chin. Phys. B 30, 018703 (2021). 10.1088/1674-1056/abc14e [DOI] [Google Scholar]
  • 45. Sun D., Liu S., and Gong X., “ Review of multimer protein–protein interaction complex topology and structure prediction*,” Chin. Phys. B 29, 108707 (2020). 10.1088/1674-1056/abb659 [DOI] [Google Scholar]
  • 46. Martino E., Chiarugi S., Margheriti F., and Garau G., “ Mapping, structure and modulation of PPI,” Front. Chem. 9, 718405 (2021). 10.3389/fchem.2021.718405 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Weigt M., White R. A., Szurmant H., Hoch J. A., and Hwa T., “ Identification of direct residue contacts in protein–protein interaction by message passing,” Proc. Natl. Acad. Sci. U. S. A. 106, 67–72 (2009). 10.1073/pnas.0805923106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Morcos F., Pagnani A., Lunt B., Bertolino A., Marks D. S., Sander C., Zecchina R., Onuchic J. N., Hwa T., and Weigt M., “ Direct-coupling analysis of residue coevolution captures native contacts across many protein families,” Proc. Natl. Acad. Sci. U. S. A. 108, E1293–E1301 (2011). 10.1073/pnas.1111471108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Ekeberg M., Lövkvist C., Lan Y., Weigt M., and Aurell E., “ Improved contact prediction in proteins: Using pseudolikelihoods to infer Potts models,” Phys. Rev. E 87, 012707 (2013). 10.1103/PhysRevE.87.012707 [DOI] [PubMed] [Google Scholar]
  • 50. Guo Z., Liu J., Skolnick J., and Cheng J., “ Prediction of inter-chain distance maps of protein complexes with 2D attention-based deep neural networks,” Nat. Commun. 13, 6963 (2022). 10.1038/s41467-022-34600-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Roy R. S., Quadir F., Soltanikazemi E., and Cheng J., “ A deep dilated convolutional residual network for predicting interchain contacts of protein homodimers,” Bioinformatics 38, 1904–1910 (2022). 10.1093/bioinformatics/btac063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Xie Z. and Xu J., “ Deep graph learning of inter-protein contacts,” Bioinformatics 38, 947–953 (2022). 10.1093/bioinformatics/btab761 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Yan Y. and Huang S.-Y., “ Accurate prediction of inter-protein residue–residue contacts for homo-oligomeric protein complexes,” Briefings Bioinf. 22, bbab038 (2021). 10.1093/bib/bbab038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Zeng H., Wang S., Zhou T., Zhao F., Li X., Wu Q., and Xu J., “ ComplexContact: A web server for inter-protein contact prediction using deep learning,” Nucl. Acids Res. 46, W432–W437 (2018). 10.1093/nar/gky420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Zerihun M. B., Pucci F., and Schug A., “ CoCoNet—Boosting RNA contact prediction by convolutional neural networks,” Nucl. Acids Res. 49, 12661–12672 (2021). 10.1093/nar/gkab1144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Taubert O., von der Lehr F., Bazarova A., Faber C., Knechtges P., Weiel M., Debus C., Coquelin D., Basermann A., Streit A. et al. , “ RNA contact prediction by data efficient deep learning,” Commun. Biol. 6, 913 (2023). 10.1038/s42003-023-05244-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Sun S., Wang W., Peng Z., and Yang J., “ RNA inter-nucleotide 3D closeness prediction by deep residual neural networks,” Bioinformatics 37, 1093–1098 (2021). 10.1093/bioinformatics/btaa932 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Jian Y., Wang X., Qiu J., Wang H., Liu Z., Zhao Y., and Zeng C., “ DIRECT: RNA contact predictions by integrating structural patterns,” BMC Bioinf. 20, 497 (2019). 10.1186/s12859-019-3099-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Jian Y., Gao C., Zeng C., Zhao Y., and Vosoughi S., “ Knowledge from large-scale protein contact prediction models can be transferred to the data-scarce RNA contact prediction task,” arXiv:2302.06120 (2023).
  • 60. Dethoff E. A., Chugh J., Mustoe A. M., and Al-Hashimi H. M., “ Functional complexity and regulation through RNA dynamics,” Nature 482, 322–330 (2012). 10.1038/nature10885 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Ganser L. R., Kelly M. L., Herschlag D., and Al-Hashimi H. M., “ The roles of structural dynamics in the cellular functions of RNAs,” Nat. Rev. Mol. Cell Biol. 20, 474–489 (2019). 10.1038/s41580-019-0136-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Shen W., Zhou T., and Shi X., “ Enhanced sampling in molecular dynamics simulations and their latest applications—A review,” Nano Res. 16, 13474–13497 (2023). 10.1007/s12274-023-6311-9 [DOI] [Google Scholar]
  • 63. Sethi A., Eargle J., Black A. A., and Luthey-Schulten Z., “ Dynamical networks in tRNA: Protein complexes,” Proc. Natl. Acad. Sci. U. S. A. 106, 6620–6625 (2009). 10.1073/pnas.0810961106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. VanWart A. T., Eargle J., Luthey-Schulten Z., and Amaro R. E., “ Exploring residue component contributions to dynamical network models of allostery,” J. Chem. Theory Comput. 8, 2949–2961 (2012). 10.1021/ct300377a [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Holliday M. J., Camilloni C., Armstrong G. S., Vendruscolo M., and Eisenmesser E. Z., “ Networks of dynamic allostery regulate enzyme function,” Structure 25, 276–286 (2017). 10.1016/j.str.2016.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Eargle J. and Luthey-Schulten Z., “ NetworkView: 3D display and analysis of protein·RNA interaction networks,” Bioinformatics 28, 3000–3001 (2012). 10.1093/bioinformatics/bts546 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Feher V. A., Durrant J. D., Van Wart A. T., and Amaro R. E., “ Computational approaches to mapping allosteric pathways,” Curr. Opin. Struct. Biol. 25, 98–103 (2014). 10.1016/j.sbi.2014.02.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Karamzadeh R., Karimi-Jafari M. H., Sharifi-Zarchi A., Chitsaz H., Salekdeh G. H., and Moosavi-Movahedi A. A., “ Machine learning and network analysis of molecular dynamics trajectories reveal two chains of red/ox-specific residue interactions in human protein disulfide isomerase,” Sci. Rep. 7, 3666 (2017). 10.1038/s41598-017-03966-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Wang Y., Lamim Ribeiro J. M., and Tiwary P., “ Machine learning approaches for analyzing and enhancing molecular dynamics simulations,” Curr. Opin. Struct. Biol. 61, 139–145 (2020). 10.1016/j.sbi.2019.12.016 [DOI] [PubMed] [Google Scholar]
  • 70. Xie T., France-Lanord A., Wang Y., Shao-Horn Y., and Grossman J. C., “ Graph dynamical networks for unsupervised learning of atomic scale dynamics in materials,” Nat. Commun. 10, 2667 (2019). 10.1038/s41467-019-10663-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Tai K., Shen T., Börjesson U., Philippopoulos M., and McCammon J. A., “ Analysis of a 10-ns molecular dynamics simulation of mouse acetylcholinesterase,” Biophys. J. 81, 715–724 (2001). 10.1016/S0006-3495(01)75736-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Ichiye T. and Karplus M., “ Collective motions in proteins: A covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations,” Proteins 11, 205–217 (1991). 10.1002/prot.340110305 [DOI] [PubMed] [Google Scholar]
  • 73. Young M. A., Gonfloni S., Superti-Furga G., Roux B., and Kuriyan J., “ Dynamic coupling between the SH2 and SH3 domains of c-Src and Hck underlies their inactivation by C-terminal tyrosine phosphorylation,” Cell 105, 115–126 (2001). 10.1016/S0092-8674(01)00301-4 [DOI] [PubMed] [Google Scholar]
  • 74. Hünenberger P. H., Mark A. E., and van Gunsteren W. F., “ Fluctuation and cross-correlation analysis of protein motions observed in nanosecond molecular dynamics simulations,” J. Mol. Biol. 252, 492–503 (1995). 10.1006/jmbi.1995.0514 [DOI] [PubMed] [Google Scholar]
  • 75. Shortridge M. D., Wille P. T., Jones A. N., Davidson A., Bogdanovic J., Arts E., Karn J., Robinson J. A., and Varani G., “ An ultra-high affinity ligand of HIV-1 TAR reveals the RNA structure recognized by P-TEFb,” Nucl. Acids Res. 47, 1523–1531 (2019). 10.1093/nar/gky1197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Bowerman S. and Wereszczynski J., “ Chapter seventeen - detecting allosteric networks using molecular dynamics simulation,” in Methods in Enzymology, edited by Voth G. A. ( Academic Press, 2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Amaro R. E., Sethi A., Myers R. S., Davisson V. J., and Luthey-Schulten Z. A., “ A network of conserved interactions regulates the allosteric signal in a glutamine amidotransferase,” Biochemistry 46, 2156–2173 (2007). 10.1021/bi061708e [DOI] [PubMed] [Google Scholar]
  • 78. Süel G. M., Lockless S. W., Wall M. A., and Ranganathan R., “ Evolutionarily conserved networks of residues mediate allosteric communication in proteins,” Nat. Struct. Biol. 10, 59–69 (2003). 10.1038/nsb881 [DOI] [PubMed] [Google Scholar]
  • 79. Miao Y., Nichols S. E., Gasper P. M., Metzger V. T., and McCammon J. A., “ Activation and dynamic network of the M2 muscarinic receptor,” Proc. Natl. Acad. Sci. U. S. A. 110, 10982–10987 (2013). 10.1073/pnas.1309755110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Van Wart A. T., Durrant J., Votapka L., and Amaro R. E., “ Weighted implementation of suboptimal paths (WISP): An optimized algorithm and tool for dynamical network analysis,” J. Chem. Theory Comput. 10, 511–517 (2014). 10.1021/ct4008603 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Schroeder R., Barta A., and Semrad K., “ Strategies for RNA folding and assembly,” Nat. Rev. Mol. Cell Biol. 5, 908–919 (2004). 10.1038/nrm1497 [DOI] [PubMed] [Google Scholar]
  • 82. Zhuo C., Zeng C., Yang R., Liu H., and Zhao Y., “ RPflex: A coarse-grained network model for RNA pocket flexibility study,” Int. J. Mol. Sci. 24, 5497 (2023). 10.3390/ijms24065497 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Xia Y., Xia C.-Q., Pan X., and Shen H.-B., “ GraphBind: Protein structural context embedded rules learned by hierarchical graph neural networks for recognizing nucleic-acid-binding residues,” Nucl. Acids Res. 49, e51 (2021). 10.1093/nar/gkab044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Yuan Q., Tian C., and Yang Y., “ Genome-scale annotation of protein binding sites via language model and geometric deep learning,” eLife 13, RP93695 (2024). 10.7554/eLife.93695 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Corley M., Burns M. C., and Yeo G. W., “ How RNA-binding proteins interact with RNA: Molecules and mechanisms,” Mol. Cell 78, 9–29 (2020). 10.1016/j.molcel.2020.03.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Michaud J. M., Madani A., and Fraser J. S., “ A language model beats alphafold2 on orphans,” Nat. Biotechnol. 40, 1576–1577 (2022). 10.1038/s41587-022-01466-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Singh A., “ Structure prediction for orphan proteins,” Nat. Methods 20, 176–176 (2023). 10.1038/s41592-023-01795-1 [DOI] [PubMed] [Google Scholar]
  • 88. Si Y. and Yan C., “ Protein language model-embedded geometric graphs power inter-protein contact prediction,” eLife 12, RP92184 (2024). 10.7554/eLife.92184 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Ning S., Sun M., Dong X., Li A., Zeng C., Liu M., Gong Z., and Zhao Y., “ Dynamic geometry design of cyclic peptide architectures for RNA structure,” Phys. Chem. Chem. Phys. 25, 27967–27980 (2023). 10.1039/D3CP03384H [DOI] [PubMed] [Google Scholar]
  • 90. Zhu J., Wang J., Han W., and Xu D., “ Neural relational inference to learn long-range allosteric interactions in proteins from molecular dynamics simulations,” Nat. Commun. 13, 1661 (2022). 10.1038/s41467-022-29331-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data sharing is not applicable to this article as no new data were created or analyzed in this review.


Articles from Biophysics Reviews are provided here courtesy of American Institute of Physics

RESOURCES