Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2025 Jan 16;65(3):1441–1452. doi: 10.1021/acs.jcim.4c01393

Post-Docking Refinement of Peptide or Protein-RNA Complexes Using Thermal Titration Molecular Dynamics (TTMD): A Stability Insight

Andrea Dodaro 1, Gianluca Novello 1, Silvia Menin 1, Chiara Cavastracci Strascia 1, Mattia Sturlese 1, Veronica Salmaso 1, Stefano Moro 1,*
PMCID: PMC11815843  PMID: 39818831

Abstract

graphic file with name ci4c01393_0003.jpg

RNA–protein interactions drive and regulate fundamental cellular processes like transcription and translation. Despite being still limited, the growing body of structural data significantly contributes to the characterization of these interactions. However, RNA complexes involving proteins or peptides are not always available due to the structural determination challenges that this biopolymer entails. Consequently, modeling approaches like molecular docking are exploited to generate complexes relevant to structural and pharmaceutical purposes, including analysis of putative drug targets. Docking methods, despite their widespread adoption, are often hindered by limitations in scoring accuracy, which affects the ranking of the generated poses. Postdocking refining methods, including molecular dynamics (MD) approaches, have been developed to tackle this issue. Thermal Titration Molecular Dynamics (TTMD) is an enhanced sampling molecular dynamics technique that has been previously effectively applied to refine protein or RNA-small-molecule docking poses. This study presents the first application of TTMD to RNA-peptide complexes, validating this method on more complex systems and extending its applicability domain. Our findings showcase the capability of this technique to refine peptide-RNA docking poses, correctly identifying native binding modes among decoys for different pharmaceutically relevant targets.

Introduction

Ribonucleic acid (RNA) is a crucial molecule within the cell, commonly considered a bridging element between genetic information and protein expression. Beyond its canonical “Central Dogma” role, RNA has emerged as a versatile biopolymer involved in several cellular processes like RNA transcription, translation, and splicing.13

Remarkably, only a fraction of the genome is translated into proteins,4 which have been the main pharmaceutical target for decades, leaving a large part of the genome unexplored. For this reason, noncoding RNAs are now receiving significant attention as their dysregulation is related to conditions like cancer and neurodegenerative diseases,5 and understanding these processes can lead to the identification of new pharmaceutically relevant targets. In particular, flaws in protein-RNA interactions are found in these diseases,6,7 and the proteins that can bind RNA, defined as RNA Binding Proteins (RBP), constitute a not neglectable portion of all the proteins (7.5%).8 Consequently, studying the interactions between RNA and proteins could give insights into cellular processes and could be strategic for identifying new therapeutics.

An increasing, yet limited, number of high-resolution structures of RNA-protein complexes (ribonucleoproteins, RNPs) has been solved and deposited in public databases in the past decade; however, the experimental determination of these complexes is still a time-consuming and complex task due to the peculiar nature of RNA. The high structural flexibility, the negatively charged phosphate backbone, and the transient nature of the interaction involving RNA-protein complexes hamper the resolution of these structures.3 Given the lack of solved structures, modeling approaches could help to predict and study these interactions, with molecular docking being one of the most adopted methods.3

The docking procedure is composed of two stages: ranking and scoring. The first one is the generation of possible binding modes (poses) of a molecule at a given target by sampling its orientation and conformation, while the latter consists of ranking the generated models using a scoring function.9 One of the main issues associated with sampling is that the recognition process and the peptide-RNA complex formation often involve conformational changes. While many tools perform rigid body docking, neglecting this critical aspect in favor of speed and reducing computing costs, some semiflexible programs can address this conformational issue by considering local rearrangements. Still, conformational changes involving backbone and loop rearrangements are considered a critical challenge in modeling macromolecular complexes.9

On the other hand, pose scoring can suffer from inaccuracies.10 Many docking tools were initially conceived or adapted to generate protein–protein complexes, and specific protein-RNA scoring potentials are not always available.2,11 A consensus-scoring approach could improve the results,3 as no protein-RNA scoring function can now reach optimal performances. Limited knowledge of protein-RNA interactions restricts the refinement process of these functions. In contrast, the situation is different for protein complexes not involving RNA, as they have been extensively characterized and studied. Many structures are available in the Protein Data Bank (PDB), and the scoring functions used to evaluate these complexes are better tuned.12

Given the peculiar charge distribution of RNA, another essential aspect neglected by common docking approaches is the role of water and ions,13 which can be crucial for maintaining structural stability and facilitating RNA-protein interactions.14

An alternative approach based on Molecular Dynamics (MD) could solve these limitations, considering both RNA and protein flexibility, simulating the system in explicit solvent and ions, thus providing a better model of a biological environment.

However, classical MD is poorly efficient in observing infrequent events such as binding and unbinding. Although reaching biochemical relevant time scales (microsecond-to-millisecond simulations) is indeed possible with current computer architectures, it still requires prohibitive computational time, at least for virtual screening purposes. Moreover, RNAs are often characterized by a rugged conformational landscape, where the system may become trapped in a metastable state. High energy barriers usually block transitions between states, leading to the preferential sampling of a state that differs from the desired one.

For these reasons, enhanced sampling protocols are routinely adopted to offset the time scale limitation of classical MD, allowing the observation of such phenomena in reduced computational time.

Many of these methods rely on the definition of the so-called collective variables (CVs), along which an energy bias is added during the simulation.15 Some relevant examples are metadynamics, steered molecular dynamics (STD), and umbrella sampling. These techniques are widely employed in diverse contexts, from folding and conformational studies to binding and unbinding simulations16

Some notable applicative examples in the context of nucleic acids include the use of Bias-exchange metadynamics to characterize the binding and unbinding of Mg2+ ions on an RNA duplex,17 steered Molecular dynamics (STD) to characterize the binding between TAR and a cyclic peptide15 and umbrella sampling coupled with STD to simulate the unbinding of two ligands (BMVC and BMVC0) from a G-quadruplex.18

Despite the success of these methods, selecting appropriate reaction coordinates can be challenging and system-dependent. CV-free enhanced sampling methods, instead, introduce a bias independently from collective variables. A few examples included in this category are replica-exchange molecular dynamics (REMD), accelerated molecular dynamics (aMD), temperature-accelerated molecular dynamics (TAMD), Gaussian accelerated Molecular Dynamics (GaMD). For example, in relation to nucleic acids, Gaussian accelerated Molecular Dynamics (GaMD) has been recently exploited to simulate the binding between M2-1 protein, from Human respiratory syncytial virus (HRSV), and SH7 RNA19; while replica exchange has been used in the study of a polycation-siRNA complex to identify possible binding sites.20

Thermal Titration Molecular Dynamics (TTMD)21 also belongs to the CV-free class and, differently from the previously mentioned programs, one of the perks of this method is represented by its simplicity. TTMD aims to qualitatively compare ligand-target complexes and discriminate strong from weak binders, without aiming to reconstruct the free energy landscape of the binding-unbinding process nor to estimate the free energy of the process. TTMD’s workflow is pretty straightforward as it performs a series of MD simulations at increasing temperatures while monitoring the persistence of the original binding mode through an interaction fingerprint-based scoring function22: the persistency of the pose discriminates strong binders from weak ones. Moreover, TTMD is a user-friendly program, and simulations can be easily carried out without a strong modeling background as they are automatically performed.

This tool has been recently applied to study the stability of RNA-small molecule complexes.23 As a result of the positive outcome of applying TTMD to RNA-small molecule complexes, this work aims to assess the applicability domain of the technique to study the stability of RNA-peptide/protein complexes. In particular, we evaluated the application of TTMD as a postdocking filter, capable of discriminating poses with native-like binding modes from decoys based on interaction stability.

Materials and Methods

Hardware Overview

The molecular modeling studies, such as the preparation of RNA-peptide/protein complex structures, the setup for MD simulations, and trajectory analyses, were conducted on a 20 CPU Linux workstation equipped with an Intel Core i9-9820X 3.3 GHz processor. Molecular dynamics simulations were carried out on a cluster composed of 30 NVIDIA GPUs ranging from GTX980 to RTX4090.

Structure Preparation

The three-dimensional structures of the RNA-peptide/protein complexes selected for the analysis were retrieved from the Protein Data Bank (PDB).24 The structures of these complexes are reported in Tables S1–S5 (Supporting Information) with their PDB IDs reported as well in Table 1. In the case of NMR structures, the first conformer was selected according to the deposited selection criteria (lowest energy for 1HJI and 5UZZ, and lowest NMR restraint violation energy for 1ETG). All structures were prepared using tools provided by the Molecular Operating Environment (MOE) suite 2022.02.25

Table 1. Summary of the Investigated Complexes.

PDB accession code peptide/protein RNA technique resolution
1HJI48 nun protein BoxB RNA NMR  
6XH049 TAR binding protein TAR trans-activation response element X-ray 3.1 Å
4PDB50 30S ribosomal protein S8 RNA aptamer X-ray 2.6 Å
1ETG51 REV peptide RRE rev responsive element RNA NMR  
5UZZ52 macrocyclic peptide (L50) pre-miR21 NMR  

The “Structure Preparation” tool of the MOE environment was utilized to address issues within the chosen protein and RNA chains. This preparation process included the addition of missing atoms, capping of N/C-termini with acetyl and N-metylamide respectively, and handling of tautomeric forms and alternate conformations of specific residues and nucleotides. Missing loops were reconstructed using a homology-based approach.

The “Protonate3D” tool was used to add hydrogen atoms and assign ionization states (pH = 7.4, T = 310 K, ionic strength = 0.154 M), determining the protonation states. This tool also calculated the most probable tautomer based on hydrogen-bonding contributions, ensuring the stability of the structure. Partial charges were computed using the AMBER14:EHT force field, and hydrogen atoms added during reconstruction underwent energy minimization using the same force field.

The prepared structures were then used for subsequent molecular dynamics and docking calculations, ensuring comprehensive and accurate preparation for computational analysis.

RNA-Protein Docking

Docking calculations, performed to generate RNA-peptide/protein complexes, were carried out using two protocols: HDOCKlite 1.1 for rigid docking and HADDOCK26 2.5 for flexible docking.

Rigid Docking

HDOCKlite1.1 is a docking program derived from the HDOCK server.27 Unlike the web-based version, the local software does not use a template-based approach. It employs a hierarchical rigid sampling of the rotational and translational space between proteins and/or nucleic acids, with an angle interval of 15°, and a spacing of 1.2 Å for a Fast Fourier Transform (FFT)-based translational search. The best scoring and diverse binding modes are retained.

Flexible Docking

To incorporate the flexibility of RNA and peptide or protein counterparts, flexible docking was performed using a locally installed version of HADDOCK2.5.

HADDOCK2.5 is an integrative modeling docking program derived from its web server interface.26 It can combine experimental and predicted information to guide the structure prediction of biomolecular complexes. The docking calculation is based on a triple consecutive run of docking using protocols of first rigid-body calculation, semiflexible refinement, and, in the end, an explicit solvent analysis.

The docking protocol follows the method developed by Charitou and co-workers,28 designating the entire peptide or protein as passive while defining the RNA interface based on nucleotides within a 4.5 Å distance cutoff from the target in the reference PDB structures. The three-stage docking process generates 5000 poses in a initial rigid body minimization stage, followed by 400 poses during the semiflexible step, terminating with the final refinement in explicit solvent. For the subsequent molecular dynamics (MD) simulations, we employed complexes generated during the semiflexible refinement stage, since solvation and refinement were performed following our standard internal protocol for MD simulation.

Surface of Contact Calculations

The surface of contact of each generated pose and the respective crystal or NMR deposited structure has been computed as interatomic contact surface areas (CSA) through the program dr_sasa (method 1) which exploits the Shrake-Rupley algorithm.29

System Setup for MD Simulations

The RNA-peptide or protein complexes for molecular dynamics (MD) simulations were prepared utilizing Visual Molecular Dynamics (VMD)30 1.9.3 and AmberTools22.31,32 Parameters for RNA atoms were assigned using the ff14SB force field with χ modification tuned for RNA (χOL3),33,34 while protein and peptide ligands were parametrized employing the ff14SB35 force field. Each system was solvated in an orthorhombic box with a padding of 15 Å between the complex and the box boundary, using the TIP3P water model.36 Sodium (Na+) and chloride (Cl) ions were added to neutralize the system and achieve a physiological salt concentration of 0.154 M. A 500-step energy minimization using the conjugate gradient algorithm was performed to eliminate clashes and bad contacts.

A two-step equilibration protocol was carried out before the production phase. The first step involved a 1 ns simulation in the canonical ensemble (NVT), with a 5 kcal mol–1 Å–2 harmonic positional restraint applied to each protein/peptide and RNA atom. The second equilibration step consisted of a simulation of equal length in the isothermal–isobaric ensemble (NPT), where the same restraint was applied only to the RNA and peptide/protein backbone. During the NPT phase, the pressure was maintained at 1 atm using a Monte Carlo barostat,37 while the temperature, for all the equilibration phases, was kept constant at the initial (i.e., lowest) TTMD temperature value through a Langevin thermostat.38

All MD simulations were conducted using the ACEMD 3.539 engine based on the open-source Python library OpenMM.40 An integration time step of 2 fs was employed, and the M-SHAKE algorithm41 was utilized to constrain bonds involving hydrogen atoms. Long-range electrostatic interactions were calculated using the particle-mesh Ewald method,42 applying a 9.0 Å cutoff for Lennard–Jones and real-space electrostatic interactions, and a switching distance of 7.5 Å for Lennard–Jones interactions.

TTMD

Thermal Titration Molecular Dynamics (TTMD)21,43 is an enhanced sampling molecular dynamics protocol, developed for evaluating complex unbinding kinetics,43 initially used for protein–ligand complexes and also applied to RNA-small molecule structures. TTMD operates through a series of short classical MD simulations, performed at progressively increasing temperatures, defined as “TTMD-steps”. Thus, the kinetic energy of the system is progressively increased, thereby reducing the simulation time required to observe unbinding events compared to conventional MD simulations.

In this study, TTMD simulations were carried out using Python 3.10, employing libraries such as NumPy, MDAnalysis,44,45 and Scikit-learn. The starting temperature was set at 300 K, and the simulation proceeded in 10 K increments up to the final temperature of 450 K. Each simulation window, or TTMD-step, lasted for 10 ns.

During the simulation, the preservation of the native binding mode was assessed using an interaction fingerprint-based scoring function (IFPcs).22

This metric, described extensively in the work of Pavan et al.,21 evaluates the conservation of the starting binding mode throughout the MD simulation. In particular, for nucleic acids, Prolif fingerprint package is used to determine the presence of nine types of interaction (Hydrophobic, HBDonor, HBAcceptor, PiStacking, Anionic, Cationic, CationPi, PiCation, VdWContact), based on geometric thresholds, between each nucleotide of the nucleic acid and the peptide/protein.

This information is then organized into a matrix (i.e., interaction fingerprint), which is further processed and compared through cosine similarity to the reference one (i.e., computed from the last frame of equilibration), to evaluate changes in the binding mode.

TTMD Trajectory Analysis

During a TTMD simulation, each frame generated is analyzed using the ProLIF46 Python package to compute the interaction fingerprint between the two molecular entities of the complex. The fingerprint is then compared for each frame to the last frame of the second equilibration phase using the cosine similarity metric, which helps assess the maintenance of the original binding mode during the simulation. The similarity value is multiplied by −1 to provide a negative score range, where 0 represents a complete deviation from the reference pose, i.e., loss of the refence binding mode. The interaction fingerprint cosine similarity (IFPcs) score thus ranges from −1 to 0, where −1 indicates complete retention of the original binding mode, and 0 signifies a total loss of native interactions. If the average IFPcs score, calculated on the last 10% of each TTMD-step, exceeds −0.05, indicating that all native interactions have been lost, the simulation is stopped.

At the end of the simulation, each average IFPcs score for each TTMD-step is plotted in the “Titration Profile” graph, as well as the respective temperature. In this graph the “MS coefficient” is also reported, which is defined by the slope of the line connecting the origin of the axes and the last point of the graph, and it is an indicator of the complex stability. This coefficient can range from 0 to 1, with 0 representing the conservation of the binding mode and 1 indicating the loss of the initial interactions in the first step of the simulation. The MS score is calculated using the following equation.

graphic file with name ci4c01393_m001.jpg 1

Five independent TTMD simulations were executed for each examined complex, and the average MS coefficient was computed over three replicates after discarding the ones with the highest and the lowest values.

In addition, a “Titration Timeline” plot is generated by the automatic TTMD procedure, illustrating the time-dependent evolution of IFPCS scores throughout the simulation. This plot also includes the root-mean-square deviation (RMSD) of protein backbone, of RNA backbone and of the backbone of nucleotides belonging to the binding site (i.e., in contact with the peptide/protein; defined as Bsite in the graph), calculated using the MDAnalysis Python package, providing a comprehensive view of the dynamics and stability of the complex during the simulation.

eRMSD Calculations

All the eRMSD calculations were performed using Barnaba47 software and the eRMSD plots obtained are available in Zenodo at the link provided in the Data Availability section.

Results

To assess the applicability of TTMD to the study of RNA-peptide complexes, we applied this technique to five test cases of pharmaceutical relevance, selected based on the availability of an experimental structure in the Protein Data Bank (PDB) and spanning over different “ligand” classes, ranging from cyclic peptides to proteins, mainly focusing on peptides.

In particular, the test cases included the following complexes: Nun protein-BoxB RNA (PDB ID: 1HJI), TAR binding protein-HIV-1 TAR RNA (PDB ID: 6XH0), 30S ribosomal protein S8-RNA aptamer (PDB ID: 4PDB), REV peptide-RRE RNA (PDB ID: 1ETG) and macrocyclic peptide-pre-miR21 (PDB ID: 5UZZ).

1HJI, 1ETG, and 5UZZ were chosen to assess the technique’s performance with RNA-peptide complexes with the latter characterized by a cyclic peptide, while 4PDB and 6XH0 were used to explore the limits of TTMD application with RNA-protein complexes.

A concise outline of the selected cases, summarized in Table 1, is reported hereafter.

Nun Protein

Bacteriophages rely on the bacteria machinery to replicate their genomes, synthesize viral proteins, and generate offspring. They can also compete for the same host, and in this scenario, some molecular mechanisms enable a lysogenic phage to exclude superinfecting phages. In particular, HK022 and superinfecting λ phage are involved in a well-studied exclusion mechanism, where HK022-encoded Nun protein acts as a transcription terminator for λ early genes, competing with the λ phage-encoded N protein for the major groove-binding of the stem-loop RNA boxB.53 These complexes have been extensively studied as models to elucidate protein-RNA recognition,54 as they both share on the protein side an arginine-rich motif (ARM) like other RNA-binding proteins such as Tat and Rev.55 In this paper, a nun/boxB complex, deposited under the PDB ID: 1HJI,48 has been considered.

BoxB RNA is characterized by a hairpin conformation closed by an apical tetraloop (GNRA), and in the deposited structure, the peptide assumes a bent α-helix conformation upon binding to the major groove, contacting even portions of the apical loop,48 with a predicted surface of contact of 561.76 Å2.

TAR and REV

Human immunodeficiency virus of type 1 (HIV-1) is the causative agent of the acquired immunodeficiency syndrome (AIDS). Despite the significant efforts directed to treat and prevent this retrovirus, its eradication is still hampered by the absence of an effective vaccine and by the limited capability of Antiretroviral Therapy (ART) to completely remove the virus from the host,56 as the presence of latently infected cells could be a source of persistent residual viremia57 and long-term related comorbidities can occur.58 In this scenario, transcription inhibitors could play a key role in controlling viral reactivation, thus reducing viral reservoir. Among the targets involved in the transcription phase, the TAT-TAR complex is one of the most attractive for drug development.59 The elongation of HIV-1 transcription is promoted by host CDK9/cyclin T1, recruited by HIV-1 transactivation response (TAR). TAR is activated by the Tat protein, which is essential in this process as, in its absence, the transcription results in short and abortive transcripts.60 Not surprisingly, the TAR complexes deposited in the PDB mainly include peptidomimetics and cyclic peptides specifically designed to inhibit the formation of the TAR-Tat complex. In this work, a complex that involves a TAR binding protein, deposited under the PDB ID: 6XH0,49 has been selected. This case study involves a protein-RNA complex, and in this structure, TAR is characterized by an A-form helical stem-loop with a central trinucleotide (UCU) bulge and an apical hexaloop, with the major groove and the central bulge being involved in binding with the β2-β3 connecting loop of the protein,49 with a predicted surface of contact of 771.90 Å2.

TAR-TAT complex is not the only target that emerged as a potential candidate against the latent form of HIV-1: REV-RRE complex has also gathered significant attention as it is involved in the process that leads to the translation of structural proteins.61 HIV-1 REV is a regulatory protein that recognizes the Rev response element (RRE), a ∼350 nucleotide sequence located in unspliced and incompletely spliced viral mRNA transcripts. The REV-RRE oligomeric recruits host Crm1/Ran-GTP nuclear export machinery to facilitate the export of these mRNAs from the nucleus to the cytoplasm.62,63 Subsequently, after the complex dissociation in the cytoplasm, viral RNAs are packaged for new virions or translated into essential viral proteins.62,64

The interaction between a purine-rich bulge stem-loop IIb of RRE and the arginine-rich α-helix binding domain of REV drives the recognition between Rev and RRE. In the present study, a complex available under PDB code 1ETG(51) and presenting RRE complexed with a REV peptide portion that includes the arginine-rich motif (ARM) has been selected, prepared, and subjected to docking and TTMD pose refinement.

RRE is characterized by a branched structure with multiple stem-loops and bulges. In the deposited structure, the REV peptide, presenting an α-helical conformation, interacts with the stem-loop IIB of RRE, binding the major groove near a purine-rich internal loop,51 with a predicted surface of contact of 964.49 Å2.

Bacillus anthracis Ribosomal Protein S8

The bacterial S8 ribosomal protein has been studied and taken as a model for the characterization of RNA-protein interaction. This RNA-binding protein contributes to the correct folding of the central domain of 16S rRNA.65 Apart from this function in the ribosome assembly, it has a role in the repression of the translation of the genes encoded by the spc operon mRNA, including the ones of S8 itself and ten other ribosomal proteins.50 Although greater affinity is commonly attributed to the 16S rRNA site, the region targeted on the mRNA is very similar,66 and many interactions between the S8 protein and the two binding sites are the same.50 The study of the interaction between RNA and S8 has also been addressed through aptamers: Davlieva et al.,50 through a SELEX experiment, discovered an RNA aptamer able to bind the Bacillus anthracis S8 protein with high affinity. They aimed to identify RNA sequences that, while retaining the ability to bind S8 protein, do not present the conserved features of helix 21 (16S rRNA),50 and resulted in a valuable insight into the ability of the selected aptamer to adopt a different secondary structure upon binding, stressing the importance of RNA flexibility into RNA-protein recognition. To investigate the impact of RNA flexibility on TTMD performances, we decided to include this complex, registered with the PDB ID: 4PDB, performing both rigid and semiflexible docking. In this structure, the aptamer assumes a hairpin conformation, with the A-form helical stem interrupted by an internal loop, while the S8 protein presents an N-terminal domain with an αβαββ fold and a C-terminal comprising an α-helix followed by an antiparallel four-strand β-sheet. The latter portion is involved in most of the contacts with the RNA aptamer through residues located in the turns at the ends of the β-strands.50 The overall predicted surface area is 879.87 Å2.

The results of prior work from our group also guided the selection of this test case, as a binding simulation between the aptamer and the protein was successfully performed through Supervised Molecular Dynamics (SuMD), with the last frame of the trajectory converging to the crystal structure.67

Pre-miR21

The overexpression of miR21 is implicated in the development of cancer. In many solid tumors, it is linked with cell proliferation, differentiation, and apoptosis, contributing to tumor growth, invasion, and metastasis.68 After the first cleavage of the primary miRNA transcripts inside the nucleus, the pre-miRNA is exported into the cytoplasm, where it is further cleaved at the apical portion of the stem-loop by another RNase III enzyme named Dicer. While the passenger strand of the obtained duplex is degraded, the guide strand forms the miRNA-induced silencing complex,69 which targets complementary RNAs, leading to mRNA cleavage, degradation, and translational repression.70

For these reasons, miR21 is investigated as a potential tumor biomarker for diagnostic purposes. Moreover, targeting miRNA-21 could provide adjuvant therapy for tumors like glioblastoma, in which this RNA acts like an oncogene capable of blocking apoptosis.68

However, inhibition of miR-21 using small molecules has had limited success. A cyclic peptides library screening led to the identification of a cyclic peptide (L50) that can bind pre-miR21 at the Dicer cleavage site,52 and the same authors also deposited the structure of the complex under the PDB ID: 5UZZ.52 In this complex, the cyclic peptide binds the major groove at the junction between the single-stranded RNA loop and the double-stranded RNA,52 with a predicted surface of contact of 636.99 Å2.

In the present work, starting from different docking poses generated through HDOCKlite, the TTMD protocol has been applied to a pharmaceutically relevant target like miR21, evaluating the behavior of the technique in the challenging field of cyclic peptides.

Simulation Workflow

For each RNA-peptide complex, different docking poses were generated by exploiting HDOCKlite and HADDOCK programs, and the obtained structures are reported in Tables S1–S5 (Supporting Information). These two different algorithms were considered to compare a fully rigid docking procedure (HDOCKlite) to one including some extent of flexibility (HADDOCK). The only exception was represented by structure 5UZZ, characterized by the presence of a cyclic peptide with a d-Proline, that was not easily managed with HADDOCK, which was thus excluded from the comparison.

The five top-scoring docking poses were selected for each protocol and subjected to TTMD simulations. When the pose with the lowest RMSD to the experimental reference (i.e., the experimental-like pose) was not included in the top five structures, a lower-ranked experimental-like pose was retrieved, regardless of its ranking position, and subjected to TTMD as well. The results were compared to reference TTMD simulations performed on the experimental structure, to assess the complexes stabilities and evaluate the capability of TTMD to discriminate experimental and experimental-like poses from decoys.

As an example, the poses for 6XH0 are reported in Table 2, while the docking poses for the remaining test cases are reported in Tables S1–S5 (Supporting Information). In this case, as regards HDOCKlite, the 5 top scoring poses have been selected, with the crystal-like pose being at the top of the ranking. In the case of HADDOCK, the 5 top scoring poses were selected as well, but they did not include the crystal-like pose, which was instead at the 212th position of the ranking, and additionally submitted to TTMD simulations.

Table 2. Docking Results of 6XH0.

graphic file with name ci4c01393_0001.jpg

a

The docking pose of 6XH0, obtained exploiting HDOCKlite and HADDOCK, and the structure deposited in the PDB along with its assertion code, are reported. The RNA is colored in Blue while the protein is pictured orange.

Scoring Results

The Prolif fingerprint package, according to the experience gained in the previous application of TTMD to RNA-small molecule complexes,23 was adopted to compute the RNA-peptide interaction fingerprints. In TTMD, the cosine similarity (IFPcs) between the interaction fingerprints during the simulation and the reference position (end of equilibration) is used to monitor the persistence of the binding mode during time. In this work, the starting temperature of the simulation was set to 300 K, the end temperature to 450 K, and the duration of each TTMD-step to 10 ns with a temperature increase of 10 K between each step. The TTMD MS coefficient, i.e., the slope of the line connecting the IFPcs at the beginning and end of the simulation (and deeply explained in the method section), was adopted to rank the poses and characterize the conservation of the binding mode between the start and the end of the simulation, with lower MS values representing more stable poses.

Tables 3 and 4 summarize the TTMD results, for each test case, indicating the average MS values calculated from the TTMD simulations, while the whole set of results is reported in Tables S6–S10 (Supporting Information).

Table 3. TTMD Simulation Results for Each Significant Replica Generated with HADDOCKa.

HADDOCK
6XH0
1ETG
1HJI
4PDB
pose mean MS pose mean MS pose mean MS pose mean MS
crystal 0.0038 NMR 0.0041 NMR 0.0041 crystal 0.0032
1 0.0048 1 0.0060 1 0.0082 1 0.0036
2 0.00516 2 0.00596 2 0.00806 2 0.00388
3 0.00529 3 0.00573 3 0.00665 3 0.00417
4 0.00488 4 0.00552 4 0.67135 4 0.00456
5 0.00580 5 0.00465 5 0.00549 5 0.00444
C-like (Pose212) 0.0041 NMR-like (Pose247) 0.00535 NMR-like (Pose73) 0.0043 C-like (Pose317) 0.0038
a

The PDB accession code, the docking program used, and the average MS score calculated, are reported for each test case simulated. The mean MS value for the experimental structure and experimental-like pose, the docking protocol, and the PDB code of each case are highlighted in bold.

Table 4. TTMD Simulation Results for Each Significant Replica Generated with HDOCKLitea.

HDOCKlite
6XH0
1ETG
1HJI
4PDB
5UZZ
pose mean MS pose mean MS pose mean MS pose mean MS pose mean MS
crystal 0.0038 NMR 0.0041 NMR 0.0041 crystal 0.0032 NMR 0.0052
1 0.0038 1 0.0054 1 0.0042 1 0.0041 1 0.0055
2 0.00503 2 0.00614 2 0.00710 2 0.01154 2 0.00563
3 0.00615 3 0.0053 3 0.00517 3 0.066865 3 0.00561
4 0.00532 4 0.00529 4 0.00464 4 0.00524 4 0.00911
5 0.02103 5 0.00484 5 0.0040 5 0.01168 5 0.00551
C-like (Pose1) 0.0038 NMR-like (Pose52) 0.0045 NMR-like (Pose1) 0.0042 C-like (Pose1) 0.0041 NMR-like (Pose1) 0.0055
a

The PDB accession code, the docking program used, and the average MS score calculated, are reported for each test case simulated. The mean MS value for the experimental structure and experimental-like pose, the docking protocol, and the PDB code of each case are highlighted in bold.

Although the complete unbinding event was never sampled during the simulations, applying TTMD as a postdocking refinement tool with this set of test cases has brought positive results since the stability of the simulated complexes, evaluated as the retention of the binding mode throughout the simulation, has been assessed. In detail, all the experimental structures have lower MS values when compared to the derived docking poses, regardless of the protocol used to generate them. The only exception is represented by pose number 5 (RMSD 5.02 Å) of 1HJI, generated through the HDOCKlite rigid docking approach, which has an MS score slightly lower than the NMR and the NMR-like structure (pose 1, RMSD 1.63 Å), but still comparable to them.

For most of the selected test cases, the technique can assign the lowest MS value, among the different poses, to the experimental-like predicted structures. Detailed information about the docking scores for each simulation and RMSD values for each docking pose can be found in Tables S1–S5 (Supporting Information).

Remarkably, even when exploiting a semiflexible docking protocol, like HADDOCK, TTMD successfully distinguished experimental-like poses from decoys in all selected cases except for 4PDB and 1ETG.

All HADDOCK poses obtained for 30S ribosomal protein S8 (4PDB) have reasonably low and close RMSD values as shown in Table S4 (Supporting Information), and TTMD failed to discriminate their subtle differences. In this situation, TTMD identified the crystal pose as the most stable, but, among the docking poses, the lowest MS was assigned to pose 1 (RMSD of 2.52 Å) rather than pose 317 (RMSD of 1.82 Å). Considering that both RMSD values are below the crystallographic resolution (2.60 Å), the two poses can be described as crystal-like, and TTMD still discriminated them from decoys.

In the case of REV peptide (1ETG), TTMD assigned the lowest MS score to the NMR pose, while the second MS score was not assigned to the NMR-like pose (pose 247), RMSD of 5.62 Å, but to pose 5 (RMSD of 7.28 Å). However, both poses are topologically similar to the NMR structure and are assigned the lowest MS scores among the docking poses.

Despite these situations, the technique was able to identify experimental-like poses for the remaining cases. 6XH0 and 1HJI are particularly significant since, following HADDOCK’s ranking, most of the top poses are characterized by high RMSD. Therefore, the application of TTMD and the evaluation of the MS score are valuable in rescoring HADDOCK’s poses, identifying the more experimental-like poses regardless of the ranking provided by the HADDOCK score.

On the other hand, when used starting from the HDOCKlite-generated poses, TTMD obtained positive results, isolating the experimental-like pose in almost all the cases apart from 1HJI. As mentioned above, the NMR structure, pose 1 (NMR-like pose), and pose 5, share low and close MS score values, 0.0042, 0.00423, and 0.00414 respectively. The technique still manages to identify a stable pose among the others, but it also describes pose number 5 as the most stable. Intriguingly, pose 1 and 5 have the lowest RMSD values among the generated ones. Moreover, as observable in the Titration Timeline graphs Figure S1 (Supporting Information), the fluctuation of the IFPcs is higher for pose 5, whereas, for pose number 1 and the NMR reference, it is steadier over time. The IFF score23 a previously reported additional TTMD scoring metric evaluating the fingerprint fluctuation around the mean IFPcs value, provides further support, suggesting a slightly stabler fingerprint fluctuation for the crystal-like pose, with an IFF value of 0,8 compared to 0,9 attributed to pose 5. The MS score is calculated on the last TTMD-step, as explained in the Materials and Methods section. For this reason, at the end of the simulation, rearrangements can lead the complex to assume native interactions previously lost during the initial part of the simulation, leading to a slightly lower MS score.

Although this docking program was indeed able to identify the NMR-like pose among the others, for 5UZZ the MS scores are high and close, indicating a lower resolution of TTMD arguably determined by a suboptimal temperature ramp for this test case.

Differently from the HADDOCK-originated poses of 1ETG discussed above, in the case of HDOCKlite, TTMD managed to identify the NMR-like pose among the others even if it was out of the five top scoring poses.

Notably, TTMD was able to seamlessly identify the crystal-like pose among the ones generated for 4PDB, as, in contrast with HADDOCK, they are much more diversified, as can be observed in Table S4 (Supporting Information).

This specific test case was also selected because an enhanced sampling MD technique called Supervised Molecular Dynamics (SuMD) has already been used to simulate the binding event with this test case reaching a near-crystal complex.67 This achievement opens the possibility of using SuMD as an alternative tool for the complex generation, coupling it with TTMD to score the obtained structures, an approach already tested for a Berenil-dodecanucleotide complex.71

Starting from the last frame of the SuMD trajectory retrieved from a previous study, we also performed a TTMD simulation as a proof of concept. As shown in Table 5, despite being higher than the crystal one, the average MS score, calculated from the TTMD simulations carried on the last frame of the SuMD trajectory, was lower than those computed from the HDOCKlite and HADDOCK-generated crystal-like poses. Given the encouraging result, SuMD seems able to generate stable crystal-like complexes. Despite the need for further validation and the long computational times required, this technique could represent a viable alternative to standard docking approaches for complex generation.

Table 5. TTMD Simulation Results of the Last Frame from the SuMD Trajectory.

graphic file with name ci4c01393_0002.jpg

a

The PDB accession code, the MD protocol or docking program used, the MS score for each simulation, and the average MS coefficient are reported for 4PDB. The replica with the nearest MS value to the average is highlighted in bold.

TTMD Simulation Analysis

For each test case, an in-depth description of the TTMD simulation is presented hereafter, comparing experimental structures with experimental-like ones and the MS worst-performing pose.

1ETG

The NMR structure remains stable throughout the simulation, obtaining the lowest MS value, and the nucleic acid globally retains the native structure even at high temperatures as can be evinced by low to medium eRMSD72 values, which rise with temperature, just like the following poses. Interactions mediated by key RNA residues U66, G67, U45, G46, G47, and A73, as identified in the original NMR structural analysis, are consistently preserved during the simulation. Despite minor conformational changes in the RNA, the key residues within the binding site remain geometrically stable, likely due to the stabilizing effect of the peptide, which helps maintain the overall integrity of the complex.

Similarly, the NMR-like structures generated by both HDOCKlite and HADDOCK exhibit stable interactions between the peptide and RNA, with key interactions involving U66, G67, U45, G46, G47, and A73 remaining even as the temperature increases.

However, for the MS worst-performing poses from both HDOCKlite and HADDOCK, the interactions deviate from the NMR-like configurations. These poses show a significant reduction in the complex stability throughout the simulation, with an increased peptide flexibility and mobility, and a partial loss of the initial interactions.

1HJI

The NMR structure remains stable throughout the simulation, achieving a low MS value. Key interactions mediated by C2, C3, C4, and A7, as previously described by Faber et al.,48 are largely preserved. Specifically, the stacking interaction between A7 and Tyr39 on the peptide is consistently maintained, supporting binding mode stability.

The HDOCKlite NMR-like structure similarly maintains a low MS score and stable interactions mediated by C2, C3, C4, and A7, including the critical A7-tyrosine stacking interaction, which undergoes minor fluctuations at higher temperatures. The HADDOCK NMR-like structure also preserves a stable binding mode, though it partially loses the A7 stacking interaction.

In contrast, HDOCKlites’s least stable pose quickly loses peptide-RNA interactions due to poor shape complementarity, altered by the increasing temperatures. Similarly, HADDOCK’s least stable pose loses stability early in the simulation, leading to partial peptide dissociation, which impacts the RNA stability as, for this test case, it is the only simulation reported characterized by a rapid drift toward medium eRMSD values.

5UZZ

In the NMR structure, the peptide ligand (L50) binds to the major groove of the RNA helix, where it interacts primarily with residues such as U27, G28, and A29, also interacting with residues of the apical loop as C39 and U40.

In the study of Shortridge52 Arg12 was defined as essential for recognition and binding within the core of the binding cavity, and during the simulation of the NMR structure, these interactions are maintained despite the high dynamic behavior of the complex, including Arg12 starting interaction with G28 and A29.

NMR structure and NMR-like pose share the same RNA-peptide conformation, which guarantees common interactions during the simulations like Arg12-G28 and Arg12-A29, while the MS worst-pose presents the L50 peptide flipped inside the major groove. The starting position falls within the same binding site region of the NMR and NMR-like pose and ensures common starting contacts, but the interactions are not mediated by the same amino acid residues. During the simulation, the peptide drifts and consequently loses part of the starting interactions with a consequent worsening of IFPcs. These interactions are mediated by Arg12, which interacts with the apical loop at C39 and U40 and then moves toward U6 and G7, while Arg8, at the beginning of the simulation, is solvent exposed rapidly heading to the apical loop.

Globally the eRMSD values for these complexes increase during the simulations reaching medium/high values, instability likely due to a suboptimal temperature ramp.

4PDB

The crystal structure is the most stable one, obtaining the lowest MS value. Interactions mediated by residues C16, C17, A24, U25, A26, U27, and C28, described by Davlieva et al.,50 are visibly retained during the simulation.

The crystal-like HADDOCK-generated structure is also stable overall, but toward the end of the simulation, where temperatures are highest, the C28-Tyr88 interaction is lost, contributing to a higher MS score. The U27-Glu126 hydrogen bond also experiences fluctuations, becoming less stable in response to the dynamic changes.

The crystal-like structure generated through HDOCKlite displays a similarly low MS value, comparable to that achieved with the other docking protocol. The simulation reveals some translational movement of the protein in relation to the initial bound state, with coordinated movements in the protein followed by a slight reduction in IFPcs, as shown in Figure S2 (Supporting Information). In this case, the U27-Glu126 interaction is largely preserved, while the hydrogen bond between the hydroxyl group of Ser107 and A26 fluctuates following the geometrical stability.

The worst-performing pose of HDOCKlite (pose 4) is highly unstable from the beginning of the simulation, leading to an early loss of the initial protein-nucleic interactions that are indeed highly divergent from the crystal ones.

The worst-performing HADDOCK-generated pose is stabler than the last described one, as it is still similar to the experimental structure (i.e., low RMSD values, as all the poses generated with this technique), and therefore it presents crystal-like interactions that are retained almost until the end of the simulation. Lastly, the high temperatures favor the movement of the protein leading to the partial loss of interactions and, as a consequence, worsening of the IFPcs, hinting at the sensitivity of the technique.

Both RMSD stable and unstable complexes have a temperature-dependent eRMSD trend starting from low values and reaching medium/high ones.

6XH0

The crystal structure exhibits stability throughout the simulation, achieving a low MS value. The integrity of the binding mode is primarily governed by key interactions mediated by RNA residues G26, G28, and G36. Arginines, in particular Arg47, Arg49, and Arg52, form essential interactions within the TAR major groove. Furthermore, Gln48, located in the β2-β3 loop, contributes to the stabilization of the RNA phosphate backbone, while Thr50 and Gln54 support the structural conformation of the loop by forming intrapeptide hydrogen bonds. Collectively, these interactions form a robust network that ensures the sustained stability of the binding mode throughout the simulation.

The crystal-like structures generated by both HDOCKlite and HADDOCK maintain low MS scores and exhibit similar binding modes, with preservation of the critical interactions with G26, G28, and G36. The persistence of these key interactions contributes to the stability of the binding mode during the simulations.

The least stable pose generated by HADDOCK shows only partial interactions with key RNA residues. This results in an unstable binding mode with structural fluctuations throughout the simulation. Similarly, the lowest-performing pose generated by HDOCKlite lacks the key native interactions with the RNA, resulting in the destabilization of the binding configuration starting from the initial stages of the simulation. All the complexes have low eRMSD values at the start of the simulation, rising to medium values, accordingly to the temperature increase.

Discussion and Conclusion

This work showcases the first application of Thermal Titration Molecular Dynamics (TTMD) to RNA-peptide complexes.

This Python-based tool was initially developed for the qualitative estimation of protein–ligand unbinding kinetic thanks to the analysis of the conservation of the interaction fingerprint during a series of consecutive MD simulations performed at increasing temperatures. In this paper, we employed this program, previously adapted to the study of RNA-ligand complexes,23 as a postdocking refinement tool that identifies experimental-like stable poses according to the conservation of the original binding mode, separating them from decoys.

Five test cases have been identified, basing the decision on structural differentiation and pharmaceutical relevance, and state-of-the-art docking tools like HDOCKlite and HADDOCK have been used to generate the poses starting from the selected deposited structures. We decided to carry out both rigid and semiflexible docking approaches to assess whether a fast posing could affect or invalidate the TTMD capability to identify experimental-like poses, rather than a sophisticated yet slower method, like HADDOCK, that keeps into account local flexibility and generates refined docking poses.

In a previous study, RNA-small molecule complexes were characterized through TTMD using different temperature scales and alternative Python packages to compute the interactions, resulting in the definition of a protocol feasible for nucleic acids.23 In this work, TTMD simulations were performed adopting the same protocol (see Materials and Methods).

Globally, examining Tables S6–S10, TTMD seems capable of refining docking results, being able, in most cases, to discriminate experimental or experimental-like poses, from docking poses that consistently differ from experimental structures. As an example of the discriminatory capability of the technique, Videos S1S4 (Supporting Information) report the TTMD simulations of the crystal structure of 6XH0 compared respectively to the following poses of the same test-case: HADDOCK-generated crystal-like pose (lowest RMSD) and pose 5 (highest MS), HDOCKlite-generated pose 1 (lowest RMSD) and pose 5 (highest MS). As can be observed in the videos, the crystal structure and the crystal-like poses maintain structural integrity despite the high temperatures, while the structural instability is particularly amplified by the rising temperatures for noncrystal-like poses.

Subtle differences between the poses, such as sharing the same interactions and having low RMSD, can reduce the technique’s resolution, like in the scenario where all the poses under discussion may be described as crystal-like, for instance, poses with RMSD below crystallographic resolution, a situation close to the one reported for 4PDB.

The rigid docking approach seems optimal when no local rearrangements are expected upon binding, and the structural data is available like in the current validation, even considering the short computational times: just a few minutes in the case of HDOCKlite. In fact, with this protocol, for each case a native-like pose with RMSD below 3 Å was obtained, and, except in one case, it coincided with the top-scoring pose. On the other side, in a hypothetical scenario where structure prediction programs are exploited due to the absence of a deposited structure and MD pose refinement is needed,73 it is not trivial to get such outstanding performances with a rigid docking approach, thus following a semiflexible docking procedure, like HADDOCK, could be a better choice. Still, it is important to consider that while HADDOCK’s posing seems robust and preferable in specific situations, aside from the longer computational times required (hours), its scoring capabilities are not equally effective, with native-like poses poorly ranked, sometimes even above the 100th position, as shown in Tables S6–S10 (Supporting Information).

Customizing the parameters used to execute HADDOCK could be beneficial in this scenario, but finding the optimal protocol could be difficult. In this circumstance, TTMD could be a critical resource since, as discussed previously, it can discriminate native-like poses from decoys and refine the docking ranking results, at least for the selected test cases. However, coupling TTMD to a docking protocol like HADDOCK, and implementing it in a high-throughput pipeline is still hampered by the long computational times required to carry out MD simulations. Consequently, further testing is needed to determine a robust docking setup, and additional TTMD code optimization and hardware progress are also necessary to reach the computational power required to execute such workflow rapidly. Hopefully, the rate of recent tech development suggests that near-future applications of enhanced sampling methods like TTMD on a large scale will be feasible.74

Alternatively, rescoring or consensus approaches are an option to reduce the number of poses to attention and refine with TTMD, or the top poses generated through rigid docking, followed by MD relaxation, could be used instead. Despite the long simulation time, generating only a few complexes through tools like SuMD (Supervised Molecular Dynamics) could also be a viable solution, although it has not been thoroughly explored. The encouraging results in obtaining stable near-native complexes at the final state of the simulation, as mentioned for 4PDB in the Results section, pave the way for explorative future applications of the technique.

Acknowledgments

MMS lab is very grateful to Chemical Computing Group, OpenEye, and Acellera for their scientific and technical partnership. We also thank Anna Brocchetti for the design of the graphical abstract and the cover art.

Glossary

Abbreviations

GPU

graphic processing unit

PDB

Protein Data Bank

SuMD

supervised molecular dynamics

TTMD

thermal titration molecular dynamics

Data Availability Statement

All molecular structures utilized in this work have been retrieved from the publicly available PDB. The structure of the SuMD simulation last frame of 4PDB, all the TTMD simulations, and the configuration files are available at 10.5281/zenodo.14056191. TTMD Python code to reproduce the simulations performed in this work is available at https://github.com/molecularmodelingsection/TTMD and released under a permissive MIT license.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jcim.4c01393.

  • Docking and TTMD results for all the test cases and titration timeline comparison between HDOCKlite-generated pose 1 and pose 5 of 1HJI (PDF)

  • Comparison between the TTMD replica of the crystal and HADDOCK-generated crystal-like pose for 6XH0 (MP4)

  • Comparison between the TTMD replica of the crystal and HADDOCK-generated pose 5 for 6XH0 (MP4)

  • Comparison between the TTMD replica of the crystal and HDOCKlite-generated crystal-like pose for 6XH0 (MP4)

  • Comparison between the TTMD replica of the crystal and HDOCKlite-generated pose 5 for 6XH0 (MP4)

Author Contributions

The manuscript was written through the contributions of all authors. All authors have approved the final version of the manuscript. A.D. and G.N. contributed equally.

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work has been funded by the European Union – Next-Generation EU (“PNRR M4C2-Investimento 1.4-CN00000041”).

The authors declare no competing financial interest.

Special Issue

Published as part of Journal of Chemical Information and Modelingspecial issue “Editing DNA and RNA through Computations”.

Supplementary Material

ci4c01393_si_001.pdf (46.7MB, pdf)
ci4c01393_si_002.mp4 (11.3MB, mp4)
ci4c01393_si_003.mp4 (11.9MB, mp4)
ci4c01393_si_004.mp4 (11.7MB, mp4)
ci4c01393_si_005.mp4 (12.4MB, mp4)

References

  1. Mattick J. S.; Makunin I. V. Non-Coding RNA. Hum. Mol. Genet. 2006, 15 (suppl_1), R17–R29. 10.1093/hmg/ddl046. [DOI] [PubMed] [Google Scholar]
  2. Puton T.; Kozlowski L.; Tuszynska I.; Rother K.; Bujnicki J. M. Computational Methods for Prediction of Protein–RNA Interactions. J. Struct. Biol. 2012, 179 (3), 261–268. 10.1016/j.jsb.2011.10.001. [DOI] [PubMed] [Google Scholar]
  3. Nithin C.; Ghosh P.; Bujnicki J. M. Bioinformatics Tools and Benchmarks for Computational Docking and 3D Structure Prediction of RNA-Protein Complexes. Genes 2018, 9 (9), 432. 10.3390/genes9090432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Zafferani M.; Hargrove A. E. Small Molecule Targeting of Biologically Relevant RNA Tertiary and Quaternary Structures. Cell Chemical Biology 2021, 28 (5), 594–609. 10.1016/j.chembiol.2021.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Falese J. P.; Donlic A.; Hargrove A. E. Targeting RNA with Small Molecules: From Fundamental Principles towards the Clinic. Chem. Soc. Rev. 2021, 50 (4), 2224–2243. 10.1039/D0CS01261K. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Cooper T. A.; Wan L.; Dreyfuss G. RNA and Disease. Cell 2009, 136 (4), 777–793. 10.1016/j.cell.2009.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Lukong K. E.; Chang K.; Khandjian E. W.; Richard S. RNA-Binding Proteins in Human Genetic Disease. Trends in Genetics 2008, 24 (8), 416–425. 10.1016/j.tig.2008.05.004. [DOI] [PubMed] [Google Scholar]
  8. Gerstberger S.; Hafner M.; Tuschl T. A Census of Human RNA-Binding Proteins. Nat. Rev. Genet 2014, 15 (12), 829–845. 10.1038/nrg3813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Tuszynska I.; Matelska D.; Magnus M.; Chojnowski G.; Kasprzak J. M.; Kozlowski L. P.; Dunin-Horkawicz S.; Bujnicki J. M. Computational Modeling of Protein–RNA Complex Structures. Methods 2014, 65 (3), 310–319. 10.1016/j.ymeth.2013.09.014. [DOI] [PubMed] [Google Scholar]
  10. Chaput L.; Mouawad L. Efficient Conformational Sampling and Weak Scoring in Docking Programs? Strategy of the Wisdom of Crowds. J. Cheminf. 2017, 9 (1), 37. 10.1186/s13321-017-0227-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Jones S. Protein–RNA Interactions: Structural Biology and Computational Modeling Techniques. Biophys Rev. 2016, 8 (4), 359–367. 10.1007/s12551-016-0223-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Qiu L.; Zou X. Scoring Functions for Protein-RNA Complex Structure Prediction: Advances, Applications, and Future Directions. Comm. Inf. Syst. 2020, 20 (1), 1–22. 10.4310/CIS.2020.v20.n1.a1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Xiao W.; Wang D.; Shen Z.; Li S.; Li H. Multi-Body Interactions in Molecular Docking Program Devised with Key Water Molecules in Protein Binding Sites. Molecules 2018, 23 (9), 2321. 10.3390/molecules23092321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fingerhut B. P. The Mutual Interactions of RNA, Counterions and Water–Quantifying the Electrostatics at the Phosphate–Water Interface. Chem. Commun. 2021, 57 (96), 12880–12897. 10.1039/D1CC05367A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Do T. N.; Carloni P.; Varani G.; Bussi G. RNA/Peptide Binding Driven by Electrostatics - Insight from Bidirectional Pulling Simulations. J. Chem. Theory Comput. 2013, 9 (3), 1720–1730. 10.1021/ct3009914. [DOI] [PubMed] [Google Scholar]
  16. Lazim R.; Suh D.; Choi S. Advances in Molecular Dynamics Simulations and Enhanced Sampling Methods for the Study of Protein Systems. International Journal of Molecular Sciences 2020, 21 (17), 6339. 10.3390/ijms21176339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cunha R. A.; Bussi G. Unraveling Mg2+–RNA Binding with Atomistic Molecular Dynamics. RNA 2017, 23 (5), 628–638. 10.1261/rna.060079.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Zhou J.-K.; Yang D.-Y.; Sheu S.-Y. The Molecular Mechanism of Ligand Unbinding from the Human Telomeric G-Quadruplex by Steered Molecular Dynamics and Umbrella Sampling Simulations. Phys. Chem. Chem. Phys. 2015, 17 (19), 12857–12869. 10.1039/C5CP00378D. [DOI] [PubMed] [Google Scholar]
  19. Gao Y.; Cao D.; Pawnikar S.; John K. P.; Ahn H. M.; Hill S.; Ha J. M.; Parikh P.; Ogilvie C.; Swain A.; Yang A.; Bell A.; Salazar A.; Miao Y.; Liang B. Structure of the Human Respiratory Syncytial Virus M2–1 Protein in Complex with a Short Positive-Sense Gene-End RNA. Structure 2020, 28 (9), 979–990.e4. 10.1016/j.str.2020.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Grasso G.; Deriu M. A.; Patrulea V.; Borchard G.; Möller M.; Danani A. Free Energy Landscape of siRNA-Polycation Complexation: Elucidating the Effect of Molecular Geometry, Polymer Flexibility, and Charge Neutralization. PLoS One 2017, 12 (10), e0186816 10.1371/journal.pone.0186816. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Pavan M.; Menin S.; Bassani D.; Sturlese M.; Moro S. Qualitative Estimation of Protein-Ligand Complex Stability through Thermal Titration Molecular Dynamics Simulations. J. Chem. Inf. Model. 2022, 62 (22), 5715–5728. 10.1021/acs.jcim.2c00995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Pavan M.; Menin S.; Bassani D.; Sturlese M.; Moro S. Implementing a Scoring Function Based on Interaction Fingerprint for Autogrow4: Protein Kinase CK1δ as a Case Study. Front. Mol. Biosci. 2022, 9, 909499 10.3389/fmolb.2022.909499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Dodaro A.; Pavan M.; Menin S.; Salmaso V.; Sturlese M.; Moro S. Thermal Titration Molecular Dynamics (TTMD): Shedding Light on the Stability of RNA-Small Molecule Complexes. Front. Mol. Biosci. 2023, 10, 1294543 10.3389/fmolb.2023.1294543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Berman H. M.; Westbrook J.; Feng Z.; Gilliland G.; Bhat T. N.; Weissig H.; Shindyalov I. N.; Bourne P. E. The Protein Data Bank. Nucleic Acids Res. 2000, 28 (1), 235–242. 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Molecular Operating Environment (MOE), 2022.02, C. C. G. Molecular Operating Environment (MOE), 2022. https://www.chemcomp.com/Research-Citing_MOE.htm. [Google Scholar]
  26. Honorato R. V.; Trellet M. E.; Jiménez-García B.; Schaarschmidt J. J.; Giulini M.; Reys V.; Koukos P. I.; Rodrigues J. P. G. L. M.; Karaca E.; van Zundert G. C. P.; Roel-Touris J.; van Noort C. W.; Jandová Z.; Melquiond A. S. J.; Bonvin A. M. J. J. The HADDOCK2.4 Web Server for Integrative Modeling of Biomolecular Complexes. Nat. Protoc. 2024, 19, 3219–3241. 10.1038/s41596-024-01011-0. [DOI] [PubMed] [Google Scholar]
  27. Yan Y.; Tao H.; He J.; Huang S.-Y. The HDOCK Server for Integrated Protein–Protein Docking. Nat. Protoc 2020, 15 (5), 1829–1852. 10.1038/s41596-020-0312-x. [DOI] [PubMed] [Google Scholar]
  28. Charitou V.; van Keulen S. C.; Bonvin A. M. J. J. Cyclization and Docking Protocol for Cyclic Peptide–Protein Modeling Using HADDOCK2.4. J. Chem. Theory Comput. 2022, 18 (6), 4027–4040. 10.1021/acs.jctc.2c00075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ribeiro J.; Ríos-Vera C.; Melo F.; Schüller A. Calculation of Accurate Interatomic Contact Surface Areas for the Quantitative Analysis of Non-Bonded Molecular Interactions. Bioinformatics 2019, 35 (18), 3499–3501. 10.1093/bioinformatics/btz062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Humphrey W.; Dalke A.; Schulten K. VMD: Visual Molecular Dynamics. J. Mol. Graphics 1996, 14 (1), 33–38. 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
  31. Case D. A.; Cheatham T. E.; Darden T.; Gohlke H.; Luo R.; Merz K. M.; Onufriev A.; Simmerling C.; Wang B.; Woods R. J. The Amber Biomolecular Simulation Programs. J. Comput. Chem. 2005, 26 (16), 1668–1688. 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Case D. A.AmberTools22, 2022.
  33. Pérez A.; Marchán I.; Svozil D.; Sponer J.; Cheatham T. E.; Laughton C. A.; Orozco M. Refinement of the AMBER Force Field for Nucleic Acids: Improving the Description of α/γ Conformers. Biophys. J. 2007, 92 (11), 3817–3829. 10.1529/biophysj.106.097782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Zgarbová M.; Otyepka M.; Šponer J.; Mládek A.; Banáš P.; Cheatham T. E. I.; Jurečka P. Refinement of the Cornell et al. Nucleic Acids Force Field Based on Reference Quantum Chemical Calculations of Glycosidic Torsion Profiles. J. Chem. Theory Comput. 2011, 7 (9), 2886–2902. 10.1021/ct200162x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Maier J. A.; Martinez C.; Kasavajhala K.; Wickstrom L.; Hauser K. E.; Simmerling C. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. J. Chem. Theory Comput 2015, 11 (8), 3696–3713. 10.1021/acs.jctc.5b00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Jorgensen W. L.; Chandrasekhar J.; Madura J. D.; Impey R. W.; Klein M. L. Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983, 79 (2), 926–935. 10.1063/1.445869. [DOI] [Google Scholar]
  37. Faller R.; de Pablo J. J. Constant Pressure Hybrid Molecular Dynamics–Monte Carlo Simulations. J. Chem. Phys. 2002, 116 (1), 55–59. 10.1063/1.1420460. [DOI] [Google Scholar]
  38. Davidchack R. L.; Handel R.; Tretyakov M. V. Langevin Thermostat for Rigid Body Dynamics. J. Chem. Phys. 2009, 130 (23), 234101. 10.1063/1.3149788. [DOI] [PubMed] [Google Scholar]
  39. Harvey M. J.; Giupponi G.; Fabritiis G. D. ACEMD: Accelerating Biomolecular Dynamics in the Microsecond Time Scale. J. Chem. Theory Comput. 2009, 5 (6), 1632–1639. 10.1021/ct9000685. [DOI] [PubMed] [Google Scholar]
  40. Eastman P.; Swails J.; Chodera J. D.; McGibbon R. T.; Zhao Y.; Beauchamp K. A.; Wang L. P.; Simmonett A. C.; Harrigan M. P.; Stern C. D.; Wiewiora R. P.; Brooks B. R.; Pande V. S. OpenMM 7: Rapid Development of High Performance Algorithms for Molecular Dynamics. PLOS Computational Biology 2017, 13 (7), e1005659 10.1371/journal.pcbi.1005659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kräutler V.; van Gunsteren W. F.; Hünenberger P. H. A Fast SHAKE Algorithm to Solve Distance Constraint Equations for Small Molecules in Molecular Dynamics Simulations. J. Comput. Chem. 2001, 22 (5), 501–508. . [DOI] [Google Scholar]
  42. Essmann U.; Perera L.; Berkowitz M. L.; Darden T.; Lee H.; Pedersen L. G. A Smooth Particle Mesh Ewald Method. J. Chem. Phys. 1995, 103 (19), 8577–8593. 10.1063/1.470117. [DOI] [Google Scholar]
  43. Menin S.; Pavan M.; Salmaso V.; Sturlese M.; Moro S. Thermal Titration Molecular Dynamics (TTMD): Not Your Usual Post-Docking Refinement. Int. J. Mol. Sci. 2023, 24 (4), 3596. 10.3390/ijms24043596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Gowers R. J.; Linke M.; Barnoud J.; Reddy T. J. E.; Melo M. N.; Seyler S. L.; Domański J.; Dotson D. L.; Buchoux S.; Kenney I. M.; Beckstein O.. MDAnalysis: A Python Package for the Rapid Analysis of Molecular Dynamics Simulations. In Proceedings of the 15th Python in Science Conference; Los Alamos National Laboratory: 2016; pp 98–105. [Google Scholar]
  45. Michaud-Agrawal N.; Denning E. J.; Woolf T. B.; Beckstein O. MDAnalysis: A Toolkit for the Analysis of Molecular Dynamics Simulations. J. Comput. Chem. 2011, 32 (10), 2319–2327. 10.1002/jcc.21787. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Bouysset C.; Fiorucci S. ProLIF: A Library to Encode Molecular Interactions as Fingerprints. J. Cheminf.s 2021, 13 (1), 72. 10.1186/s13321-021-00548-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Barnaba: software for analysis of nucleic acid structures and trajectories. https://rnajournal.cshlp.org/content/25/2/219.long (accessed 2024-11-08). [DOI] [PMC free article] [PubMed]
  48. Faber C.; Schärpf M.; Becker T.; Sticht H.; Rösch P. The Structure of the Coliphage HK022 Nun Protein-λ-Phage boxB RNA Complex. J. Biol. Chem. 2001, 276 (34), 32064–32070. 10.1074/jbc.M102975200. [DOI] [PubMed] [Google Scholar]
  49. Chavali S. S.; Mali S. M.; Jenkins J. L.; Fasan R.; Wedekind J. E. Co-Crystal Structures of HIV TAR RNA Bound to Lab-Evolved Proteins Show Key Roles for Arginine Relevant to the Design of Cyclic Peptide TAR Inhibitors. J. Biol. Chem. 2020, 295 (49), 16470–16486. 10.1074/jbc.RA120.015444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Davlieva M.; Donarski J.; Wang J.; Shamoo Y.; Nikonowicz E. P. Structure Analysis of Free and Bound States of an RNA Aptamer against Ribosomal Protein S8 from Bacillus Anthracis. Nucleic Acids Res. 2014, 42 (16), 10795–10808. 10.1093/nar/gku743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Battiste J. L.; Mao H.; Rao N. S.; Tan R.; Muhandiram D. R.; Kay L. E.; Frankel A. D.; Williamson J. R. α Helix-RNA Major Groove Recognition in an HIV-1 Rev Peptide-RRE RNA Complex. Science 1996, 273 (5281), 1547–1551. 10.1126/science.273.5281.1547. [DOI] [PubMed] [Google Scholar]
  52. Shortridge M. D.; Walker M. J.; Pavelitz T.; Chen Y.; Yang W.; Varani G. A Macrocyclic Peptide Ligand Binds the Oncogenic MicroRNA-21 Precursor and Suppresses Dicer Processing. ACS Chem. Biol. 2017, 12 (6), 1611–1620. 10.1021/acschembio.7b00180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Uc-Mass A.; Khodursky A.; Brown L.; Gottesman M. E. Overexpression of Phage HK022 Nun Protein Is Toxic for Escherichia Coli. J. Mol. Biol. 2008, 380 (5), 812–819. 10.1016/j.jmb.2008.05.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Zhang X.; Lee S. W.; Zhao L.; Xia T.; Qin P. Z. Conformational Distributions at the N-Peptide/boxB RNA Interface Studied Using Site-Directed Spin Labeling. RNA 2010, 16 (12), 2474–2483. 10.1261/rna.2360610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Watnick R. S.; Herring S. C.; Palmer A. G.; Gottesman M. E. The Carboxyl Terminus of Phage HK022 Nun Includes a Novel Zinc-Binding Motif and a Tryptophan Required for Transcription Termination. Genes Dev. 2000, 14 (6), 731–739. 10.1101/gad.14.6.731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Sadowski I.; Hashemi F. B. Strategies to Eradicate HIV from Infected Patients: Elimination of Latent Provirus Reservoirs. Cell. Mol. Life Sci. 2019, 76 (18), 3583–3600. 10.1007/s00018-019-03156-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Van Lint C.; Bouchat S.; Marcello A. HIV-1 Transcription and Latency: An Update. Retrovirology 2013, 10 (1), 67. 10.1186/1742-4690-10-67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Alanazi A.; Ivanov A.; Kumari N.; Lin X.; Wang S.; Kovalskyy D.; Nekhai S. Targeting Tat–Tar Rna Interaction for Hiv-1 Inhibition. Viruses 2021, 13 (10), 2004. 10.3390/v13102004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Mousseau G.; Mediouni S.; Valente S. T.Targeting HIV Transcription: The Quest for a Functional Cure. In The Future of HIV-1 Therapeutics: Resistance Is Futile?; Torbett B. E.; Goodsell D. S.; Richman D. D., Eds.; Springer International Publishing: Cham, 2015; pp 121–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Donahue D. A.; Kuhl B. D.; Sloan R. D.; Wainberg M. A. The Viral Protein Tat Can Inhibit the Establishment of HIV-1 Latency. Journal of Virology 2012, 86 (6), 3253–3263. 10.1128/JVI.06648-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Pai S.; Mudgal J.; Kamath B. V.; Pai K. S. R. An Insight on Promising Strategies Hoping to Cure HIV-1 Infection by Targeting Rev Protein—Short Review. Pharmacol. Rep 2021, 73 (5), 1265–1272. 10.1007/s43440-021-00257-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Fernandes J.; Jayaraman B.; Frankel A. The HIV-1 Rev Response Element: An RNA Scaffold That Directs the Cooperative Assembly of a Homo-Oligomeric Ribonucleoprotein Complex. RNA Biology 2012, 9 (1), 6–11. 10.4161/rna.9.1.18178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Sherpa C.; Rausch J. W.; Le Grice S. F. J.; Hammarskjold M.-L.; Rekosh D. The HIV-1 Rev Response Element (RRE) Adopts Alternative Conformations That Promote Different Rates of Virus Replication. Nucleic Acids Res. 2015, 43 (9), 4676–4686. 10.1093/nar/gkv313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Rausch J. W.; Grice S. F. J. L. HIV Rev Assembly on the Rev Response Element (RRE): A Structural Perspective. Viruses 2015, 7 (6), 3053–3075. 10.3390/v7062760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Moine H.; Cachia C.; Westhof E.; Ehresmann B.; Ehresmann C. The RNA Binding Site of S8 Ribosomal Protein of Escherichia Coli: Selex and Hydroxyl Radical Probing Studies. RNA 1997, 3 (3), 255–268. [PMC free article] [PubMed] [Google Scholar]
  66. Davies C.; Ramakrishnan V.; White S. W. Structural Evidence for Specific S8–RNA and S8–Protein Interactions within the 30S Ribosomal Subunit: Ribosomal Protein S8 from Bacillus Stearothermophilus at 1.9 å Resolution. Structure 1996, 4 (9), 1093–1104. 10.1016/S0969-2126(96)00115-3. [DOI] [PubMed] [Google Scholar]
  67. Pavan M.; Bassani D.; Sturlese M.; Moro S.. Investigating RNA–Protein Recognition Mechanisms through Supervised Molecular Dynamics (SuMD) Simulations. NAR Genomics and Bioinformatics 2022, 4 ( (4), ). 10.1093/nargab/lqac088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Huang Y.; Yang Y. B.; Zhang X. H.; Yu X. L.; Wang Z. B.; Cheng X. C. MicroRNA-21 Gene and Cancer. Med. Oncol 2013, 30 (1), 376. 10.1007/s12032-012-0376-8. [DOI] [PubMed] [Google Scholar]
  69. Medley J. C.; Panzade G.; Zinovyeva A. Y. microRNA Strand Selection: Unwinding the Rules. WIREs RNA 2021, 12 (3), e1627 10.1002/wrna.1627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Iwakawa H.; Tomari Y. Life of RISC: Formation, Action, and Degradation of RNA-Induced Silencing Complex. Mol. Cell 2022, 82 (1), 30–43. 10.1016/j.molcel.2021.11.026. [DOI] [PubMed] [Google Scholar]
  71. Novello G.; Dodaro A.; Menin S.; Cavastracci C.; Sturlese M.; Salmaso V.; Stefano M. A Second Life for the Crystallographic Structure of Berenil-Dodecanucleotide Complex: A Computational Revisitation Thirty Years after Its Publication. Med. Chem. Res. 2024, 33, 2101. 10.1007/s00044-024-03299-9. [DOI] [Google Scholar]
  72. Bottaro S.; Di Palma F.; Bussi G. The Role of Nucleobase Interactions in RNA Structure and Dynamics. Nucleic Acids Res. 2014, 42 (21), 13306–13314. 10.1093/nar/gku972. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Zhang Y.; Vass M.; Shi D.; Abualrous E.; Chambers J. M.; Chopra N.; Higgs C.; Kasavajhala K.; Li H.; Nandekar P.; Sato H.; Miller E. B.; Repasky M. P.; Jerome S. V. Benchmarking Refined and Unrefined AlphaFold2 Structures for Hit Discovery. J. Chem. Inf. Model. 2023, 63 (6), 1656–1667. 10.1021/acs.jcim.2c01219. [DOI] [PubMed] [Google Scholar]
  74. Salmaso V.; Moro S. Bridging Molecular Docking to Molecular Dynamics in Exploring Ligand-Protein Recognition Process: An Overview. Front. Pharmacol. 2018, 9, 923. 10.3389/fphar.2018.00923. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ci4c01393_si_001.pdf (46.7MB, pdf)
ci4c01393_si_002.mp4 (11.3MB, mp4)
ci4c01393_si_003.mp4 (11.9MB, mp4)
ci4c01393_si_004.mp4 (11.7MB, mp4)
ci4c01393_si_005.mp4 (12.4MB, mp4)

Data Availability Statement

All molecular structures utilized in this work have been retrieved from the publicly available PDB. The structure of the SuMD simulation last frame of 4PDB, all the TTMD simulations, and the configuration files are available at 10.5281/zenodo.14056191. TTMD Python code to reproduce the simulations performed in this work is available at https://github.com/molecularmodelingsection/TTMD and released under a permissive MIT license.


Articles from Journal of Chemical Information and Modeling are provided here courtesy of American Chemical Society

RESOURCES