Abstract
Long noncoding RNAs (lncRNAs) are increasingly recognized as druggable targets due to their conserved secondary/tertiary structures and regulatory roles in disease. A prototypical example is the MALAT1 triple helix, whose stability supports transcript persistence and is implicated in oncogenesis. Here, we evaluate the ability of a structure-based drug discovery (SBDD) pipeline, integrating molecular dynamics (MD), pocket analysis, ensemble docking, and diverse scoring functions, to capture the binding behavior of 21 congeneric diminazene-based ligands targeting MALAT1. Conformational ensembles were generated using both conventional MD and Hamiltonian Replica Exchange MD, revealing two potential binding sites. Ensemble docking with AutoDock GPU and rDock across representative RNA conformations, followed by rescoring with force-field and machine-learning-based scoring functions, led to the identification of a binding mode with the best agreement across the series. Principal component analysis of interaction fingerprints within clustered poses was used to explain the experimentally observed affinity trends. Our findings highlight the promise and limitations of current SBDD pipelines for flexible RNA targets and offer a benchmark for future improvement in RNA-focused drug discovery.


Introduction
The organization of the eukaryotic genome is highly complex, with nearly 98% of the human genome consisting of noncoding sequences. , Once dismissed as “junk DNA”, these regions are now recognized as a reservoir of functional elements, including multiple classes of noncoding RNAs (ncRNAs). − Long noncoding RNAs (lncRNAs), which exceed 200 nucleotides, represent a highly heterogeneous class of regulatory molecules. − lncRNAs often display greater conservation of secondary and tertiary structure than primary sequence, highlighting their functional relevance but also complicating experimental structure determination even with emerging techniques such as Cryo-EM. As the most abundant class of ncRNAs, lncRNAs participate in essential biological processes such as epigenetic regulation, transcriptional control, X chromosome inactivation, , genomic imprinting, , and cell development, and their dysregulation is strongly associated with disease, particularly cancer.
A key example of the pharmacological relevance of lncRNAs is the metastasis-associated lung adenocarcinoma transcript 1 (MALAT1). Its overexpression has been reported across multiple cancer types, and its knockdown reduces oncogenic phenotypes. − In particular, MALAT1 plays important roles in metastatic processes ,,, and in the development of resistance to several chemotherapeutic agents, − thereby contributing to poor clinical outcomes. Despite extensive study, the molecular mechanisms and interaction partners underlying MALAT1-mediated regulation remain incompletely understood. MALAT1 is an attractive target for small-molecule intervention due to its well-characterized 3′-terminal triple helix, which is essential for transcript stability (Figure A). − This motif, functionally characterized in cell-based assays and resolved through X-ray crystallography, protects the RNA from degradation by sequestering its adenine-rich 3′-tail via base pairing with a uridine-rich stem loop. Recent studies further indicate that local conformational dynamics modulate this protective mechanism, as increased flexibility or unfavorable ionic conditions destabilize the triplex, exposing MALAT1 to exonucleolytic degradation. Because small molecules can reshape RNA stability and conformational preference, − this structural element represents a compelling therapeutic entry point.
1.
MALAT1 RNA triple helix and the diminazene series of ligands. (A) Secondary (left) and tertiary (right, PDB ID: 4PLX) structure of the MALAT1 triplex, with the poly(A) tail highlighted in light purple. (B) The data set of small molecule ligands with diminazene scaffold used in this work, and corresponding experimental relative binding affinities in μM (right panel). Relative affinity is expressed as CD50 (small molecule concentration needed to achieve 50% competitive displacement); molecules labeled as out of range (O.R.) indicate small molecules with CD50 values outside of the range that lead to minimal dye displacement in the assays (>500 μM). The position of the substituents on the central scaffold (ortho, O, meta, M, or para, P), and the ID of the substituent compose the ligand ID.
Structure-based drug discovery (SBDD) is widely used to accelerate the identification and optimization of small-molecule ligands. However, most computational SBDD tools were developed for proteins, reflecting the relatively recent recognition of RNA as a druggable target. − As a result, RNA-focused applications remain comparatively underexplored. ,,− Benchmarking studies have produced mixed conclusions: while RNA-specific tools such as rDock can outperform protein-oriented programs in pose prediction, broader analysis often favor protein-derived methods, including AutoDock GPU and AutoDock VINA among others, in overall performances. Moreover, no docking protocol has shown consistent success across enrichment metrics in RNA virtual screening campaigns. Crucially, these benchmarking studies have largely overlooked the challenge of generating binding-competent RNA conformations and identifying transient ligandable pockets. The recent disclosure of a library of 21 congeneric diminazene-based ligands targeting the MALAT1 triple helix, along with their experimentally measured affinities (Figure B), provides an ideal benchmark for evaluating current SBDD strategies against RNA targets. MALAT1 thus exemplifies the complexity of a real-world scenario, characterized by pronounced conformational heterogeneity and an unknown ligand binding mode and site.
In this work, we evaluate the applicability of SBDD frameworks based on molecular docking and Molecular Dynamics (MD) simulations to the MALAT1 triplex (Figure ). We employ conventional MD and Hamiltonian Replica Exchange (HREX)-MD to extensively explore the RNA’s conformational space − and identify potential ligand binding sites. Ensemble docking of the 21-ligand library is then performed against representative MALAT1 conformations using AutoDock GPU and rDock. Finally, we evaluate docking poses using force field- (AutoDock, AutoDock Vina, rDock) and machine learning-based (AnnapuRNA, SPRank) scoring functions to determine the ability of each protocol to recapitulate the experimentally observed variation in binding affinity and binding mode consistency across the library. This integrated approach reveals two persistent and ligand-accessible binding sites, one of which featuring a highly consistent binding mode across the ligand series, providing insight into affinity trends. Despite limited scoring accuracy, our results uncover meaningful structure–affinity trends and highlight key challenges and opportunities in RNA-targeted drug discovery.
2.
Computational pipeline established in this work. Starting from the crystal structure, standard and advanced MD simulations are employed to identify promising binding regions and to explore the RNA conformational dynamics, respectively. The information is integrated to generate a conformational ensemble of the identified binding regions, which is then used for ensemble docking of the diminazene library of compounds; ligand poses at this stage are produced using the AutoDock 4 and rDock software. Finally, the binding poses are rescored using diverse scoring functions (AutoDock 4, rDock, Vina, SPRank, and AnnapuRNA), and postprocessing of the outcomes allows rationalization of the results.
Methods
System Preparation
To perform molecular dynamics (MD) simulations, the RNA target structure (PDB ID: 4PLX) was modeled using the Amber force field for nucleic acids, comprising the Amber f99SB force field and the bsc0 and χOL3 refinements.
The system was solvated using the four-point OPC water model, while potassium and chloride ions were added to neutralize the system and achieve a physiological salt concentration of 150 mM KCl using the Joung and Cheatham ion model. While these conditions differ from those employed in the indicator displacement assay to determine ligand affinities (52 mM KCl and 0.1 mM MgCl2), the ionic concentration was chosen to approximate physiological conditions and thus promote the sampling of biologically relevant RNA conformational states. Magnesium ions were not explicitly included in our simulations due to the limited size of the simulation box. Subsequently, the system underwent energy minimization followed by equilibration in both the NVT and NPT ensembles, with a total simulation time of 1.2 ns (600 ps for each ensemble) with a time step of 2 fs. Covalent bonds involving hydrogen atoms were constrained using the LINCS algorithm, , while long-range electrostatic interactions were treated with the Particle Mesh Ewald (PME) method, , using a cutoff distance of 1.2 nm.
For conventional MD simulations in the NPT ensemble, the final equilibrated system was used as the starting point. For Hamiltonian Replica Exchange (HREX) simulations, the trajectory frame with volume closest to the mean from the second half of the NPT equilibration was used as initial configuration for production in the NVT ensemble. This choice mitigated artifacts from box size fluctuations that could lead to abnormal pressure values during subsequent HREX MD simulations performed under NVT conditions.
Molecular Dynamics Simulations
Following system preparation and equilibration, production MD simulations were conducted on the MALAT1 RNA system, namely conventional MD and HREX MD. Conventional MD simulations were run in the NPT ensemble at 300 K and 1 bar, maintained by the V-rescale thermostat and C-rescale barostat, respectively. Three independent replicates, each lasting 500 ns long, were performed.
To enhance the conformational sampling of the RNA molecule, we employed the HREX scheme, which involves simulating multiple replicas of the systems with modified Hamiltonians where charges, Lennard–Jones parameters, and dihedral potentials of a predefined region of the system are gradually scaled. In this study, 16 replicas were used with the scaling parameter λ ranging from 1 to 0.7, applied specifically to residues 55 to 76 of the poly(A) in the triple helix (Figure A). HREX simulations were run in the NVT ensemble at 300 K using the V-rescale thermostat. Each replica was simulated for approximately 96 ns, with exchange attempts every 240 steps. The trajectory from the “cold” replica (λ = 1) was extracted and processed for further analysis.
The conformational variability of the poly(A) tail of the triple helix was inspected in both the conventional and HREX MD simulations, focusing on the “coldest” replica in the latter case, by computing the RMSD, aligning and computing on the RNA heavy atoms, and eRMSD, based on nucleotide g-vectors, with respect to the starting structure. While the RMSD measures the overall structural deviation, the eRMSD is specifically designed for RNA and reflects base-pairing rearrangements, with values above 0.7–0.8 typically indicating discrepancy in the base-pairing pattern with respect to the reference..
Pocket Identification
The three trajectories from conventional MD of the RNA were aggregated and analyzed with the Pocketron algorithm to identify dynamic pocket formation and communication. , Pocketron, included in the BiKi Life Sciences software package, tracks the dynamic evolution of binding pockets during MD simulations, using the NanoShaper , algorithm to detect pockets on a frame-by-frame basis. The approach leverages solvent-excluded surface calculations with two probe sizes, and we used here the default radii of 1.4 Å and 3.5 Å for the smaller and larger probes, respectively. To focus on pockets with potential relevance for small-molecule binding and long-range allosteric communication, only cavities larger than the volume of three water molecules were tracked. This filtering step excludes highly transient or solvent-exposed pockets that are unlikely to accommodate ligands. The analysis allows identifying pockets exhibiting the largest volume, highest persistence, and significant long-range communication.
Conformational Ensemble Preparation
The HREX-MD simulation was analyzed to extract a representative ensemble of conformations for subsequent docking studies. The conformational variability of the regions identified as promising binding sites, based on the pocket analysis on the conventional MD runs, was evaluated in the “coldest” replica (λ = 1) of the HREX. We first determined the RMSD, aligning and computing on the RNA heavy atoms, and eRMSD, based on nucleotide g-vectors, with respect to the starting structure.
Principal Component Analysis (PCA) was performed via the scikit-learn Python package, using as input features the g-vectors of residues comprised in the two selected binding regions. Finally, to identify representative conformations of the two sites from the HREX-MD simulation, we applied the Quality Threshold (QT) clustering algorithm to the eRMSD matrix of the frames from the coldest replica, using a cutoff of 0.7 to group similar structures and extracting centroids from each of the identified cluster. We then retained cluster centroids with eRMSD > 0.7 with respect to all other cluster centroids in order to maximize the structural variability of the ensembles. The procedure was repeated separately for the two binding regions identified.
Docking Pose Generation
Docked poses of the 21 diminazene analogs were generated using two software packages, namely AutoDock GPU and rDock, with default settings. Both software require dedicated preparation of the biomolecular target and the ligands, using a united-atom representation with explicit hydrogens on polar heavy atoms only. A notable difference between the two programs is the grid definition for docking. In both software, the grid was centered on the center of mass of the heavy atoms of the residues comprised in the binding sites. In AutoDock GPU, a rectangular grid needs to be defined (Figure S1), for which we employed 110 × 110 × 130 points for the first site and 110 × 80 × 80 points for the second one, with a grid spacing of 0.3 Å between points. These dimensions were chosen to ensure complete coverage of the binding regions identified through Pocketron. Differently, rDock employs a two-sphere approach for grid generation. Here, we used a radius of 17 Å for both sites, with a small probe of 1 Å and a large probe of 17 Å. This aimed at creating a spherical grid encompassing about the dimensions of the AutoDock one, to ensure comparable volumes allowed for pose generation via the two docking software. For each of the 21 ligands, we generated 250 poses with each software.
Pose Rescoring
To evaluate the generated ligand binding poses on the RNA target, we applied multiple scoring functions, encompassing both force field-based and machine learning-based approaches. Specifically, as force-field-based methods, we employed the scoring functions from AutoDock, rDock, and Vina. On the machine learning side, we used the recently developed AnnapuRNA and SPRank scoring functions. AnnapuRNA leverages a coarse-grained representation of RNA and ligand pharmacophores, which is used to determine RNA-ligand interaction scores using a trained ML model. SPRank refines knowledge-based pairwise potentials iteratively against experimental RNA-ligand structures to discriminate correct poses.
The Vina, AnnapuRNA, and SPRank scoring functions can be directly applied to the output poses from AutoDock GPU and rDock after minor formatting conversion. However, artifacts arose when rescoring some of the poses generated by rDock with the AutoDock GPU scoring function, evidenced by high score values. To address this issue, the grid box for both binding regions was extended to 130 × 130 × 130 points in the three dimensions during the rescoring phase. For consistency, also the poses generated by AutoDock GPU were rescored using this grid definition.
Interaction Fingerprints
To characterize RNA-ligand interactions in the generated binding poses, we computed interaction fingerprints via the fingeRNAt Python package. The tool analyzes complexes between nucleic acids and ligands to generate binary vectors indicating presence or absence of specific noncovalent interactions, namely hydrogen bonds, halogen bonds, lipophilic interactions, cation–anion, π–stacking, π–cation, and π–anion. The software inspects the residues in nucleic acid structures using geometric criteria, and the analysis supports three modes: SIMPLE, for detecting basic atomic contacts; PBS, for distinguishing interactions involving phosphate, base, and sugar moieties of nucleic acids; FULL, for classifying the interactions into specific categories. Herein, we applied fingeRNAt to the RNA-ligand binding poses using the FULL method. For every pose, histograms of interaction occurrences were computed at distances ranging from 2 to 8 Å with 80 bins. Additionally, PCA was applied on the interaction fingerprints, using bin populations as input features.
Clustering of the Binding Poses
Cluster analysis of the binding poses was achieved through the Quality Threshold (QT) clustering algorithm applied to the minimum distance RMSD matrix with a threshold of 5 Å. The RMSD matrix was computed focusing on the common diminazene scaffold of the 21 compounds. For proper clustering of the generated ligand binding poses, structural symmetry must be taken into account. To this end, RMSD values between pose pairs were computed on the Cartesian coordinates of the ligand scaffold both directly and after applying symmetry transformations, and the minimum RMSD value was retained. This was particularly important for para-substituted ligands with symmetrical side chains (Figure S2).
Stability of Docking Poses
Standard MD simulations were performed on the docking complexes of compounds O5, P0, and P9 from cluster 3 of Site 2. The systems were parametrized with the OPLS4 force field and minimized with MacroModel, treating the ligands and the surrounding nucleic bases within 5 Å as flexible. The complexes were solvated in explicit TIP3P water using orthorhombic simulation boxes. Potassium and chloride ions were added to neutralize the systems and achieve salt concentrations of 52 and 150 mM. Following 100 ps of Brownian dynamics with positional restraints applied to the solute atoms, the systems were heated to 300 K in the NVT ensemble over 1 ns. This was followed by a gradual release of restraints over 1.5 ns in the NPT ensemble. Prior to production, an additional 5 ns NPT equilibration stage was performed with weak positional restraints of 0.5 kcal mol–1 Å–2 applied to the phosphate atoms and ligand heavy atoms. Production runs were carried out in the NPT ensemble without restraints, using the Nosé-Hoover thermostat , and Martyna–Tuckerman–Klein barostat. For each ligand and ionic condition, five independent replicates of 200 ns each were performed, resulting in 30 MD runs for an aggregate total of 6 μs. Covalent bonds involving hydrogen atoms were constrained using the M-SHAKE algorithm. Short-range electrostatic interactions were truncated at 9 Å, whereas long-range electrostatics were treated using the PME method. A RESPA integrator was employed with a 2 fs time step for bonded and short-range nonbonded interactions, while long-range electrostatics were updated every 6 fs. All simulations were performed using Desmond v8.2.
Results and Discussion
Sampling the Conformational Space of the MALAT1 Triple Helix
Generating an ensemble of drug-target conformations by MD simulations for subsequent virtual screening campaigns is a well-established SBDD approach. Motivated by the need to explore MALAT1’s conformational space and identify suitable binding pockets, we carried out simulations starting from the available X-ray structure (PDB ID: 4PLX). Three independent conventional MD simulations alongside a single HREX-MD simulation were performed. While conventional MD simulations can be limited in their ability to efficiently sample the complex conformational landscape of biomolecular systems, this limitation is expected to be particularly pronounced for RNA targets due to their intrinsic flexibility. ,, Following this rationale, and within a realistic-case scenario in which binding sites and their degree of preorganization are not known a priori, HREX-MD was employed under the assumption that an enhanced sampling protocol would enable a more exhaustive exploration of the conformational space, thereby improving the detection and characterization of transient and alternative binding pockets. The triplicate conventional simulations then served as a baseline for quantifying the benefit of this enhanced sampling protocol.
Figure A illustrates the conformational dynamics of the poly(A) tail under conventional MD sampling. In particular, the RMSD analysis reveals significant structural fluctuations, especially in Run 1, where RMSD values reach up to 8 Å. In contrast, the eRMSD timeseries remain more consistent across all replicates, with values centered around 0.7, and only occasionally approaching 1 in Run 3. This suggests that the high RMSD values observed do not necessarily correspond to significant base pairing rearrangements, emphasizing the importance of eRMSD as a more tailored metric for capturing relative changes in nucleic acid structure. Furthermore, the PCA plot, color-coded by replica, shows that even relatively short simulations access a distinct conformational space in the three runs. As a comparison, Figure B demonstrates the effectiveness of HREX-MD in sampling the conformational landscape of MALAT1 more exhaustively, as evidenced by the corresponding eRMSD timeseries, reaching values up to 1.2. The improved sampling can be better appreciated through dimensionality reduction via PCA (Figure C), which shows that the HREX-MD not only fully covers the PCA space spanned by the three conventional MD runs but also explores previously uncharted regions. By analyzing and visually comparing the centroid structures, we observe that, compared with the conventional MD, the HREX simulations promote greater mobility of the poly(A) residues, particularly in the lower region where the triple helix wraps around the duplex and inserts to form the triple pairing. In this region, where the residues are less constrained, larger displacements are detected due to lost interactions involving particularly residues U56–U57, A60–A61, and U52–U53. This enhanced flexibility also reveals a new cluster of conformations sampled only in the HREX simulations, in which the lower region folds back on itself, bringing residues A60 and A61 into close proximity with U52 and U53, while displacing U56 and U57 away from the duplex. In contrast, in the central region where the triple pairing occurs, no major differences are observed compared with the conventional MD simulations, apart from minor local shifts.
3.
Conformational dynamics of MALAT1 in conventional and enhanced MD simulations. Timeseries and distribution of RMSD and eRMSD computed on the poly(A) residues using the crystallographic conformation as a reference in (A) three independent conventional MD simulations and (B) the HREX MD simulation. (C) PCA of the aggregated conformations sampled in the conventional and HREX MD simulations, color-coded consistently with panels (A, B) and with the crystallographic conformation indicated in purple. The circles indicate the representative conformations of the poly(A) tail from the most populated clusters of each simulation. On the right, the representative conformations are superimposed on the crystal one using the same color scheme.
Pocket Communication Analysis Reveals Potential Binding Sites
Sampling a broad conformational space of the MALAT1 triple helix does not necessarily imply the presence of binding sites capable of accommodating the small molecules in the reference library, nor does it guarantee that the observed conformational variability is reflected in pocket diversity. To address these issues, we carried out a systematic pocket identification and pocket communication analysis using the Pocketron algorithm. The analysis yielded a total of 37 pockets, encompassing those present in the initial structure and those dynamically formed during the simulation. A comprehensive picture of pocket communication within the MALAT1 structure is reported by the 37 × 37 correlation matrix derived from the Pocketron analysis (Figure S3), where each element quantifies the communication between corresponding pocket IDs (pIDs). These pockets can be visually represented as spheres (Figure A), with sphere radius proportional to pocket volume and color indicating pocket persistency (percentage of trajectory frames with nonzero volume).
4.
Generating a conformational ensemble of the promising binding regions. (A) Pocket communication analysis on the standard MD simulations (left) identified two major interconnected regions in the MALAT1 structure, to use as promising binding sites for subsequent docking (right). Yellow, orange, and red spheres correspond to pockets displaying increasing persistence, with sphere size reflecting the pocket volume, and edge thickness proportional to communication strength. (B) PCA of the combined MD simulations (left), and eRMSD and RMSD matrices of most dissimilar cluster centroids from the HREX MD simulations (right), performed separately for the two binding regions.
The color scheme corresponds to the following persistence ranges: yellow (<33%), orange (33–66%), and red (>66%). Notably, the most persistent pockets also exhibit larger volumes, suggesting their potential suitability for further analysis and targeting in drug discovery efforts. To further assess the suitability of these pockets for ligand binding, we employed Pocketron to track residue exchanges between pockets and quantify the extent of communication across different target regions. This involved calculating the average number of “merge” and “split” events, , which can be then visualized in a 3D graph (Figure A), with edge thickness proportional to the strength of communication between pockets. Overall, the analysis reveals two promising regions that harbor the largest and most persistent pockets. In the lower portion of the triple helix, a localized exchange of residues around the high-volume, high-persistence pocket 2 can be observed. In the adjacent region, another short communication pathway is present, including pockets 19 and 3, displaying a slightly lower persistence. Interestingly, a more long-range pathway originating from pocket 19 can also be observed, traversing pockets 3 and 5 to reach the second high-volume, high-persistence pocket 1, in the upper portion of the structure. Based on these insights, we next focused our analysis on the two regions comprising the most persistent pockets (Figure A and Table ). These regions, hereafter named Site 1 (including pIDs 2, 3, and 19, in the lower portion of the triple helix) and Site 2 (including pIDs 1 and 6, in the upper portion), represent promising areas for further exploration as ligand binding sites, and consistent with those previously explored in independent studies. In particular, our Site 1 corresponds to site 3 in François-Moutal et al. and comprises the site explored in the works by Rocca et al. , and the one bound by compound 16 in Abulwerdi et al., while our Site 2 corresponds to site 2 in François-Moutal and to the site bound by compound 5 in Abulwerdi et al. Notably, not only do the pockets engage in frequent residue exchanges with neighboring ones, but they also exhibit communication across the two regions, albeit to a lesser extent. This is interesting, as intersite communication hints at potential allosteric effects that could propagate through the structure and ultimately influence the stability of the entire triple helix.
1. Features of the Major Pockets Identified from the MD Simulations, Namely Pocket ID, Average Volume, Residues Comprised in the Pocket, Pocket Persistency, and Corresponding Binding Region.
| pID | average volume (Å3) | residues | persistency (%) | site |
|---|---|---|---|---|
| 2 | 297 | 3, 4, 5, 6, 52, 53, 54, 56, 57, 58, 59, 60, 61, 62, 63, 64 | 69 | 1 |
| 3 | 179 | 47, 49, 50, 66, 67 | 62 | |
| 19 | 294 | 49, 50, 51, 52 | 35 | |
| 1 | 455 | 11, 12, 13, 41, 42, 70, 71, 72 | 96 | 2 |
| 6 | 99 | 13, 14, 15, 73, 74 | 42 |
Characterization of a Site-Specific Conformational Ensemble
With the binding site definition established, we next characterized the conformational ensemble spanned by Site 1 and Site 2 in terms of the spatial arrangements of their constituent residues. In analogy with the previous analyses, we computed site-specific eRMSD and performed PCA (see Methods section) for both the conventional and HREX MD trajectories. Figure S4 illustrates the eRMSD time series for both sites, with values ranging from approximately 0.6 to 1.3. HREX-MD simulations expanded the conformational space accessible to the two regions beyond that sampled by the three independent conventional MD runs. This effect is particularly evident in the PCA analysis (Figure B, left panels). Comparing the two sites, Site 1 maintained a base pairing pattern closer to the reference compared to Site 2 (Figure S4), whose dynamics were mostly driven by rearrangements of the triple helix. Specifically, Site 1 spanned values between 0.7 and 1.1, whereas Site 2 explored higher eRMSD values, except for Run 3, which remained closer to the reference.
To identify representative conformations of the two sites, we performed cluster analysis. To this end, we applied the Quality Threshold clustering algorithm to the eRMSD matrix of the residues comprised in the two regions. The overall conformational diversity is captured by the pairwise eRMSD matrices (Figure B, center and right panels), with eRMSD values spanning the 0.7–1.4 range for Site 1 and 0.7–1.2 for Site 2, highlighting a greater conformational variability, and thus greater flexibility, for Site 1. Then, to maximize the structural variability of the representative structures, we only retained cluster centroids with eRMSD above the cutoff of 0.7. This resulted in two independent ensembles comprising 10 and 5 structures for Sites 1 and 2, respectively. Taken together, these curated ensembles captured the full range of conformational dynamics observed in the simulations and provide a minimal, yet representative, set of structures to support efficient downstream docking studies.
Complementary Binding Modes by Distinct Docking Programs
In a recent work, a library of 21 congeneric compounds was experimentally tested against the MALAT1 triple helix, providing a reference set of small molecules. The compounds share a diminazene scaffold, decorated with different substituents in the ortho, meta, and para positions (Figure B). Thus, we used the generated conformational ensembles for the two binding regions in the RNA to dock the 21 compounds, in an ensemble docking spirit. , We also included the conformations of Sites 1 and 2 of the crystal structure of the RNA, yielding 17 combined conformations. Using AutoDock GPU and rDock, we generated 250 poses per ligand with each program (500 poses per ligand overall). This process was repeated for all representative structures of the two sites, resulting in 8500 docked poses for each compound. We first assessed the extent of agreement between the poses generated by the two programs by performing dimensionality reduction across all the poses for all the RNA target conformations within each site.
Figure shows the results of PCA on the Cartesian coordinates for 5500 poses across 11 RNA conformations for Site 1 and 3000 poses across 6 RNA conformations for Site 2, obtained with the two software for compound P13, which has the largest number of heavy atoms and represents the most challenging case for accommodation within the target due to its marked steric hindrance. While the poses occupy overlapping regions in the PC space, rDock was able to generate binding modes not accessed by AutoDock GPU, particularly in Site1. This result underscores the value of using multiple docking programs to capture a broader spectrum of potential binding poses on the RNA target. For the smaller Site 2, the poses from both programs occupy similar regions in the PC space, indicating lower diversity between the generated poses. Overall, for both sites, the first PC distinguishes elongated binding modes along the major groove and more compact ones.
5.
Binding poses generated for ligand P13. (A) 2D structure of the diminazene analog P13. (B) PCA on the aggregated poses generated for ligand P13 at the two sites, using the AutoDock GPU (blue) and rDock (orange) docking software. Representative binding poses from extreme regions along PC1 are shown, with ligand carbon atoms colored consistently with the software that generated the pose, while RNA residues within 3 Å of ligands are displayed in licorice, colored white, and labeled.
Scoring Function Performance and Conformational Preference
To investigate how different scoring functions evaluate the same set of docked poses, we rescored all poses using multiple scoring functions, namely AutoDock, rDock, AutoDock Vina, AnnapuRNA, and SPRank. An overview of the results, covering all ligands and all combinations of docking programs and scoring functions for both Site 1 and Site 2, is reported in Figures S5–S8. The general trend that can be inferred is that poses tend to have more favorable scores when evaluated by the native scoring function of the program that generated them. For example, poses from AutoDock GPU scored overall better under the AutoDock scoring function, while poses from rDock performed better when evaluated by the native rDock scoring function (Figures S5 and S6). This effect is more pronounced for rDock, which shows greater variability across the scores. AutoDock Vina exhibited a similar pattern to AutoDock, although to a lesser extent, suggesting it identifies stable poses across both docking programs more uniformly. By contrast, AnnapuRNA showed lower consistency (Figures S7 and S8), with poses generated by both programs largely overlapping within the same score range. However, several poses produced by AutoDock GPU received highly unfavorable scores, likely due to steric clashes. Interestingly, this behavior was absent when scoring rDock-generated poses, suggesting higher compatibility between the RNA-specific docking program and this scoring function. Nevertheless, this trend could not be generalized to all considered RNA-specific scoring functions, since SPRank exhibited the opposite behavior, assigning more favorable scores to AutoDock GPU poses in both binding sites.
These patterns are further illustrated in Figure S9, which shows the top-scoring poses for each ligand across all docking program-scoring function combinations. AutoDock and rDock scoring functions consistently favored their respective docking outputs for both sites. AnnapuRNA favored rDock-generated poses, while Vina showed a site-dependent behavior. In particular, for Site 1, Vina generally preferred rDock poses, though a few AutoDock GPU poses were also highly ranked. Conversely, for Site 2, Vina leaned toward AutoDock GPU poses. Similarly, SPRank favored AutoDock GPU poses for Site 2, while it identified top-scoring poses from both AutoDock GPU and rDock for Site 1.
To assess the effect of target conformation on docking outcomes, we compared the top-ranked pose from each conformational ensemble for every ligand (Figure S10). Site 1 exhibited a broad distribution of best-scoring structures, with conformers 5, 7, 10, and the crystal structure frequently yielding the top scores across several scoring functions. AutoDock produced mixed results, favoring conformers 7, 10, and the crystal structure, whereas AutoDock Vina showed an even stronger preference for the crystal structure. rDock displayed a clear bias toward conformer 10, while ML-based scoring functions yielded more variable rankings: AnnapuRNA tended to favor conformers 5 and 10, and SPRank preferentially selected conformers 5 and 7. In contrast, Site 2 displayed more consistent behavior, with fewer conformers dominating (Figure S11). When considering the best score per ligand irrespective of the target conformation, both AutoDock and Vina most frequently favored conformers 1, 3, and the crystal structure, with Vina showing a marked preference for the latter even at Site 2. Conformer 4 consistently emerged as the preferred structure for rDock across all ligands, as well as for AnnapuRNA, though the latter occasionally favored conformer 1. SPRank generally preferred conformers 4 and 3, with minor deviations across individual ligands.
Evaluation of Docking Scores against Experimental Affinity Trends
Our goal for the ensemble docking exercise was to evaluate whether any combination of docking software and scoring function could reproduce the experimentally observed affinity trends among the ligands. In this respect, the overall picture is summarized in Figure S11, which reveals that no clear correlation with experimental binding affinities could be established, except for AutoDock to a limited extent. For the latter, higher-affinity ligands generally received more favorable scores than lower-affinity ones, suggesting that AutoDock scores may be suitable in distinguishing binders from nonbinders.
Interestingly, different from other scoring functions, AutoDock results displayed highly similar scoring patterns across the ligand set for both binding sites. This implies that the AutoDock scoring is largely independent of the specific binding site, highlighting a potential limitation in its ability to capture site-specific interactions for this case. The comparable scoring behavior across sites may stem from the chemical composition of the two binding regions, both featuring A-U base pairs with two G-C pairs in the middle. This similarity in composition can be particularly relevant for force-field-based scoring functions such as AutoDock, which heavily rely on atomic partial charges and van der Waals interactions.
Although AutoDock showed the most pronounced site-independent behavior, similar but less marked scoring patterns emerged for Vina and SPRank. Particularly interesting is the similarity observed within Site 1 for AnnapuRNA and SPRank. This comparable trend indicates that certain physicochemical properties of the ligands themselves may drive scoring outcomes across different methods. Overall, compared to AutoDock, AnnapuRNA and SPRank preserve some degree of site-specificity in their evaluations.
Interaction-Based Interpretation of Ligand Affinity
To rationalize the observed variation in experimental binding affinities based on the predicted binding modes, we focused on ligand poses generated by AutoDock GPU, as its scoring function was the only one to exhibit a discernible correlation with experimental trends, albeit with the limitations discussed above. In particular, we analyzed ligand interactions within Site 1 and Site 2 using the recently developed fingeRNAt. The tool inspects complexes between nucleic acids and ligands to detect and classify noncovalent interactions. All ligand poses across the conformational ensemble were evaluated to generate an average interaction profile. As shown in Figure S12, the average interaction profiles were broadly similar between the two sites due to their comparable chemical features. A notable exception involved lipophilic interactions: Site 2 showed a sharp peak at approximately 5 Å, while Site 1 exhibited a bimodal distribution with peaks around 4.5 Å and 5.5 Å. To detect dominant interaction patterns among ligand poses, we further applied PCA. Although differences between sites were subtle, a clear trend emerged along PC1. Specifically, poses derived from low-affinity ligands were predominantly located in the negative region of PC1, whereas those from high-affinity ligands clustered toward positive PC1 values, indicating a separation in interaction patterns consistent with binding strength. In particular, analysis of the PC loadings revealed that positive values along PC1 are mainly associated with hydrogen-bond (distance range 2.25–3.88 Å) and cation–anion (2.23–5.45 Å) interactions. By contrast, lipophilic contacts (2.75–3.95 Å) are primarily associated with poses located on the negative side of PC1. All other interactions and distance ranges display no significant contribution in the loading profile (Figure S13).
To improve the interpretability of results, we performed a cluster analysis on the ligand binding poses. To discriminate major configurations of the ligands, we focused on the common diminazene scaffold of the 21 compounds. For this stage, we only retained the most populated clusters, covering about 50% of poses for further analysis (Figure A). PCA of these selected clusters (Figure B) showed improved separation of ligands by affinity, and the variance explained by the first two principal components increased from about 14% to approximately 26%, underscoring the effectiveness of this refinement.
6.
Characterization of the aggregated binding poses generated for all ligands. (A) Cluster analysis and (B) PCA were performed separately on Sites 1 and 2 (top and bottom panels, respectively), using about 50% of the total number of poses. Cluster populations are highlighted by occurrence per ligand, and the same color code is used for the PCA projections, ranging from red (highest relative affinity) to violet (lowest relative affinity).
To further explore affinity-related binding patterns, we analyzed the distribution of PC values within each cluster. For each ligand, we calculated the average PC1 score per cluster (Figures S14–S17). Cluster 3 of Site 2 showed the strongest association with experimental binding affinities (R 2 = 0.6, Figure A), suggesting that poses in this cluster, which feature a distinctive and consistent binding mode of the central scaffold, are most effective at distinguishing strong from weak binders.
7.

Rationalizing ligand affinity by predicted binding mode. (A) Linear relationship between relative affinity and average PC1 score for the entire set of compounds. (B) Proposed binding modes for representative ligands, namely the highest-affinity one (P9), the unsubstituted scaffold (P0), and the lowest-affinity ligand (O5). The RNA backbone is shown in tubes, while RNA nucleotides directly interacting with the ligands are displayed in licorice and labeled. Colored dashed lines indicate RNA-ligand interactions, with green, red, and blue representing lipophilic, cation–anion, and hydrogen bond interactions, respectively.
A closer inspection of the binding poses within this cluster, focusing on ligands P9 (highest experimental affinity), P0 (unsubstituted scaffold), and O5 (lowest affinity), provides additional insight into the observed structure–activity relationships (Figure B). Starting with P0, which represents the common scaffold, the central benzene rings establish lipophilic contacts with the RNA bases, while the diminazene group forms hydrogen bonds with nearby residues. The amidine moiety at the opposite end further participates in hydrogen bonding with bases and ionic interactions with the phosphate backbone via its positively charged nitrogen atoms. The binding mode of P9 closely resembles that of P0 in terms of scaffold and amidine interactions. However, its nitrogen-rich, positively charged substituents enable additional hydrogen bonds and cation–anion contacts with the RNA phosphate groups. This expanded polar interaction network corresponds to the shift of P9 poses toward higher PC1 values in the PCA plot. By contrast, O5, which bears highly lipophilic substituents, predominantly forms lipophilic contacts with the aromatic bases, resulting in lower PC1 values. Overall, the experimental affinity trend can be rationalized based on the substituents’ charge and hydrogen-bonding capacity: nitrogen-rich, positively charged groups generally enhance affinity relative to P0, whereas purely aromatic or hydrophobic substituents reduce it, with the only exceptions being P5 and M7.
To further assess the reliability of the predicted binding poses under explicit-solvent conditions and realistic ion concentrations, we performed atomistic MD simulations for the three representative ligands. At both ionic strengths (52 and 150 mM KCl), the main binding modes remained largely stable throughout the simulations, with occasional unbinding events observed for the lowest affinity ligand. The resulting RMSD profiles qualitatively reflected the experimental affinity ranking, with higher-affinity ligands exhibiting greater binding-mode stability (Figure S18). Notably, simulations conducted at the experimental salt concentration (52 mM) showed slightly more consistent stability profiles, particularly for the highest-affinity ligand, while preserving the same qualitative behavior. Altogether, these results indicate that the docking-derived poses are robust within the explicit-solvent solvation environment and across different ionic strengths.
Conclusion
In this study, we applied a comprehensive structure-based drug discovery (SBDD) pipeline to investigate how a congeneric series of diminazene derivatives interacts with the MALAT1 triple helix, a prototypical structured lncRNA with known druggability. Our approach integrated enhanced molecular dynamics simulations, pocket communication analysis, ensemble docking, and rescoring with both classical and RNA-specific methods. The simulations revealed two persistent and ligand-accessible binding regions (Site 1 and Site 2), while the pocket communication analysis highlighted a degree of dynamic coupling between RNA subregions, hinting at potential for allosteric modulation. Ensemble docking across the generated RNA conformational ensemble expanded the diversity of accessible binding poses, particularly within Site 1, and uncovered complementary binding subpockets depending on the docking tool used. Despite limitations, AutoDock achieved modest discrimination between high- and low-affinity ligands. Crucially, our analysis of clustered interaction patterns using fingeRNAt and principal component analysis offered a clearer view of structure–affinity relationships. One specific subpopulation of poses (Site 2, Cluster 3) showed clear trends in relation to experimental affinity, suggesting this as a potentially relevant binding mode. Nevertheless, several limitations emerged that mirror challenges consistently reported in the field. The accuracy of scoring functions, both general-purpose and RNA-specific, including machine learning–based approaches, proved insufficient for reliably predicting binding affinities across structurally similar ligands, consistently with the general expectation for protein targets. In this respect, the known sensitivity of scoring functions to target conformations and the inherent difficulty in accounting for ligand and target flexibility become even more critical. Overall, our results reinforce prior observations that accurately generating and evaluating docking poses for RNA targets remains an open challenge, further complicated by the lack of high-resolution experimental binding affinity data for nucleic acid-ligand complexes. These data gaps hinder the development, training, and validation of more robust predictive models. In this context, it is important to note that the results obtained are inherently dependent on the docking software and scoring functions employed. We therefore expect that performance may change, and potentially improve, through the application of post-docking strategies such as MD-based relaxation of selected poses and more advanced rescoring approaches, whose systematic assessment was beyond the scope of the present work. In particular, MD-based refinement could be instrumental in explicitly accounting for solvent and ionic effects, as well as in elucidating the role of water and ions in stabilizing specific RNA conformations or ligand binding modes, which are challenging to capture at the docking stage. On the other hand, higher-level rescoring strategies may enable a more accurate estimation of binding affinities by partially mitigating the limitations associated with semiflexible docking approximations. , Notably, free-energy methods based on alchemical transformations have reached a mature and consolidated stage in the context of protein-targeted drug discovery, where they have proven effective in improving affinity ranking and guiding lead optimization. Encouragingly, recent benchmark studies suggest that these approaches can achieve overall promising performances also when applied to RNA targets. ,, While their routine application to RNA-focused drug discovery pipelines remains computationally demanding, continued methodological advances and increasing computational resources are expected to facilitate their broader adoption by the community. Finally, although the RNA model used here was structurally resolved and subjected to extensive conformational sampling, it may not fully capture the plasticity and environmental complexity present in cellular contexts. The absence of experimental confirmation for the predicted binding poses, combined with the limited chemical diversity of the ligand series, also limits the generalizability of our findings.
Altogether, our work underscores both the potential and the limitations of current SBDD strategies when applied to dynamic RNA targets. While advanced computational pipelines can extract useful structure–activity relationships and identify plausible binding modes, achieving predictive reliability remains out of reach. Progress in RNA-targeted drug discovery will require not only better scoring functions and more flexible modeling strategies but also broader access to high-quality experimental data and integrative validation frameworks.
Supplementary Material
Acknowledgments
We gratefully acknowledge the Data Science and Computation Facility and its Support Team at Fondazione Istituto Italiano di Tecnologia for computing time and support on the Franklin HPC system. We also thank the anonymous reviewers for their constructive suggestions, which substantially improved the quality of our work.
Glossary
Abbreviations
- RNA
ribonucleic acid
- MALAT1
metastasis-associated lung adenocarcinoma transcript 1
- MD
molecular dynamics
- HREX
Hamiltonian replica exchange
- SBDD
structure-based drug discovery
All the materials to perform the MD simulations and the molecular docking calculations on the triple helix structure of the MALAT1 RNA, as well as to reproduce the analyses presented in this work is freely available in Zenodo with accession code 10.5281/zenodo.18872932. Specifically, necessary input files to perform both plain (i.e., unbiased) and Hamiltonian replica-exchange (HREX) MD simulations are provided together with the associated output trajectories. RNA and ligand structures for docking via AutoDock-GPU and rDock are also provided, as well as output data to reproduce the analyses and figures reported in this work. The Notebook_malat1.ipynb Jupyter Notebook illustrates how to perform all the analyses. The notebook can also be straightforwardly consulted at https://github.com/CompMedChemLab/project_malat1. Input files to perform plain MD simulations of the docking poses for three representative ligands (P9, P0, and O5), together with the corresponding output trajectories, are provided as well.
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jcim.5c02955.
Comparison of AutoDock grid box and rDock cavity mapping for the two binding regions; atom index transformations applied for minimum RMSD calculations; interpocket communication matrix obtained from Pocketron analysis; eRMSD timeseries for the two sites from conventional and HREX MD simulations; docking score distributions for all generated poses on representative RNA conformations of both sites, grouped by scoring function and docking software; evaluation of ligand-target binding using multiple scoring functions, highlighting top-ranked poses across RNA ensembles for both sites; analysis of RNA-ligand interactions via FingeRNAt; loadings of PC1 for different interaction features at both sites; PCA of the molecular fingerprints of all ligand poses for the two sites and resulting correlations between PC1 scores and relative binding affinity; RMSD analysis of compounds P9, P0, and O5 from MD simulations (PDF)
The manuscript was written through the contributions of all authors. All authors have given approval to the final version of the manuscript.
This work was supported by NextGenerationEU–PNRR MUR, M4C2, Action 1.4 ‘Potenziamento strutture di ricerca e di campioni nazionali di R&S’. Funding was provided through the projects National Centre for HPC, Big Data and Quantum Computing (CUP: J33C22001180001, CN00000013–Spoke 8) and National Center for Gene Therapy and Drugs based on RNA Technology (CUP: J33C22001130001, CN00000041).
The authors declare no competing financial interest.
References
- Wilusz J. E., Sunwoo H., Spector D. L.. Long noncoding RNAs: functional surprises from the RNA world. Genes Dev. 2009;23:1494–1504. doi: 10.1101/gad.1800909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ENCODE Project Consortium. Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olsen G. J., Woese C. R.. Archaeal genomics: an overview. Cell. 1997;89:991–994. doi: 10.1016/S0092-8674(00)80284-6. [DOI] [PubMed] [Google Scholar]
- Mortimer S. A., Kidwell M. A., Doudna J. A.. Insights into RNA structure and function from genome-wide studies. Nat. Rev. Genet. 2014;15:469–479. doi: 10.1038/nrg3681. [DOI] [PubMed] [Google Scholar]
- Cech T. R., Steitz J. A.. The Noncoding RNA RevolutionTrashing Old Rules to Forge New Ones. Cell. 2014;157:77–94. doi: 10.1016/j.cell.2014.03.008. [DOI] [PubMed] [Google Scholar]
- Niu D.-K., Jiang L.. Can ENCODE tell us how much junk DNA we carry in our genome? Biochem. Biophys. Res. Commun. 2013;430:1340–1343. doi: 10.1016/j.bbrc.2012.12.074. [DOI] [PubMed] [Google Scholar]
- Prensner J. R., Chinnaiyan A. M.. The emergence of lncRNAs in cancer biology. Cancer Discovery. 2011;1:391–407. doi: 10.1158/2159-8290.CD-11-0209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jandura A., Krause H. M.. The New RNA World: Growing Evidence for Long Noncoding RNA Functionality. Trends Genet. 2017;33:665–676. doi: 10.1016/j.tig.2017.08.002. [DOI] [PubMed] [Google Scholar]
- Rinn J. L., Chang H. Y.. Genome regulation by long noncoding RNAs. Annu. Rev. Biochem. 2012;81:145–166. doi: 10.1146/annurev-biochem-051410-092902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yao R.-W., Wang Y., Chen L.. Cellular functions of long noncoding RNAs. Nat. Cell Biol. 2019;21:542–551. doi: 10.1038/s41556-019-0311-8. [DOI] [PubMed] [Google Scholar]
- Kung J. T. Y., Colognori D., Lee J. T.. Long noncoding RNAs: past, present, and future. Genetics. 2013;193:651–669. doi: 10.1534/genetics.112.146704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li R., Zhu H., Luo Y.. Understanding the Functions of Long Non-Coding RNAs through Their Higher-Order Structures. Int. J. Mol. Sci. 2016;17:702. doi: 10.3390/ijms17050702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonilla S. L., Jones A. N., Incarnato D.. Structural and biophysical dissection of RNA conformational ensembles. Curr. Opin. Struct. Biol. 2024;88:102908. doi: 10.1016/j.sbi.2024.102908. [DOI] [PubMed] [Google Scholar]
- Martianov I., Ramadass A., Serra Barros A., Chow N., Akoulitchev A.. Repression of the human dihydrofolate reductase gene by a non-coding interfering transcript. Nature. 2007;445:666–670. doi: 10.1038/nature05519. [DOI] [PubMed] [Google Scholar]
- Tian D., Sun S., Lee J. T.. The long noncoding RNA, Jpx, is a molecular switch for X chromosome inactivation. Cell. 2010;143:390–403. doi: 10.1016/j.cell.2010.09.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee J. T., Bartolomei M. S.. X-inactivation, imprinting, and long noncoding RNAs in health and disease. Cell. 2013;152:1308–1323. doi: 10.1016/j.cell.2013.02.016. [DOI] [PubMed] [Google Scholar]
- Bartolomei M. S., Zemel S., Tilghman S. M.. Parental imprinting of the mouse H19 gene. Nature. 1991;351:153–155. doi: 10.1038/351153a0. [DOI] [PubMed] [Google Scholar]
- Sleutels F., Zwart R., Barlow D. P.. The non-coding Air RNA is required for silencing autosomal imprinted genes. Nature. 2002;415:810–813. doi: 10.1038/415810a. [DOI] [PubMed] [Google Scholar]
- Fatica A., Bozzoni I.. Long non-coding RNAs: new players in cell differentiation and development. Nat. Rev. Genet. 2014;15:7–21. doi: 10.1038/nrg3606. [DOI] [PubMed] [Google Scholar]
- Batista P. J., Chang H. Y.. Long noncoding RNAs: cellular address codes in development and disease. Cell. 2013;152:1298–1307. doi: 10.1016/j.cell.2013.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang X., Hamblin M. H., Yin K.. The long noncoding RNA Malat1: Its physiological and pathophysiological functions. RNA Biol. 2017;14:1705–1714. doi: 10.1080/15476286.2017.1358347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amodio N., Raimondi L., Juli G., Stamato M. A., Caracciolo D., Tagliaferri P., Tassone P.. MALAT1: a druggable long non-coding RNA for targeted anti-cancer approaches. J. Hematol. Oncol. 2018;11:63–64. doi: 10.1186/s13045-018-0606-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gutschner T., Hammerle M., Eissmann M., Hsu J., Kim Y., Hung G., Revenko A., Arun G., Stentrup M., Gross M., Zornig M., MacLeod A. R., Spector D. L., Diederichs S.. The noncoding RNA MALAT1 is a critical regulator of the metastasis phenotype of lung cancer cells. Cancer Res. 2013;73:1180–1189. doi: 10.1158/0008-5472.CAN-12-2850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arun G., Diermeier S., Akerman M., Chang K., Wilkinson J. E., Hearn S., Kim Y., MacLeod A. R., Krainer A. R., Norton L., Brogi E., Egeblad M., Spector D. L.. Differentiation of mammary tumors and reduction in metastasis upon Malat1 lncRNA loss. Genes Dev. 2016;30:34–51. doi: 10.1101/gad.270959.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arun G., Spector D. L.. MALAT1 long non-coding RNA and breast cancer. RNA Biol. 2019;16:860–863. doi: 10.1080/15476286.2019.1592072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim J., Piao H., Kim B., Yao F., Han Z., Wang Y., Xiao Z., Siverly A. N., Lawhon S. E., Ton B. N., Lee H., Zhou Z., Gan B., Nakagawa S., Ellis M. J., Liang H., Hung M., You M. J., Sun Y., Ma L.. Long noncoding RNA MALAT1 suppresses breast cancer metastasis. Nat. Genet. 2018;50:1705–1715. doi: 10.1038/s41588-018-0252-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kwok Z. H., Roche V., Chew X. H., Fadieieva A., Tay Y. A.. non-canonical tumor suppressive role for the long non-coding RNA MALAT1 in colon and breast cancers. Int. J. Cancer. 2018;143:668–678. doi: 10.1002/ijc.31386. [DOI] [PubMed] [Google Scholar]
- Ji P., Diederichs S., Wang W., Boing S., Metzger R., Schneider P. M., Tidow N., Brandt B., Buerger H., Bulk E., Thomas M., Berdel W. E., Serve H., Muller-Tidow C.. MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage non-small cell lung cancer. Oncogene. 2003;22:8031–8041. doi: 10.1038/sj.onc.1206928. [DOI] [PubMed] [Google Scholar]
- Li P., Zhang X., Wang H., Wang L., Liu T., Du L., Yang Y., Wang C.. MALAT1 Is Associated with Poor Response to Oxaliplatin-Based Chemotherapy in Colorectal Cancer Patients and Promotes Chemoresistance through EZH2. Mol. Cancer. Ther. 2017;16:739–751. doi: 10.1158/1535-7163.MCT-16-0591. [DOI] [PubMed] [Google Scholar]
- Cui Y., Li G., Zhang X., Dai F., Zhang R.. Increased MALAT1 expression contributes to cisplatin resistance in non-small cell lung cancer. Oncol. Lett. 2018;16:4821–4828. doi: 10.3892/ol.2018.9293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hou J., Zhang G., Wang X., Wang Y., Wang K.. Functions and mechanisms of lncRNA MALAT1 in cancer chemotherapy resistance. Biomark Res. 2023;11:23–28. doi: 10.1186/s40364-023-00467-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Z.-X., Zhu Q., Zhang H., Hu Y., Wang G., Zhu Y.. MALAT1: a potential biomarker in cancer. Cancer Manage. Res. 2018;10:6757–6768. doi: 10.2147/CMAR.S169406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- François-Moutal L., Miranda V. G., Mollasalehi N., Gokhale V., Khanna M.. In Silico Targeting of the Long Noncoding RNA MALAT1. ACS Med. Chem. Lett. 2021;12:915–921. doi: 10.1021/acsmedchemlett.1c00060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abulwerdi F. A., Xu W., Ageeli A. A., Yonkunas M. J., Arun G., Nam H., Schneekloth J. S. J., Dayie T. K., Spector D., Baird N., Le Grice S. F. J.. Selective Small-Molecule Targeting of a Triple Helix Encoded by the Long Noncoding RNA, MALAT1. ACS Chem. Biol. 2019;14:223–235. doi: 10.1021/acschembio.8b00807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Donlic A., Morgan B. S., Xu J. L., Liu A., Roble C. J., Hargrove A. E.. Discovery of Small Molecule Ligands for MALAT1 by Tuning an RNA-Binding Scaffold. Angew. Chem., Int. Ed. 2018;57:13242–13247. doi: 10.1002/anie.201808823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pernak M., Fleurisson C., Delorme C., Moumne R., Benedetti E., Micouin L., Azoulay S., Foricher Y., Duca M.. Development of Comprehensive Screening and Assessment Assays for Small-Molecule Ligands of MALAT1 lncRNA. ACS Chem. Biol. 2025;20:1068–1076. doi: 10.1021/acschembio.5c00061. [DOI] [PubMed] [Google Scholar]
- Rocca R., Polera N., Juli G., Grillone K., Maruca A., Di Martino M. T., Artese A., Amato J., Pagano B., Randazzo A., Tagliaferri P., Tassone P., Alcaro S.. Hit identification of novel small molecules interfering with MALAT1 triplex by a structure-based virtual screening. Arch. Pharm. 2023;356:e2300134. doi: 10.1002/ardp.202300134. [DOI] [PubMed] [Google Scholar]
- Rocca R., Alcaro S., Artese A.. Structure-based virtual screening and molecular dynamics simulations of FDA-approved drugs targeting MALAT1. Med. Chem. Res. 2024;33:2095–2100. doi: 10.1007/s00044-024-03336-7. [DOI] [Google Scholar]
- Wilusz J. E., JnBaptiste C. K., Lu L. Y., Kuhn C., Joshua-Tor L., Sharp P. A.. A triple helix stabilizes the 3′ ends of long noncoding RNAs that lack poly(A) tails. Genes Dev. 2012;26:2392–2407. doi: 10.1101/gad.204438.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown J. A., Bulkley D., Wang J., Valenstein M. L., Yario T. A., Steitz T. A., Steitz J. A.. Structural insights into the stabilization of MALAT1 noncoding RNA by a bipartite triple helix. Nat. Struct. Mol. Biol. 2014;21:633–640. doi: 10.1038/nsmb.2844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ageeli A. A., McGovern-Gooch K. R., Kaminska M. M., Baird N. J.. Finely tuned conformational dynamics regulate the protective function of the lncRNA MALAT1 triple helix. Nucleic Acids Res. 2019;47:1468–1481. doi: 10.1093/nar/gky1171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stelzer A. C., Frank A. T., Kratz J. D., Swanson M. D., Gonzalez-Hernandez M. J., Lee J., Andricioaei I., Markovitz D. M., Al-Hashimi H. M.. Discovery of selective bioactive small molecules by targeting an RNA dynamic ensemble. Nat. Chem. Biol. 2011;7:553–559. doi: 10.1038/nchembio.596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davis B., Afshar M., Varani G., Murchie A. I. H., Karn J., Lentzen G., Drysdale M., Bower J., Potter A. J., Starkey I. D., Swarbrick T., Aboul-ela F.. Rational design of inhibitors of HIV-1 TAR RNA through the stabilisation of electrostatic ″hot spots″. J. Mol. Biol. 2004;336:343–356. doi: 10.1016/j.jmb.2003.12.046. [DOI] [PubMed] [Google Scholar]
- Matsumoto S., Caliskan N., Rodnina M. V., Murata A., Nakatani K.. Small synthetic molecule-stabilized RNA pseudoknot as an activator for −1 ribosomal frameshifting. Nucleic Acids Res. 2018;46:8079–8089. doi: 10.1093/nar/gky689. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jorgensen W. L.. The many roles of computation in drug discovery. Science. 2004;303:1813–1818. doi: 10.1126/science.1096361. [DOI] [PubMed] [Google Scholar]
- Warner K. D., Hajdin C. E., Weeks K. M.. Principles for targeting RNA with drug-like small molecules. Nat. Rev. Drug Discovery. 2018;17:547–558. doi: 10.1038/nrd.2018.93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernetti M., Aguti R., Bosio S., Recanatini M., Masetti M., Cavalli A.. Computational drug discovery under RNA times. QRB Discovery. 2022;3:e22. doi: 10.1017/qrd.2022.20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Manigrasso J., Marcia M., De Vivo M.. Computer-aided design of RNA-targeted small molecules: A growing need in drug discovery. Chem. 2021;7:2965–2988. doi: 10.1016/j.chempr.2021.05.021. [DOI] [Google Scholar]
- Serra E., Ghidini A., Aguti R., Bernetti M., Decherchi S., Cavalli A.. Path-Based Nonequilibrium Binding Free Energy Estimation, from Protein-Ligand to RNA-Ligand Binding. J. Chem. Inf. Model. 2025;65:6057–6072. doi: 10.1021/acs.jcim.5c00452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chopra N., Abramyan A. M., Si Chaib Z., Chen W., Xiong Y., Bochicchio A., Rinaldo D., Jerome S. V., Zhang Y.. Enabling In-Silico Hit Discovery Workflows Targeting RNA with Small Molecules. J. Chem. Inf. Model. 2025;65:7393–7398. doi: 10.1021/acs.jcim.5c00814. [DOI] [PubMed] [Google Scholar]
- Kallert E., Fischer T. R., Schneider S., Grimm M., Helm M., Kersten C.. Protein-Based Virtual Screening Tools Applied for RNA-Ligand Docking Identify New Binders of the preQ(1)-Riboswitch. J. Chem. Inf. Model. 2022;62:4134–4148. doi: 10.1021/acs.jcim.2c00751. [DOI] [PubMed] [Google Scholar]
- Jiang D., Du H., Zhao H., Deng Y., Wu Z., Wang J., Zeng Y., Zhang H., Wang X., Wang E., Hou T., Hsieh C.. Assessing the performance of MM/PBSA and MM/GBSA methods. 10. Prediction reliability of binding affinities and binding poses for RNA-ligand complexes. Phys. Chem. Chem. Phys. 2024;26:10323–10335. doi: 10.1039/D3CP04366E. [DOI] [PubMed] [Google Scholar]
- Hoang G. L., Rock M., Tancredi A., Magauer T., Mandelli D., Schulz J. B., Krauss S., Rossetti G., Tollinger M., Carloni P.. Refining Ligand Poses in RNA/Ligand Complexes of Pharmaceutical Relevance: A Perspective by QM/MM Simulations and NMR Measurements. J. Phys. Chem. Lett. 2025;16:1702–1708. doi: 10.1021/acs.jpclett.4c03456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bosio S., Bernetti M., Rocchia W., Masetti M.. Similarities and Differences in Ligand Binding to Protein and RNA Targets: The Case of Riboflavin. J. Chem. Inf. Model. 2024;64:4570–4586. doi: 10.1021/acs.jcim.4c00420. [DOI] [PubMed] [Google Scholar]
- Agarwal R., T R. R., Smith J. C.. Comparative Assessment of Pose Prediction Accuracy in RNA-Ligand Docking. J. Chem. Inf. Model. 2023;63:7444–7452. doi: 10.1021/acs.jcim.3c01533. [DOI] [PubMed] [Google Scholar]
- Jiang D., Zhao H., Du H., Deng Y., Wu Z., Wang J., Zeng Y., Zhang H., Wang X., Wu J., Hsieh C., Hou T.. How Good Are Current Docking Programs at Nucleic Acid-Ligand Docking? A Comprehensive Evaluation. J. Chem. Theory Comput. 2023;19:5633–5647. doi: 10.1021/acs.jctc.3c00507. [DOI] [PubMed] [Google Scholar]
- Gunasinghe K. K. J., Ginjom I. R. H., San H. S., Rahman T., Wezen X. C.. Can Current Molecular Docking Methods Accurately Predict RNA Inhibitors? J. Chem. Inf. Model. 2024;64:5954–5963. doi: 10.1021/acs.jcim.4c00235. [DOI] [PubMed] [Google Scholar]
- Ansari N., Liu C., Hedin F., Henin J., Ponder J. W., Ren P., Piquemal J., Lagardere L., El Hage K.. Targeting RNA with small molecules using state-of-the-art methods provides highly predictive affinities of riboswitch inhibitors. Commun. Biol. 2025;8:1405. doi: 10.1038/s42003-025-08809-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zafferani M., Martyr J. G., Muralidharan D., Montalvan N. I., Cai Z., Hargrove A. E.. Multiassay Profiling of a Focused Small Molecule Library Reveals Predictive Bidirectional Modulation of the lncRNA MALAT1 Triplex Stability In Vitro. ACS Chem. Biol. 2022;17:2437–2447. doi: 10.1021/acschembio.2c00124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Vivo M., Masetti M., Bottegoni G., Cavalli A.. Role of Molecular Dynamics and Related Methods in Drug Discovery. J. Med. Chem. 2016;59:4035–4061. doi: 10.1021/acs.jmedchem.5b01684. [DOI] [PubMed] [Google Scholar]
- Bussi G.. Hamiltonian replica exchange in GROMACS: a flexible implementation. Mol. Phys. 2014;112:379–384. doi: 10.1080/00268976.2013.824126. [DOI] [Google Scholar]
- Muscat S., Martino G., Manigrasso J., Marcia M., De Vivo M.. On the Power and Challenges of Atomistic Molecular Dynamics to Investigate RNA Molecules. J. Chem. Theory Comput. 2024;20:6992–7008. doi: 10.1021/acs.jctc.4c00773. [DOI] [PubMed] [Google Scholar]
- Bussi G., Bonomi M., Gkeka P., Sattler M., Al-Hashimi H. M., Auffinger P., Duca M., Foricher Y., Incarnato D., Jones A. N., Kirmizialtin S., Krepl M., Orozco M., Palermo G., Pasquali S., Salmon L., Schwalbe H., Westhof E., Zacharias M.. RNA dynamics from experimental and computational approaches. Structure. 2024;32:1281–1287. doi: 10.1016/j.str.2024.07.019. [DOI] [PubMed] [Google Scholar]
- Šponer J., Bussi G., Krepl M., Banas P., Bottaro S., Cunha R. A., Gil-Ley A., Pinamonti G., Poblete S., Jurecka P., Walter N. G., Otyepka M.. RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview. Chem. Rev. 2018;118:4177–4338. doi: 10.1021/acs.chemrev.7b00427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santos-Martins D., Solis-Vasquez L., Tillack A. F., Sanner M. F., Koch A., Forli S.. Accelerating AutoDock4 with GPUs and Gradient-Based Local Search. J. Chem. Theory Comput. 2021;17:1060–1073. doi: 10.1021/acs.jctc.0c01006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruiz-Carmona S., Alvarez-Garcia D., Foloppe N., Garmendia-Doval A. B., Juhos S., Schmidtke P., Barril X., Hubbard R. E., Morley S. D.. rDock: a fast, versatile and open source program for docking ligands to proteins and nucleic acids. PLoS Comput. Biol. 2014;10:e1003571. doi: 10.1371/journal.pcbi.1003571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morris G. M., Huey R., Lindstrom W., Sanner M. F., Belew R. K., Goodsell D. S., Olson A. J.. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. 2009;30:2785–2791. doi: 10.1002/jcc.21256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eberhardt J., Santos-Martins D., Tillack A. F., Forli S.. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model. 2021;61:3891–3898. doi: 10.1021/acs.jcim.1c00203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stefaniak F., Bujnicki J. M.. AnnapuRNA: A scoring function for predicting RNA-small molecule binding poses. PLoS Comput. Biol. 2021;17:e1008309. doi: 10.1371/journal.pcbi.1008309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou Y., Jiang Y., Chen S.. SPRank horizontal line A Knowledge-Based Scoring Function for RNA-Ligand Pose Prediction and Virtual Screening. J. Chem. Theory Comput. 2024;20(16):7358–7369. doi: 10.1021/acs.jctc.4c00681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hornak V., Abel R., Okur A., Strockbine B., Roitberg A., Simmerling C.. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins. 2006;65:712–725. doi: 10.1002/prot.21123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perez A., Marchan I., Svozil D., Sponer J., Cheatham T. E. 3, Laughton C. A., Orozco M.. Refinement of the AMBER force field for nucleic acids: improving the description of alpha/gamma conformers. Biophys. J. 2007;92:3817–3829. doi: 10.1529/biophysj.106.097782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zgarbová M., Otyepka M., Sponer J., Mladek A., Banas P., Cheatham T. E. 3, Jurecka P.. Refinement of the Cornell et al. Nucleic Acids Force Field Based on Reference Quantum Chemical Calculations of Glycosidic Torsion Profiles. J. Chem. Theory Comput. 2011;7:2886–2902. doi: 10.1021/ct200162x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Izadi S., Anandakrishnan R., Onufriev A. V.. Building Water Models: A Different Approach. J. Phys. Chem. Lett. 2014;5:3863–3871. doi: 10.1021/jz501780a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joung I. S., Cheatham T. E.. 3. Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. J. Phys. Chem. B. 2008;112:9020–9041. doi: 10.1021/jp8001614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hess B., Bekker H., Berendsen H. J. C., Fraaije J. G. E. M.. LINCS: A linear constraint solver for molecular simulations. J. Comput. Chem. 1997;18:1463–1472. doi: 10.1002/(SICI)1096-987X(199709)18:12<1463::AID-JCC4>3.0.CO;2-H. [DOI] [Google Scholar]
- Hess B.. P-LINCS: A Parallel Linear Constraint Solver for Molecular Simulation. J. Chem. Theory Comput. 2008;4:116–122. doi: 10.1021/ct700200b. [DOI] [PubMed] [Google Scholar]
- Darden T., York D., Pedersen L.. Particle mesh Ewald: An N·log(N) method for Ewald sums in large systems. J. Chem. Phys. 1993;98:10089–10092. doi: 10.1063/1.464397. [DOI] [Google Scholar]
- Essmann U., Perera L., Berkowitz M. L., Darden T., Lee H., Pedersen L. G.. A smooth particle mesh Ewald method. J. Chem. Phys. 1995;103:8577–8593. doi: 10.1063/1.470117. [DOI] [Google Scholar]
- Bussi G., Donadio D., Parrinello M.. Canonical sampling through velocity rescaling. J. Chem. Phys. 2007;126:014101. doi: 10.1063/1.2408420. [DOI] [PubMed] [Google Scholar]
- Bernetti M., Bussi G.. Pressure control using stochastic cell rescaling. J. Chem. Phys. 2020;153:114107. doi: 10.1063/5.0020514. [DOI] [PubMed] [Google Scholar]
- Bottaro S., Di Palma F., Bussi G.. The role of nucleobase interactions in RNA structure and dynamics. Nucleic Acids Res. 2014;42:13306–13314. doi: 10.1093/nar/gku972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- La Sala G., Decherchi S., De Vivo M., Rocchia W.. Allosteric Communication Networks in Proteins Revealed through Pocket Crosstalk Analysis. ACS Cent. Sci. 2017;3:949–960. doi: 10.1021/acscentsci.7b00211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aguti R., Bernetti M., Bosio S., Decherchi S., Cavalli A.. On the allosteric puzzle and pocket crosstalk through computational means. J. Chem. Phys. 2023;158:165101. doi: 10.1063/5.0145364. [DOI] [PubMed] [Google Scholar]
- Decherchi S., Bottegoni G., Spitaleri A., Rocchia W., Cavalli A.. BiKi Life Sciences: A New Suite for Molecular Dynamics and Related Methods in Drug Discovery. J. Chem. Inf. Model. 2018;58:219–224. doi: 10.1021/acs.jcim.7b00680. [DOI] [PubMed] [Google Scholar]
- Decherchi S., Rocchia W.. A general and robust ray-casting-based algorithm for triangulating surfaces at the nanoscale. PLoS One. 2013;8:e59744. doi: 10.1371/journal.pone.0059744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abate C., Serra E., Rocchia W., Cavalli A., Decherchi S.. NanoShaperWeb: Molecular Surface and Pocket Detection Made Visual. J. Chem. Inf. Model. 2025;65:7341–7346. doi: 10.1021/acs.jcim.5c00821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dubourg V., Vanderplas J., Passos A., Cournapeau D., Brucher M., Perrot M., Duchesnay É.. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]
- González-Alemán R., Hernandez-Castillo D., Caballero J., Montero-Cabrera L. A.. Quality Threshold Clustering of Molecular Dynamics: A Word of Caution. J. Chem. Inf. Model. 2020;60:467–472. doi: 10.1021/acs.jcim.9b00558. [DOI] [PubMed] [Google Scholar]
- Szulc N. A., Mackiewicz Z., Bujnicki J. M., Stefaniak F.. fingeRNAt-A novel tool for high-throughput analysis of nucleic acid-ligand interactions. PLoS Comput. Biol. 2022;18:e1009783. doi: 10.1371/journal.pcbi.1009783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allen W. J., Rizzo R. C.. Implementation of the Hungarian algorithm to account for ligand symmetry and similarity in structure-based design. J. Chem. Inf. Model. 2014;54:518–529. doi: 10.1021/ci400534h. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Costa F., Ocello R., Guardiani C., Giacomello A., Masetti M.. Integrated Approach Including Docking, MD Simulations, and Network Analysis Highlights the Action Mechanism of the Cardiac hERG Activator RPR260243. J. Chem. Inf. Model. 2023;63:4888–4899. doi: 10.1021/acs.jcim.3c00596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu C., Wu C., Ghoreishi D., Chen W., Wang L., Damm W., Ross G. A., Dahlgren M. K., Russell E., Von Bargen C. D., Abel R., Friesner R. A., Harder E. D.. OPLS4: Improving Force Field Accuracy on Challenging Regimes of Chemical Space. J. Chem. Theory Comput. 2021;17:4291–4300. doi: 10.1021/acs.jctc.1c00302. [DOI] [PubMed] [Google Scholar]
- Schrödinger Release 2025–02: MacroModel, Schrödinger; LLC: New York, NY, 2025. [Google Scholar]
- Nosé S.. A molecular dynamics method for simulations in the canonical ensemble. Mol. Phys. 1984;52:255–268. doi: 10.1080/00268978400101201. [DOI] [Google Scholar]
- Hoover W. G.. Canonical dynamics: Equilibrium phase-space distributions. Phys. Rev. A. 1985;31:1695. doi: 10.1103/PhysRevA.31.1695. [DOI] [PubMed] [Google Scholar]
- Martyna G. J., Tobias D. J., Klein M. L.. Constant pressure molecular dynamics algorithms. J. Chem. Phys. 1994;101:4177–4189. doi: 10.1063/1.467468. [DOI] [Google Scholar]
- Kräutler V., van Gunsteren W. F., Hünenberger P. H.. A fast SHAKE algorithm to solve distance constraint equations for small molecules in molecular dynamics simulations. J. Comput. Chem. 2001;22:501–508. doi: 10.1002/1096-987X(20010415)22:5<501::AID-JCC1021>3.0.CO;2-V. [DOI] [Google Scholar]
- Tuckerman M., Berne B. J., Martyna G. J.. Reversible multiple time scale molecular dynamics. J. Chem. Phys. 1992;97:1990–2001. doi: 10.1063/1.463137. [DOI] [Google Scholar]
- Schrödinger Release 2025–02: Desmond, Schrödinger; LLC: New York, NY, 2025. [Google Scholar]
- Amaro R. E., Baudry J., Chodera J., Demir Ö., McCammon J. A., Miao Y., Smith J. C.. Ensemble Docking in Drug Discovery. Biophys. J. 2018;114:2271–2278. doi: 10.1016/j.bpj.2018.02.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harada R., Takano Y., Baba T., Shigeta Y.. Simple, yet powerful methodologies for conformational sampling of proteins. Phys. Chem. Chem. Phys. 2015;17:6155–6173. doi: 10.1039/C4CP05262E. [DOI] [PubMed] [Google Scholar]
- Mlýnský V., Bussi G.. Exploring RNA structure and dynamics through enhanced sampling simulations. Curr. Opin. Struct. Biol. 2018;49:63–71. doi: 10.1016/j.sbi.2018.01.004. [DOI] [PubMed] [Google Scholar]
- Majumdar S., Di Palma F., Spyrakis F., Decherchi S., Cavalli A.. Molecular Dynamics and Machine Learning Give Insights on the Flexibility-Activity Relationships in Tyrosine Kinome. J. Chem. Inf. Model. 2023;63:4814–4826. doi: 10.1021/acs.jcim.3c00738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernetti M., Bosio S., Bresciani V., Falchi F., Masetti M.. Probing allosteric communication with combined molecular dynamics simulations and network analysis. Curr. Opin. Struct. Biol. 2024;86:102820. doi: 10.1016/j.sbi.2024.102820. [DOI] [PubMed] [Google Scholar]
- Zhou Y., Chen S.. Harnessing Computational Approaches for RNA-Targeted Drug Discovery. RNA NanoMed. 2024;1:1–15. doi: 10.59566/ISRNN.2024.0101001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gioia D., Bertazzo M., Recanatini M., Masetti M., Cavalli A.. Dynamic Docking: A Paradigm Shift in Computational Drug Discovery. Molecules. 2017;22:2029. doi: 10.3390/molecules22112029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cournia, Z. ; Chipot, C. ; Roux, B. ; York, D. M. ; Sherman, W. . Free Energy Methods in Drug DiscoveryIntroduction, Free Energy Methods in Drug Discovery: Current State and Future Directions; American Chemical Society, 2021; Vol. 1397, pp 1–38. [Google Scholar]
- Rasouli A., Pickard F. C. IV, Sur S., Grossfield A., Bennett M. I.. Essential Considerations for Free Energy Calculations of RNA-Small Molecule Complexes: Lessons from the Theophylline-Binding RNA Aptamer. J. Chem. Inf. Model. 2025;65:223–239. doi: 10.1021/acs.jcim.4c01505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abramyan A. M., Bochicchio A., Wu C., Damm W., Langley D. R., Shivakumar D., Lupyan D., Wang L., Harder E., Oloo E. O.. Accurate Physics-Based Prediction of Binding Affinities of RNA- and DNA-Targeting Ligands. J. Chem. Inf. Model. 2025;65:1392–1403. doi: 10.1021/acs.jcim.4c01708. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All the materials to perform the MD simulations and the molecular docking calculations on the triple helix structure of the MALAT1 RNA, as well as to reproduce the analyses presented in this work is freely available in Zenodo with accession code 10.5281/zenodo.18872932. Specifically, necessary input files to perform both plain (i.e., unbiased) and Hamiltonian replica-exchange (HREX) MD simulations are provided together with the associated output trajectories. RNA and ligand structures for docking via AutoDock-GPU and rDock are also provided, as well as output data to reproduce the analyses and figures reported in this work. The Notebook_malat1.ipynb Jupyter Notebook illustrates how to perform all the analyses. The notebook can also be straightforwardly consulted at https://github.com/CompMedChemLab/project_malat1. Input files to perform plain MD simulations of the docking poses for three representative ligands (P9, P0, and O5), together with the corresponding output trajectories, are provided as well.






