Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 Jan 27:2024.01.26.577511. [Version 1] doi: 10.1101/2024.01.26.577511

In silico λ-dynamics predicts protein binding specificities to modified RNAs

Murphy Angelo 1, Wen Zhang 1,2,*, Jonah Z Vilseck 1,3,*, Scott T Aoki 1,2,*
PMCID: PMC10849657  PMID: 38328125

Abstract

RNA modifications shape gene expression through a smorgasbord of chemical changes to canonical RNA bases. Although numbering in the hundreds, only a few RNA modifications are well characterized, in part due to the absence of methods to identify modification sites. Antibodies remain a common tool to identify modified RNA and infer modification sites through straightforward applications. However, specificity issues can result in off-target binding and confound conclusions. This work utilizes in silico λ-dynamics to efficiently estimate binding free energy differences of modification-targeting antibodies between a variety of naturally occurring RNA modifications. Crystal structures of inosine and N6-methyladenosine (m6A) targeting antibodies bound to their modified ribonucleosides were determined and served as structural starting points. λ-Dynamics was utilized to predict RNA modifications that permit or inhibit binding to these antibodies. In vitro RNA-antibody binding assays supported the accuracy of these in silico results. High agreement between experimental and computed binding propensities demonstrated that λ-dynamics can serve as a predictive screen for antibody specificity against libraries of RNA modifications. More importantly, this strategy is an innovative way to elucidate how hundreds of known RNA modifications interact with biological molecules without the limitations imposed by in vitro or in vivo methodologies.

Keywords: RNA modifications, RNA-protein interactions, in silico modeling, molecular dynamics, lambda-dynamics

Introduction

Biology has an RNA complexity problem. Cells must make sense of a vast sea of RNAs that function as protein code, regulatory molecules, enzymes, scaffolds, and other biological tools. Furthermore, the 4 canonical RNA bases can be enzymatically modified into new chemical structures that change their ability to base pair, form secondary structure, and interact with RNA-binding proteins (1). These chemical additions can be as small as a single methyl group or as large as a sugar moiety. Over 140 RNA modifications have been identified across all three kingdoms of life (1). RNA modifications are prevalent in biology and function as an epigenetic code to regulate development (2), respond to infectious diseases (3), and are involved in cancer progression (4). Their combinatorial complexity highlights how individual or collections of RNA modifications may alter an RNA’s fate or function. A current challenge is the development of methods to identify all modification sites to decipher the roles of these RNA modifications in biology.

A variety of methods can identify a few RNA modification sites. For example, chemical treatment can identify m6A (e.g. GLORI (5)) and pseudouridine (e.g. pseudo-seq (6)) by taking advantage of chemistries that affect a modified base differently than an unmodified base. Direct RNA nanopore sequencing can also identify specific modifications like m6A (717) through differences in electrical current perturbations as the modified RNA transverses the sequencing pore. Both strategies, however, require tailor-made approaches to accommodate each RNA modification’s unique biochemical characteristics. Furthermore, without employing enrichment strategies, low abundance modifications remain difficult to detect. Adaptable methods are needed to elucidate the full breadth of modified RNAs found in living organisms.

A common, versatile identification strategy uses antibodies to immunoprecipitate modified RNAs (18). These enriched RNAs are then sequenced to identify RNA targets and infer modification sites. Immunoprecipitation and sequencing methods are well established with straightforward workflows, and enrichment permits identification of less prevalent modification sites. Indeed, much of the work determining the modification sites of N6-methyladenosine (m6A, e.g. (19,20)), N1-methyladenosine (m1A, e.g. (2124)), 5-methylcytosine (m5C, e.g. (25,26)), and others have relied on antibodies.

Antibodies can become de novo RNA-binding proteins through adaptive immunity. Immunoglobulin G (IgG) antibodies are comprised of two heavy and two light polypeptide chains that assemble a pair of six hypervariable complementary-determining region (CDR) loops at their antigen recognition interface (2729). Antibodies recognize a variety of antigens through CDRs that vary in amino acid length and composition. How antibodies recognize proteins is well studied (30), but how antibodies recognize modified RNAs is less clear. A polyinosine-antibody crystal structure was determined bound to various nucleotides (31). Closer inspection of the structure reveals a large, suitably configured pocket adjacent to the bound nucleotide (Fig S1), suggesting that the antibody may have specificity toward nucleic acid, not single bases. Regardless, the lack of antibody structures targeting other modified bases limits insights into how antibodies recognize RNA modifications.

The success of using antibodies for RNA modification site identification depends on the quality of the antibody (32,33). Antibodies with low specificity have assigned erroneous biochemical functions to RNA modifications. For example, published studies reached differing conclusions regarding the mechanism of the m1A modification. Two studies found m1A prevalent in the 5’ ends of mRNA (23,24), suggesting that the modification enhances translation (24), while contrasting studies reported it as rare in mRNA (21,22). In the former studies, it was later discovered that the antibody used for m1A RNA enrichment also had affinity towards 7-methylguanosine (m7G, (21)), an abundant mRNA 5’ cap modification crucial for cap-dependent translation (34). These false positive site identifications led to incorrect conclusions regarding m1A function. Because the identification of RNA targets and their specific modification sites gives insight into their biological and biochemical mechanisms, the development of antibodies with high affinity and high specificity is a key to successfully discovering the biological roles of the many RNA modifications. However, given the large number of RNA modifications and the subtle chemical differences between them, off-targets of RNA modification antibodies will be a continuous, inevitable problem. The current state of RNA chemistry prevents in vitro testing of all known RNA modifications, and thus new methods are required to predict the specificity of RNA modification-targeting antibodies.

Computational approaches have the potential to screen antibodies for their predicted ability to bind modified RNA bases. Physics-based, alchemical free energy calculations are an accurate, rigorous, and cost-effective means to quantify chemical probe interactions with protein structures in silico (3537). These calculations compute relative binding free energies (ΔΔGbind) between two or more molecules by transforming between alternate chemical groups in silico. Because they are at the heart of molecular dynamics simulations, these calculations also provide dynamic structural characterization of macromolecular complexes. With these methods, changes in RNA-protein binding affinities can be monitored as a function of the chemical differences between modified or unmodified RNAs. Hence, modeling different RNA modifications can predict binding selectivity.

λ-Dynamics is an efficient alchemical free energy method that can accurately and rapidly screen hundreds of modified RNAs bound to a protein host. This method holds a key advantage over other in silico strategies in that it can model multiple chemical variations simultaneously within a single simulation (38,39), making it more efficient and higher throughput. In a λ-dynamics calculation, a variable λ parameter allows chemical groups to dynamically scale between “on” and “off” states during a molecular dynamics simulation. Akin to selection in an in vitro competitive binding assay, this dynamic behavior effectively differentiates the varying affinities of target molecules, providing insights into their binding characteristics. Thus, λ-dynamics can rapidly select for the best binders from a library of chemical modifications (40,41). To date, λ-dynamics has accurately measured the relative binding free energy differences of large chemical inhibitor libraries targeting the HIV reverse transcriptase (4244) and β-secretase 1 (45,46), of mutations at various protein-protein interfaces (47,48), as well as of the folding free energies of mutant T4 lysozyme proteins (49). Notably, chemical probe binding studies with λ-dynamics demonstrated 8- to 30-fold efficiency gains over other conventional free energy calculations (42,45). This equates to months of computational time savings.

The following investigation tested whether λ-dynamics could accurately predict how RNA modifications affected RNA-protein interactions. This work determined the structures of two modified RNA-targeting antibodies bound to inosine and m6A, revealing that these antibodies recognize their target ligands similar to other modified RNA binding proteins. The structural models permitted the use of λ-dynamics to perform a computational screen of RNA base modifications bound to inosine and m6A antibodies to predict their binding specificities. These in silico binding predictions were verified with in vitro binding assays. Collectively, the results demonstrate how structural biology can be combined with λ-dynamics to predict modified RNA-protein interactions without the limitations imposed by biochemical experiment methodologies.

Results

The goal was to test whether λ-dynamics could be used as an in silico strategy to accurately probe modified RNA-protein interactions. Antibodies can serve as modified RNA-binding proteins. They are commonly used as reagents to enrich for modified RNAs and determine modification sites in biology (18). Currently, RNA modification targeting antibodies are relatively few in number, have modest affinity toward their targets (32,33), and can have specificity issues that confound biological conclusions (21). An antibody specificity screening method for known RNA modifications will enable a comprehensive view of the RNAs enriched and provide insight into how to improve antibody design.

High-resolution structures of antibodies targeting single modified RNA bases have not been published. An inosine-targeting antibody structure is available (31), but an open pocket adjacent to the nucleoside binding site potentiates the chance of the antibody binding to a dinucleotide substrate (Fig S1). To avoid this confounder, additional antibody structures bound to modified ribonucleosides were pursued. The protein sequences of available antibodies were predicted by mass spectrometry and sequencing (see Methods). Recombinant antibodies were produced in cell culture and used to generate antibody fragments (Fabs). Fabs were screened in crystallizing conditions, and crystals were soaked or grown with target nucleoside ligands (see Methods). These efforts lead to the determination of three modified RNA-targeting antibody crystal structures (Table S1): one targeting inosine at 1.94 Å and two targeting m6A at 2.02 Å and 3.06 Å.

IgG antibodies are composed of heavy and light protein chains, forming 6 variable loops on each arm, or antibody binding fragment (Fab), that typically dictate binding affinity to its target substrate (2729). In the 1.94 Å inosine and 3.06 Å m6A antibody structures, a large, discontinous density was observed at these variable loop regions where a modified purine target nucleoside could be adequately modeled (Fig 1A,B). Rather than binding to loops on the periphery, the modified nucleosides bound to a central cavity created by the 6 variable loops between the heavy and light chains (Fig 1A,B). Binding of small molecules at this location has been observed in other antibody structures (50). In the third 2.02 Å m6A targeting antibody structure, density in this binding pocket was not observed (Fig S2). Thus, two structures yielded high-resolution models of how purine modified bases bind to antibodies.

Fig 1.

Fig 1.

Binding of inosine and m6A targeting antibodies mimics RNA-binding proteins. (A) Crystal structure of the inosine targeting antibody fragment to 1.94 Å (PDB ID: 8SIP). Overview (left) and magnified (right) rendition of the antibody bound to inosine nucleoside. 1FoFc density without ligand in green mesh. Heavy chain (H) in dark blue, light chain (L) in light blue, and inosine in orange. Interacting amino acids include heavy chain residues Asn35, Trp40, Trp50, Gly99, Tyr104, and Leu106 and light chain residues Ser97 and Trp101. Those discussed in the main text are labeled. (B) Crystal structure of the m6A-targeting antibody fragment to 3.06 Å (PDB ID: 8VEV). Labeling same as in (A), except m6A nucleoside in orange. Interacting amino acids include heavy chain residues Trp33, Asn35, Glu50, Tyr61, Trp101, and Phe105 and light chain residues Tyr34, Trp93, and Leu98. Those discussed in the main text are labeled. (C) Structure of a YTH bound to m6A (YTHF1, PDB ID: 4RCJ). Residues in dark blue. m6A in orange. Interacting amino acids include Tyr397, Asp401, Trp411, Cys412, Asn441, Trp465, Lys469, Trp470, and Asp507. Those discussed in the main text are labeled.

Small molecule antibodies are selected through adaptive immunity to target a particular hapten (51). Thus, antibodies become RNA-binding proteins through adaptation and can inform on how biology designs a protein to bind an RNA modification de novo. Modified RNA-binding proteins provide exemplary examples of potential binding architecture. For example, the YTH domains bind to m6A with high specificity (52). This domain arranges its side chains to 1) create a specificity pocket for the parent base and modification, 2) bind the nucleobase through π-π stacking, and 3) line the pocket periphery with positively charged side chains to accommodate the negatively charged RNA phosphate backbone (Fig 1C). Antibodies targeting modified RNAs might also mimic this strategy. Alternatively, they might use a collection of novel binding strategies, each selected randomly through adaptive immune selection.

The inosine and m6A antibody structures both bound to their modified ribonucleoside ligands similarly to other RNA-binding proteins. To specify the modified base, the inosine targeting antibody used an asparagine to select for the O6 oxygen and N1 nitrogen of the inosine nucleobase (Fig 1A). The m6A-targeting antibody created a hydrophobic pocket to accommodate the methyl group (Fig 1B) and a glutamate side chain to hydrogen bond with the adenosine nucleobase N1 nitrogen (Fig 1B). Interestingly, glutamate side chain coordination is also observed in some YTH domains that bind m6A (Fig 1C, (53)). Both antibodies used paired tryptophans to create a slot for favorable π-π stacking and a tyrosine for ribose ring interactions (Fig 1A,B). However, these tryptophans and tyrosine came from differing variable loops in each antibody and are organized differently in their central antibody binding pocket (Fig 1A,B). The difference in binding pocket organization potentially reflects how these two antibodies were isolated from different animals with separate adaptive immune responses. In sum, the antibody-ligand structures revealed that these two antibodies use similar strategies to bind their modified base targets that may permit differentiation between unmodified base counterparts.

The quality of the structures enabled predicting in silico how these antibodies may interact with other RNA nucleobases. There are over 140 different RNA modifications identified in biology, many of which are not available as commercial reagents or lack protocols to synthesize in vitro. A library of 44 modified and 4 unmodified nucleobases was selected based on published thermodynamic parameters for RNA modifications in the CHARMM force field (54) and their commercial availability for experimental testing in vitro (Fig S3). λ-Dynamics was used to assess differences in relative binding free energies between inosine or m6A versus each library nucleobase when bound to their respective antibodies (see Methods, Fig 2, and Fig S4). During the simulations, some of the modified nucleosides unbound from the antibody (Fig S5), presumably due to having poor binding affinity or steric clashes, and were removed from further study (Table S2 and S3). Similar to previously performed studies (4244,4749), relative binding free energies (ΔΔGbind) were calculated for the nucleosides that remained antibody bound. Examples of the results obtained are shown (Fig 3 and 4) with full results reported in the Supplement (Table S2 and S3). A positive ΔΔGbind value indicates poorer binding and a negative value suggests enhanced binding when compared to the native inosine or m6A base. As a control, inosine and m6A modified bases were perturbed into an identical but distinct copy of themselves within their respective antibody complexes. These free energy differences were near zero (Fig 3A and 4), as expected of a base replacing itself, and indicated that the λ-dynamics calculations were working correctly.

Fig 2.

Fig 2.

In silico λ-dynamics workflow for screening potential binders to the inosine and m6A antibodies. A three-step process was used to filter candidates from a library of 48 ribonucleosides for in vitro antibody binding validation. (1) For each mutant library candidate, a λ-dynamics simulation was conducted to calculate a relative binding free energy between the mutant and its respective native ribonucleoside ligand (inosine or m6A). (2) All ribonucleosides that unbound during these simulations were deemed unfavorable and excluded from further processing. (3) Mutant bases with relative binding free energies deemed favorable (ΔΔGbind ≤ −0.7 kcal/mol) were selected for in vitro validation with binding assays based on commercial availability.

Fig 3.

Fig 3.

Highlighted binding trends from the inosine antibody λ-dynamics screen. (A) λ-Dynamics predicts loss of binding (red) for cytidine (C), no change in binding (grey) for inosine and adenosine (A), and enhancement of binding (green) for uridine (U). Estimated relative binding free energies (ΔΔGbind) and uncertainties (±σ) are listed. (B) The predicted inosine antibody promiscuity for U generalizes to many of its derivatives. Estimated relative binding free energies and uncertainties are listed in green. The thickness of each equilibrium arrow is proportional to the favorability of the corresponding transition. Seven other uridine derivatives (U×7) showed enhanced binding but are not depicted. See Table S2 for a complete list.

Fig 4.

Fig 4.

Highlighted binding trends from the m6A antibody λ-dynamics screen. λ-Dynamics predicts loss of binding (red) for cytidine (C), no change in binding (grey) for m6A and adenosine (A), and enhancement of binding (green) for m62A. Estimated relative binding free energies (ΔΔGbind) and uncertainties (±σ) are listed. See Table S3 for a complete list.

λ-Dynamics predicted differing specificities and off-targets for these two antibodies. The inosine antibody had many predicted off-targets that included uridine (Fig 3A) and uridine modifications (Fig 3B). Inspection of the models revealed that hydrogen bonding of the asparagine side chain to the O6 oxygen in inosine could be satisfied by the O4 oxygen in uridine (Fig S6A). Many uridine modifications had an O4 oxygen available for hydrogen bonding, potentially explaining why related molecules all had higher predicted binding affinities in the λ-dynamics calculations. In contrast, cytidine and adenosine were not predicted to enhance binding (Fig 3A and Table S2). Both nucleosides have nitrogens at similar positions, potentially making the pocket less favorable for these bases to interact by removing hydrogen bonding. Finally, a further inspection of the structures revealed a larger binding pocket in the inosine versus the m6A antibody binding pocket (Fig 1A,B). This larger pocket may accommodate a greater variety of shapes and sizes, increasing the propensity for off-targets. Thus, λ-dynamics predicted the inosine antibody to have many off-targets in this modestly sized ribonucleoside library.

In contrast to the inosine antibody, λ-dynamics predicted that the m6A antibody had relatively few off-targets (Table S3). As discussed previously, the binding pocket was smaller (Fig 1A,B) and required a N1 nitrogen on the nucleobase for hydrogen bonding (Fig 1B). Along with m6A, a few adenosine bases were predicted to bind (Fig 4 and Table S3), including adenosine and N6,N6-dimethyladenosine (m62A), a dimethyl modification at the N6 nitrogen position (Fig S6B,C). Closer inspection of the structure revealed that the hydrophobic pocket had enough space to accommodate a second methyl group (Fig S6C). Similar to the inosine antibody, cytidine was predicted to be a poor binder with a high, positive free energy difference (Fig 4). In summary, the m6A antibody had fewer off-targets compared to the inosine antibody but still was predicted to bind to nucleosides other than m6A.

While λ-dynamics has demonstrated accuracy with modeling protein-protein and protein-small molecule binding interactions (4248), it has so far been untested with respect to reproducing protein-RNA interactions. To evaluate our in silico predictions in vitro, Enzyme-Linked Immunosorbent Assays (ELISAs) were used to probe the binding of inosine and m6A antibodies to target and off-target RNA bases. RNAs were synthesized through solid-state chemistry (see Methods) to create biotin-labeled oligomers of inosine, adenosine, uridine, and cytidine to test the inosine antibody binding. Cytidine oligos with single base changes of adenosine, m6A, and m62A were synthesized to test the m6A antibody binding. The biotin-labeled oligos were bound to wells coated with a streptavidin derivative. Wells without oligo served as a background control. After oligo incubation and washing, the inosine and m6A antibodies were incubated at varying concentrations. Bound inosine and m6A antibodies were detected with a secondary horseradish peroxidase (HRP) conjugated antibody that targeted mouse IgG. No inosine or m6A antibody wells were used to control for secondary antibody background. The presence of secondary antibody was detected with an HRP chromogenic substrate, with the absorbance measured as an indirect reading for inosine or m6A antibody binding.

The inosine and m6A antibody in vitro binding results agreed with the λ-dynamics predictions (Fig 5). The inosine antibody bound to inosine and uridine oligos (Fig 5A), although inosine binding was observed at much lower antibody concentrations. In contrast, the inosine antibody did not bind to adenosine or cytosine oligos (Fig 5A). Likewise, the m6A antibody bound to m6A containing cytidine oligos but bound poorly to cytidine only (Fig 5B), as expected. As λ-dynamics predicted, the m6A antibody bound to an m62A-containing oligo (Fig 5B). The antibody also bound to an adenosine-containing oligo (Fig 5B) but to a lesser degree than m6A. Regardless, the in vitro binding results matched the predictions of λ-dynamics, supporting the accuracy of this in silico method to identify modified RNA-protein interactions.

Fig 5.

Fig 5.

ELISA binding assay results confirmed λ-dynamics predictions of antibody off-targets. (A) Absorbance units reported by ELISA indicating the binding affinity of inosine antibody to inosine (I), uridine (U), adenosine (A), and cytidine (C) over varying protein concentrations. Double asterisks (**) denote a p-value < 0.01. Inosine serves as a positive control. In line with λ-dynamics predictions, U identified as an off-target while A and C demonstrated negligible binding. (B) Absorbance units reported by ELISA indicating the binding affinity of m6A antibody to m6A, m62A, adenosine (A), and cytidine (C) at varying protein concentrations. Double asterisks (**) denote a p-value < 0.01. m6A serves as a positive control. Again, matching λ-dynamics predictions, m62A and A are identified as off-targets while C demonstrated negligible binding. All p-values calculated are available in Fig S6D,E.

Discussion

With hundreds of RNA modifications identified in biology, new methods are required to determine the sites of each of these chemical changes to determine their functions. Antibodies targeting RNA modifications are a versatile tool to enrich and determine modification sites, but their reliability hinges upon their accuracy. To this end, inosine and m6A antibody structures bound to their modified ribonucleoside targets were determined to high resolution. These structures then facilitated the use of λ-dynamics, an in silico free energy calculation, to estimate how the antibodies may bind other unmodified and modified RNA bases, with worsened, neutral, or enhanced binding affinities. λ-Dynamics predictions matched well with in vitro binding assay results, supporting the accuracy of using this computational approach to measure untested RNA-protein interactions. In its simplest application and as performed in this work, the method can be used to determine off-target RNA base interactions with antibodies used for modified RNA enrichment and site identification. But the strategy holds greater promise to inject insight into the biochemical mechanisms of RNA modifications by determining how any modified RNA, commercially available for biochemical investigation or not, may interact with proteins and other molecules (Fig 6).

Fig 6.

Fig 6.

Proposed strategy to predict how proteins bind canonical and modified RNA. (1) Starting with an RNA-protein structural model, (2) an in silico λ-dynamics screen can be conducted to assess the favorability of the protein’s interactions with a complete range of RNA bases. (3) This approach provides an economical and effective means to explore the full extent of a protein’s RNA-binding capabilities that can be tested further in vitro.

The determined antibody structures targeting modified purines revealed identical binding strategies toward their respective modified RNA bases, reminiscent of modified RNA-binding proteins. Each antibody had a specificity pocket and used tryptophans to create a slot for π- π stacking with the nucleobase. Only one of these tryptophans had a similar sequence position between the two antibodies. The other came from a separate loop, leading to RNA binding in completely different orientations. These antibodies were created through adaptive immunity, supporting the notion that mimicking modified base RNA-binding proteins by creating a specificity pocket and using π-π stacking for nucleobase interactions is a competent way to bind a modified nucleobase. Thus, convergent adaption may have led both purine-targeting antibodies to follow a similar binding strategy as modified RNA-binding proteins. The results lead to the speculation that all modified RNA-targeting antibodies bind to their targets similarly. Examples of pyrimidine-targeting antibody structures will be necessary to further probe this concept.

Antibodies are heavily used reagents to enrich modified RNA for sequencing and site identification. This strategy has been used to identify sites of many different RNA modifications to deduce their biological and biochemical mechanisms. Regardless of new methodologies to determine RNA modification sites, antibodies will continue to be used to enrich for less abundant modifications. Thus, antibody binding to off-target RNA modifications will continue to be a problem in research. The chemical similarities between many RNA modifications make antibody specificity an expected complication. This work demonstrates how λ-dynamics is a viable in silico tool to determine potential RNA off-targets of antibodies. The method does not require the availability of modified nucleosides, RNA oligomers, or other in vitro reagents that are currently unavailable. With an accurate, high-resolution structural model, λ-dynamics can test the full breadth of RNA modifications in biology. Additionally, λ-dynamics has previously investigated the effects of protein mutations on binding (47,48). The method can thus be used to rationally design antibodies for improved binding specificity and affinity.

This is the first study to use λ-dynamics to probe nucleic acid-protein interactions via nucleic base perturbations. Other in silico molecular modeling and free energy methods have been employed to study nucleic acid-protein interactions, including predictions of DNA binding to proteins (55) and probing mutations in DNA-protein complexes (56,57). λ-Dynamics has several key attributes that make it advantageous over other in silico calculations. First, λ-dynamics enables multiple modified bases to be calculated within a single simulation. This can drastically improve efficiency over other free energy methods that can only investigate a single perturbation at a time, therefore requiring many simulations to study multiple perturbations. Second, λ-dynamics can simultaneously sample modifications at multiple sites within a chemical system. This enables base changes at different RNA sequence positions to yield free energy results for multiple modification combinations. There are limitations to λ-dynamics as well. Many of the calculated free energy differences, such as with uridine bound to the inosine antibody (Fig 3A) or with m62A bound to the m6A antibody (Fig 4), predicted greater enhancement of binding than what was observed in vitro (Fig 5). The starting models for the λ-dynamics calculations were based on the crystal structures of antibody fragments bound to nucleosides, but binding was tested in vitro with RNA oligos. This omission of the RNA phosphate backbone from the model, as well as the potential for sporadic self-associations or secondary structures in the unbound oligo, may have impacted the true binding values. Additional work probing RNA-protein interactions with λ-dynamics will undoubtedly improve the simulations. Moreover, the refinement of molecular dynamics force fields, particularly with respect to nucleic acids, is a bustling area of research, and future advancements promise to further enhance the accuracy of these classical simulations.

While hundreds of RNA modifications have been identified, only a few dozen are available for experimental testing in vitro. Novel methods must be developed to examine how all modifications affect molecular interactions to decipher their biological mechanisms. This study establishes a workflow for using λ-dynamics to probe nucleic acid-protein interactions in silico (Fig 6). The combinatorial efficiency of λ-dynamics enables rapid in silico examination of currently known and newly discovered RNA modifications. With high-resolution structures of nucleic acid-protein complexes, modified and unmodified nucleoside bases can be probed to explore how chemical changes to RNA affect protein binding interactions. This computational approach can be used for DNA or RNA and is not limited by available chemistry. The work presented demonstrates how this strategy can probe for the specificity of antibodies. Future work can utilize this method to test how hundreds of RNA modifications affect their molecular interactions with any RNA-binding protein or other nucleic acids, delivering novel insights into their molecular functions.

Materials and Methods

Recombinant antibodies.

Commercial antibodies targeting inosine and m6A were sequenced by Abterra Biosciences (San Diego, CA) (5860). Briefly, the antibodies were fragmented and submitted for MS/MS mass spectrometry. The data was then analyzed to predict the probable antibody sequence. Full-length monoclonal antibodies (mAb) and antibody fragments (Fab) were produced recombinantly in human cells by Sino Biological (Wayne, PA). Fabs were made from mAbs by papain protease digestion, Fc removal by protein A, and size exclusion chromatography. All mAbs and Fabs were shipped and stored in phosphate buffered saline (PBS; 137 mM NaCl, 2.7 mM KCl, 10 mM Na2HPO4, 1.8 mM KH2PO4).

Crystallography.

Recombinant Fabs were concentrated to approximately 3–5 mg/ml and sitting drop crystal trays were set with an Oryx4 (Douglas Instruments; Hungerford, United Kingdom). The m6A Fab was set up without and with 1 mM m6A nucleoside (MedChemExpress, HY-N0086). Crystals were observed by 4 weeks in the following conditions: 1) the inosine Fab in 50 mM Tris pH 8.3, 15% PEG 4000, 0.1 mM EDTA; 2) the m6A Fab only in 20% (v/v) PEG 2K, 0.2 M MgCl2, 100 mM Tris pH 8.0; and 3) m6A Fab with 1 mM m6A nucleoside in 0.17 M ammonium sulfate, 25.5% (w/v) PEG 4000. The inosine and m6A Fab only crystals were incubated in freezing conditions (inosine: 21% PEG 4K, 50 mM Tris pH 8.3, 0.1 mM EDTA, 20% glycerol, 0.2 mM inosine nucleoside (Sigma, I4125–1G); m6A: 20% (v/v) PEG 2K, 0.2 M MgCl2, 100 mM Tris pH 8.0, 5–15% (v/v) glycerol, 1 mM m6A nucleoside) with addition of 10 mM inosine and 10 mM m6A nucleoside for 30–60 minutes prior to freezing, respectively. X-ray diffraction data was collected at Lilly Research Laboratories Collaborative Access Team (LRL-CAT; Argonne National Laboratory; Argonne, IL) and ESRF ID30B (Life Sciences Collaborative Access Team (LS-CAT) operating at the European Synchrotron Radiation Facility (ESRF); Grenoble, France). Data was collected and processed by Lilly, UW-Madison Crystallography Core, and the authors. All data was indexed, merged, and scaled in XDS/Aimless (61). Space groups were determined in XDS/pointless (61). Model building and refinement were performed in Coot (62) and Phenix (63), respectively. In some of the inosine and m6A Fab density maps, a large density was observed at the Fab antigen binding site. The respective modified RNA nucleosides used in crystallization and in freezing modeled well into these densities (Fig 1A,B). The final structures and merged reflection files are deposited at wwPDB (wwpdb.org; PDB IDs: 8SIP, 8TCA, 8VEV). Unmerged reflection data were deposited at Integrated Resource for Reproducibility in Macromolecular Crystallography (proteindiffraction.org).

System setup for molecular modeling.

Coordinates for the inosine and m6A Fabs were obtained from our Protein Data Bank (PDB) entries 8SIP and 8VEV. Residue flips for His, Glu, and Asn were assessed using the MolProbity webserver (64). Protonation states of titratable residues were assigned based on their predicted pKa values at pH 7.0 using PROPKA (65,66). The protein-nucleoside complexes were then solvated using the CHARMM-GUI Solution Builder (67), requiring a minimum of 10 Å of solvent padding from each face. The resulting cubic water box dimensions were 101 Å per edge for the inosine system and 98 Å per edge for the m6A system. Sufficient K+ or Cl ions were added to neutralize the net charge of each system. Additional K+, Mg2+, and Cl ions were then added to achieve a final ionic strength of 150 mM KCl and 0–5 mM MgCl2. This process was repeated to solvate the individual nucleosides without their respective Fabs, yielding unbound model systems with cubic box dimensions of 30 Å per edge for inosine and 32 Å per edge for m6A.

All simulations were performed using the CHARMM molecular simulation package ((68,69), developmental version c47a2) with the Basic λ-Dynamics Engine (BLaDE) on graphics processing units (GPUs) (70). Prior to running molecular dynamics, each system was subjected to 250 steps of steepest descent minimization. Molecular dynamics (MD) simulations were then run in the isothermal-isobaric (NPT) ensemble at 25°C and 1 atm using a Langevin thermostat and Monte Carlo barostat (7072). The g-BAOAB integrator was used with an integration timestep of 2 fs and trajectory frames were saved every 1000 steps (70,73). Bond lengths between hydrogens and heavy atoms were constrained using the SHAKE algorithm (7477). Periodic boundary conditions were employed in conjunction with Particle Mesh Ewald (PME) electrostatics (7880), to compute long-range electrostatic forces, and force-switched van der Waals (vdW) interactions (81). Nonbonded cutoffs were set to 10 Å, with force switching taking effect starting at 9 Å.

All explicit solvent calculations were conducted using the TIP3P water model (82). The CHARMM36 protein force field was used to represent the inosine and m6A Fabs, and the CHARMM36 nucleic acid force field was used to represent the RNA oligos (8387). Modified ribonucleobase parameters were used to model noncanonical bases in the ribonucleoside (54). For the alchemical perturbations performed with λ-dynamics, ribonucleoside base mutations were represented using a hybrid multiple-topology approach (88). In the case of purine-to-purine mutations, analogous atoms in the shared core were harmonically restrained to one another using the Scaling of Constrained Atoms (SCAT) interface described previously (89).

λ-Dynamics calculations.

From 112 parameterized modified ribonucleobases available (54), a library of 48 bases, comprising 44 modified and 4 unmodified base candidates, were selected for in silico screening with λ-dynamics. Those with charged functional groups, bulky side chains, or modifications to the ribose sugar were excluded. Simulations were conducted for each of the 48 ribonucleosides with λ-dynamics to alchemically transform wild-type nucleoside bases (inosine or m6A) into a corresponding mutant base and compute relative differences in binding affinities. Prior to initiating λ-dynamics production sampling, appropriate biasing potentials must first be identified. The Adaptive Landscape Flattening (ALF) (49,90) algorithm was used to identify optimal biasing potentials to facilitate dynamic and frequent alchemical transitions between the perturbed bases. For each perturbation, ALF identified initial biases by first conducting one hundred simulations of 100 ps MD sampling, followed by 13 simulations of 1 ns each. These biases were then further refined via five replicate simulations of 5 ns each. With optimal biases identified, five independent production simulations of 25 ns were conducted, with an initial 5 ns of sampling removed from free energy determinations for equilibration. Ribonucleosides that unbound from the Fab binding site during λ-dynamics production sampling were labeled as unfavorable and were not pursued further. In all other cases, final ΔΔGbind values were calculated by Boltzmann reweighting the end-state populations to the original biases with WHAM (49,91). Uncertainties (σ) were calculated by bootstrapping the standard deviation of the mean across each of the five independent trials. From these results, modified oligonucleotides were selected for synthesis based on commercial availability.

RNA oligonucleotide preparation.

RNA oligonucleotides used for binding affinity measurements and crystallographic studies were synthesized on an ABI 394 DNA/RNA synthesizer (Applied Biosystems (ABI); Waltham, MA). m6A (10-3005-90; Glen Research; Sterline, VA), m62A (ANP-8626; Chemgene; Wilmington, MA), and inosine (ANP-5680; Chemgene) modified RNA phosphoramidites; Biotin phosphoramidite (CLP-1517; Chemgene); and canonical RNA (A, ANP-5671; U, ANP-5674; C, ANP-6676; Chemgene) phosphoramidites were purchased from commercial sources. The canonical and modified phosphoramidites were concentrated to 0.1 M in acetonitrile. Coupling was carried out using a 5-benzylthio-1H-tetrazole (5-BTT) solution (0.25 M) as the catalyst. The coupling time was 650 seconds. 3% trichloroacetic acid in methylene chloride was used for the detritylation. Syntheses were performed on control pore glass (CPG-1000) immobilized with the appropriate nucleosides. All L-oligonucleotides were prepared with DMTr-on and in-house deprotected using AMA (1:1 v/v aqueous mixture of 30% w/v ammonium hydroxide and 40% w/v methylamine) for 15 minutes at 65°C. The RNA strands were additionally desilylated with Et3N•3HF solution to remove TBDMS groups. The 5’-DMTr deprotection was carried out using the commercial Glen-Pak purification cartridge (Glen Research). Purification was initially performed by the commercial Glen-Pak purification cartridge, followed by further purification with a 15% denaturing PAGE gel. The oligonucleotides were collected, lyophilized, desalted, re-dissolved in water, and then concentrated as appropriate for downstream experiments. Concentrations of the aqueous RNA samples were determined by their UV absorption at 260 nm, using the Thermo Scientific Nanodrop One Spectrophotometer. The theoretical molar extinction coefficients of these samples at 260 nm were provided by Integrated DNA Technologies.

ELISA.

Biotin-labeled, RNA oligos were diluted to 100 nM in ELISA blocking buffer (PBS, 0.05% Tween-20, 0.2 mg/ml bovine serum albumin (BSA, BP9706100; Fisher Scientific; Hampton, NH)), and 100 ul were incubated in clear, 96-well NeutrAvidin Coated Plates (PI15217; Pierce; Waltham, MA) overnight at 4°C. Two technical replicates were set for each RNA oligo. ELISA blocking buffer without oligo condition was used as a negative control. The plates were washed with PBS-T (PBS with 0.05% Tween-20) 3 times, and varying concentrations of recombinant mAb incubated in each well for 1 hour at room temperature (approximately 20°C). A no-mAb condition was used as a no primary antibody control. Plates were washed 3 times again with PBS-T and incubated with goat anti-mouse IgG conjugated to horseradish peroxidase (HRP, NBP2–30347H; Novus Biologicals; Centennial, CO) at 0.05 μg/ml in ELISA blocking buffer for 1 hour at room temperature (approximately 20°C). The plates were washed again with PBS-T and incubated with 50 ul of room temperature 1-Step Ultra TMB-ELISA Substrate Solution (PI34028; Pierce). After 15 minutes, the reaction was stopped with 50 ul of 2M Sulfuric Acid (A300S-500, Fisher Scientific). The plates were analyzed by 450 nm absorbance with a Synergy H1 microplate reader (BioTek Instruments; Winooski, VT). All ELISA experiments were replicated at least 3 times. The 3 cleanest runs were reported. Averages, standard deviations, and graphs were performed and made in GraphPad Prism version 10.1.1 for MacOS (GraphPad Software, Boston, MA).

Supplementary Material

Supplement 1
Download video file (8.6MB, mov)
Supplement 2
Download video file (6.8MB, mov)
1

Acknowledgements

The authors thank Dr. Millie Georgiadis for help with crystal data collection and model building, and members of the Aoki, Vilseck, and Zhang Labs for their helpful discussion. Data collection and processing was also facilitated by Craig A. Bingman of the Department of Biochemistry Collaborative Crystallography Core and the University of Wisconsin, which is supported by user fees and the department. X-ray diffraction data were collected on Advanced Photon Source beamline LRL-CAT 31-ID and the European Synchrotron Radiation Facility (ESRF) ID30B operated by the Life Sciences Collaborative Access Team (LS-CAT). This research used resources of the Advanced Photon Source, a US Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357. Use of the Lilly Research Laboratories Collaborative Access Team (LRL-CAT) beamline at Sector 31 of the Advanced Photon Source was provided by Eli Lilly Company, which operates the facility. The authors acknowledge the Indiana University Pervasive Technology Institute for providing supercomputing and storage resources that have contributed to the research results reported within this paper. S.T.A, W.Z., and J.Z.V. received start-up funds from the Indiana University School of Medicine and its Precision Health Initiative (PHI). J.Z.V. gratefully acknowledges the National Institutes of Health (NIH) for financial support through grant R35GM146888. S.T.A. is funded by the NIH/NIGMS (R35GM142691) and an Indiana University Research Support Funds Grant (RSFG).

References

  • 1.McCown P.J., Ruszkowska A., Kunkler C.N., Breger K., Hulewicz J.P., Wang M.C., Springer N.A. and Brown J.A. (2020) Naturally occurring modified ribonucleosides. Wiley Interdiscip Rev RNA, 11, e1595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Roundtree I.A., Evans M.E., Pan T. and He C. (2017) Dynamic RNA Modifications in Gene Expression Regulation. Cell, 169, 1187–1200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kennedy E.M., Courtney D.G., Tsai K. and Cullen B.R. (2017) Viral Epitranscriptomics. J Virol, 91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wu X., Sang L. and Gong Y. (2018) N6-methyladenine RNA modification and cancers. Am J Cancer Res, 8, 1957–1966. [PMC free article] [PubMed] [Google Scholar]
  • 5.Liu C., Sun H., Yi Y., Shen W., Li K., Xiao Y., Li F., Li Y., Hou Y., Lu B. et al. (2023) Absolute quantification of single-base m(6)A methylation in the mammalian transcriptome using GLORI. Nat Biotechnol, 41, 355–366. [DOI] [PubMed] [Google Scholar]
  • 6.Carlile T.M., Rojas-Duran M.F., Zinshteyn B., Shin H., Bartoli K.M. and Gilbert W.V. (2014) Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells. Nature, 515, 143–146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Stoiber M., Quick J., Egan R., Lee J.E., Celniker S., Neely R.K., Loman N., Pennacchio L.A. and Brown J. (2017) De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing. bioRxiv, 094672. [Google Scholar]
  • 8.Lorenz D.A., Sathe S., Einstein J.M. and Yeo G.W. (2020) Direct RNA sequencing enables m(6)A detection in endogenous transcript isoforms at base-specific resolution. RNA, 26, 19–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Gao Y., Liu X., Wu B., Wang H., Xi F., Kohnen M.V., Reddy A.S.N. and Gu L. (2021) Quantitative profiling of N(6)-methyladenosine at single-base resolution in stem-differentiating xylem of Populus trichocarpa using Nanopore direct RNA sequencing. Genome Biol, 22, 22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hendra C., Pratanwanich P.N., Wan Y.K., Goh W.S.S., Thiery A. and Goke J. (2022) Detection of m6A from direct RNA sequencing using a multiple instance learning framework. Nat Methods, 19, 1590–1598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Leger A., Amaral P.P., Pandolfini L., Capitanchik C., Capraro F., Miano V., Migliori V., Toolan-Kerr P., Sideri T., Enright A.J. et al. (2021) RNA modifications detection by comparative Nanopore direct RNA sequencing. Nat Commun, 12, 7198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Pratanwanich P.N., Yao F., Chen Y., Koh C.W.Q., Wan Y.K., Hendra C., Poon P., Goh Y.T., Yap P.M.L., Chooi J.Y. et al. (2021) Identification of differential RNA modifications from nanopore direct RNA sequencing with xPore. Nat Biotechnol, 39, 1394–1402. [DOI] [PubMed] [Google Scholar]
  • 13.Liu H., Begik O., Lucas M.C., Ramirez J.M., Mason C.E., Wiener D., Schwartz S., Mattick J.S., Smith M.A. and Novoa E.M. (2019) Accurate detection of m(6)A RNA modifications in native RNA sequences. Nat Commun, 10, 4079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Parker M.T., Knop K., Sherwood A.V., Schurch N.J., Mackinnon K., Gould P.D., Hall A.J., Barton G.J. and Simpson G.G. (2020) Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m(6)A modification. Elife, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Price A.M., Hayer K.E., McIntyre A.B.R., Gokhale N.S., Abebe J.S., Della Fera A.N., Mason C.E., Horner S.M., Wilson A.C., Depledge D.P. and Weitzman M.D. (2020) Direct RNA sequencing reveals m(6)A modifications on adenovirus RNA are necessary for efficient splicing. Nat Commun, 11, 6016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jenjaroenpun P., Wongsurawat T., Wadley T.D., Wassenaar T.M., Liu J., Dai Q., Wanchai V., Akel N.S., Jamshidi-Parsian A., Franco A.T. et al. (2021) Decoding the epitranscriptional landscape from native RNA sequences. Nucleic Acids Res, 49, e7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zhong Z.D., Xie Y.Y., Chen H.X., Lan Y.L., Liu X.H., Ji J.Y., Wu F., Jin L., Chen J., Mak D.W. et al. (2023) Systematic comparison of tools used for m(6)A mapping from nanopore direct RNA sequencing. Nat Commun, 14, 1906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Sarkar A., Gasperi W., Begley U., Nevins S., Huber S.M., Dedon P.C. and Begley T.J. (2021) Detecting the epitranscriptome. Wiley Interdiscip Rev RNA, 12, e1663. [DOI] [PubMed] [Google Scholar]
  • 19.Horowitz S., Horowitz A., Nilsen T.W., Munns T.W. and Rottman F.M. (1984) Mapping of N6-methyladenosine residues in bovine prolactin mRNA. Proc Natl Acad Sci U S A, 81, 5667–5671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Dominissini D., Moshitch-Moshkovitz S., Schwartz S., Salmon-Divon M., Ungar L., Osenberg S., Cesarkas K., Jacob-Hirsch J., Amariglio N., Kupiec M. et al. (2012) Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq. Nature, 485, 201–206. [DOI] [PubMed] [Google Scholar]
  • 21.Grozhik A.V., Olarerin-George A.O., Sindelar M., Li X., Gross S.S. and Jaffrey S.R. (2019) Antibody cross-reactivity accounts for widespread appearance of m(1)A in 5’UTRs. Nat Commun, 10, 5126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Safra M., Sas-Chen A., Nir R., Winkler R., Nachshon A., Bar-Yaacov D., Erlacher M., Rossmanith W., Stern-Ginossar N. and Schwartz S. (2017) The m1A landscape on cytosolic and mitochondrial mRNA at single-base resolution. Nature, 551, 251–255. [DOI] [PubMed] [Google Scholar]
  • 23.Li X., Xiong X., Wang K., Wang L., Shu X., Ma S. and Yi C. (2016) Transcriptome-wide mapping reveals reversible and dynamic N(1)-methyladenosine methylome. Nat Chem Biol, 12, 311–316. [DOI] [PubMed] [Google Scholar]
  • 24.Dominissini D., Nachtergaele S., Moshitch-Moshkovitz S., Peer E., Kol N., Ben-Haim M.S., Dai Q., Di Segni A., Salmon-Divon M., Clark W.C. et al. (2016) The dynamic N(1)-methyladenosine methylome in eukaryotic messenger RNA. Nature, 530, 441–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Delatte B., Wang F., Ngoc L.V., Collignon E., Bonvin E., Deplus R., Calonne E., Hassabi B., Putmans P., Awe S. et al. (2016) RNA biochemistry. Transcriptome-wide distribution and function of RNA hydroxymethylcytosine. Science, 351, 282–285. [DOI] [PubMed] [Google Scholar]
  • 26.Amort T., Rieder D., Wille A., Khokhlova-Cubberley D., Riml C., Trixl L., Jia X.Y., Micura R. and Lusser A. (2017) Distinct 5-methylcytosine profiles in poly(A) RNA from mouse embryonic stem cells and brain. Genome Biol, 18, 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Stanfield R.L. and Wilson I.A. (2014) Antibody Structure. Microbiol Spectr, 2. [DOI] [PubMed] [Google Scholar]
  • 28.Lipman N.S., Jackson L.R., Trudel L.J. and Weis-Garcia F. (2005) Monoclonal versus polyclonal antibodies: distinguishing characteristics, applications, and information resources. ILAR J, 46, 258–268. [DOI] [PubMed] [Google Scholar]
  • 29.Lee C.C., Perchiacca J.M. and Tessier P.M. (2013) Toward aggregation-resistant antibodies by design. Trends Biotechnol, 31, 612–620. [DOI] [PubMed] [Google Scholar]
  • 30.Wilson I.A. and Stanfield R.L. (2021) 50 Years of structural immunology. J Biol Chem, 296, 100745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Pokkuluri P.R., Bouthillier F., Li Y., Kuderova A., Lee J. and Cygler M. (1994) Preparation, characterization and crystallization of an antibody Fab fragment that recognizes RNA. Crystal structures of native Fab and three Fab-mononucleotide complexes. J Mol Biol, 243, 283–297. [DOI] [PubMed] [Google Scholar]
  • 32.Weichmann F., Hett R., Schepers A., Ito-Kureha T., Flatley A., Slama K., Hastert F.D., Angstman N.B., Cardoso M.C., Konig J. et al. (2020) Validation strategies for antibodies targeting modified ribonucleotides. RNA, 26, 1489–1506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Helm M. and Motorin Y. (2017) Detecting RNA modifications in the epitranscriptome: predict and validate. Nat Rev Genet, 18, 275–291. [DOI] [PubMed] [Google Scholar]
  • 34.Sonenberg N. (2008) eIF4E, the mRNA cap-binding protein: from basic discovery to translational research. Biochem Cell Biol, 86, 178–183. [DOI] [PubMed] [Google Scholar]
  • 35.Song L.F. and Merz K.M. Jr. (2020) Evolution of Alchemical Free Energy Methods in Drug Discovery. J Chem Inf Model, 60, 5308–5318. [DOI] [PubMed] [Google Scholar]
  • 36.Chodera J.D., Mobley D.L., Shirts M.R., Dixon R.W., Branson K. and Pande V.S. (2011) Alchemical free energy methods for drug discovery: progress and challenges. Curr Opin Struct Biol, 21, 150–160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Kollman P. (2002) Free energy calculations: Applications to chemical and biochemical phenomena. Chemical Reviews, 93, 2395–2417. [Google Scholar]
  • 38.Knight J.L. and Brooks C.L. (2011) Multisite λ Dynamics for Simulated Structure–Activity Relationship Studies. Journal of Chemical Theory and Computation, 7, 2728–2739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Knight J.L. and Brooks C.L. (2011) Applying efficient implicit nongeometric constraints in alchemical free energy simulations. Journal of Computational Chemistry, 32, 3423–3432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kong X. and Brooks C.L. (1996) λ-dynamics: A new approach to free energy calculations. The Journal of Chemical Physics, 105, 2414–2423. [Google Scholar]
  • 41.Knight J.L. and Brooks C.L. (2009) λ-Dynamics free energy simulation methods. Journal of Computational Chemistry, 30, 1692–1700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Vilseck J.Z., Armacost K.A., Hayes R.L., Goh G.B. and Brooks C.L. (2018) Predicting Binding Free Energies in a Large Combinatorial Chemical Space Using Multisite λ Dynamics. The Journal of Physical Chemistry Letters, 9, 3328–3332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Lee W.-G., Gallardo-Macias R., Frey K.M., Spasov K.A., Bollini M., Anderson K.S. and Jorgensen W.L. (2013) Picomolar Inhibitors of HIV Reverse Transcriptase Featuring Bicyclic Replacement of a Cyanovinylphenyl Group. Journal of the American Chemical Society, 135, 16705–16713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lee W.-G., Frey K.M., Gallardo-Macias R., Spasov K.A., Bollini M., Anderson K.S. and Jorgensen W.L. (2014) Picomolar Inhibitors of HIV-1 Reverse Transcriptase: Design and Crystallography of Naphthyl Phenyl Ethers. ACS Medicinal Chemistry Letters, 5, 1259–1262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Vilseck J.Z., Sohail N., Hayes R.L. and Brooks C.L. (2019) Overcoming Challenging Substituent Perturbations with Multisite λ-Dynamics: A Case Study Targeting β-Secretase 1. The Journal of Physical Chemistry Letters, 10, 4875–4880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Keränen H., Pérez-Benito L., Ciordia M., Delgado F., Steinbrecher T.B., Oehlrich D., van Vlijmen H.W.T., Trabanco A.A. and Tresadern G. (2017) Acylguanidine Beta Secretase 1 Inhibitors: A Combined Experimental and Free Energy Perturbation Study. Journal of Chemical Theory and Computation, 13, 1439–1453. [DOI] [PubMed] [Google Scholar]
  • 47.Hanquier J.N., Sanders K., Berryhill C.A., Sahoo F.K., Hudmon A., Vilseck J.Z. and Cornett E.M. (2023) Identification of nonhistone substrates of the lysine methyltransferase PRDM9. J Biol Chem, 299, 104651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Peck Justice S.A., Barron M.P., Qi G.D., Wijeratne H.R.S., Victorino J.F., Simpson E.R., Vilseck J.Z., Wijeratne A.B. and Mosley A.L. (2020) Mutant thermal proteome profiling for characterization of missense protein variants and their associated phenotypes within the proteome. J Biol Chem, 295, 16219–16238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Hayes R.L., Vilseck J.Z. and Brooks C.L. 3rd. (2018) Approaching protein design with multisite lambda dynamics: Accurate and scalable mutational folding free energies in T4 lysozyme. Protein Sci, 27, 1910–1922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Fanning S.W. and Horn J.R. (2011) An anti-hapten camelid antibody reveals a cryptic binding site with significant energetic contributions from a nonhypervariable loop. Protein Sci, 20, 1196–1207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Yau K.Y., Lee H. and Hall J.C. (2003) Emerging trends in the synthesis and improvement of hapten-specific recombinant antibodies. Biotechnol Adv, 21, 599–637. [DOI] [PubMed] [Google Scholar]
  • 52.Patil D.P., Pickering B.F. and Jaffrey S.R. (2018) Reading m(6)A in the Transcriptome: m(6)A-Binding Proteins. Trends Cell Biol, 28, 113–127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Xu C., Liu K., Ahmed H., Loppnau P., Schapira M. and Min J. (2015) Structural Basis for the Discriminative Recognition of N6-Methyladenosine RNA by the Human YT521-B Homology Domain Family of Proteins. J Biol Chem, 290, 24902–24913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Xu Y., Vanommeslaeghe K., Aleksandrov A., MacKerell A.D. Jr. and Nilsson L. (2016) Additive CHARMM force field for naturally occurring modified ribonucleotides. J Comput Chem, 37, 896–912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Seeliger D., Buelens F.P., Goette M., de Groot B.L. and Grubmuller H. (2011) Towards computational specificity screening of DNA-binding proteins. Nucleic Acids Res, 39, 8281–8290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Beierlein F.R., Kneale G.G. and Clark T. (2011) Predicting the effects of basepair mutations in DNA-protein complexes by thermodynamic integration. Biophys J, 101, 1130–1138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Gapsys V. and de Groot B.L. (2017) Alchemical Free Energy Calculations for Nucleotide Mutations in Protein-DNA Complexes. J Chem Theory Comput, 13, 6275–6289. [DOI] [PubMed] [Google Scholar]
  • 58.Bandeira N., Pham V., Pevzner P., Arnott D. and Lill J.R. (2008) Automated de novo protein sequencing of monoclonal antibodies. Nat Biotechnol, 26, 1336–1338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Castellana N.E., Pham V., Arnott D., Lill J.R. and Bafna V. (2010) Template proteogenomics: sequencing whole proteins using an imperfect database. Mol Cell Proteomics, 9, 1260–1270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Castellana N.E., McCutcheon K., Pham V.C., Harden K., Nguyen A., Young J., Adams C., Schroeder K., Arnott D., Bafna V. et al. (2011) Resurrection of a clinical antibody: template proteogenomic de novo proteomic sequencing and reverse engineering of an anti-lymphotoxin-alpha antibody. Proteomics, 11, 395–405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Kabsch W. (2010) Xds. Acta Crystallogr D Biol Crystallogr, 66, 125–132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Emsley P., Lohkamp B., Scott W.G. and Cowtan K. (2010) Features and development of Coot. Acta Crystallogr D Biol Crystallogr, 66, 486–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Liebschner D., Afonine P.V., Baker M.L., Bunkoczi G., Chen V.B., Croll T.I., Hintze B., Hung L.W., Jain S., McCoy A.J. et al. (2019) Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr D Struct Biol, 75, 861–877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Williams C.J., Headd J.J., Moriarty N.W., Prisant M.G., Videau L.L., Deis L.N., Verma V., Keedy D.A., Hintze B.J., Chen V.B. et al. (2018) MolProbity: More and better reference data for improved all-atom structure validation. Protein Sci, 27, 293–315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Olsson M.H.M., Søndergaard C.R., Rostkowski M. and Jensen J.H. (2011) PROPKA3: Consistent Treatment of Internal and Surface Residues in Empirical pKa Predictions. Journal of Chemical Theory and Computation, 7, 525–537. [DOI] [PubMed] [Google Scholar]
  • 66.Sondergaard C.R., Olsson M.H., Rostkowski M. and Jensen J.H. (2011) Improved Treatment of Ligands and Coupling Effects in Empirical Calculation and Rationalization of pKa Values. J Chem Theory Comput, 7, 2284–2295. [DOI] [PubMed] [Google Scholar]
  • 67.Jo S., Kim T., Iyer V.G. and Im W. (2008) CHARMM-GUI: a web-based graphical user interface for CHARMM. J Comput Chem, 29, 1859–1865. [DOI] [PubMed] [Google Scholar]
  • 68.Brooks B.R., Brooks C.L. 3rd, Mackerell A.D. Jr., Nilsson L., Petrella R.J., Roux B., Won Y., Archontis G., Bartels C., Boresch S. et al. (2009) CHARMM: the biomolecular simulation program. J Comput Chem, 30, 1545–1614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Brooks B.R., Bruccoleri R.E., Olafson B.D., States D.J., Swaminathan S. and Karplus M. (1983) CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. Journal of Computational Chemistry, 4, 187–217. [Google Scholar]
  • 70.Hayes R.L., Buckner J. and Brooks C.L. 3rd. (2021) BLaDE: A Basic Lambda Dynamics Engine for GPU-Accelerated Molecular Dynamics Free Energy Calculations. J Chem Theory Comput, 17, 6799–6807. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Åqvist J., Wennerström P., Nervall M., Bjelic S. and Brandsdal B.O. (2004) Molecular dynamics simulations of water and biomolecules with a Monte Carlo constant pressure algorithm. Chemical physics letters, 384, 288–294. [Google Scholar]
  • 72.Chow K.-H. and Ferguson D.M. (1995) Isothermal-isobaric molecular dynamics simulations with Monte Carlo volume sampling. Computer physics communications, 91, 283–289. [Google Scholar]
  • 73.Leimkuhler B. and Matthews C. (2016) Efficient molecular dynamics using geodesic integration and solvent-solute splitting. Proc Math Phys Eng Sci, 472, 20160138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Ryckaert J.-P., Ciccotti G. and Berendsen H.J. (1977) Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. Journal of computational physics, 23, 327–341. [Google Scholar]
  • 75.van Gunsteren W.F. and Berendsen H.J. (1977) Algorithms for macromolecular dynamics and constraint dynamics. Molecular Physics, 34, 1311–1327. [Google Scholar]
  • 76.Andersen H.C. (1983) Rattle: A “velocity” version of the shake algorithm for molecular dynamics calculations. Journal of computational Physics, 52, 24–34. [Google Scholar]
  • 77.Kräutler V., Van Gunsteren W.F. and Hünenberger P.H. (2001) A fast SHAKE algorithm to solve distance constraint equations for small molecules in molecular dynamics simulations. Journal of computational chemistry, 22, 501–508. [Google Scholar]
  • 78.Darden T., York D. and Pedersen L. (1993) Particle mesh Ewald: An N· log (N) method for Ewald sums in large systems. The Journal of chemical physics, 98, 10089–10092. [Google Scholar]
  • 79.Essmann U., Perera L., Berkowitz M.L., Darden T., Lee H. and Pedersen L.G. (1995) A smooth particle mesh Ewald method. The Journal of chemical physics, 103, 8577–8593. [Google Scholar]
  • 80.Huang Y., Chen W., Wallace J.A. and Shen J. (2016) All-Atom Continuous Constant pH Molecular Dynamics With Particle Mesh Ewald and Titratable Water. J Chem Theory Comput, 12, 5411–5421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Steinbach P.J. and Brooks B.R. (1994) New spherical-cutoff methods for long-range forces in macromolecular simulation. Journal of Computational Chemistry, 15, 667–683. [Google Scholar]
  • 82.Jorgensen W., Chandrasekhar J., Madura J., Impey R. and Klein M. (1983) Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys., 79, 926–935. [Google Scholar]
  • 83.Huang J., Rauscher S., Nawrocki G., Ran T., Feig M., de Groot B.L., Grubmüller H. and MacKerell A.D. (2017) CHARMM36m: an improved force field for folded and intrinsically disordered proteins. Nature Methods, 14, 71–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Mackerell A.D. Jr, Feig M. and Brooks Iii C.L. (2004) Extending the treatment of backbone energetics in protein force fields: Limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. Journal of Computational Chemistry, 25, 1400–1415. [DOI] [PubMed] [Google Scholar]
  • 85.Best R.B., Zhu X., Shim J., Lopes P.E.M., Mittal J., Feig M. and MacKerell A.D. Jr. (2012) Optimization of the Additive CHARMM All-Atom Protein Force Field Targeting Improved Sampling of the Backbone ϕ, ψ and Side-Chain χ1 and χ2 Dihedral Angles. Journal of Chemical Theory and Computation, 8, 3257–3273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Denning E.J., Priyakumar U.D., Nilsson L. and Mackerell A.D. Jr. (2011) Impact of 2’-hydroxyl sampling on the conformational properties of RNA: update of the CHARMM all-atom additive force field for RNA. J Comput Chem, 32, 1929–1943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Foloppe N. and MacKerell J.A.D. (2000) All-atom empirical force field for nucleic acids: I. Parameter optimization based on small molecule and condensed phase macromolecular target data. Journal of Computational Chemistry, 21, 86–104. [Google Scholar]
  • 88.Vilseck J.Z., Cervantes L.F., Hayes R.L. and Brooks C.L. III. (2022) Optimizing Multisite λ-Dynamics Throughput with Charge Renormalization. Journal of Chemical Information and Modeling, 62, 1479–1488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Hayes R.L. and Brooks C.L. 3rd. (2021) A strategy for proline and glycine mutations to proteins with alchemical free energy calculations. J Comput Chem, 42, 1088–1094. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Hayes R.L., Armacost K.A., Vilseck J.Z. and Brooks C.L. 3rd. (2017) Adaptive Landscape Flattening Accelerates Sampling of Alchemical Space in Multisite lambda Dynamics. J Phys Chem B, 121, 3626–3635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Kumar S., Rosenberg J.M., Bouzida D., Swendsen R.H. and Kollman P.A. (1992) THE weighted histogram analysis method for free-energy calculations on biomolecules. I. The method. Journal of Computational Chemistry, 13, 1011–1021. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
Download video file (8.6MB, mov)
Supplement 2
Download video file (6.8MB, mov)
1

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES