Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 Apr 8.
Published in final edited form as: J Chem Phys. 2026 Feb 14;164(6):064109. doi: 10.1063/5.0309358

A theoretical framework for random accelerated molecular dynamics simulations

Yiwei Ding 1, Alessia Ghidini 1, Pilar Cossio 2,3, Benoît Roux 4,a)
PMCID: PMC13054613  NIHMSID: NIHMS2162513  PMID: 41665444

Abstract

The dissociation of a ligand bound to a receptor is a rare events occurring on a timescale that is far longer than can be afforded by standard simulation methodologies. A number of specialized approaches have been designed to overcome this challenge. One of these, Random Accelerated Molecular Dynamics (RAMD), is a biased simulation method that is particularly effective at accelerating the dissociation of ligands bound to protein receptors. However, no theoretical framework is currently available to extract unbiased estimates from the RAMD simulation results. In this work, we develop a simple theory for its interpretation based on the assumption that the RAMD simulations can be approximated by an overdamped Langevin dynamics with an additional force of fixed magnitude changing direction randomly at exponentially distributed times. We shown that the dynamics of the model is consistent with a Smoluchowski equation with an effective temperature and diffusion coefficient that depend quadratically on the magnitude of the random force. These observations explain why the RAMD simulations yields accelerated dynamics. Importantly, we derive an analytical expression for the unbiased escape time of a particle over a free-energy barrier based on the modified Smoluchowski equation. The theoretical framework was used to analyse RAMD simulations monitoring the dissociation of the Trypsin-Benzamidine protein-ligand complex. The results indicate that the theoretical framework is sound and could potentially be used to unbias the results from RAMD simulations.

I. INTRODUCTION

Many processes of interest in biophysics and biochemistry involve transitions over high free-energy barriers and occur over long time scales. In particular, the dissociation kinetics in protein-ligand interactions has recently gained attention as an important factor in drug discovery1. Specifically, the residence time of a ligand, i.e., the average time it spends in contact with a protein receptor before unbinding appears to be closely related to the in vivo efficacy of potential drug candidates2,3. However, these times can span a broad range, as the residence times of potential drug candidates may range from seconds to even minutes4, which makes their study particularly challenging.

In principle, molecular dynamics (MD) simulations represent a powerful tool to investigate the underlying mechanism of such processes. However, as the transitions of interest occur very infrequently on the time scale of molecular motions, a direct observation of ligand unbinding is computationally prohibitive using simple unbiased MD simulations. The extremely long time scales limit the usefulness of standard unbiased MD trajectories in predicting protein-ligand kinetics. For this reason, such slow processes are commonly described as “rare events” and their investigations require specialized computational methodologies5,6. A wide class of approaches seek to accelerate the transitions across the free-energy barriers by modifying the potential energy, e.g., accelerated MD710, solute tempering11, or by introducing various biasing potentials, e.g., adaptive biasing force12, local elevation13, conformational flooding14, and metadynamics1517. Other methods seek to combine short unbiased simulations to determine the long-time kinetics, e.g., Markov State Models1820 milestoning2123, and the adaptive multilevel splitting method24. Another class of strategies seek to alter the inherent dynamical evolution of the system, e.g., as with the self-guided molecular dynamics25 self-guided Langevin dynamics26.

Perhaps in a class of its own, the Random Accelerated Molecular Dynamics (RAMD) was specifically designed to facilitate and accelerate the dissociation of a ligand from its receptor binding site2729. In RAMD, an external force of a fixed magnitude is applied to the center of mass (COM) of the ligand in a random direction that is periodically updated according to specific criteria in order to increases the propensity of the bound ligand to explore various regions of the binding pocket, and find an exit pathway27. For example, in the so-called τRAMD protocol30, the COM of the ligand must cover a minimum threshold distance, otherwise a new direction of the force is generated and applied for a period of time tRAMD. This process is repeated until the ligand dissociates from the binding pocket.

An important appeal of RAMD is its great effectiveness at accelerating the ligand unbinding process and its broad applicability1,3,29,3134. RAMD simulations is commonly used to estimate relative protein-ligand residence times, helping predict how long a ligand stays bound to its target. Additionally, it can be used to identify and define ligand unbinding pathways or determine collective variables for more complex systems. These identified collective variables delineating the dissociation pathway can then with other sampling techniques1215 to gain deeper quantitative insights into ligand binding.

An empirical analysis of RAMD previously showed that the logarithm of the ligand dissociation time correlates with the magnitude of the applied force in RAMD simulations35. However, no theoretical framework currently explains the fundamental significance of RAMD or describes an efficient selection of its optimal parameters. Importantly, no established protocol currently exists to eliminate or correct the bias introduced by RAMD simulations. Motivated by these open questions, we develop a theoretical framework to elucidate the significance of RAMD simulations. In particular, we establish a theory that accounts for the influence of the random force in RAMD under Langevin dynamics. This formal analysis is then tested and validated with numerical simulations. Afterwards, the theoretical framework is illustrated in the context the protein-ligand complex Trypsin-Benzamidine, an experimentally well characterized system36, that has been used as a benchmarking system for computational studies of unbinding kinetics. Although some previous computational estimates are close to experiment24,3739, the reported values span a wide range1623,4043, reflecting the uncertainty in methods and force fields to accurately predict unbinding rates44.

II. THEORY

A. The original RAMD protocol

In RAMD an external random force is applied to the COM of a ligand to accelerate its dissociation from its binding site27. Let us write the RAMD force as FRAMD(t)=FRAMDu(t), where FRAMD is the magnitude of the force and u(t) is a time-dependent unit vector. The magnitude of the external force is an input parameter that remains constant throughout the RAMD simulation, while its direction given by the unit vector u(t) is randomly changed when the ligand does not make sufficient progress toward dissociation according to a minimum distance threshold applied at a constant period of time tRAMD2729. If the ligand travels a distance greater than a selected threshold over the time tRAMD then the direction u is kept constant for the subsequent period of time tRAMD. However, if the distance traveled by the ligand is smaller than the threshold then a new random direction u is generated and applied for the subsequent period of time tRAMD. The process is repeated until the ligand dissociates from the binding pocket. As designed, the applied random force FRAMD(t) increases the propensity of the bound ligand to explore various regions of the binding pocket, and eventually find an exit pathway. Specifically, the “τRAMD protocol”, outlining strategies for system preparation and statistical analysis, was developed to estimate the relative unbinding times of a set of drug-like molecules from a receptor30. In the τRAMD protocol, the COM of the ligand must cover a distance of at least 0.025 Å during 0.1 ps of RAMD simulation, otherwise a new direction of the force is selected. The RAMD simulations is carried out until the distance between the ligand and the protein COM is greater than 30 Å. The magnitude FRAMD is the only tunable input parameter of this protocol, and its value influences the computational time required for ligand unbinding.

B. Langevin dynamics model of RAMD

In RAMD simulations, the applied random force changes its direction randomly according to a minimum distance threshold. For this reason, the dynamical propagation comprises elements of spatial and temporal memory that are difficult to model mathematically in a simple form. To make progress, we consider a simpler dynamical model that mimics the true RAMD protocol while remaining mathematically tractable. To this end, we represent the RAMD dynamics of a ligand by the Langevin equation,

mdvdt=-𝛁W(r)-γv+FRAMD(t)+f(t) (1)

where r is the position in a d-dimensional space, v is the velocity, W(r) is the potential of mean force (PMF), γ is the dissipative friction, and f(t) is a gaussian-distributed random force obeying f(t)=0, and f(t)f(0)=2dγkBTδ(t). kB is Boltzmann’s constant and T is the temperature. In one-dimension, we recover the familiar fluctuation-dissipation relation, f(t)f(0)=2γkBTδ(t). Eq. (1) only provides an approximate description to the dynamics because the influence of the environment is modeled using the classic Langevin equation and memory effects are neglected. While one could, in principle, adopt a more detailed representation such as the Generalized Langevin Equation (GLE)45, any potential benefit would be severely constrained by the strong approximations needed for the treatment of RAMD.

Two stochastic factors determine the dynamics of the system based on Eq. (1). The first is the random Langevin force, and the second is the RAMD force via the time-dependent random unit vector u. In this model, we assume the RAMD force FRAMD(t) to have a constant magnitude FRAMD in a random direction that persists for an exponentially distributed random time period. Therefore, the autocorrelation function of this RAMD force is FRAMD(t)FRAMD(0)=FRAMD2u(t)u(0)=e-t/τ where u is a normalized unit vector (uu=1) and τ is a correlation time. We note that this is an approximation because in the original RAMD protocol the direction of the force persists for a length of time that depends on a minimum distance threshold. This simplifying assumption will be illustrated and validated below using RAMD simulations of ligand dissociation simulations of the Trypsin-Benzamidine complex.

It can be anticipated that a system propagated via Eq. (1) will exhibit some peculiarities because it is a stochastic dynamics driven by a time-dependent random force FRAMD(t)+(t) with a finite correlation time, which is counterbalanced by the Markovian friction -γv. This construction does not satisfy the generalized fluctuation-dissipation condition45. Let us examine the dynamical consequences of Eq. (1) for a free particle in the absence of a potential W. Starting from Eq. (1), we use the Wiener-Khinchin theorem to express the Fourier transform of the velocity autocorrelation function as,

C~vv(ω)=1/m2ω2+(γ/m)2C~FF(ω)+C~ff(ω) (2)

where C~FF(ω), and C~ff(ω) are the Fourier transform of the RAMD and Langevin force autocorrelation functions, respectively. Knowing that C~ff=2γkBT, and

C~FF(ω)=2FRAMD2τ1(ωτ)2+1 (3)

It follows that

C~vv(ω)=2FRAMD2m2(1/τ)ω2+(γ/m)2ω2+(1/τ)2+2dγkBTm21ω2+(γ/m)2 (4)

The effective diffusion coefficient is given by

Deff=12dC~vv0=2FRAMD2dm21/τγ/m21/τ2+2γkBTdm21γ/m2=1γFRAMD2τdγ+kBT (5)

Using the simple Einstein relation D=kBT/γ, the effective diffusion coefficient can be expressed as,

Deff=DkBTeffkBT (6)

Assuming that an Einstein-like fluctuation-dissipation relation holds, γDeff=kBTeff, this analysis suggests the existence of an effective RAMD temperature,

kBTeff=FRAMD2τdγ+kBT (7)

As expected, if γ is very large then the Langevin terms (dissipation and random force) act as a strong thermostat on the system and the effect of the RAMD force becomes negligible and TeffT. It is also possible to consider the average kinetic energy of the free particle. The inverse Fourier of C~vv(ω) yields the time-correlation function,

Cvv(t)=FRAMD2m21/τ1/τ2-(γ/m)2e-|t|γ/mγ/m-e-|t|/τ1/τ+kBTme-|t|γ/m (8)

assuming that γ/m1/τ. At t=0, we have,

mv2=mCvv(0) (9)
=FRAMD2τdγ11+(γ/m)τ+kBT (10)

We note that if the factor (γ/m)τ is small then the average kinetic energy of the particle is consistent with the effective temperature Teff identified in Eq. (7). While this analysis will not be pursued further, one could also derive formal results for a particle in a harmonic well based on Eq. (1).

C. Modified Smoluchowski equation

To make progress, we neglect the inertial term in the Langevin equation, mdv/dt, and consider an overdamped random diffusion model of RAMD,

γdrdt=-𝛁W+FRAMD(t)+f(t) (11)

where γ=kBT/D. The assumption is that the long-time evolution of the system can be pictured as some effective Brownian Dynamics (BD)46. To determine the long-time evolution of the associated probability P(r,t), we use the Kramers-Moyal expansion47,

Pt=1Δt𝛁-M1P+12𝛁M2P (12)

where M1 and M2 are the first and second moments, respectively, with respect to the displacement Δr(Δt)=r(Δt)-r(0) over a time step Δt. To determine the first and second moments, we integrate Eq. (11) over a time-step Δt,

r(Δt)=r(0)+1γ-𝛁WΔt+0ΔtFRAMD(t)+f(t)dt (13)

The first moment is the mean displacement over a time-step of Δt,

M1=Δr(Δt)=-DkBT𝛁WΔt

The second moment is the mean-square deviation of the displacement over a time-step of Δt,

M2=1dΔrΔt2=1dγ20Δtdt0Δtdt(FRAMDtFRAMDt+ftft (14)

where FRAMDtFRAMDt=CFFt-t is the time correlation function of the RAMD force. Assuming that CFF(t)=FRAMD2e-|t|/τ, we find that

M2=1dγ20Δtdt0ΔtdtCFFt-t+2DΔt (15)

where D=kBT/γ. We perform a change of variables for the double integral t1=t-t and t2=t+t/2,

M2=1dγ2ΔtΔtdt112t1Δt12t1dt2CFFt1+2DΔt2DFRAMD2τdγ+kBTΔt (16)

In the last step, it was assumed that Δtτ. In this approximation, the evolution of the probability P(r,t) associated with the overdamped RAMD diffusion process follows the modified Smoluchowski equation,

Pt=𝛁DeffkBTeff𝛁WP+Deff𝛁P (17)

where Deff and Teff are consistent with Eqs. (5) and (7) from the analysis of the RAMD Langevin equation. The simple diffusion model based on the modified Smoluchowski Eq. (17) makes it possible to examine the relaxation of the system at very long time in the presence of a PMF. In the following, this theory is exploited to derive important results used for the analysis of RAMD simulations.

D. Analytical results from the modified Smoluchowski equation

1. A particle in a harmonic well

The modified Smoluchowski theory allows us to derive explicit expressions regarding the statistical behavior of a system propagated by the overdamped diffusion RAMD dynamics. The objective is to then exploit those expressions to extract the maximum of information from all-atom RAMD simulations. However, the modified Smoluchowski theory associated with the overdamped diffusion RAMD model involves an effective temperature and diffusion coefficient, which are defined by several parameters, FRAMD, and τ, as well as the underlying free energy landscape W(r) and friction γ. While the present theoretical analysis provides coherent framework, it is important to keep in mind that it is an approximation to a real RAMD simulation. For this reason, there are hidden dependencies among the various parameters when trying to match the results of the (approximate) overdamped diffusion RAMD model to the real RAMD simulations. For example, the lag-time τ governing the period of time during which the RAMD force is kept constant may be affected by the magnitude of the random force in the true RAMD simulations. Furthermore, the PMF W(r) and the friction γ are unknown. All these may shift their value when the magnitude of the external force is altered in real RAMD simulations. For this reason, it may not be possible to extract all the parameters with high confidence. However, it is possible to bypass the uncertainties about these parameters by structuring the analysis around simple expressions corresponding to the situation of a ligand harmonically restrained in the binding site.

Let us first consider the simple situation where the COM of a ligand is restrained by a harmonic potential,

Uharm=12KtotΔr2 (18)

where Ktot is the total force constant affecting the ligand, and Δr is the displacement of the COM relative to its mean position. By the equipartition theorem, the fluctuations of the ligand from a RAMD simulation are,

Δr2RAMD=dkBTeffKtot (19)

where d is the dimension. In practice, when the analysis is carried out for a ligand in a binding site, Ktot=Wmin+K, where Wmin is the intrinsic curvature of the PMF in the binding site and K is the force constant from the harmonic restraining potential. It follows that the effective temperature can be expressed as,

kBTeff=Ktot1dΔr2RAMD (20)

Similarly, the effective diffusion coefficient Deff can be estimated from the relaxation time of the harmonically restrained ligand in RAMD simulations,

Deff=1τrrdΔr2RAMD (21)

where τrr is calculated from the position-position time-correlation function,

Crr(t)=Δr(t)Δr(0)RAMD=Δr2RAMDet/τrr (22)

using the identity,

τrr=0CrrtCrr(0)dt (23)

In the absence of RAMD, τrr reduces to the unperturbed diffusive relaxation time of the particle in the harmonic well, τharm=kBT/DKtot. This simple analysis provides direct estimates of the effective temperature Teff and effective diffusion coefficient Deff. This also makes possible to examine the linear relation between Teff and Deff to verify the validity and accuracy of the modified Smoluchowski overdamped dynamic RAMD model based on Eq. (17).

2. Mean escape time over a free-energy barrier

We now consider the mean escape time of a bound ligand over a free-energy barrier. This represents a critical step toward a rational interpretation of ligand residence times determined from real RAMD simulations. We consider a system with a bound (A) and unbound (B) states separated by a large energy barrier, and assume that the unbinding process occurs along the reaction coordinate x. Fluctuations orthogonal to the reaction coordinate x are assumed to have no impact on the escape process. The mean escape time from state A can be derived following the classic Kramers-Smoluchowski theory48,49, with a effective temperature Teff and diffusion coefficient Deff,

τesc=1DeffxAeWx/kBTeffdxxAxBeWx/kBTeffdx (24)

where the first integral is over the basin A and the second is over the barrier. The integrals are approximated using a quadratic expansion at the bottom of the reactant well at xA and the top of the free energy barrier located at x is given by5052,

τesc=kBTeffDeff2πWxAWxeΔW/kBTeff=τ0eΔW/kBTeff (25)

where ΔW=Wx-WxA is the activation free energy. According to the overdamped RAMD model, the dynamical prefactor τ0 is unaffected by the the random force FRAMD since Deff/kBTeff=D/kBT. The implication is that the mean escape time τesc in RAMD can be written as τ0eΔW/kBTeff, where only the dimensionless quantity ΔW/kBTeff depends on the presence of the random force. If this assumption about the linear dependence of Deff and kBTeff is not verified in practice, one can treat τ0 as some constant times lnDeff/kBTeff.

We can rewrite the expression for the mean escape time from a well as,

lnτesc=ΔWkBTα+lnτ0 (26)

where the dimensionless quantity α=T/Teff corresponds to the inverse of the relative effective temperature. This result explains the relation between FRAMD and the logarithm of the ligand dissociation time that was noted previously35. Because α<1, Eq. (26) makes it clear that the RAMD algorithm reduces the mean escape time of a bound ligand via an increased effective temperature. Furthermore, this expression, displaying a linear relationship between lnτesc and α, is very convenient because it provides a simple route to determine an unbiased estimate of the mean escape time from the RAMD simulations.

E. Limitations of the overdamped RAMD approximation

The above results, obtained in the overdamped diffusive limit, assume implicitly that the RAMD correlation time τ is much shorter than τharm, the unperturbed relaxation time of the ligand in the harmonic well. Further analysis (see Appendix) reveal that the effective temperature estimated from the quadratic fluctuations in a harmonic well has the form,

kBTeff=KtotdΔr2RAMD=FRAMD2τdγ(1η+1)+kBT (27)

where η=τ/τharm. Thus, Eq. (7) is valid only if the dimensionless parameter η=τ/τharm1. Eq. (27) shows that the effective temperature as defined is actually dependent on η. Similarly, the effective diffusion coefficient estimated from the relaxation time τrr in a harmonic retain the same form as in Eq. (21), but with τrr given by,

τrr=τharmkBTeff(η=0)kBTeff(η) (28)

If η is negligible then τrr is equal to τharm. A similar analysis would affect the imaginary relaxation time at the top of an inverted harmonic barrier in the case of an activated process53.

These results, which will be illustrated with explicit BD simulations in the next section, shows the limitations of an analysis of RAMD simulations based on the overdamped diffusive model. For this reason, the analysis is expected to be more accurate in the limit of high friction and low curvature of the potential. Hopefully, those conditions are satisfied in the context of RAMD simulations of ligand unbinding, but should be verified for each particular case.

III. RESULTS

A. Simulations of the overdamped RAMD model

To verify and validate the theoretical analysis, simulations were carried out according to the simplified overdamped diffusive model of the RAMD algorithm. In these RAMD simulations, the particle is evolving in d dimensions and an external force of fixed magnitude FRAMD is applied in a random direction over an exponentially distributed time period, after which a new direction is picked randomly. The exponential distribution is characterized by a correlation time τ. In practice, the behavior of the system was examined for three simple scenarios. The simplest situation concerns the effective diffusion coefficient Deff of a free particle in d dimension as expressed from its mean squared displacement (MSD),

Deff=limt12dt(R(t)-R(0))2 (29)

According to Eq. (5), Deff depends quadratically on the magnitude of the applied random force FRAMD. This was explicitly tested by carrying out simulations with different values of FRAMD. The results are shown in Figure 1. Direct comparison of the effective diffusion coefficient, Deff, as a function of the applied force, FRAMD, with the theoretical prediction (Eq. 5) validates the theoretical analysis.

Figure 1.

Figure 1.

Comparison between theoretical predictions and simulation results for the effective diffusion coefficient Deff as a function of the magnitude of the external force FRAMD. The theory is from Eq. (5). In these simulations, T=300K,D=0.00052/ps. The integration time step is 0.01 ps, and the total simulation time is 100000 ps. The correlation time of the RAMD random force was set to 50 ps. Statistics are obtained by repeating the simulation 10 times for each value of FRAMD.

The second situation concerns the fluctuations of a particle restrained by a harmonic potential. Both the effective temperature Teff and effective diffusion coefficient Deff depend quadratically on FRAMD. A comparison between the theoretical analysis and the simulated results is shown in Figure 2. As with the freely diffusing particle, the good agreement confirms the validity of the theoretical analysis. One may note that the variations in the case of Deff are noticeably larger than for Teff, suggesting that the effective temperature is perhaps a more robust feature of the system than the diffusion coefficient. Nonetheless, the comparison shows that the fluctuations of a harmonically restrained ligand can be analyzed using Eqs. (20) and (21), we can estimate to determine the effective temperatures and diffusion coefficient in RAMD simulations.

Figure 2.

Figure 2.

Comparison of simulation and theoretical results for the effective temperature Teff (top) and effective diffusion constant Deff (bottom) of a harmonically restrained particle as a function of the magnitude of the random forces FRAMD and total restraining force constants Ktot. Dashed lines represent theoretical predictions, while solid markers with error bars represent simulation results. The theory is from Eq. (27) for Teff, and from Eq. (21) for Deff with τrr from Eq. (28).

The last situation concerns the escape of a bound particle over an energy barrier. To model the system using the three-dimensional potential,

W(x,y,z)=12kx2+y2+50e-10z+ΔWexp-(z-p)22w (30)

where k is the spring constant confining the particle in the x-y plane, ΔW is the height of the free energy barrier along the z-axis, and w determines the barrier width. We used k=10,ΔW=2.9575,p=5, and w=1 in the simulations. The escape time as a function of α=T/Teff are shown in Figure 3. Taking the dependence of the renormalized temperature Teff on FRAMD into account, it is observed that the results obtained from the RAMD simulations closely match the predictions of KS theory. The excellent agreement suggests that the theoretical analysis based on Eq. (26) provides a viable approach for estimating the effect of FRAMD on the escape times obtained from RAMD simulations.

Figure 3.

Figure 3.

Comparison between simulations and theoretical predictions for the escape time over a potential energy barrier. Dash lines represent theoretical results, while markers with error bars represent simulation data. The result of linear regression is y=5.00x+9.3405, which is near the slope of 5 predicted from KS theory.

Based on these results, we propose the ensuing protocol to obtain unbiased estimates of the escape time τ0 from RAMD simulations:

  1. Carry out several RAMD simulations to determine the biased escape time of the ligand, τesc as a function of FRAMD. The values of the RAMD force should be sufficiently large to accelerate the unbinding within a reasonable simulation time, while still generating semi-realistic trajectories.

  2. Carry out additional RAMD simulations for the same set of FRAMD with the ligand harmonically restrained in the binding pocket to determine the fluctuation of the ligand COM, hence the effective temperature Teff (Eq. (20)) as well as the effective diffusion coefficient Deff (Eq. (21)).

  3. Perform a linear least-square fit using the pair of data points α=Teff/T (as abscissa) and lnτesc (as ordinate) determined for each value of FRAMD on the basis of Eq. (26). The slope of the linear relation corresponds to the activation free energy ΔW, and the extrapolation α1 yields the unbiased mean escape time along the ordinate.

While this itemized protocol can be rationally justified, it is important to recognize that the theory and analysis presented here is an approximate construct based on specific assumptions about the RAMD simulations. One of those assumptions is the exponential distribution of the time during which the random force remains constant before it changes its direction. Another potential weakness is the inherent friction affecting the the ligand dynamics, which translates into a sensitivity of the effective temperature on the dimensionless factor η related to the correlation time τ and the relaxation time τrr of the ligand in the binding pocket. It shall generally make sense to complete the analysis by validating these assumptions and verify if they are valid in the RAMD simulations.

B. RAMD simulations of Trypsin-Benzamidine with explicit solvent

The RAMD simulation method is applied to the Trypsin-Benzamidine protein-ligand complex, depicted in Figure 4. The complex model was modeled starting from the crystallographic structure with PDB ID 3PTB54. More information about the modeling can be found in Supplementary Information. The RAMD simulations were carried out using NAMD355, following the τRAMD protocol describe previously30. The temperature was kept constant at 300 K using a Langevin thermostat, and the pressure was kept constant at 1 atm using a Langevin barostat. The distance between the COMs of Trypsin and Benzamidine was evaluated every 0.1 ps (50 steps). If this distance exceeded a threshold of 0.025 Å, the current force direction was maintained; otherwise, a new random force direction was selected. This process was repeated until the distance between the protein and ligand COMs reached 40 Å.

Figure 4.

Figure 4.

Structural representation of the Trypsin-Benzamidine complex. The ligand Benzamidine is depicted in light green, while the protein Trypsin is shown using a grey ribbon representation overlaid with a grey surface.

The most relevant parameter to define in τRAMD is the magnitude of the random force, FRAMD. Experimental measurements indicate that Benzamidine exhibits a short residence time in complex with Trypsin (≈ 0.0016 s); thus we selected a range of relatively small force magnitudes: FRAMD from 1.4 to 14 kcal/molÅ (from 58.5 to 585 kJ/mol nm). For each force value, ten independent RAMD simulations were performed to simulate the dissociation of Benzamidine from Trypsin. Different random seeds were used in the different replicas, generating different initial force directions. The escape time at each FRAMD (i.e., computational unbinding time) was defined as the bootstrap average of the escape times registered in the ten independent RAMD simulations (Table S1). The logarithm of the escape time as a function of FRAMD is shown in Figure S1. However, at the lowest force magnitude (FRAMD=1.4kcal/mol), the ligand did not dissociate in all simulated trajectories. The escape time for this force magnitude was estimated by accounting for the probability of the unbinding event to occur and not to occur within the simulated time, as better detailed in Supplementary Information.

To determine the fluctuation of the ligand COM, and hence α=T/Teff, additional RAMD simulations of the Trypsin-Benzamidine protein-ligand complex were carried out in the presence of a harmonic restraining potential to prevent the dissociation of the ligand. For each RAMD force in the considered range (FRAMD=1.4-14kcal/mol/), independent simulations were run with different force constants of the harmonic restraining potential K applied to the ligand COM. The force constant varied from 1 to 20 kcal/mol Å2 depending on the value of FRAMD to ensure that the ligand remains inside the binding pocket (more information about the combination of FRAMD and K used can be found in Supplementary Information: Figures S2 and S3, Tables S2 and S3).

As shown in Figure 5, the magnitude of the RAMD force has a considerable impact on the distribution of the lag time τ, i.e., the time interval between consecutive force changes. For large RAMD forces, only a limited number of fore directions are sample during the RAMD simulations. This occurs because the distance traveled by the ligand COM is in most cases higher than the defined threshold of 0.025 Å. In contrast, for smaller FRAMD a larger number of random directions is sampled during RAMD simulations, and the lag time τ exhibits an exponential distribution.

Figure 5.

Figure 5.

Analysis of the lag time τ between consecutive changes in the applied RAMD force. Top) Probability distribution of the lag time extracted from a RAMD simulation with a force FRAMD equal to 1.4 kcal/molÅ. The escape time of the ligand is 113 ns, with 2245 changes in the direction of the RAMD force. Bottom) Probability distribution of the lag time extracted from a RAMD simulation with a a force FRAMD equal to 5.6 kcal/molÅ. The escape time of the ligand is 953 ps, with 23 changes in the direction of the force.

To better sample the distribution of lag times between consecutive force changes, we calculated the lag time τ from the RAMD simulations in the presence of a harmonic restraining potential. The logarithm of the correlation function FRAMD(t)FRAMD(0) for different magnitudes of the RAMD force is shown in Figure 6. The result shows that the exponential decay with a single correlation time is remarkably robust, even when the magnitude of FRAMD is excessive (i.e., larger than 5 kcal/mol Å). This is encouraging because an important assumption of our approximate model of RAMD based on Eq. (1) is that the random force is a random variable that follows an exponential distribution with a fixed correlation time τ. This assumption is strongly supported by Figure 6. This empirical finding validates the assumption underlying the theoretical development.

Figure 6.

Figure 6.

Fitting the correlation time τ for different RAMD simulations. The dashed lines are simulation results, and the solid lines are the fits. FRAMD=1.4,τ=48.03;FRAMD=2.8,τ=40.71;FRAMD=4.2,τ=42.82;FRAMD=5.6,τ=42.94;FRAMDF=7,τ=49.85;FRAMD=10,τ=39.13;FRAMD=12,τ=42.03;FRAMD=14,τ=45.34.FRAMD is in kcal/mol Å, and τ is in ps.

It is possible to determine the stationary fluctuations of the COM of the ligand, Δr2RAMD, from the RAMD simulations performed with the harmonic restraining potential. From the stationary fluctuations, one can estimate the effective temperature, Teff, based on Eq. (20) and the effective diffusion coefficient, Deff, based on Eq. (21), both as a function of the random force magnitude FRAMD. The results are shown in Figure 7. The diffusion coefficient is not absolutely necessary for the unbiasing procedure, but it provides a valuable piece of information to better interpret the results. As shown in Figure 7, both the effective temperature and the effective diffusion coefficient exhibit a dependence on FRAMD2. However, while the estimated effective temperature appears to be consistent with Eq. (7), there are considerable variations with regards to the effective diffusion coefficient. In Figure 7, the estimated Deff for different values of the harmonic restraint as a function of FRAMD increase faster than predicted by Eq. (5). Since this analysis is based on an overdamped diffusion in a harmonic potential, we conjecture that these deviations in Deff from the prediction are due to the presence of some spatial ruggedness δw in the underlying PMF felt by the ligand in the binding pocket. In one-dimension, the effect of δw in a rugged potential gives rise to an effective diffusion coefficient,56,57

Drugged=De-δw/kBTeff-1e+δw/kBTeff-1 (31)

where the bracket represents a spatial average over the diffusion space and D=kBT/γ is the unperturbed diffusion coefficient from the overdamped BD dynamics Eq. (11). As the effective temperature rises, the diffusion coefficient increases because the effect of the ruggedness decreases. While the inaccuracy of the estimated diffusion coefficient from Eq. (5) shows the limitations of the approximate treatment, the consequences on the present analysis are limited. In practice, the value of Deff/D varies at most between 1 and around 3.5 in Figure 7. While the value Deff is helpful to assess the validity of the analysis, ultimately this quantity does not have a large effect on the mean escape time, which depends exponentially on Teff. Therefore, the quantitative impact of Deff is limited.

Figure 7.

Figure 7.

Estimates of Teff and Deff from RAMD simulations of the Trypsin-Benzamidine complex with a harmonic restraint using Eqs. (20) and (21). Simulations were performed with different values of the random force FRAMD=1.4,2.8,4.2,5.6,7.0,10.0,12.0,14.0kcal/mol) and of the force constant of harmonic restraining potential relative the equilibrium bound pose (K=1,2,3,5,5,7,10,12,15,17,20kcal/mol2). The black line follows the effective temperature as Teff/T=0.022FRAMD2+1.0.

Using the estimated effective temperature determined from Figure 7, we used the linear form Eq. (26) to analyze the results of the escape time obtained from the RAMD simulations. The natural logarithm of the escape time at each force magnitude is plotted against the corresponding T/Teff in Figure 8 and fitted with a linear regression, according to Eq. (26). The fitting was also performed considering only force magnitudes from 1.4 to 7 kcal/mol Å, as shown in Figure S4.

Figure 8.

Figure 8.

Fitting the linear form based on Eq. (26). The solid line follows lnτesc=12.29α+0.38, where α=T/Teff.ΔW=7.32kcal/mol and τ0=1.46ps.

It is clear from Figure 8 that the linear form extracted from the theoretical analysis is satisfied, which is indicative that the natural logarithm of the average escape time is related to the magnitude of the random force FRAMD. The slope is related to the activation free energy ΔW, suggesting a value on the order of 12.3kBT. The unbiased residence time of the Trypsin-Benzamidine complex can be estimated by linearly extrapolating the data to α=1, yielding a value about 320 ns (considering only FRAMD=1.4-7kcal/mol, the unbiased τesc is 900 ns). While this estimate is much shorter than the experimental value of about 1.6 ms36, reported values from various computational methods span a wide range, from 10μs to 300 ms1624,3743.

Although the application of the proposed approach to the Tryspin-Benzamidine system resulted in an absolute unbinding time not consistent with the experimental data, it showcases the consistency of the theoretical framework, as demonstrated by the linear relationship between lnτesc and α reported in Figure 8. Importantly, this application showed that a wide range of force magnitudes is not required to obtain this linear relation; it can also be captured using only relatively high random force values. The advantage is that instead of estimating the absolute unbinding time relying on few long simulations, several short and parallel RAMD trajectories can be used. Therefore, this theoretical framework represents a promising basis to improve the feasibility of absolute kinetics estimation in drug discovery.

For the Trypsin-Benzamidine system, considering the short experimental time that the ligand spends in contact with the protein (≈ 0.0016 s), it is expected that high force magnitudes results in very fast escape times. Indeed, only for force magnitudes lower than 10 kcal/mol Å the average escape time was in the order of nanoseconds, reaching 101 and 102 ns for smaller forces. Thus, simulations in the order of nanoseconds or shorter can be used to obtain the linear relationship shown in Figure 8.

IV. CONCLUSION

In this article, we have developed a novel theoretical framework to better understand the underlying effect of the RAMD simulation algorithm. In particular, a special focus was put on how RAMD simulations accelerate the mean escape time of a ligand bound to a receptor protein. In our analysis, we assumed that the RAMD simulations based on the τRAMD protocol30 can be approximated by an overdamped Langevin dynamics with an additional random force. By construction, we assumed that the force had a fixed magnitude FRAMD but changed its direction randomly at exponentially distributed random times. These simplifying assumptions allowed us to represent the RAMD dynamics of a bound ligand in terms of a modified overdamped Langevin equation. A number of formal results could be obtained on the basis of this simple theory. In particular, it was shown that dynamics of the model is roughly consistent with that of an effective Smoluchowski equation with effective temperature Teff and diffusion coefficient Deff which depend on the magnitude of the random force FRAMD. This analysis shows that the overall RAMD dynamics is “accelerated” because the effective temperature and the diffusion coefficient increase quadratically with FRAMD. In retrospect, the most useful result of the present analysis is the expression for the effective RAMD temperature given by Eq. (7). Following a classic Kramers treatment of the escape of a particle over a free energy barrier based on the effective Smoluchowski equation it becomes immediately clear how the RAMD simulations accelerate the dissociation rate of a bound ligand. To some extent, the effect of the effective temperature bears some similarities to what is achieved with the solute-tempering REST2 method58, although RAMD is not explicitly scaling any component of the potential as is done in REST2. Furthermore, there are also additional kinetic effects as the diffusion coefficient is also increased. All these factors contribute to accelerate activated processes in RAMD simulations. The theoretical framework was used to analyze RAMD simulations monitoring the dissociation of the Trypsin-Benzamidine protein-ligand complex. The results show that the theoretical framework is sound and could potentially be used to unbias the results from RAMD simulations. The present treatment drawing from an effective Smoluchowski equation is based on the strong assumption of overdamped diffusive dynamics, and it is unavoidably imperfect. One possible source of inaccuracy may arise from the neglect non-Markovian effects related to the inherent relaxation times in the system relative to the RAMD lag time τ. The test with the Trypsin-Benzamidine protein-ligand complex suggest that the analysis of RAMD simulations can overcome those limitations. Lastly, while the method has been only applied in the context of ligand unbinding, it could in principle be used to accelerate the dynamics of any selected degree of freedom.

Supplementary Material

SI

The Supporting Information is available free of charge and includes:

  • Modeling of the Trypsin-Benzamidine complex;

  • Escape time for RAMD simulations with a low force magnitude;

  • Escape times values at different force magnitudes;

  • RAMD simulations with the ligand harmonically restrained in the binding pocket;

  • Unbiased escape time from RAMD simulation with FRAMD=1.4-7kcal/mol

APPENDIX: PARTICLE IN A HARMONIC WELL

Assuming that the overdamped RAMD dynamics of a particle in a harmonic well follows Eq. (11), the Fourier transform of the position-position time-correlation function is,

C~rr(ω)=1ω2+1/τharm2DkBT1γC~FF(ω)+C~ff(ω) (32)

where C~ff(ω)=2dD, and C~RAMD(ω)=2FRAMD2τ/1+(ωτ)2. The Fourier transform C~rr(ω) can be simplified as

C~rrω=2FRAMD2γτDkBT11/τharm2(1/τ)21ω2+(1/τ)2+2dD2FRAMD2γτDkBT11/τharm(1/τ)21ω2+1/τharm2 (33)

The inverse Fourier of C~rr(ω) yields the time-correlation function,

Crr(t)=FRAMD2τ2γDkBT1η2-1e-|t|/τ+1KdkBT-FRAMD2τγ1η2-1e-|t|/τharm (34)

where η=τ/τharm. At t=0, we have

Crr0=Δr2RAMD=dKFRAMD2τdγ1η+1+kBT=dKkBTeffη (35)

The effective diffusion coefficient estimated from the relaxation time τrr in a harmonic potential is given by Eq. (21), Deff=Δr2RAMD/τrrd, with τrr given by,

τrr=C˜rr0/2Δr2RAMD=dτharm2DkBTFRAMD2τdγ+DΔr2RAMD=τharmTeffη=0Teffη (36)

When η is negligible, then τrr is identically equal to τharm.

REFERENCES

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SI

RESOURCES