Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2020 Oct 21;117(44):27116–27123. doi: 10.1073/pnas.2008307117

Broad distributions of transition-path times are fingerprints of multidimensionality of the underlying free energy landscapes

Rohit Satija a, Alexander M Berezhkovskii b, Dmitrii E Makarov a,c,1
PMCID: PMC7959536  PMID: 33087575

Significance

The function of biomolecules is controlled by the energy landscapes traversed when they fold, bind to other molecules, generate mechanical motion, or catalyze chemical reactions. Information about such landscapes is encoded in single-molecule observations of transition paths, i.e., brief events where molecules cross activation barriers, but teasing out dynamical details from inherently low-dimensional experimental signals is a challenge. Here, we ask a question of principle: How can one decide whether an observed biomolecular transition can be represented as motion along a one-dimensional reaction coordinate or multidimensionality is essential? We establish a rigorous constraint on the width of the distribution of transition path times, violation of which necessitates use of multidimensional models for the interpretation of experimental data.

Keywords: single molecules, protein folding, force spectroscopy, transition paths

Abstract

Recent single-molecule experiments have observed transition paths, i.e., brief events where molecules (particularly biomolecules) are caught in the act of surmounting activation barriers. Such measurements offer unprecedented mechanistic insights into the dynamics of biomolecular folding and binding, molecular machines, and biological membrane channels. A key challenge to these studies is to infer the complex details of the multidimensional energy landscape traversed by the transition paths from inherently low-dimensional experimental signals. A common minimalist model attempting to do so is that of one-dimensional diffusion along a reaction coordinate, yet its validity has been called into question. Here, we show that the distribution of the transition path time, which is a common experimental observable, can be used to differentiate between the dynamics described by models of one-dimensional diffusion from the dynamics in which multidimensionality is essential. Specifically, we prove that the coefficient of variation obtained from this distribution cannot possibly exceed 1 for any one-dimensional diffusive model, no matter how rugged its underlying free energy landscape is: In other words, this distribution cannot be broader than the single-exponential one. Thus, a coefficient of variation exceeding 1 is a fingerprint of multidimensional dynamics. Analysis of transition paths in atomistic simulations of proteins shows that this coefficient often exceeds 1, signifying essential multidimensionality of those systems.


In recent years, single-molecule studies of the folding (1, 2) and binding (3, 4) dynamics of biomolecules have brought unprecedented mechanistic insights into those phenomena. A key advance that has made this possible was achieving a high temporal resolution allowing observation of fast events where the molecule is caught en route between its stable molecular conformations (e.g., the folded and unfolded states) as it crosses, and possibly recrosses, the free energy barrier that separates them. More precisely, one can observe a “transition path,” a piece of a molecular trajectory x(t) that starts when this trajectory enters the barrier region xa<x<xb through one of its boundaries (e.g., xa) and exits it through the other boundary, while staying within this region continuously between these two events (Fig. 1). The temporal duration of a transition path, known as the transition path time or the direct transit time, has been of particular interest because of its experimental accessibility as well as its interesting mathematical properties (5, 6).

Fig. 1.

Fig. 1.

In the simplest view of chemical reaction dynamics, a large conformational rearrangement (such as biomolecular folding) is considered as a transition, along a reaction coordinate x, from one minimum of a 1D free energy landscape to another. Transition path (shown in red) is a segment of a trajectory x(t) that enters the transition region, xa<x<xb, through one of its boundaries and exits through the other staying continuously within the transition region in between these events. Properties of transition paths, such as the distribution of their temporal duration, depend on the free energy landscape in the transition region. Two examples are shown here: In one (solid line), there is a potential barrier separating two (meta)stable molecular conformations, and in the other (dashed line) there is an on-pathway intermediate represented by a potential well.

The distribution of the transition path time is sensitive to the details of the barrier crossing dynamics. The most common model used to rationalize the dynamics of biomolecular folding and binding is that of diffusive motion over a one-dimensional (1D) free energy landscape U(x), with x being a suitable “reaction coordinate” (79) (Fig. 1). When applied to experimental observations, the reaction coordinate x is related to the experimental observable, such as the mechanical extension of the molecule in force spectroscopy measurements or the distance between fluorescent probes in fluorescence experiments. Several recent studies have suggested, however, that the simple, 1D picture of diffusive barrier crossing along experimental reaction coordinates is inadequate as a description of transition path time distributions. Specifically, the distribution of the transition path time, pTP(t), is often “broader” than what the 1D model of diffusive barrier crossing would predict. For example, Neupane et al. (10) report that this distribution can be fitted using the diffusion model (11, 12), but only when the assumed barrier is much lower than the actual free energy barrier measured along the reaction coordinate. A lower barrier corresponds to a broader distribution of transition path times, as illustrated in Fig. 2.

Fig. 2.

Fig. 2.

Distribution of the transition path time, plotted here for the case where the potential in the transition region is parabolic, becomes more narrow as the barrier height ΔU (measured relative to the transition region boundaries) becomes greater. This is manifested by the decreasing values of the coefficient of variation C (Eq. 1). When the potential in the transition region is a parabolic well (ΔU<0), the distribution of transition path times approaches a single-exponential distribution, and the coefficient of variation C approaches 1 as the well depth increases. The numerical data shown here were generated using the method of ref. 12.

Even more dramatic discrepancy between the predictions of the 1D diffusion model and the observed transition path times is encountered in atomistic simulations of protein folding (13, 14) and other conformational changes in proteins (Fig. 3). In particular, the observed distributions are much broader than those predicted by the 1D diffusion model: The latter, when the diffusivity is adjusted to give the correct mean transition path time, significantly underestimates the contribution of both short and long transition paths (13). Several explanations to this discrepancy have been proposed, including roughness of the free energy landscape (15), entropic barriers (16, 17), heterogeneity of the pathways (14, 18), and memory effects (1921), but so far the experimental or simulation data have not allowed one to differentiate among such different possibilities. A further complication is that the distribution of the transition path time for the case of diffusive dynamics depends on the shape of the potential of mean force (Fig. 1). While this potential is accessible experimentally in single-molecule force spectroscopy studies (2224), it is generally unknown in fluorescence-based measurements of transition path times. This necessitates the use of an additional adjustable parameter, the barrier height ΔU, in fitting the experimental data with the 1D diffusion model, thus diminishing both the predictive power and the credibility of this model.

Fig. 3.

Fig. 3.

Distributions of transition path times for conformational changes in proteins, based on long-time atomistic simulations (37, 56). The transitions in question are the folding of the norleucine double mutant C-terminal fragment of the villin headpiece HP35 (13) and end-to-end loop formation (57) in the following intrinsically disordered proteins: N-terminal zinc-binding domain of HIV-1 Integrase (IN), cold shock protein from Thermotoga maritima (CspTm), and the immunoglobulin-binding domain of protein-L (protL). As indicated in the legend, the coefficient of variation C exceeds 1 in each case, indicating that no 1D diffusion model can describe these phenomena. The Inset presents the same distributions on a semilog scale, showing the behavior of the distribution tails.

Here, we show that a simple measure of the distribution width, known as the coefficient of variation (denoted C), also known as the relative SD, can be used to differentiate between data interpretations based on 1D diffusive dynamics and on multidimensional dynamics (Methods). Specifically, the 1D interpretation is unequivocally ruled out when this coefficient is greater than 1. We use this result to show that, according to this measure, the distributions of transition path times observed in atomistic simulations of a number of proteins are inconsistent with the model of 1D diffusive dynamics along the reaction coordinate and indicate essential multidimensionality of their dynamics.

Results

Coefficient of Variation.

The coefficient of variation is a common statistical measure of a distribution used in a variety of fields, from engineering to economics. For a distribution of the transition-path time, pTP(t), it is defined as the ratio of the SD to the mean:

C=[t2t2]1/2t, [1]

where the angular brackets denote averages over this distribution.

The coefficient of variation is large for broad distributions and small for narrow ones (approaching zero for sharply peaked distributions). To understand the range of values of this coefficient in the case of 1D diffusion, consider the simple example of diffusion across a symmetric parabolic potential,

U(x)=kx2/2, [2]

where negative values of the curvature k correspond to a barrier and positive ones to a potential well. The height of the barrier measured relative to the boundaries (xa=xb) of the transition region is as follows:

ΔU=kxa2/2=kxb2/2, [3]

with the negative barrier height corresponding to a potential well (Fig. 1). Distributions of the transition path time computed numerically as described in ref. 12 for different values of ΔU(measured in the units of the thermal energy kBT) are shown in Fig. 2. We observe that higher barriers entail narrower distributions (smaller values of C), and, as the curvature k becomes positive and its absolute value increases, the coefficient C approaches a value of 1. This limit is easy to understand: When a particle traverses a deep potential well, it becomes trapped in this well for a long time. The transition path time in this limit becomes the same as the time it takes to escape the well via activated barrier crossing, which is known to have an exponential distribution, pTP(t)=(1/τ)exp(t/τ), where τ is the average lifetime inside the well. For this distribution, Eq. 1 immediately gives C = 1. For the case of k = 0, the distribution of the transition path time and its first two moments can be obtained analytically (25), resulting in C=2/50.63.

Rigorous Bound on the Coefficient of Variation for 1D Diffusion.

The central finding of this work is that C<1 for the diffusive dynamics in any potential U(x) (Methods). In other words, the distribution of transition path times for the diffusive dynamics in any 1D potential can never be “broader” (as quantified by Eq. 1) than the single-exponential distribution. Thus, an observed value of C > 1 would immediately necessitate that the 1D Markov diffusion model should be abandoned.

This result may appear surprising. Consider, for instance, the dynamics on a rugged (but 1D) free energy landscape, as illustrated in Fig. 4A, with multiple potential wells that can trap the particle. It is known that a broad distribution of trap depths leads to nonergodic behavior and anomalous-diffusion–type phenomena (2629). For example, if the average lifetime τ inside each trapped state obeys the Arrhenius law, τ=τ0exp(βE), where β is the inverse thermal energy and E > 0 is the trap depth, then an exponential distribution of the trap depths, p(E)=αexp(αE)θ(E) [where θ(E) is the Heaviside step function ensuring that a trap depth is positive], results in a power-law distribution of the (average) trapping times, p(τ)=(α/βτ0)(τ/τ0)(1+α/β)θ(ττ0). When α<β, the first moment of the trapping time distribution, i.e., the mean trapping time, diverges. It seems tempting to argue that such a heavy-tailed distribution of the trap lifetime will result in a heavy-tailed distribution of the transition path time, pTP(t), since each typical individual transition path involves multiple trapping events, resulting in C > 1. It is, however, not true.

Fig. 4.

Fig. 4.

(A) Diffusion on a rough landscape, with multiple potential wells acting as traps. (B) Two-site model of a particle dynamics in a channel connecting left and right reservoirs. (C) A two-site model with a single off-pathway intermediate.

Coefficient of Variation for Discrete Markov State Networks.

Our finding can be restated within the framework of discrete models. For example, the scenario depicted in Fig. 4A can be mapped onto a random walk, wherein each potential well corresponds to a discrete state with a randomly drawn average lifetime. Surprisingly, then, for any linear chain of discrete states within the transition region, the coefficient of variation will always remain below 1 (that is, the distribution will always be narrower than the single-exponential one) regardless of how the average lifetimes of the discrete states are distributed.

Values of C greater than 1 are, however, possible for networks with nonlinear topologies. This observation is particularly important because existence of trapped, long-lived states is often viewed as the origin of anomalous-diffusion–type dynamics of reaction coordinates in proteins (3033); yet our result shows that such trapping alone cannot account for long-tailed distributions of transition path times observed in Fig. 3.

The role of dimensionality (or network topology in the discrete case) can be illustrated by two two-site models of biological membrane channels shown in Fig. 4 B and C. In the first, sites 1 and 2 represent the left and right sides of a channel (34, 35). A transition path, say, from left to right starts when a translocating particle enters the channel from the left reservoir and occupies site 1. It ends when the particle leaves the channel into the bulk on the right-hand side escaping the channel from site 2. The transition path may, of course, involve multiple transitions between the sites as long as the particle does not leave the channel. The distribution of the transition path time in this case can be found analytically:

pTP(t)=γγ+k1+k2k1+k2e(γ+k1+k2)t(e(k1+k2)t1), [4]

where k1 and k2 are the effective exchange rates between sites 1 and 2, and γ is the effective particle escape rate from the channel into the bulk, as shown in Fig. 4B. Accordingly, using Eq. 1, one finds

C=12γk1+k2+γ(k1+k2+2γ)2<1. [5]

Thus the coefficient of variation is strictly less than 1 in this case.

The second example, Fig. 4C, is the simplest model of a discrete network with nonlinear topology. In this case, the transition region also consists of two sites, 1 and 2, but the site 2 is an off-pathway intermediate, and thus each transition path both begins when the particle arrives in site 1 and ends when it leaves site 1 to the bulk. A similar calculation shows that, in this case,

pTP(t)=γ2Γ{(γ+k1k2+Γ)eγk1k2+Γ2t+(γk1+k2+Γ)eγ+k1+k2+Γ2t}, [6]

where Γ=(γ+k1+k2)24γk2. This gives the following value of the coefficient of variation,

C=1+2γk1(k1+k2)2, [7]

which is strictly greater than 1. Notice that, in both cases, C approaches unity in the limit k1+k2γ. In this limit, the internal exchange dynamics between the sites 1 and 2 is so fast that the transition region effectively behaves as a single state; and since for a single state the transition-path-time distribution is exponential (36), we find C → 1.

Application to HIV-1 Integrase.

We now apply these considerations to the end-to-end dynamics in the intrinsically disordered N-terminal domain of HIV-1 integrase (37), a system that attracted recent experimental attention (38). In this case, the “reaction” in question is the closure of a loop where the end-to-end distance (which is viewed as the reaction coordinate x) drops below a specified value xa. The other boundary of the transition region, xb, is chosen comparable to the average end-to-end distance of the protein. See SI Appendix for further details. The distribution of the transition path time obtained from molecular dynamics simulations of this protein (37) is shown in Fig. 3. Similarly to the case of the folding dynamics of another protein (13, 14), the distribution of transition path times for this protein exhibits a long tail (Fig. 3). Consistent with this, we find a coefficient of variation to be equal to C1.4>1.

To gain further insight into the physical origin of this broad distribution, we have mapped the atomistically resolved dynamics of HIV-1 integrase onto that of a Markov-state network (39, 40), whose connectivity is illustrated in Fig. 5A. The resulting Markov-state model (MSM) provides a good description of the distribution of the transition path time pTP(t) (Fig. 5B), which agrees with that from the original atomistic model except at short times (because, by construction, a MSM cannot reproduce the dynamics at timescales where the Markov assumption is invalid). See SI Appendix for other tests performed to validate the MSM. Importantly, the MSM also predicts a heavy-tailed distribution with C = 1.6.

Fig. 5.

Fig. 5.

(A) Connectivity of the Markov state network obtained for the intrinsically disordered N-terminal domain of HIV-1 integrase indicates a multidimensional free energy landscape. (B) Transition path time distributions computed directly from the dynamics of the reaction coordinate in the atomistic simulations of the protein (black squares) agree with the predictions of the MSM (black solid line) at times that are longer than the lag time (∼20 ns) used to construct the model. When the free energy landscape of the MSM is smoothed by setting the rates of all internode transitions to be the same, the resulting distribution of the transition path time remains broader than exponential (blue dashed line). Inset shows the same distribution on a semilog scale.

To find out whether the nontrivial network topology alone could explain the observed distribution, we have smoothed the MSM’s free energy landscape by setting the transition rates between every pair of connected nodes to be a constant value (and by adjusting this value to give the correct mean transition path time). This modified MSM still predicts a heavy-tailed distribution of transition path times (Fig. 5B) (C = 2.0). Since the modified network, by construction, lacks states that act as long-lived kinetic traps, we conclude that nontrivial connectivity of the network rather than its energetic ruggedness is the primary physical origin of the observed broad distribution of the transition-path times. Indeed, as stated above (and proven below), a linear network of states cannot predict a distribution with C > 1 regardless of the transition rates between its nodes and, correspondingly, regardless of the free energies of the nodes.

The Role of Experimental Artifacts and Multidimensional Diffusion Model.

Single-molecule experiments do not usually probe the molecular degrees of freedom directly. For example, in single-molecule force spectroscopy experiments, the intrinsic molecular extension x is inferred from the observable experimental degrees of freedom such as the position y of the force probe (23, 4144). As a result, even the minimalist model describing such experiments may require dimensionality higher than 1. Does such instrument-induced multidimensionality lead to a broad distribution of the transition path time with C > 1?

Although the coupling of the molecular coordinates to those of the experimental setup has been reported to broaden the distributions of the transition path time as compared to the 1D case, both experimental observations and simulations often report still narrower-than-single-exponential distributions (10, 19, 44). Here, we have studied this case in more detail by considering the standard model where the coupling between the two degrees of freedom is described as a harmonic spring with a free energy of the form k(xy)2/2, where k represents the stiffness of the polymeric handle connecting the molecule and the force probe (SI Appendix). Although we do not have a rigorous proof of the inequality C < 1 regardless of the model parameters in this case, extensive search in the parameter space yielded no regimes with C exceeding 1. An implication of this observation is that, if a value C > 1 is observed experimentally, it is more likely to result from inherent multidimensionality of the molecular dynamics rather than from experimental artifacts.

What properties of multidimensional dynamics and/or of the underlying multidimensional energy landscape are then responsible for values C that exceed 1? Some insight into this question can be achieved by considering a potential energy landscape that has two separate saddles corresponding to two different pathways (reaction channels) leading from the reactant to the product (Fig. 6). Such landscapes are common even in reactions involving small molecules, stilbene isomerization being a classic example (45), and they are ubiquitous in conformational transitions that take place in biomolecules (46). We emphasize that, although in general multiple pathways may lead to nonexponential kinetics, existence of parallel reaction pathways is not inconsistent with the exponential kinetics of the overall reaction as long as all activation barriers are high enough (6). Let now pTP(1)(t) and pTP(2)(t) be the distributions of the transition path time for reactions proceeding via channels 1 and 2 and let f and 1 − f be the corresponding fractions of transitions occurring via each channel. Then the observed distribution of the transition path time is given by the following:

pTP(t)=fpTP(1)(t)+(1f)pTP(2)(t), [8]

and the resulting coefficient of variation is as follows:

C={ft12+(1f)t22[ft1+(1f)t2]21}1/2, [9]

where ti and ti2 are the first and second moments of the distribution pTP(i)(t), i = 1, 2. Introducing coefficients of variation C1 and C2 for the individual distributions pTP(1)(t) and pTP(2)(t), one finds from Eq. 9 that C > 1 if

t1t21+C122f1f+t2t11+C22+2ff>4. [10]

Regardless of the values of f, C1, and C2, the inequality of Eq. 10 is satisfied if the ratio t1/t2 is sufficiently large or small. In particular, the two individual transition-path-time distributions may be narrow (C1<1,C2<1), yet the distribution of Eq. 8 can be broad, with C > 1, given sufficient disparity between the average timescales t1 and t2. Interestingly, if both distributions are single-exponential, C1=C2=1, then the inequality of Eq. 10 becomes t1/t2+t2/t1>2, which is always true unless t1=t2!

Fig. 6.

Fig. 6.

A two-pathway potential energy surface with two transition-state saddles (TS1 and TS2) is an example of a multidimensional energy landscape that may lead to a broad distribution of the transition path time with C > 1. This happens when there is sufficient disparity between the mean transition path times proceeding via each saddle (roughly following the green and red lines). See text for further details.

The above simple example suggests that heterogeneity of the ensemble of transition paths may account for broad distributions with C > 1. Existence of multiple activation barriers is, however, not a necessary condition. Consider, for example, the scenario where the system has, in addition to the reaction coordinate x, a degree of freedom y that is slow in comparison with the typical transition path time. Then y remains nearly constant for the duration of each transition path, and the distribution of their times can be written as follows:

pTP(t)=dyf(y)pTP(y)(t), [11]

where pTP(y)(t) is the distribution of the transition path time at a fixed y and f(y) is the distribution of y for the ensemble of transition paths. As Eq. 11 is a generalization of Eq. 8 to the continuous case, we expect that a value C > 1 can also be achieved in this case. Indeed, in SI Appendix, we present a numerical study of a 2D toy model with a single transition-state saddle, in which a soft y-mode results in transition paths wandering far away from the saddle (SI Appendix, Fig. S6) and leads to broadly distributed transition path times.

Discussion

Until recently, rate coefficients, which are related to mean first passage times, have been the focus of kinetic studies of biological molecules. Distributions of first passage times, as opposed to their mean values, are however becoming increasingly accessible to single-molecule measurements. Such distributions are beginning to provide detailed information about energy landscapes traversed by biomolecules en route between stable molecular species. For example, analysis of the short-time behavior of distributions of first passage times enables one to detect the number of intermediate states encountered in a molecular transition (47).

Transition path times are conditional first passage times that are particularly interesting, as they probe the dynamics during the fleeting yet rate-defining events where the activation barrier of a chemical reaction is being surmounted or a biological membrane channel is being traversed. Here, we showed that the width of the distribution of the transition path time contains important information about the dimensionality of the underlying free energy landscape. Specifically, we have rigorously proven that no 1D diffusive model, regardless of the ruggedness of the underlying free energy landscape or position dependence of the diffusivity, can possibly predict a distribution whose coefficient of variation C exceeds 1. A coefficient of variation exceeding 1 is thus a fingerprint of multidimensional dynamics.

A value of C that is below 1, however, does not necessarily mean that the system is 1D, nor does it imply that a 1D description is a good approximation. Indeed, a common reason requiring one to invoke multidimensionality to describe single-molecule force spectroscopy experiments is the coupling of the molecular coordinate x to the observable experimental degrees of freedom such as the position y of the force probe (23, 4144).This coupling leads to memory effects in the dynamics of both x and y even if the intrinsic dynamics of these degrees of freedom is Markovian (19, 48), and thus a 1D diffusion model may no longer be an adequate description of the observed experimental signal even when the intrinsic dynamics along x is diffusive. However, we find here that elastic coupling of the molecule to the experimental observables is not likely to lead to a broad distribution of the transition path time (with C > 1). Consequently, coefficients of variation greater than 1, if observed experimentally, are unlikely to arise from experimental artifacts; rather, they indicate “inherent multidimensionality” of the underlying free energy landscape.

Our analysis of toy 2D diffusion models and of the MSM of a protein suggests that broader-than-single-exponential distributions of the transition path time arise as a result of multiple parallel reaction pathways. Consistent with this proposal, recent experimental (49, 50) and computational (18) observations have already indicated heterogeneity of the ensembles of the biomolecular transition paths, but such heterogeneity is a property that is not easy to quantify. In particular, Hoffer et al. (50) invoke energy landscapes with multiple saddles or with broad saddles (as in SI Appendix, Fig. S6) as a possible explanation of the high variability of transition path shapes observed experimentally for some DNA hairpins. The above considerations suggest that the coefficient of variation of the transition-path-time distribution may serve as a useful proxy for the “diversity” of pathways.

Other physical models may also be used to describe dynamics that yields values of C higher than 1, particularly the fractional Brownian motion model, which was shown to accurately predict the broad distribution of the transition-path time in the villin headpiece HP35 (13). Nondiffusive dynamics and memory along the reaction coordinate could also be captured using the generalized Langevin equation (GLE) (15, 20, 21). Although analytical approximations for pTP(t) exist in this case, they are only valid in the limit of high barriers, and so numerical simulations that account for memory effects would be required to establish whether they could lead to values of C greater than 1. Note that a GLE with exponential memory for the reaction coordinate x is equivalent mathematically to the dynamics arising from the above 2D model, where the reaction coordinate x is coupled to a mode y via a harmonic spring term (19, 51); thus, we know that C < 1 in this case, but whether the same conclusion holds for more complicated memory kernels is an open question.

From an experimental perspective, use of coefficients of variation offers a significant advantage over analyzing the transition-path-time distributions via fitting them to some analytical function. First, such fitting usually assumes a specific shape of the potential of mean force (e.g., parabolic) and often makes additional approximations such as the barrier height being much greater than the thermal energy. Second, fitting is especially unreliable at long and short times where the statistics are poor. Calculation of the coefficient of variation, in contrast, only requires the first and the second moments of the distribution and should be much more robust with respect to experimental constraints than the distribution itself.

Finally, we note that calculating how the value of the coefficient of variation depends on the transition region boundaries offers a method to probe the dynamics of the system locally. For example, one could choose the boundaries of the transition region to be (xa,xb)=(x0Δ/2,x0+Δ/2) to probe the dynamics around some point x0, as long as the region width Δ satisfies the constraints imposed by the temporal and spatial resolution of the experiment. This not only allows one to probe the dynamics in more detail but also offers potentially a more critical test of the model used to describe the data. For example, we know that, in the 1D case, the values of C=C(x0,Δ) are correlated with the barrier height (cf. Fig. 2), with higher barriers corresponding to narrower distributions. Thus, if a value C=C(x0,Δ)<1 is observed in the case of a high barrier, multidimensional effects may be masked by those of the barrier height. However, the barrier can be made lower by choosing a smaller value of the width Δ (to the extent that the temporal and spatial resolution permit), thus offering greater sensitivity to deviations from the 1D picture of diffusive dynamics. This can be illustrated using the atomistic simulation data for the villin headpiece HP35 (SI Appendix, Figs. S10 and S11). Although in this case we find C(x0,Δ)>1 even when the transition region boundaries are placed near the minima of the potential of mean force, this coefficient increases as the width Δ is decreased. At sufficiently small values of Δ, however, the coefficient of variation decreases again, dropping below 1 at Δ0. Moreover, C(x0,Δ), considered as a function of x0 at fixed Δ, reaches a maximum when x0 is placed near the barrier top.

To gain additional insight into these observations, particularly into the nonmonotonic Δ dependence of the coefficient of variation, we performed the same analysis for the toy 2D model with a broad saddle (SI Appendix, Figs. S7 and S8). Again, we find that the value of C first increases as the width Δ decreases, but then it decreases again approaching the 1D result in the limit Δ0, where the effect of the potential on the dynamics within the transition region becomes negligible. In general, the precise dependence of the coefficient C on the boundaries of the transition region is determined by the topography of the underlying free energy landscape in relationship to the observed reaction coordinate or, equivalently, to the transition region boundaries (Fig. 6). One hopes that such properties can be deduced from the observed functions C(x0,Δ).

Methods

Outline of the Proof That Transition Path Distributions Are Narrow (C < 1) for 1D Diffusive Dynamics in an Arbitrary Potential.

We start with the known expression (5254) for the mean transition path time for transitions between boundaries xa and xb:

tTP=[1Dxaxbdypeq(y)][xaxbdxΦa(x)Φb(x)peq(x)]=xaxbdxeβU(x)[xaxeβU(y)dy][xxbeβU(y)dy]DxaxbdyeβU(y). [12]

Here, D is the diffusivity, which, for simplicity, is assumed to be position independent here. It is, however, straightforward to generalize this derivation to allow any position-dependent diffusivity function D(x). The function peq(x)eβU(x) is the equilibrium distribution of x, and

Φb(x)=1Φa(x)=xaxeβU(y)dyxaxBeβU(y)dy [13]

is the splitting probability (55) defined as the probability that the system starting at a point x, xa<x<xb, will reach the boundary xb before reaching xa. It can be shown (SI Appendix) that the second moment of the distribution of the transition path time is given by a similar relationship:

tTP2=0dtt2pTP(t)=[2Dxaxbdypeq(y)][xaxbdxΦa(x)Φb(x)peq(x)t(xxb|xa)]. [14]

Here, t(xxb|xa) is the mean time it takes to reach the boundary xb starting from x (xa<x<xb), conditional upon not exiting the transition region via the boundary xa. Note the identity limxxa+t(xxb|xa)=tTP(xaxb). We now use the intuitively appealing inequality, t(xxb|xa)<tTP(xaxb)tTP for any intermediate point x such that xa<x<xb (SI Appendix). Thus, from Eqs. 12 and 14, we have the following:

tTP2<2tTP[1Dxaxbdypeq(y)][xaxbdxΦa(x)Φb(x)peq(x)]=2tTP2. [15]

As a consequence, we have the following:

C2=tTP2tTP2tTP2<1, [16]

which completes the proof.

Supplementary Material

Supplementary File

Acknowledgments

A.M.B. was supported by the Intramural Research Program of the NIH, Center for Information Technology. R.S. and D.E.M. were supported by the Robert A. Welch Foundation (Grant F-1514) and the NSF (Grants CHE 1566001 and CHE 1955552).

Footnotes

The authors declare no competing interest.

This article is a PNAS Direct Submission. G.H. is a guest editor invited by the Editorial Board.

This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2008307117/-/DCSupplemental.

Data Availability.

Codes and trajectory data have been deposited in the GitHub repository (https://github.com/rsatijaUT/Broad-TPT) (58).

References

  • 1.Chung H. S., Eaton W. A., Protein folding transition path times from single molecule FRET. Curr. Opin. Struct. Biol. 48, 30–39 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hoffer N. Q., Woodside M. T., Probing microscopic conformational dynamics in folding reactions by measuring transition paths. Curr. Opin. Chem. Biol. 53, 68–74 (2019). [DOI] [PubMed] [Google Scholar]
  • 3.Sturzenegger F., et al., Transition path times of coupled folding and binding reveal the formation of an encounter complex. Nat. Commun. 9, 4708 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kim J.-Y., Meng F., Yoo J., Chung H. S., Diffusion-limited association of disordered protein by non-native electrostatic interactions. Nat. Commun. 9, 4707 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Berezhkovskii A. M., Hummer G., Bezrukov S. M., Identity of distributions of direct uphill and downhill translocation times for particles traversing membrane channels. Phys. Rev. Lett. 97, 020601 (2006). [DOI] [PubMed] [Google Scholar]
  • 6.Makarov D. E., Single Molecule Science: Physical Principles and Models (CRC Press, Boca Raton, FL, 2015). [Google Scholar]
  • 7.Szabo A., Schulten K., Schulten Z., First passage time approach to diffusion controlled reactions. J. Chem. Phys. 72, 4350 (1980). [Google Scholar]
  • 8.Socci N. D., Onuchic J. N., Wolynes P. G., Diffusive dynamics of the reaction coordinate for protein folding funnels. J. Chem. Phys. 104, 5860–5868 (1996). [Google Scholar]
  • 9.Klimov D., Thirumalai D., Viscosity dependence of the folding rates of proteins. Phys. Rev. Lett. 79, 317 (1997). [Google Scholar]
  • 10.Neupane K., et al., Direct observation of transition paths during the folding of proteins and nucleic acids. Science 352, 239–242 (2016). [DOI] [PubMed] [Google Scholar]
  • 11.Zhang B. W., Jasnow D., Zuckerman D. M., Transition-event durations in one-dimensional activated processes. J. Chem. Phys. 126, 074504 (2007). [DOI] [PubMed] [Google Scholar]
  • 12.Chaudhury S., Makarov D. E., A harmonic transition state approximation for the duration of reactive events in complex molecular rearrangements. J. Chem. Phys. 133, 034118 (2010). [DOI] [PubMed] [Google Scholar]
  • 13.Satija R., Das A., Makarov D. E., Transition path times reveal memory effects and anomalous diffusion in the dynamics of protein folding. J. Chem. Phys. 147, 152707 (2017). [DOI] [PubMed] [Google Scholar]
  • 14.Mori T., Saito S., Molecular mechanism behind the fast folding/unfolding transitions of villin headpiece subdomain: Hierarchy and heterogeneity. J. Phys. Chem. B 120, 11683–11691 (2016). [DOI] [PubMed] [Google Scholar]
  • 15.Pollak E., Transition path time distribution and the transition path free energy barrier. Phys. Chem. Chem. Phys. 18, 28872–28882 (2016). [DOI] [PubMed] [Google Scholar]
  • 16.Makarov D. E., Reconciling transition path time and rate measurements in reactions with large entropic barriers. J. Chem. Phys. 146, 071101 (2017). [DOI] [PubMed] [Google Scholar]
  • 17.Cheng R. R., Makarov D. E., Failure of one-dimensional Smoluchowski diffusion models to describe the duration of conformational rearrangements in floppy, diffusive molecular systems: A case study of polymer cyclization. J. Chem. Phys. 134, 085104 (2011). [DOI] [PubMed] [Google Scholar]
  • 18.Jacobs W. M., Shakhnovich E. I., Accurate protein-folding transition-path statistics from a simple free-energy landscape. J. Phys. Chem. B 122, 11126–11136 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Medina E., Satija R., Makarov D. E., Transition path times in non-Markovian activated rate processes. J. Phys. Chem. B 122, 11400–11413 (2018). [DOI] [PubMed] [Google Scholar]
  • 20.Satija R., Makarov D. E., Generalized Langevin equation as a model for barrier crossing dynamics in biomolecular folding. J. Phys. Chem. B 123, 802–810 (2019). [DOI] [PubMed] [Google Scholar]
  • 21.Carlon E., Orland H., Sakaue T., Vanderzande C., Effect of memory and active forces on transition path time distributions. J. Phys. Chem. B 122, 11186–11194 (2018). [DOI] [PubMed] [Google Scholar]
  • 22.Covino R., Woodside M. T., Hummer G., Szabo A., Cossio P., Molecular free energy profiles from force spectroscopy experiments by inversion of observed committors. J. Chem. Phys. 151, 154115 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hummer G., Szabo A., Free energy profiles from single-molecule pulling experiments. Proc. Natl. Acad. Sci. U.S.A. 107, 21441–21446 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Woodside M. T., Block S. M., Reconstructing folding energy landscapes by single-molecule force spectroscopy. Annu. Rev. Biophys. 43, 19–39 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Berezhkovskii A. M., Dagdug L., Bezrukov S. M., Exact solutions for distributions of first-passage, direct-transit, and looping times in symmetric cusp potential barriers and wells. J. Phys. Chem. B 123, 3786–3796 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Klafter J., Sokolov I. M., First Steps in Random Walks: From Tools to Applications (Oxford University Press, Oxford, 2011). [Google Scholar]
  • 27.Metzler R., Jeon J. H., Cherstvy A. G., Barkai E., Anomalous diffusion models and their properties: Non-stationarity, non-ergodicity, and ageing at the centenary of single particle tracking. Phys. Chem. Chem. Phys. 16, 24128–24164 (2014). [DOI] [PubMed] [Google Scholar]
  • 28.Metzler R., Klafter J., The random walk’s guide to anomalous diffusion: A fractional dynamics approach. Phys. Rep. 339, 1–77 (2000). [Google Scholar]
  • 29.Plotkin S. S., Wolynes P. G., Non-Markovian configurational diffusion and reaction coordinates for protein folding. Phys. Rev. Lett. 80, 5015 (1998). [Google Scholar]
  • 30.Luo G., Andricioaei I., Xie X. S., Karplus M., Dynamic distance disorder in proteins is caused by trapping. J. Phys. Chem. B 110, 9363–9367 (2006). [DOI] [PubMed] [Google Scholar]
  • 31.Krivov S. V., Is protein folding sub-diffusive? PLOS Comput. Biol. 6, 1000921 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Avdoshenko S. M., Das A., Satija R., Papoian G. A., Makarov D. E., Theoretical and computational validation of the Kuhn barrier friction mechanism in unfolded proteins. Sci. Rep. 7, 269 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Das A., Makarov D. E., Dynamics of disordered proteins under confinement: Memory effects and internal friction. J. Phys. Chem. B 122, 9049–9060 (2018). [DOI] [PubMed] [Google Scholar]
  • 34.Bezrukov S. M., Berezhkovskii A. M., Szabo A., Diffusion model of solute dynamics in a membrane channel: Mapping onto the two-site model and optimizing the flux. J. Chem. Phys. 127, 115101 (2007). [DOI] [PubMed] [Google Scholar]
  • 35.Berezhkovskii A. M., Bezrukov S. M., Mapping intrachannel diffusive dynamics of interacting molecules onto a two-site model: Crossover in flux concentration dependence. J. Phys. Chem. B 122, 10996–11001 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Gopich I. V., Szabo A., Decoding the pattern of photon colors in single-molecule FRET. J. Phys. Chem. B 113, 10965–10973 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Piana S., Donchev A. G., Robustelli P., Shaw D. E., Water dispersion interactions strongly influence simulated structural properties of disordered protein states. J. Phys. Chem. B 119, 5113–5123 (2015). [DOI] [PubMed] [Google Scholar]
  • 38.Zosel F., Haenni D., Soranno A., Nettels D., Schuler B., Combining short- and long-range fluorescence reporters with simulations to explore the intramolecular dynamics of an intrinsically disordered protein. J. Chem. Phys. 147, 152708 (2017). [DOI] [PubMed] [Google Scholar]
  • 39.Prinz J. H., et al., Markov models of molecular kinetics: Generation and validation. J. Chem. Phys. 134, 174105 (2011). [DOI] [PubMed] [Google Scholar]
  • 40.Pande V. S., Beauchamp K., Bowman G. R., Everything you wanted to know about Markov state models but were afraid to ask. Methods 52, 99–105 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Makarov D. E., Communication: Does force spectroscopy of biomolecules probe their intrinsic dynamic properties? J. Chem. Phys. 141, 241103 (2014). [DOI] [PubMed] [Google Scholar]
  • 42.Neupane K., Woodside M. T., Quantifying instrumental artifacts in folding kinetics measured by single-molecule force spectroscopy. Biophys. J. 111, 283–286 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Cossio P., Hummer G., Szabo A., On artifacts in single-molecule force spectroscopy. Proc. Natl. Acad. Sci. U.S.A. 112, 14248–14253 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Cossio P., Hummer G., Szabo A., Transition paths in single-molecule force spectroscopy. J. Chem. Phys. 148, 123309 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Agmon N., Kosloff R., Dynamics of two-dimensional diffusional barrier crossing. J. Phys. Chem. 91, 1988–1996 (1987). [Google Scholar]
  • 46.Wales D. J., Energy Landscapes: Applications to Clusters, Biomolecules and Glasses (Cambridge University Press, 2003). [Google Scholar]
  • 47.Thorneywork A. L., et al., Direct detection of molecular intermediates from first-passage times. Sci. Adv. 6, eaaz4642 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Pyo A. G. T., Woodside M. T., Memory effects in single-molecule force spectroscopy measurements of biomolecular folding. Phys. Chem. Chem. Phys. 21, 24527–24534 (2019). [DOI] [PubMed] [Google Scholar]
  • 49.Kim J. Y., Chung H. S., Disordered proteins follow diverse transition paths as they fold and bind to a partner. Science 368, 1253–1257 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Hoffer N. Q., Neupane K., Pyo A. G. T., Woodside M. T., Measuring the average shape of transition paths during the folding of a single biological molecule. Proc. Natl. Acad. Sci. U.S.A. 116, 8125–8130 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Elber R., Makarov D. E., Orland H., Molecular Kinetics in Condensed Phases: Theory, Simulation, and Analysis (Wiley, 2020). [Google Scholar]
  • 52.Hummer G., From transition paths to transition states and rate coefficients. J. Chem. Phys. 120, 516–523 (2004). [DOI] [PubMed] [Google Scholar]
  • 53.Chung H. S., Gopich I. V., Fast single-molecule FRET spectroscopy: Theory and experiment. Phys. Chem. Chem. Phys. 16, 18644–18657 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Berezhkovskii A. M., Pustovoit M. A., Bezrukov S. M., Channel-facilitated membrane transport: Average lifetimes in the channel. J. Chem. Phys. 119, 3943–3951 (2003). [Google Scholar]
  • 55.Gardiner C. W., Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences (Springer, Berlin, 1983). [Google Scholar]
  • 56.Piana S., Lindorff-Larsen K., Shaw D. E., Protein folding kinetics and thermodynamics from atomistic simulation. Proc. Natl. Acad. Sci. U.S.A. 109, 17845–17850 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Satija R., Das A., Mühle S., Enderlein J., Makarov D. E., Kinetics of loop closure in disordered proteins: Theory vs simulations vs experiments. J. Phys. Chem. B 124, 3482–3493 (2020). [DOI] [PubMed] [Google Scholar]
  • 58.Satija R., Berezhkovskii A. M., Makarov D. E., Data from "Broad distributions of transition-path times are fingerprints of multidimensionality of the underlying free energy landscapes." GitHub. Available at https://github.com/rsatijaUT/Broad-TPT/tree/master/Data/. Deposited 10 October 2020. [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Data Availability Statement

Codes and trajectory data have been deposited in the GitHub repository (https://github.com/rsatijaUT/Broad-TPT) (58).


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES