Skip to main content
Biophysical Reviews logoLink to Biophysical Reviews
. 2018 Mar 15;10(2):375–389. doi: 10.1007/s12551-018-0406-7

Relaxation mode analysis for molecular dynamics simulations of proteins

Ayori Mitsutake 1,, Hiroshi Takano 1
PMCID: PMC5899748  PMID: 29546562

Abstract

Molecular dynamics simulation is a powerful method for investigating the structural stability, dynamics, and function of biopolymers at the atomic level. In recent years, it has become possible to perform simulations on time scales of the order of milliseconds using special hardware. However, it is necessary to derive the important factors contributing to structural change or function from the complicated movements of biopolymers obtained from long simulations. Although some analysis methods for protein systems have been developed using increasing simulation times, many of these methods are static in nature (i.e., no information on time). In recent years, dynamic analysis methods have been developed, such as the Markov state model and relaxation mode analysis (RMA), which was introduced based on spin and homopolymer systems. The RMA method approximately extracts slow relaxation modes and rates from trajectories and decomposes the structural fluctuations into slow relaxation modes, which characterize the slow relaxation dynamics of the system. Recently, this method has been applied to biomolecular systems. In this article, we review RMA and its improved versions for protein systems.

Keywords: Protein, Simulation, Analysis, Dynamics

Introduction

Molecular dynamics simulation is widely used for protein research. In general, the focus of this research is to extract information on the physical properties of individual proteins. The results from such simulations are then often compared with experimental results. Since these experiments are generally conducted in solvents, it is necessary to simulate protein and water molecular systems, which are complicated systems. These simulations are conducted for a variety of purposes such as to analyze the stability and dynamics of the structures around crystal structures and to determine folding from an extended structure into a native structure. There are three difficulties in current approaches for protein simulations (Freddolino et al. 2010). The first is the potential function of the protein systems. In recent years, it has become possible to evaluate the molecular force field by improving the sampling, and accuracy has consequently improved. The second problem is related to the sampling. With respect to the folding mechanism, simulation at the millisecond scale is necessary. Recently, it has become possible to perform simulations at the millisecond scale by using special hardware such as Anton (Lindorff-Larsen et al. 2011, 2012; Dror et al. 2012, Lane et al. 2013), but sampling problems still exist for complex systems such as ligand-binding systems and other even more complex systems. The third issue is related to the analysis methods. It is important to extract the characteristic degrees of freedom (order parameters) from the complex protein movements obtained from simulations, which are good indicators for analyzing trajectories.

In normal mode analysis, the normal mode near the minimum point of the potential energy of the protein molecule is obtained (Go et al. 1983; Brooks and Karplus 1983; Levitt et al. 1985). Langevin mode analysis investigates modes around the native structure, including the water effect (Lamm and Szabo 1986; Kottalam and Case 1990; Kitao et al. 1991; Hayward et al. 1993). An elastic network model and Gaussian network model approximately estimate normal modes with large amplitudes by using the harmonic potential of coarse-grained models (Tirion 1996; Baher et al. 1997; Tama and Sanejouand 2001; Cui and Bahar 2005; Miyashita and Tama 2008). This method extracts collective modes with large amplitudes in the case of huge protein systems such as viruses, because huge proteins have rigid-like motions (Tama and Brooks III 2002).

Principal component analysis (PCA), also called quasiharmonic analysis or the essential dynamics method (Levy et al. 1984; Ichiye and Karplus 1991; Abagyan and Argos 1992; Garcia 1992; Hayward et al. 1993; Amadei et al. 1993; Kitao and Go 1999), is one of the most popular methods adopted for analyzing the structural fluctuations around the average structure. The modes with large structure fluctuations are extracted and are regarded as cooperative movement, and the relation of these fluctuations with function has been widely investigated. The obtained modes are also used as the axis of the free-energy surface. Moreover, various other analysis methods have been proposed, such as full correlation analysis (Lange and Grubmüller 2007), subspace joint approximate diagonalization of eigenmatrices (Sakuraba et al. 2010), and wavelet analysis (Kamada et al. 2011), among others (Moritsugu et al. 2015; Matsunaga et al. 2015).

In recent years, it has become possible to perform an extensively long simulation; thus, development of dynamic analysis methods to identify the local minimum-energy states and analyze the transitions between them is required. Accordingly, many methods to analyze the dynamics and kinetics of protein simulations have been developed (Zuckerman 2010; Komatsuzaki et al. 2011; Bowman et al. 2014). In particular, the Markov state model has been presented and applied to many protein systems (Schütte et al. 1999; Swope et al. 2004; Singhal et al. 2004; Chodera et al. 2006, 2007; Chodera and Noé 2014; Noé et al. 2007; Noé and Fischer 2008; Noé and Clementi 2017 Buchete and Hummer 2008; Prinz et al. 2011; Pérez-Hernández et al. 2013; Schwantes and Pande 2013; Schwantes et al. 2014; Bowman et al. 2014; Wu et al. 2017). The Markov state model can analyze transitions between local minimum-energy states, which are identified from clustering analysis methods. This is a powerful method for analyzing dynamics in the context of both long and short simulations of proteins.

Relaxation mode analysis (RMA) was developed to investigate the “dynamic” properties of spin systems (Takano and Miyashita 1995) and homopolymer systems for Monte Carlo (Koseki et al. 1997) and molecular dynamics (Hirao et al. 1997) analyses, and has been applied to various polymer systems (Hagita and Takano 2002; Saka and Takano 2008; Iwaoka et al. 2015; Natori and Takano 2017) to investigate their slow relaxation dynamics (de Gennes 1984; Doi and Edwards 1986). Recently, RMA has also been applied to biomolecular systems (Mitsutake et al. 2011; Mitsutake et al. 2005; Mitsutake and Takano 2015; Nagai et al. 2009, 2013). RMA approximately estimates slow relaxation modes and rates from trajectories obtained from simulations.

The relaxation modes {Xp} satisfy

Xp(t)Xq(0)=δp,qeλpt. 1

Here, 〈A(t)B(0)〉 denotes the equilibrium correlation of A at time t and B at time 0:

A(t)B(0)=Q,QA(Q)Tt(Q|Q)B(Q)Peq(Q), 2

where Tt(Q|Q) is the conditional probability that the system is in state Q at time t given that it is in state Q at time t = 0. Further, Peq(Q) denotes the probability that the system is in state Q at equilibrium. The relaxation rate of Xp is denoted by λp. The relaxation time is given by 1/λp. Note that the relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively, from the viewpoint of the statistical mechanics (Hirao et al. 1997; Koseki et al. 1997; Mitsutake and Takano 2015) (see the “Relaxation modes {Xp} and rates λp” section). The point of RMA is that we consider the variational problem, which is equivalent to the eigenvalue problem of the time evolution operator, and choose an appropriate trial function to estimate the slow relaxation modes and rates in the system (see the “RMA” section). From these processes, we obtain the generalized eigenvalue problem of the time correlation matrices for two different times. From the eigenvectors and eigenvalues, we approximately estimate slow relaxation modes and rates.

Conventional RMA approximately estimates slow relaxation modes by solving the generalized eigenvalue problem of the time correlation matrices of coordinates for two different times, C(τ + t0) and C(t0), which are calculated from the trajectory. Recently, dynamical analysis methods for molecular simulations of biopolymer systems have been developed to investigate slow dynamics. In these techniques such as time structure-based independent component analysis (tICA) (Naritomi and Fuchigami 2011, 2013), time-lagged independent component analysis (TICA) (Pérez-Hernández et al. 2013; Schwantes and Pande 2013), and dynamic component analysis (DCA) (Mori et al. 2015, 2016), time correlation matrices of certain physical quantities or states are used. (Note that tICA is a special case of RMA with t0 = 0. See Mitsutake et al. (2011) and Naritomi and Fuchigami (2011) for more details on the differences between tICA and RMA.) In tICA, TICA, and DCA, the time correlation functions C(τ) and C(0) are used, whereas C(τ + t0) and C(t0) are used in RMA. The relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. From this point of view, RMA is related to Markov state models. (The relationship among the Markov state model, tICA, and TICA is explained in Pérez-Hernández et al. (2013), Schwantes and Pande (2013), and Mitsutake and Takano (2015).) The combination method of tICA and a Markov state model was also proposed (Pérez-Hernández et al. 2013; Schwantes and Pande 2013). A Markov state model was constructed from clustering in the subspace determined by tICA.

In this review, we first provide a definition of relaxation modes and rates from the viewpoint of the statistical mechanics in the “Relaxation modes {Xp} and rates λp” section. The “RMA” section explains the original RMA (RMA with a single evolution time) and the process of RMA using coordinates for the trial function in detail. The “Improvement of RMA” section explains the improved versions of RMA, including RMA with multiple evolution times, principal component RMA (PCRMA), two-step RMA, and Markov-state RMA (MSRMA). Finally, in the “Application of RMA to a system with large conformational changes” section, we present results from studies in which RMA was applied to a system with large conformational changes. The “Conclusions” section provides conclusions and perspectives on the state of the field.

Relaxation modes {Xp} and rates λp

In this section, we provide the definition of relaxation modes and rates from the viewpoint of the statistical mechanics (Risken 1989; Zwanzig 2001). The relaxation modes {Xp} satisfy Eq. 1. The relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. We first explain the relation in three types of simulations satisfying the detailed balance condition.

In a Monte Carlo simulation satisfying the detailed balance condition, the time evolution of the probability P(Q;t) that the biomolecule is in a state Q=r1T,r2T, ,rNTT at time t is described by a master equation:

∂tP(Q;t)=QΓ(Q|Q)P(Q;t). 3

Here, Γ(Q|Q) denotes the (Q, Q)-component of the time evolution matrix Γ, and Q denotes the summation over all possible states. Γ(Q|Q) is also chosen so that the detailed balance for the equilibrium distribution function Peq(Q) is satisfied:

Γ(Q|Q)Peq(Q)=Γ(Q|Q)Peq(Q). 4

In the Brownian dynamics simulation, the time evolution of coordinates ri,(i = 1,⋯ ,N) is given by the Langevin equation for a biomolecule with N atoms:

dridt=1ζriU({rj})+wi. 5

Here, ri(t) denotes the position of the i th atom at time t, and ζ is the friction constant. The interaction between atoms is described by the potential U({ri}) = U(r1,…,rN). The random force wi(t) acting on the i th atom is a Gaussian white stochastic process and satisfies

wi,α(t)wj,β(t)=2ζkBTδα,βδi,jδ(tt), 6

where wi, α, kB, and T denote the α-component of wi (α = x, y, or z), the Boltzmann constant, and the temperature of the system, respectively. The Smoluchowski equation equivalent to Eq. 5 can be written as

∂tP(Q,t)=Γ(Q)P(Q,t)=i=1ri1ζkBTri+∂UriP. 7

Here, Q = {r1,…,rN} denotes a point in the phase space of the system, and P(Q, t)dQ denotes the probability that the system is found at time t in an infinitesimal volume dQ at point Q in the phase space. The time evolution operator Γ satisfies the detailed balance condition (Risken 1989):

Peq(Q)Γ(Q)δ(QQ)=Peq(Q)Γ(Q)δ(QQ), 8

where Peq(Q)expU({rj})kBT. Here, Γ(Q)δ(QQ) and the adjoint operator Γ(Q)δ(QQ) act only on Q in δ(QQ). In the matrix representation, so that Γ(Q)δ(QQ) = Γ(Q|Q) and Γ(Q)δ(QQ) = Γ(Q|Q), the detailed balance condition is the same as that in Eq. 4.

In a molecular dynamics simulation with the Langevin thermostat, the time evolution of coordinates ri,(i = 1,⋯ ,N) is given by the Langevin equation for a biomolecule with N atoms:

midvidt=ζviriU({rj})+wi, 9

with

dridt=vi. 10

Here, ri(t) and vi(t) denote the position and the velocity of the i th atom at time t, respectively. The mass of the i th atom is denoted by mi and ζ is the friction constant.

The Kramers equation, equivalent to Eqs. 9 and 10, can be written as

∂tP(Q,t)=Γ(Q)P(Q,t)=i=1Nrivi1mivi∂Uriζmivivi+kBTmiviP. 11

Here, Q = {r1,…,rN,v1,…,vN} denotes a point in the phase space of the system. The time evolution operator Γ satisfies the detailed balance condition:

Peq(Q)Γ(Q)δ(QQ)=Peq(𝜖Q)Γ(𝜖Q)δ(𝜖Q𝜖Q), 12

where Peq(Q)exp1kBT12imivi2+U({rj}) and Peq(Q) = Peq(𝜖Q). Here, 𝜖Q denotes the time-reversed state of the state Q, namely, 𝜖Q = {𝜖1r1,…,𝜖NrN,.𝜖N+ 1v1,…,𝜖2NvN} with

𝜖i=1fori=1,,N,1fori=N+1,,2N. 13

In the matrix representation, the detailed balance condition is written as follows:

Γ(Q|Q)Peq(Q)=Γ(𝜖Q|𝜖Q)Peq(𝜖Q). 14

The time evolution equation of P(Q;t) of Eqs. 7 and 11 corresponds to Eq. 3 in the matrix representation. In Monte Carlo and Brownian dynamics, because only coordinates are the degrees of freedom in the system, 𝜖Q = Q, the detailed balance condition in all three cases is given by Eq. 14.

We now consider the eigenvalue problem of the time evolution operator Γ(Q|Q) of the master equation:

Qϕn(Q)Γ(Q|Q)=λnϕn(Q). 15
QΓ(Q|Q)ψn(Q)=λnψn(Q). 16

Here, ϕn(Q) and ψn(Q) are the left and right eigenfunctions of the time evolution operator Γ with eigenvalue λn, respectively. When we define a quantity ϕ^n(Q) through

ψn(Q)=ϕ^n(Q)Peq(Q), 17

then ϕ^n(Q)=ϕn(𝜖Q). The eigenfunctions are chosen to satisfy the orthonormal condition:

Qϕm(Q)ψn(Q)=ϕm(Q)ϕ^nPeq(Q)=ϕmϕ^n=δm,n. 18

The equilibrium time-displaced correlation function of ϕn(Q) and ϕ^m(Q) is given by the following:

ϕm(t)ϕ^n(0)=QQϕm(Q)Tt(Q|Q)ϕ^n(Q)Peq(Q)=QQϕm(Q)eΓt(Q|Q)ϕ^n(Q)Peq(Q)=QQϕm(Q)eΓt(Q|Q)ψn(Q)=Qϕm(Q)eλntψn(Q)=δm,neλnt, 19

where Tt(Q|Q) = e−Γτ(Q|Q) is the conditional probability that the system is found at time t at Q given that the system is at Q at time 0.

If two quantities A(Q) and B(Q) are expanded as

A(Q)=nanϕn(Q)andB(Q)=nb^nϕ^n(Q), 20

then the time correlation function of A and B in the equilibrium state is given by

A(t)B(0)=nanb^nexp(λnt). 21

Thus, in terms of ϕn(Q) and ϕ^n(Q), the correlation function 〈A(t)B(0)〉 is decomposed into a sum of exponentially relaxing contributions. Therefore, we use two sets of functions, {ϕn(Q)} and {ϕ^n(Q)}, as relaxation modes, and refer to {λn} as their relaxation rates. The relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively.

RMA

RMA with a single evolution time, t0

RMA approximately estimates slow relaxation modes and rates from trajectories obtained from simulations. Herein, we explain how to obtain the slow relaxation modes and rates. The point of this method is that we consider the variational problem, which is equivalent to the eigenvalue problem of the time evolution operator, and choose an appropriate trial function in order to estimate the slow relaxation modes and rates in the system.

We consider the equations for the conditional probability:

Qϕn(Q)Tτ(Q|Q)=eλnτϕn(Q), 22
QTτ(Q|Q)ψn(Q)=eλnτψn(Q). 23

The eigenvalue problem in Eqs. 22 and 23 is equivalent to the variational problem

δR=0 24

with

R[ϕn]=ϕn(τ)ϕ^n(0)ϕn(0)ϕ^n(0), 25

and the stationary value of R gives the eigenvalue exp(−λnτ). RMA treats the variational problem of Eqs. 24 and 25 using trial functions instead of the eigenvalue problem of Eqs. 22 and 23. To choose the trial function given by a linear combination of important relevant quantities, we can evaluate the relaxation modes and rates from simulation data.

Herein, we consider a biopolymer composed of N atoms and only treat the coordinates, because the velocities have faster relaxations (∼ picosecond order) than coordinates in protein systems. We assume that R is a 3N-dimensional column vector that consists of a set of atomic coordinates relative to their average coordinates

RT=(r1T,r2T,,rNT)=(x1,y1,z1,,xN,yN,zN), 26

with

ri=riri, 27

where ri is the coordinate of the i th atom of the biopolymer in the center-of-mass coordinate system, and 〈ri〉 is its average. Note that because we consider the coordinates only, ϕ^n(Q)=ϕn(𝜖Q)=ϕn(Q) holds.

In RMA, we use the following function as an approximate relaxation mode:

Xp(Q)=i=13Nfp,iRi(t0/2;Q), 28

with

Ri(t;Q)=QRi(Q)Tt(Q|Q). 29

Here, Ri(Q) is the i th component of R. The quantity Ri(t;Q) is the expectation value of Ri after a period t starting from a state Q and satisfies Ri(t;Q)|t= 0 = Ri(Q). The parameter t0 is introduced in order to reduce the relative weight of the faster modes contained in R, and it is expected that Eq. 28 becomes a better approximation as t0 becomes larger.

For the trial function (28), R defined by Eq. 25 is given by

R[Xp]=i=13Nj=13Nfp,iCi,j(t0+τ)fp,ji=13Nj=13Nfp,iCi,j(t0)fp,j, 30

where Ci, j(t) is a component of a 3N × 3N symmetric matrix C(t) defined by

Ci,j(t)=Ri(t)Rj(0). 31

Then, the variational problem of Eq. 25 becomes a generalized eigenvalue problem

j=13NCi,j(t0+τ)fp,j=exp(λpτ)j=13NCi,j(t0)fp,j. 32

The orthonormal condition of Eq. 18 for Xp is written as

i=13Nj=13Nfp,iCi,j(t0)fp,j=δp,q. 33

Equations 32 and 33 determine the relaxation rates λp and the corresponding relaxation modes fp, i. We chose the indices of λp so that 0 < λ1λ2 ≤⋯ holds. Here, the relation

Tt(Q|Q)Peq(Q)=Tt(Q|Q)Peq(Q), 34

which is equivalent to the detailed balance condition of Eq. 14 with 𝜖Q = Q, and the Markovian property

QTt1(Q|Q)Tt2(Q|Q′′)=Tt1+t2(Q|Q′′) 35

are used.

The inverse transformation of Eq. 28 is given by

Ri(t0/2;Q)=p=13Ngi,pXp(Q) 36

with

gi,p=j=13NCi,j(t0)fp,j. 37

The time correlation functions of Ri are reproduced by

Ri(t)Rj(0)=pqgi,pgj,qXptt0Xq(0),pgi,pgj,pexpλptt0,=pg~i,pg~j,pexp(λpt), 38

for tt0. Here,

g~i,p=gi,pexp(λpt0/2). 39

Because we are considering position coordinates only, the detailed balance condition yields the following consequences: C(t) is a symmetric matrix, Ci, j(t) = Cj, i(t); {λp} are real and positive, which corresponds to pure relaxation. We refer to this method as the “RMA method with a single evolution time,” which is t0/2.

In practice, the time correlation matrices for the two different times are calculated through simulations. Then, by solving the generalized eigenvalue problem, {λp} and {Xp} are obtained from the eigenvalues and eigenvectors, respectively. To examine the validity of the present analysis, the autocorrelation functions Ci, i(t) are reconstructed from the estimated eigenvalues and eigenvectors and are compared with those directly calculated via simulation.

Herein, we comment on the trial function. When RMA was first introduced to a spin system, states of spins on a lattice were used as the trial function (Takano and Miyashita (1995)). When RMA was first introduced to polymer systems, the coordinates of polymers were used as the trial function (Koseki et al. (1997); Hirao et al. (1997)). In polymer systems, the Rouse modes, which were derived from the theory of polymer physics (Doi and Edwards 1986), correspond to the relaxation modes. Rouse modes are given as linear combinations of coordinates. Thus, when RMA was applied to polymer systems, the modes obtained by RMA were compared with the Rouse modes. In protein systems, PCA using coordinates has been widely used. In PCA, the eigenvalue problem of the covariance matrix of coordinates is solved. Therefore, when we first applied RMA to a hetero polymer system (protein system), it seemed to be better to use coordinates as trial functions. The results of RMA and PCA were directly compared with each other. Recently, we have proposed to use physical quantities with slow motions as the trial functions and PCRMA and two-step RMA have been introduced (see the “Improvement of RMA” section). However, RMA using coordinates as the trial functions has an advantage that we can easily convert the information on the slow relaxation modes to the information in coordinate space.

RMA for protein systems

In homopolymer systems, relaxation of the positions of a polymer relative to the center of the mass is investigated. This means that the translational degrees of freedom are removed from the coordinates of the polymer. Because the rotational degrees of freedom remain, the rotational relaxation of the polymer is observed as slow relaxations. In protein systems, it is of interest to evaluate fluctuations of the conformations of a biomolecule around its average conformation. Thus, the translational and rotational degrees of freedom are removed from the sampled conformations of a biomolecule. In practice, treatment of the generalized-eigenvalue problem for removing the translational degrees of freedom in the homopolymer system was given by Koseki et al. (1997). Herein, we explain how to treat the generalized eigenvalue problem for removing the translational and rotational degrees of freedom when using the coordinates for the trial function (Mitsutake et al. 2011). The point of this process is that the generalized eigenvalue problem for real symmetric matrices can be easily solved numerically if the matrices are positive definite. Therefore, we shift the zero eigenvalues to finite positive values without changing the other eigenvalues and the corresponding eigenvectors.

A schematic illustration of the process for RMA using coordinates for the trial function is shown in Fig. 1. First, we remove the translational and rotational degrees of freedom as well as conduct PCA (Eckart 1935; McLachlan 1979). After the average structure converges, the origin of the coordinate system is chosen to be the center of the mass of the average positions, 〈ri〉 with i = 1,…,N, and the axes of the coordinate system are chosen to be the principal axes of the moment of the inertia tensor of the average positions. We calculate Ci,j(t)=Ci,j(t)+Cj,i(t)2 and C(t):

C(t)=C(t)+α=x,y,zexp(λαtr(tt0))dαtrdαtrT+α=x,y,zexp(λαrot(tt0))dαrotdαrotT, 40

where dxtr, dytr, and dztr are unit vectors given by

dxtr=1N(1,0,0,1,0,0,,1,0,0)T,dytr=1N(0,1,0,0,1,0,,0,1,0)T,dztr=1N(0,0,1,0,0,1,,0,0,1)T, 41

and dxrot, dyrot, and dzrot are unit vectors given by

dxrot=1i=1n(zi2+yi2)×(0,z1,y1,0,z2,y2,,0,zN,yN)T,dyrot=1i=1n(zi2+xi2)×(z1,0,x1,z2,0,x2,,zN,0,xN)T,anddzrot=1i=1n(yi2+xi2)×(y1,x1,0,y2,x2,0,,yN,xN,0)T. 42

Fig. 1.

Fig. 1

Schematic illustration of the RMA process using the coordinate R for the trial function

The values of λαtr and λαrot are usually set to zero. These unit vectors satisfy the following relations:

dαadβb=dαaTdβb=δα,βδa,b 43

and

C(t)dαa=0, 44

where α, β = x, y, z and a, b = tr,rot. Then, we solve the generalized eigenvalue problem for C(t0 + τ) and C(t0), C(t0+τ)vp=exp(λpτ)C(t0)vp, with the orthonormal condition vpTC(t0)vq=δp,q. The unit vectors dαa are eigenvectors of this generalized eigenvalue problem with eigenvalues exp(λαaτ). We denote fp as the eigenvectors other than dαa. Because dαaTC(t)fp=exp(λαa(tt0))dαaTfp=0 , C(t)fp=C(t)fp holds. Therefore, fp are identical with the eigenvectors fp = (fp,1,fp,2,…,fp,3N)T of the generalized-eigenvalue problem for C(t0 + τ) and C(t0) with the same eigenvalues exp(−λpτ). Thus, fp and exp(−λpτ) can be obtained by solving the generalized eigenvalue problem for C(t0 + τ) and C(t0), which are real symmetric positive definite matrices.

After obtaining relaxation modes and rates, we confirm whether or not the slow relaxation modes and rates obtained using τ and t0 are appropriate. For this purpose, the convergences of slow relaxation times as a function of τ are examined. The autocorrelation functions Ci, i(t) are reconstructed from the estimated eigenvalues and eigenvectors and are compared with those directly calculated via simulation (especially the slow relaxation behavior). After examining the validity, we use the obtained relaxation modes and rates for analysis.

Improvement of RMA

Selection of τ and t0 and relevant quantities for the trial function

The relaxation times {1/λp} and the {Xp} obtained via RMA depend on the manner in which t0 and τ are selected in practice. For simplification, we here consider the case of one physical quantity, R. From the variational problem of Eqs. 24 and 25, the relaxation time 1/λ is obtained from the gradient of the straight line connecting two points at t = t0 and t = t0 + τ in the semi-log plot of the correlation function C(t) = 〈R(t)R(0)〉−〈R2 versus t, as shown in Fig. 2a. If the time correlation function of the physical quantity contains several {1/λp}, and if we choose t0 = 0 (tICA case) or a small t0 and small τ, as shown in Fig. 2a (green line), the obtained 1/λ does not correspond to the slow relaxation behavior of log C(t) at long times. To investigate the slow relaxation, we wish to choose values of t0 and τ that are as large as possible, as shown in Fig. 2a (blue line). However, the choice of a longer t0 and τ is also limited, because of the decreasing accuracy of the time correlation function over long time periods. Therefore, we must choose the appropriate t0 and τ.

Fig. 2.

Fig. 2

Schematic illustration of RMA with a single evolution time t0 (a), and multiple evolution times (1) using t1 and t2 (b) and (2) using ti (c)

We can improve the RMA explained above by using two different approaches: introduction of multiple evolution times and using the different relevant physical quantities obtained from coordinates (and velocities) for the trial function. For the first improvement, we describe two types of methods with multiple evolution times, as shown in Fig. 2b, c. (The detailed descriptions are given by Nagai et al. (2013), Natori and Takano (2017), and Karasawa et al. (2017).) For the second improvement, we describe the PCRMA (Nagai et al. 2013), in which the relevant physical quantities for the trial function are given by the PC modes with large structural fluctuations and the two-step RMA (Natori and Takano 2017; Karasawa et al. 2017), which are in turn given by the slowest relaxation modes roughly obtained by RMA. Moreover, the MSRMA (Mitsutake and Takano 2015) is also proposed. We will describe these two improved RMAs in detail below.

RMA with multiple evolution times

RMA with multiple evolution times t1 and t2(1)

In this method, the following trial functions are used as approximate relaxation modes:

Xp(Q)=i=13Nfp,i1Ri(t1/2;Q)+i=13Nfp,i2Ri(t2/2;Q). 45

Note that two evolution times, t1/2 and t2/2, are used instead of a single evolution time, t0/2. Because the contributions of faster modes in R time-evolved for t1/2 and those for t2/2 are different, the approximate relaxation modes can extract the faster modes, which cannot be extracted by the approximate relaxation modes using a single evolution time (see Fig. 2b). Using Eq. 45 as a trial function for the variational problem, the following generalized eigenvalue problem is obtained:

j=16NCi,j(t0+τ)fp,j=exp(λpτ)j=16NCi,j(t0)fp,j, 46

with fp=(fp1T,fp2T)T. Here, C(t) is a 6N × 6N matrix defined by

C(t)=C1,1(t)C1,2(t)C2,1(t)C2,2(t), 47

and Cμ1,μ2(t) is an 3N × 3N matrix defined by

Ci,jμ1,μ2(t)=Ritμ12+tμ22+tRj(0), 48

where μ1, μ2 = 1 or 2. The orthonormal condition is written as

i=16Nj=16Nfp,iCi,j(0)fp,j=δp,q. 49

The inverse transformation of Eq. 45 is given by

Ri(t1/2;Q)=p=16Ngi,p1Xp(Q)Ri(t2/2;Q)=p=16Ngi,p2Xp(Q) 50

with

gi,p=j=16NCi,j(0)fp,j, 51

where gp=gp1T,gp2TT. The time correlation functions of Ri are reproduced by

Ri(t)Rj(0)p=16Ng~i,pavg~j,pavexpλpt, 52

where

g~i,pav=(exp(λpt1/2)gi,p1+exp(λpt2/2)gi,p2)/2. 53

RMA with multiple evolution times ti (2)

When the relevant physical quantities R in the trial function exhibit different relaxations, it is preferable to use different evolution times for the different physical quantities, as shown in Fig. 1c. That is, if we know the characteristic time scales of the relevant physical quantities, we can choose a specific evolution time ti for each relevant physical quantity Ri based on its characteristic time scale. This RMA method is referred to as “RMA with multiple evolution times {ti/2}.” In this method, we use the following trial function:

Xp(Q)=i=13Nfp,iRi(ti/2;Q). 54

The parameter ti is introduced in order to reduce the relative weight of the faster modes contained in Ri. Further, it is expected that Eq. 54 would yield a superior approximation for larger ti values.

The variational problem becomes a generalized-eigenvalue problem:

j=13NCi,jti+tj2+τfp,j=exp(λpτ)j=13NCi,jti+tj2fp,j. 55

Here, Ci, j(t) = 〈Ri(t)Rj(0)〉 and the orthonormal condition for Xp is expressed as

i=13Nj=13Nfp,iCi,jti+tj2fq,j=δp,q. 56

Equations 5455, and 56 determine the relaxation rates λp and the corresponding relaxation modes. We chose the indices of λp such that 0 < λ1λ2 ≤⋯ holds. The inverse transformation of Eq. 54 is given by

Ri(ti/2;Q)=p=13N6gi,pXp(Q), 57

with

gi,p=j=13NCi,jti+tj2fp,j. 58

The time correlation functions of Ri are given by

Ri(t)Rj(0)=pQgi,pgj,qXptti+tj2Xq(0),pgi,pgj,pexpλptti+tj2,=pg~i,pg~j,pexp(λpt), 59

for t ≥ (ti + tj)/2. Here,

g~i,p=gi,pexp(λpti/2). 60

RMAs to automatically reduce the degrees of freedom of relevant quantities for the trial function

RMA requires relatively high statistical precision of the time correlation matrices because of treatment for the generalized eigenvalue problem; thus, it is difficult for RMA to handle a large number of degrees of freedom directly. We must therefore reduce the number of degrees of freedom automatically.

In an original RMA, the coordinates (and velocity) are used for the trial function. The results may change depending on which relevant quantities are used for the trial function because their correlation functions are fitted using t0 and τ. (For the Markov state model, the dependence of relaxation times on the selection of states is discussed in Swope et al. (2004) and Pérez-Hernández et al. (2013).) It is better to use the relevant quantities that include the slow behavior. For the second improvement, we describe the PCRMA in which the relevant quantities are given by the PC modes with large structural fluctuations, and the two-step RMA in which the quantities are given by the slowest relaxation modes roughly obtained by the first RMA. A schematic illustration of PCRMA and two-step RMA is given in Fig. 3.

Fig. 3.

Fig. 3

Schematic illustration of PCRMA (a) and two-step RMA (b)

PCRMA

To apply RMA to a protein system by reducing its degrees of freedom, we proposed an improved method, which is referred to as the PCRMA method (Nagai et al. 2013). In this method, PCA is carried out first, and then, RMA is applied to a small number of principal components with large fluctuations (Φ =(Φ1,Φ2,,ΦNc)T). We use the following function as an approximate relaxation mode:

Xp(Q)=i=1Ncfp,iΦi(t0/2;Q). 61

Because the degrees of freedom is reduced to Nc and the relevant quantities with large variance tend to have slow relaxations, the slow relaxation times can be estimated by setting t0 and τ as large values. Note that because the selected principal components also contain faster relaxation modes, as shown in Fig. 4, Nagai et al. (2013) also combined PCRMA with the RMA using multiple evolution times (1) explained above. Note that in PCRMA, if the Ncth or more PC modes (with relatively small fluctuations) have slow relaxation, the slow behaviors may not be extracted; thus, there is a possibility that the slow relaxations would not be estimated with small structural fluctuations.

Fig. 4.

Fig. 4

Schematic illustrationfor PCRMA

Two-step RMA

Using a similar process to that of PCRMA, we proposed a two-step RMA method (Natori and Takano 2017; Karasawa et al. 2017). Based on our experience, the slow {Xp} obtained from the conventional RMA with small t0 and τ contains the true slow {Xp} (Mitsutake et al. 2011), although the {1/λp} values are underestimated. The slow relaxation modes obtained by the first RMA may contain the true slow relaxation modes. Thus, we use the slow relaxation modes roughly obtained from the first RMA as the relevant quantities for the trial function. In this technique, RMA with a single evolution time using small t0 and τ is implemented first, and {Xp} and {λp} are roughly estimated. We then apply the second RMA to a small number of the obtained slowest {Xp}. We denote the number of {Xp} used in the second RMA as Nm. In the second RMA, we also use the previously presented technique of RMA with multiple evolution times (2), because the characteristic time scales of the {Xp} obtained from the first RMA are roughly given by the relaxation times {1/λp}. In the second RMA, we use the following trial function:

Xu(Q)=p=1Nmfu,pXp(tp/2;Q). 62

Here, Xp(Q) is the relaxation mode obtained from the first RMA and tp is determined from 1/λp. A detailed explanation is given by Natori and Takano (2017) and Karasawa et al. (2017).

In the second RMA, the time interval τ can be chosen to be large, because the number of degrees of freedom is reduced and the physical quantities {Xp} exhibit slow relaxations. Using the second RMA, the estimation accuracy of the relaxation modes and times can be improved.

Markov state RMA

As mentioned above, in RMA, the relaxation modes and rates are given as left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. From this point of view, RMA is related to Markov state models. Herein, we consider the relation between RMA and Markov state models and propose the new method of MSRMA.

In the simplest Markov state model, the phase space of the system, where only the position coordinates are considered, is divided into clusters (subsets) Si, i = 1,…,n. First, the joint probability P¯i,j(τ)=P(QSi,τ;QSj,0) that the state of the system Q is in the j th cluster at time 0 and is in the i th cluster at time τ > 0 is calculated in a simulation. Second, the transition probability T¯i,j(τ) that the state of the system is found in the i th cluster after time τ starting from a state in the j th cluster is calculated by

T¯i,j(τ)=P¯i,j(τ)/p¯j, 63

where p¯j=P(QSj) is the probability that the state of the system is found in the j th cluster, which is estimated in the simulation. Then, by solving the eigenvalue problem

f¯pTT¯(τ)=f¯pTΛ¯p 64

for the transition matrix T¯(τ)=(T¯i,j(τ)), the p th eigenvector f¯p and its eigenvalue Λ¯p are obtained. The eigenvector f¯1(1,1,,1)T corresponds to the equilibrium state and its eigenvalue Λ¯1=1. Other eigenvectors f¯p represent structural transitions and the corresponding eigenvalues Λ¯p give their relaxation time scales τ¯p as

τ¯p=τlnΛ¯p. 65

Note that in the Markov description, it is important that the states are defined in a kinetically meaningful way (Swope et al. 2004; Pérez-Hernández et al. 2013). We need to define the states that are classified by order parameters representing the dynamics and kinetics of the system. Even with a good choice of states, in order for a Markov description of the process to be accurate, the time interval τ should also be chosen carefully. In other words, for the Markov description to work, the time interval of the transition matrix τ must be chosen appropriately so that it is as large as the slowest relaxation time of the states. When plotting τ¯p as a function of τ, τ¯p slowly converges to the appropriate time scale when τ is increased. In addition, when a much longer τ than the slowest relaxation time of the states is used, the Markov state model is not expected to be accurate. Thus, we usually set the time interval τ to the value when the variation of τp is sufficiently flat (Swope et al. 2004; Pérez-Hernández et al. 2013).

The abovementioned procedure of the Markov state model is related to the following procedure of RMA. We consider an approximate relaxation mode given by

X¯p=i=1nfp,iδi(t0/2;Q), 66

where δi(t;Q) is defined in the same way as Ri(t;Q) in Eq. 29 from δi(Q) given as a function of the state Q of the system by

δi(Q)=1forQSi,0forQSi. 67

Then, the generalized eigenvalue problem is given by

jC¯i,j(t0+τ)f¯p,j=eλ¯pτjC¯i,j(t0)f¯p,j, 68

with

i,jf¯p,iC¯i,j(t0)f¯q,j=δp,q, 69

where

C¯i,j(t)=δi(t)δj(0). 70

According to the definition of δi(Q), it follows that C¯i,j(t) is the joint probability P¯i,j(t).

If we set t0 = 0, the generalized eigenvalue problem (68) becomes the eigenvalue problem (64) with Λ¯p=eλ¯pτ or τ¯p=1/λ¯p, because C¯(0)=diag(p¯1,,p¯n) and C¯(τ)C¯(0)1=T¯(τ). Thus, the Markov state model is a special case of MSRMA with t0 = 0.

Because δi(t0/2;Q) in Eq. 66 reduces the contributions of faster modes in δi(Q), the solutions of the generalized eigenvalue problem (68) provides better approximations to the slow relaxation modes and rates as t0 becomes larger. Therefore, the relaxation times τ¯p obtained by the Markov state model are expected to be improved by solving Eq. 68 with t0 > 0 rather than Eq. 64.

Application of RMA to a system with large conformational changes

In this section, we apply RMA to a protein system simulation to show the effectiveness of RMA. The selection of order parameters in simulations is important to analyze the trajectory. PCA, which is a static analysis method, extracts large structural fluctuations from simulations, and the obtained PC mode is used to obtain the order parameters. Moreover, it has now become possible to perform long simulations such as those of unfolded and folded protein structures, and when the simulation involves large structural changes, the difference between local minimum-energy states is relatively small compared with that between the folded and unfolded states. In this case, it is difficult for PCA to extract the effective modes or order parameters to accurately identify the local minimum-energy states. By contrast, RMA extracts slow relaxation modes. It is thought that the local minimum-energy states are usually stable so that the system remains in this state for a long time during a simulation. The order parameters with slow relaxation may correspond to the directions between local minimum-energy states. Thus, slow relaxation modes may be suitable order parameters to identify local minimum-energy states and the transitions between them. To validate this concept, we applied RMA to the 10-residue peptide, chignolin in water near its folding transition temperature.

The detailed results are described in Mitsutake et al. (2011). Chignolin consists of a 10-amino acid sequence, GYDPETGTWG and adopts a β-hairpin turn structure (Honda et al. 2004). Several simulations of chignolin have been reported to date (Satoh et al. 2006; Suenaga et al. 2007; Harada and Kitao 2011; Kührova et al. 2012; Okumura 2012). Previous research has shown that chignolin has a stable (native) and a misfolded state, which are both found as hairpin-like structures (see Fig. 5c). These two states have a common turn structure from Asp3 to Glu5 but slightly different hydrogen bond patterns. RMA requires a relatively high level of statistical precision for the time correlation matrices and therefore requires a long simulation where many transitions between local minimum-energy states occur. In addition, we sought to analyze the system with large conformational changes. Thus, we performed a 750-ns molecular dynamics simulation of chignolin in aqueous solution near the transition temperature from an extended structure (Case et al. 2014). We observed many transitions among structures, including the native, misfolded, and unfolded states, by performing the simulation at 450 K. We used the coordinates of Cα atoms on the backbone as coordinates so that the degrees of freedom were 30. After removing the translational and rotational motions from the coordinates of Cα atoms, PCA and RMA were carried out on the coordinates of Cα atoms (see Fig. 1). For RMA, we set t0 and τ to 10.0 and 20.0 ps, respectively.

Fig. 5.

Fig. 5

The free-energy surfaces for a the first PC mode Φ1 and the second PC mode Φ2, and for b the first slowest RM and the second slowest RM in the case of t0 = 10.0 ps and τ = 20.0 ps. c Snapshots of the native, misfolded, intermediate, and unfolded states classified by RMA, and d distributions for the native (red), misfolded (green), and intermediate (blue) states on the free-energy surface of the first PC mode and the second PC mode. e Relaxation times of the second relaxation mode obtained by MSRMA as a function of the time interval τ. In e, the line of t0 ps corresponds to the results of a simple Markov state model. The figure was reproduced from Mitsutake and Takano (2015)

Figure 5 shows the free-energy surfaces obtained from PCA (a) and RMA (b). From the free-energy surface of PCA, the native and misfolded states were not distinguished because the conformational difference between them is much smaller than the conformational fluctuations of the system (the third PC mode distinguished the native and misfolded states). By contrast, in RMA, the transition between the native and misfolded structures is slow, and the slowest relaxation mode was found to be the axis distinguishing them. This analysis showed that the slow relaxation mode is a good order parameter to distinguish the native and misfolded structure. Interestingly, we could also identify the intermediate structure. By extracting the structures in the center part of the free-energy surface shown in Fig. 5b, the cluster was formed with a turn structure common to the native and misfolded structures. Because the structures at both terminals fluctuate, a cluster of intermediate structures forming a turn is also obtained, while ignoring the fast relaxing movement of both terminals. The upper part of the free-energy surface shown in Fig. 5b corresponded to the extended structure. Figure 5c shows the characteristic structures for the four states. When plotting the points for the obtained intermediate structure on the free-energy surface of PCA in Fig 5d, the points were distributed widely because both terminals fluctuate. Thus, RMA can identify the characteristic structure, even when it is only partially formed. From the free-energy surface obtained by RMA, it is clarified that chignolin folds to the native or misfolded structures through the intermediate (turn) structure from the extended structures.

Because the structures were classified into a smaller number of states using the free-energy surface obtained by RMA, we then applied the Markov state model and MSRMA to analyze these four states: native, misfolded, intermediate, and unfolded states. Figure 5e shows the relaxation time τp = 1/λp obtained by MSRMA as a function of τ when t0 = 0,10,50,100,200, and 500 ps. Because the first eigenvector corresponds to the steady state with infinite relaxation time τ1 = , we show the second slowest relaxation times. The line of t0 = 0 corresponds to the results of a simple Markov state model. In the case of t0 = 0, the τp values slowly approach the appropriate time scale, i.e., the values for plateau regions or peak values of the solid lines, when τ is increased. For the lines of t0 > 0, the values of τp quickly approach the appropriate time scale, i.e., those corresponding to the values for plateau regions or peak values. Thus, the slow relaxation times can be improved when applying MSRMA with t0 > 0, which is introduced to reduce the relative weight of the faster modes.

Overall, RMA can be used to effectively analyze long simulations at room temperature and is also useful for investigating systems with large conformational changes, such as intrinsically disordered proteins and protein folding.

Conclusions

In this paper, we have reviewed the method and application of RMA, a dynamic analysis method for protein simulations. We described the definition of relaxation modes and rates, which correspond to the left eigenfunctions and eigenvalues of the time evolution operator of the master equation of the system, respectively. After providing the definition, we explained how to estimate the slow relaxation modes and rates from simulation data. We also summarized several new RMAs proposed, including RMA with multiple evolution times, PCRMA, two-step RMA, and MSRMA. Finally, to demonstrate the effectiveness of RMA, we briefly presented the analysis results of the unfolding/folding simulation of the 10-residue peptide chignolin detected near the transition temperature. The simulation results showed that the relaxation mode is a good order parameter for not only extracting the transition between the native state and misfolded state but also for identifying the intermediate state, which is partially folded. This suggests that RMA is suitable to investigate a system with large structural changes and naturally denatured protein systems. Although RMA is efficient for a longer simulation than the longest relaxation time of the system, it can also extract rare events in a finite-time simulation such as that conducted at the microsecond scale. By examining the extent to which the correlation function can be reconstructed, we can clarify the information that can be obtained on dynamics using the obtained relaxation modes and rates. Theoretical studies to compare data of the Markov state model with experimental data from nuclear magnetic resonance and neutron scattering analyses have emerged recently (Xia et al. 2013; Lindner et al. 2013; Zheng et al. 2013; Bowman et al. 2014). In the future, it will also be important to interpret the theoretical relationships in light of experimental data.

Acknowledgements

The authors would like to thank Mr. Toshiki Nagai, Mr. Taku Yamamoto, Mr. Yuta Koizumi, Mr. Satoshi Natori, and Mr. Naoyuki Karasawa at Keio University for fruitful discussions.

Funding information

This work was supported by JST PRESTO (JPMJPR13LB). This work was also partially supported by a Grant-in-Aid for Scientific Research (C) (No. 24540441) from the Japan Society for the Promotion of Science.

Compliance with ethical standards

Conflict of Interests

Ayori Mitsutake declares that he has no conflicts of interest. Hiroshi Takano declares that he has no conflicts of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any og the authors.

Footnotes

This article is part of a Special Issue on ‘Biomolecules to Bio-nanomachines - Fumio Arisaka 70th Birthday’ edited by Damien Hall, Junichi Takagi and Haruki Nakamura.

References

  1. Abagyan R, Argos P. Optimal protocol and trajectory visualization for conformational searches of peptides and proteins. J Mol Biol. 1992;225:519–532. doi: 10.1016/0022-2836(92)90936-E. [DOI] [PubMed] [Google Scholar]
  2. Amadei A, Linssen ABM, Berendsen HJC. Essential dynamics of proteins. Proteins Struct Funct Genet. 1993;17:412–425. doi: 10.1002/prot.340170408. [DOI] [PubMed] [Google Scholar]
  3. Baher I, Atilgan AR, Erman B. Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential. Fold Des. 1997;2:173. doi: 10.1016/S1359-0278(97)00024-2. [DOI] [PubMed] [Google Scholar]
  4. Bowman GR, Pande VS, Noé F, editors. An introduction to Markov state models and their application to long timescale molecular simulation. Dordrecht: Springer; 2014. [Google Scholar]
  5. Brooks B, Karplus M. Harmonic dynamics of proteins: normal modes and fluctuations in bovine pancreatic trypsin inhibitor. Proc Natl Acad Sci USA. 1983;80:6571. doi: 10.1073/pnas.80.21.6571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Buchete N, Hummer G. Coarse master equations for peptide folding dynamics. J Phys Chem B. 2008;112:6057. doi: 10.1021/jp0761665. [DOI] [PubMed] [Google Scholar]
  7. Case DA, Babin V, Betz RM, Cai Q, Cerutti DS, Cheatham IIITE, Darden TA, Duke RE, Gohlke H, Götz AW, Gusarov S, Homeyer N, Janowski P, Kaus J, Kolossváry I, Kovalenko A, Lee TS, Le Grand S, Luchko T, Luo R, Madej B, Merz KM, Paesani F, Roe DR, Roitberg A, Sagui C, Salomon-Ferrer R, Seabra G, Simmerling CL, Smith W, Swails J, Walker RC, Wang J, Wolf RM, Wu X, Kollman PA. AMBER 14. San Francisco: University of California; 2014. [Google Scholar]
  8. Chodera JD, Swope WC, Pitera JW, Dill KA. Long-time protein folding dynamics from short-time molecular dynamics simulations. Multiscale Model Simul. 2006;5:1214. doi: 10.1137/06065146X. [DOI] [Google Scholar]
  9. Chodera JD, Singhal N, Vande VS, Dill KA, Swope WC. Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics. J Chem Phys. 2007;126:155101. doi: 10.1063/1.2714538. [DOI] [PubMed] [Google Scholar]
  10. Chodera JD, Noé F. Markov state models of biomolecular conformational dynamics. Curr Opin Struct Biol. 2014;25:135. doi: 10.1016/j.sbi.2014.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cui Q, Bahar I, editors. Normal mode analysis: theory and applications to biological and chemical systems. London: Chapman & Hall/CRC; 2005. [Google Scholar]
  12. Dror RO, Dirks RM, Grossman JP, Xu H, Shaw DE. Biomolecular simulation: a computational microscope for molecular biology. Annu Rev Biophys. 2012;41:429. doi: 10.1146/annurev-biophys-042910-155245. [DOI] [PubMed] [Google Scholar]
  13. de Gennes PG. Scaling concepts in polymer physics. Ithaca: Cornell University Press; 1984. [Google Scholar]
  14. Doi M, Edwards SF. The theory of polymer dynamics. Oxford: Oxford University Press; 1986. [Google Scholar]
  15. Eckart C. Some studies concerning rotating axes and polyatomic molecules. Phys Rev. 1935;47:552–558. doi: 10.1103/PhysRev.47.552. [DOI] [Google Scholar]
  16. Freddolino PL, Harrison CB, Liu Y, Schulten K. Challenges in protein folding simulations: timescale, representation, and analysis. Nat Phys. 2010;6:751. doi: 10.1038/nphys1713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Garcia AE. Large-amplitude nonlinear motions in proteins. Phys Rev Lett. 1992;68:2696–2699. doi: 10.1103/PhysRevLett.68.2696. [DOI] [PubMed] [Google Scholar]
  18. Go N, Noguti T, Nishikawa T. Dynamics of a small globular protein in terms of low-frequency vibrational modes. Proc Natl Acad Sci USA. 1983;80:3696. doi: 10.1073/pnas.80.12.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Hagita K, Takano H. Relaxation mode analysis of a single polymer chain in a melt. J Phys Soc Jpn. 2002;71:673–676. doi: 10.1143/JPSJ.71.673. [DOI] [Google Scholar]
  20. Harada R, Kitao A. Exploring the folding free energy landscape of a β-hairpin miniprotein, chignolin, using multiscale free energy landscape calculation method. J Phys Chem B. 2011;115:8806. doi: 10.1021/jp2008623. [DOI] [PubMed] [Google Scholar]
  21. Hayward S, Kitao A, Hirata F, Go N. Effect of solvent on collective motions in globular protein. J Mol Biol. 1993;234:1207–1217. doi: 10.1006/jmbi.1993.1671. [DOI] [PubMed] [Google Scholar]
  22. Hirao H, Koseki S, Takano H. Molecular dynamics study of relaxation modes of a single polymer chain. J Phys Soc Jpn. 1997;66:3399–3405. doi: 10.1143/JPSJ.66.3399. [DOI] [Google Scholar]
  23. Honda S, Yamasaki K, Sawada Y, Morii H. 10 residue folded peptide designed by segment statistics. Structure. 2004;12:1507. doi: 10.1016/j.str.2004.05.022. [DOI] [PubMed] [Google Scholar]
  24. Ichiye T, Karplus M. Collective motions in proteins: a covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations. Protein. 1991;11:205–217. doi: 10.1002/prot.340110305. [DOI] [PubMed] [Google Scholar]
  25. Iwaoka N, Hagita K, Takano H. Estimation of relaxation modulus of polymer melts by molecular dynamics simulations: application of relaxation mode analysis. J Phys Soc Jpn. 2015;84:044801. doi: 10.7566/JPSJ.84.044801. [DOI] [Google Scholar]
  26. Kamada M, Toda M, Sekijima M, Takata M, Joe J. Analysis of motion features for molecular dynamics simulation of proteins. Chem Phys Lett. 2011;502:241. doi: 10.1016/j.cplett.2010.12.028. [DOI] [Google Scholar]
  27. Karasawa N, Mitsutake A, Takano H. Two-step relaxation mode analysis with multiple evolution times applied to all-atom molecular dynamics protein simulation. Phys Rev E. 2017;96:062408. doi: 10.1103/PhysRevE.96.062408. [DOI] [PubMed] [Google Scholar]
  28. Kitao A, Hirata F, Go N. The effects of solvent on the conformation and the collective motions of protein: normal mode analysis and molecular dynamics simulations of melittin in water and in vacuum. Chem Phys. 1991;158:447. doi: 10.1016/0301-0104(91)87082-7. [DOI] [Google Scholar]
  29. Kitao A, Go N. Investigating protein dynamics in collective coordinate space. Curr Opin Struct Biol. 1999;9:164. doi: 10.1016/S0959-440X(99)80023-2. [DOI] [PubMed] [Google Scholar]
  30. Komatsuzaki T, Berry RS, Leitner DM. Advancing theory for kinetics and dynamics of complex, many-dimensional systems. Canada: Wiley; 2011. [Google Scholar]
  31. Kottalam J, Case DA. Langevin modes of macromolecules: applications to crambin and DNA hexamers. Biopolymers. 1990;29:1409. doi: 10.1002/bip.360291008. [DOI] [PubMed] [Google Scholar]
  32. Koseki S, Hirao H, Takano H. Monte Carlo study of relaxation modes of a single polymer chain. J Phys Soc Jpn. 1997;66:1631–1637. doi: 10.1143/JPSJ.66.1631. [DOI] [Google Scholar]
  33. Kührova P, Simone AD, Otyepka M, Best RB. Force-field dependence of chignolin folding and misfolding: comparison with experiment and redesign. Biophys J. 2012;102:1897. doi: 10.1016/j.bpj.2012.03.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lamm G, Szabo A. Langevin modes of macromolecules. J Chem Phys. 1986;85:7334. doi: 10.1063/1.451373. [DOI] [Google Scholar]
  35. Lane TJ, Shukla D, Beauchamp KA, Pande VS. To milliseconds and beyond: challenges in the simulation of protein folding. Curr Opin. 2013;23:58. doi: 10.1016/j.sbi.2012.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Lange OF, Grubmüller H. Full correlation analysis of conformational protein dynamics. Proteins. 2007;70:1294. doi: 10.1002/prot.21618. [DOI] [PubMed] [Google Scholar]
  37. Levy RM, Srinivasan AR, Olson WK, McCammon JA. Quasi-harmonic method for studying very low frequency modes in proteins. Biopolymers. 1984;23:1099–1112. doi: 10.1002/bip.360230610. [DOI] [PubMed] [Google Scholar]
  38. Levitt M, Sander C, Stern PS. Protein normal-mode dynamics: trypsin inhibitor, crambin, ribonuclease and lysozyme. J Mol Biol. 1985;181:423. doi: 10.1016/0022-2836(85)90230-X. [DOI] [PubMed] [Google Scholar]
  39. Lindorff-Larsen K, Piana S, Dror RO, Shaw DE. How fast-folding proteins fold. Science. 2011;334:517. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
  40. Lindorff-Larsen K, Margakis P, Piana S, Eastwood MP, Dror RO, Shaw DE. Systematic validation of protein force fields against experimental data. PLos ONE. 2012;7:e32131. doi: 10.1371/journal.pone.0032131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Lindner B, Yi Z, Prinz JH, Smith J, Noé F. Dynamic neutron scattering from conformational dynamics I: theory and Markov models. J Chem Phys. 2013;139:175101. doi: 10.1063/1.4824070. [DOI] [PubMed] [Google Scholar]
  42. Matsunaga Y, Kidera A, Sugita Y. Sequential data assimilation for single-molecule FRET photon-counting data. J Chem Phys. 2015;142:214115. doi: 10.1063/1.4921983. [DOI] [PubMed] [Google Scholar]
  43. McLachlan AD. Gene duplications in the structural evolution of chymotrypsin. J Mol Biol. 1979;128:49–79. doi: 10.1016/0022-2836(79)90308-5. [DOI] [PubMed] [Google Scholar]
  44. Mitsutake A, Iijima H, Takano H (2005) Principal component analysis and relaxation mode analysis of a peptide. Biophysics. 45: Supplement S214. Abstracts for the 43th Annual Meeting, The Biophysical Society of Japan. (in Japanese)
  45. Mitsutake A, Iijima H, Takano H. Relaxation mode analysis of a peptide system: comparison with principal component analysis. J Chem Phys. 2011;135:164102. doi: 10.1063/1.3652959. [DOI] [PubMed] [Google Scholar]
  46. Mitsutake A, Takano H. Relaxation mode analysis and Markov state relaxation mode analysis for chignolin in aqueous solution near a transition temperature. J Chem Phys. 2015;143:124111. doi: 10.1063/1.4931813. [DOI] [PubMed] [Google Scholar]
  47. Miyashita O, Tama F. Coarse-graining of condensed phase and biomolecular systems. Boca Raton: CRC Press; 2008. pp. 267–267. [Google Scholar]
  48. Moritsugu K, Koike R, Yamada K, Kato H, Kidera A. Motion tree delineates hierarchical structure of protein dynamics observed in molecular dynamics simulation. PLoS ONE. 2015;10:e0131583. doi: 10.1371/journal.pone.0131583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Mori T, Saito S. Dynamic heterogeneity in the folding/unfolding transitions of FiP35. J Chem Phys. 2015;142:135101. doi: 10.1063/1.4916641. [DOI] [PubMed] [Google Scholar]
  50. Mori T, Saito S. Molecular mechanism behind the fast folding/unfolding transitions of villin headpiece subdomain: hierarchy and heterogeneity. J Phys Chem B. 2016;120:11683. doi: 10.1021/acs.jpcb.6b08066. [DOI] [PubMed] [Google Scholar]
  51. Nagai T, Mitsutake A, Takano H. Principal component relaxation mode analysis of an all-atom molecular dynamics simulation of human lysozyme. J Phys Soc Jpn. 2013;82:023803. doi: 10.7566/JPSJ.82.023803. [DOI] [Google Scholar]
  52. Nagai T, Mitsutake A, Takano H (2009) Relaxation mode analysis of a biopolymer system by molecular dynamics. Biophysics. 49 Supplement S75. (Abstracts for the 47th Annual Meeting, The Biophysical Society of Japan
  53. Naritomi Y, Fuchigami S. Slow dynamics in protein fluctuations revealed by time-structure based independent component analysis: the case of domain motions. J Chem Phys. 2011;134:065101. doi: 10.1063/1.3554380. [DOI] [PubMed] [Google Scholar]
  54. Naritomi Y, Fuchigami S. Slow dynamics of a protein backbone in molecular dynamics simulation revealed by time-structure based independent component analysis. J Chem Phys. 2013;139:215102. doi: 10.1063/1.4834695. [DOI] [PubMed] [Google Scholar]
  55. Natori S, Takano H. Two-step relaxation mode analysis with multiple evolution times: application to a single [n]polycatenane. J Phys Soc Jpn. 2017;86:43003. doi: 10.7566/JPSJ.86.043003. [DOI] [Google Scholar]
  56. Noé F, Horenko I, Schütte C, Smith JC. Hierarchical analysis of conformational dynamics in biomolecules: transition networks of metastable states. J Chem Phys. 2007;126:155102. doi: 10.1063/1.2714539. [DOI] [PubMed] [Google Scholar]
  57. Noé F, Fischer S. Transition networks for modeling the kinetics of conformational change in macromolecules. Curr Opin Struct Biol. 2008;18:154. doi: 10.1016/j.sbi.2008.01.008. [DOI] [PubMed] [Google Scholar]
  58. Noé F, Clementi C. Collective variables for the study of long-time kinetics from molecular trajectories: theory and methods. Curr Opin Struct Biol. 2017;43:141. doi: 10.1016/j.sbi.2017.02.006. [DOI] [PubMed] [Google Scholar]
  59. Okumura H. Temperature and pressure denaturation of chignolin: folding and unfolding simulation by multibaric-multithermal molecular dynamics method. Proteins. 2012;80:2397. doi: 10.1002/prot.24125. [DOI] [PubMed] [Google Scholar]
  60. Pérez-Hernández G, Paul F, Giorgino TG, Fabritiis D, Noé F. Identification of slow molecular order parameters for Markov model construction. J Chem Phys. 2013;139:015102. doi: 10.1063/1.4811489. [DOI] [PubMed] [Google Scholar]
  61. Prinz J, Wu H, Sarich M, Keller B, Senne M, Held M, Chodera JD, Schütte C, Noé F. Markov models of molecular kinetics: generation and validation. J Chem Phys. 2011;134:174105. doi: 10.1063/1.3565032. [DOI] [PubMed] [Google Scholar]
  62. Risken H. The Fokker-Planck equation: methods of solution and applications 2nd Ed. Springer-Verlag. Heidelberg: Berlin; 1989. [Google Scholar]
  63. Zwanzig R. Nonequilibrium statistical mechanics. New York: Oxford university press; 2001. [Google Scholar]
  64. Saka S, Takano H. Relaxation of a single knotted ring polymer. J Phys Soc Jpn. 2008;77:034001. doi: 10.1143/JPSJ.77.034001. [DOI] [Google Scholar]
  65. Sakuraba S, Joti Y, Kitao A. Detecting coupled collective motions in protein by independent subspace analysis. J Chem Phys. 2010;133:185102. doi: 10.1063/1.3498745. [DOI] [PubMed] [Google Scholar]
  66. Satoh D, Shimizu K, Nakamura S, Terada T. Folding free-energy landscape of a 10-residue mini-protein, chignolin. FEBS Letters. 2006;580:3422. doi: 10.1016/j.febslet.2006.05.015. [DOI] [PubMed] [Google Scholar]
  67. Schütte C, Fischer A, Huisinga W, Deuflhard P. A direct approach to conformational dynamics based on hybrid Monte Carlo. J Comput Phys. 1999;151:146. doi: 10.1006/jcph.1999.6231. [DOI] [Google Scholar]
  68. Schwantes CR, Pande VS. Improvements in Markov state model construction reveal many non-native interactions in the folding of NTL9. J Chem Theor Comput. 2013;9:2000. doi: 10.1021/ct300878a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Schwantes CR, McGibbon RT, Pande VS. Perspective: Markov models for long-timescale biomolecular dynamics. J Chem Phys. 2014;141:090901. doi: 10.1063/1.4895044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Singhal N, Snow CD, Pande VS. Using path sampling to build better Markovian state models: predicting the folding rate and mechanism of a tryptophan zipper beta hairpin. J Chem Phys. 2004;121:415. doi: 10.1063/1.1738647. [DOI] [PubMed] [Google Scholar]
  71. Suenaga A, Narumi T, Futatsugi N, Yanai R, Ohno Y, Okimoto N, Taiji M. Folding dynamics of 10-residue beta-hairpin peptide chignolin. Chem Asian J. 2007;2:591. doi: 10.1002/asia.200600385. [DOI] [PubMed] [Google Scholar]
  72. Swope WC, Pitera JW, Suits F. Describing protein folding kinetics by molecular dynamics simulations. 1. Theory J Phys Chem B. 2004;108:6571. doi: 10.1021/jp037421y. [DOI] [Google Scholar]
  73. Takano H, Miyashita S. Relaxation modes in random spin systems. J Phys Soc Jpn. 1995;64:3688–3698. doi: 10.1143/JPSJ.64.3688. [DOI] [Google Scholar]
  74. Tama F, Sanejouand YH. Conformational change of proteins arising from normal mode calculations. Protein Engin. 2001;14:1. doi: 10.1093/protein/14.1.1. [DOI] [PubMed] [Google Scholar]
  75. Tama F, Brooks CL., III The mechanism and pathway of pH induced swelling in Cowpea chlorotic mottle virus. J Mol Biol. 2002;318:733. doi: 10.1016/S0022-2836(02)00135-3. [DOI] [PubMed] [Google Scholar]
  76. Tirion MM. Large amplitude elastic motions in proteins from a single-parameter, atomic analysis. Phys Lev Lett. 1996;77:1905. doi: 10.1103/PhysRevLett.77.1905. [DOI] [PubMed] [Google Scholar]
  77. Wu H, Nüske F, Paul F, Klus S, Koltai P, Noé F. Variational Koopman models: slow collective variables and molecular kinetics from short off-equilibrium simulations. J Chem Phys. 2017;146:154104. doi: 10.1063/1.4979344. [DOI] [PubMed] [Google Scholar]
  78. Xia J, Deng JN, Levy RM. NMR relaxation in proteins with fast internal motions and slow conformational exchange: model-free framework and Markov state simulations. J Phys Chem B. 2013;117:6625. doi: 10.1021/jp400797y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Zheng Y, Lindner B, Prinz JH, Noé F, Smith J. Dynamic neutron scattering from conformational dynamics II: application using molecular dynamics simulation and Markov modeling. J Chem Phys. 2013;139:175102. doi: 10.1063/1.4824071. [DOI] [PubMed] [Google Scholar]
  80. Zuckerman DM. Statistical physics of biomolecules: an introduction. New York: CRC Press; 2010. [Google Scholar]

Articles from Biophysical Reviews are provided here courtesy of Springer

RESOURCES