Skip to main content
Biophysics and Physicobiology logoLink to Biophysics and Physicobiology
. 2019 Nov 29;16:310–321. doi: 10.2142/biophysico.16.0_310

Building a macro-mixing dual-basin Gō model using the Multistate Bennett Acceptance Ratio

Ai Shinobu 1, Chigusa Kobayashi 1, Yasuhiro Matsunaga 2, Yuji Sugita 1,3,4,
PMCID: PMC6975896  PMID: 31984186

Abstract

The dual-basin Gō-model is a structural-based coarsegrained model for simulating a conformational transition between two known structures of a protein. Two parameters are required to produce a dual-basin potential mixed using two single-basin potentials, although the determination of mixing parameters is usually not straightforward. Here, we have developed an efficient scheme to determine the mixing parameters using the Multistate Bennett Acceptance Ratio (MBAR) method after short simulations with a set of parameters. In the scheme, MBAR allows us to predict observables at various unsimulated conditions, which are useful to improve the mixing parameters in the next round of iterative simulations. The number of iterations that are necessary for obtaining the converged mixing parameters are significantly reduced in the scheme. We applied the scheme to two proteins, the glutamine binding protein and the ribose binding protein, for showing the effectiveness in the parameter determination. After obtaining the converged parameters, both proteins show frequent conformational transitions between open and closed states, providing the theoretical basis to investigate structure-dynamics-function relationships of the proteins.

Keywords: molecular dynamics simulations, coarse-grained model, structure-based potential, protein conformational transitions, multiple basin potentials


Significance.

Structure-based coarse-grained (CG) Gō models are used with great success to sample protein dynamics on long time scales. Here, we combine two Gō type single-basin potentials to sample conformational transitions between two states of a protein. CG models are highly parameter-dependent and detecting appropriate parameters is a time-consuming task. In this study, we propose a scheme for parameter refinement of CG models using the Multistate Bennett Acceptance Ratio (MBAR), a statistical weighing method. Using MBAR, free energy profiles can be calibrated without simulations. We demonstrate conformational transitions between two reaction states of soluble proteins by applying the scheme.

Proteins often undergo conformational transitions during their function [15]. In many cases, the binding or unbinding of a ligand induces a large amplitude domain-level transition between active and inactive states [611]. Elucidating the mechanism for such conformational transitions connects protein structural information with functional understanding and opens the way for interference with existing processes and design of new compounds.

Molecular dynamics (MD) simulations are widely used to investigate conformational dynamics of proteins [1218]. Recent advancement in the development of high-performance computers has sparked the interest in simulating the dynamics of systems of enormous size scales such as large biomolecular complexes and biomolecules in cellular environment [1923]. However, all-atom MD simulations are able to simulate conformational dynamics that happen for several tens of microseconds using conventional PC-clusters or super-computers. For reaching the millisecond time scale, in which large-scale conformational motions of proteins occur, MD simulations based on coarse-grained (CG) models [2429] are more suitable [24,3033].

Structure-based off-lattice Gō models [3436] are often used in CG MD simulations with remarkable success [3644]. Based on the original Gō model [4547], the potential function used in such simulations consists of an attractive potential between non-bonded contacts that exist in the native structure (native contacts) and a repulsive potential between all other non-bonded pairs. The simulations realize funnel-shaped free energy landscapes for protein folding and dynamics, successfully explaining many experimental results [4850]. A revised model was developed by Karanicolas and Brooks [51,52], in which residue-type-dependent knowledge-based energy terms (the Miyazawa-Jernigan contact energies [53]) are applied to the native contacts (hereafter, we refer to it as KBGo model). Folding pathways for proteins predicted using this model were in excellent agreement with experimental results. Moreover, the robustness of the folding pathways against competing non-native interactions was demonstrated. The Domain Motion Enhanced (DoME) model [54], developed more recently, emphasizes inter-domain motions while keeping intra-domain regions relatively rigid. This model is, in particular, useful for simulating large-scale domain motions of proteins, by making the simulation robust against temperature changes.

The off-lattice Gō models presented above predominantly stabilize a single conformation and are referred to as single-basin models. For simulating transitions between two stable states, two single-basin potentials are mixed to form a dual-basin potential. The mixing of potentials can be obtained using two main approaches, the microscopic (micro-mixing) and macroscopic (macro-mixing). In micro-mixing [39,5557], energetic terms are added as individual terms to form the mixed potential. The mixed potential is in fact a single-basin potential of one state, to which contact energies of the other state are added as perturbations. In macro-mixing [37,38,43,58,59], the two single-basin potentials are predefined and all energetic terms are coupled to each other in the mixed potential. The micro-mixing approach is suitable for systems with a high degree of similarity, in which transitions between the two states can be described by addition of contacts. The macro-mixing approach is often successful in describing large-scale transitions with little overlap between the contact sets of the two states. For example, the sheet to helix transition in Arc repressor was successfully described using the macro-mixing potential [37].

The exponential Boltzmann weighing [37,43,60] (exponential mixing) is a macro-mixing approach in which a dual-basin potential is constructed by summing the partition functions of the two single-basin potentials. In the superposition approach [38,59], the mixed potential is resolved by solving an eigenvalue equation. Both schemes ensure a smooth transition between the two basins. Structure-based CG models in general heavily rely on system-dependent parameters, which are determined properly for successful MD simulations. Conventionally, the parameter determination in exponential mixing is done by trial and error, by performing multiple rounds of short simulations with different parameters while assessing the frequency of conformational transitions. This laborious process can be very time consuming, especially for large systems or for potentials in which more than a single parameter needs to be determined.

Here, we apply the Multistate Bennett Acceptance Ratio (MBAR) analysis method [61] for an efficient determination of parameters in the exponential-mixing dual-basin potential. The scheme that we propose consists of short simulations with multiple candidates of mixing parameters and an estimation of improved parameters based on MBAR. MBAR statistically calculates the optimal estimators for computing free energies of unsampled data using sampled data at different conditions. Without performing an enormous number of simulations, this scheme can sample the parameter space efficiently and allows a severalfold faster parameter determination. We apply this scheme to the dual-basin Gō simulations of the Glutamine Binding Protein [6] (GBP, Fig. 1) and the Ribose Binding Protein [62,63] (RBP, Supplementary Fig. S1), and discuss the quality of the predicted parameters and the stability of the extended sampling simulations.

Figure 1.

Figure 1

Structures of GBP in the unbound (left side at each panel) and bound (right side) forms, colored according to secondary elements (A) or domains (B). PDB codes written below the protein structures. RMSD values were calculated between the unbound and bound forms, fitted by their Cα atoms. The bound form in panel A shows the bound ligand (glutamine) in a stick representation. Domain classification in panel B was performed according to the Motion Tree [64,65]. Domain name notations were taken from Ref. [6].

Theory and Methods

The potential energy function of the dual-basin structure-based Gō model

We use the single-basin KBGo potential [51,52] as described in the original works [37,43]. The DoME model [54] potential carries the same functional form as the KBGo potential. It differs only in the non-bonded native contact parameters, which are set inversely proportional to the magnitude of inter-domain motions (and a constant value for intra-domain contacts). Domain classification is obtained from the Motion Tree calculation [64,65].

The macro-mixing approach of exponential Boltzmann weighing [37] (exponential mixing) for mixing two single-basin potentials, which we use in the study, is shown in Eq. (1).

exp (-1kBTmixE(R))=exp [-1kBTmix(V1(R)+C)]+exp [-1kBTmixV2(R)], (1)

where kB is the Boltzmann constant, V1 and V2 are the single-basin potentials to be mixed, and Tmix and C are the mixing parameters. Tmix is related to the barrier height between the two basins (a lower value corresponds to a higher barrier). C determines the energetic offset between the basins. The mixed potential function was implemented while adhering to all the details described in the original works [37,43]. We define the open (OP) basin as basin 1 and the closed (CL) basin as basin 2.

Parameter searching

Given the structures in the OP and CL states, we need to determine a pair of parameters (Tmix, C), which allow proper transitions between the two states under the dual-basin exponential mixing potential. The overall scheme for determining the mixing parameters is presented in Figure 2. Our workflow consists of three major stages that are: the manual detection of an approximate range of proper parameters (stage C and Supplementary Fig. S2), the MBAR-assisted parameter refinement (stage D and D1–D11 in Supplementary Fig. S3), and the final selection of parameters (stage E and E1–E4 in Supplementary Fig. S3). In order to estimate the potential of mean force (PMF) from MBAR, both OP and CL states as well as transitions between them needs to be sampled. Thus, manually detecting an approximate parameter range is required (stage C) prior to using the MBAR analysis for the parameter refinement (stage D). In both stages C and D, the approach is to iteratively perform simulations with a set of parameters, asses their quality, set modified parameters, and use them for the next round.

Figure 2.

Figure 2

Overall flowchart describing the procedure presented in this work. Detailed schemes are shown in Supplementary Figures S2 and S3. Letters assigned to each stage are compatible to naming of stages in Supplementary Figures S2 and S3.

We use the distance root-mean-square displacement [66] (dRMS) as a reaction coordinate for quantifying the similarity of frames during the simulations to the OP (dRMSOP) and CL (dRMSCL) native structures. dRMS is calculated between atom pairs, which in the reference state were separated by at least four atoms in sequence and between 6–50 Å in space, and for which the difference in distance between the two reference states was larger or equal to 5 Å. Along the manuscript, time-series of dRMSCL is used for visualizing state populations during each simulation.

The equilibrium constant Keq between the CL and OP states is defined as:

Keq=pCLpOP, (2)

where pCL and pOP are the populations in the CL and OP states, respectively. The populations at each dRMSCL bin are obtained from the PMF along dRMSCL, using the relation dG=−kBTlnp. Trajectory frames are assigned to either the OP or CL states according to their dRMSCL values, where the cutoff value is determined at the location of the barrier in between the two basins of the PMF plot. It is possible to determine proper mixing parameters for any target Keq. In this study, we set the target Keq value to 1.

Manual parameter search until two states are sampled

At this stage (C in Fig. 2 and Supplementary Fig. S2), we detect Tmix and C for an approximate range, which produces transitions between the two states within a single simulation. We start by using large increments of extreme values and gradually narrow the search range by reducing the increments until an acceptable range is detected. At each round, we perform a series of simulations (five in this case) with different sets of parameters, and initiate simulations from both the OP and CL states (ten simulations in total). Tmix or C are fixed alternatively at each round, while the other parameter is varied.

Parameters for the next round are determined according to the behavior of the current round by varying their values according to the direction which leads to proper state transitions. In case one of the states is sampled in excess, C is varied in the direction which stabilizes the other state, or in case transition frequency is too low, Tmix is increased.

MBAR-assisted parameter refinement

Once two states are sampled, the MBAR analysis method is used for speeding up the parameter search process (D in Fig. 2 and D1–D11 in Supplementary Fig. S3).

The MBAR method [61] allows to estimate the expectation of any physical quantity 〈A(x)〉 (in this case, the PMF) for an unsimulated condition by reweighing simulated data performed under various conditions (in this study, conditions refer to combinations of Tmix and C).

The first step of MBAR method estimates the free energies i of simulated conditions by iteratively solving the following nonlinearly coupled equations:

f^i=-lnj=1Kn=1Njexp [-ui(xjn)]k=1KNkexp [f^k-uk(xjn)] (3)

where i and j indicate conditions, and K is the total number of conditions. Nj is the total number of configurations (simulation frames) in condition j. ui is the reduced potential energy defined as:

ui(x)=βEi(x)=1kBTEi(x), (4)

kB is the Boltzmann constant, and T is the simulation temperature. Ei(x) is the potential energy for condition i (here, combinations of Tmix and C). We mark this step as MBAR1.

In the next step, we can estimate the expectation of a physical quantity 〈A(x)〉 for an unsimulated target condition (marked by ‘estim’ and a hat symbol) by calculating the ratio of partition functions using the results of eq. (3) [61],

A(x)^estim=A(x)exp[-uestim(x)]dxexp [-uestim(x)]dx=jnwjnA(xjn) (5)

Here, for the estimation of the PMF, indicator functions are employed for A(x) along the axis of the dRMS. uestim(x) is the reduced potential energy in the unsimulated target condition in which we attempt to estimate the PMF. wjn is the weight of configuration n in condition j with respect to the target condition. A(x)^estim is the estimated probability density defined as the expectation of an indicator function. We mark this step as MBAR2. MBAR1 and MBAR2 were performed using the MATLAB MDToolbox (https://github.com/ymatsunaga/mdtoolbox) [67].

The procedure in the MBAR-assisted refinement stage follows a similar progression as the preliminary manual stage. However, instead of manually guessing the parameters for the next round and confirming by MD simulation, we perform several iterations of MBAR2. Once the weight factors are obtained by solving Eq. (3) using simulation trajectories, iterations for estimation of parameters can be performed without simulations (D2–D7 in Supplementary Fig. S3). The inclusion of MBAR-assisted parameter estimation reduces the number of MD simulation rounds with unsuccessful parameter choice.

The Keq value for an estimated parameter set (KMBAR) is obtained from the estimated PMF (PMFMBAR). This process of guessing Tmix and C and estimating KMBAR is performed iteratively until the target Keq (Ktarget) is included in the range of KMBAR.

The results of each simulation round are assessed and a decision is made on whether to end the parameter search (step D10 in Supplementary Fig. S3). For each of the simulations in the round, in addition to Ksim values, dRMSCL and dRMSOP are calculated at the free energy minimum of each of the two basins. The conditions for ending the refinement stage are as follows: 1) Ksim for all simulations are in an acceptable range around Ktarget (in this case we used the range 0.2–5), 2) There is at least one pair of Tmix and C for which Ksim is close enough to Ktarget (in this case we allowed a deviation of 25% from Ktarget), and 3) both basins’ minima are located outside the intermediate basin region in the (dRMSCL, dRMSOP) space, defined as the rectangular area confined by 2 Å at the bottom limit and the dRMS value between the native states in the top limit. This ensures that the simulated ensembles do not sway too far from the native states.

Final selection of mixing parameters

Once a satisfactory parameter pair is detected in stage D, all simulations of the last round are elongated to 2 μs, Ksim values are calculated, and the parameter pair used for the simulations for which Ksim values have the smallest average (between the OP and CL simulations) deviation from Ktarget is selected as the final parameter set.

Simulation details

For both target systems GBP and RBP, crystal (native) structures of the OP (unbound) and CL (bound) states are available (PDB codes: 1GGG/1WDN [6] and 1URP [63]/ 2DRI [62] for OP/CL of GBP and RBP, respectively, Fig. 1A and Supplementary Fig. S1A). Residues 5–224 were used for GBP (one residue from the N-terminus and two from the C-terminus were removed from the CL structure) and 1–271 for RBP. dRMS values between the native structures were 8.964 Å and 7.125 Å for GBP and RBP, respectively. The Cα models were created and single-basin KBGo potentials were built using the MMTSB server (https://mmtsb.org/) [51,52].

Initial values for Tmix and C were set in ranges of 15000 to 30000 and −50 to +50, respectively. Values were chosen based on a previous study [43], in which macro-mixing parameters for Adenylate Kinase (AdK) were determined.

Simulations were performed using the MD program GENESIS [68,69], in which both single-basin and dual-basin potential schemes were implemented. Simulations were performed in the NVT ensemble using the Langevin thermostat. The timestep of integration was 0.02 ps. All bonds were constrained using the SHAKE [70] algorithm. Native interactions were calculated without truncation, whereas repulsive non-native interactions were truncated at a distance of 20 Å.

The simulation temperature in the dual-basin potential was set to ~0.9 Tf. Tf is the folding temperature of the protein, and was determined in a 500 ns single-basin potential simulation in the KBGo model as the temperature for which the protein shares its time equally between the folded and unfolded states. Unfolding was identified using the RMSD of the protein to its own initial native structure. If the OP and CL simulations exhibite different Tf values, the lower value of the two is used. Dual-basin simulations were performed at 240 K and 260 K for GBP and RBP, respectively. After identifying the parameters which satisfy the mixing criteria, we performed a 10 μs sampling simulation with the final parameters. The final parameters and Keq calculated for the sampling simulations are summarized in Table 1 for all four cases.

Table 1.

Final parameters for macro-mixing simulations

System Property KBGo DoME
GBP Cfinal −22.4 −29.0
Tmix,final 21000 25000
Ksim,OP/CL 1.25/1.25 1.00/1.12

RBP Cfinal −1.12 −11.25
Tmix,final 4500 7000
Ksim,OP/CL 0.85/0.86 1.15/0.89

Results and Discussion

Acceleration of parameter searching using MBAR

Figure 3 shows the dRMSCL time-series for the parameter search rounds (C–E in Fig. 2 and Supplementary Figs. S2 and S3) for GBP in the KBGo model. For this system, two manual iterations (stage C) were required for sampling two states, and five rounds of MBAR-assisted iterations and an additional 2 μs elongation for identifying Tmix,final and Cfinal.

Figure 3.

Figure 3

Time-series of dRMSCL for simulation rounds performed for determining the macro-mixing parameters for GBP in the KBGo model. Five simulations per initial structure (OP and CL, left and right panels at each round, respectively) were performed at each round. At each round one parameter was fixed for all simulations (a single value which is written above the plot) and the other was varied (five values written in order of top to bottom in the dRMS plots). The length of the simulation for all rounds except round 8 was 500 ns. “manual”, “MBAR” and “2 μs elongation” stand for the three stages along the parameter tuning process. “manual” for the manually adjusting the parameters until two states are sampled (stage C in Fig. 2). “MBAR” for stages in which parameters were estimated using MBAR analysis using simulation data of the previous stage (stage D), and “2 μs elongation” for elongating the last “MBAR” round (round 7) to confidently determine a single set (Tmix, C) of optimal parameters (stage E).

In round 1 (stage C1 in Supplementary Fig. S2), observing dRMSCL values, a few conformational transitions between the OP and CL states occur in the simulations with C=−25, while for C=−50 the protein resides solely in the OP state, and for C=0, +25, +50 solely in the CL state. We selected C=−25 and executed simulations with different Tmix values in round 2 (stage C5). Transitions are observed at simulations with Tmix=15000 (only OP simulation), 20000, and 25000 (both OP and CL). At this stage, the procedure advances to the MBAR-assisted simulations rounds. In rounds 3–7 (stages D2–D11 in Supplementary Fig. S3), we performed simulations with parameters selected by estimating PMFMBAR and KMBAR using i given from the previous simulations (stage D8). During these rounds, conformational transitions are observed 50–100 times per simulation. In round 7, criteria for ending the search are fulfilled where a pair of simulations produced Ksim values within the desired range (encircled). In round 8, the set of simulations with identical conditions to round 7 were performed for 2 μs, after which Tmix,final and Cfinal were selected according to the simulation pair that produced Ksim values closest to Ktarget (encircled). A single simulation of 500 ns required roughly 100 minutes to complete on 8 cores of Intel® Xeon® E5-2680 v3 CPUs. The available computing resources enable running all 10 simulations of a round in parallel. Thus, detecting Tmix,final and Cfinal (not including the 10 μs-long sampling) for the current system required approximately 18 hours.

The MBAR2 iterations (D2–D7 in Supplementary Fig. S3) serve as substitutes for MD simulations and therefore constitute the core of the speedup achieved in the current method. The number of MBAR2 rounds for the current system was between 2–4 per round (data not shown). Assuming that each MBAR2 iteration is equivalent to one round of MD simulation, we would need to perform from 10 (2 MBAR2 rounds ×5 rounds of MBAR-assisted rounds) to 20 (4×5) additional simulation rounds per system. Therefore, without using the MBAR analysis, we estimate that an extra simulation time of 50–100 μs will be required. Using the MBAR analysis results in substantial acceleration of the parameter searching process. This effect is expected to be even more prominent for larger systems in which physical simulation times are severalfold longer. The MBAR analysis calculation time is independent of system size and is negligible comparing to MD simulation times, thus the reduction in total calculation time is expected to be even greater.

Characterizing the quality of the MBAR-prediction

PMFs along dRMSCL and the values of Keq calculated from the PMFs (Ksim) are shown for each round of the parameter searching process in Figure 4. In rounds 1–2, PMFsim does not cover both the OP and CL states and Ksim values (in cases where they could be calculated) are far from the targeted Keq. MBAR-predicted PMF (PMFMBAR) plots are shown in black for rounds 3–8. In round 3, there is poor overlap between PMFsim and PMFMBAR. The overlap becomes progressively better with the rounds. Figures 5A and 5B present values for Ksim and the difference between Ksim and KMBAR, respectively. From the figures we learn that the MBAR-assisted search drives Ksim closer to Ktarget and Ksim closer to KMBAR.

Figure 4.

Figure 4

PMF vs. dRMSCL, presented for simulation rounds performed for determining the macro-mixing parameters for GBP in the KBGo model. Panels representing each round correspond to those in Figure 3. Black curves (“MBAR”, for rounds 3 to 8) represent PMF for each parameter set (Tmix, C), estimated by the MBAR analysis performed using data from the previous round (PMFMBAR). Red (OP) and blue (CL) curves represent PMF calculated from simulation data of the current round (PMFsim). Keq values per each individual simulation are written on the right side of their respective PMF plots. For the three bottom simulations of round 1 the OP state was not sampled, thus Keq is not defined (marked as “#”). Encircled text represent Ksim values which satisfied the criterion for ending the MBAR-assisted search stage (round 7) and the Ksim values of the simulations with parameters selected as Tmix,final and Cfinal.

Figure 5.

Figure 5

Ksim (A) and Ksim-KMBAR (B) for rounds 3–9 of the GBP KBGo parameter search (round numbers are according to the notation in Figure 3, round 9 is the 10 μs sampling simulations with Tmix,final, Cfinal). Each point represents calculated value from one simulation. Data from simulations starting from the OP and CL states are shown in red and blue, respectively. Horizontal lines represent the value of Ktarget (A), and the location of ‘0’ (B). For round 3 in B: four of the data points have a very large difference between the predicted KMBAR and simulated Ksim (−165, −163, −10.8, and −11.7) and are not shown in the figure.

We characterize the quality of the MBAR prediction by the similarity between PMFMBAR and PMFsim. There are two aspects which influence the quality of prediction. The first is the similarity between the parameters with which the weight factor i is calculated (data from the previous simulation round) and the parameters for which MBAR2 predicts the PMF for (to be used in the current simulation round). Higher similarity between the two parameter sets results in a more accurate prediction. Indeed, values of Tmix ranging from 5000 to 25000 were used for estimating PMFMBAR in round 3, whereas a much smaller Tmix range (18000–23000) was used for all later rounds. This illustrates the importance of narrowing the parameter range for achieving accuracy in the prediction.

Another aspect regarding the quality of prediction lies in the frequency of transitions between the basins. When one basin is exceedingly stabilized with respect to the other, transitions to the less stable basin are difficult and the chances of the simulation residing in the stable basin for prolonged times increase. This brings to uneven sampling, which results in poor convergence between the two simulations as well as with predicted properties (PMFMBAR). This is observed during the MBAR-assisted parameter searching where parameters which produce large or small KMBAR values are used. For example, the top simulation in round 5 in Figure 4 for which KMBAR=1.52 and the two Ksim values deviate both from each other and from KMBAR (2.66 and 0.74 for the OP and CL simulations, respectively). This issue can be resolved by sufficient simulation time. Thus, if the desired Ktarget is much larger or smaller than 1, longer simulation times should be used for an accurate parameter determination.

We start the search with a wide range of parameters (5000 to 25000 and −50 to +50 for Tmix and C, respectively), and gradually narrow the search until the desired behavior (Keq) is obtained with good enough precision, finally converging to parameter values of 21000 and −22.4 for Tmix and C, respectively. This ensures the robustness of the method in a sense that the final answer does not depend on the choice of initial guess. Structure-based models such as the one used here are constructed from structural information of the protein. The number of contacts, as well as the boundaries of the rigid domains are different from system to system. Therefore, the parameter values differ according to the system. A robust method, such as the one presented here can be advantageous for constructing a suitable structure-based model.

Figure 6 presents free energy minima near each basin in the dRMSCL/dRMSOP space. The most populated configuration near each basin (the configuration with the lowest free energy) does not penetrate the “intermediate region” (the grey area) for all but one simulation (in round 7 using C=−22.3). This indicates that the original native states are sampled accurately along all simulation rounds. Near the OP basin, most simulations sample configurations closer to the CL state than the native OP structure, whereas near the CL basin, simulations are spread evenly around the native configuration.

Figure 6.

Figure 6

Location of the free energy minima near the OP (bottom right) and CL (top left) basins. Data for different simulation rounds are shown in different colors. The gray area is the rectangle confined within 2 Å from the bottom and the dRMS between the native structures from top (the “intermediate region”).

Characterizing the behavior of the mixed potential in the extended sampling simulation

Figure 7 and Table 2 display various properties calculated for the 10 μs-long sampling simulations with Tmix,final and Cfinal. The time-series of dRMSCL in Figure 7A displays frequent transitions between the two states. The transition rate was calculated as 157 and 156 μs−1, showing an excellent convergence between the OP and CL simulations. The total number of transitions is above 1500 for both simulations, which is sufficient for accurately calculating properties which characterize the systems’ behavior. Maximal residence times were calculated as 115 and 160 ns for the OP and CL simulations, respectively, constituting under 2% of the whole simulation time. Transition rates obtained in the current study are higher than those of actual conformational transitions (which occur at time ranges of milliseconds). Indeed, simulating realistic transition rates is still challenging with the currently available computational resources. In the current work, we focus on simulating enough transition events that will allow us to characterize transition paths, or to treat very large systems.

Figure 7.

Figure 7

Results for the 10 μs sampling simulations using the final macro-mixing parameters for GBP with the KBGo model. A) Time-series of dRMSCL for simulations starting from the OP (top) and the CL (bottom) states. B) Two-dimensional PMF surfaces along dRMSCL (x-axis) and dRMSOP (y-axis). Black circles represent dRMS values of the most probable structures from the single-basin simulations of the OP and CL states (structure closest to the free energy minimum near each basin). Representative structures are shown for the unbound (OP), bound (CL) for the frame closest to the free energy minimum of each basin, and for an intermediate state residing along the path connecting the end states. Domain coloring corresponds to Figure 1B. C) PMF vs. dRMSCL, calculated for 0.5, 1, 2, 4, and 10 μs along the sampling simulation.

Table 2.

Properties calculated for 10 μs simulations of GBP in KBGo model using Tmix,final and Cfinal

Property OP simulationa CL simulationa
Ksim 1.25 1.25
Transition rateb,c, μs−1 157 156
dRMSnear,OPb,d, Å 1.30 1.35
dRMSnear,CLb,d, Å 1.40 1.74
Average residence timeb,c, ns 6.38±0.21 6.39±0.22
Maximal residence timeb,c, ns 115 160
a

Initial structure from which the simulation started.

b

OP and CL states during the simulations assigned according to the native state to which the dRMS of the frame was smaller.

c

Not including short transitions (<0.4 ns).

d

dRMS calculated at the minimum of free energy near each basin, with respect to the native structure of each basin.

Figure 7B shows two-dimensional free energy surfaces in the dRMSCL/dRMSOP space. The dual-basin simulations near the CL basin sample configurations very similar to those sampled in the single-basin simulation, with the most populated configuration approximately overlapping with that in the single-basin simulation (shown as black circles). For the OP state, the most populated configuration is located closer to both the CL and the OP native states than the single basin simulations. This is likely due to the fact that under the single-basin potential, fluctuations towards the opposite direction (excessive opening in the OP state) are larger than in the dual-basin where the two domains are held more tightly against each other.

An estimated path for the states transition can be detected in Figure 7B as the line connecting the two states. From the dRMS time-series (Figs. 3 and 7A) it is apparent that coordinates shift instantaneously without sampling intermediate states. Thus, the intermediate states visible in panel B are not states which were sampled during transitions, but rather fluctuations within each basin and the most probable path is estimated as the line connecting the fluctuations. Below panel B, representative structures are shown for the OP and CL most populated configurations and for an intermediate state, represented by a configuration with dRMS values residing midway between the two native states.

Free energy landscapes of GBP [71] and RBP [72] obtained from simulations using all-atom force fields show two basins near the crystal structures with no intermediate states in between the basins. In this aspect, our model resembles the results of the all-atom simulations.

Final Ksim values were 1.25 for both the OP and CL simulations (Table 2), slightly above the MBAR-predicted value of 0.96, but still within the allowed deviation range. Figure 7C shows PMF plots for the sampling simulation after 0.5, 1, 2, 4, and finally 10 μs. We observe that the OP and CL PMFs converge as the simulation time increases. Also, the transition state becomes progressively more defined. It takes approximately 4 μs until full convergence of the two simulations is observed. MBAR-assisted parameter determination was performed with 0.5 μs-long simulations, where convergence is not full. For obtaining a more accurate prediction of Keq, one should either increase the length of each individual MBAR-assisted round simulation, or increase the number of MBAR-assisted rounds while modifying the guessed parameters in smaller increments.

The effect of system and model on the mixing behavior

We also applied the procedure to GBP in the DoME model and to RBP in the KBGo and the DoME models. The number of required rounds and simulation times are shown for all four systems in Supplementary Tables S1 and S2.

Supplementary Figure S4A shows PMF plots for the 10 μs-long simulations with Tmix,final and Cfinal for GBP and RBP in the KBGo and DoME models. The dual-basin potential is asymmetric for all cases, with the OP basin higher in energy and broader than the CL basin. Ksim values close to 1 imply equal populations between the two basins. We note the relation between the basins’ relative heights and the breadths where the narrower basin will be lower in energy in order to compensate for a smaller number of available configurations. This effect is demonstrated in Supplementary Figure S4A in which PMFs of GBP (top) and RBP (bottom) are displayed, and the OP basin in RBP appears broader and higher in energy than the GBP OP basin.

The position of the two single-basin potentials relative to each other is controlled by the mixing parameters, which are optimized to produce the desired mixing behavior. However, the shapes of the individual basins are innate to the system and carried on from the single-basin potentials. As noted earlier, in RBP, the OP basin is broader than in GBP (Supplementary Fig. S4A). Observing the two-dimensional free energy surfaces (Supplementary Figs. S4B and C) it is apparent that in RBP the OP state highly fluctuates away from its native configuration (compared to GBP in Fig. 7B). This behavior occurs in the single-basin simulations as well (data not shown).

The separation between the basins is related to the magnitude of the transition (characterized by the RMSD between the two native states). Distant basins will have a higher transition state. In order to facilitate transitions, the barrier is lowered by increasing Tmix. This is the case for GBP, in which the OP and CL basins are farther apart (RMSD of 5.3 Å compared to 4.1 Å in RBP), therefore its Tmix is higher (see Table 1). Lowering the barrier by increasing Tmix increases the population of intermediate states. Thus, the upper limit for Tmix will be as high as the native states can still be sampled accurately. This means that the transition rate can only be controlled to some extent, and is largely dictated by the magnitude of the transition. Systems with larger transition magnitudes will naturally display slower transition rates and longer simulation times are required for parameter searching as well as for the actual sampling.

Finally, comparing between the KBGo and the DoME models, we observe that C is more negative for the DoME model (the CL state is naturally more stable so the OP state needs to be stabilized with respect to it). In the DoME model, fluctuations near each basin are smaller, therefore the basins are narrower (Supplementary Fig. S4A). This mostly affects the OP basin which has less contacts, which effectively makes the OP basin narrower than in the KBGo model, reducing its population. Agreeing with the relation between basin population and relative height discussed previously, reduction of population should be compensated by lowering the basin energy.

Practical issues in the current scheme

For the systems used in the current study, which had transition RMSDs of 4–5 Å, short iterations of 500 ns MD simulations and an additional final round of 2 μs are sufficient for accurately determining the parameters. We demonstrated that using the detected parameters, the two basins are sampled accurately, and transition rate is fast enough for free energies to converge. For larger systems, barriers are naturally higher and longer simulation times will be required in order not to compensate for accuracy of basin sampling.

Other issue to be considered in the scheme is the target population ratio between the states. In the current study, the ratio was set to 1, but can, in principle be set to any desired value. This aspect is important because this way, the model can be calibrated against experimental values to construct a system which exhibits a realistic behavior. The rate is not explicitly controlled in the current application but can be controlled to some extent. The structural based CG MD simulations heavily rely on the system-dependent parameters. The target population ratio might be one of such parameters in the simulations for other proteins or protein complexes.

Conclusions

In this study, we have developed a scheme to determine parameters for exponential mixing dual-basin Gō model potential using the MBAR analysis method. Using MBAR, we can perform fast iterations for predicting the behavior of a hypothetical simulation, reducing the total simulation time. We estimate that parameter searching can be accelerated by at least three-fold using the procedure to the two protein systems. The systematic scheme for determining mixing parameters in dual-basin Gō model potential makes the CG MD simulation more powerful in future applications. The systematic scheme developed here is generally applicable to the parameter determinations that are necessary in other types of simulations.

Supplementary Material

16_310_1.pdf (966.7KB, pdf)

Acknowledgment

The computer resources of K computer were provided by the RIKEN Center for Computational Science through HPCI System Research project (Project ID: ra000009). We thank the RIKEN Integrated Cluster of Clusters (RICC) and HOKUSAI GreatWave for additional computer resources. This research was supported in part by RIKEN pioneering researches of “Glycolipidologue” and “Dynamic Structural Biology” (to YS), a Grant-in-Aid for Scientific Research (C) (JSPS KAKENHI Grant no. 16K05526) (to CK).

Footnotes

Informed Consent

The authors declare no conflict of interest.

Author Contribution

Y. S. directed the entire study. A. S., C. K., Y. M., and Y. S. co-wrote the manuscript. A. S., C. K., and Y. M. wrote the simulation program and implemented potentials, mixing schemes and MBAR analysis code. A. S. performed the simulations and the analysis.

References

  • 1.Thirumalai D, Lorimer GH. Chaperonin-mediated protein folding. Annu Rev Biophys Biomol Struct. 2001;30:245–269. doi: 10.1146/annurev.biophys.30.1.245. [DOI] [PubMed] [Google Scholar]
  • 2.Huse M, Kuriyan J. The conformational plasticity of protein kinases. Cell. 2002;109:275–282. doi: 10.1016/s0092-8674(02)00741-9. [DOI] [PubMed] [Google Scholar]
  • 3.Margittai M, Widengren J, Schweinberger E, Schröder G, Felekyan S, Haustein E, et al. Single-molecule fluorescence resonance energy transfer reveals a dynamic equilibrium between closed and open conformations of syntaxin 1. Proc Natl Acad Sci USA. 2003;100:15516–15521. doi: 10.1073/pnas.2331232100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Munro JB, Sanbonmatsu KY, Spahn CM, Blanchard SC. Navigating the ribosome’s metastable energy landscape. Trends Biochem Sci. 2009;34:390–400. doi: 10.1016/j.tibs.2009.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ma B, Nussinov R. Enzyme dynamics point to stepwise conformational selection in catalysis. Cur Opin Chem Biol. 2010;14:652–659. doi: 10.1016/j.cbpa.2010.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sun Y-J, Rose J, Wang B-C, Hsiao C-D. The structure of glutamine-binding protein complexed with glutamine at 1.94 Å resolution: comparisons with other amino acid binding proteins. J Mol Biol. 1998;278:219–229. doi: 10.1006/jmbi.1998.1675. [DOI] [PubMed] [Google Scholar]
  • 7.Mowbray SL, Björkman AJ. Conformational changes of ribose-binding protein and two related repressors are tailored to fit the functional need. J Mol Biol. 1999;294:487–499. doi: 10.1006/jmbi.1999.3271. [DOI] [PubMed] [Google Scholar]
  • 8.Kern D, Zuiderweg ER. The role of dynamics in allosteric regulation. Curr Opin Struct Biol. 2003;13:748–757. doi: 10.1016/j.sbi.2003.10.008. [DOI] [PubMed] [Google Scholar]
  • 9.Wong L, Lieser SA, Miyashita O, Miller M, Tasken K, Onuchic JN, et al. Coupled motions in the SH2 and kinase domains of Csk control Src phosphorylation. J Mol Biol. 2005;351:131–143. doi: 10.1016/j.jmb.2005.05.042. [DOI] [PubMed] [Google Scholar]
  • 10.Hanson JA, Duderstadt K, Watkins LP, Bhattacharyya S, Brokaw J, Chu J-W, et al. Illuminating the mechanistic roles of enzyme conformational dynamics. Proc Natl Acad Sci USA. 2007;104:18055–18060. doi: 10.1073/pnas.0708600104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Boehr DD, Nussinov R, Wright PE. The role of dynamic conformational ensembles in biomolecular recognition. Nat Chem Biol. 2009;5:789–796. doi: 10.1038/nchembio.232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.MacKerell AD, Jr, Bashford D, Bellott M, Dunbrack RL, Jr, Evanseck JD, Field MJ, et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J Phys Chem B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
  • 13.Ponder JW, Case DA. Adv Protein Chem. Vol. 66. Elsevier; 2003. Force fields for protein simulations; pp. 27–85. [DOI] [PubMed] [Google Scholar]
  • 14.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, et al. Scalable molecular dynamics with NAMD. J Comput Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Adcock SA, McCammon JA. Molecular dynamics: survey of methods for simulating the activity of proteins. Chem Rev. 2006;106:1589–1615. doi: 10.1021/cr040426m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Klepeis JL, Lindorff-Larsen K, Dror RO, Shaw DE. Long-timescale molecular dynamics simulations of protein structure and function. Curr Opin Struct Biol. 2009;19:120–127. doi: 10.1016/j.sbi.2009.03.004. [DOI] [PubMed] [Google Scholar]
  • 17.Hospital A, Goñi JR, Orozco M, Gelpí JL. Molecular dynamics simulations: advances and applications. Adv Appl Bioinform Chem. 2015;8:37–47. doi: 10.2147/AABC.S70333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Abraham MJ, Murtola T, Schulz R, Páll S, Smith JC, Hess B, et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1:19–25. [Google Scholar]
  • 19.Shaw DE, Deneroff MM, Dror RO, Kuskin JS, Larson RH, Salmon JK, et al. Anton, a special-purpose machine for molecular dynamics simulation. Commun ACM. 2008;51:91–97. [Google Scholar]
  • 20.Yu I, Mori T, Ando T, Harada R, Jung J, Sugita Y, et al. Biomolecular interactions modulate macromolecular structure and dynamics in atomistic model of a bacterial cytoplasm. Elife. 2016;5:e19274. doi: 10.7554/eLife.19274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Durrant JD, Bush RM, Amaro RE. Microsecond molecular dynamics simulations of influenza neuraminidase suggest a mechanism for the increased virulence of stalk-deletion mutants. J Phys Chem B. 2016;120:8590–8599. doi: 10.1021/acs.jpcb.6b02655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Trovato F, Fumagalli G. Molecular simulations of cellular processes. Biophys Rev. 2017;9:941–958. doi: 10.1007/s12551-017-0363-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Acun B, Hardy D, Kale LV, Li K, Phillips J, Stone J. Scalable molecular dynamics with NAMD on the Summit system. IBM J Res Dev. 2018;62:4:1–4:9. doi: 10.1147/jrd.2018.2888986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hyeon C, Thirumalai D. Capturing the essence of folding and functions of biomolecules using coarse-grained models. Nat Commun. 2011;2:487. doi: 10.1038/ncomms1481. [DOI] [PubMed] [Google Scholar]
  • 25.Takada S. Coarse-grained molecular simulations of large biomolecules. Curr Opin Struct Biol. 2012;22:130–137. doi: 10.1016/j.sbi.2012.01.010. [DOI] [PubMed] [Google Scholar]
  • 26.Noid W. Perspective: Coarse-grained models for biomolecular systems. J Chem Phys. 2013;139:090901. doi: 10.1063/1.4818908. [DOI] [PubMed] [Google Scholar]
  • 27.Ingólfsson HI, Lopez CA, Uusitalo JJ, de Jong DH, Gopal SM, Periole X, et al. The power of coarse graining in biomolecular simulations. Wiley Interdiscip Rev Comput Mol Sci. 2014;4:225–248. doi: 10.1002/wcms.1169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Kmiecik S, Gront D, Kolinski M, Wieteska L, Dawid AE, Kolinski A. Coarse-grained protein models and their applications. Chem Rev. 2016;116:7898–7936. doi: 10.1021/acs.chemrev.6b00163. [DOI] [PubMed] [Google Scholar]
  • 29.Pak AJ, Voth GA. Advances in coarse-grained modeling of macromolecular complexes. Curr Opin Struct Biol. 2018;52:119–126. doi: 10.1016/j.sbi.2018.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Sanbonmatsu KY, Joseph S, Tung C-S. Simulating movement of tRNA into the ribosome during decoding. Proc Natl Acad Sci USA. 2005;102:15854–15859. doi: 10.1073/pnas.0503456102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Koga N, Takada S. Folding-based molecular simulations reveal mechanisms of the rotary motor F1–ATPase. Proc Natl Acad Sci USA. 2006;103:5367–5372. doi: 10.1073/pnas.0509642103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Pellarin R, Guarnera E, Caflisch A. Pathways and intermediates of amyloid fibril formation. J Mol Biol. 2007;374:917–924. doi: 10.1016/j.jmb.2007.09.090. [DOI] [PubMed] [Google Scholar]
  • 33.Yao X-Q, Kenzaki H, Murakami S, Takada S. Drug export and allosteric coupling in a multidrug transporter revealed by molecular simulations. Nat Commun. 2010;1:117. doi: 10.1038/ncomms1116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Clementi C, Nymeyer H, Onuchic JN. Topological and energetic factors: what determines the structural details of the transition state ensemble and “en-route” intermediates for protein folding? An investigation for small globular proteins. J Mol Biol. 2000;298:937–953. doi: 10.1006/jmbi.2000.3693. [DOI] [PubMed] [Google Scholar]
  • 35.Clementi C, Jennings PA, Onuchic JN. How native-state topology affects the folding of dihydrofolate reductase and interleukin-1β. Proc Natl Acad Sci USA. 2000;97:5871–5876. doi: 10.1073/pnas.100547897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Koga N, Takada S. Roles of native topology and chain-length scaling in protein folding: a simulation study with a Gō-like model. J Mol Biol. 2001;313:171–180. doi: 10.1006/jmbi.2001.5037. [DOI] [PubMed] [Google Scholar]
  • 37.Best RB, Chen Y-G, Hummer G. Slow protein conformational dynamics from multiple experimental structures: the helix/sheet transition of arc repressor. Structure. 2005;13:1755–1763. doi: 10.1016/j.str.2005.08.009. [DOI] [PubMed] [Google Scholar]
  • 38.Okazaki K, Koga N, Takada S, Onuchic JN, Wolynes PG. Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations. Proc Natl Acad Sci USA. 2006;103:11844–11849. doi: 10.1073/pnas.0604375103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Whitford PC, Miyashita O, Levy Y, Onuchic JN. Conformational transitions of adenylate kinase: switching by cracking. J Mol Biol. 2007;366:1661–1671. doi: 10.1016/j.jmb.2006.11.085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Takagi F, Kikuchi M. Structural change and nucleotide dissociation of myosin motor domain: dual Gō model simulation. Biophys J. 2007;93:3820–3827. doi: 10.1529/biophysj.106.103796. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Clementi C. Coarse-grained models of protein folding: toy models or predictive tools? Curr Opin Struct Biol. 2008;18:10–15. doi: 10.1016/j.sbi.2007.10.005. [DOI] [PubMed] [Google Scholar]
  • 42.Hills R, Brooks C. Insights from coarse-grained Gō models for protein folding and dynamics. Int J Mol Sci. 2009;10:889–905. doi: 10.3390/ijms10030889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Daily MD, Phillips GN, Jr, Cui Q. Many local motions cooperate to produce the adenylate kinase conformational transition. J Mol Biol. 2010;400:618–631. doi: 10.1016/j.jmb.2010.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Takada S, Kanada R, Tan C, Terakawa T, Li W, Kenzaki H. Modeling structural dynamics of biomolecular complexes by coarse-grained molecular simulations. Acc Chem Res. 2015;48:3026–3035. doi: 10.1021/acs.accounts.5b00338. [DOI] [PubMed] [Google Scholar]
  • 45.Taketomi H, Ueda Y, Gō N. Studies on Protein Folding, Unfolding and Fluctuations by Computer Simulation: I. The effect of specific amino acid sequence represented by specific inter-unit interactions. Int J Pept Protein Res. 1975;7:445–459. [PubMed] [Google Scholar]
  • 46.Ueda Y, Taketomi H, Gō N. Studies on protein folding, unfolding, and fluctuations by computer simulation. II. A. Three-dimensional lattice model of lysozyme. Biopolymers. 1978;17:1531–1548. [Google Scholar]
  • 47.Go N. Theoretical studies of protein folding. Annu Rev Biophys Bioeng. 1983;12:183–210. doi: 10.1146/annurev.bb.12.060183.001151. [DOI] [PubMed] [Google Scholar]
  • 48.Bryngelson JD, Onuchic JN, Socci ND, Wolynes PG. Funnels, pathways, and the energy landscape of protein folding: a synthesis. Proteins. 1995;21:167–195. doi: 10.1002/prot.340210302. [DOI] [PubMed] [Google Scholar]
  • 49.Onuchic JN, Socci ND, Luthey-Schulten Z, Wolynes PG. Protein folding funnels: the nature of the transition state ensemble. Fold Des. 1996;1:441–450. doi: 10.1016/S1359-0278(96)00060-0. [DOI] [PubMed] [Google Scholar]
  • 50.Onuchic JN, Luthey-Schulten Z, Wolynes PG. Theory of protein folding: the energy landscape perspective. Annu Rev Phys Chem. 1997;48:545–600. doi: 10.1146/annurev.physchem.48.1.545. [DOI] [PubMed] [Google Scholar]
  • 51.Karanicolas J, Brooks CL., III The origins of asymmetry in the folding transition states of protein L and protein G. Protein Sci. 2002;11:2351–2361. doi: 10.1110/ps.0205402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Karanicolas J, Brooks CL., III Improved Gō-like models demonstrate the robustness of protein folding mechanisms towards non-native interactions. J Mol Biol. 2003;334:309–325. doi: 10.1016/j.jmb.2003.09.047. [DOI] [PubMed] [Google Scholar]
  • 53.Miyazawa S, Jernigan RL. Residue–residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol. 1996;256:623–644. doi: 10.1006/jmbi.1996.0114. [DOI] [PubMed] [Google Scholar]
  • 54.Kobayashi C, Matsunaga Y, Koike R, Ota M, Sugita Y. Domain Motion Enhanced (DoME) Model for Efficient Conformational Sampling of Multidomain Proteins. J Phys Chem B. 2015;119:14584–14593. doi: 10.1021/acs.jpcb.5b07668. [DOI] [PubMed] [Google Scholar]
  • 55.Whitford PC, Gosavi S, Onuchic JN. Conformational transitions in adenylate kinase allosteric communication reduces misligation. J Biol Chem. 2008;283:2042–2048. doi: 10.1074/jbc.M707632200. [DOI] [PubMed] [Google Scholar]
  • 56.Hyeon C, Jennings PA, Adams JA, Onuchic J. Ligand-induced global transitions in the catalytic domain of protein kinase A. Proc Natl Acad Sci USA. 2009;106:3023–3028. doi: 10.1073/pnas.0813266106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Lu Q, Wang J. Kinetics and statistical distributions of single-molecule conformational dynamics. J Phys Chem B. 2009;113:1517–1521. doi: 10.1021/jp808923a. [DOI] [PubMed] [Google Scholar]
  • 58.Maragakis P, Karplus M. Large amplitude conformational change in proteins explored with a plastic network model: adenylate kinase. J Mol Biol. 2005;352:807–822. doi: 10.1016/j.jmb.2005.07.031. [DOI] [PubMed] [Google Scholar]
  • 59.Okazaki K, Takada S. Dynamic energy landscape view of coupled binding and protein conformational change: induced-fit versus population-shift mechanisms. Proc Natl Acad Sci USA. 2008;105:11182–11187. doi: 10.1073/pnas.0802524105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Hummer G, Pratt LR, García AE. Multistate Gaussian model for electrostatic solvation free energies. J Am Chem Soc. 1997;119:8523–8527. [Google Scholar]
  • 61.Shirts MR, Chodera JD. Statistically optimal analysis of samples from multiple equilibrium states. J Chem Phys. 2008;129:124105. doi: 10.1063/1.2978177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Björkman A, Binnie RA, Zhang H, Cole LB, Hermodson MA, Mowbray SL. Probing protein-protein interactions. The ribose-binding protein in bacterial transport and chemotaxis. J Biol Chem. 1994;269:30206–30211. [PubMed] [Google Scholar]
  • 63.Björkman AJ, Mowbray SL. Multiple open forms of ribose-binding protein trace the path of its conformational change. J Mol Biol. 1998;279:651–664. doi: 10.1006/jmbi.1998.1785. [DOI] [PubMed] [Google Scholar]
  • 64.Koike R, Ota M, Kidera A. Hierarchical description and extensive classification of protein structural changes by Motion Tree. J Mol Biol. 2014;426:752–762. doi: 10.1016/j.jmb.2013.10.034. [DOI] [PubMed] [Google Scholar]
  • 65.Moritsugu K, Koike R, Yamada K, Kato H, Kidera A. Motion tree delineates hierarchical structure of protein dynamics observed in molecular dynamics simulation. PLoS One. 2015;10:e0131583. doi: 10.1371/journal.pone.0131583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Domański J, Hedger G, Best RB, Stansfeld PJ, Sansom MS. Convergence and sampling in determining free energy landscapes for membrane protein association. J Phys Chem B. 2017;121:3364–3375. doi: 10.1021/acs.jpcb.6b08445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Matsunaga Y, Sugita Y. Refining Markov state models for conformational dynamics using ensemble-averaged data and time-series trajectories. J Chem Phys. 2018;148:241731. doi: 10.1063/1.5019750. [DOI] [PubMed] [Google Scholar]
  • 68.Jung J, Mori T, Kobayashi C, Matsunaga Y, Yoda T, Feig M, et al. GENESIS: a hybrid—parallel and multi— scale molecular dynamics simulator with enhanced sampling algorithms for biomolecular and cellular simulations. Wiley Interdiscip Rev: Comput Mol Sci. 2015;5:310–323. doi: 10.1002/wcms.1220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Kobayashi C, Jung J, Matsunaga Y, Mori T, Ando T, Tamura K, et al. GENESIS 1.1: A hybrid-parallel molecular dynamics simulator with enhanced sampling algorithms on multiple computational platforms. J Comput Chem. 2017;38:2193–2206. doi: 10.1002/jcc.24874. [DOI] [PubMed] [Google Scholar]
  • 70.Ryckaert J-P, Ciccotti G, Berendsen HJ. Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J Comput Phys. 1977;23:327–341. [Google Scholar]
  • 71.Loeffler HH, Kitao A. Collective dynamics of periplasmic glutamine binding protein upon domain closure. Biophys J. 2009;97:2541–2549. doi: 10.1016/j.bpj.2009.08.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Ravindranathan KP, Gallicchio E, Levy RM. Conformational equilibria and free energy profiles for the allosteric transition of the ribose-binding protein. J Mol Biol. 2005;353:196–210. doi: 10.1016/j.jmb.2005.08.009. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

16_310_1.pdf (966.7KB, pdf)

Articles from Biophysics and Physicobiology are provided here courtesy of The Biophysical Society of Japan

RESOURCES