Abstract
Molecular Dynamics (MD) simulations seek to provide atomic-level insights into conformationally dynamic biological systems at experimentally relevant time resolutions, such as those afforded by single-molecule fluorescence measurements. However, limitations in the time scales of MD simulations and the time resolution of single-molecule measurements have challenged efforts to obtain overlapping temporal regimes required for close quantitative comparisons. Achieving such overlap has the potential to provide novel theories, hypotheses, and interpretations that can inform idealized experimental designs that maximize the detection of the desired reaction coordinate. Here, we report MD simulations at time scales overlapping with in vitro single-molecule Förster (fluorescence) resonance energy transfer (smFRET) measurements of the amino acid binding protein LIV-BPSS at sub-millisecond resolution. Computationally efficient all-atom structure-based simulations, calibrated against explicit solvent simulations, were employed for sampling multiple cycles of LIV-BPSS clamshell-like conformational changes on the time scale of seconds, examining the relationship between these events and those observed by smFRET. The MD simulations agree with the smFRET measurements and provide valuable information on local dynamics of fluorophores at their sites of attachment on LIV-BPSS and the correlations between fluorophore motions and large-scale conformational changes between LIV-BPSS domains. We further utilize the MD simulations to inform the interpretation of smFRET data, including Förster radius (R0) and fluorophore orientation factor (κ2) determinations. The approach we describe can be readily extended to distinct biochemical systems, allowing for the interpretation of any FRET system conjugated to protein or ribonucleoprotein complexes, including those with more conformational processes, as well as those implementing multi-color smFRET.
Author summary
Förster (fluorescence) resonance energy transfer (FRET) has been used extensively by biophysicists as a molecular-scale ruler that yields fundamental structural and kinetic insights into transient processes including complex formation and conformational rearrangements required for biological function. FRET techniques require the identification of informative fluorophore labeling sites, spaced at defined distances to inform on a reaction coordinate of interest and consideration of noise sources that have the potential to obscure quantitative interpretations. Here, we describe an approach to leverage advancements in computationally efficient all-atom structure-based molecular dynamics simulations in which structural dynamics observed via FRET can be interpreted in full atomistic detail on commensurate time scales. We demonstrate the potential of this approach using a model FRET system, the amino acid binding protein LIV-BPSS conjugated to self-healing organic fluorophores. LIV-BPSS exhibits large scale, sub-millisecond clamshell-like conformational changes between open and closed conformations associated with ligand unbinding and binding, respectively. Our findings inform on the molecular basis of the dynamics observed by smFRET and on strategies to optimize fluorophore labeling sites, the manner of fluorophore attachment, and fluorophore composition.
Introduction
Computational advancements have enabled the field of molecular dynamics (MD) simulations of biomolecular components to progress from sub-nanosecond to millisecond simulation time scales [1–3]. The combination of enhanced sampling algorithms and increased compute power has led to extended time scale simulations and rapid growth in the number of MD publications [1]. As MD simulations for studying conformational dynamics in biological systems venture into the realm of milliseconds and beyond, we are now at the precipice of MD simulation time scales that are temporally commensurate to an increasing number of in vitro experimental techniques. Congruence between MD and experimental time scales is also made increasingly possible as in vitro techniques advance towards collecting data at increasingly rapid time resolutions by leveraging improvements in laser, fluorophore, and detector performance [4–7]. Quantitative comparison of in vitro biochemical and spectroscopic data with in silico MD simulation on comparable time scales has the potential to provide rigorous structural and molecular interpretations of dynamic biomolecular events in atomic detail.
It is becoming standard practice in the field of MD simulations to perform explicit-solvent equilibrium simulations of small proteins in the microsecond range, while coarse-graining approaches have reached millisecond time scales [8,9]. Simulations have been used to predict chemical shifts and directly compare these results to nuclear magnetic resonance (NMR) experiments [10–23]. Simulations have also been used to recapitulate small angle X-ray scattering (SAXS) data [24–37]. Temporal information can be gleaned in NMR or SAXS by a diverse variety of means, including peak broadening in NMR, or by performing SAXS in time-resolved modes [38]. When data from these techniques are directly compared to MD simulations, the time scale of the simulations is often not considered critical, so long as sampling is sufficient to avoid equilibration artifacts [10–35]. Beyond these techniques, MD simulations can provide atomistic insights into structural dynamics for a multitude of experimental approaches [39–46]. Yet, close quantitative comparisons between in vitro experimental data and in silico simulations at commensurate time scales are relatively rare.
Single-molecule Förster Resonance Energy Transfer (smFRET), an experimental technique used to study the dynamics of a wide variety of biomolecules, from polymerases to ribosomes, to G-protein coupled receptors, could benefit greatly from quantitative MD simulation comparison [47–51]. Investigations involving MD interpretations of smFRET experiments have been a major focus for some time [52]. Recent pioneering studies utilized a 400 microsecond MD simulation in combination with machine learning and restraints derived smFRET experiments to recapitulate or interpret smFRET states and the transitions between them [53–56]. Multiple groups have also employed MD simulations to model FRET efficiencies derived from protein conformational ensembles [57–59]. Still others have used smFRET data to inform simulations of peptides [60]. To our knowledge, direct one-to-one comparisons of MD simulations with smFRET data of full-length proteins on congruent time scales have, however, yet to be performed.
To expand the predictive power of MD simulations for direct one-to-one comparisons with experiment, all-atom simulations must be performed on time scales that overlap with experimental observables (i.e., aggregate sampling of seconds). Explicit solvent simulations on the time scale of smFRET investigations are not trivial and can require hundreds of GPU core years. Explicit solvent simulations of small biological systems, such as a single protein, are also difficult to scale efficiently to supercomputers. For a 23,558 atom system, the Anton 2 supercomputer has proven capable of producing 85 μs/day [61]. To achieve explicit solvent simulations on Anton on time scales commensurate with smFRET, i.e. 10 seconds, MD calculations would, therefore, have to be performed continuously for several years.
To address this issue, we sought to leverage advances in the computationally economical, all-atom Gō-like structure-based models and to assess our findings with kinetic theory to compute rates [62]. More specifically, we employed a multibasin Gō-like structure-based simulation approach where multiple FRET states can be defined as native basins in the simulation, allowing for interconversion between states[63]. Simulations of this kind define contacts within a given state as a native basin, but allow for large-scale, spontaneous conformational changes [64,65]. For this technique, specific hydrogen bonds, electrostatic interactions, and rotamer angles are implicitly included in the native basin. By combining this approach with explicit-solvent equilibrium simulations, one can achieve comprehensive views of the dynamic ensemble of a biomolecule by explicitly resolving even more subtle interactions, such as explicit hydrogen bonds, electrostatics, or solvation effects [65,66].
Here, we take advantage of such methods to perform comparisons of MD simulations and smFRET data on commensurate time scales. This effort has been enabled by numerous advances in both fields of research, including the aforementioned breakthroughs in MD simulations and rapidly evolving smFRET instrumentation, self-healing organic fluorophores, and the identification of the amino acid binding protein leucine-isoleucine-valine binding protein (LIV-BP) as a robust model system [5,67–69]. This biosensor, with the inclusion of mutations to remove native cysteines (LIV-BPSS), rapidly binds leucine, isoleucine, and valine amino acids with a kon of 30 μM-1 s-1 for Leu [68]. We use this system to develop a pipeline for recapitulating smFRET data in silico for a direct structural interpretation of the underlying conformational processes associated with amino acid binding and unbinding. We then employ kinetic theory to determine rates of transitions between FRET states to enable direct comparisons between our in silico and in vitro results [62]. Using all-atom structure-based simulations on comparable time scales to in vitro experiments, together with explicit solvent simulations on shorter time scales, we examine the key variables and determinants of smFRET data, including quantitative descriptions of Förster radius (R0) and fluorophore orientation factor (κ2) parameters that inform on the FRET-distance relationship. Importantly, this pipeline is not restricted to LIV-BPSS. It can be reconstituted for any smFRET assay to gain all-atom structural perspectives on experimental data.
Results
Single-molecule FRET experiments and molecular simulations resolve LIV-BPSS conformational changes on commensurate time scales
The LIV-BP model system employed here for smFRET experiments and MD simulations corresponds to the LIV-BPSS protein described previously [68]. Previous experimental investigations using a fluorescently labeled LIV-BPSS protein have shown that ligand binding rates approach the diffusion limit (ca. ~108 M-1 s-1) [68]. In the present study, analogous to prior investigations [68,70], LIV-BPSS was labelled at positions 67 and 181 by mutating native residues to Cys and performing maleimide chemistry with the self-healing fluorophores LD555 and LD655 (Fig 1A). In the MD simulations, LD555 and LD655 fluorophores were explicitly modeled at these same positions. As previously described [68], the sites of labeling in LIV-BPSS were selected based on X-ray crystal structures of ligand-bound (closed conformation) and ligand-free (open conformation) states (PDB ID: 1Z15, 1Z16, 1Z17, and 1Z18) (Fig 1B) [71] to report on the conformational changes upon amino acid binding and dissociation, respectively. This conformational change can be described as a “clamshell” or “fly-trap” motion, where the two domains of LIV-BPSS approach each other in the ligand-bound state (closed conformation) (Fig 1B). Capturing the conformational dynamics between the closed and open LIV-BPSS conformations using smFRET require camera exposure times below 5 ms (Fig 2A) [68,70]. Notably, typical explicit solvent MD simulations have time steps of 1–4 fs. By contrast, reduced-description MD simulations can exhibit up to nanosecond individual time steps. Thus, any smFRET time resolution can, in principle, be matched with MD simulations. However, to simulate the large number of conformational transitions observed in smFRET experiments, time scales on the order of hundreds of milliseconds to seconds are required. Such an undertaking is not trivial to achieve.
For comparison to MD simulations, we examined the conformational changes of LIV-BPSS in smFRET experiments at varied camera frame rates (100 ms to 0.25 ms) (Fig 2A) [70]. To ensure a robust comparison between experimental FRET efficiencies and those derived from simulations, we corrected the experimental smFRET data for spectral crosstalk, relative detection efficiencies and quantum yields (QD), and acceptor direct excitation (see Methods). Notably, in the presence of the leucine (Leu) ligand at the KD of ligand binding (~4.5 μM) in the 100 ms exposure regime, only one FRET state is evidenced that exhibits ~0.52 FRET efficiency (see Methods). This finding reflects time-averaging effects related to the rapid association and dissociation rates of leucine to LIV-BPSS(~30 μM-1 s-1 and ~212 s-1, respectively) [68,70]. Consistent with this interpretation, at exposure times of 3 ms and below, two distinct FRET states exhibiting ~0.65 and ~0.35 FRET efficiency, corresponding to open and closed LIV-BPSS conformations, respectively, are resolved (Fig 2A).
To perform MD simulations of LIV-BPSS congruent with these experimental studies, site-specifically labelled LIV-BPSS proteins, as described above, were used in dual-basin all-atom structure-based simulations to describe conformational changes between open and closed conformations (Fig 2B). In addition, explicit solvent simulations of apo and Leu-bound states were performed for an in-depth investigation of the dynamics of LIV-BPSS near the open and closed basin minima. Structure-based simulations in the presence of Leu were used to observe transitions between the open and closed conformations. However, due to technical considerations related to the structure-based potential, Leu binding and unbinding events were not explicitly simulated (S1 Text). Because the ligand itself contributes relatively few native contacts to the bound (closed) state, this exclusion has little effect on the number and time scale of simulated transitions.
An advantage of structure-based potentials is that they can be relatively easily calibrated to experimental systems. Here, the structure-based potentials were calibrated to the opening and closing events observed at the KD of Leu binding (~4.5 μM) to LIV-BPSS in smFRET experiments. This approach enabled us to simplify the simulations, while recapitulating LIV-BPSS opening and closing dynamics. In so doing, we were able to clearly define a double Gaussian potential for the LIV-BPSS protein, where each basin is defined by the native contacts present in both the open and closed conformations (S1 Fig). In this approach, both conformations are set as native basins, allowing for the protein to transition between each conformation through barrier crossing events. Previous implementations of Gaussian basins involved studying kinetic processes such as protein folding and conformational changes [72,73]. By contrast, our simulations using the double Gaussian potential are expected to provide information regarding the conformational changes of LIV-BPSS. This implementation also enables analyses of LD555 and LD655 fluorophore dynamics, tumbling behaviors, and the simulation of FRET efficiencies based on the measured inter-fluorophore distances according to Förster theory.
Proper conversion of FRET efficiency values to fluorophore distances, or vice versa, requires accurate determination of the fluorophore positions in the system in three-dimensional space. To characterize the fluorophore distance correlation between MD simulations and smFRET experiments, we measured the distance in the MD simulations between the centers of mass of the explicit LD555 and LD655 chromophores (Rdye) attached to positions 67 and 181 of the protein, respectively (S2 Fig). As expected, the fluorophore distances were observed to fluctuate, where the dye center of masses exhibited mean inter-fluorophore distance values of 69 ± 5 Å and 56 ± 8 Å for the open and closed conformations, respectively.
Precisely ascertaining the time interval of each time step of any simulation based on a reduced description potential is non-trivial. In the case of structure-based potentials, the time intervals are dependent on the size of the system, the number of particles, the contact weights, and the harmonic potentials employed. However, through direct comparison with in vitro experiments or explicit solvent simulations, estimates of the time scales for structure-based simulations can be achieved [65,74]. To estimate the time scales of the simulated LIV-BPSS conformational changes, we directly compared the simulated dwell times of the closed conformation evidenced in our structure-based simulations to the corresponding dwell times observed in high-FRET state by smFRET.
At the KD of Leu binding (4.5 μM), LIV-BPSS spends approximately 50% of its time in both open and closed conformations (Fig 3A–3C) [68,70]. The experimentally estimated opening rate under these conditions (~212 s-1) corresponds to Leu unbinding [68]. This measurement indicates that the average duration of the Leu-bound state is approximately 4.7 ms. By definition of the KD, the average lifetime of the open conformation must, therefore, also be ~4.7 ms. These data are globally consistent with the estimated KD (~4.5 μM) and a near diffusion-limited ligand binding rate (~108 M-1s-1).
The average dwell time of the closed conformation in the MD simulations was 1.9 ± 0.19 x 107 time steps of simulation time (Fig 2B). This dwell time was determined by measuring the number of time steps LIV-BPSS stays in the closed conformation (Rdye < 60 Å), where 0.19 x 107 corresponds to the standard deviation of this value. Considering the dwell times evidenced in smFRET experiments, we correspondingly infer that each time step reflects approximately 0.25 ns. These estimates, which relate the time steps in structure-based models to physiological time scales, are similar in nature to those reported by Yang, et al for structure-based simulations, which estimated each simulated time step to be 0.05–1 ns [74].
A key determinant of the FRET-distance relationship is the fluorophore orientation factor (κ2), which relates to the relative orientation of the transition dipole moments of each fluorophore. The transition dipole moments of individual fluorophores are often assumed to be randomized by rotational diffusion. In such cases, the average fluorophore orientation is approximately isotropic, yielding a κ2 value of 2/3. Accurate κ2 values are required to accurately calculate the Förster distance (R0).
Fluorophore conjugation to a biomolecule can, however, influence its tumbling behaviors due to steric restrictions and hydrophobic and/or electrostatic effects. Correspondingly, a fluorophore’s position may be non-random. To ascertain if the fluorophores conjugated to LIV-BPSS are randomly orientated, we first determined the transition dipole moment of the fluorophores with quantum mechanical (QM) calculations (S3 Fig). Using these dipole moments in conjunction with our structure-based simulations, we determined that the self-healing LD555 and LD655 fluorophores conjugated to LIV-BPSS have κ2 = 0.58 ± 0.22, close to the theoretical 2/3 value (S1 Table). From explicit solvent simulations of open and closed LIV-BPSS conformations performed in triplicate, we find κ2 values of 0.45 ± 0.19 and 0.41 ± 0.10, respectively. The lower estimation of κ2 from explicit solvent simulations is likely due to the limited sampling of fluorophore positions during the 1 μs simulation. The deviations of κ2, calculated in structure-based simulations (0.58 ± 0.22) from the ideal value (0.66) is expected to shorten R0 by 1 Å. More skewed fluorophore orientations would have greater effects. Such findings are consistent with the notion that the tumbling behaviors of fluorophores at their sites of attachment can influence estimations of the FRET-distance relationship, albeit modestly in the case of LIV-BPSS. Using the calculated κ2 value (0.58) based on structure-based simulations, and the experimentally determined LD555 quantum yield attached to LIV-BPSS (0.48; see Methods), we estimated the R0 of the LD555/LD655 pair to be approximately 62.0 Å. This finding is in good agreement with the inter-fluorophore distance (Rdye) changes estimated from our structure-based MD simulations of 69 ± 5 Å (open) and 56 ± 8 Å (closed) and the experimentally derived FRET efficiencies (~0.65 and 0.35 FRET, respectively), which are above and below 50% transfer efficiency for the LD555/LD655 pair (~62.0 Å) (discussed below).
Correlating smFRET efficiencies with computed FRET efficiencies from MD simulations
Using the estimated duration of the simulated time steps (0.25 ns) and the estimated R0 (62.0 Å), we generated simulated FRET efficiency traces from the all-atom structure-based MD simulations (S4 Fig). As expected from experiment, simulated smFRET efficiencies at 1 ms exposure time (averaging 4 x 106 time steps to reach 1 ms exposure) exhibited step-like transitions between high- and low-FRET states. However, the averaged FRET values were higher than those expected from smFRET experiments (~0.65 and ~0.35 for the experimentally observed open and closed conformations, respectively) (Figs 2A and 3A). We note in this context that the open and closed LIV-BPSS conformations in our all-atom structure-based MD simulations, exhibit estimated FRET efficiencies of ~0.84 and ~0.47, respectively, roughly 0.1–0.2 higher FRET efficiency than those observed experimentally (S4 and S5 Figs.). In considering the origins of this difference, we noted that the structure-based simulations did not include the potential intramolecular contacts for the fluorophores observed in our explicit solvent simulations. As a result, LD555 and LD655 were highly dynamic and extended outward from the protein surface throughout the structure-based simulations, similar to antennae from an ant.
To examine the hypothesis that such a scenario may be unrealistic—for instance, by discounting fluorophore-linker or fluorophore protein interactions—we added contacts between the fluorophore and its linker (dye-linker interactions), which we discerned from our three, 1 μs explicit solvent simulations of LIV-BPSS in apo and Leu-bound states. Here, fluorophore atoms that come within 3 Å of a linker atom were considered and added as a native contact to the structure-based simulations. This analysis led to the addition of 111 native contacts in the structure-based model for both fluorophores combined (S5 Fig). In the explicit solvent simulations, fluorophore atoms approach the protein as well. Nevertheless, these interactions were an order of magnitude fewer (39) in number and they were observed to be relatively transient in nature. We therefore did not consider them likely to significantly alter the observed fluorophore dynamics to the same extent as the intramolecular fluorophore-linker contacts. For example, the longest enduring interaction (distance shorter than 3 Å) between LD555 and the protein lasted for < 500 ns, and was only formed in one of the 6 explicit solvent simulations. By contrast, dye-linker interactions were observed in all explicit solvent simulations, most of which maintained for > 500 ns in both the open and closed conformations. Correspondingly, interactions between the fluorophores and protein were not considered for the structure-based simulations. The inclusion of the fluorophore-linker interactions, used to yield more accurate FRET values do not significantly impact the κ2 values (S1 Table), yet were introduced to provide a more precise fluorophore-fluorophore distances.
To examine the impacts of these contacts on the average fluorophore positions in our structure-based simulations, we scaled their strength from 0.1 to 1.0. Using this procedure, we empirically determined that a scaling of 0.5 for dye-linker contacts closely recapitulated the FRET values of the experimental traces (Fig 2A and 2B). At 100 ms exposure time for both the smFRET and MD simulations, only a single FRET population at ~0.5 FRET efficiency was observed (Fig 2A and 2B), consistent with the experimental observations. In line with the experimental smFRET trajectories, at the equivalent of 3 ms, 1 ms, and 0.25 ms exposure time, the MD simulations also clearly revealed two distinct FRET populations (~0.6 and 0.4 FRET efficiency) that rapidly interconverted (Fig 2A and 2B). Decreasing the time step averaging to 1 ms or 0.25 ms, revealed that the FRET efficiencies of these two states reached a maximum separation of ~0.3 FRET, corresponding to FRET efficiency values of approximately 0.65 and 0.35 FRET for open and closed conformations, respectively (Fig 2B).
Importantly, the estimated κ2 value in this system (0.61 ± 0.11) revealed that that the fluorophore tumbling behaviors remained within 8% of the expected value for perfect isotropic tumbling (0.66). These findings corroborate the accuracy of the experimentally derived approximations of R0 and κ2 employed for our simulations and suggest that the average positions of the LD555 and LD655 fluorophores on LIV-BPSS may be modestly compacted towards their sites of attachment at the LIV-BPSS surface. Such findings are consistent with the expected flexibility of the alkyl chain linker connecting the protein and fluorophore, which may be exacerbated by short-lived fluorophore-linker and/or fluorophore-protein interactions that lead to compaction of the dye center of mass towards the protein surface.
MD simulations recapitulate population-level smFRET data
Single-molecule fluorescence and FRET experiments seek to determine how a system functions by evaluating the compositional and conformational dynamics of large ensembles of individual molecules. In the case of LIV-BPSS, function is inferred from the opening and closing dynamics that arise from amino acid binding and unbinding in the experimental system.
To accurately determine the mean FRET efficiencies and standard deviations of the experimentally measured FRET states, individual smFRET traces were compiled into population FRET histograms that were then fit to Gaussian distributions (Fig 3A–3C). Histograms of this kind clearly illustrate that the low- and high-FRET states of LIV-BPSS, which correspond to open and closed conformations, respectively, only become fully resolved at exposure times below ~5 ms in the experiments. At 5 ms exposure time high- (~0.6) and low- (~0.4) FRET states exhibit significant overlap due to the existence of exchange processes between open and closed conformations that occur on the integration time scale. Decreasing the integration time in the experiments leads to gradual resolution of two distinct FRET peaks, where the open and closed conformations exhibited ~0.65 and ~0.35 mean FRET efficiency values, respectively (Table 1).
Table 1. FRET efficiencies determined from Gaussian distribution of FRET distributions.
Low FRET State | High FRET State | |||
---|---|---|---|---|
Experimental | Simulation | Experimental | Simulation | |
5 ms | 0.41 ± 0.10 | 0.35 ± 0.06 | 0.61 ± 0.10 | 0.56 ± 0.15 |
3 ms | 0.40 ± 0.10 | 0.35 ± 0.06 | 0.64 ± 0.07 | 0.56 ± 0.15 |
1 ms | 0.36 ± 0.09 | 0.32 ± 0.09 | 0.65 ± 0.07 | 0.63 ± 0.11 |
0.5 ms | 0.34 ± 0.09 | 0.32 ± 0.10 | 0.65 ± 0.07 | 0.63 ± 0.13 |
In the MD simulations, we observe a similar behavior to the experiments. In particular, the open and closed FRET efficiencies become more clearly differentiated as the estimated exposure time is decreased (Fig 3B). To identify the mean of these populations we fit the histograms of the simulated FRET data to a sum of two Gaussian distributions (Fig 3D). These fits revealed that the mean FRET efficiencies obtained from MD simulations are approximately 0.33 and 0.62 for open and closed conformations, respectively (Table 1). These computationally estimated FRET values, while slightly lower than expected from experiment—the observed differences of ~0.02 to 0.03 correspond to distances of ~1–2 Å, respectively—remain in close agreement with those estimated by smFRET (Table 1).
The modest differences in the mean FRET efficiency values between the MD and smFRET experiments may arise from a number of potential sources. This may include inaccuracies in the fluorophore-linker and fluorophore-protein contacts used to calibrate structure-based simulations, corrections used to estimate experimentally derived FRET efficiencies, or the absolute values of specific fluorophore parameters (namely R0) used to convert the distances observed in MD simulations to FRET efficiency.
In addition to the open and closed conformations of LIV-BPSS, we also observed an intermediate population in our simulations at relatively slow time regimes (3 and 5 ms) (Fig 3D). This may be akin to the blurring of low and high-FRET states observed experimentally in slow exposure regimes, (Fig 3C and 3D), where artifactual intermediates appear to arise due to the time averaging of the simulated FRET signal.
The slight difference in FRET efficiency values or time-averaging intermediates observed in MD may arise from subtle distinctions in the simulated fluorophore-linker interactions compared to those present in the fully solvated experimental setting. As the simulations performed treat hydrogen bonding and electrostatic interactions as implicit contacts, it is likely the fluorophore dynamics differ slightly from those that actually occur in vitro. Nonetheless, the simulated dynamics of LIV-BPSS closely recapitulate those observed experimentally by smFRET. Further iterations of the simulated contacts can be explored to determine how the FRET efficiency values for open (apo) and closed (Leu-bound) LIV-BPSS conformations vary with linker and dye composition. Such efforts are expected to inform on potential optimizations and the influence of the site of protein attachment on experimentally derived and simulated FRET efficiency values.
Single-molecule and molecular dynamics simulated-FRET changes reflect interdomain movements in LIV-BPSS
We next sought to quantitatively examine the extent to which the fluctuations in FRET efficiency observed in our simulations correspond to conformational changes of the LIV-BPSS protein. Due to the flexible nature of the fluorophore linkers, LD555 and LD655 can, in principle, approach each other without the protein undergoing a complete conformational transition from apo (open) to Leu-bound (closed) state, and vice versa. In other words, the fluorophores could fail to precisely track the changes in distance between the two LIV-BPSS domains to which they are attached.
To assess the potential contributions of such divergences to our analyses, we sought to examine the correlation between the inter-fluorophore distance–and therefore FRET efficiencies–with the actual LIV-BPSS domain movements in the simulations. While these two values are often assumed to be closely correlated, it is not obvious that they are actually correlated in light of, for example, relatively long linker lengths required for smFRET experiments. The assumption of close correlation between the fluorophore and the domain to which it is attached actually entails two implicit assumptions: (i) the residue position on the protein accurately reflects movement of an entire domain, as opposed to a local rearrangement, and (ii) the flexible linkers can be approximated by rigid bodies, fully correlated with movement of the protein.
To examine the extent to which this assumption holds for the entire domain to which the fluorophores are attached, we sought out an alternative reaction coordinate that captures the maximum amplitude of LIV-BPSS conformational change, independent of any possible restrictions on labeling positions imposed by the practicalities of experiments. We chose a maximum amplitude reaction coordinate that optimizes separation of conformations on approximate free energy landscapes, aiding in estimations of barrier height. To find this reaction coordinate we measured the distance between all Cα atoms in domain 1 to all Cα atoms in domain 2. These distances were compared between the open and closed conformations to generate changes in distance, Δ, for each amino acid pair between domains 1 and 2 (S6 Fig). This analysis revealed areas of the protein that undergo the largest interdomain distance changes during conformational change.
We found that the amino acid pair Ser 12 and Ala 237, displayed the largest distance change (Δ ~13 Å) between open and closed conformations. Thus, we define the alternative reaction coordinate, Rdomain, to be the distance between Cα atoms of Ser 12 and Ala 237. We note that, while the change in Rdomain is larger than the change for the label positions used in the experiments, the Rdomain states are too close (~35 ± 4 Å and ~22 ± 3 Å for open and closed states, respectively) for FRET-based experimental measurements due to practical considerations (S2 Fig). A Boltzmann-weighted approximate free energy landscape was calculated, indicating that Rdomain and Rdye are correlated (Pearson’s correlation of 0.73) (Fig 4A). Here, the open conformation exhibits a mean Rdye value of ~69 Å and a Rdomain of ~35 Å, whereas, the closed conformation exhibits a mean Rdye value of ~56 Å and a Rdomain of ~20 Å. The 34 Å or 36 Å distance difference between Rdye and Rdomain is a result of fluorophore linker length and positions 12 and 237, used to measure Rdomain, being closer together than positions 67 and 181, the sites of fluorophore conjugation in our smFRET experiments. From Rdomain or Rdye, the apparent barrier heights of LIV-BPSS conformational change can be approximated to 3.8 kcal/mol or 2.5 kcal/mol, respectively (Fig 4B and 4C). To connect the barrier heights to rates we used the relationship between protein diffusion coefficients, rates, and free energy barriers [75,76]. As the reaction coordinate Rdomain leads to the larger barrier height we used it to determine the rate of LIV-BPSS conformational change (S2 Text). From these calculations we observed Leu-bound and apo LIV-BPSS to undergo conformational changes at rates of 750 s-1 to 6000 s-1 and 14773 s-1 to 19530 s-1, respectively, where the lower limit is approaching the rate determined by smFRET of 210 s-1 (S7 Fig; S2 Table) [68]. As diffusion is determined by the degree of roughness in the landscape, the overestimation of the rate of conformational transitions by LIV-BPSS is likely a result of the low degree of roughness in the free energy landscapes from structure-based model approaches. The rate difference between the apo and Leu-bound states is a result of the up to 10-fold difference in prefactor (S2 Table). The prefactor represents the attempt frequency, and is measured from the time dependent variance of Rdomain in explicit solvent simulations. As the apo state is more dynamic, the time dependent variance of Rdomain is larger, leading to the larger prefactor (see below) (Fig 4D–4F). The rate is proportional to the prefactor and to the exponential term (dependent on the barrier height), resulting in both making important contributions to rate estimates. Barrier height calculations are often limited by sampling; however, the all-atom structure-based potential significantly enhances sampling. On the other hand, the prefactor, determined by explicit solvent simulations, is somewhat limited in terms of sampling. Thus, longer simulations or implementation of explicit solvent enhanced sampling techniques to estimate diffusion may improve the accuracy of our methodology.
To examine the effect of finite linker length, we considered a third reaction coordinate, RCα, which measures the distances between the Cα atoms at the labeling positions used in the smFRET experiments (positions 67 and 181). This pair exhibited a distance change (Δ) between open (~57 Å) and closed (~49 Å) conformations of approximately 7.7 Å (S6 Fig). Comparing this reaction coordinate with Rdye, the distance between the fluorophores (each of which is attached to a linker), we obtained a Pearson correlation coefficient of 0.8, indicating the fluorophore fluctuations correlate with domain motions of LIV-BPSS (S8 Fig).
The open state of LIV-BPSS is inherently dynamic
We performed explicit solvent simulations of two systems: (i) LIV-BPSS in the open state without ligand (apo) and (ii) LIV-BPSS in the closed state with ligand (Leu-bound). Comparison of Rdomain and Rdye for explicit solvent simulations of the apo state indicates that this LIV-BPSS state is significantly more flexible than the Leu-bound state (Fig 4D). Three 1 μs simulations of LIV-BPSS in the apo (open) or Leu-bound (closed) states were combined to create approximate free energy landscapes comparable to those from structure-based simulations (Fig 4D–4F). In the Leu-bound state simulation, a single conformational basin was observed with a minimum at a Rdomain and Rdye of 20 Å and 38 Å, respectively (Fig 4E). Upon closer inspection, this minimum was found to exhibit some heterogeneity (Fig 4F), indicative of modest interdomain and interdye distance variations in the Leu-bound state, albeit orders of magnitude less than variations in the all-atom structure-based MD simulations.
By contrast, the apo state explicit solvent simulations exhibited two well-separated FRET populations, corresponding to a Rdomain value of either ~45 Å or ~25 Å and Rdye values of ~75 Å and ~50 Å, respectively. As the explicit solvent simulations were initiated from the open conformation, the differences in the occupied positions for each simulation on the free energy landscape are a result of the fluctuations sampled. Close inspection of these simulations revealed that the two LIV-BPSS domains can indeed approach each other in the absence of ligand to achieve a conformation similar in nature to the closed conformation (S9 Fig). Here, we note that while the two domains of LIV-BPSS approach each other, they do not reach the same conformational space adopted by Leu-bound LIV-BPSS (Fig 4D–4F). The trajectories of the apo state simulations that approach the closed conformation all initiate their transition fairly rapidly in the simulation (within the first 10 ns). Interconversion between the closed and open conformation is not observed in the explicit solvent simulations in the 1 μs sampling regime. However, the apo state simulations are observed to be more dynamic as reflected by width of the free energy landscape populations (Fig 4D and 4E) and the RMSD and RMSF values, which were both on average ~1.5 Å larger than the Leu-bound simulations of LIV-BPSS (S10 and S11 Figs). Altogether, the results indicate that the open state is more dynamic and correspondingly more likely to interconvert between the closed and open conformations, with longer sampling.
Discussion
Efforts to quantify the relationships between data obtained through smFRET imaging and MD simulation have the potential to provide atomic descriptions of rate-limiting, large-scale conformational changes in biological systems critical to cellular function and regulation. Atomistic-resolution information on such large-scale events are currently limited by the computational costs of explicit solvent simulations over long time scales.
To overcome these limitations, we have used a computationally efficient all-atom structure-based approach to simulate the model protein LIV-BPSS that recapitulates smFRET data on the same time scale as in vitro experiments. The LIV-BPSS system undergoes a relatively simple clamshell-like transition in converting between open and closed conformations. The integration of smFRET and simulation is not, however, limited to small proteins. In fact, it is also suited to larger and more complex systems that undergo a diversity of conformational changes potentially dominated by more than one mode. More complex systems, such as the ribosome, can be observed to undergo large-scale conformational transitions by smFRET, but the nature of these conformational changes is difficult to understand even when informed by high-resolution structural information. Our long-term goal is to employ the integration of smFRET with MD simulations to enable atomistic, structural dynamic descriptions of the complex rotations, counter rotations, swivel- and tilt-like domain movements that have been evidenced to underpin ribosome functions during translation.
As demonstrated, MD simulations of the kind described herein provide direct insight into the relationship between dynamics of a system and the observed FRET values evidenced experimentally. In doing so they have the potential to provide dynamic structural descriptions of the underlying physical barriers separating states of a system. Such efforts hold the promise of providing complete descriptions of the reaction coordinate evidenced by smFRET investigations and critical information about the nature of the conformational changes observed. At the same time, MD simulations can be used to inform on strategies to maximize smFRET signals or to investigate alternative reaction coordinates. Such strategies could involve optimization of FRET pairs utilized, given the impact on κ2 from the labelling position environment measured in simulations. Improvements from this symbiotic relationship between experiment and simulations can be extended to gain a deeper understanding of the fluorophore-linker, fluorophore-protein and fluorophore tumbling behaviors at their chosen sites of attachment. Ultimately, this relationship aims to delineate the most accurate FRET distance relationships possible. For instance, simulations could be employed to inform chemical synthesis efforts to alter the fluorophore linker lengths for a given FRET pair to optimize κ2, all of which can be performed in silico using the approach described above. In so doing, the functional reaction coordinate of interest can be fully revealed to the experimentalist and more informative experiments can be implemented.
Future endeavors in this area of inquiry will benefit from further advances in the experimental strategies used to site-specifically incorporate extrinsic fluorophores within compositionally diverse biomolecules. The bridge between experiment and simulation will be further enhanced by continuing imaging platform and fluorophore developments that further extend smFRET studies into the microsecond time domain. Such time scales will reduce computational burdens and ultimately afford explicit solvent simulations of smFRET data on commensurate time scales. Likewise, as computational resources extend the simulated time scales, the gap between experiment and simulation approaches will decrease. Synergistic endeavors of this kind provide a deeper understanding of complex biological systems and the structural dynamic events that occur during conformational transitions between distinct states evidenced in smFRET experiments. Continued focus on this frontier is ultimately expected to inform the developments that will enable MD simulations to recapitulate more fully biological systems of interest and extend their predictive capacity for increasingly diverse areas of inquiry.
Methods
Modelling of Liv-BP Conjugated to self-healing Cy3 and Cy5 fluorophores
The coordinates for the E. coli LIV-BP protein were accessed from the protein data bank, PDB ID: 1Z15 –super open form, 1Z16 –LIV-BP•Leu, 1Z17 –LIV-BP•Ile, and 1Z18 –LIV-BP•Val [71]. Positions Cys53 and Cys78 were altered to Ser to recapitulate the Liv-BPSS variant rapid kinetics used in Fitzgerald et al. 2019 [68]. Coordinates for the self-healing fluorophores, LD555 and LD655, were designed and geometries optimized with Avogadro 1.2.0 and GAUSSIAN 09, respectively [77,78]. B3LYP theory and a 6-31G(d) basis was used for the GAUSSIAN geometry optimization [79,80]. Positions Asn67 and Asp181 of LIV-BPSS were altered to Cys and the self-healing fluorophores LD555 and LD655 were conjugated to these positions, respectively, through thiol-maleimide reaction. This was repeated for the super open (apo), Leu, Ile, and Val bound LIV-BP complexes.
Explicit solvent equilibrium simulations
Each of the LIV-BPSS systems were solvated in a 12.0 Å SPCE water box and neutralized to a salt concentration of 100 mM KCl and 5 mM MgCl2 with the tleap package in AMBER18. The parameters for the self-healing LD555 and LD655 fluorophores were created in the antechamber AMBER package [82,83]. The potential energy of the water and total LIV-BPSS system was minimized for 1,000 and 10,000 steps, respectively, using a steepest descent approach in AMBER18 using AMBER ff19SB parameters [81]. Each system was heated to 300 K in steps of 10 K over 50 ps using a Langevin thermostat and equilibrated for 300 ps prior to simulation. Simulations were performed with AMBER18 using Lennard-Jones interactions with a 10 Å cut-off, with periodic boundaries and the particle-mesh Ewald method [81]. The SHAKE algorithm was used for all bonds involving a constrained hydrogen. A step size of 2 fs was used in the simulations and all explicit solvent simulations were performed for a minimum of 1 μs, totaling 12 μs of aggregate simulation time.
All atom Gō-like structure-based simulations
All atom Gō-like structure-based simulations were performed as previously described [48,66]. AMBER minimized structures of the apo LIV-BPSS protein or bound to Leu, Ile, or Val (excluding hydrogens) were used as starting structures for structure-based models. Coordinate and topology files for structure-based models were constructed using Smog-2.2 [84]. A multi-basin Gaussian contact potential (Cij) described previously by Noel et al. 2012 [85], was used, defined by:
(1) |
Where, rij is the distance between atoms i and j, α and β are the open and closed conformations, respectively, σNC = 2.5 Å is the excluded volume size, and
(2) |
where, σ is the width of the gaussian well which is set to a depth of -1. With this approach both the open and closed conformations non-bonded contacts, bond-distances, and angles are set as native states. The potential (Vij) used for the simulations is defined by:
(3) |
where, εr = 50 ε0, εθ = 40 ε0, εχi = 10 ε0, εχp = 40 ε0, εNC = 0.1 ε0, ε0 = 1, σNC = 2.5 Å and εFD = ε (1—cosΦ) + ε/2 (1 –cos 3Φ). The parameters r0, θi,0, χi,0, and Φi,0 correspond to the initial bond distances, angles, planar angles, or dihedral angles, respectively, of the structure the simulation was initiated from.
All simulations were performed using a modified version of Gromacs v4.5.4, totaling 300 simulations each at 500 million time steps of size 0.002, for an aggregated 150 billion time steps [86]. The temperature of the simulations was set to 0.02–1.16 in reduced units which was maintained by Langevin dynamics to identify a temperature that reflects dynamics of the LIV-BPSS protein at 300 K under explicit solvent conditions (S1 Text, S12 Fig.). A temperature of 0.3 reduced units was used for all subsequent simulations of LIV-BPSS. Simulations were calibrated against smFRET data to estimate the time scale of each time step ~ 0.25 ns. Additionally, the weights of the contacts were set to 0.2–0.46 to find a conformational equilibrium (S1 Text, S13 Fig.).
Measurements of reaction coordinates
The reaction coordinate Rdye is the distance between the center of mass of the fluorophores, excluding linkers. To obtain a reaction coordinate describing the movements of LIV-BPSS domains that displays the most variation between the open and closed conformations, the distance from the Cαs in domain 1 was measured to each Cα in domain 2 of LIV-BPSS, culminating in a matrix of domain 1 to domain 2 Cα distances. Ser 12 and Ala 237 displayed the largest distance difference between the open and closed conformation, therefore, the distance between these amino acids was used to measure the movements of LIV-BPSS domains (Rdomain).
Rdye was converted into a FRET efficiency value with the equation:
(4) |
where, R0 is the Förster distance of the fluorophore pair defined by:
(5) |
where, NA is Avogadro’s constant, κ2 is the orientation factor, QD is the quantum yield of the donor fluorophore (0.48), and n is the refractive index of the medium and J is the spectral overlap of the two dyes. Simulations performed in implicit solvent were considered to be in a refractive index of water (1.33). The orientation factor was determined by:
(6) |
where, θAD is the angle between the transition dipole moment vector of the acceptor and donor fluorophore, θD is the angle between the transition dipole moment vector of the donor fluorophore and the separation vector between the fluorophores, and θA is the angle between the transition dipole moment vector of the acceptor fluorophore and the separation vector between the fluorophores. The transition dipole moment vectors for the fluorophores was calculated with the Schrodinger computational suite [87].
Resulting FRET efficiencies were fit to a two gaussian distribution to identify the low and high FRET states corresponding to the apo and Leu-bound states. The two gaussian distribution is defined as:
(7) |
where, A and B are the height of the first and second gaussian distribution, and are the mean of the first and second distribution, and σa and σb are the deviation of distribution 1 and 2.
Analysis of structure-based simulations
Approximate Boltzmann-weighted free-energy landscapes were prepared as previously described [41]. In brief, landscapes were calculated using the g_sham package in GROMACS v4.5.4, using the equation:
(8) |
where, ΔG* is the approximate free energy, P(xi) is the probability of being in state i, Pmax(x) is the probability of the most observed state, kB is the Boltzmann constant, and T is the temperature (300 K). The free energy landscape evaluated as a function of Rdom and Rdye. The barrier crossing rate (k) was calculated using the relationship between LIV-BPSS kinetics and free energy profile:
(9) |
where, G is Gibb’s free energy, D(Rdomain) is the effective diffusion coefficient along the reaction coordinate Rdomain, and R is the reaction coordinate. The integral was simplified by using a constant value for D(Rdomain); however, the maximum and minimum diffusion coefficients were considered providing a range of estimated rates.
Analysis of smFRET data
Experimental smFRET data were previously published [70]. In that study, smFRET imaging was performed using a custom-built total internal reflection fluorescence (TIRF) microscope using scientific complementary metal-oxide semiconductor (sCMOS) detectors [5]. LD555 was excited using a 532 nm laser (Laser Quantum) and imaged at the specified time resolution. The fluorescence from LD555 and LD655 was separated using a T635lpxr dichoric (Chroma) and projected onto two Flash 4.0 v2 sCMOS cameras (Hamamatsu) using a MultiCam LS device (Cairn). Microfluidic imaging chambers passivated with a mixture of PEG and biotin-PEG were incubated for 5 min each with 0.8 μM streptavidin (Invitrogen) and 10 nM biotin-tris-NTA-Ni2+ [68,70,88]. His-tagged LIV-BPSS (a construct with native disulfide residues removed and labeled at engineered cysteine residues in positions 67 and 181) was surface immobilized via the His-tag:Ni2+ interaction in for 2 minutes. All experiments were performed with 30 mM Tris (pH 7) and 150 mM NaCl buffer and with leucine present at 4.5 μM, which corresponds to the KD for LIV-BPSS where bound and unbound states are equally occupied. Wide-field TIRF movies acquired in this way were analyzed with SPARTAN [5], including corrections for donor to acceptor crosstalk, gamma, and acceptor direct excitation [89,90].
Fluorescence quantum yield and anisotropy measurements
Absolute fluorescence quantum yield of LD555 labeled LIVBPSS was measured in a FluoTime 300 spectrometer using integrating sphere accessories (PicoQuant GmbH, Berlin) and a 300 W xenon excitation lamp. The absorbance of the sample at the excitation wavelength was adjusted to 0.02 in a Shimadzu UV-2600 spectrometer to minimize re-absorption of emitted photons. For the absolute quantum yield measurements, the sample was photoexcited at 517 nm and the photons were detected from 512 to 700 nm.
Steady-state fluorescence anisotropy of LD555-LIVBPSS was also recorded in FluoTime 300 spectrometer using 300 W xenon lamp as an excitation source. The sample was photoexcited at 517 nm and the anisotropy was recorded in the emission range 550–650 nm. All the measurements were carried out in a buffer containing 30 mM Tris (pH 7) and 150 mM NaCl at room temperature using standard 1 cm path length quartz cuvettes (Starna Cells, Inc.). The data were analyzed in EasyTau software (PicoQuant GmbH, Berlin).
Supporting information
Acknowledgments
The authors would like to thank all the members of the Sanbonmatsu and Blanchard lab for insightful and valuable comments as well as support from the single-molecule imaging center at St. Jude Children's Hospital. We thank Gerhard Hummer for useful discussions.
Data Availability
All relevant data are within the manuscript and its Supporting Information files.
Funding Statement
This work was supported by NIH NIGMS grant R01-GM072686 and DOE LANL LDRD 20200222DR (to KS), by NSF, by NIH NIGMS grants R01-GM079238-13 and R01-GM098859-07 (to SCB). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Hollingsworth SA, Dror RO. Molecular dynamics simulation for all. Neuron. 2018;99:1129–43. 10.1016/j.neuron.2018.08.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Karplus M. Molecular dynamics of biologic macromolecules: a brief history and perspective. Biopolymers. 2003;68(3):350–8. 10.1002/bip.10266 [DOI] [PubMed] [Google Scholar]
- 3.McCammon JA, Gelin BR, Karplus M. Dynamics of folded proteins. Nature. 1977;267(5612):585–90. 10.1038/267585a0 [DOI] [PubMed] [Google Scholar]
- 4.Chung HS, McHale K, Louis JM, Eaton WA. Single-molecule fluorescence experiments determine protein folding transition path times. Science. 2012;335(6071):981–4. 10.1126/science.1215768 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Juette MF, Terry DS, Wasserman MR, Altman RB, Zhou Z, Zhao H, et al. Single-molecule imaging of non-equilibrium molecular ensembles on the millisecond timescale. Nat Methods. 2016;13(4):341–4. 10.1038/nmeth.3769 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Muñoz V, Cerminara M. When fast is better: protein folding fundamentals and mechanisms from ultrafast approaches. Biochem J. 2016;473(17):2545–59. 10.1042/BCJ20160107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sturzenegger F, Zosel F, Holmstrom ED, Buholzer KJ, Makarov DE, Nettels D, et al. Transition path times of coupled folding and binding reveal the formation of an encounter complex. Nat Commun. 2018;9(1):4708 10.1038/s41467-018-07043-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pan AC, Jacobson D, Yatsenko K, Sritharan D, Weinreich TM, Shaw DE. Atomic-level characterization of protein-protein association. Proc Natl Acad Sci U S A. 2019;116(10):4244–9. 10.1073/pnas.1815431116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wang Q, Pechersky Y, Sagawa S, Pan AC, Shaw DE. Structural mechanism for Bruton’s tyrosine kinase activation at the cell membrane. Proc Natl Acad Sci USA. 2019;116(19):9390–9. 10.1073/pnas.1819301116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bucher D, Grant BJ, Markwick PR, McCammon JA. Accessing a hidden conformation of the maltose binding protein using accelerated molecular dynamics. PLoS Comput Biol. 2011;7(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lindorff-Larsen K, Maragakis P, Piana S, Shaw DE. Picosecond to millsecond structural dynamics in human ubiquitin. J Phys Chem B. 2016;120:83313–8320. [DOI] [PubMed] [Google Scholar]
- 12.Olsson S, Noé F. Mechanistic models of chemical excahnge induced relaxation in protein NMR. J Am Chem Soc. 2016;139:200–10. 10.1021/jacs.6b09460 [DOI] [PubMed] [Google Scholar]
- 13.Patel DS, Pendrill R, Mallajosyula SS, Widmalm G, MacKerells AD. Conformational properties of α- or β-(1→6)-Linked Oligosaccharides:Hamiltonian Replica Exchange MD Simulations and NMR Experiments. J Phys Chem B. 2014;118:2851–71. 10.1021/jp412051v [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Shaw DE, Maragakis P, Lindorff-Larsen K, Piana S, Dror RO, Eastwood MP, et al. Atomic-level characterization of the structural dynamics of proteins. Science. 2010;330(6002):341–6. 10.1126/science.1187409 [DOI] [PubMed] [Google Scholar]
- 15.Trbovic N, Kim B, Friesner RA, Palmer A. Structural analysis of protein dynamics by MD simulations and spin-relaxation. Proteins. 2009;71(2):684–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Xue Y, Ward JM, Yuwen T, Podkorytov IS, Skrynnikov NR. Microsecond time-scale conformational exchange in proteins: using long molecular dynamics trajectory to simulate NMR relaxation dispersion data. J Am Chem Soc. 2012;134:2555–62. 10.1021/ja206442c [DOI] [PubMed] [Google Scholar]
- 17.Xue Y, Yuwen T, Zhu F, Skrynnikov NR. Role of electrostatic interactions in binding of peptides and intrinsically disordered proteins to their folded targets. 1. NMR and MD characterization of the complex between the c-Crk N-SH3 domain and the peptide Sos. Biochemistry. 2014;53:6473–95. 10.1021/bi500904f [DOI] [PubMed] [Google Scholar]
- 18.Yang M, Angles d'Ortoli T, Sawen E, Jana M, Widmalm G, MacKerell AD. Delineating the conformational flexibility of trisaccharides from NMR spectroscopy experiments and computer simulations. Phys Chem Chem Phys. 2016;18(28):18776–94. 10.1039/c6cp02970a [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yuwen T, Xue Y, Skrynnikov NR. Role of electrostatic interactions in binding of peptides and intrinsically disordered proteins to their folded targets: 2. The model of encounter complex involving the double mutant of the c-Crk N-SH3 domain and peptide Sos. Biochemistry. 2016;55(1784–1800). [DOI] [PubMed] [Google Scholar]
- 20.Zhang L, Bouguet-Bonnet S, Buck M. Combining NMR and Molecular Dynamics studies for insights into allostery of small GTPase-protein interactions. Methods Mol Biol. 2012;796:235–59. 10.1007/978-1-61779-334-9_13 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fujinami D, Motomura H, Oshima H, Mahin A-A, Elsayed KM, Zendo T, et al. Mosaic cooperativity in slow polypeptide topological isomerization revealed by residue-specific NMR thermodynmaic analysis. J Phys Chem Lett. 2020;11:1934–9. 10.1021/acs.jpclett.9b03591 [DOI] [PubMed] [Google Scholar]
- 22.Jang H, Banerjee A, Marcus K, Makowski L, Mattos C, Gaponenko V, et al. The Structural Basis of the Farnesylated and Methylated KRas4B Interaction with Calmodulin. Structure. 2019;27(11):1647–59 e4. 10.1016/j.str.2019.08.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Muratcioglu S, Chavan TS, Freed BC, Jang H, Khavrutskii L, Freed RN, et al. GTP-Dependent K-Ras Dimerization. Structure. 2015;23(7):1325–35. 10.1016/j.str.2015.04.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bowerman S, Rana ASJB, Rice A, Pham GH, Strieter ER, Wereszczynski J. Determining atomistic SAXS models of tri-ubiquitin chains from bayesian analysis of accelerated molecular dynamics simulations. J Chem Theory Comput. 2017;13:2418–29. 10.1021/acs.jctc.7b00059 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Brosey CA, Tainer JA. Evolving SAXS versatility: solution X-ray scattering for macromolecular architecture, funcitonal landscapes, and integrative structural biology. Curr Opin Struct Biol. 2019;58:197–213. 10.1016/j.sbi.2019.04.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Brosey CA, Yan C, Tsutakawa SE, Heller WT, Rambo RP, Tainer JA, et al. A new structural framework for integrating replication protein A into DNA processing machinery. Nucleic Acids Res. 2013;41(4):2313–27. 10.1093/nar/gks1332 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Chen PC, Hub JS. Validating solution ensembles from molecular dynamics simulation by wide-angle X-ray scattering data. Biophys J. 2014;107(2):435–47. 10.1016/j.bpj.2014.06.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Debiec KT, Whitley MJ, Koharudin LMI, Chong LT, Gronenborn AM. Integrating NMR, SAXS, and Atomistic Simulations: Structure and Dynamics of a Two-Domain Protein. Biophys J. 2018;114(4):839–55. 10.1016/j.bpj.2018.01.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kim DN, Thiel BC, Mrozowich T, Hennelly SP, Hofacker IL, Patel TR, et al. Zinc-finger protein CNBP alters the 3-D structure of lncRNA Braveheart in solution. Nat Commun. 2020;11(1):148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Knight CJ, Hub JS. WAXSiS: a web server for the calculation of SAXS/WAXS curves based on explicit-solvent molecular dynamics. Nucleic Acids Res. 2015;43(W1):W225–30. 10.1093/nar/gkv309 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Plumridge A, Katz AM, Calvey GD, Elber R, Kirmizialtin S, Pollack L. Revealing the distinct folding phases of an RNA three-helix junction. Nucleic Acids Res. 2018;46(14):7354–65. 10.1093/nar/gky363 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Shi J, Nobrega RP, Schwantes C, Kathuria SV, Bilsel O, Matthews CR, et al. Atomistic structural ensemble refinement reveals non-native structure stabilizes a sub-millisecond folding intermediate of CheY. Sci Rep. 2017;7:44116 10.1038/srep44116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Tsutakawa SE, Van Wynsberghe AW, Freudenthal BD, Weinacht CP, Gakhar L, Washington MT, et al. Solution X-ray scattering combined with computational modeling reveals multiple conformations of covalently bound ubiquitin on PCNA. Proc Natl Acad Sci U S A. 2011;108(43):17672–7. 10.1073/pnas.1110480108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Weiel M, Reinartz I, Schug A. Rapid interpretation of small-angle X-ray scattering data. PLoS Comput Biol. 2019;15(3):e1006900 10.1371/journal.pcbi.1006900 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Yang K, Różycki B, Cui F, Shi C, Chen W, Li Y. Sampling enrichment toward target structures using hybrid molecular dynamics-Monte Carlo simualtions. PLoS One. 2016;11(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Peng Y, Cao S, Kiselar J, Xiao X, Du Z, Hsieh A, et al. A metastable contact and structural disorder in the estrogen receptor transactivation domain. Structure. 2019;27(2):229–40 e4. 10.1016/j.str.2018.10.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Yang S. Methods for SAXS-based structure determination of biomolecular complexes. Adv Mater. 2014;26(46):7902–10. 10.1002/adma.201304475 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tuukkanen AT, Spilotros A, Svergun DI. Progress in small-angle scattering from biological solutions at high-brilliance synchrotrons. IUCrJ. 2017;4:518–28. 10.1107/S2052252517008740 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Song L, Yang L, Meng J, Yang S. Thermodynamics of Hydrophobic Amino Acids in Solution: A Combined Experimental-Computational Study. J Phys Chem Lett. 2017;8(2):347–51. 10.1021/acs.jpclett.6b02673 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hurst T, Xu X, Zhao P, Chen S-J. Qantitative understanding of SHAPE mechanism from RNA structure and dynamics analysis. J Phys Chem B. 2018;122(18):4771–83. 10.1021/acs.jpcb.8b00575 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Doyle SM, Shastry S, Kravats AN, Shih Y-H, Miot M, Hoskins JR, et al. Interplay between E. coli DnaK, ClpB and GrpE during protein disaggregation. J Mol Biol. 2015;427:312–27. 10.1016/j.jmb.2014.10.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bao H, Das D, Courtney NA, Jiang Y, Briguglio JS, Lou X, et al. Dynamics and number of trans-SNARE complexes determine nascent fusion pore properties. Nature. 2018;554(7691):260–3. 10.1038/nature25481 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Leonarski F, Jasiński M, Trylska J. Thermodynamics of the fourU RNA thermal switch derived from molecular dynamics simualtions and spectroscopic techniques. Biochimie. 2018;156:22–32. 10.1016/j.biochi.2018.09.005 [DOI] [PubMed] [Google Scholar]
- 44.Wojciechowska M, Dudek M, Trylska J. Thermodynamics of the pseudo-knot in helix 18 of 16S ribosomal RNA. Biopolymers. 2017;109(4):e23116. [DOI] [PubMed] [Google Scholar]
- 45.Jasiński M, Kulik M, Wojciechowska M, Stolarski R, Trylska J. Interactions of 2’-O-methyl oligoribonucleotides with the RNA models of the 30S subunit A-site. PLoS One. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Miao Z, Adamiak RW, Antczak M, Boniecki MJ, Bujnicki JM, Chen SJ, et al. RNA-Puzzles Round IV: 3D structure predictions of four ribozymes and two aptamers. RNA. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Gahlon HL, Romano LJ, Rueda D. Influence of DNA lesions on polymerase-mediated DNA replication at single-molecule resolution. Chem Res Toxicol. 2017;30:1972–83. 10.1021/acs.chemrestox.7b00224 [DOI] [PubMed] [Google Scholar]
- 48.Morse JC, Girodat D, Burnett BJ, Holm M, Altman RB, Sanbonmatsu KY, et al. Elongation factor-Tu can repetively engage aminoacyl-tRNA within the ribosome during the proofreading stage of tRNA selection. Proc Natl Acad Sci USA. 2020;117(7):3610–20. 10.1073/pnas.1904469117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gahlon HL, Walker AR, Cisneros GA, Lamers MH, Rueda DS. Reduced structural flexibility for an exonuclease deficient DNA polymerase III mutant. Phys Chem Chem Phys. 2018;20(42):26892–902. 10.1039/c8cp04112a [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Blanchard SC, Gonzalez RL, Kim HD, Chu S, Puglisi JD. tRNA selection and kinetic proofreading in translation. Nat Struct Mol Biol. 2004;11(10):1008–14. 10.1038/nsmb831 [DOI] [PubMed] [Google Scholar]
- 51.Gregorio GG, Masureel M, Hilger D, Terry DS, Juette M, Zhao H, et al. Single-molecule analysis of ligand efficacy in β2AR-G-protein activation. Nature. 2017;547:68–73. 10.1038/nature22354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Geggier P, Dave R, Feldman MB, Terry DS, Altman RB, Munro JB, et al. Conformational sampling of aminoacyl-tRNA during selection on the bacterial ribosome. J Mol Biol. 2010;399(4):576–95. 10.1016/j.jmb.2010.04.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Matsunaga Y, Sugita Y. Linking time-series of single-molecular experiments with moelcular dynamics simulations by machine learning. Elife. 2018;7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Matsunaga Y, Sugita Y. Use of single-molecule time-series data for refining conformational dynamics in molecular simulations. Curr Opin Struct Biol. 2020;61:153–9. 10.1016/j.sbi.2019.12.022 [DOI] [PubMed] [Google Scholar]
- 55.LeVine MV, Terry DS, Khelashvili G, Siegel ZS, Wuick M, Javitch JA, et al. The allosteric mechanism of substrate-specific tranpsort in SLC6 is mediated by a volumetric sensor. Proc Natl Acad Sci USA. 2019;116(32):15947–56. 10.1073/pnas.1903020116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zheng Y, Cui Q. Multiple pathways and time scales for conformational transition in apo—adenylate kinase. J Chem Theory Comput. 2018;14(3):1716–26. 10.1021/acs.jctc.7b01064 [DOI] [PubMed] [Google Scholar]
- 57.Fernandes DD, Bamrah J, Kailasam S, Gomes GW, Li Y, Wieden HJ, et al. Characterization of fluorescein arsenical hairpin (FlAsH) as a probe for single-molecule fluorescence spectroscopy. Sci Rep. 2017;7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Hoefling M, Lima N, Haennia D, Seidel CAM, Schuler B, Grubmüller H. Structural heterogeneity and quantitative FRET efficiency distributions of polyprolines thorugh a hybrid atomisitic simulation and Monte Carlo approach. PloS One. 2016;11(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Woźniak AK, Schröder GF, Grubmüller H, Seidel CAM, Oesterhelt F. Single-molecule FRET measures bends and kinds in DNA. Proc Natl Acad Sci USA. 2008;105(47):18337–42. 10.1073/pnas.0800977105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Matsunaga Y, Kidera A, Sugita Y. Sequential data assimilation for single-molecuel FRET photon-counting data. J Chem Phys. 2015;142. [DOI] [PubMed] [Google Scholar]
- 61.Shaw DE, Grossman JP, Bank JA, Batson B, Butts JA, Chao JC, et al. Anton 2: raising the bar for performance and programmability in a special-purpose molecular dynamics supercomputer. SC14: international conference for high performance computing, networking, storage, and analysis 2014. [Google Scholar]
- 62.Stelzl LS, Kells A, Rosta E, Hummer G. Dynamic histogram analysis to determine free energies and rates from biased simulations. J Chem Theory Comput. 2017;13(12):6328–42. 10.1021/acs.jctc.7b00373 [DOI] [PubMed] [Google Scholar]
- 63.Best RB, Chen YG, Hummer G. Slow protein conformational dynamics from mutiple experimental structures: the helix/sheet transition of Arc repressor. Structure. 2005;13:1755–63. 10.1016/j.str.2005.08.009 [DOI] [PubMed] [Google Scholar]
- 64.Whitford PC, Noel JK, Gosavi S, Schug A, Sanbonmatsu KY, Onuchic JN. An all-atom structure-based potential for proteins: bridging minimal models with all-atom empirical forcefields. Proteins. 2009;75(2):430–41. 10.1002/prot.22253 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Whitford PC, Geggier P, Altman RB, Blanchard SC, Onuchic JN, Sanbonmatsu KY. Accommodation of aminoacyl-tRNA into the ribosome involves reversible excursions along multiple pathways. RNA. 2010;16(6):1196–204. 10.1261/rna.2035410 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Girodat D, Blanchard SC, Wieden HJ, Sanbonmatsu KY. Elongation Factor-Tu switch I is a gate for aminoacyl-tRNA selection. J Mol Biol. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Altman RB, Terry DS, Zhou Z, Zheng Q, Geggier P, Kolster RA, et al. Cyanine fluorophore derivatives with enhanced photostability. Nat Methods. 2012;9(1):68–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Fitzgerald GA, Terry DS, Warren AL, Quick M, Javitch JA, Blanchard SC. Quantifying secondary transport at single-molecule resolution. Nature. 2019;575:528–34. 10.1038/s41586-019-1747-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Zheng Q, Juette MF, Jockusch S, Wasserman MR, Zhou Z, Altman RB, et al. Ultra-stable organic fluorophores for single-molecule research. Chem Soc Rev. 2014;43(4):1044–56. 10.1039/c3cs60237k [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Pati AK, Bakouri OE, Jockushc S, Zhou Z, Altman RB, Fitzgerald GA, et al. Tuning the Baird aromatic triplet state energy of cyclooctatetraene to maximize the self-healing mechanism in organic fluorophores. Proc Natl Acad Sci USA. 2020; 117(39); 24305–15. 10.1073/pnas.2006517117 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Trakhanov S, Vyas NK, Luecke H, Kristensen DM, Ma J, Quiocho FA. Ligand-free and -bound structure of the biding protein (Liv J) of the Escherichi coli ABC leucine/isoleucine/valine transport system: trajectory and dynamics of the interdomain rotation and ligand specificity. Biochemistry. 2005;44:6597–608. 10.1021/bi047302o [DOI] [PubMed] [Google Scholar]
- 72.Lammert H, Schug A, Onuchic JN. Robustness and generalization of structure-based models for portien folding and function. Proteins. 2010;77:881–91. [DOI] [PubMed] [Google Scholar]
- 73.Noel JK, Sulkowska JI, Onuchic JN. Slipknotting upon native-like loop formation in a trefoil knot protein. Proc Natl Acad Sci USA. 2010;107(35):15403–8. 10.1073/pnas.1009522107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Yang H, Bandarkar P, Horne R, Leite VBP, Chahine J, Whitford PC. Diffusion of tRNA inside the ribosome is position-dependent. J Chem Phys. 2019;151. [DOI] [PubMed] [Google Scholar]
- 75.Bryngelson JD, Wolynes PG. Intermediates and barrier crossing in a random energy model (with applications to protein folding). J Phys Chem. 1989;93:6902–15. [Google Scholar]
- 76.Zwanzig R. Diffusion in a rough potential. Proc Natl Acad Sci USA. 1988;85:2029–30. 10.1073/pnas.85.7.2029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Frisch MJ, Trucks GW, Schlegel HB, Scuseria GE, Robb MA, CRJ R. et al. Gaussian 09, Revision E.01. Wallingford CT: Gaussian, Inc; 2016. [Google Scholar]
- 78.Hanwell MD, Curtis DE, Lonie DC, Vandermeersch T, Zurek E, Hutchison GR. Avogadro: an advanced semantic chemical editor, visualization, and analysis platform. J Cheminform. 2012;4(1):17 10.1186/1758-2946-4-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Becke AD. Density-functional thermochemistry. III. The role of exact exchange. J Chem Phys. 1993;98(7):5648. [Google Scholar]
- 80.Stephen PJ, Devlin FJ, Chablowski CF, Frisch MJ. Ab initio calculation of vibrational absorption and circular dichroism spectra using density functional force fields. J Phys Chem. 1994;98(45):11623–7. [Google Scholar]
- 81.Case DA, Ben-Shalom IY, Brozell SR, Cerutti DS, Cheatham TEI, Cruzeiro VWD, et al. AMBER 2018. University of California, San Francisco: 2018. [Google Scholar]
- 82.Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general amber force field. J Comput Chem. 2004;25(9):1157–74. 10.1002/jcc.20035 [DOI] [PubMed] [Google Scholar]
- 83.Wang J, Wang W, Kollman PA, Case DA. Automatic atom type and bond type perception in molecular mechanical calculations. J Mol Graph Model. 2006;25(2):247–60. 10.1016/j.jmgm.2005.12.005 [DOI] [PubMed] [Google Scholar]
- 84.Noel JK, Levi M, Raghunathan M, Lammert H, Hayes RL, Onuchic JN, et al. SMOG 2: A Versatile Software Package for Generating Structure-Based Models. PLoS Comput Biol. 2016;12(3):e1004794 10.1371/journal.pcbi.1004794 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Noel JK, Onuchic JN. The many faces of structure-based potentials: from protein folding landscapes to structural characterization of complex biomolecules In: Dokholyan N, editor. Computational modeling of biological systems Biological and Medical Physics, Biomedical Engineering. Boston, MA: Springer; 2012. [Google Scholar]
- 86.Hess B, Kutzner C, Van Der Spool D, Lindahl E. GROMACS 4: algorithms for highly efficient, load-balanced, and scalable molecular simulations. J Chem Theory Comput. 2008;4:435–47. 10.1021/ct700301q [DOI] [PubMed] [Google Scholar]
- 87.Schrödinger Release 2020–1. New York, NY: Jaguar, Schrödinger, LLC; 2020. [Google Scholar]
- 88.Lata S, Reichel A, Brock A, Tampé R, Piehler J. High-affinity adaptors for switchable recognition of histidine-tagged proteins. J Am Chem Soc. 2005;127(29):10205–15. 10.1021/ja050690c [DOI] [PubMed] [Google Scholar]
- 89.Hellenkamp B, Schmid S, Doroshenko O, Opanasyuk O, Kühnemuth R, Adariani SR, et al. Precision and accuracy of single-molecule FRET measurements—a multi-laboratory benchmark study. Nat Methods. 2018;15:669–76. 10.1038/s41592-018-0085-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Schuler B. Application of single molecule Förster Resonance Energy Transfer to protein folding In: Bai Y, Nussinov R, editors. Protein Folding Protocols Methods in Molecular Biology. 350: Humana Press; 2007. 10.1385/1-59745-189-4:169 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the manuscript and its Supporting Information files.