Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 Jun 3:2024.06.03.597137. [Version 1] doi: 10.1101/2024.06.03.597137

Accounting for fast vs slow exchange in single molecule FRET experiments reveals hidden conformational states

Justin J Miller 1,2, Upasana L Mallimadugula 2, Maxwell I Zimmerman 2, Melissa D Stuchell-Brereton 2, Andrea Soranno 2, Gregory R Bowman 1,2,
PMCID: PMC11185552  PMID: 38895430

Abstract

Proteins are dynamic systems whose structural preferences determine their function. Unfortunately, building atomically detailed models of protein structural ensembles remains challenging, limiting our understanding of the relationships between sequence, structure, and function. Combining single molecule Förster resonance energy transfer (smFRET) experiments with molecular dynamics simulations could provide experimentally grounded, all-atom models of a protein’s structural ensemble. However, agreement between the two techniques is often insufficient to achieve this goal. Here, we explore whether accounting for important experimental details like averaging across structures sampled during a given smFRET measurement is responsible for this apparent discrepancy. We present an approach to account for this time-averaging by leveraging the kinetic information available from Markov state models of a protein’s dynamics. This allows us to accurately assess which timescales are averaged during an experiment. We find this approach significantly improves agreement between simulations and experiments in proteins with varying degrees of dynamics, including the well-ordered protein T4 lysozyme, the partially disordered protein apolipoprotein E (ApoE), and a disordered amyloid protein (Aβ40). We find evidence for hidden states that are not apparent in smFRET experiments because of time averaging with other structures, akin to states in fast exchange in NMR, and evaluate different force fields. Finally, we show how remaining discrepancies between computations and experiments can be used to guide additional simulations and build structural models for states that were previously unaccounted for. We expect our approach will enable combining simulations and experiments to understand the link between sequence, structure, and function in many settings.

Introduction

A protein’s function is determined by the ensemble of structures that it adopts16, but building atomically-detailed models of these ensembles to probe ensemble-function relationships remains challenging7. Of course, the high-resolution structures that structural biologists have become adept at solving are of enormous value. Despite this, ongoing challenges with tasks like drug and protein design highlight the limits of the structure-function paradigm 8. We expect having detailed models of the rest of a protein’s structural ensemble would lead to dramatic improvements in our understanding and ability to design such systems911. Unfortunately, most of the alternative structures a protein can adopt are difficult to detect and/or characterize experimentally because they have too low of a probability (i.e. high energy)1214.

Single molecule Förster resonance energy transfer (smFRET) experiments are a powerful tool for studying the distribution of structures that a protein adopts, including high energy states that are invisible to many other techniques 1517. In these experiments, a donor and an acceptor fluorophore are attached to two different residues in a protein. The donor fluorophore on a single protein is then excited and one measures how many acceptor and donor photons are emitted 18,19. The probability of transferring energy from the donor to the acceptor fluorophore, called the FRET efficiency, reports on a variety of valuable structural properties, including the distance between the fluorophores, their relative orientations, and the timescale on which they are rotating. Making many measurements results in a probability distribution of FRET efficiencies. These FRET efficiency distributions report on the distribution of structures the protein adopts and have proved to be a powerful means of revealing the conformational heterogeneity of proteins 2023.

Unfortunately, one cannot extract atomically-detailed structural models from smFRET data in a manner analogous to fitting structures to electron density from crystallography or cryoEM. smFRET data is inherently sparse, with each experiment reporting on the structure and dynamics of a single pair of dyes. One can perform experiments for multiple dye positions to learn about more of the protein structure. However, each experiment is independent, making it hard to discern any correlations between the behavior of different parts of the protein. While multi-color FRET experiments are being developed24, they are quite challenging to perform and still can’t measure many distances in parallel. Another challenge is that there is not a one-to-one mapping between the FRET efficiency and the distance between a pair of residues.

Combining atomically detailed computer simulations with smFRET experiments could yield experimentally grounded models of protein conformational ensembles with the desired resolution25,26. Ideally, there would be a method to predict energy transfer distributions from simulations to show that these predictions were in perfect agreement with smFRET experiments. Then one could analyze the simulations, making use of the atomistic structural and dynamical information they provide to generate new hypotheses, and test those hypotheses experimentally.

While there are cases where smFRET experiments and simulations are in good agreement, the agreement between the two approaches is often limited 25. A variety of approaches have been employed to close this gap. For example, scaling factors have been used to shift computational predictions into closer alignment with experiments 27,28. Others have employed reweighting schemes to shift the relative probabilities of structures from their simulations and bring their predictions into closer agreement with experiments 2931. There have also been efforts to develop improved methods for predicting the probability of energy transfer from simulations (e.g. by modeling in the dyes) and to improve force fields 26,28,3242. However, there is still room for improvement.

Here, we explore the importance of accounting for kinetic effects in smFRET experiments when connecting with simulations. Each FRET efficiency measured in an smFRET experiment is the ratio of acceptor photons to all photons emitted during some time interval. This time interval typically ranges from one to ten milliseconds depending on the experimental setup (e.g. TIRF vs diffusion confocal, laser power, etc.) 22,23. It has long been recognized in the smFRET community that this means each FRET efficiency measured is, therefore, averaging across whatever conformational dynamics occur during the one to ten millisecond time interval. As in NMR experiments, conformations that are exchanging more quickly than this measurement time will be averaged together, while conformations that are exchanging more slowly will not. Significant effort has gone into dealing with this time-averaging when analyzing experiments 4353. For example, it is common to perform global fits to many measurements with different dye positions and solvent conditions 5456, fit hidden Markov models to photon traces 21,46,51, or dissect the correlation between FRET efficiency and other fluorescence observables that report on shorter timescales 44,45,55. New experimental approaches are also being developed to shorten the timescale over which FRET efficiencies are measured 5761. Nonetheless, accounting for time-averaging could dramatically improve agreement between experiments and simulations enabling these two approaches to be used even more effectively to advance our understanding of the ensemble-function relationship.

We present an approach for accounting for time-averaging when predicting FRET efficiencies from simulations and assess its performance on three well-studied systems that exemplify different extents of dynamics. We start with Apolipoprotein E4 (ApoE4), as it contains both ordered and disordered regions and addresses the applicability of our approach to each 54,62. We also apply our approach to T4 Lysozyme, a well-ordered system that has recently been extensively characterized using 33 distinct smFRET labeling positions 55. Finally, we apply our approach to amyloid-β40 (Aβ40), a 40 amino acid highly disordered protein 28. One recent study produced simulations of Aβ40 with a variety of force fields, giving us the chance to test how well different force fields perform when combined with our approach for accounting for time-averaging when predicting the experimentally observed energy transfer distribution 63.

Results

Accounting for time-averaging dramatically improves agreement between simulations and experiments for a partially-disordered protein

We developed an approach for predicting FRET efficiencies from simulations in a manner that accounts for time-averaging by drawing on Markov state models (MSMs) 6467 built from molecular dynamics simulations. An MSM is a network model that describes a molecule’s conformational space in terms of the structural states it adopts and the probabilities of hopping between every pair of states in a fixed time interval. These models integrate information from many independent simulations to capture length and time scales that are far beyond the reach of any individual simulation. Importantly, we can use an MSM to generate a synthetic trajectory using a kinetic Monte Carlo scheme in which one chooses a random starting state and then iteratively adds new random states based on the transition probabilities from the current state to all other states. To mimic a smFRET experiment, we use one MSM to describe the protein’s conformational dynamics and separate MSMs for each of the dyes. First, we select a random experimental photon time trace and use our protein MSM to generate a synthetic trajectory of the same length. Then we identify conformations in our synthetic trajectory that correspond to the times photons were detected in the experiment. We assume these are the conformations that emit photons and then choose whether to label each photon as coming from the donor or acceptor dye as follows. First, we generate a set of plausible dye conformations by mapping representative structures from each state in our dye MSM onto the protein structure and removing any that form steric clashes with the protein. Next, we use our dye MSMs to simulate the dynamics of the dye (on a fixed protein structure) leading up to emission of a photon. At each step of these dye simulations, we use a Monte Carlo move to decide if the donor emits a photon, transfers energy leading to emission of an acceptor photon, or stays excited similar to previous efforts 40,41. If the dye remains excited, both dyes are allowed to hop to another state in the MSM. We repeat this process for the remainder of the synthetic trajectory to simulate a photon burst, returning the average FRET efficiency, or the number of acceptor photons divided by the total photons, for that burst. Finally, we repeat this process over multiple trajectories until an adequate number of photon bursts have been sampled. Since we model dyes as a post-processing step instead of including the dyes in the simulations, it is easy to scale this approach to predict the observed FRET for many dye positions. Furthermore, simulating the dye dynamics allows us to minimize the number of adjustable parameters, as we do not need to select constant values like a Förster radius that are required by other approaches.

To test our approach, we applied it to the partially disordered protein ApoE4. Our recent work presented smFRET measurements for five different pairs of dye positions on this protein. Some of these dye positions report on dynamics within the largely folded N-terminal domain while others report on the partially disordered and highly flexible C-terminal domain 54. Therefore, comparisons between simulations of this protein and experiments speak to the utility of our approach for both well-folded and disordered structures. We previously showed that ApoE4 predominately adopts three conformational states: a closed state, an open state, and an extended state. Identifying these three states experimentally required an enormous number of measurements, including multiple dye positions each at varying levels of denaturant. Even with this wealth of data, building atomically detailed structural models of the different states required extensive molecular dynamics simulations, totaling over 3 ms of aggregate simulation, which we showed were in reasonable agreement with experiments using a simplified version of the approach presented here. That approach did not model dye dynamics and, therefore, required us to choose constant values for parameters like the fluorescence anisotropy used in the Förster radius.

Assessing different ways of predicting FRET efficiencies from simulations and examining the actual distance distributions in those simulations highlights the importance of accounting for time-averaging (Figure 1). For example, Figure 1A shows the modeled inter-dye distance distribution between residues 5 and 86 in the folded, N-terminal domain. This distribution has two peaks, which roughly correspond to the closed and open states of ApoE4 (Figure 1B). If one assumes that smFRET measurements are instantaneous (i.e. there is no time-averaging), then the distribution of FRET efficiencies that one predicts retains these two peaks. However, accounting for time-averaging causes these two peaks to collapse into a single peak because the different populations are in fast exchange (Figure 1C). Importantly, the FRET efficiency we predict by accounting for time-averaging is in good agreement with the experimental data. Without accounting for time-averaging, we would have come to the erroneous conclusion that our simulations were in poor agreement with experiments. By accounting for time-averaging, we instead find good agreement with experiments and can use the simulation data to help identify the different populations that give rise to the experimentally observed smFRET data.

Figure 1: Accounting for time averaging significantly alters the apparent structural distribution from our model and increases agreement with experiments.

Figure 1:

(A) Inter-dye distances for apolipoprotein E labeled with Alexafluor 488 and Alexafluor 594 at positions 5 and 86 or (D) 223 and 291. In red is the equilibrium (instantaneous) distribution accounting for the distance added or subtracted by dye positioning (B,E) Exemplar structures of ApoE at two distinct dye-distance positions. Arrows indicate the portion of the distance distribution the structure occupies. (C) FRET efficiencies obtained for positions 5 and 86 or (F) 223 and 291. In black is the experimental distribution, in red is the result when not accounting for conformational dynamics of ApoE, and in purple is the time-averaged trace of the red trace.

Repeating the analysis above for the other dye positions supports the importance of accounting for time-averaging. For example, the modeled inter-dye distance distribution between residues 223 and 291 in the disordered C-terminal domain is broad and symmetrical (Figure 1D). The distribution of FRET efficiencies one would predict without accounting for time-averaging is skewed to large FRET efficiencies, in poor agreement with the experimentally observed distribution of FRET efficiencies. Accounting for time-averaging improves the agreement between simulations and experiments (Figure 1F). Importantly, the MSM accounts for motions occurring over multiple timescales enabling us to automatically average together states which are interconverting rapidly in a single energy basin while simultaneously capturing the differences between states that are interconverting slowly and thus broadening the histogram. Similar results are found for other dye positions (Figure S1). Treating the dyes as a point cloud rather than modeling their dynamics also gives similar results (Figure S2), though this approach requires the choice of a constant Förster radius that can be a source of error if a poor choice is made or if the dyes are not isotropically rotating.

Time-averaging improves experiment-simulation agreement for the entire spectrum from ordered to disordered proteins

Given that ApoE has a mix of ordered and disordered regions, we reasoned that our time-averaging approach should be equally applicable to fully ordered and disordered systems. To test this hypothesis further, we used our approach on the highly ordered protein, T4 lysozyme, and the intrinsically disordered protein (IDP), Aβ40. Both T4 lysozyme and Aβ40 benefit from a plethora of prior structural studies, including experimental smFRET characterization 28,55. For Aβ40, we make use of an existing 30 μs long simulation in the amber99sb forcefield, which was found to match NMR order parameters reasonably well 63. For lysozyme, we performed 5 independent 5 μs long all-atom molecular dynamics simulations in explicit tip3p solvent and the amber03 force field as described in the methods section. For both Aβ40 and lysozyme, we clustered our datasets, made MSMs, modeled on the appropriate dye pairs to match the experimental setup, and investigated the predicted FRET efficiencies using our time-averaging approach.

As expected, we found that accounting for time-averaging is important for both systems (Figure 2). We first calculate the lysozyme FRET efficiency for one of the experimental FRET probe distances, residues 44–150, using Alexa 488 maleimide and Alexa 647 hydroxylamine dyes. We find strong agreement between our time-averaging approach and the experimental results for lysozyme (Figure 2A). We note that both time-averaging and instantaneous FRET are in reasonable agreement with experimental data for this probe position, though both miss a population at high FRET efficiency. We next calculate the FRET efficiency for Aβ40 using Alexa 488 hydroxylamine and Alexa 647 maleimide attached to positions 1 and 40. We find that the results significantly improve upon the distribution obtained without time-averaging. However, the distribution is shifted overall towards higher FRET efficiencies, suggesting either insufficient sampling or force fields issues (Figure 2B). Overall, these findings demonstrate that accounting for time-averaging is helpful when there are conformations in fast exchange and is equivalent to other approaches when such exchange is absent.

Figure 2: smFRET time averaging impacts proteins across the ordered.

Figure 2:

A) FRET efficiencies for T4 Lysozyme labeled at 44 (para-acetylphenylalanine ) and 150 (cysteine) with Alexa 488 and Alexa 647 or B) Aβ40 labeled at positions 1 (para-acetylphenylalanine) and 40 (cysteine) with Alexa 488 and Alexa 647. In black is the experimental distribution, red the result when not accounting for protein dynamics, and purple accounting for protein dynamics via time-averaging. Protein structures are the 15 most probable states in the MSM with labeling positions indicated in orange spheres. Experimental donor only counts (E < 0.25) have been removed for ease of comparison.

Directing sampling based on discrepancies between predicted and observed FRET reveals a novel conformation of lysozyme.

Given the strong agreement between our predicted energy transfer distributions and experiments for folded and partially disordered systems, we reasoned that remaining discrepancies may point to under sampled regions of conformational space in simulations. Indeed, the prior study on lysozyme concluded that the minor population could not be explained by any structure of lysozyme existing in the protein data bank 55. If this is true, then we should be able to improve the agreement between simulations and experiments by driving simulations to sample structures with FRET values that are not observed often enough compared to experiments.

To explore this possibility, we sought to provide a structural model for a minor population that was previously observed in an extensive smFRET study of lysozyme. That study presented smFRET measurements for 33 pairs of dye positions. For 17 of these dye pairs, the authors observe a minor state in the FRET efficiency that they could not explain based on any of the numerous published crystal structures of this protein. When probing residues 44–150, this minor population has high FRET efficiency, a metric which would require the dyes to come closer together than is conceivable based on a clamshell motion of the two lobes of lysozyme. Furthermore, we would not expect our simulations (aggregate simulation time of 25 μs) to reach this minor state since the experiments suggest that it is accessed with a rate of ~4 ms−1. Indeed, we find that our simulations stay near the starting structure and that our predicted energy transfer distributions agreed well with the major population seen experimentally but missed the minor population seen for constructs like lysozyme44–150 (Figure 3A).

Figure 3: Discrepancies between smFRET and time averaging results enable discovery of a novel lysozyme fold.

Figure 3:

A) FRET efficiencies and (C) observed inter-dye distances for T4 Lysozyme labeled at position 44 and 150 with Alexa 488 and Alexa 647. Experimental traces are in black, calculations resulting from an MSM that only included crystal-like states in purple, and calculations resulting from an MSM including the alternate state in orange. Experimental donor only counts (E < 0.25) have been removed for ease of comparison. B) Example conformations of the crystal-like state of lysozyme (blue), or the alternate state (red). Residues 44 and 150 labeled for clarity. D) Qualitative comparison of smFRET distributions from experiment and simulation results including the alternate pose of lysozyme. E) Wasserstein distance between experimental and limited sampling dataset (purple) or extended sampling dataset (orange) for all labeled pairs.

To provide a structural explanation for the minor population seen experimentally, we employed a combination of metadynamics and MSMs. First, we used metadynamics simulations to find structures that are consistent with the high FRET of the minor population. In metadynamics, one adds an external biasing force to drive dynamics along a pre-selected collective variable. In this case, we pushed the system along the distance between residues 44 and 150 to see if we could find structures where they come close together, as this would result in a high FRET efficiency. These simulations revealed that the β domain can undergo minor unfolding which enables a swiveling motion to bring residue 44 much closer to residue 150 of the α domain (Figure 3B). To test if this alternative structural state is metastable, we selected four conformations where residues 44 and 150 are near one another and ran >500 ns conventional molecular dynamics simulations of each of them. All the simulations stayed near the starting point, confirming that the alternative structure state we discovered in metadynamics is a metastable free energy state. To determine the relative probability of this alternative state and those observed in our original simulations, we sought to build an MSM that captured transitions between the crystal-like states and the new alternative state. We ran goal-oriented adaptive sampling simulations using the Fluctuation Amplification of Specific Traits (FAST) algorithm68 to promote transitions from the crystallographic state to the alternative state and vice versa. FAST works by iteratively running a batch of simulations, building an MSM, and choosing states from the MSM as starting points for new simulations in a manner that balances between exploring further around states with a specified geometric property (called exploitation) and broad exploration of conformational space. In this case, we started one batch of FAST simulations from the crystallographic structure and set the exploitation term to favor states with a low RMSD to the novel fold we discovered and a second with the targets reversed. Both sets of simulations captured transitions between the two folds, providing a basis for building an MSM that captures the relative probabilities of both folds of lysozyme.

After building a new MSM that incorporates this data, we find that the computationally calculated energy transfer distribution now includes a minor state in agreement with experiments with multiple dye positions (Figure 3). Specifically, including the novel fold of lysozyme greatly improves agreement between our time-averaged results and the experimental traces for labeling pair 44–150 (Figure 3A,B). As a further test of our model, we then calculated energy transfer distributions for the remaining 32 FRET probe positions. Agreement between our model and these experiments would be strong support for our model, given that none of these experiments influenced our simulation strategy. In support of the alternative fold we predicted, we see the addition of minor peaks to 11 of the 17 FRET probe positions that were sensitive to this minor population, and only one additional peak in probe positions not reporting on the minor population (Figure 3D, S3). Inclusion of the alternate state greatly reduces the Wasserstein distance between the experimental and predicted histograms for all states except 2: 8–69 and 44–132.

Results for IDPs are more sensitive to force field choice than globular proteins

We hypothesized that accounting for time-averaging in our Aβ40 prediction was insufficient to give strong agreement with experiment because the force field preferred overly compact states of Aβ40. Historically, force fields which govern the underlying physics of the simulation, have yielded large differences between simulations and the corresponding experimental data for IDPs 63,69,70. While amber99sb notably performs better with IDPs than others, force fields that were parameterized for folded proteins tend to lead to over compaction of IDPs, largely due to an imbalance between protein-protein and protein-water interactions. This systematic compaction of Aβ40 would skew the observed FRET values towards higher efficiencies, exactly as we have observed (Figure 2B).

To test whether discrepancies in experimental agreement are due to force field errors, we predicted Aβ40 smFRET using simulations conducted with a suite of nine different force fields/water models. Seven of these datasets were taken from a previous study that examined how well 30 μs simulations with each force field recapitulated NMR measurements. The seven force field/water combinations are: amber99SB*-ILDN with TIP3P, C22* with TIP3P-CHARMM, C36m with TIP3P-CHARMM, a03ws with TIP4P, a99SB with TIP4P-Ew with Head-Gordon vdW and dihedral modifications (a99sb-ucb), a99SB-ILDN with TIP4P-D, and a99SB-disp with a modified TIP4P water. In addition to these datasets, we also ran our own simulations using amber0371 and a99sb-ws, both with TIP3P water. For each of our simulations, we ran 250 ns long simulations in triplicate starting from the 10 most distinct Aβ40 structures captured in the previous simulations.

We find that force field choice substantially affects the quality of smFRET prediction for IDPs and that there is still room for improvement. For each of the above force field-water combinations, we generated MSMs and calculated the expected smFRET. All force field-water combinations result in Aβ40 distributions that are more collapsed than the experimental distribution (Figure 4A). We note that in our datasets, we explicitly started simulations from expanded states of Aβ40. However, these states exhibit a rapid compaction event which is not reversed during the simulation, consistent with previous findings that most force fields are biased towards more compact IDP structures than are experimentally observed 42,69,72. Of the force field-water combinations, a99SB-ILDN-TIP4PD and a99sb-ucb showed the strongest agreement with experiment.

Figure 4: Force field choice has a significant impact on the level of agreement between simulations and experiments on IDPs even when accounting for time averaging.

Figure 4:

A) FRET efficiencies for Aβ40 labeled at positions 1 and 40 with Alexa 488 and Alexa 647 in different force fields. In each trace, the black line represents the experimental smFRET result, individual force field and water combinations indicated in the legend below the column. B) FRET efficiency distributions for T4-lysozyme residues 44 and 150 in amber03 with TIP3P water (purple) or charmm36m with TIP3P water (orange) or the experimental result (black). Experimental donor only counts (E < 0.25) have been removed for ease of comparison.

While there is large variation in force field performance for IDPs, we find less variation between force fields for lysozyme. We performed another 5 independent, 5μs long replicate simulations of lysozyme in tip3p water using charmm36m as a force field. As with our amber03 simulations, we fail to uncover the third, minor population, of lysozyme (Figure S4). Using the previously discovered minor state of lysozyme as both a starting and target structure, we again performed goal-oriented sampling to promote transitions between the crystal-like poses of lysozyme and the minor state in the charmm36m force field. Both sets of simulations again capture transitions between the two states, enabling us to construct an MSM. Though there are slight differences in energy transfer efficiency between our datasets constructed using amber03 and charmm36m, both produce good agreement between the experimental energy transfer distributions and those predicted from our MSMs (Figure 4B).

Discussion/Conclusion

Here, we have explored how conformational averaging during smFRET measurements impacts the observed distribution of FRET efficiencies. Our results show that accounting for time-averaging across protein and dye dynamics improves the agreement between simulations and experiments for three proteins across the ordered spectrum (Figure 1, 2). These results agree with an existing experimental understanding that single molecule energy transfer distributions report an average of all protein motion during the measurement window. Our work adds to these prior experimental efforts by both identifying which states are being averaged together, while also providing an atomistic view of the protein conformations. While prior computational efforts have often focused on accounting for protein dynamics, improving protein and dye force field accuracy, or accounting for dye dynamics in FRET predictions, often these efforts either require additional simulations for every labeled dye position, or are unable to account for the effect of dye dynamics without a priori knowledge of the dynamical nature of the dyes. Our approach is unique in that it leverages MSMs to account for both protein and dye dynamics without the need for additional, computationally expensive, simulations. This removes additional modelling choices, such as choosing a Förster radius or timescale to average dye motions over, from the calculation.

Historically it has been difficult to determine why simulations and experiments have failed to agree. While simulations can have systematic errors due to parameterization or incomplete conformational sampling, experimental limitations and artifacts may also lead to disagreements. Here, we highlight examples where predicted and experimentally obtained energy transfer efficiency measurements appeared to disagree until we properly accounted for details of the experiment like time-averaging. Accounting for these experimental details in our modeling approach did not provide a structural rationale for a previously observed minor population of T4 lysozyme. However, we were able to use this persistent discrepancy to guide additional simulations to find structures that are consistent with the energy transfer in the minor populations. This approach allowed us to propose a structural model for the previously unexplained minor population that is consistent with most of the experimental measurements (Figure 3). We also find that simulations of Aβ40 with force field and water combinations parameterized for folded proteins result in an overly compact ensemble compared to the experimentally determined ensemble (Figure 4). As expected, based on previous publications, force fields designed to improve performance on disordered proteins performed better in our tests. While there were still differences between amber03 and charmm36m force fields for our lysozyme simulations, the choice of force field was less impactful for lysozyme than Aβ40.

We expect our approach will enable combining simulations and experiments to understand the link between sequence, structure, and function in many settings. While smFRET experiments are extremely valuable, one cannot readily derive atomistic models of conformational distributions from this data alone. The approach we have outlined here enables robust calculation of energy transfer distributions from protein ensembles, providing a direct link between energy transfer distributions and atomic models. While our approach led to strong agreement between simulations and experiments, there are some exceptions where our datasets diverge. One explanation could be that we do not consider alternative mechanisms of donor energy emission – such as quenching via nearby residues such as tryptophan and tyrosine. Another explanation could be that the mutagenesis required for dye attachment during experiments, as well as the attachment of the dye itself, disrupt the conformational landscape of the protein in question. Indeed, in many of our apo-simulation models we observe states that are incompatible with dye labeling, such as when amino acid to-be-labeled becomes buried or interacts closely with other regions of the protein. Nonetheless, our method is implemented post-simulation, modeling additional dye positions is rapid and requires minimal additional computational cost. Accordingly, once one has a satisfactory simulation dataset, it is facile to use these tools to design novel probe pairs which report on identified motions of interest.

Brief Methods

Molecular dynamics simulations

All simulations were performed in explicit solvent at 300K.

Simulations of Apolipoprotein E4 were generated using OpenMM8.073 and amber0371 with TIP3P water74 and a timestep of 4 fs. The dataset was generated using a diverse composition of starting structures of ApoE and totals 3.61 ms. Clustering was performed based on the pairwise distances between 15 selected residue pairs and the MSM was created using row normalization and a 2 ns lagtime.

Simulations of Aβ40 in the following force fields were obtained from prior work: a99SB*-ILDN with TIP3P, C22* with TIP3P-CHARMM, C36m with TIP3P-CHARMM, a03ws with TIP4P/2005 interactions, a99SB with TIP4P-Ew with the Head-Gordon vdW and dihedral modifications (a99SB-UCB), a99SB-ILDN with TIP4P-D, and a99SB-disp with a modified TIP4P-D water. For each combination, a total of ~30 μs data were collected using Anton hardware. Simulations of Aβ40 in amber03 + TIP3P and amber99sb-ws + TIP3P were generated in this work using GROMACS and 10 diverse starting structures from the above Aβ40 dataset and running each simulation for 250 ns in triplicate using unique initial velocities for each (aggregate 7.5 μs). Clustering for both simulation datasets was performed using the distance between every 5th residue as a feature, and the MSM was created using row normalization and a 5 ns (Anton datasets) or a 0.2 ns lag time (GROMACS).

Simulations of T4 lysozyme were performed in amber03 with TIP3P water. Initial unbiased simulations were started from PDB structure 5LZM and 5 replicates were performed for 5 μs each using differing initial velocities. Metadynamics simulations were performed using PLUMED and a biasing potential of 0.3 between residues 44 and 150 for a total of 250 ns. Unbiased simulations were started from 4 alternate states uncovered by metadynamics with 5 replicas using differing initial velocities, each for a total length of 1 μs. FAST adaptive sampling was performed from both alternate and crystal-like states to the opposing state to capture the transition pathways in forward and reverse using RMSD as a progress metric. MSMs were built based on cluster centers from initial unbiased simulations, or the entire dataset excluding the metadynamics runs (total 94.8 μs). Clustering was performed using backbone RMSD to a radius of 2.5Å and the MSM was built using row normalization and a lag time of 2 ns.

Simulations of dyes were performed in amber03ws using the modified amber dye parameters28,39 with TIP3P water and a timestep of 2 fs. A single run was performed for each dye for 500ns. Simulation frames were saved every 20 fs. A 5000 state MSM with a lag time of 2 ps was built for each dye using RMSD of heavy atoms as a clustering metric.

Simulation of smFRET

Dye color determination was achieved by building a MSM for each dye of interest (see simulations, above). Briefly, the dye MSM is modeled onto each state in the protein MSM, removing any positions from the dye MSM that clash with the protein. Next, a random dye position is chosen for both the donor and acceptor dye. Probabilities of radiative decay, energy transfer, non-radiative decay, or remaining excited are calculated and a random outcome is chosen accordingly, similar to prior work 40,41. If the dye remains excited, dye positions are allowed to update along with transfer probabilities until the donor dye is no longer excited. In the case of point clouds, a conformational ensemble of dyes 26 was modeled onto the protein and all steric clashes were discarded. Photon colors were determined by choosing a random distance between the donor and acceptor dye emission centers and the Förster relationship (Equation 1).

E=R06/R06+r6 Equation 1

To determine which protein states to average, we recolor an experimental photon trace from Apolipoprotein E4 54. We choose a random state from our protein MSM and build a synthetic trajectory to match the length of the experimental photon burst. We apply a time correction factor of 10,000 to slow the simulation timescale to match the experimental timescale. Each time an experimental photon is observed, we select the corresponding state in our trajectory and evaluate the photon identity as above. We determine the overall energy transfer efficiency as the ratio of acceptor photons to the total observed photons and repeat this process for all bursts (~14,000), yielding the displayed distributions. The code for these calculations is available on github (https://github.com/bowman-lab/enspara).

Experimental smFRET data

Data for Apolipoprotein E4 was obtained from Stuchell-Brereton et al54. Data for T4 lysozyme was obtained from Sanabria et al55. Data for Aβ40 was obtained from Meng et al28.

Analysis/Software

Simulations generated during this manuscript were performed in GROMACS202075 or OpenMM8.073 as noted. Adaptive sampling was performed using FAST68, and metadynamics76 simulations were performed using PLUMED77 and GROMACS2020. Structure imaging was performed in PyMOL. Trajectory analysis was performed using MDtraj78. Clustering and MSMs were created using ENSPARA79. All graphs were generated using Matplotlib80.

Supplementary Material

Supplement 1

Acknowledgements

We thank the citizen scientists who contributed computational time and technical expertise for the simulation of ApoE via Folding@home. This work was supported by the National Institutes of Health through U19AG069701 (project 1: A.S, core B: G.R.B.), RF1AG067194 (G.R.B. and A.S.), NIGMS R35GM152085 (G.R.B.), and NSF MCB 2218156 (G.R.B.). J.J.M was funded by the NIH training grant T32AG05851804.

Footnotes

Competing interests

The authors declare no competing interests.

References

  • 1.Knoverek C. R. et al. Opening of a cryptic pocket in β-lactamase increases penicillinase activity. Proc. Natl. Acad. Sci. 118, e2106473118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cruz M. A. et al. A cryptic pocket in Ebola VP35 allosterically controls RNA binding. Nat. Commun. 13, 2269 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Xie T., Saleh T., Rossi P. & Kalodimos C. G. Conformational states dynamically populated by a kinase determine its function. Science 370, eabc2754 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Fraser J. S. et al. Hidden alternative structures of proline isomerase essential for catalysis. Nature 462, 669–673 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Knoverek C. R., Amarasinghe G. K. & Bowman G. R. Advanced methods for accessing protein shape-shifting present new therapeutic opportunities. Trends Biochem. Sci. 44, 351–364 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Chen S. et al. The dynamic conformational landscape of the protein methyltransferase SETD8. eLife 8, e45403 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Miller M. D. & Phillips G. N. Moving beyond static snapshots: Protein dynamics and the Protein Data Bank. J. Biol. Chem. 296, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Fraser J. S. & Murcko M. A. Structure is beauty, but not always truth. Cell 187, 517–520 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Smith L. G., Novak B., Osato M., Mobley D. L. & Bowman G. R. PopShift: A Thermodynamically Sound Approach to Estimate Binding Free Energies by Accounting for Ligand-Induced Population Shifts from a Ligand-Free Markov State Model. J. Chem. Theory Comput. 20, 1036–1050 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Jones D. et al. Improved Protein–Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference. J. Chem. Inf. Model. 61, 1583–1592 (2021). [DOI] [PubMed] [Google Scholar]
  • 11.Nam K. & Wolf-Watz M. Protein dynamics: The future is bright and complicated! Struct. Dyn. 10, 014301 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Boehr D. D., McElheny D., Dyson H. J. & Wright P. E. The Dynamic Energy Landscape of Dihydrofolate Reductase Catalysis. Science 313, 1638–1642 (2006). [DOI] [PubMed] [Google Scholar]
  • 13.Sekhar A. & Kay L. E. An NMR View of Protein Dynamics in Health and Disease. Annu. Rev. Biophys. 48, 297–319 (2019). [DOI] [PubMed] [Google Scholar]
  • 14.Agarwal P. K., Doucet N., Chennubhotla C., Ramanathan A. & Narayanan C. Conformational Sub-states and Populations in Enzyme Catalysis. Methods Enzymol. 578, 273–297 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Roy R., Hohng S. & Ha T. A practical guide to single-molecule FRET. Nat. Methods 5, 507–516 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Ha T. et al. Probing the interaction between two single molecules: fluorescence resonance energy transfer between a single donor and a single acceptor. Proc. Natl. Acad. Sci. 93, 6264–6268 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Schuler B., Soranno A., Hofmann H. & Nettels D. Single-Molecule FRET Spectroscopy and the Polymer Physics of Unfolded and Intrinsically Disordered Proteins. Annu. Rev. Biophys. 45, 207–231 (2016). [DOI] [PubMed] [Google Scholar]
  • 18.Förster T. Zwischenmolekulare Energiewanderung und Fluoreszenz. Ann. Phys 437, 55–75 (1948). [Google Scholar]
  • 19.Stryer L. Fluorescence Energy Transfer as a Spectroscopic Ruler. Annu. Rev. Biochem. 47, 819–846 (1978). [DOI] [PubMed] [Google Scholar]
  • 20.Agam G. et al. Reliability and accuracy of single-molecule FRET studies for characterization of structural dynamics and distances in proteins. Nat. Methods 20, 523–535 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Mazal H. & Haran G. Single-molecule FRET methods to study the dynamics of proteins at work. Curr. Opin. Biomed. Eng. 12, 8–17 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lerner E. et al. FRET-based dynamic structural biology: Challenges, perspectives and an appeal for open-science practices. eLife 10, e60416 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lerner E. et al. Toward dynamic structural biology: Two decades of single-molecule Förster resonance energy transfer. Science 359, eaan1133 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lee J. et al. Single-molecule Four-color FRET. Angew. Chem. Int. Ed Engl. 49, 9922–9925 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Alston J. J., Soranno A. & Holehouse A. S. Integrating single-molecule spectroscopy and simulations for the study of intrinsically disordered proteins. Methods San Diego Calif 193, 116–135 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Montepietra D. et al. FRETpredict: a Python package for FRET efficiency predictions using rotamer libraries. Commun. Biol. 7, 1–10 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Zheng W. et al. Inferring properties of disordered chains from FRET transfer efficiencies. J. Chem. Phys. 148, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Meng F. et al. Highly Disordered Amyloid-β Monomer Probed by Single-Molecule FRET and MD Simulation. Biophys. J. 114, 870–884 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Peran I. et al. Unfolded states under folding conditions accommodate sequence-specific conformational preferences with random coil-like dimensions. Proc. Natl. Acad. Sci. U. S. A. 116, 12301–12310 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Bottaro S., Bengtsen T. & Lindorff-Larsen K. Integrating Molecular Simulation and Experimental Data: A Bayesian/Maximum Entropy Reweighting Approach. in Structural Bioinformatics: Methods and Protocols (ed. Gáspári Z.) 219–240 (Springer US, New York, NY, 2020). doi: 10.1007/978-1-0716-0270-6_15. [DOI] [PubMed] [Google Scholar]
  • 31.Köfinger J., Różycki B. & Hummer G. Inferring Structural Ensembles of Flexible and Dynamic Macromolecules Using Bayesian, Maximum Entropy, and Minimal-Ensemble Refinement Methods. in Biomolecular Simulations: Methods and Protocols (eds. Bonomi M. & Camilloni C.) 341–352 (Springer, New York, NY, 2019). doi: 10.1007/978-1-4939-9608-7_14. [DOI] [PubMed] [Google Scholar]
  • 32.Woźniak A. K., Schröder G. F., Grubmüller H., Seidel C. A. M. & Oesterhelt F. Single-molecule FRET measures bends and kinks in DNA. Proc. Natl. Acad. Sci. U. S. A. 105, 18337–18342 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lehmann K. et al. Dynamics of the nucleosomal histone H3 N-terminal tail revealed by high precision single-molecule FRET. Nucleic Acids Res. 48, 1551–1571 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kalinin S. et al. A toolkit and benchmark study for FRET-restrained high-precision structural modeling. Nat. Methods 9, 1218–1225 (2012). [DOI] [PubMed] [Google Scholar]
  • 35.Walczewska-Szewc K. & Corry B. Accounting for dye diffusion and orientation when relating FRET measurements to distances: three simple computational methods. Phys. Chem. Chem. Phys. 16, 12317–12326 (2014). [DOI] [PubMed] [Google Scholar]
  • 36.Sindbert S. et al. Accurate Distance Determination of Nucleic Acids via Förster Resonance Energy Transfer: Implications of Dye Linker Length and Rigidity. J. Am. Chem. Soc. 133, 2463–2480 (2011). [DOI] [PubMed] [Google Scholar]
  • 37.Merchant K. A., Best R. B., Louis J. M., Gopich I. V. & Eaton W. A. Characterizing the unfolded states of proteins using single-molecule FRET spectroscopy and molecular simulations. Proc. Natl. Acad. Sci. 104, 1528–1533 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Dimura M. et al. Quantitative FRET studies and integrative modeling unravel the structure and dynamics of biomolecular systems. Curr. Opin. Struct. Biol. 40, 163–185 (2016). [DOI] [PubMed] [Google Scholar]
  • 39.Graen T., Hoefling M. & Grubmüller H. AMBER-DYES: Characterization of Charge Fluctuations and Force Field Parameterization of Fluorescent Dyes for Molecular Dynamics Simulations. J. Chem. Theory Comput. 10, 5505–5512 (2014). [DOI] [PubMed] [Google Scholar]
  • 40.Speelman A. L. et al. Using Molecular Dynamics and Quantum Mechanics Calculations To Model Fluorescence Observables. J. Phys. Chem. A 115, 3997–4008 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Hoefling M. et al. Structural Heterogeneity and Quantitative FRET Efficiency Distributions of Polyprolines through a Hybrid Atomistic Simulation and Monte Carlo Approach. PLOS ONE 6, e19791 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Rauscher S. et al. Structural Ensembles of Intrinsically Disordered Proteins Depend Strongly on Force Field: A Comparison to Experiment. J. Chem. Theory Comput. 11, 5513–5524 (2015). [DOI] [PubMed] [Google Scholar]
  • 43.Tomov T. E. et al. Disentangling Subpopulations in Single-Molecule FRET and ALEX Experiments with Photon Distribution Analysis. Biophys. J. 102, 1163–1173 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Barth A. et al. Unraveling multi-state molecular dynamics in single-molecule FRET experiments. I. Theory of FRET-lines. J. Chem. Phys. 156, 141501 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Opanasyuk O. et al. Unraveling multi-state molecular dynamics in single-molecule FRET experiments. II. Quantitative analysis of multi-state kinetic networks. J. Chem. Phys. 157, 031501 (2022). [DOI] [PubMed] [Google Scholar]
  • 46.Pirchi M. et al. Photon-by-Photon Hidden Markov Model Analysis for Microsecond Single-Molecule FRET Kinetics. J. Phys. Chem. B 120, 13065–13075 (2016). [DOI] [PubMed] [Google Scholar]
  • 47.Gopich I. V. & Szabo A. Decoding the Pattern of Photon Colors in Single-Molecule FRET. J. Phys. Chem. B 113, 10965–10973 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Chung H. S. & Gopich I. V. Fast single-molecule FRET spectroscopy: theory and experiment. Phys. Chem. Chem. Phys. PCCP 16, 18644–18657 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Torella J. P., Holden S. J., Santoso Y., Hohlbein J. & Kapanidis A. N. Identifying Molecular Dynamics in Single-Molecule FRET Experiments with Burst Variance Analysis. Biophys. J. 100, 1568–1577 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Chung H. S., Cellmer T., Louis J. M. & Eaton W. A. Measuring ultrafast protein folding rates from photon-by-photon analysis of single molecule fluorescence trajectories. Chem. Phys. 422, 229–237 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Harris P. D. et al. Multi-parameter photon-by-photon hidden Markov modeling. Nat. Commun. 13, 1000 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Laurence T. A., Kong X., Jäger M. & Weiss S. Probing structural heterogeneities and fluctuations of nucleic acids and denatured proteins. Proc. Natl. Acad. Sci. 102, 17348–17353 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Nettels D., Hoffmann A. & Schuler B. Unfolded Protein and Peptide Dynamics Investigated with Single-Molecule FRET and Correlation Spectroscopy from Picoseconds to Seconds. J. Phys. Chem. B 112, 6137–6146 (2008). [DOI] [PubMed] [Google Scholar]
  • 54.Stuchell-Brereton M. D. et al. Apolipoprotein E4 has extensive conformational heterogeneity in lipid-free and lipid-bound forms. Proc. Natl. Acad. Sci. 120, e2215371120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Sanabria H. et al. Resolving dynamics and function of transient states in single enzyme molecules. Nat. Commun. 11, 1231 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Müller-Späth S. et al. Charge interactions can dominate the dimensions of intrinsically disordered proteins. Proc. Natl. Acad. Sci. 107, 14609–14614 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Acuna G. P. et al. Fluorescence Enhancement at Docking Sites of DNA-Directed Self-Assembled Nanoantennas. Science 338, 506–510 (2012). [DOI] [PubMed] [Google Scholar]
  • 58.Farooq S. & Hohlbein J. Camera-based single-molecule FRET detection with improved time resolution. Phys. Chem. Chem. Phys. 17, 27862–27872 (2015). [DOI] [PubMed] [Google Scholar]
  • 59.Bohlen J. et al. Plasmon-assisted Förster resonance energy transfer at the single-molecule level in the moderate quenching regime. Nanoscale 11, 7674–7681 (2019). [DOI] [PubMed] [Google Scholar]
  • 60.Nüesch M. F. et al. Single-molecule Detection of Ultrafast Biomolecular Dynamics with Nanophotonics. J. Am. Chem. Soc. 144, 52–56 (2022). [DOI] [PubMed] [Google Scholar]
  • 61.Grabenhorst L., Sturzenegger F., Hasler M., Schuler B. & Tinnefeld P. Single-Molecule FRET at 10 MHz Count Rates. J. Am. Chem. Soc. 146, 3539–3544 (2024). [DOI] [PubMed] [Google Scholar]
  • 62.Frieden C., Wang H. & Ho C. M. W. A mechanism for lipid binding to apoE and the role of intrinsically disordered regions coupled to domain–domain interactions. Proc. Natl. Acad. Sci. 114, 6292–6297 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Robustelli P., Piana S. & Shaw D. E. Developing a molecular dynamics force field for both folded and disordered protein states. Proc. Natl. Acad. Sci. 115, E4758–E4766 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Husic B. E. & Pande V. S. Markov State Models: From an Art to a Science. J. Am. Chem. Soc. 140, 2386–2396 (2018). [DOI] [PubMed] [Google Scholar]
  • 65.Konovalov K. A., Unarta I. C., Cao S., Goonetilleke E. C. & Huang X. Markov State Models to Study the Functional Dynamics of Proteins in the Wake of Machine Learning. JACS Au 1, 1330–1341 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Pande V. S., Beauchamp K. & Bowman G. R. Everything you wanted to know about Markov State Models but were afraid to ask. Methods San Diego Calif 52, 99–105 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Suárez E. et al. What Markov State Models Can and Cannot Do: Correlation versus Path-Based Observables in Protein-Folding Models. J. Chem. Theory Comput. 17, 3119–3133 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Zimmerman M. I. & Bowman G. R. FAST Conformational Searches by Balancing Exploration/Exploitation Trade-Offs. J. Chem. Theory Comput. 11, 5747–5757 (2015). [DOI] [PubMed] [Google Scholar]
  • 69.Best R. B., Zheng W. & Mittal J. Balanced Protein–Water Interactions Improve Properties of Disordered Proteins and Non-Specific Protein Association. J. Chem. Theory Comput. 10, 5113–5124 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Lincoff J. et al. Extended experimental inferential structure determination method in determining the structural ensembles of disordered protein states. Commun. Chem. 3, 74 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Duan Y. et al. A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations. J. Comput. Chem. 24, 1999–2012 (2003). [DOI] [PubMed] [Google Scholar]
  • 72.Best R. B. & Mittal J. Protein simulations with an optimized water model: cooperative helix formation and temperature-induced unfolded state collapse. J. Phys. Chem. B 114, 14916–14923 (2010). [DOI] [PubMed] [Google Scholar]
  • 73.Eastman P. et al. OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLOS Comput. Biol. 13, e1005659 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Jorgensen W. L., Chandrasekhar J., Madura J. D., Impey R. W. & Klein M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983). [Google Scholar]
  • 75.Abraham M. J. et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 19–25 (2015). [Google Scholar]
  • 76.Laio A. & Parrinello M. Escaping free-energy minima. Proc. Natl. Acad. Sci. 99, 12562–12566 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Tribello G. A., Bonomi M., Branduardi D., Camilloni C. & Bussi G. PLUMED 2: New feathers for an old bird. Comput. Phys. Commun. 185, 604–613 (2014). [Google Scholar]
  • 78.McGibbon R. T. et al. MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys. J. 109, 1528–1532 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Porter J. R., Zimmerman M. I. & Bowman G. R. Enspara: Modeling molecular ensembles with scalable data structures and parallel computing. J. Chem. Phys. 150, 044108 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Matplotlib: A 2D Graphics Environment | IEEE Journals & Magazine | IEEE Xplore. https://ieeexplore.ieee.org/document/4160265. [Google Scholar]
  • 81.Chen J., Li Q. & Wang J. Topology of human apolipoprotein E3 uniquely regulates its diverse biological functions. Proc. Natl. Acad. Sci. U. S. A. 108, 14813 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES