Abstract
Here, we introduce FRETpredict, a Python software program to predict FRET efficiencies from ensembles of protein conformations. FRETpredict uses an established Rotamer Library Approach to describe the FRET probes covalently bound to the protein. The software efficiently operates on large conformational ensembles such as those generated by molecular dynamics simulations to facilitate the validation or refinement of molecular models and the interpretation of experimental data. We demonstrate the performance and accuracy of the software for different types of systems: a relatively structured peptide (polyproline 11), an intrinsically disordered protein (ACTR), and three folded proteins (HiSiaP, SBD2, and MalE). We also describe a general approach to generate new rotamer libraries for FRET probes of interest. FRETpredict is open source (GPLv3) and is available at github.com/KULL-Centre/FRETpredict and as a Python PyPI package at pypi.org/project/FRETpredict.
Introduction
Single-molecule Förster Resonance Energy Transfer (smFRET) is a well-established technique to measure distances and dynamics between two fluorophores [1, 2]. smFRET has been broadly used to study protein and nucleic acid conformational states and dynamics [3, 4], binding events [5, 6], and intramolecular transitions [7, 8]. The high spatial (nm) and temporal (ns) resolutions enable smFRET experiments to uncover individual species in heterogeneous and dynamic biomolecular complexes, as well as transient intermediates [9–13].
In a typical smFRET experiment on proteins, two residues are labeled with a donor and an acceptor FRET probe, respectively. While the FRET probes may sometimes be fluorescent proteins, they are more commonly organic molecules optimized for spectral and photophysical properties. Each such probe consists of a fluorophore and a linker, which can vary in length and is covalently attached to the protein [14]. For FRET to occur, the donor and acceptor fluorophores must have respective emission and absorption spectra that partially overlap, and the efficiency of the energy transfer depends on the proximity and relative orientation of the fluorophores.
Computational advancements, combined with enhanced sampling methods and approaches to coarse-grain, have enabled Molecular Dynamics (MD) simulations of biomolecules to explore time scales up to the millisecond or beyond [15–17]. Concomitantly, the molecular-level insights into protein structural dynamics provided by MD simulations are routinely employed to aid the interpretation of a multitude of experimental approaches, including smFRET [13, 18]. Irrespective of whether the underlying protein structure is static or dynamic, the conformational ensembles of the fluorescent probes must be taken into account to accurately predict FRET efficiencies from MD simulations [19].
To model the conformational space of dyes attached to a protein, several methods have been developed [20]. At the low end of the spectrum of computational cost, the Available Volume (AV) method uses a coarse-grained description of the probe for predicting the geometric volume encompassing the conformational ensemble of the probe [21, 22]. At the high end, MD simulations can be performed with explicit FRET probes [18, 23, 24]. Although, this approach provides unique insight into the motion of and interactions between protein and FRET probes, it must be often preceded by the parameterization of force field for the fluorescent dyes [23]. Furthermore, comparison with studies which integrate results from multiple pairs of probe positions require running independent MD simulations for each probe pair. Somewhere in the middle of the scale of computational cost and resolution is the Rotamer Library Approach (RLA), where multiple rotamer conformations of the FRET probe are placed at the labeled site of a protein conformation, and the statistical weight of each conformer is estimated using a simplified potential [25]. Polyhach et al. [25] introduced the RLA in the context of electron paramagnetic resonance [26]. The RLA may, however, also be employed to predict FRET [27, 28], in addition to double electron-electron resonance (DEER) and paramagnetic relaxation enhancement (PRE) nuclear magnetic resonance data [25, 26, 29, 30].
In this work we introduce FRETpredict, a Python module based on the RLA for calculating FRET efficiency based on protein conformational ensembles and MD trajectories. We describe a general methodology to generate rotamer libraries for FRET probes and present case studies for both intrinsically disordered and folded proteins (ACTR, Polyproline 11, HiSiaP, SBD2, and MalE).
Design and Implementation
FRETpredict is written in Python and is available as a Python package. The FRETpredict class carries out the FRET efficiency predictions. The class is initialized with (i) a protein structure or trajectory (provided as MDAnalysis Universe objects [31]), (ii) the residue indices to which the fluorescent probes are attached, and (iii) the rotamer libraries for the fluorophores and linkers to be used in the calculation. The lib/libraries.yml file lists all the available Rotamer Libraries, along with necessary fluorophore information, including atom indices for calculating transition dipole moments and distances between fluorophores. As shown in the Results section, the calculations are triggered by the run function.
The main requirements are Python 3.6–3.8 and MDAnalysis 2.0 [31]. FRETpredict can be installed through the package manager PIP by executing
1 pip install FRETpredict
Tests reproducing FRET data for a Hsp90 system can be run locally using the test running tool pytest.
Rotamer library Generation
Each FRET probe consists of two parts: the fluorescent dye, responsible for the FRET, and the linker, which comprises (i) a spacer, to distance the dye from the protein and (ii) a moiety to attach the probe covalently to the protein. For example, many of the most widely used probes can be purchased with maleimide (to link to Cys), N-hydroxysuccinimide (to link to Lys), or azide (for click chemistry) functional groups.
In FRETpredict, rotamer libraries are created through the following steps: (i) generation of the conformational ensemble of the FRET probe using MD simulations, (ii) selection of the peaks of the dihedral distributions of the linker, (iii) two clustering steps, and (iv) filtering. These steps are detailed in S1 Text and implemented in FRETpredict/rotamer libraries.py. In this work, we created rotamer libraries for AlexaFluor, ATTO, and Lumiprobe dyes with different linkers, using the force fields developed by Graen et al. [32]. This selection of rotamer libraries of widely used FRET probes are made available as a part of the FRETpredict package. Moreover, we provide a Jupyter Notebook tutorial (tests/tutorials/Tutorial_generate_new_rotamer_libraries.ipynb) which illustrates how to generate new rotamer libraries for FRETpredict.
FRETpredict algorithm
For each protein structure to be analysed—either individually or as an ensemble—the FRETpredict algorithm places the FRET probes at the selected protein sites independently of each other. Relative orientations and distances between the dyes are then computed for all combinations of the donor and acceptor rotamers. Further, nonbonded interaction energies between each rotamer and the surrounding protein atoms are calculated within a radius of 1.0 nm. Using these energies, statistical weights are first estimated for donor and acceptor rotamers independently and subsequently combined to calculate average FRET efficiencies. S2 Text details the rotamer library placement and weighting steps. In the following, we will detail the calculation of the average FRET efficiency in different averaging regimes.
FRET efficiency calculation.
FRET efficiency is defined as the fraction of donor excitations that result in energy transfer to the acceptor, and can be calculated as , where kET is the instantaneous FRET rate and kD is the spontaneous decay rate of donor excitation by non-FRET mechanisms (e.g. donor emission or non-radiative mechanisms). kET can be calculated as 6, where R0 is the Förster radius, and κ2 is the orientation factor, related to the relative orientation of the dipole moments of the dyes. The Förster radius is defined as
(1) |
where J is the spectral overlap integral between the fluorescence emission of the donor and the absorption spectrum of the acceptor, QD is the quantum yield of the donor in the absence of the acceptor, and n is the refractive index of the medium. Of these parameters, the most challenging to estimate is κ2. While it can be difficult to measure κ2 experimentally due to the rapid isomerization of the linker region of the probes, κ2 is often approximated to its freely diffusing isotropic average of 2/3 by considering that the fluorophore dynamics occur on a timescale that is sufficiently shorter than the donor lifetime. By assuming a fixed donor–acceptor distance, r, and κ2 = 2/3, we obtain
(2) |
For most cases, this approximation is acceptable due to the length of the linker region and rapid fluorophore reorientation. However, the placement of the probes on a protein structure may restrict the motions of the dyes due to interactions with the surrounding protein environment. Because of such potentially restricted fluorophore motions, sometimes κ2 ≠ 2/3. Therefore, a more general formula for calculating FRET efficiency is
(3) |
In this case, it is still assumed that the chromophore is reorienting faster than the donor lifetime, but that its motion is restricted in space. Due to the discrete nature of the RLA, FRETpredict allows precise computation of κ2 and the possibility to compute R0 in a κ2-dependent way. κ2-dependent R0 calculations (Eq. 1) are the default in FRETpredict, but users can also adopt a fixed R0 value by setting fixed_R0=True and specifying the R0 value with the r0 option. R0 values for the most common FRET probes are reported in lib/R0/R0_pairs.csv.
Averaging regimes.
Protein, linker and dye motions may all contribute to FRET and so dynamics on different timescales may be important; here we simplify these as the protein correlation time (τp), the linker-distance correlation time (τl), the orientation correlation time of the dye (τk), and the fluorescence lifetime (τf). Given a conformational ensemble, but no explicit representation of the dynamical motion and timescales, the “average” FRET efficiency depends on how rapidly the various time-dependent components of E (i.e., r and κ2 in Eq. 3) are averaged relative to the fluorescence lifetime. If a specific motion occurs much faster than the fluorescence decay, the effective kET will be completely averaged over that degree of freedom. Assuming that protein fluctuations are slow (i.e., τp >> τf), we obtain three different regimes for the relationship between the experimentally measured efficiency and the underlying donor–acceptor distance distribution. [33]
Static Regime (τk >> τf and τl >> τf). In this scenario, dye distance and orientation fluctuations are both slow, thus, there is no averaging of transfer rate, and every combination of protein configurations, linker distance, and dye orientation gives a separate kET. In this case, the FRET efficiency is averaged over N protein conformations as well as over the m and l rotamers for the donor and the acceptor, respectively,
(4) |
In this regime, is an instantaneous value calculated for a given combination of donor and acceptor rotamers as
(5) |
where and are the transition dipole moment unit vectors of the donor and acceptor, respectively, and denotes the normalized inter-fluorophore displacement. In FRETpredict, the atom pairs defining , , and are specified in lib/libraries.yml.
Dynamic Regime (τk << τf and τl >> τf). The dynamic regime is commonly assumed in the treatment of experimental data, where the complete conformational sampling is achieved within the fluorescence lifetime of the donor. The FRET efficiency is calculated as:
(6) |
Here, is calculated over all the protein conformations and combinations of probe rotamers:
(7) |
Dynamic+ Regime (τk << τf and τl << τf). In this regime, both dye distances and orientations are very fast, and the kET for each protein frame is averaged over all dye configurations, considering both distances and orientations. The FRET efficiency is calculated as
(8) |
where
(9) |
Results
Rotamer libraries
To illustrate the extent to which the conformational ensemble of the probes is reduced upon the generation of the rotamer libraries, we plotted the projection on the xy-plane of the distance vectors between the Cα atom and the central atom of the fluorophore (Fig 2 and S6, S7, and S8 Figs) of all the generated rotamer libraries (S3, S4, and S5 Figs). Compared to the unfiltered rotamer libraries (S6 Fig), the distribution of fluorophore positions for the large rotamer libraries (cutoff = 10) are less isotropic and homogeneous, as evidenced by the deviation of the scatter plot from a circular shape. Unsurprisingly, the anisotropicity is increasingly more pronounced for the medium and small rotamer libraries which were obtained by filtering out cluster of less than 20 and 30 conformers, respectively (S7 and S8 Figs).
The rotamer libraries of some FRET probes show pronounced anisotropy, illustrated by the deviation of the scatter plots from a circular shape (A48 L1R, A53 L1R, A56 L1R, A59 L1R, and A48 B1R). The observed anisotropy can be related to the length of the linker, and hence to its the rotational degrees of freedom. For example, the rotamer library A48 C1R is more isotropic than A48 L1R because L1R is a shorter linker than C1R (S3 Fig). On the other hand, a comparison between A48 L1R and T42 L1R suggests that the more isotropic nature of T42 might be due to the structure of the T42 fluorophore which effectively provides an extension to the linker length (S4 Fig).
The RLA relies on a trade-off between thorough conformational sampling and computational cost, as the latter increases with the increased size of the library (S9 Fig), which ideally should not exceed ∼ 1, 000 rotamers. To provide an idea of the time differences involved in using rotamer libraries with different numbers of rotamers, we report the times required to calculate the FRET efficiencies for Polyproline 11 (S10 Table).
Below we showcase how FRETpredict can be used to calculate FRET efficiencies using different labels, different averaging schemes and different types and sources of protein/peptide conformations. Our goal here is not to discuss the biophysics of the individual systems, but rather to highlight the capabilities of FRETpredict.
Case study 1: Protein Trajectory (pp11)
Polyproline 11 (pp11) has been described as behaving like a rigid rod, and was used as a “spectroscopic ruler” in the seminal paper by Stryer and Haugland [34]; subsequent work showed additional complexity [24, 33, 35, 36]. The pp11 system is thus a classical example of the importance of comparing molecular models with FRET data. Here, we compared FRET efficiency values estimated using FRETpredict with reference values from experiments [33] and from extensive all-atom MD simulations of pp11 with explicit FRET probes [23]. For analyses with the RLA we removed these FRET probes to ensure that the conformational ensembles were comparable, and thus compare the different ways of representing the dyes (explicitly or via RLA). In both experiments and simulations, the terminal residues were labeled with AlexaFluor 488 - C1R (donor) and AlexaFluor 594 - C1R (acceptor), and the R0 value was fixed to 5.4 nm. We used large rotamer libraries to estimate the FRET efficiency of pp11 in the three averaging regimes. Comparison with the reference values (Fig 3 and S11 Table) shows that FRETpredict calculations yield predictions that are comparable to MD simulations with explicit representation of the probes when compared with the experimental values, suggesting that the RLA provides a relatively accurate FRET calculation. In particular, the Dynamic regime best approximates the experimental value. As a convenient approach to calculate FRET efficiencies when there is no information about which averaging regime to use, we also calculate the average, ⟨E⟩, over the estimates of the Static, Dynamic, and Dynamic+ regimes.
FRET efficiencies were calculated from the pp11 trajectory through the following lines of code:
1 from FRETpredict import FRETpredict 2 u = MDAnalysis. Universe (“ pp11. pdb “, “ pp11. xtc “) 3 FRET = FRETpredict ( protein =u, residues =[0, 12], electrostatic =True, 4 donor=“ AlexaFluorப488”, acceptor=“ Alexa Fluorப594”, 5 libname_1 =“ AlexaFluorப488பC1R பcutoff10”, 6 libname_2 =“ AlexaFluorப594பC1Rபcutoff10”) 7 FRET. run ()
Line two generates the MDAnalysis Universe object from an XTC trajectory and a PDB topology. Line three initializes the FRETpredict object with the labeled residue numbers, the FRET probes from the available rotamer libraries, and turns the electrostatic calculations on. Line seven runs the calculations and saves per-frame and ensemble-averaged data to file. R0 was computed for each combination of FRET probes via Eq. 1.
Case study 2: Ensemble of an Intrinsically Disordered Protein (ACTR)
ACTR (activator for thyroid hormone and retinoid receptors) is an intrinsically disordered protein that has previously been extensively studied [38, 39]. Here, we used ACTR to demonstrate how FRETpredict can be used on conformational ensembles for intrinsically disordered proteins.
We used previous experimental FRET measurements and MD simulations for ACTR solutions at different urea concentrations that were used to assess the effect of chemical denaturants on protein structure [37, 40]. As in the experiments, we labeled the residue pairs 3–61, 3–75, and 33–75 with Alexa Fluor 488 - C1R as the donor and Alexa Fluor 594 - C1R as the acceptor. To account for the dependence of R0 on urea concentration, we used Eq. 4 in Zheng et al. [40] and estimated R0 = 5.40 Å, 5.34 Å, and 5.29 Å for [urea] = 0 M, 2.5 M, and 5 M, respectively.
Fig 4 and S12 Table show the FRET efficiency values predicted by FRETpredict at the different urea concentrations (0 M, 2.5 M, and 5 M) using medium rotamer libraries. The absolute values of predicted FRET efficiency differ from the experimental values on average by 13.1, 7.2, and 12.1% for [urea] = 0 M, 2.5 M, and 5 M, respectively. Notably, the prediction trend is consistent with the experimental data for all the pairs of labeled residues of ACTR and at the three urea concentrations. The agreement between calculated and experimental trends for the E values shown in Fig 4 relies on the thorough and accurate sampling of conformational ensembles obtained via MD simulations by Zheng et al. [40] while it also contributes to validating FRETpredict as a model for calculating E.
To determine which regime most accurately predicts the FRET efficiency, we calculated the root-mean-square error (RMSE) between the predicted and experimental values for all the residue pairs. For the ACTR data, RMSE values obtained for the Static, Dynamic, and Dynamic+ regimes and their average are 0.233, 0.177, 0.315, and 0.171, respectively. As observed in Case Study 1, the Dynamic regime and the average best approximate the experimental FRET efficiency data.
The following lines of code were used to calculate the E values from the ACTR trajectory at [urea] = 0 M:
1 from FRETpredict import FRETpredict 2 u_0M = MDAnalysis. Universe (“ actr. gro “, “ actr_urea 0. xtc “) 3 FRET = FRETpredict ( protein =u_0M, residues =[3, 61], 4 fixed_R 0 =True, r0 =5.40, 5 electrostatic =True, 6 libname_1 =“ AlexaFluorப488பC1Rபcutoff20”, 7 libname_2 =“ AlexaFluorப594பC1Rபcutoff20”) 8 FRET. run ()
Line two generates the MDAnalysis Universe object from an XTC trajectory and a GRO topology. Line three initializes the FRETpredict object with the labeled residue numbers, the FRET probes from the available rotamer libraries, and fixes the R0 value corresponding to the specific urea concentration listed above. Line eight runs the calculations and saves per-frame and ensemble-averaged data to file.
Case study 3: Single protein structures (HiSiaP, SBD2, MalE)
Although we generated rotamer libraries for several of the most common FRET probes, in some cases smFRET experiments might be performed with probes that are currently not available in FRETpredict. In this case study, we illustrate how, in the absence of the exact probes, accurate trends can be predicted by (i) choosing rotamer libraries with similar structural characteristics (linker length, linker dihedrals, fluorophore structure) and (ii) entering the R0 for the experimental pair of dyes (S14 Fig). We apply this strategy to the single structures of HiSiaP, SBD2, and MalE and show that it leads to results that are consistent with the experimental trends. The reference FRET efficiency data of this case study was obtained from the experimental study of Peter et al. [41], wherein Alexa Fluor 555 - C2R and Alexa Fluor 647 - C2R dyes were employed as donor and acceptor, respectively. In FRETpredict, both donor and acceptor were replaced by AlexaFluor 647 - C2R, the available rotamer library with the most similar steric hindrance (S3 Fig), whereas we used the R0 value of the FRET pair used in the actual experiments.
HiSiaP
HiSiaP is the periplasmic substrate-binding protein from the sialic acid tripartite ATP-independent periplasmic transporter of Haemophilus influenzae. In this protein, ligand binding induces a conformational rearrangement from an open to a closed state [42].
We calculated E values for the labeled residue pairs measured by Peter et al. [41] (58–134, 55–175, 175–228, and 112–175) using structures deposited in the Protein Data Bank (PDB) for the open and closed structures (PDB codes 2CEY [43] and 3B50 [44], respectively).
The absolute values of the FRET efficiency predicted for HiSiaP differ on average by 20.6 and 24.3% from the experimental values of the open and closed conformation, respectively (Fig 5 A and B, and S13 Table). The trend of the FRETpredict prediction is about equally consistent with the experimental data for both conformations and for all the pairs of labeled residues.
The code used to calculate the FRET efficiency for the single HiSiaP open structure with FRETpredict is:
1 from FRETpredict import FRETpredict 2 u_open = MDAnalysis. Universe (“ 2 cey. pdb “) 3 FRET = FRETpredict ( protein = u_open, residues =[58, 134], temperature =298, 4 fixed_R 0 =True, r0 =5.1, 5 electrostatic =True, 6 libname_1 =“ AlexaFluorப647பC2Rபcutoff10 “, 7 libname_2 =“ AlexaFluorப647பC2Rபcutoff10 “) 8 FRET. run ()
Line two generates the MDAnalysis Universe object for the open structure from a PDB topology. Line three initializes the FRETpredict object with the labeled residue numbers, the FRET probes from the available rotamer libraries, and fixes the R0 value to the experimental one. Line eight runs the calculations and saves per-frame and ensemble-averaged data to file. The same FRETpredict code structure has been used for the other single structure tests of SBD2 and MalE.
SBD2
SBD2 is the second of two substrate-binding domains constituting the glutamine ABC transporter GlnPQ from Lactococcus lactis. As for HiSiaP, upon binding of high-affinity ligands SBD2, undergoes a transition from an open to a closed state [45].
Peter et al. [41] performed FRET efficiency measurements on SBD2 by labeling the residue pairs 319–392 and 369–451. We used the structures for the open and closed structures deposited in the PDB (PDB codes 4KR5 [46] and 4KQP [46], respectively) in combination with AlexaFluor 647 - C2R as both donor and acceptor.
The absolute values of the FRET efficiency predicted for SBD2 differ on average by 21.6 and 21.1% from the experimental values of the open and closed conformation, respectively (Fig 5 C and D, and S13 Table).
MalE
The maltose binding protein from Escherichia coli, MalE, plays an important role in the uptake of maltose and maltodextrins by the maltose transporter complex MalFGK2 [47]. MalE undergoes structural transition between the apo and holo states upon sugar binding, resulting in a ∼35◦ rigid body domain reorientation [48].
Peter et al. [41] performed FRET measurements on MalE by labeling the residue pairs 87–127, 134–186, 36–352, and 29–352. We used open and closed structures (PDB codes 1OMP [49] and 1ANF [50], respectively) with AlexaFluor 647 - C2R as both donor and acceptor.
The absolute values of the FRET efficiency predicted for MalE differ on average by 15.1 and 10.0% from the experimental values of the open and closed conformation, respectively (Fig 5 E and F, and S13 Table).
The RMSE values associated with the averaging regimes over all single-frame structures of HiSiaP, SBD2, and MalE are 0.097 (Static), 0.094 (Dynamic), 0.141 (Dynamic+), and 0.086 (Average). Based on these results, we observe that in the case of single-frame structures, the best predictions correspond to the Average regime.
In this case study, we used probes that are similar but not identical to those used in the experiments. The main physicochemical factors to take into consideration to assess the similarity between probes are the steric bulk of dye, the length and flexibility of the linker, and the presence of charged groups. We already noted that the steric bulk of the FRET probe and the rigidity of the linker have a strong influence on the clustering of the rotamers. Accordingly, these structural features also affect the external weights calculated upon placement of the rotamers at the binding site. On the other hand, we observed that including electrostatic interactions in FRETpredict calculations (electrostatic=true) had little effect on the accuracy of FRET efficiency prediction for the studied systems (S14 Fig). In summary, we found that using the rotamer library for a probe with similar steric hindrance, in combination with the R0 value for the correct dye pair, yields FRET efficiency trends in good agreement with the experimental data (S14 Fig).
Conclusions
We have introduced FRETpredict, an open-source software program with a fast implementation of the RLA for the calculation of FRET efficiency data, along with the rotamer libraries of many of the most commonly used FRET probes. Using three case studies, we have highlighted the capabilities of our implementation in the case of a peptide trajectory (pp11), an IDP trajectory (ACTR), and single protein structures (HiSiaP, SBD2, and MalE). The FRET efficiency prediction trends are in most cases in good agreement with the experimental data; However, we note that the accuracy of the method depends on the quality and relevance of the protein conformational ensembles that are used as input.
In FRETpredict, the average FRET efficiency can be calculated in three different regimes: Static, Dynamic, and Dynamic+. We suggest using the Dynamic regime when making predictions on protein trajectories and the Static regime for single protein structures. In the absence of information about the different timescales, we find that simply averaging the results from the three regimes often leads to good agreement with experiments.
FRETpredict calculations and, more generally, FRET efficiency predictions from protein trajectories involve a trade-off between computation time and prediction accuracy. Accordingly, the choice of the optimal rotamer library selection must take its size into consideration. Large rotamer libraries may lead to greater accuracy but are also more computationally expensive than smaller libraries. On the other hand, both medium and small rotamer libraries are a good compromise between calculation time and accuracy when long simulation trajectories are used. However, using a small number of rotamer clusters may compromise the prediction of FRET efficiency, especially in case of tight placement at the labeled site, in which many rotamers may be excluded from the calculation due to probe-protein steric clashes. Therefore, we recommend using large rotamer libraries when the computational cost is not a limiting factor and medium libraries for larger conformational ensembles.
Availability and Future Directions
The software is available on GitHub at github.com/KULL-Centre/FRETpredict, where it is published and distributed under GPL license, version 3. Tutorials for predicting FRET efficiency with FRETpredict and creating new rotamer libraries were also created and made available on the GitHub repository. FRETpredict is also distributed as a PyPI package (pypi.org/project/FRETpredict). FRETpredict has a general framework and can be readily extended to encompass non-protein biomolecules and additional rotamer libraries of FRET probes. In the current implementation, we consider all combinations of rotamers from the respective donor and acceptor libraries and independently weigh each dye based on protein-dye interaction energies, which are evaluated for the two rotamers independently. The approach could be further developed to randomly sample pairs of rotamers and to account for dye-dye interactions in the calculation of the statistical weights assigned to each pair. Further, the calculation of average FRET efficiencies could be based on the diffusive motion of the FRET probes in a potential of mean force derived from donor–acceptor distance distributions, as recently described [51] and implemented in the MMM software-tool [27].
Supplementary Material
Acknowledgments
M.B.A.K. acknowledges funding from the Lundbeck Foundation (lundbeckfonden.com). K.L.-L. acknowledges funding via a Sapere Aude Starting Grant from the Danish Council for Independent Research (Natur og Univers, Det Frie Forskningsråd, 12-126214, https://dff.dk/) and the Lundbeck Foundation BRAINSTRUC initiative in structural biology (R155-2015-2666, lundbeckfonden.com). We acknowledge the use of resources at the core facility for biocomputing at the Department of Biology.
References
- 1.Förster T. Zwischenmolekulare energiewanderung und fluoreszenz. Annal Phys. 1948;2:55–75. [Google Scholar]
- 2.Főrster T. 10th Spiers Memorial Lecture. Transfer mechanisms of electronic excitation. Discuss Faraday Soc. 1959;27:7–17. doi: 10.1039/DF9592700007. [DOI] [Google Scholar]
- 3.Greenleaf WJ, Woodside MT, Block SM. High-resolution, single-molecule measurements of biomolecular motion. Annu Rev Biophys Biomol Struct. 2007;36(1):171–190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Schuler B. Perspective: Chain dynamics of unfolded and intrinsically disordered proteins from nanosecond fluorescence correlation spectroscopy combined with single-molecule FRET. J Chem Phys. 2018;149:010910. [DOI] [PubMed] [Google Scholar]
- 5.Coats JE, Lin Y, Rueter E, Maher I L James, Rasnik I. Single-molecule FRET analysis of DNA binding and bending by yeast HMGB protein Nhp6A. Nucleic Acids Research. 2012;41(2):1372–1381. doi: 10.1093/nar/gks1208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Schuler B, Borgia A, Borgia MB, Heidarsson PO, Holmstrom ED, Nettels D, et al. Binding without folding – the biomolecular function of disordered polyelectrolyte complexes. Current Opinion in Structural Biology. 2020;60:66–76. doi: 10.1016/j.sbi.2019.12.006. [DOI] [PubMed] [Google Scholar]
- 7.Santoso Y, Joyce CM, Potapova O, Reste LL, Hohlbein J, Torella JP, et al. Conformational transitions in DNA polymerase I revealed by single-molecule FRET. Proceedings of the National Academy of Sciences. 2010;107(2):715–720. doi: 10.1073/pnas.0910909107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Gauer JW, LeBlanc S, Hao P, Qiu R, Case BC, Sakato M, et al. Single-molecule FRET to measure conformational dynamics of DNA mismatch repair proteins. Methods Enzymol. 2016;581:285–315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ha T, Enderle T, Ogletree DF, Chemla DS, Selvin PR, Weiss S. Probing the interaction between two single molecules: fluorescence resonance energy transfer between a single donor and a single acceptor. Proceedings of the National Academy of Sciences. 1996;93(13):6264–6268. doi: 10.1073/pnas.93.13.6264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lee NK, Kapanidis AN, Wang Y, Michalet X, Mukhopadhyay J, Ebright RH, et al. Accurate FRET Measurements within Single Diffusing Biomolecules Using Alternating-Laser Excitation. Biophysical Journal. 2005;88(4):2939–2953. doi: 10.1529/biophysj.104.054114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Schuler B, Soranno A, Hofmann H, Nettels D. Single-molecule FRET spectroscopy and the polymer physics of unfolded and intrinsically disordered proteins. Annu Rev Biophys. 2016;45(1):207–231. [DOI] [PubMed] [Google Scholar]
- 12.Dimura M, Peulen TO, Sanabria H, Rodnin D, Hemmen K, Hanke CA, et al. Automated and optimally FRET-assisted structural modeling. Nature communications. 2020;11(1):1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Alston JJ, Soranno A, Holehouse AS. Integrating single-molecule spectroscopy and simulations for the study of intrinsically disordered proteins. Methods. 2021;193:116–135. doi: 10.1016/j.ymeth.2021.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hellenkamp B, Schmid S, Doroshenko O, Opanasyuk O, Kühnemuth R, Rezaei Adariani S, et al. Precision and accuracy of single-molecule FRET measurements-a multi-laboratory benchmark study. Nat Methods. 2018;15(9):669–676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Shaw DE, Maragakis P, Lindorff-Larsen K, Piana S, Dror RO, Eastwood MP, et al. Atomic-level characterization of the structural dynamics of proteins. Science. 2010;330(6002):341–346. [DOI] [PubMed] [Google Scholar]
- 16.Girodat D, Pati AK, Terry DS, Blanchard SC, Sanbonmatsu KY. Quantitative comparison between sub-millisecond time resolution single-molecule FRET measurements and 10-second molecular simulations of a biosensor protein. PLoS Comput Biol. 2020;16(11):e1008293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Zimmerman MI, Porter JR, Ward MD, Singh S, Vithani N, Meller A, et al. SARS-CoV-2 simulations go exascale to predict dramatic spike opening and cryptic pockets across the proteome. Nature chemistry. 2021;13(7):651–659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Merchant KA, Best RB, Louis JM, Gopich IV, Eaton WA. Characterizing the unfolded states of proteins using single-molecule FRET spectroscopy and molecular simulations. Proceedings of the National Academy of Sciences. 2007;104(5):1528–1533. doi: 10.1073/pnas.0607097104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Fajer P, Fajer M, Zawrotny M, Yang W. Full atom simulations of spin label conformations. Methods Enzymol. 2015;563:623–642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dimura M, Peulen TO, Hanke CA, Prakash A, Gohlke H, Seidel CA. Quantitative FRET studies and integrative modeling unravel the structure and dynamics of biomolecular systems. Current Opinion in Structural Biology. 2016;40:163–185. doi: 10.1016/j.sbi.2016.11.012. [DOI] [PubMed] [Google Scholar]
- 21.Sindbert S, Kalinin S, Nguyen H, Kienzler A, Clima L, Bannwarth W, et al. Accurate Distance Determination of Nucleic Acids via Förster Resonance Energy Transfer: Implications of Dye Linker Length and Rigidity. Journal of the American Chemical Society. 2011;133(8):2463–2480. doi: 10.1021/ja105725e. [DOI] [PubMed] [Google Scholar]
- 22.Kalinin S, Peulen T, Sindbert S, Rothwell PJ, Berger S, Restle T, et al. A toolkit and benchmark study for FRET-restrained high-precision structural modeling. Nature Methods. 2012;9(12):1218–1225. doi: 10.1038/nmeth.2222. [DOI] [PubMed] [Google Scholar]
- 23.Best RB, Hofmann H, Nettels D, Schuler B. Quantitative interpretation of FRET experiments via molecular simulation: force field and validation. Biophys J. 2015;108(11):2721–2731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Best RB, Merchant KA, Gopich IV, Schuler B, Bax A, Eaton WA. Effect of flexibility and ¡i¿cis¡/i¿ residues in single-molecule FRET studies of polyproline. Proceedings of the National Academy of Sciences. 2007;104(48):18964–18969. doi: 10.1073/pnas.0709567104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Polyhach Y, Bordignon E, Jeschke G. Rotamer libraries of spin labelled cysteines for protein studies. Phys Chem Chem Phys. 2011;13:2356–2366. doi: 10.1039/C0CP01865A. [DOI] [PubMed] [Google Scholar]
- 26.Jeschke G. MMM: A toolbox for integrative structure modeling. Protein Science. 2017;27(1):76–85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Klose D, Holla A, Gmeiner C, Nettels D, Ritsch I, Bross N, et al. Resolving distance variations by single-molecule FRET and EPR spectroscopy using rotamer libraries. Biophys J. 2021;120(21):4842–4858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Walczewska-Szewc K, Corry B. Accounting for dye diffusion and orientation when relating FRET measurements to distances: three simple computational methods. Phys Chem Chem Phys. 2014;16:12317–12326. doi: 10.1039/C4CP01222D. [DOI] [PubMed] [Google Scholar]
- 29.Tesei G, Martins JM, Kunze MBA, Wang Y, Crehuet R, Lindorff-Larsen K. DEER-PREdict: Software for efficient calculation of spin-labeling EPR and NMR data from conformational ensembles. PLOS Computational Biology. 2021;17(1):1–18. doi: 10.1371/journal.pcbi.1008551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Jeschke G, Esteban-Hofer L. Integrative ensemble modeling of proteins and their complexes with distance distribution restraints. Methods in Enzymology. 2022;666:145–169. [DOI] [PubMed] [Google Scholar]
- 31.Gowers Richard J, Linke Max, Barnoud Jonathan, Reddy Tyler J E, Melo Manuel N, Seyler Sean L, et al. MDAnalysis: A Python Package for the Rapid Analysis of Molecular Dynamics Simulations. In: Benthall Sebastian, Rostrup Scott, editors. Proceedings of the 15th Python in Science Conference; 2016. p. 98 – 105. [Google Scholar]
- 32.Graen T, Hoefling M, Grubmüller H. AMBER-DYES: Characterization of Charge Fluctuations and Force Field Parameterization of Fluorescent Dyes for Molecular Dynamics Simulations. Journal of Chemical Theory and Computation. 2014;10(12):5505–5512. doi: 10.1021/ct500869p. [DOI] [PubMed] [Google Scholar]
- 33.Schuler B, Lipman EA, Steinbach PJ, Kumke M, Eaton WA. Polyproline and the “spectroscopic ruler” revisited with single-molecule fluorescence. Proceedings of the National Academy of Sciences. 2005;102(8):2754–2759. doi: 10.1073/pnas.0408164102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Stryer L, Haugland RP. Energy transfer: a spectroscopic ruler. Proc Natl Acad Sci U S A. 1967;58(2):719–726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Watkins LP, Chang H, Yang H. Quantitative single-molecule conformational distributions: a case study with poly-(L-proline). J Phys Chem A. 2006;110(15):5191–5203. [DOI] [PubMed] [Google Scholar]
- 36.Schuler B, Lipman EA, Eaton WA. Probing the free-energy surface for protein folding with single-molecule fluorescence spectroscopy. Nature. 2002;419(6908):743–747. doi: 10.1038/nature01060. [DOI] [PubMed] [Google Scholar]
- 37.Borgia A, Zheng W, Buholzer K, Borgia MB, Schüler A, Hofmann H, et al. Consistent View of Polypeptide Chain Expansion in Chemical Denaturants from Multiple Experimental Methods. Journal of the American Chemical Society. 2016;138(36):11714–11726. doi: 10.1021/jacs.6b05917. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Demarest SJ, Martinez-Yamout M, Chung J, Chen H, Xu W, Dyson HJ, et al. Mutual synergistic folding in recruitment of CBP/p300 by p160 nuclear receptor coactivators. Nature. 2002;415(6871):549–553. [DOI] [PubMed] [Google Scholar]
- 39.Soranno A, Koenig I, Borgia MB, Hofmann H, Zosel F, Nettels D, et al. Single-molecule spectroscopy reveals polymer effects of disordered proteins in crowded environments. Proc Natl Acad Sci U S A. 2014;111(13):4874–4879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Zheng W, Borgia A, Buholzer K, Grishaev A, Schuler B, Best RB. Probing the Action of Chemical Denaturant on an Intrinsically Disordered Protein by Simulation and Experiment. Journal of the American Chemical Society. 2016;138(36):11702–11713. doi: 10.1021/jacs.6b05443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Peter MF, Gebhardt C, Mächtel R, Muñoz GGM, Glaenzer J, Narducci A, et al. Cross-validation of distance measurements in proteins by PELDOR/DEER and single-molecule FRET. Nature Communications. 2022;13(1):4396. doi: 10.1038/s41467-022-31945-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Mulligan C, Fischer M, Thomas GH. Tripartite ATP-independent periplasmic (TRAP) transporters in bacteria and archaea. FEMS Microbiology Reviews. 2011;35(1):68–86. doi: 10.1111/j.1574-6976.2010.00236.x. [DOI] [PubMed] [Google Scholar]
- 43.Müller A, Severi E, Mulligan C, Watts AG, Kelly DJ, Wilson KS, et al. Conservation of Structure and Mechanism in Primary and Secondary Transporters Exemplified by SiaP, a Sialic Acid Binding Virulence Factor from Haemophilus influenzae*. Journal of Biological Chemistry. 2006;281(31):22212–22222. doi: 10.1074/jbc.M603463200. [DOI] [PubMed] [Google Scholar]
- 44.Johnston JW, Coussens NP, Allen S, Houtman JCD, Turner KH, Zaleski A, et al. Characterization of the N-Acetyl-5-neuraminic Acid-binding Site of the Extracytoplasmic Solute Receptor (SiaP) of Nontypeable Haemophilus influenzae Strain 2019*. Journal of Biological Chemistry. 2008;283(2):855–865. doi: 10.1074/jbc.M706603200. [DOI] [PubMed] [Google Scholar]
- 45.Gouridis G, Schuurman-Wolters GK, Ploetz E, Husada F, Vietrov R, de Boer M, et al. Conformational dynamics in substrate-binding domains influences transport in the ABC importer GlnPQ. Nature Structural & Molecular Biology. 2015;22(1):57–64. doi: 10.1038/nsmb.2929. [DOI] [PubMed] [Google Scholar]
- 46.Fulyani F, Schuurman-Wolters G, Žagar A, Guskov A, Slotboom DJ, Poolman B. Functional Diversity of Tandem Substrate-Binding Domains in ABC Transporters from Pathogenic Bacteria. Structure. 2013;21(10):1879–1888. doi: 10.1016/j.str.2013.07.020. [DOI] [PubMed] [Google Scholar]
- 47.Hall JA, Ganesan AK, Chen J, Nikaido H. Two Modes of Ligand Binding in Maltose-binding Protein of Escherichia coli: FUNCTIONAL SIGNIFICANCE IN ACTIVE TRANSPORT. Journal of Biological Chemistry. 1997;272(28):17615–17622. doi: 10.1074/jbc.272.28.17615. [DOI] [PubMed] [Google Scholar]
- 48.Tang C, Schwieters CD, Clore GM. Open-to-closed transition in apo maltose-binding protein observed by paramagnetic NMR. Nature. 2007;449(7165):1078–1082. doi: 10.1038/nature06232. [DOI] [PubMed] [Google Scholar]
- 49.Sharff AJ, Rodseth LE, Spurlino JC, Quiocho FA. Crystallographic evidence of a large ligand-induced hinge-twist motion between the two domains of the maltodextrin binding protein involved in active transport and chemotaxis. Biochemistry. 1992;31(44):10657–10663. doi: 10.1021/bi00159a003. [DOI] [PubMed] [Google Scholar]
- 50.Quiocho FA, Spurlino JC, Rodseth LE. Extensive features of tight oligosaccharide binding revealed in high-resolution structures of the maltodextrin transport/chemosensory receptor. Structure. 1997;5(8):997–1015. doi: 10.1016/S0969-2126(97)00253-0. [DOI] [PubMed] [Google Scholar]
- 51.Dingfelder F, Benke S, Nettels D, Schuler B. Mapping an Equilibrium Folding Intermediate of the Cytolytic Pore Toxin ClyA with Single-Molecule FRET. The Journal of Physical Chemistry B. 2018;122(49):11251–11261. doi: 10.1021/acs.jpcb.8b07026. [DOI] [PubMed] [Google Scholar]
- 52.Meng F, Bellaiche MMJ, Kim JY, Zerze GH, Best RB, Chung HS. Highly disordered amyloid-beta monomer probed by single-molecule FRET and MD simulation. Biophys J. 2018;114(4):870–884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Huang J, MacKerell AD Jr. CHARMM36 all-atom additive protein force field: validation based on comparison to NMR data. J Comput Chem. 2013;34(25):2135–2145. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.