Abstract
Histograms of single-molecule Förster resonance energy transfer (FRET) efficiency are often used to study the structures of biomolecules and relate these structures to function. Methods like probability distribution analysis analyze FRET histograms to detect heterogeneities in molecular structure, but they cannot determine whether this heterogeneity arises from dynamic processes or from the coexistence of several static structures. To this end, we introduce burst variance analysis (BVA), a method that detects dynamics by comparing the standard deviation of FRET from individual molecules over time to that expected from theory. Both simulations and experiments on DNA hairpins show that BVA can distinguish between static and dynamic sources of heterogeneity in single-molecule FRET histograms and can test models of dynamics against the observed standard deviation information. Using BVA, we analyzed the fingers-closing transition in the Klenow fragment of Escherichia coli DNA polymerase I and identified substantial dynamics in polymerase complexes formed prior to nucleotide incorporation; these dynamics may be important for the fidelity of DNA synthesis. We expect BVA to be broadly applicable to single-molecule FRET studies of molecular structure and to complement approaches such as probability distribution analysis and fluorescence correlation spectroscopy in studying molecular dynamics.
Introduction
Single-molecule Förster resonance energy transfer (FRET) is an important tool for studying the dynamics of biological molecules and has contributed to fields such as protein folding (1,2), nucleic acid structure and dynamics (3,4), and the function of polymerases (5–8). A common method for analyzing single-pair FRET data is through histograms, which report on the distribution of FRET efficiencies and corresponding donor-acceptor distances for a given molecular species (5–7,9,10). Typically, FRET experiments focus on interpreting changes in mean FRET efficiency, which reflect structural changes in the molecules of interest. Besides mean FRET efficiency, the widths and shapes of these distributions also contain information (9,11–18). Broad FRET distributions may indicate the presence of static heterogeneity, dynamic heterogeneity, or a combination of the two. Static heterogeneity is due to the coexistence of multiple species with static but distinct FRET efficiencies in the same sample, whereas dynamic heterogeneity is due to a single molecular species that fluctuates between multiple distinct FRET states. Dynamic heterogeneity is of special interest, since it can report on the relationship between the conformational states of a biomolecule and its mechanism of action (19,20).
Recent methods such as probability distribution analysis (PDA) (13,14,21–23) and proximity ratio histogram analysis (PRH) (12) have helped interpret the widths of FRET distributions. These methods use the experimental distribution of photon counts from either single fluorescence bursts in solution-phase experiments (PRH) or equally sized time windows (PDA), to calculate the shot-noise-limited distribution of FRET values, i.e., the distribution corresponding to a single, static FRET value, broadened only by photon statistics (shot noise). Whereas a shot-noise-limited distribution is consistent with a single donor-acceptor distance and structural homogeneity, additional broadening indicates the presence of heterogeneity. Recent PDA extensions have made it possible to fit models of static or dynamic heterogeneity to these broad distributions (12,14,22,23). However, it is difficult for either PDA or PRH to determine the exact origin of the broadening.
Fluorescence correlation spectroscopy (FCS) methods have been used extensively to identify molecular dynamics (24); however, it is difficult to resolve dynamics on the diffusion timescale with these methods, and they operate at the small-ensemble level, hindering the study of samples with multiple molecular subpopulations or imperfect fluorescent labeling. Although recent FCS-based methods have broadened the range of detectable dynamic timescales (25,26), they require additional experimental controls and are better suited for small-ensemble data. Moreover, although several studies have used correlation-based methods to resolve dynamics in single-molecule subpopulations (27–29), these suffer from the diffusion-timescale insensitivity of ensemble-based correlation methods. There is therefore a need for methods that can detect diffusion-timescale dynamics in single-molecule FRET experiments.
Here, we introduce burst variance analysis (BVA), which directly detects dynamics in single-molecule FRET data by examining how FRET efficiency fluctuates over time in individual molecules. Whereas the standard deviation of FRET for a static molecule is a simple analytical function of its mean FRET, molecules with dynamic fluctuations in FRET are characterized by an increased standard deviation. BVA compares the static and experimentally observed standard deviations, using a strict statistical criterion to determine whether a given sample exhibits dynamic FRET fluctuations. We demonstrate the ability of BVA to distinguish between static and dynamic heterogeneity using simulations and experiments on both static and dynamic DNA standards. We also show that BVA can be used to analyze the shot-noise predictions generated by PDA, providing a second dimension along which to test models of biomolecular dynamics against experimental data. Finally, we apply BVA to study fingers-closing dynamics in the Klenow fragment (KF) of Escherichia coli DNA polymerase I (Pol I). This conformational change precedes nucleotide incorporation and is thought to contribute to the polymerase's impressive fidelity. We found evidence for previously unidentified fingers-closing dynamics in both KF-DNA (binary) and KF-DNA-deoxynucleotide triphosphate (dNTP) (ternary) complexes, which may be functionally important for the fidelity of DNA synthesis.
Materials and Methods
Single-molecule fluorescence
Solution-phase single-molecule fluorescence experiments were performed using alternating laser excitation as described in previous studies (5,30). The excitation powers measured in continuous-wave mode at 532 and 638 nm were 200 μW and 80 μW, respectively, for DNA samples, and 400 μW and 60 μW, respectively, for KF samples. Samples were analyzed at a concentration of 10–50 pM to minimize multimolecule bursts. DNA samples were measured in 400 mM NaCl, 10 mM Tris-HCl, pH 8.0, 1 mM EDTA, and 100 μg/mL BSA; KF samples were measured in 40 mM HEPES-NaOH, pH 7.3, 10 mM MgCl2, 1 mM DTT, 100 μg/ml BSA, 5% glycerol, and 1 mM β-mercaptoethylamine. Fluorescent labeling and purification of DNA and KF samples is described in the Supporting Material.
Data analysis and simulations
Analysis software was written in MATLAB (The MathWorks, Natick, MA) or C++. Fluorescent bursts were detected as described previously (31) and analyzed to determine the proximity ratio, E∗, the donor-excitation photon count, N, the burst duration, T, and the donor/acceptor stoichiometry, S. Unless otherwise noted, all data were thresholded using S ≥ 0.45, eliminating acceptor-only fluorescent species from our analysis. BVA was implemented as described (see Theory), and PDA was carried out as described (22). BVA histograms were normalized such that the darkest shade represented the densest point on the histogram; white represented zero density. Simulation software was written in C++ and is described in the Supporting Material.
Results and Discussion
Theory
Proximity ratio
A fluorescence burst contains N donor-excitation photons, each detected in either the donor (D) or acceptor (A) channel. The photons arriving in each channel include contributions from both fluorescence, FD and FA, and background, BD and BA. Fluorescence due to leakage of donor fluorescence into the acceptor channel, and to direct excitation of the acceptor fluorophores by the donor-excitation laser (32), can also be observed; we include these contributions in the acceptor fluorescence term, FA. The experimentally observed FRET, or proximity ratio E∗, therefore includes contributions from background, leakage, and direct excitation and is simply the ratio of photons observed in the acceptor channel to total number of photons observed:
(1) |
The proximity ratio is a standard FRET-based reporter for measuring relative distance changes between two fluorophores (33), and we use it throughout this work.
Probability distribution analysis
In its simplest form, PDA predicts the distribution of observed FRET efficiencies, P(E), when the true FRET efficiency, 〈E〉, is the same for all molecules, and the only source of P(E) broadening is photon statistics (13). For the proximity ratio E∗, this expected distribution is
(2) |
where P(F) is the distribution of fluorescence photons per burst, and is the mean proximity ratio. Assuming no background, P(F) = P(N), the experimental distribution of photon counts. Moreover, , the distribution of fluorescence photons in the acceptor channel (including leakage and direct excitation), follows a binomial distribution (13):
(3) |
The mean FRET, , is typically a floating parameter, which we fit by minimizing a reduced chi-square objective function, (see Supporting Material). For simplicity, we ignore the contribution of background fluorescence, which is negligible in our experiments (background counts of ≤6 kHz have a negligible effect on FRET histograms under typical experimental conditions (12,22)), though such contributions can be incorporated (Eq. S1, Eq. S2, and Eq. S3) (13).
Extensions of PDA to predict the shot-noise-limited histograms of samples with static or dynamic heterogeneity are presented in Eq. S4, Eq. S5, Eq. S6, Eq. S7, and Eq. S8 in the Supporting Material, and in detail elsewhere (14,22,23). In a previous publication, we developed a method to predict the shot-noise-limited FRET histograms of molecules with kinetic schemes of arbitrary complexity (22). Whereas this method relied on the simplifying assumption of a uniform photon arrival-time distribution, we have since modified the method to incorporate the experimental distribution of arrival times, improving the accuracy of our PDA predictions. We refer to this method as arrival-time PDA (explained in detail in the Supporting Material); the arrival-time-PDA method preserves photon arrival-time information, allowing the PDA prediction itself to be analyzed by BVA for the presence of dynamics. For validation of the arrival-time-PDA method, see Fig. S2 in the Supporting Material.
Burst variance analysis
If the observed FRET distribution is broader than the expected shot-noise distribution, PDA can be used to fit for multiple static components, or multiple interconverting states (12,14,22,23) (also see the Supporting Material); however, it cannot discriminate between these two sources of heterogeneity. In its simplest form, we use BVA to determine whether the observed broadening of the distribution is due to dynamics.
Whereas PDA examines the heterogeneity in FRET among all molecules in a sample, BVA analyzes the heterogeneity in the FRET of individual molecules over time (Fig. 1). For static heterogeneity, the width of the E∗ distribution expands beyond shot noise, because different molecules have different originating FRET values; however, the FRET distribution over time for any individual static molecule is consistent with a shot-noise-limited distribution (Fig. 1 A, magenta); in contrast, this single-molecule FRET distribution will be wider than shot noise if the molecule exhibits FRET dynamics (Fig. 1 A, blue). In BVA, we test for dynamics by comparing the expected shot-noise-limited standard deviation for a given mean E∗, σE∗, against the observed standard deviation, sE∗, for individual molecules.
For a static species, the expected standard deviation due to shot noise, σE∗, depends only on photon statistics. Any set of n consecutive photons will follow a binomial distribution with respect to emission in the donor and acceptor channels. Assuming no background, the expected standard deviation of FA is that of a binomial, , with the standard deviation of being
(4) |
To calculate the experimental burstwide standard deviation, , we segment each burst, i, into Mi consecutive (and nonoverlapping) windows of n photons each (where Mi is the maximum number of windows in burst i; Fig. 1 B), and calculate the standard deviation of all windows within the burst:
(5) |
where ɛij is the proximity ratio of window j in burst i, and μi is the mean FRET of all such windows in burst i. In this work, we set n = 5 (rationale and discussion of window size effects is provided in the Supporting Material).
Individual bursts often contain only a few photon windows, resulting in large errors in the calculated si. To increase the statistical power of BVA, we segment the E∗ axis into R bins, each centered on a given value of E∗ and bearing a width w. For each bin, we calculate the expected standard deviation, sE∗, of all windows belonging to bursts in the interval , where and are lower and upper bounds, respectively, of the bin (Fig. 1 C),
(6) |
is the proximity ratio of burst i, and μ is the mean FRET of all windows belonging to bursts with . Unless otherwise indicated, we define R = 20 bins, each with a width of 0.05 along the axis; for instance, the bin centering on includes windows from all bursts with . In this work, we consider sE∗ values only from those bins with at least 50 bursts, to ensure that any dynamics detected are representative of the sample. For simplicity, we ignore the contribution of background to BVA; as in PDA, such contributions are usually negligible (see Fig. S3).
We note that, like any method of detecting FRET dynamics, BVA may be sensitive to dynamic changes in fluorophore quantum yield or orientation factor that also give rise to dynamic changes in FRET. It is therefore important to ensure that these artifacts are identified and eliminated with proper controls or else occur on timescales distinct from the dynamics of interest (12,34,35).
Confidence intervals
We calculate upper-limit confidence intervals on by considering the sampling distribution of standard deviations, P(σ), expected for M windows of n photons. Although this distribution has an approximate analytical solution (see Eq. S9 and accompanying text), we use a computationally expensive, but more precise, Monte Carlo approach to calculate P(σ).To implement the Monte Carlo approach, we simulate the sampling distribution of σ,
(7) |
and are random variables drawn from a binomial distribution with n trials (i.e., the number of photons/window) and E∗ probability of success. We define the resulting Monte Carlo distribution as .
We use the distribution to calculate the upper-tail confidence interval on the standard deviation, , and test for dynamics by comparing it to the observed sE∗. As we calculate this interval on R = 20 data bins, we employ a Bonferroni correction for multiple hypothesis testing (implementation described in Abidi (36), and in the Supporting Material). Unless otherwise indicated, we set our per-experiment confidence level to ; deviations beyond the value of sE∗ corresponding to this level should reflect the presence of dynamics.
Simulation results
BVA can distinguish between static and dynamic heterogeneity
We first tested whether BVA can distinguish between static and dynamic heterogeneity as sources of broadening in E∗ distributions. In addition to a series of single, static FRET species (Fig. S5), we simulated an equimolar mixture of three species with distinct but static FRET efficiencies (, , and ) and analyzed it via PDA (Fig. 2 A, upper). As expected, the PDA prediction assuming these static species achieved a good fit to the data (black line; ); however, the same data could also be fit well by considering a single species with two dynamically interconverting states. To be specific, we performed a two-parameter fit assuming symmetry about E∗ = 0.5 and equal forward and backward kinetic rates (red line; , , ; ). The ability of both static and dynamic PDA predictions to account for the observed E∗ histogram (Fig. 2 A, upper; ) demonstrates the difficulty of resolving static versus dynamic heterogeneity with PDA alone. We then analyzed the static sample with BVA; as expected, all sE∗ values fell well within the predicted 99.9% confidence interval (Fig. 2 A, lower, triangles in gray region) correctly suggesting that the observed heterogeneity was due to static, rather than dynamic, sources.
We next simulated a simple, two-state dynamic sample (Fig. 2 B) using the FRET values and first-order rate constants from the dynamic PDA prediction in Fig. 2 A (, , ). Again, PDA predictions assuming either the given two-state dynamic model, or a three-species static model (, (fixed), ), could account for the observed E∗ distribution (Fig. 2 B, upper; ). BVA, however, showed a clear increase in sE∗ beyond the confidence interval for intermediate values of E∗, indicating dynamics (Fig. 2 B, lower, red triangles). Therefore, despite the similarity of E∗ histograms resulting from static or dynamic heterogeneity, BVA could correctly determine the type of heterogeneity present.
For the dynamic sample, the sE∗ values at intermediate E∗ are above the confidence interval, whereas those nearer to the E∗ of each individual state are not. This occurs because the diffusion time of the molecules (∼1 ms) is similar to the timescale of dynamics (in our simulations, molecules fluctuate on a timescale of ); some molecules will therefore sample only one state (e.g., E∗ ∼ 0.3) during their diffusion through the confocal spot, producing an E∗ value characteristic of the sampled state, and giving rise to an sE∗ value consistent with static behavior. Molecules with an intermediate E∗ value, however, sample both FRET states, and therefore show an increased sE∗.
BVA detects FRET dynamics in a timescale-dependent fashion
Studies on dynamic systems have shown that although FRET histograms broaden in response to dynamics near the diffusion timescale, they appear shot-noise-limited when dynamics are much faster or much slower than diffusion (12). In the former case, molecules interconvert so rapidly that the FRET efficiencies of their states average out, and each burst exhibits an apparently constant (intermediate) FRET efficiency; in the latter, molecules interconvert so slowly that every burst is spent in one state or the other, yielding a shot-noise-limited population corresponding to each state. As BVA detects dynamics through intraburst FRET fluctuations, we expected it to show a timescale dependence similar to that observed when analyzing E∗ histogram broadening alone.
To test the ability of BVA to detect dynamics on different timescales, we studied the effects of fluctuation timescales on both E∗ histograms and the sE∗ calculated with BVA. We simulated dynamic species fluctuating between two FRET states, and , at timescales on the order of diffusion (), or three orders of magnitude above () or below it (). As expected, the molecule fluctuating on the diffusion timescale exhibited broadening by PDA, and an increased sE∗ (Fig. 3 B). In contrast, molecules fluctuating much slower or faster than the diffusion timescale appeared static by both PDA and BVA (Fig. 3, A and C); thus, at timescales >3 orders of magnitude slower or faster than diffusion, BVA could not detect dynamics.
To determine the timescales over which BVA can detect dynamics, we simulated the same two-state fluctuation () at timescales from to . To quantify our ability to detect dynamics, we calculated a dynamic score (DS), the sum of squared residuals between the observed standard deviation sE∗ and the upper-tail confidence interval for all significant sE∗ (i.e., those above the confidence interval):
(8) |
The DS is a least-squares-like objective function providing an intuitive measure of dynamics: the DS is zero when all sE∗ are within the confidence interval and the molecule appears static, and nonzero when there is strong evidence for dynamics (i.e., some sE∗ > 0).
Using the same two-state dynamic species (), we calculated the DS over many timescales, and detected dynamics over four orders of magnitude (Fig. 3 D, black line). BVA was most sensitive to FRET fluctuations near the diffusion timescale, regardless of diffusion coefficient (, , ); in all cases, the maximal DS coincided with mean molecular diffusion time (Fig. 3 D, arrow). As diffusion time depends on the dimensions of the confocal spot, we expect features of the experimental setup to affect the sensitivity of BVA to dynamics. We also tested the effects of photon window size, n (Fig. S4), and fluctuation amplitude, (Fig. 3 E), on the ability of BVA to detect dynamics. Fluctuation amplitude had a large impact, with a doubling of fluctuation amplitude yielding a roughly fourfold increase in DS at the diffusion timescale.
Experimental results
BVA can detect dynamic heterogeneity in DNA samples examined using smFRET
For the experimental validation of BVA, we prepared a series of DNAs modeled after a DNA hairpin standard (Fig. S1). The hairpin is a stem-loop structure that interconverts dynamically between an open and a closed conformation on the timescale of diffusion (Figs. 4 A and Fig. S1 B) (22,37,38). The donor and acceptor fluorophores are on the stem and loop, respectively, such that opening and closing result in large FRET changes.
To test whether BVA can distinguish between static and dynamic FRET species, we prepared both a dynamic hairpin and hairpin-like static controls, which remain permanently in either the closed or open conformation (Fig. S1, C and D). We verified that the individual closed and open hairpin conformations are static by testing a mixture of control hairpins (Fig. 4 B); each distribution was broader than expected from shot noise alone (Fig. S6), and consistent with a Gaussian distribution of in the range of 0.15–0.23 nm, as seen previously (12,13,22,34). Such heterogeneity has been attributed to either acceptor dye photophysics, or long-lived states in which fluorophores occupy different positions and/or orientations with respect to the DNA (13,34). Consistent with this proposed quasistatic heterogeneity, BVA analysis showed no evidence for dynamics in either of the two control DNA populations (Fig. 4 B).
We then analyzed the DNA hairpin, which interconverts at the millisecond timescale between the FRET states represented by the two controls, and should yield a large dynamic signal, as we detected previously using a simple form of BVA (31). As expected, BVA revealed a dramatic increase in sE∗ at intermediate E∗ (Fig. 4 C, red triangles); in these bursts, the hairpin switched between the open and closed conformations during its transit through the confocal volume, yielding an intermediate E∗ and high sE∗.
Previously, hairpin dynamics were proposed to occur via a simple two-state kinetic model (22,38,39) wherein the hairpin fluctuates between open and closed conformations with first-order rate constants and ; we thus tested whether the observed distribution and sE∗ values were consistent with such a model. We first used PDA to fit the data to a two-state dynamic model while fixing the FRET and Gaussian broadening of each state equal to that of the static controls (Fig. 4 C, upper). We obtained best-fit kinetic values of and , which yielded a good fit to the data () and an overall reaction time of , consistent with the results of previous correlation-based analyses ( of 0.5–1.0 ms) (31,37).
We then used BVA to determine the sE∗ of this PDA prediction, and found close agreement between the sE∗ of the data and the two-state kinetic model (Fig. 4 C, green triangles); in contrast, the sE∗ data could not be explained by a PDA prediction assuming a distribution of static underlying E∗ values (Fig. 4 C, blue triangles; blue line, upper). This supported earlier PDA-based work suggesting that a two-state dynamic model can account for the dynamics of the hairpin on the timescale of diffusion (25). We note, however, that our best-fit solution exhibits small but systematic deviations in sE∗ compared to the data, possibly due to the presence of minor additional states. Indeed, recent work found that a similar hairpin fluctuated with double-exponential kinetics, suggesting an intermediate state between the open and closed forms (40). Overall, our analysis demonstrates that BVA can both detect dynamics in experimental samples and test models of molecular heterogeneity against experimental data.
Dynamics in the Klenow fragment of E. coli DNA Pol I
The bacterial DNA Pol I is an essential component of DNA replication and repair, and it exhibits remarkable fidelity in selecting the correct template-directed nucleotide for addition to a growing DNA chain. Substantial effort has been invested in studying how this fidelity is achieved, with many reports pointing to conformational changes in the polymerase prior to nucleotide incorporation, and especially to a “fingers-closing” transition during which the polymerase forms a tight pocket around both its DNA substrate and an incoming nucleotide, positioning them for catalysis (41,42) (Fig. 5 A).
Previously, we used single-molecule FRET to monitor the fingers-closing transition in the Klenow Fragment (KF) of E. coli DNA Pol I by labeling it with donor and acceptor fluorophores on the fingers and thumb KF subdomains, respectively (8,25). We showed that fingers-closing and opening occurs dynamically in the absence of a DNA template; this was based on the fact that E∗ distributions of the unliganded KF were too wide to be accounted for by either one or two shot-noise-limited distributions, and that the unliganded KF appeared dynamic via FCS-based methods and an early form of BVA (8). Recently, we showed that the E∗ histogram for unliganded KF was consistent with a simple two-state kinetic model using PDA (25), such that unliganded KF appears to fluctuate between its open and closed states on a millisecond timescale, with rates in agreement with the first report of KF dynamics (8).
For the DNA-Pol binary complex and DNA-Pol-dNTP ternary complex, however, the results were less clear: FCS-based methods failed to reveal clear dynamics in these complexes, despite the apparent existence of both open and closed conformations in equilibrium (6,22), and stopped-flow data showing diffusion-timescale fingers-closing during ternary complex formation (43). Moreover, the distributions could be fitted well by either sums of (static) Gaussian distributions (6) or by a PDA-based two-state dynamic model (22). Finally, application of an early form of BVA, which relied on visual comparisons between BVA contour plots of experimental and simulated data, did not clearly identify dynamics, though it lacked the statistical power to do so conclusively (6).
To determine whether the binary and ternary samples are dynamic, we studied them using BVA. In both samples, KF was bound to a nonextensible hairpin DNA with A as the templating nucleotide (Fig. S1 E); in the ternary complex sample, dTTP was added to form the Pol-DNA-dNTP complex (see Materials and Methods).
We first used PDA to test whether the observed E∗ distributions for each sample could be accounted for with one of two models: a dynamic two-state model or a static two-species model (Fig. 5, B–D). For the dynamic two-state model, we assumed that the polymerase fluctuates between the open and closed complex with first-order rate constants kclose and kopen. In each of the three KF complexes, we fixed the means and Gaussian widths of each state, fit for kclose and kopen, and obtained a good fit to the observed E∗ distribution (Fig. 5, B–D, upper, red lines; in all cases). Furthermore, all fitted kopen and kclose fell in the 100- to 500-s−1 range, consistent with the rate of fingers-closing dynamics expected from previous studies (6,43). We then fit each E∗ histogram assuming the existence of two static (or slowly interconverting) states, corresponding to the open and closed conformations; as in the dynamic model, we held the mean FRET of each state constant, but fit for the Gaussian widths about these means, and . To avoid introducing new parameters, we fixed the relative occupancy of each state according to the apparent equilibrium from the dynamic fit. As in the dynamic model, this static model achieved a reasonable fit for all three polymerase complexes (Fig. 5, B–D, upper, black lines; in all cases). Together, these data suggest that PDA analysis of the observed E∗ histogram is consistent with both static and dynamic models of polymerase behavior in all three complexes.
We then analyzed each sample and its corresponding PDA predictions with BVA, and compared the resulting sE∗ values (Fig. 5, B–D, lower; full contour plots of si shown in Fig. S7). The sE∗ of the actual data were in qualitative agreement with the sE∗ from the dynamic predictions in all cases; in contrast, the sE∗ of the static predictions were consistently and substantially lower than the sE∗ for either the data or the dynamic prediction. To quantify this difference, we calculated the sum of squared residuals (SSR) between the actual sE∗ and those of each prediction, where a smaller SSR indicates a better fit. In all three KF species, the SSR of the dynamic prediction was about an order of magnitude smaller than that of the static prediction (Fig. 5, B–D), suggesting that these wide E∗ distributions were due to dynamic, rather than static, heterogeneity. To ensure that this result was independent of the model of static heterogeneity employed, we tested a model that includes a third static species with an E∗ between that of the open and closed complexes; this model, too, did not agree with the data (Fig. S7). We also investigated how the specific timescales used in the dynamic prediction affected their agreement with the experimental data. In addition to finding that predictions using the rates extracted with PDA (100–500 s−1, consistent with fingers-closing dynamics) matched the observed sE∗ very well, we found that these rates could be altered by more than an order of magnitude before achieving an SSR comparable to the static prediction (Fig. S8). In conclusion, the full BVA method identified clear dynamics in both the binary and ternary complexes of KF (beyond a strict 99.9% confidence interval) and was able to reproduce the observed standard deviation data accurately using a two-state kinetic model (Fig. 5). These results raise the intriguing possibility that the fingers-closing transition in KF may be dynamic throughout its reaction trajectory, whether DNA-bound or poised for dNTP incorporation.
Conclusions
We introduced BVA, an analytical method that enables the detection of dynamics in single-molecule FRET experiments; it accomplishes this by comparing the standard deviation of FRET from individual molecules over time to that expected from theory. To characterize BVA, we analyzed static DNA molecules, a dynamic DNA hairpin, and the KF of DNA Pol I and its complexes. BVA analysis showed a lack of dynamics in static DNA standards but clear dynamics in a previously characterized dynamic DNA hairpin. We combined BVA with PDA to test specific models of static or dynamic heterogeneity against smFRET data on the DNA hairpin; whereas both static and dynamic sources of heterogeneity could explain the observed E∗ distribution, only a two-state dynamic model could account for the experimental data when incorporating standard deviation information via BVA.
We used BVA to analyze the conformational dynamics of KF binary and ternary complexes, uncovering millisecond-timescale dynamics not previously detected using a correlation-based approach (6), and suggested (but not conclusively identified) using PDA (21). The presence of dynamics at the millisecond timescale is consistent with recent studies showing that the fingers-closing transition is not rate-limiting for nucleotide addition, supporting a model in which fingers-closing precedes, but does not commit the polymerase to, dNTP incorporation (43,44). Together with our previous results (6), this suggests a model in which fingers-closing, a prechemistry step important for discriminating between matched and mismatched nucleotides, may occur several times prior to successful dNTP incorporation.
Our results illustrate the usefulness of BVA in detecting structural dynamics using single-molecule FRET. Due to its diffusion-timescale sensitivity, BVA complements correlation-based methods, as well as time-resolved smFRET measurements (15), which may have difficulty detecting dynamics near the diffusion timescale. BVA also complements a recent PDA-based method to identify the presence of FRET dynamics (23), both by offering model-free detection of dynamics, and by adding a dimension along which to hypothesis-test specific models of dynamics; the latter is useful in rejecting incorrect models that produce distributions consistent with experiments, but show poor agreement between predicted and observed BVA data. Since BVA is performed on single-molecule data, it can also be applied to subpopulations of interest, or imperfectly labeled samples, without producing the artifacts inherent to correlation-based methods. BVA should be broadly useful in single-molecule FRET studies of enzyme structures and dynamics, and of protein and nucleic acid folding.
Acknowledgments
The authors thank C. M. Joyce and N. D. F. Grindley for helpful discussions, and C. M. Joyce and O. Potapova for providing fluorescently labeled Klenow fragment.
This work was supported by a European Commission Seventh Framework Programme (FP7/2007–2013) grant (HEALTH-F4-2008-201418, entitled READNA) and a Biotechnology and Biological Sciences Research Council grant (BB/H01795X/1) to A. N. Kapanidis, an Environmental Protection Agency Cephalosporin Scholarship (Linacre College, University of Oxford, UK) to Yusdi Santoso, and a Clarendon Award (Oxford University, Oxford, UK) to J. P. Torella. The authors declare that they have no competing financial interests.
Supporting Material
References
- 1.Deniz A.A., Laurence T.A., Weiss S. Single-molecule protein folding: diffusion fluorescence resonance energy transfer studies of the denaturation of chymotrypsin inhibitor 2. Proc. Natl. Acad. Sci. USA. 2000;97:5179–5184. doi: 10.1073/pnas.090104997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Schuler B., Eaton W.A. Protein folding studied by single-molecule FRET. Curr. Opin. Struct. Biol. 2008;18:16–26. doi: 10.1016/j.sbi.2007.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Karymov M.A., Chinnaraj M., Lyubchenko Y.L. Structure, dynamics, and branch migration of a DNA Holliday junction: a single-molecule fluorescence and modeling study. Biophys. J. 2008;95:4372–4383. doi: 10.1529/biophysj.108.135103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhao R., Rueda D. RNA folding dynamics by single-molecule fluorescence resonance energy transfer. Methods. 2009;49:112–117. doi: 10.1016/j.ymeth.2009.04.017. [DOI] [PubMed] [Google Scholar]
- 5.Kapanidis A.N., Margeat E., Ebright R.H. Initial transcription by RNA polymerase proceeds through a DNA-scrunching mechanism. Science. 2006;314:1144–1147. doi: 10.1126/science.1131399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Santoso Y., Joyce C.M., Kapanidis A.N. Conformational transitions in DNA polymerase I revealed by single-molecule FRET. Proc. Natl. Acad. Sci. USA. 2010;107:715–720. doi: 10.1073/pnas.0910909107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Liu S.X., Abbondanzieri E.A., Zhuang X. Slide into action: dynamic shuttling of HIV reverse transcriptase on nucleic acid substrates. Science. 2008;322:1092–1097. doi: 10.1126/science.1163108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Coban O., Lamb D.C., Nienhaus G.U. Conformational heterogeneity in RNA polymerase observed by single-pair FRET microscopy. Biophys. J. 2006;90:4605–4617. doi: 10.1529/biophysj.105.078840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Majumdar D.S., Smirnova I., Kaback H.R. Single-molecule FRET reveals sugar-induced conformational dynamics in LacY. Proc. Natl. Acad. Sci. USA. 2007;104:12640–12645. doi: 10.1073/pnas.0700969104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gansen A., Valeri A., Seidel C.A. Nucleosome disassembly intermediates characterized by single-molecule FRET. Proc. Natl. Acad. Sci. USA. 2009;106:15308–15313. doi: 10.1073/pnas.0903005106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Gopich I.V., Szabo A. Single-molecule FRET with diffusion and conformational dynamics. J. Phys. Chem. B. 2007;111:12925–12932. doi: 10.1021/jp075255e. [DOI] [PubMed] [Google Scholar]
- 12.Nir E., Michalet X., Weiss S. Shot-noise limited single-molecule FRET histograms: comparison between theory and experiments. J. Phys. Chem. B. 2006;110:22103–22124. doi: 10.1021/jp063483n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Antonik M., Felekyan S., Seidel C.A. Separating structural heterogeneities from stochastic variations in fluorescence resonance energy transfer distributions via photon distribution analysis. J. Phys. Chem. B. 2006;110:6970–6978. doi: 10.1021/jp057257+. [DOI] [PubMed] [Google Scholar]
- 14.Kalinin S., Felekyan S., Seidel C.A. Characterizing multiple molecular states in single-molecule multiparameter fluorescence detection by probability distribution analysis. J. Phys. Chem. B. 2008;112:8361–8374. doi: 10.1021/jp711942q. [DOI] [PubMed] [Google Scholar]
- 15.Laurence T.A., Kong X.X., Weiss S. Probing structural heterogeneities and fluctuations of nucleic acids and denatured proteins. Proc. Natl. Acad. Sci. USA. 2005;102:17348–17353. doi: 10.1073/pnas.0508584102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Deniz A.A., Dahan M., Schultz P.G. Single-pair fluorescence resonance energy transfer on freely diffusing molecules: observation of Förster distance dependence and subpopulations. Proc. Natl. Acad. Sci. USA. 1999;96:3670–3675. doi: 10.1073/pnas.96.7.3670. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Watkins L.P., Chang H.Y., Yang H. Quantitative single-molecule conformational distributions: a case study with poly-(L-proline) J. Phys. Chem. A. 2006;110:5191–5203. doi: 10.1021/jp055886d. [DOI] [PubMed] [Google Scholar]
- 18.Hanson J.A., Duderstadt K., Yang H. Illuminating the mechanistic roles of enzyme conformational dynamics. Proc. Natl. Acad. Sci. USA. 2007;104:18055–18060. doi: 10.1073/pnas.0708600104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Henzler-Wildman K., Kern D. Dynamic personalities of proteins. Nature. 2007;450:964–972. doi: 10.1038/nature06522. [DOI] [PubMed] [Google Scholar]
- 20.Henzler-Wildman K.A., Thai V., Kern D. Intrinsic motions along an enzymatic reaction trajectory. Nature. 2007;450:838–844. doi: 10.1038/nature06410. [DOI] [PubMed] [Google Scholar]
- 21.Kalinin S., Felekyan S., Seidel C.A. Probability distribution analysis of single-molecule fluorescence anisotropy and resonance energy transfer. J. Phys. Chem. B. 2007;111:10253–10262. doi: 10.1021/jp072293p. [DOI] [PubMed] [Google Scholar]
- 22.Santoso Y., Torella J.P., Kapanidis A.N. Characterizing single-molecule FRET dynamics with probability distribution analysis. ChemPhysChem. 2010;11:2209–2219. doi: 10.1002/cphc.201000129. [DOI] [PubMed] [Google Scholar]
- 23.Kalinin S., Valeri A., Seidel C.A. Detection of structural dynamics by FRET: a photon distribution and fluorescence lifetime analysis of systems with multiple states. J. Phys. Chem. B. 2010;114:7983–7995. doi: 10.1021/jp102156t. [DOI] [PubMed] [Google Scholar]
- 24.Gurunathan K., Levitus M. Applications of fluorescence correlation spectroscopy to the study of nucleic acid conformational dynamics. Prog. Nucleic Acid Res. Mol. Biol. 2008;82:33–69. doi: 10.1016/S0079-6603(08)00002-0. [DOI] [PubMed] [Google Scholar]
- 25.Torres T., Levitus M. Measuring conformational dynamics: a new FCS-FRET approach. J. Phys. Chem. B. 2007;111:7392–7400. doi: 10.1021/jp070659s. [DOI] [PubMed] [Google Scholar]
- 26.Hohlbein J., Steinhart M., Hübner C.G. Confined diffusion in ordered nanoporous alumina membranes. Small. 2007;3:380–385. doi: 10.1002/smll.200600398. [DOI] [PubMed] [Google Scholar]
- 27.Eggeling C., Fries J.R., Seidel C.A. Monitoring conformational dynamics of a single molecule by selective fluorescence spectroscopy. Proc. Natl. Acad. Sci. USA. 1998;95:1556–1561. doi: 10.1073/pnas.95.4.1556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Laurence T.A., Kwon Y., Barsky D. Correlation spectroscopy of minor fluorescent species: signal purification and distribution analysis. Biophys. J. 2007;92:2184–2198. doi: 10.1529/biophysj.106.093591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Laurence T.A., Kwon Y., Barsky D. Motion of a DNA sliding clamp observed by single molecule fluorescence spectroscopy. J. Biol. Chem. 2008;283:22895–22906. doi: 10.1074/jbc.M800174200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kapanidis A.N., Lee N.K., Weiss S. Fluorescence-aided molecule sorting: analysis of structure and interactions by alternating-laser excitation of single molecules. Proc. Natl. Acad. Sci. USA. 2004;101:8936–8941. doi: 10.1073/pnas.0401690101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Santoso Y., Kapanidis A.N. Probing biomolecular structures and dynamics of single molecules using in-gel alternating-laser excitation. Anal. Chem. 2009;81:9561–9570. doi: 10.1021/ac901423e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lee N.K., Kapanidis A.N., Weiss S. Accurate FRET measurements within single diffusing biomolecules using alternating-laser excitation. Biophys. J. 2005;88:2939–2953. doi: 10.1529/biophysj.104.054114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dahan M., Deniz A.A., Weiss S. Ratiometric measurement and identification of single diffusing molecules. Chem. Phys. 1999;247:85–106. [Google Scholar]
- 34.Kalinin S., Sisamakis E., Seidel C.A. On the origin of broadening of single-molecule FRET efficiency distributions beyond shot noise limits. J. Phys. Chem. B. 2010;114:6197–6206. doi: 10.1021/jp100025v. [DOI] [PubMed] [Google Scholar]
- 35.Chung H.S., Louis J.M., Eaton W.A. Distinguishing between protein dynamics and dye photophysics in single-molecule FRET experiments. Biophys. J. 2010;98:696–706. doi: 10.1016/j.bpj.2009.12.4322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Abidi H. Encyclopedia of Measurement and Statistics. Sage; Thousand Oaks, CA: 2007. Bonferroni and Sidak corrections for multiple comparisons; pp. 103–107. [Google Scholar]
- 37.Wallace M.I., Ying L.M., Klenerman D. FRET fluctuation spectroscopy: exploring the conformational dynamics of a DNA hairpin loop. J. Phys. Chem. B. 2000;104:11551–11555. [Google Scholar]
- 38.Ying L.M., Wallace M.I., Klenerman D. Two-state model of conformational fluctuation in a DNA hairpin-loop. Chem. Phys. Lett. 2001;334:145–150. [Google Scholar]
- 39.Bonnet G., Krichevsky O., Libchaber A. Kinetics of conformational fluctuations in DNA hairpin-loops. Proc. Natl. Acad. Sci. USA. 1998;95:8602–8606. doi: 10.1073/pnas.95.15.8602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Jung J.Y., Van Orden A. A three-state mechanism for DNA hairpin folding characterized by multiparameter fluorescence fluctuation spectroscopy. J. Am. Chem. Soc. 2006;128:1240–1249. doi: 10.1021/ja0560736. [DOI] [PubMed] [Google Scholar]
- 41.Li Y., Korolev S., Waksman G. Crystal structures of open and closed forms of binary and ternary complexes of the large fragment of Thermus aquaticus DNA polymerase I: structural basis for nucleotide incorporation. EMBO J. 1998;17:7514–7525. doi: 10.1093/emboj/17.24.7514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Joyce C.M., Benkovic S.J. DNA polymerase fidelity: kinetics, structure, and checkpoints. Biochemistry. 2004;43:14317–14324. doi: 10.1021/bi048422z. [DOI] [PubMed] [Google Scholar]
- 43.Joyce C.M., Potapova O., Grindley N.D. Fingers-closing and other rapid conformational changes in DNA polymerase I (Klenow fragment) and their role in nucleotide selectivity. Biochemistry. 2008;47:6103–6116. doi: 10.1021/bi7021848. [DOI] [PubMed] [Google Scholar]
- 44.Rothwell P.J., Mitaksov V., Waksman G. Motions of the fingers subdomain of klentaq1 are fast and not rate limiting: implications for the molecular basis of fidelity in DNA polymerases. Mol. Cell. 2005;19:345–355. doi: 10.1016/j.molcel.2005.06.032. [DOI] [PubMed] [Google Scholar]
- 45.Johnson S.J., Taylor J.S., Beese L.S. Processive DNA synthesis observed in a polymerase crystal suggests a mechanism for the prevention of frameshift mutations. Proc. Natl. Acad. Sci. USA. 2003;100:3895–3900. doi: 10.1073/pnas.0630532100. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.