Abstract
Photon sequences from single-molecule Förster resonance energy transfer (FRET) experiments can be analyzed using a maximum likelihood method. Parameters of the underlying kinetic model (FRET efficiencies of the states and transition rates between conformational states) are obtained by maximizing the appropriate likelihood function. In addition, the errors (uncertainties) of the extracted parameters can be obtained from the curvature of the likelihood function at the maximum. We study the standard deviations of the parameters of a two-state model obtained from photon sequences with recorded colors and arrival times. The standard deviations can be obtained analytically in a special case when the FRET efficiencies of the states are 0 and 1 and in the limiting cases of fast and slow conformational dynamics. These results are compared with the results of numerical simulations. The accuracy and, therefore, the ability to predict model parameters depend on how fast the transition rates are compared to the photon count rate. In the limit of slow transitions, the key parameters that determine the accuracy are the number of transitions between the states and the number of independent photon sequences. In the fast transition limit, the accuracy is determined by the small fraction of photons that are correlated with their neighbors. The relative standard deviation of the relaxation rate has a “chevron” shape as a function of the transition rate in the log-log scale. The location of the minimum of this function dramatically depends on how well the FRET efficiencies of the states are separated.
I. INTRODUCTION
In single-molecule Förster resonance energy transfer (FRET) spectroscopy, likelihood-based methods can be used to get the maximum information about a molecule’s conformational dynamics from available experimental data.1–6 Fluctuations of the distance between donor and acceptor labels attached to the molecule lead to the fluctuation of the donor and acceptor photon intensity. The sequence of photons can be analyzed using the likelihood function of a model that describes the interdye distance changes. The parameters of the model are estimated by maximizing the likelihood function.
When the processes being studied are slow, photon sequences can be divided into time bins and studied using various methods, including Hidden Markov Models.7,8 The observables are the numbers of donor and acceptor photons detected during bin time. This analysis is applicable when the bin time is much shorter than the time scale of conformational changes.9 To study fast conformational dynamics, photon sequences must be analyzed without binning.10–16 The observables in this case are the photon colors (i.e., donor, green, or acceptor, red) and photon arrival times (or the interphoton times) so that all information in the measurements are utilized. Such analysis of single-molecule photon sequences has been used to find transition rates of fast folding proteins17,18 (in which case there can be several transitions during smallest bin time) and transition path time, which is comparable to the interphoton times.5,19,20
In this paper, we study the likelihood function that is used to analyze photon sequences without binning with recorded photon colors and arrival times.12 We are interested in the accuracy of the parameters obtained by the Maximum Likelihood (ML) method. The problem of the accuracy of ML estimates has been a focus of numerous studies, including those in single-molecule FRET.21,22 There can be two types of errors. First, the model used to construct the likelihood function may not be accurate. Second, even when the model is perfect, there are uncertainties due to the fact that the amount of data is limited. This is the type of errors that is the subject of the present paper.
We consider the simplest two-state model of conformational dynamics. For this model, we study the uncertainties of the parameters obtained from the arrival times and colors of photons. Statistical properties of the ML estimates from photon arrival trajectories for the two-state model have been studied numerically by Waligórska and Molski.14 The results of simulations for this model have also been reported in Ref. 23. Here, we present an extensive study of the two-state ML estimates and derive analytical expressions for the standard deviations of the extracted parameters in some special and limiting cases.
Our aim is to understand the behavior of the errors in the extracted parameters. We would like to know how the errors depend on various parameters such as the number of photons (photon intensity), transition rates, and duration of the photon sequences. Intuitively, the parameters should be more accurately estimated when the transitions between the states are slow and the separation between the two FRET efficiencies is large. In this limit, many photons are detected before the interdye distance changes; the states can be readily identified and the sequence of photons can be transformed into a state trajectory. The error in the transition rate decreases when the number of transitions between the two states increases so that faster transitions result in more accurate estimates of the transition rates given photon sequences with the same total duration. In the case when the transition rates are very large and the states of the molecule cannot be directly determined, the accuracy should decrease. In this paper, we make these intuitive ideas more precise and provide simple expressions for the variance of the ML estimates obtained from analytical theory and validated by simulations.
We start the paper by formulating the model and presenting our main results for the errors of the ML estimates in Sec. II. The variances of the ML estimates were obtained analytically in a special case (when the FRET efficiencies in two states are 0 and 1) and in two limiting cases of fast and slow conformational dynamics. The results are discussed and compared with simulations in Sec. III. The last three sections contain the details of the derivation.
II. MODEL AND RESULTS
A. Model
Consider a molecule at equilibrium undergoing transitions between two conformational states with a small and a large interdye distance corresponding to high and low FRET efficiencies. The transitions between the two states are described by the rate constants k1 (1 → 2) and k2 (2 → 1). The equilibrium populations of the states are p1 = k2/(k1 + k2) and p2 = 1 − p1. The statistics of the acceptor and donor photon counts in each state are Poissonian and determined by the acceptor, nAi (i = 1, 2), and donor, nDi, photon count rates (or fluorescence intensities). The photon count rates are the mean numbers of photon counts detected per unit time. These parameters depend on the distance between donor and acceptor fluorophores, laser intensity, background noise, etc.24 The assumption of Poisson statistics implies that all these parameters are fixed when the molecule is in a given conformational state.
The ratio of the acceptor count rate to the total count rate is the apparent FRET efficiency, so that each state can be characterized by the FRET efficiencies and , which are the probabilities of observing an acceptor rather than a donor photon when the molecule is in state i = 1, 2. Unlike the photon count rates, the apparent FRET efficiencies usually do not depend on the laser intensity.
A sequence of photons with recorded photon colors and arrival times is analyzed by constructing a likelihood function and optimizing it with respect to the model parameters. In general, the likelihood function involves all four photon count rates.12,14 Here, we assume that the total (acceptor and donor) photon count rate in two states is the same, nA1 + nD1 = nA2 + nD2 ≡ n. In this case, the likelihood function can be presented in a form that does not involve photon count rates.12 The likelihood function of jth sequence of photons (or photon trajectory) with Nj photons of color ci (acceptor or donor) and interphoton times τi is
| (1) |
where F(acceptor) = E, F(donor) = I − E, E is the diagonal matrix of FRET efficiencies, I is the 2 × 2 unity matrix, K is the rate matrix, 1⊤ is a row vector with unit elements, and peq is the column vector of equilibrium populations
| (2) |
Here, k = k1 + k2 is the relaxation rate. The likelihood function in Eq. (1) reads from the right to the left. Note that the states of the molecule are not observed, so the first photon can be emitted from the first or second state with probabilities p1 and p2 = 1 − p1. Therefore, the first term on the right in Eq. (1) is the vector of equilibrium probabilities peq. The next term F(c1) corresponds to the first detected photon and depends on the color of this photon (donor or acceptor). The evolution of the conformational states until the second photon is detected at time τ2 is described by the matrix exponential, exp(Kτ2). The next term F(c2) depends on the color of the second detected photon, and so on. The final multiplication by 1⊤ sums over all conformational states.
The above likelihood function can be used to analyze the experiments with both immobilized and diffusing molecules17 without need to infer photon intensities. In the experiments with freely diffusing molecules, many short photon sequences (bursts of photons) emitted by different molecules are observed.25–27 The total likelihood of M independent photon sequences is the product of the likelihoods of individual sequences, . The total duration of all photon sequences is T and the total number of all photons is . It is assumed that there are many photons in each photon sequence (or burst) so that N ≫ M.
The resulting likelihood function depends on four model parameters, FRET efficiencies of the states and , the relaxation rate k = k1 + k2, and the population p1 = k2/k = 1 − p2. These parameters are found by numerical optimization of the log-likelihood. We assume that all four parameters are unknown, except for the special case considered in Subsection II B, in which the FRET efficiencies of the states are known.
The likelihood function in Eq. (1) is exact for the model described above. This means that when photon sequences corresponding to the model are analyzed with the likelihood function in Eq. (1), the ML estimates converge to the true values in the limit of large number of photons. When the number of photons is infinite, the likelihood function is a delta-function at the true parameter values. When the number of photons is large, but finite, the likelihood function at the maximum can be approximated by a multidimensional Gaussian. The errors of the parameters can be estimated from the curvature of the likelihood function at the maximum, which is related to the Fisher information.21,28,29 This approximation of normal distribution is appropriate when the number of photons is large, so the error of the parameter estimation is small. In addition, the estimates should not be close to the boundaries (which are 0 and 1 for the FRET efficiencies and population and 0 for the relaxation rate).
Below, we first present the main results (and the key ideas behind them) for the variances of ML estimates, which are found from the curvature of the likelihood function at the maximum. Detailed derivations are given in the subsequent sections. In general, the above likelihood function is a complicated function of parameters and can be studied only numerically. To study this likelihood analytically, we first consider a special case in which the interdye distance in the two states is much smaller and much larger than the Förster radius, so the FRET efficiencies of the states are 0 and 1. Then, we present the analytic results for arbitrary FRET efficiencies when conformational changes are fast and slow compared to the time between photons. These analytic results are validated by simulations.
B. Exactly solvable model
Assume that the FRET efficiencies in two states are known and equal to 0 and 1. This means that the state of the molecule is known at the moment when a photon is detected, i.e., the donor photons are emitted only when the molecule is in state 1 and the acceptor photons are from state 2 (see Fig. 1(a)). Then, the likelihood function in Eq. (1), which is a product of matrices, simplifies and becomes a product of scalar functions (see Eq. (11)) and can be treated analytically. There are only two parameters that are found by maximizing the log-likelihood, i.e., the relaxation rate k and the population p1.
FIG. 1.

Transitions between two states with FRET efficiencies 0 and 1. (a) Acceptor (red) and donor (green) photon trajectory and state (blue) trajectory. Photon arrival times correspond to the centers of circles. The states are equally populated, total photon count rate n = 100 in arbitrary time units, two-state relaxation rate k = 2k1 = 2k2 = 100. (b) and (c) Relative standard deviation of the ML estimate of the relaxation rate (b) and population (c) as a function of the ratio of the relaxation rate, k, and photon count rate, n (the square roots of Eqs. (3) and (4)). The number of photon sequences is M = 1, 102, 103, 104; the number of photons in all sequences is N = 200 000.
The relative variance of the relaxation rate can be found in terms of special functions (see Sec. IV) and becomes relatively simple for equally populated states (p1 = p2 = 1/2)
| (3) |
where N is the total number of all photons and is the Hurwitz zeta function (a generalization of the Riemann zeta function ).
The square root of Eq. (3) is the relative standard deviation of the relaxation rate, which is shown in Fig. 1(b). This function has a “chevron” (or “V”) shape as a function of k/n in the log-log scale, where n is the photon count rate. At small relaxation rates, k ≪ n, the relative quadratic error is 〈δk2〉/k2 → 2n/kN = 2/kT, where T = N/n is the duration of all photon sequences. kT/2 is the mean number of transitions during T. Thus, when the relaxation rate is small, the error is large because there are few transitions in all photon sequences. When the relaxation rate is large, k ≫ n, the error is 〈δk2〉/k2 → 4k/(Nnζ(3)). The transition between the fast and slow regions is at the minimum of the above function, k/n ≈ 0.756. Thus, the relative standard deviation of the relaxation rate is large when k is small (small number of transitions) and when k is large (many transitions between two photons). The optimal standard deviation is when the numbers of photons and transitions are comparable.
The relative variance of the population for equally populated states is
| (4) |
where ψ(z) = Γ′(z)/Γ(z) is the digamma function (the logarithmic derivative of the gamma function). The relative standard deviation of the population (the square root of Eq. (4)) is shown in Fig. 1(c). When the relaxation rate is small, k ≪ n, then 〈δp2〉/p2 ∼ 1/(Nk/(2n) + M) = 1/(kT/2 + M). Even when there are few transitions in all photon sequences, the population can be accurately determines if there are enough independent photon sequences (M ≫ 1). When the relaxation rate is large, k ≫ n, the quadratic error is ∝1/N (assuming that N ≫ M). The population is very accurately determined even when the relaxation rate is much faster than the interphoton times (see Fig. 1(c)). This is because the FRET efficiencies in this special case are assumed to be known, and the population is related to the average FRET efficiency (), which can be determined with high accuracy from the total number of photons. This is not the case when all parameters are unknown.
C. Slow transitions
The above results can be extended to arbitrary FRET efficiencies by considering the limiting cases of fast and slow transitions separately. Here, we present the results when the transitions between the states are slow. In this case, there are many photons detected while the molecule stays in one state and so one can readily determine a state trajectory from the sequence of photons (by comparing color patterns or by binning and comparing FRET efficiencies in bins). One can also count the number of transitions between the states and the number of photons in each state. The likelihood function in Eq. (1) can be replaced by a simpler likelihood with new observables which are the number of photons in each state and the times between transitions. The details of the analysis of this likelihood function are given in Sec. V.
The error of the FRET efficiency in this limit does not depend on the relaxation rate
| (5) |
where piN is the mean number of photons emitted from state i (i = 1, 2). The above variance of the FRET efficiency ML estimate is the same as the variance due to shot noise.24,30
The variances of the relaxation rate and of population are
| (6a) |
| (6b) |
| (6c) |
The above equations reduce to those for the exactly solvable model in Eqs. (3) and (4) in the limit k/n ≪ 1 when p1 = p2 = 1/2.
The errors of the relaxation rate and of the population in Eq. (6) are independent of the FRET efficiencies of the states and photon intensity (n). They are determined by the number of transitions between the states, kT/2, and the number of independent photon sequences, M. The relaxation rate and the population are uncoupled when the populations of the states are equal.
When there are more transitions than photon sequences, kT ≫ 2M, the errors of both k and p1 can be estimated from the following simple arguments. The mean number of transitions 1 → 2 (the same as those of 2 → 1 on average) in the photon trajectory of duration T = N/n is N1→2 = p1k1T = p1p2kT. In the limit of slow transitions, it can be shown (see Sec. V) that parameters k1 and k2 are independent and have variance . The error of k = k1 + k2 can be obtained using . The error of p1 = k2/(k1 + k2) can be obtained by calculating and using 〈δk1δk2〉 = 0. Similarly, the covariance of k and p1 can be obtained using 〈δkδp1〉 = 〈(δk1 + δk2)(k1δk2 − k2δk1)/k2〉. In this way, we get the results in Eq. (6) with M/kT set to zero.
In the opposite limit when transitions during a single photon sequence are infrequent and there are more sequences than transitions (kT ≪ 2M), the error of k is still determined by the number of transitions in all photon sequences. However, the error of p1 is determined by the number of photon trajectories M
| (7) |
Thus, the populations (but not the rates) can be accurately estimated even when there are no transitions during photon trajectories. Since it has been assumed that each photon trajectory starts randomly, the color of the first photon in the trajectory has information about the equilibrium populations.
D. Fast transitions
The case where the transition rates are fast is more challenging. In the limit of very fast transitions, all photons are uncorrelated and the only parameter that can be accurately determined is the mean FRET efficiency, . This is why the errors of all four parameters increase as k/n increases. When k/n is large, but finite, only photons separated by short interphoton times are correlated. The pairs of consecutive photons that are separated by times ∼1/k provide additional information about those parameters that determine the interphoton time distributions. The number of such pairs is ∝Nn/k. The interphoton time distributions involve only the mean FRET efficiency, , the relaxation rate, k, and the FRET efficiency variance due to two-state transitions, . Consequently, the relative quadratic error of the relaxation rate is ∝k/n (i.e., inversely proportional to the number of the correlated pairs of photons). To get the remaining parameters, , , and p1, a significant number of three consecutive photons must be correlated (see Sec. VI). The probability of detecting such photons is proportional to (n/k)2. As a result, the quadratic errors of the FRET efficiencies and of the population increase as ∝(k/n)2.
The details of the derivation are considered in Sec. VI. It will be shown that the quadratic errors (variances) in the limit of large relaxation rate, k/n ≫ 1, and small separation between FRET efficiencies of the states, , are
| (8a) |
| (8b) |
| (8c) |
| (8d) |
These equations are applicable only when the variances are small (≪1). When the error is large, the assumptions used to derive these equations do not hold. The above variances confirm the dependence on k/n mentioned above. They are very sensitive to .
The restriction of small separation between the states can be removed for the case when the populations are equal and . In this case, it can be shown that
| (9a) |
| (9b) |
| (9c) |
where , is the polylogarithm function, and . When the separation between the two FRET efficiencies is small, Δ ≪ 1, Lim(Δ4) → Δ4, F(Δ−2 − 1) → Δ2, and Eqs. (9) reduce to Eqs. (8) (with , p1 = p2 = 1/2). When and , Δ = 1, Li3(1) = ζ(3), and Eq. (9c) become 〈δk2〉/k2 = 4k/(nNζ(3)). This coincides with the exactly solvable model in Eq. (3) in the limit k/n ≫ 1. However, the population variances in Eqs. (9b) and (4) differ. This is because in the exactly solvable model the FRET efficiencies are assumed to be known, whereas here all four parameters are unknown.
The errors in Eq. (9) depend on the separation of the FRET efficiencies, , as Δ2 and Δ4. Since Δ ≤ 1, the approximation of small Δ in Eq. (8) turns out to be pretty accurate. Even when and , the difference between Eq. (9c) and its approximate version Eq. (8d) is only 11%.
III. COMPARISON WITH SIMULATIONS AND DISCUSSION
Simulations to find the errors of the ML estimates were performed for various relaxation rates and FRET efficiencies. We generated photon sequences and analyzed them to find FRET efficiencies and , the relaxation rate k, and the population of the first state, p1. These parameters were found by maximizing the log-likelihood , with Lj in Eq. (1). The standard deviations were obtained by evaluating numerically the covariance matrix at the maximum from the Hessian matrix. The parameters of the simulations were chosen to keep the errors small (less than 30%), so that the likelihood function can be well approximated by a multidimensional Gaussian near the maximum.14 For each data set, we simulated 200 photon sequences of 10 ms duration and photon count rate n = 100 ms−1. These parameters are similar to those used in single-molecule FRET experiments on fast protein folding.23 Each estimation experiment was repeated 10 times and the resulting standard deviations were averaged. All sequences used to extract parameters have the same number of photons (200 000) and same total duration (2 s). The states in the model were equally populated; the relaxation rate k varied from 0.01 ms−1 to 1000 ms−1. There were four different sets of FRET efficiencies, including ones that are very close (0.4 and 0.6) or well separated (0.1 and 0.9). The values of FRET efficiencies were chosen so that . This allows us to find simple expressions for the standard deviation analytically, Eq. (9), and to test the applicability of the results in Eq. (8). Simulations with the non-symmetric FRET efficiencies and various state populations show similar trends.23
The results of the simulations for the standard deviations of the parameters are shown in Fig. 2. It can be seen that there are two qualitatively different regions, which we refer to as slow and fast transition regions. When the transitions between the states are fast, the standard deviations dramatically depend on how different the FRET efficiencies of the states are. When the transitions are slow, the errors are insensitive to the values of FRET efficiencies. The results of the simulations agree with the standard deviations obtained in the limiting cases of fast (Eqs. (8) and (9)) and slow (Eqs. (5) and (6)) transitions, confirming that these equations capture the dependence of the standard deviation on the model parameters.
FIG. 2.
Accuracy of the maximum likelihood estimates for a two-state model as a function of the ratio of the relaxation rate and the photon count rate, k/n, at various FRET efficiencies. Relative standard deviation of the relaxation rate (a) and of the population (b) (circles). (c) Standard deviation of the FRET efficiencies of the first (reverse triangles) and second (triangles) states normalized to , i = 1, 2. The states are equally populated, the FRET efficiencies and are 0.4 and 0.6 (red), 0.3 and 0.7 (orange), 0.2 and 0.8 (green), and 0.1 and 0.9 (blue). Purple lines are the standard deviations for and (calculated using Eqs. (3) and (4)). Dashed lines are the errors in the fast transition limit (the square roots of Eqs. (8), black, and (9), cyan) and in the slow transition limit (the square roots of Eqs. (5) and (6), magenta).
Let us consider the plots in more detail. The relative standard deviation of the relaxation rate in Fig. 2(a) has a “chevron” (“V”) shape, similar to the analytically solvable model in Fig. 1(b). The shape of the curves in the slow transitions regime is independent of the FRET efficiencies and is described by Eq. (6a) (magenta dashed line). As follows from Eq. (6a), the accuracy of the relaxation rate is mainly determined by the mean number of transitions in all sequences, kT/2. It does not matter whether there are many short sequences or few long sequences as long as they have the same total duration.
Unlike the relaxation rate, the accuracy of the population in the slow transition region is determined by the sum of the number of transitions, kT/2, and the number of independent sequences, M, as follows from Eq. (6b). In the limit of very slow transitions, the standard deviation of the population does not depend on the relaxation rate (see Fig. 2(b) and Eq. (7)). For the slowest transition rate used in the simulations, k = 0.01 ms−1, there are only 10 transitions during all 200 sequences on average so that the error of the relaxation rate is large (>30%). However, the error of the population in this case is 7%. This is because the initial state of the molecule in each photon sequence is random, so that additional information about populations comes from the first photon of the sequence.
The FRET efficiencies in the slow transition region are estimated very accurately, independent of the transition rates (see Eq. (5)). When normalized to , i = 1, 2, the standard deviations of the FRET efficiency in the first and second states are the same and equal to for all FRET efficiencies (see Fig. 2(c)).
Now consider the behavior of the plots in the fast transition region. The standard deviations of all parameters increase as the transition rate increases. As follows from the theory in Sec. VI, the estimates of the FRET efficiencies and the population are coupled in the fast transition region. The mean FRET efficiency, which is a combination of these parameters (), can be determined very accurately even when the transitions are extremely fast. To estimate all three parameters, photon sequences must have a sufficient number of three consecutive photons separated by short times ∼1/k. The probability of such triads is ∝n2/k2, so the standard deviation of the FRET efficiencies and the population is ∝k/n (see Eqs. (8)). On the other hand, the estimation of the relaxation rate requires only pairs of consecutive photons to be correlated and hence the relative standard deviation of the relaxation rate is (see Eq. (8d)).
As follows from Eqs. (8), the standard deviations of the FRET efficiencies and the relaxation rate depend on the difference between the FRET efficiencies as , while the population is even more sensitive to the difference, . As can be seen in Fig. 2, the simulation results agree with these asymptotics (black dashed lines) when the relaxation rate is large and all standard deviations are small. When any of the standard deviations is large, the Gaussian approximation cannot be used to estimate the variance of the parameters. This is why the simulated data in Fig. 2(c) (red and orange triangles for k/n = 2, 5, 10) deviate from the asymptotics.
The results in Eqs. (8) were derived in Sec. VI assuming that the difference between the FRET efficiencies of the states is small. To check how accurate Eqs. (8) are, we also derived the variances in the fast transition region for the special parameter values that were used in the plots in Fig. 2, Eqs. (9), which are applicable for arbitrary separation of the FRET efficiencies (cyan lines in Fig. 2). It turns out that the approximation in Eqs. (8) is pretty accurate and can be used even when the separation between FRET efficiencies is not small.
In the above analysis, all four parameters are unknown. In the exactly solvable model, which we also plot in Figs. 2(a) and 2(b) for the reference, the FRET efficiencies are assumed to be known (0 and 1). In the slow transition region, the estimates of the FRET efficiencies are independent from the estimates of the rates and population, so the plots agree with the exactly solvable model. In the fast transition region, the error of the population in the exactly solvable model is small (see Fig. 2(b), purple line). This is because the FRET efficiencies are known and the population in the exactly solvable model is the same as the average FRET efficiency, which can be estimated very accurately.
Note that in the slow transition region, the standard deviations of both the relaxation rate and the population are independent of photon intensity n. The key parameter here is the number of transitions during all photon sequences rather than the number of photons. In the slow transition region, increasing photon intensity does not improve the accuracy of the transition rates. In contrast, in the fast transition region, the standard deviations depend on the photon intensity as 1/n (the relaxation rate) and (the population and FRET efficiencies). The relaxation rate where the slow transition region changes to the fast transition region (the minimum in the plot in Fig. 2) can be estimated by equating the fast and slow asymptotics in Eqs. (8d) and (6a) (with M ≪ kN/n)
| (10) |
This border between the two regions is intensity-dependent. It moves towards larger relaxation rates as the photon count rate increases.
The above results for the standard deviations allow one to determine the amount of data required to estimate parameters to a given accuracy. For example, when the transitions between the two states are slow, one needs the trajectories with at least 100 transitions between the states in order to estimate the transition rates with the accuracy 10%. When the transition rate is fast and comparable to the photon count rate (k ∼ n), one needs at least ∼1 560 000 photons to estimate all four parameters with the accuracy 10% for the molecule with equally populated states and and . However, when and , the amount of required photons reduces to ∼25 000. It should be noted that these estimates of the required data are for the ideal case when the model is known and accurate. In reality, there are other factors, such as fluorophore blinking or submicrosecond chain dynamics, that lead to the deviation from the assumed two-state kinetic model and result in additional errors. The effect of these factors on the accuracy of the ML estimates is beyond the scope of this paper.
In summary, the analysis of the variances of the ML estimates shows that there are two regions of parameter space where the accuracy of these estimates is qualitatively different. In the slow transition region, there are two key parameters, i.e., the number of transitions that occur during all photon sequences and the number of independent photon sequences. The error of the transition rate is determined by the number of transitions (see Eq. (6a)), whereas the error of the population, Eq. (6b), is determined by both parameters. The error of the FRET efficiencies in the slow transition region is small and depends on the number of photons emitted by the corresponding states in all photon sequences (see Eq. (5)). In the fast transition region, the accuracy of the relaxation rate is determined by the number of correlated pairs of photons, Eq. (8d), whereas the accuracy of the population and FRET efficiencies are determined by the number of correlated triplets of photons, Eqs. (8a)–(8c).
The rest of the paper deals with the technical aspects of the derivation of the results in the paper.
IV. EXACTLY SOLVABLE MODEL
In this section, we consider the special case when the FRET efficiencies in the two states are 0 and 1 and provide the details of the derivation of the parameter variances. The estimated parameters are just the relaxation rate and the population, since the FRET efficiencies are assumed to be known. In this case, the state of the molecule is known at the time the photon is detected, i.e., if the photon is green (i.e., from the donor), it has been emitted by the molecule in the first state with ; red (acceptor) photons correspond to the second state with (see Fig. 1(a)). The color of ith photon, ci, is directly connected to the state of the molecule (donor → state 1, acceptor → state 2), therefore, in this particular case, ci denotes both the color of the photon and the state of the molecule. The product of matrices in the likelihood function, Eq. (1), becomes the product of scalar functions (because matrices F(ci) have only one non-zero element)
| (11) |
Here, G(ci+1, τi+1; ci) depends on the colors of the consecutive photons, ci and ci+1, and the time between ith and (i + 1)th photon, τi+1; the product is over all photons; pin is the population of the state corresponding to the first photon in the trajectory (i.e., p1 when the first photon is green and p2 when it is red). The conditional transition probability G(α, τ; β) (α, β = 1, 2) is the probability to be in state α at time τ provided that initially the molecule was in state β
| (12) |
where k = k1 + k2 is the two-state relaxation rate, p1 = k2/(k1 + k2) = 1 − p2, δαβ is the Kronecker delta (δαβ is 1 when α and β are equal and 0 otherwise). The last equality is obtained by evaluating the matrix exponential, exp(Kτ), with K from Eq. (2).
The likelihoods Lj of various photon sequences are multiplied together, so that the log-likelihood of M sequences is
| (13) |
where M1 and M2 = M − M1 are the numbers of the trajectories that start with donor and acceptor photon, respectively. The sum over i is performed over photons in all sequences. This log-likelihood depends on the observed interphoton times τi and photon colors ci. The log-likelihood is a function of two variables, the population p1 and the relaxation rate k.
The parameters and their errors are found by optimizing numerically the log-likelihood and evaluating the curvature of the likelihood function at the maximum. To obtain the errors analytically in the limit of large photon counts, we regroup the terms in Eq. (13) by color and replace the summation by an integration over the distribution of interphoton times. For example, the summation with respect to the times between two acceptor consecutive photons is replaced by
| (14) |
where N is the total number of photons in all photon trajectories and n is the total count rate, which is the same in both states. Here, nexp(−nt) is the probability density that two consecutive photons are separated by time t and G22(t) p2 is the probability that both these photons are red (acceptor) and thus were emitted by the molecule in state 2. (More accurately, the factor in the above equation is N − M instead of N, which accounts for the last photon in the trajectory, however, since we assume N ≫ M, this correction can be neglected.)
Similar replacements are performed for the sums with other colors, so that, for any function ϕ(ci+1, τi+1; ci) ≡ ϕci+1,ci(τi+1) that depends on colors ci
| (15) |
Using this in Eq. (13), we get the log-likelihood function
| (16) |
Here, we used the fact that, in the limit of large number of photons, the number of photon sequences starting with the donor (acceptor) photon is M1 = p1M (M2 = p2M). The primes in indicate the dependence on the parameters that are being varied, k′ and . These differ from the true parameters k and p1.
The estimates for the population and relaxation rate are found by maximizing the log-likelihood with respect to v1 = k′ and . For large numbers of photons, these estimates converge to the true values. The variances are found from the curvature of the log-likelihood at the maximum. The curvature is determined by the Hessian matrix H (the Fisher information matrix) with the elements [H]ij = − ∂vi∂vjlnL. The inverse of the Hessian matrix results in the covariance matrix in the limit of large number of photons, which determines the errors of the extracted parameters
| (17) |
The first derivatives of the log-likelihood in Eq. (16) are zero when k′ = k and , as expected. This can verified by differentiating Eq. (16) and using and .
Calculation of the second derivative can be simplified using
The first term in the right-hand side vanishes after removing the primes (i.e., setting k′ = k and ) and summing using . The second term contributes to the second derivative of Eq. (16) at the maximum
| (18) |
Using the expressions for Gαβ in Eq. (12) and replacing z = kt, we find that the elements of the Hessian matrix are
| (19a) |
| (19b) |
| (19c) |
The above integrals can be expressed in terms of special functions (Lerch’s transcendent, the Hurwitz zeta function, and the hypergeometric function). The covariance matrix is the inverse of the Hessian matrix (see Eq. (17)). When the two states are equally populated, the off-diagonal terms vanish. The variances of the relaxation rate and populations in this case are the reciprocals of Eqs. (19a) and (19b), which can be shown to be given by Eqs. (3) and (4).
V. SLOW TRANSITIONS
In this section, we consider photon sequences in the limit when the transitions between the states are slow. All four parameters (the relaxation rate, population, and FRET efficiencies) are assumed to be unknown. An observed sequence of photons may result from different state trajectories. However, when transitions between the states are rare and there are enough photons so that the identity of a state can be determined before the state changes, there is a single (most probable) state trajectory defined from the observed photons, and all other state trajectories are very unlikely. In this section, we study such a case assuming that the state trajectory can be readily obtained from the observed photon trajectory. Once the state trajectory has been found, then the new observables are the numbers of photons emitted by each state, the number of transitions between the states, the time spent in each state, and the initial state for each trajectory (when there are many independent photon sequences).
The corresponding likelihood function of M photon trajectories is
| (20) |
where NAi, NDi are the total numbers of the acceptor and donor photons in state i (i = 1, 2), Ti is the total time spent in state i in all trajectories, N1→2 and N2→1 are the numbers of 1 → 2 and 2 → 1 transitions, respectively; Mi is the number of photon trajectories that start from state i. In this equation, is the probability that M1 photon sequences start with the molecule in the first state and M2 = M − M1 sequences start with the molecule is in second state. exp(−k1T1 − k2T2) is the probability that there are no transitions during times T1 and T2. Since the probability of one 1 → 2 transition during short time Δt is k1Δt, N1→2 transitions result in the factor in the above likelihood. Similarly, corresponds to the 2 → 1 transitions. Finally, we note that and are the probabilities that a photon emitted by the molecule in state i (=1, 2) is red (acceptor) and green (donor), respectively. Therefore, NAi acceptor and NDi donor photons result in the factor .
The log of the above likelihood is
| (21) |
When there are just few sequences, the last two terms in the above equation can be neglected. In this case, the parameters , , k1, and k2 are independent. The most likely FRET efficiencies are found by setting the first derivatives with respect to to zero, resulting in the estimates , i = 1, 2. In the limit of large photon counts, , , where N is the total number of photons, so that the maximum likelihood estimates converge to the true parameters. The errors of the FRET efficiencies are obtained from the second derivatives, . In the limit of large photons, this becomes .
Similarly, most likely rates are estimated as ki = Ni→j/Ti and the corresponding variances are . In the limit of large number of photons, Ti = piT, N1→2 = N2→1 = p1p2kT, so that the ML estimates of ki reduce to the true values and .
When the number of photon sequences is large so the last terms in Eq. (21) become significant, are still independent, but k1 and k2 are coupled. We consider the estimates of the relaxation rate k and the population p1 in this case. Replacing k1 = p2k and k2 = p1k and rearranging the terms, we get from Eq. (21)
| (22) |
The estimation of the FRET efficiencies is the same as before, the variances of the FRET efficiencies result in Eq. (5). To find the ML estimates of the relaxation rate and the population, we set the first derivatives of the above equation with respect to k and p1 to zero. One can verify that these estimates coincide with the true values in the limit of large number of photons. The variances of the parameters v1 = k and v2 = p1 are obtained by evaluating the second derivatives at the maximum. In this way, we find that the elements of the Hessian matrix [H]ij = − ∂vi∂vjlnL are
| (23) |
Inverting the Hessian matrix and replacing Ti = piT, Mi = piM, (i = 1, 2), and N1→2 = N2→1 = p1p2kT, we get the variance and covariance in the limit of large number of photons given in Eq. (6).
VI. FAST TRANSITIONS
In this section, we consider the case when the conformational relaxation rate is much larger than the photon count rate. The number of independent photon trajectories does not matter in this limit, so we consider just one trajectory. When the transitions between the states are infinitely fast, the correlation between any two photons is lost, so that the likelihood function in Eq. (1) converges to
| (24) |
where
| (25) |
Here, ci is the color of ith photon (donor or acceptor), , , is the mean FRET efficiency. The product in Eq. (24) is over all photons, NA and ND = N − NA are the total numbers of acceptor and donor photons, respectively. In this likelihood function, there is no information about the relaxation rate. Only the mean FRET efficiency can be accurately determined with the variance .
When the transition rates are large but finite, only photons separated by short interphoton times (∼1/k) are correlated. They determine the dependence of the likelihood on the rates. Assume that no more than two consecutive photons are correlated. Then, the likelihood function in Eq. (1) can be approximated as
| (26) |
where the product is over all photons and
| (27) |
For example, for two consecutive acceptor photons separated by time τ, ci = ci+1 = A and , where . This function is equal to the auto-correlation function of the FRET efficiency when the system jumps between and .
When the interphoton times are longer than the times between transitions, g(ci+1, τi+1; ci)/g(ci+1) → g(ci), and thus Eq. (26) reduces to Eq. (24). In the special case, when the FRET efficiency of the first and second states are 0 and 1, respectively, g(D) = p1, g(A) = p2, and g(ci+1, τi+1; ci)/g(ci+1) = G(ci+1, τi+1; ci), where G(α, τ; β) is the Green’s function defined in Eq. (12). The likelihood function in Eq. (26) in this case coincides with Eq. (11) and is, therefore, exact.
The above approximation scheme can be extended to include correlation between three consecutive photons
| (28) |
where g(ci+2, τi+2; ci+1, τi+1; ci) involves three consecutive photons
| (29) |
If any one of the τ’s is long compared to the time between transitions (kτ ≫ 1), the corresponding factor in the product in Eq. (28) becomes equal to that in Eqs. (24) and (26). For example, when τi+2 → ∞, g(ci+2, τi+2; ci+1, τi+1; ci)/g(ci+2, τi+2; ci+1) → g(ci+1, τi+1; ci)/g(ci+1). Only when all three photons are bunched together is there a difference between Eqs. (28) and (26).
We now take the log of Eq. (28), regroup the terms in the sums by color (e.g., the sum of lng(ci+1, τi,i+1; ci) has four terms, corresponding to the acceptor-acceptor, donor-donor, acceptor-donor, and donor-acceptor photons), and replace the sums by the averages. For any functions, ϕ(ci+1, τi+1; ci) ≡ ϕci+1,ci(τi+1) and ϕ(ci+2, τi+2; ci+1, τi+1; ci) ≡ ϕci+2,ci+1,ci(τi+2, τi+1) that depend on colors
| (30) |
where the indexes α, β, γ indicate color (donor, D, or acceptor, A) gαβ(t) = g(α, t; β) and gαβγ(t1, t2) = g(α, t1; β, t2; γ). The first equation here is similar to Eq. (15), except that Gαβ(t) pβ is replaced by a more general probability gαβ(t) that two photons separated by time t have colors α and β. In the second equation, three consecutive photons are involved in the averaging; gαβγ(t1, t2) is the probability that these photons have colors α, β, and γ and n2exp(−n(t1 + t2)) is the probability density that the times between the photons are t1 and t2.
Applying the averaging in Eq. (30) to the log of Eq. (28), we have
| (31) |
Here, the primes in and indicate that these functions depend on the parameters that will be found by optimizing the log-likelihood (the variables of the likelihood function) as opposed to the parameters that are given (or used to simulate photon sequences). The sums in the above expressions are over all color combinations. For example, the second sum includes four terms. Explicitly, the functions gαβ(t) and gαβγ(t) in the above equations can be written as
| (32) |
where gα = g(α), which is equal to , when α is acceptor, or , when α is donor, Sα is the sign function (SA = 1 and SD = − 1), is the FRET efficiency variance, and , xD = 1 − xA.
The log-likelihood in Eq. (31) is our approximation in the case of fast transitions. In the limit of very fast transitions, , which corresponds to the likelihood in Eq. (24) where all photons are uncorrelated. It involves only the mean FRET efficiency, which can be determined very accurately. Other parameters cannot be determined from the likelihood in Eq. (24). The approximation in Eq. (26) is the correction to the limit of uncorrelated photons. It involves the FRET efficiency variance and the relaxation rate k. To determine all four parameters, we have to consider the approximation in Eq. (28) (and (31)), which involves three bunched photons.
The parameters we are interested in are , , u3 = p1, and u4 = k. Explicit expressions in Eq. (32) show that more convenient parameters are , v2 = k, , and . To get the variances of the ML estimates of ui, we evaluate the second derivatives with respect to vi, convert them into the derivatives with respect to ui, and invert the resulting Hessian matrix. The derivatives with respect to ui are (the first derivatives of lnL are zero at the maximum). In matrix notation, the Hessian matrixes Hv (with the elements [Hv]ij = − ∂vi∂vjlnL) and Hu (with the elements [Hu]ij = − ∂ui∂ujlnL) are related by
| (33) |
where ∂uv is the matrix with elements ∂vi/∂uj
| (34) |
where . The variances are the diagonal terms of .
The first derivative of Eq. (31) with respect to a parameter v leads to the expression that contains in the first term and a similar expression in the second term. When the parameters coincide with the true ones, g′ = g and the derivative is zero as it should be since the sum of over all colors is 1. In the second derivative, we use
and the fact that in the maximum g′ = g, so that the term with the second derivative disappears after summation over the colors. Therefore, the derivative with respect to parameters vi and vj is
| (35) |
We now will evaluate these derivatives to lowest order in n/k. Consider the second derivative with respect to . The terms in gαβγ in Eq. (32) that contain can be neglected because they result in the terms ∝n/k, which are small. Therefore,
| (36) |
where we used and (dgα/dv1)2 = 1. This term does not depend on n/k.
The derivatives with respect to v2 and v3 are found by collecting the leading terms ∝n/k; therefore, the terms containing exp(−k(t1 + t2)) can be neglected since they result in the next order terms. When these terms are omitted, it follows from Eq. (32) that
| (37) |
For example, using the above approximations in Eq. (35), we get the second derivative of lnL with respect to v2
| (38) |
Here, the cross term in the first integrand is ∝(dgαβ(t1)/dv2)(dgβγ(t2)/dv2) ∝ exp(−k(t1 + t2)), and therefore is ne glected. In the remaining terms, we performed the integration with respect to one of the t’s by replacing by the leading term 1/gμgν.
Performing similar calculations, we get the leading terms (with respect to n/k) of the derivatives
| (39a) |
| (39b) |
| (39c) |
| (39d) |
where gαβ and gαβγ are given in Eq. (32), with the replacement kt → z
| (40) |
and with the replacements kt1 → z1, kt2 → z2.
These calculations show that [Hv]22, [Hv]33, and [Hv]23 are expressed in terms of the functions for two consecutive photons, gαβ, and are of the order of n/k which is considered here as a small parameter. [Hv]44 is expressed in terms of the function for three consecutive photons, gαβγ, and is . Similar reasonings show that all [Hv]i4 (i = 1, 2, 3, 4) are quadratic in n/k, . All other terms are linear in n/k, except [Hv]11, which is not small (). Using this in Eq. (33) and collecting the leading terms in n/k, we get
| (41) |
Thus, the variance of k is ∝k/n and is given in terms of [Hv]22, [Hv]33, and [Hv]23. These elements are the derivatives of the likelihood function with respect to k and , which characterize the two-photon function gαβ(t). The variances of , , and p1 are ∝k2/n2 and all are expressed in terms of a single element, [Hv]44. This element is the derivative of the likelihood function with respect to xA, which only enters into the three-photon function gαβγ(t1, t2).
A. Small variance limit
The above errors can be simply expressed when the FRET efficiency variance is small. In this case, and . Using these in Eq. (39) (with n/k in the exponent set to 0), we get
| (42) |
Using these in Eq. (41), we get the equations given above in Eq. (8).
B. Special case of equal populations
In the special case used in the simulations in Fig. 2, p1 = p2 = 1/2 and , so that and . In this case, in Eq. (40) becomes . The integrals in Eqs. (39a), (39b), and (39c) can be evaluated using
| (43) |
where is the polylogarithm function. In this way, we get the elements [Hv]22, [Hv]33, and [Hv]23, which, after using in Eq. (41), result in the variance of the relaxation rate in Eq. (9c).
The calculation of [Hv]44 is more complex. Using Mathematica (Wolfram Research, Inc.) in evaluating the integral in Eq. (39d), we can get
| (44) |
The variances of FRET efficiencies and population are and , according to Eq. (41). This results in Eqs. (9a) and (9b) given in Sec. II.
Acknowledgments
I thank A. Szabo, H. S. Chung, and S. Pressé for helpful discussions and very useful comments. This work was supported by the Intramural Research Program of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), National Institutes of Health.
REFERENCES
- 1.Schröder G. F. and Grubmüller H., J. Chem. Phys. 119, 9920 (2003). 10.1063/1.1616511 [DOI] [Google Scholar]
- 2.Andrec M., Levy R. M., and Talaga D. S., J. Phys. Chem. A 107, 7454 (2003). 10.1021/jp035514+ [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.McKinney S. A., Joo C., and Ha T., Biophys. J. 91, 1941 (2006). 10.1529/biophysj.106.082487 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Watkins L. P., Chang H., and Yang H., J. Phys. Chem. A 110, 5191 (2006). 10.1021/jp055886d [DOI] [PubMed] [Google Scholar]
- 5.Chung H. S., Louis J. M., and Eaton W. A., Proc. Natl. Acad. Sci. U.S.A. 106, 11837 (2009). 10.1073/pnas.0901178106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pirchi M., Ziv G., Riven I., Cohen S. S., Zohar N., Barak Y., and Haran G., Nat. Commun. 2, 493 (2011). 10.1038/ncomms1504 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Liu Y., Park J., Dahmen K., Chemla Y., and Ha T., J. Phys. Chem. B 114, 5386 (2010). 10.1021/jp9057669 [DOI] [PubMed] [Google Scholar]
- 8.Keller B. G., Kobitski A., Jas¨chke A., Nienhaus G. U., and Noé F., J. Amer. Chem. Soc. 136, 4534 (2014). 10.1021/ja4098719 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gopich I. V., Chem. Phys. 396, 53 (2012). 10.1016/j.chemphys.2011.06.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kou S., Xie X., and Liu J., J. R. Stat. Soc.: Ser. C (Appl. Stat.) 54, 469 (2005). 10.1111/j.1467-9876.2005.00509.x [DOI] [Google Scholar]
- 11.Jäger M., Kiel A., Herten D., and Hamprecht F., ChemPhysChem. 10, 2486 (2009). 10.1002/cphc.200900331 [DOI] [PubMed] [Google Scholar]
- 12.Gopich I. V. and Szabo A., J. Phys. Chem. B 113, 10965 (2009). 10.1021/jp903671p [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hajdziona M. and Molski A., J. Chem. Phys. 134, 054112 (2011). 10.1063/1.3544494 [DOI] [PubMed] [Google Scholar]
- 14.Waligórska M. and Molski A., Chem. Phys. 403, 52 (2012). 10.1016/j.chemphys.2012.05.009 [DOI] [Google Scholar]
- 15.Watkins L. P. and Yang H., J. Phys. Chem. B 109, 617 (2005). 10.1021/jp0467548 [DOI] [PubMed] [Google Scholar]
- 16.Haas K. R., Yang H., and Chu J. W., J. Phys. Chem. B 117, 15591 (2013). 10.1021/jp405983d [DOI] [PubMed] [Google Scholar]
- 17.Chung H. S., Gopich I. V., McHale K., Cellmer T., Louis J. M., and Eaton W. A., J. Phys. Chem. A 115, 3642 (2011). 10.1021/jp1009669 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chung H. S., Cellmer T., Louis J. M., and Eaton W. A., Chem. Phys. 422, 229 (2013). 10.1016/j.chemphys.2012.08.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chung H. S., McHale K., Louis J. M., and Eaton W. A., Science 335, 981 (2012). 10.1126/science.1215768 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Chung H. S. and Eaton W. A., Nature 502, 685 (2013). 10.1038/nature12649 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Watkins L. P. and Yang H., Biophys. J. 86, 4015 (2004). 10.1529/biophysj.103.037739 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kawai S., Cooper D., Landes C., Mootz H. D., Yang H., and Komatsuzaki T., J. Phys. Chem. B 117, 8061 (2013). 10.1021/jp402328m [DOI] [PubMed] [Google Scholar]
- 23.Chung H. S. and Gopich I. V., Phys. Chem. Chem. Phys. 16, 18644 (2014). 10.1039/c4cp02489c [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Gopich I. V. and Szabo A. , in Single-Molecule Biophysics: Experiment and Theory, Advances in Chemical Physics, edited by Komatsuzaki T., Kawakami M., Takahashi S., Yang H., and Silbey R. J. (Wiley, New York, 2012), Vol. 146, pp. 245–297 10.1002/9781118131374 [DOI] [Google Scholar]
- 25.Deniz A., Laurence T., Dahan M., Chemla D., Schultz P., and Weiss S., Annu. Rev. Phys. Chem. 52, 233 (2001). 10.1146/annurev.physchem.52.1.233 [DOI] [PubMed] [Google Scholar]
- 26.Schuler B., Lipman E. A., and Eaton W. A., Nature 419, 743 (2002). 10.1038/nature01060 [DOI] [PubMed] [Google Scholar]
- 27.Sisamakis E., Valeri A., Kalinin S., Rothwell P. J., and Seidel C. A. M., Methods Enzymol. 475, 455 (2010). 10.1016/s0076-6879(10)75018-7 [DOI] [PubMed] [Google Scholar]
- 28.Van den Bos A., Parameter Estimation for Scientists and Engineers (Wiley, 2007). [Google Scholar]
- 29.Haas K. R., Yang H., and Chu J. -W., J. Chem. Phys. 139, 121931 (2013). 10.1063/1.4820491 [DOI] [PubMed] [Google Scholar]
- 30.Dahan M., Deniz A. A., Ha T., Chemla D. S., Schultz P. G., and Weiss S., Chem. Phys. 247, 85 (1999). 10.1016/s0301-0104(99)00132-9 [DOI] [Google Scholar]

