Skip to main content
Biophysical Reports logoLink to Biophysical Reports
. 2022 Nov 25;2(4):100088. doi: 10.1016/j.bpr.2022.100088

Single-photon smFRET. III. Application to pulsed illumination

Matthew Safar 1,2, Ayush Saurabh 1,3, Bidyut Sarkar 4,5, Mohamadreza Fazel 1,3, Kunihiko Ishii 4,5, Tahei Tahara 4,5, Ioannis Sgouralis 6, Steve Pressé 1,3,7,
PMCID: PMC9747580  PMID: 36530182

Abstract

Förster resonance energy transfer (FRET) using pulsed illumination has been pivotal in leveraging lifetime information in FRET analysis. However, there remain major challenges in quantitative single-photon, single-molecule FRET (smFRET) data analysis under pulsed illumination including 1) simultaneously deducing kinetics and number of system states; 2) providing uncertainties over estimates, particularly uncertainty over the number of system states; and 3) taking into account detector noise sources such as cross talk and the instrument response function contributing to uncertainty; in addition to 4) other experimental noise sources such as background. Here, we implement the Bayesian nonparametric framework described in the first companion article that addresses all aforementioned issues in smFRET data analysis specialized for the case of pulsed illumination. Furthermore, we apply our method to both synthetic as well as experimental data acquired using Holliday junctions.

Why it matters?

In the first companion article of this series, we developed new methods to analyze noisy single-molecule Förster resonance energy transfer data. These methods eliminate the requirement of a priori specifying the dimensionality of the physical model describing a molecular complex’s kinetics. Here, we apply these methods to experimentally obtained datasets with samples illuminated by laser pulses at regular time intervals. In particular, we study conformational dynamics of Holliday junctions.

Introduction

Among the many fluorescence methods available (1,2,3,4,5,6,7), single-molecule Förster resonance energy transfer (smFRET) has been useful in probing interactions and conformational changes on nanometer scales (8,9,10,11,12). This is typically achieved by estimating FRET efficiencies (and system states) at all instants of an smFRET trace and subsequently estimating transition rates. Furthermore, among different FRET modalities, FRET efficiencies are most accurately determined under pulsed illumination (13,14,15), where the FRET dyes are illuminated by short laser bursts at known times.

Under this illumination procedure, photon arrival times are recorded with respect to the immediately preceding pulse, thereby facilitating an accurate estimation of fluorescence lifetimes as well as FRET rates. As such, in this article, we will focus on single-photon smFRET analysis under pulsed illumination.

Under pulsed illumination, information on kinetic parameters present in smFRET data is traditionally learned by binned photon methods, thereby eliminating lifetime information altogether (16,17,18); bulk correlative methods (19,20,21); and single-photon methods (14,22,23). However, these methods are parametric, i.e., require fixing the number of system states a priori, and necessarily only learn system kinetics even though information on the number of system states is encoded in the data.

In this article, we implement a general smFRET analysis framework that was presented in Sec. 2.5.1 of the first companion manuscript (24) for the case of pulsed illumination to learn full distributions. In other words, probability distributions over parameters take into account uncertainties from all existing sources such as cross talk and background. These parameters include the system transition probabilities and photophysical rates, that is, donor and acceptor relaxation and FRET rates, with special attention paid to uncertainty arising from sources such as inherent stochasticity in photon arrival times and detectors. As our main concern is deducing the number of system states using single-photon arrivals while incorporating detector effects, we leverage the formalism of infinite hidden Markov models (iHMMs) (25,26,27,28,29,30) within the Bayesian nonparametric (BNP) paradigm (25,26,31,32,33,34,35,36,37,38). The iHMM framework assumes an a priori infinite number of system states with associated transition probabilities, where the number of system states warranted by input data is enumerated by those states most visited over the course of the system state trajectory.

Next, to benchmark our BNP-FRET sampler, we analyzed synthetic and experimental smFRET data acquired using a single confocal microscope with pulsed illumination optimized to excite donor dyes.

In particular, we employ a broad range of experimental data acquired from Holliday junctions (HJs) with an array of different kinetic rates due to varying buffer concentration of MgCl2 (39,40,41,42).

Materials and methods

Terminology convention

To be consistent throughout our three-part article, we precisely define some terms as follows.

  • 1. A macromolecular complex under study is always referred to as a system.

  • 2. The configurations through which a system transitions are termed system states, typically labeled using σ.

  • 3. FRET dyes undergo quantum mechanical transitions between photophysical states, typically labeled using ψ.

  • 4. A system-FRET combination is always referred to as a composite.

  • 5. A composite undergoes transitions among its superstates, typically labeled using φ.

  • 6. All transition rates are typically labeled using λ.

  • 7. The symbol N is generally used to represent the total number of discretized time windows, typically labeled with n.

  • 8. The symbol wn is generally used to represent the observations in the n-th time window.

Forward model and inverse strategy

In this section, we first briefly illustrate the adaptation of the general formalism described in our first companion article (24) to the pulsed illumination case. Next, we present a specialized inference procedure. The details of the framework not provided herein can be found in the supporting material.

As before, we consider a molecular complex labeled with a donor-acceptor FRET pair. As the molecular complex transitions through its Mσ system states indexed by σ1:Mσ, laser pulses (optimized to excite the donor) separated by time τ may excite either the donor or acceptor to drive transitions among the photophysical states, ψ1:Mψ, as defined in the first companion article (24). Such photophysical transitions lead to photon emissions that may be detected in either donor or acceptor channels. The set of N observations, e.g., photon arrival times, from N pulses are recorded as

w={w1,w2,,wN}. (1)

Here, each individual measurement is a pair wn=(μnd,μna), where μnd and μna are the recorded arrival times (also known as microtimes) after the n-th pulse in both donor and acceptor channels, respectively. In cases where there is no photon detection, we denote the absent microtimes with μnd= and μna= for donor and acceptor channels, respectively.

As is clear from Fig. 1, smFRET traces are inherently stochastic due to the nature of photon excitation, emission, and noise introduced by detector electronics. To analyze such stochastic systems, we begin with the most generic likelihood derived in Eq. 51 of the first companion article (24),

LρstartQ1QnQNρnormT, (2)

where ρstart is the initial probability vector for the system-FRET composite to be in one of Mϕ=Mσ×Mψ superstates, and ρnorm is a vector that sums the elements of the propagated probability vector. Here, we recall that Qn is the transition probability matrix between pulses at tn and tn+1, characterizing system-FRET composite transitions among superstates.

Figure 1.

Figure 1

Events over a pulsed illumination experiment pulse window. Here, the beginning of the n-th interpulse window of size τ is marked by time tn. The FRET labels originally in state GG (donor and acceptor, respectively, in ground states) are excited by a high intensity burst (shown in green) to the state EG (only donor excited) for a very short time, δpulse. If FRET occurs, the donor transfers its energy to the acceptor and resides in the ground state, leaving the FRET labels in the GE state (only acceptor excited). The acceptor then emits a photon to be registered by the detector at microtime μn. When using ideal detectors, the microtime is the same as the photon emission time as shown in (a). However, when the timing hardware has jitter (shown in red), a small delay εn is added to the microtime as shown in (b). For convenience, we have reproduced this figure from our first companion article (24).

The propagators Qn above adopt different forms depending on whether a photon is detected or not during the associated period. Their most general forms are derived in the section on illumination features in the first companion article (24). However, these propagators involve computationally expensive integrals, and thus we make a few approximations here as follows: 1) we assume that the system state remains the same over an interpulse period since typical system kinetic timescales (typically 1 ms or more) are much longer than interpulse periods (100 ns) (41,43), and 2) the interpulse period (100 ns) is longer than the donor and acceptor lifetimes ( a few ns) (41,43) such that they relax to the ground state before the next pulse. Furthermore, we will demonstrate a specialized sampling scheme under these physically motivated approximations.

The immediate implications of the first assumption are that the system transitions may now, to a good approximation, only occur at the beginning of each pulse. Consequently, the evolution of the FRET pair between two consecutive pulses is now exclusively photophysical, as the system state remains the same during interpulse times. As such, the system now evolves in equally spaced discrete time steps of size τ, where the system state trajectory can be written as

s1:N={s1,s2,,sn,,sN1,sN},

where sn is the system state between pulses n and n+1. The stochastic evolution of the system states in such discrete steps is then determined by the transition probability matrix designated by Πσ. For example, in the simplest case of a molecular complex with two system states σ1:2, this matrix is computed as follows:

Πσ=exp(τ[λσ1σ2λσ2σ2])=[πσ1σ1πσ1σ2πσ2σ1πσ2σ2], (3)

where the matrix in the exponential contains transition rates among the system states and the represents the negative row sum.

Next, by assumption two, we can further suppose that the fluorophores always start in the ground state at the beginning of every pulse. As a result, we treat pulses independently and write the probability of observation wn as

p(wn|sn,Gψ)=ρgroundQnψ(sn)ρnormT, (4)

where ρground denotes the probability vector when the FRET pair is in the ground state at the beginning of each pulse, Gψ is the generator matrix with only photophysical transition rates, and Qnψ(sn) is the photophysical propagator for the n-th interpulse period.

We further organize the observation probabilities of Eq. 4 into a newly defined detection matrix Dnσ with its elements given by (Dnσ)snσj=p(wn|sn,Gψ). Here, we note that the index j does not appear on the right-hand side because the system state does not change during an interpulse window, resulting in the independence of observation probability from the next system state, sn+1. The explicit formulas for the observation probabilities are provided in the supporting material.

Now, using the matrix Dnσ, we define the reduced propagators for each interpulse period as

Πnσ=ΠσDnσ, (5)

where denotes the element-by-element product.

Finally, using these simplified propagators, we can write the likelihood for an smFRET trace under pulsed illumination as

L=p(w|ρstart,Πσ,Gψ)ρstartΠ1σΠ2σΠNσρnormT, (6)

as also introduced in the section on illumination features in the first companion article (24). This form of the likelihood is advantageous in that it allows empty pulses to be computed as a simple product, greatly reducing computational cost.

In the following, we first illustrate a parametric inference procedure assuming a given number of system states. We next generalize the procedure developed to the nonparametric case to deduce the number of system states along with the rest of parameters.

Inference procedure: Parametric sampler

With the likelihood at hand, we construct the posterior as follows

p(ρstart,Πσ,Gψ|w)p(w|ρstart,Πσ,Gψ)p(ρstart)p(Gψ)p(Πσ), (7)

where we assume that the unknown parameters, including the initial probability vector, ρstart, the photophysical transition rates in the generator matrix Gψ, and the transition probabilities among system states in propagator Πσ are independent, allowing us to conveniently write the prior on these parameters as a product (the last three terms on right hand side). Here, we can sample the set of unknowns using the above posterior with the Gibbs sampling procedure described in the first companion article (see the section describing inverse strategy in (24)). However, a computationally more convenient inference procedure that allows direct sampling is accomplished by writing the posterior of Eq. 7 as a marginalization (sum) over state trajectories as follows

p(ρstart,Πσ,Gψ|w)=s1:Np(ρstart,Πσ,Gψ,s1:N|w)s1:Np(w|Πσ,Gψ,s1:N)p(ρstart)p(Gψ)p(Πσ)p(s1:N|ρstart,Πσ), (8)

where s1:N={s1,s2,sN} denotes a system state trajectory. Now, we can use the nonmarginal posterior

p(ρstart,Πσ,Gψ,s1:N|w)p(w|Πσ,Gψ,s1:N)p(ρstart)p(Gψ)p(Πσ)p(s1:N|ρstart,Πσ) (9)

to sample the trajectory s1:N, which, in turn, allows direct sampling of the elements of propagator Πσ described shortly. For priors on ρstart and rates in Gψ, we, respectively, use Dirichlet and Gamma distributions similar to Eqs. 65 and 66 of the first companion article (24). We sample the system state trajectory s1:N by recursively sampling the states using a forward filtering backward sampling algorithm described in section S4.3.

Finally, for each row in the propagator Πσ, we use a Dirichlet prior

πmDirichlet(αβ),m=1,2,,Mσ, (10)

where Mσ is the number of system states and πm denotes the m-th row of the propagator. Here, the hyperparameters α and β are, respectively, the concentration parameter and a vector of length Mσ described in the first companion article (see Section 3.2.2 of (24)). We can now directly generate samples for the transition probability vectors πm of length Mσ via prior-likelihood conjugacy as (see section S4.3)

πmDirichlet(nm+αβ),m=1,2,,Mσmax,

where the vector nm collects the number of times each transition out of system state σm occurs obtained using the system state trajectory.

After constructing the posterior, we can make inferences on the parameters by drawing samples from the posterior. However, as the resulting posterior has a nonanalytical form, it cannot be directly sampled. Therefore, we develop a Markov chain Monte Carlo sampling procedure (37,38,44,45,46,47) to draw samples from the posterior.

Our Markov chain Monte Carlo sampling scheme follows a Gibbs sampling technique, sweeping through updates of the set of parameters in the following order: 1) photophysical transition rates including donor relaxation rates λd (inverse of donor lifetime), acceptor relaxation rate λa (inverse of acceptor lifetime), FRET rates λσ1:MσFRET for each system state, and excitation rate (inverse of excitation probability πex) using the Metropolis-Hastings(MH) step; 2) transition probabilities between system states, π1:Mσ, by directly drawing samples from the posterior; 3) the system states trajectory, S, using a forward-backward sampling procedure (48); and 4) the initial probabilities, ρstart, by taking direct samples. In the end, the chains of samples drawn can be used for subsequent numerical analysis.

Inference procedure: Nonparametrics sampler

The smFRET data analysis method illustrated above assumes a given number of system states, Mσ. However, in many applications, the number of system states is not specified a priori. Here, we describe a generalization of our parametric method to address this shortcoming and estimate the number of system states simultaneously along with other unknown parameters.

We accomplish this by modifying our previously introduced parametric posterior as follows. First, we suppose an infinite number of system states (Mσ) for the likelihood introduced previously and learn the transition matrix Πσ. The number of system states can then be interpreted as those appreciably visited over the course of the trajectory.

To incorporate this infinite system state space into our inference strategy, we leverage the iHMMs (25,26,28,29,30) from the BNP repertoire, placing a hierarchical Dirichlet process prior over the infinite set of system states as described in the first companion article (the inverse strategy section in (24)). However, as detailed in the first companion manuscript (the inverse strategy section in (24)), dealing with an infinite number of random variables, though feasible, is not computationally efficient, and we approximate this infinite value with a large number, Mσmax, reducing our hierarchical Dirichlet process prior to

βDirichlet(γMσmax,,γMσmax),πmDirichlet(αβ),m=1,,Mσmax.

Here, β denotes the base probability vector of length Mσmax serving itself as a prior on the probability transition matrix Πσ, and πm is the m-th row of Πσ. Moreover, γ is a positive scalar hyperparameter of the Dirichlet process prior often chosen to be one. As such, we ascribe identical weights across the state space a priori for computational convenience (28,29,49).

Now, equipped with the nonparametric posterior, we proceed to simultaneously make inferences on transition probabilities, excited-state escape rates, and the remaining parameters. To do so, we employ the Gibbs sampling scheme detailed in the inverse strategy section in the first companion article (24), except that we must now also sample the system state trajectory s1:N. More details on the overall sampling scheme are found in section S4 of the supporting material.

Results

The main objective of our method is to learn full distributions over 1) transition probabilities among Mσmax system states determining, in turn, the corresponding system transition rates and the effective number of system states, and 2) photophysical transition rates, including FRET rates λ1:MFRET, and fluorophores’ relaxation rates (inverse of lifetimes) λa and λd.

To sample from distributions over these parameters, the BNP-FRET sampler requires input data comprised of photon arrival time traces from both donor and acceptor channels as well as a set of precalibrated input parameters including camera effects such as cross talk matrix and detection efficiency (see Sec. 2.4 and example V of the first companion article (24)); background emission (see the section on background in the first companion article and Section S2.4); and the instrument response function (IRF) (see illumination features sectoin in the first companion article (24) and Section S2.3).

Here, we first show that our method samples posteriors over a set of parameters employing realistic synthetic data generated using the Gillespie algorithm (50) to simulate system and photophysical transitions while incorporating detector artefacts such as crosstalk (see the synthetic data generation section in the first companion article (24)). The list of parameters used in data generation for all figures is provided in Section S6. Furthermore, prior hyperparameters used in the analysis of synthetic and experimental data are listed in Section S3.

We first show that our method works for the simplest case of slow transitions compared with the interpulse period (25 ns) with two system states using synthetic data (see Fig. 2). Next, we proceed to tackle more challenging synthetic data with three system states and higher transition rates (Fig. 3). We show that our nonparametric algorithm correctly infers system transition probabilities and thus the number of system states (see Fig. 3).

Figure 2.

Figure 2

Analysis on synthetic data for a system with two system states. In (a), we show a section of synthetic data produced with the values in Table S2. Furthermore, the system state trajectory is shown in blue. Below this, the arrival times of donor and acceptor photons μnd and μna are shown in green and red, respectively. In (b), we plot the bivariate distribution over escape rates and FRET efficiencies. The ground truth is shown with red dots corresponding to an escape rate of 40 s−1 and FRET efficiencies of 0.22 and 0.59. λescεFRET. As seen, the BNP-FRET sampler clearly distinguishes two system states with maximum a posteriori (MAP) estimates for the associated escape rates of 387+7 and 407+7 s−1 and for FRET efficiencies of 0.210.03+0.03 and 0.590.03+0.03. We have smoothed the distributions using kernel density estimation for illustration purposes only.

Figure 3.

Figure 3

Analysis on synthetic data for three system states. In (a), we have a section of synthetic data produced with the values from Table S3. The system state trajectory is seen in blue. Below this, the arrival times of donor and acceptor photons μnd and μna are shown in green and red, respectively. In (b), we plot the distribution over escape rates and FRET efficiencies εFRET. The red dots show ground truths corresponding to escape rates of 1,200, 2,400, and 1,200 s−1 and FRET efficiencies of 0.22, 0.53, and 0.7. From our maximum a posteriori (MAP) estimate, λesc εFRET we clearly see three system states with escape rates of 1,10060+60, 2,300128+131, and 1,05080+80 s−1.

After demonstrating the performance of our method using synthetic data, we use experimental data to investigate the kinetics of HJs under different MgCl2 concentrations in buffer (see Fig. 4).

Figure 4.

Figure 4

The bivariate posterior for the conformational transition rates λesc and FRET efficiencies εFRET for experimental data acquired in the presence of different HJ concentrations. Here, we show our bivariate posteriors where red dots show MAP estimates. In (a), we show the posterior for a sample with 1 mMMgCl2. We report escape rates of 1,530550+500 and 1,240420+420 s−1 in this case. The posterior for a sample with 3 mMMgCl2 is shown in (b). We report escape rates of 14038+38 and 14232+32 s−1 for this case. In (c), we show our posterior for a sample with 5 mMMgCl2. Here, we report escape rates of 649+9 and 8010+10 s−1. The posterior in (d) is for a sample with 10 mMMgCl2. We report escape rates of 3912+17 and 4112+23 s−1.

Simulated data analysis

To help validate BNPs on smFRET single-photon data, we start with a simple case of a two-state system and select kinetics similar to those of the experimental data sets, c.f., the HJ in 10 mm MgCl2, with escape rates of 40 s−1 for both system states (51). The generated system state trajectory and photon traces over a period of 500 ms from both channels are shown in Fig. 2 a.

Fig. 2b shows the bivariate posterior distribution over FRET efficiencies, εFRET, defined as εFRET=λFRET/(λFRET+λd), and system escape rates, i.e., obtained by computing the logarithm of the propagator matrix, with two peaks corresponding to the two system states most visited by the sampler. Furthermore, the ground truths, designated by red dots, fall within the posterior with a relative error of less than 3% from the posterior modes. The results for the remaining parameters, including donor and acceptor transition rates, FRET transition rates, and system transition probabilities, are presented in Section S7.

To showcase the critical role played by BNPs, we also consider the more difficult case of a sample with three system states and faster system state kinetics ranging over 1,200–2,500 s1. We do so by simulating photon traces in both donor and acceptor channels over a period of ∼150 ms. A 50 ms section of the synthetic photon trace is shown in Fig 3 a.

Using direct photon arrivals from the generated photon trace, we find that the most probable system state trajectories sampled by BNP-FRET visit the correct number of system states, as shown in Fig 3 b, while inferring all other parameters. Furthermore, the BNP-FRET sampler estimates the system transition rates and thus the escape rates (i.e., sum of transition rates out of a given state) where the ground-truth escape rates differ from the posterior peaks by a relative average error of less than 8%. The results for the remaining parameters are provided in Section S7.

Experimental data analysis: HJ

In this section, we benchmark our method over a wide range of kinetic rates employing experimental data acquired using HJ under varying buffer MgCl2 concentrations (15,51).

HJs are four-way double-helical DNA junctions existing in various structural configurations (41,52,53). When not interacting with multivalent metal ions, electrostatic repulsion between negatively charged phosphate groups of the four helical arms forces HJs to assume a wide configuration where the arms lie along the two diagonals of a square. However, in the presence of ions, such as Mg2+, interaction with the phosphate groups results in electrostatic screening. This reduced repulsion induces transitions to what is believed to be primarily two compact stacked configurations/conformations. The transitions between both conformations necessitates passing through the intermediate open configuration. Since, at high ion concentrations, displacing ions away from the phosphate group becomes increasingly difficult, in this scenario, we anticipate smaller transition rates between both conformations.

The HJ kinetic rates have been studied using both fluorescence lifetime correlation spectroscopy (15) and HMM analysis (54) on diffusing HJs assuming a priori a pair of high and low FRET system states. As expected, these previous studies show kinetic rates decreasing with increasing MgCl2 concentrations (41,43) and correspondingly longer dwells.

Here, our method, free from averaging and binning that are otherwise common in HMM analysis, is particularly well suited to learn the rapid kinetics at low Mg2+ concentrations. We apply our BNP-FRET to data acquired from HJs at 1, 3, 5, and 10 mM MgCl2 concentrations and sample the photophysical transition rates and the system transition probabilities.

The acquired bivariate posterior distributions over the FRET efficiencies and escape rates (computed via the logarithm of the system transition probability matrix Πσ) are presented in Fig. 4. Moreover, estimates for the other parameters can be found in Section S7. We note that our results are obtained on a single-molecule basis with a photon budget of 104105 photons.

For all four concentrations (see Fig. 4), our BNP-FRET sampler most frequently visited only two system states, while this was given as an input to the other analysis methods (15,54). Moreover, both escape rates are found to have similar values with an average of approximately 1,400s−1 (1 mM MgCl2), 140s−1 (3 mM MgCl2), 72s−1 (5 mM MgCl2), and 41 s−1 (10 mM MgCl2). These escape rates are in close agreement with values reported by fluorescence lifetime correlation spectroscopy and H2MM methods (15,54) of 1,300 s−1 (1 mM MgCl2), 170 (3 mM MgCl2), 100 (5 mM MgCl2), and 60 s−1 (10 mM MgCl2), which lie well within the bounds of our posteriors shown in Fig. 4 while simultaneously, and self-consistently, learning a number of system states.

Experimental data acquisition

In this section, we describe the protocol for preparing the surface-immobilized HJ sample labeled with a FRET pair and the procedure for recording smFRET traces from individual immobilized molecules. The sample preparation and recording of data follow previous work (55).

Sample preparation

The HJ used in this work consists of four DNA strands whose sequences are as follows:

  • R-strand: 5′-CGA TGA GCA CCG CTC GGC TCA ACT GGC AGT CG-3′

  • H-strand: 5′-CAT CTT AGT AGC AGC GCG AGC GGT GCT CAT CG-3′

  • X-strand: 5′-biotin-TCTTT CGA CTG CCA GTT GAG CGC TTG CTA GGA GGA GC-3′

  • B-strand: 5′-GCT CCT CCT AGC AAG CCG CTG CTA CTA AGA TG-3’

For surface immobilization, the X-strand was labeled with biotin at the 5′ end. For FRET measurements, the donor (ATTO-532) and acceptor (ATTO-647N) dyes were introduced into the H- and B-strands, respectively. In both cases, the dyes were labeled to thymine nucleotide at the 6th position from the 5′ ends of the respective strands (shown as T). All DNA samples (labeled or unlabeled) were purchased from JBioS (Shinjuku-ku, Japan) in the high-performance liquid chromatography purified form and were used without any further purification.

The HJ complex was prepared by mixing 1 mM solutions of R-, H-, B-, and X-strands in TN buffer (10 mM Tris-HCl with 50 mm NaCl, pH 8) at a 3:2:3:3 molar ratio, annealing the mixture at 94°C for 4 minutes, and gradually cooling it down (2°C–3°C  min−1) to room temperature (25°C ). For smFRET measurements, we used a sample chamber (SecureSeal, GBL621502, Grace Bio-Labs, Bend, OR, USA) with a biotin-PEG-SVA (biotin-poly(ethylene glycol)-succinimidyl valerate)-coated coverslip. The chamber was first incubated with streptavidin (0.1 mg mL−1 in TN buffer) for 20 min. This was followed by washing the chamber with TN buffer (3 times) and injection of 1 nM HJ solution (with respect to its H-strand) for 3–10 s. After this incubation period, the chamber was rinsed with TN buffer (3 times) to remove unbound DNA, and it was filled with TN buffer containing 1 mM (or 5 mM) MgCl2 and 2 mM Trolox for smFRET measurements.

smFRET measurements

The smFRET traces from individual HJs were recorded using a custom-built confocal microscope (Eclipse Ti, Nikon, Tokyo, Japan) equipped with the Perfect Focus System, a sample scanning piezo stage (Nano control B16-055), and a time-correlated single-photon counting module (SPC-130EM, Berlin, Germany).

The broadband light generated by a supercontinuum laser operating at 40 MHz (SC-400-4, Fianium, Southampton, UK) was filtered with a band-pass filter (FF01-525/30, Semrock, West Henrietta, NJ, USA) for exciting the donor dye, ATTO-532. This excitation light was introduced to the microscope using a single-mode optical fiber (P5-460B-PCAPC-1, Thorlabs, Newton, NJ, USA) and directed onto the sample using a dichroic mirror (ZT532/640rpc, Chroma, Cambridge, MA, USA) and a water immersion objective lens (Nikon Plan Apo IR 60×, numerical aperture: 1.27).

The excitation light was focused onto the top surface of the coverslip, and, during measurements, the focusing condition was maintained using the Perfect Focus System. The fluorescence signals were collected by the same objective, passed through the dichroic mirror, and guided to the detection assembly (Thorlabs DFM1/M) using a multimode fiber (Thorlabs M50L02S-A). Note that this multimode fiber (core diameter: 50 μm) also acts as the confocal pinhole. In the detection assembly, the fluorescence signals from the donor and acceptor dyes were separated using a dichroic mirror (ZT633rdc, Chroma Technology, Bellows Falls, VT, USA), filtered using band-pass filters (Chroma ET585/65m for donor and Semrock FF02-685/40 for acceptor), and detected using separate hybrid detectors (Becker and Hickl HPM-100-40-C).

For each detected photon, its macrotime (absolute arrival time from the start of the measurement) was recorded with 25.2 ns resolution and its microtime (relative delay from the excitation pulse) was recorded with 6.1 ps resolution using the time-correlated single-photon counting module operating in time-tagging mode. A router (Becker and Hickl HRT-41) was used to process the signals from the donor and acceptor detectors.

For recording smFRET traces from individual HJs, we first imaged a 10 × 3 μm area of the sample using the piezo stage by scanning it linearly at a speed of 1 μm s−1 in the x direction and with an increment of 0.1 μm in the y direction. Individual HJs appeared as isolated bright spots in the image.

Next, we fitted the obtained donor and acceptor intensity images with multiple 2D Gaussian functions to determine the precise locations of individual HJs. Note that, during this image acquisition, the laser excitation power was kept to a minimum (∼1 μW at the back aperture of the objective lens) to avoid photobleaching the dyes. In addition, we also employed an electronic shutter (Suruga Seiki, Shizuoka, Japan) in the laser excitation path to control the sample excitation as required.

Using the precise locations of individual HJs obtained, we recorded 30 s-long smFRET traces for each molecule by moving them to the center of the excitation beam using the piezo stage. For each trace, the laser excitation was blocked (using the shutter) for the first 5 s and was allowed to excite the sample for the remaining 25 s. Note that the smFRET traces were recorded using 40 μW laser excitation (at the back aperture of the objective lens) to maximize the fluorescence photons emitted from the dyes. We automated the process of acquiring smFRET traces from different molecules sequentially and executed it using a program written in house on Igor Pro (Wavemetrics, Portland, OR, USA).

Discussion

The sensitivity of smFRET under pulsed illumination has been exploited to investigate many different molecular interactions and geometries (8,9,10,11,56). However, quantitative interpretation of smFRET data faces serious challenges including an unknown number of system states and robust propagation of uncertainty from noise sources such as detectors and background. These challenges ultimately mitigate our ability to determine full distributions over all relevant unknowns and, traditionally, have resulted in data pre- or postprocessing compromising the information otherwise encoded in the rawest form of data: single-photon arrivals.

Here, we provide a general BNP framework for smFRET data analysis starting from single-photon arrivals under a pulsed illumination setting. We simultaneously learn transition probabilities among system states as well as determine photophysical rates by incorporating existing sources of uncertainty such as background and cross talk.

We benchmark our method using both experimental and simulated data. That is, we first show that our method correctly learns parameters for the simplest case with two system states and slow system transition rates. Moreover, we test our method on more challenging cases with more than two states using synthetic data and obtain correct estimations for the system state transition probabilities and thus the number of system states along with the remaining parameters of interest. To further assess our method’s performance, we analyzed experimental data from HJs suspended in solutions with a range of MgCl2 concentrations. These data were previously processed using other techniques assuming a fixed number of system states by binning photon arrival times (15).

Despite multiple advantages mentioned above for BNP-FRET, BNPs always come with an added computational cost as they take full advantage of information from single-photon arrival times and all existing sources of uncertainty. For this version of our general BNP method simplified for pulsed illumination, we further reduced the computational complexity by grouping empty pulses together. Therefore, the computational complexity increased only linearly with the number of input photons as the photons are treated independently.

The method described in this paper assumes a Gaussian IRF. However, the developed framework is not limited to a specific form for the IRF and can be used for data collected using any type of IRF by modifying Eq. 4. Furthermore, the framework is flexible in accommodating different illumination techniques such as alternating color pulses, which are typically used to directly excite the acceptor fluorophores. This can be achieved by simple modification of the propagator Qnψ in Eq. 4. A future extension of this method could relax the assumption of a static sample by adding spatial dependence to the excitation rate as we explored in previous works (35,47,57). This would allow our method to learn the dynamics of diffusing molecules, as well as their photophysical and system state transition rates.

Code availability

The BNP-FRET software package is available on Github at https://github.com/LabPresse/BNP-FRET.

Acknowledgments

We thank Dr. Zeliha Kilic, Weiqing Xu, and J. Shephard Bryan IV for their contributions and insight into the project. We also thank Dr. Douglas Shepherd for providing insight into the workings of detectors and other experimental equipment and Irina Gopich for discussions on FRET likelihoods. T.T. acknowledges support from the JSPS KAKENHI (grant no. 18H05265). S.P. acknowledges support from the NIH NIGMS (R01GM130745) for supporting early efforts in nonparametrics and NIH NIGMS (R01GM134426) for supporting single-photon efforts.

Declaration of interests

The authors declare no competing interests.

Editor: Jorg Enderlein.

Footnotes

Supporting material can be found online at https://doi.org/10.1016/j.bpr.2022.100088.

Supporting material

Document S1. Supporting material, Figures S1–S7, and Tables S1–S3
mmc1.pdf (3.6MB, pdf)
Document S2, Article plus supporting material
mmc2.pdf (4.4MB, pdf)

References

  • 1.Weiss S. Fluorescence spectroscopy of single biomolecules. Science. 1999;283:1676. doi: 10.1126/science.283.5408.1676. [DOI] [PubMed] [Google Scholar]
  • 2.Lippincott-Schwartz J., Snapp E., Kenworthy A. Studying protein dynamics in living cells. Nat. Rev. Mol. Cell Biol. 2001;2:444. doi: 10.1038/35073068. [DOI] [PubMed] [Google Scholar]
  • 3.Huang B., Bates M., Zhuang X. Super-resolution fluorescence microscopy. Annu. Rev. Biochem. 2009;78:993–1016. doi: 10.1146/annurev.biochem.77.061906.092014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lelek M., Gyparaki M.T., et al. Zimmer C. Single-molecule localization microscopy. Nat. Rev. Methods Primers. 2021;1:39. doi: 10.1038/s43586-021-00038-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Fazel M., Wester M.J. Analysis of super-resolution single molecule localization microscopy data: A tutorial. AIP Adv. 2022;12:010701. [Google Scholar]
  • 6.Datta R., Heaster T.M., et al. Skala M.C. Fluorescence lifetime imaging microscopy: fundamentals and advances in instrumentation, analysis, and applications. J. Biomed. Opt. 2020;25:1–43. doi: 10.1117/1.JBO.25.7.071203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Garini Y., Young I.T., McNamara G. Spectral imaging: principles and applications. Cytometry A. 2006;69:735–747. doi: 10.1002/cyto.a.20311. [DOI] [PubMed] [Google Scholar]
  • 8.Roy R., Hohng S., Ha T. A practical guide to single-molecule FRET. Nat. Methods. 2008;5:507–516. doi: 10.1038/nmeth.1208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Mazal H., Haran G. Single-molecule FRET methods to study the dynamics of proteins at work. Curr. Opin. Biomed. Eng. 2019;12:8–17. doi: 10.1016/j.cobme.2019.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Schuler B. Single-molecule FRET of protein structure and dynamics - a primer. J. Nanobiotechnol. 2013;11:S2. doi: 10.1186/1477-3155-11-S1-S2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lu M., Ma X., et al. Mothes W. Associating HIV-1 envelope glycoprotein structures with states on the virus observed by smFRET. Nature. 2019;568:415–419. doi: 10.1038/s41586-019-1101-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Mooney S.M., Qiu R., et al. Weninger K.R. Cancer/testis antigen PAGE4, a regulator of c-Jun transactivation, is phosphorylated by homeodomain-interacting protein kinase 1, a component of the stress-response pathway. Biochemistry. 2014;53:1670–1679. doi: 10.1021/bi500013w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wozniak A.K., Schröder G.F., et al. Oesterhelt F. Single-molecule FRET measures bends and kinks in DNA. Proc. Natl. Acad. Sci. USA. 2008;105:18337–18342. doi: 10.1073/pnas.0800977105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Chung H.S., Louis J.M., Gopich I.V. Analysis of fluorescence lifetime and energy transfer efficiency in single-molecule photon trajectories of fast-folding proteins. J. Phys. Chem. B. 2016;120:680–699. doi: 10.1021/acs.jpcb.5b11351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Heo W., Hasegawa K., et al. Tahara T. Scanning two-dimensional fluorescence lifetime correlation spectroscopy: Conformational dynamics of DNA Holliday junction from microsecond to subsecond. J. Phys. Chem. Lett. 2022;13:1249–1257. doi: 10.1021/acs.jpclett.1c03787. [DOI] [PubMed] [Google Scholar]
  • 16.Zeug A., Woehler A., et al. Ponimaskin E.G. Quantitative intensity-based fret approaches—a comparative snapshot. Biophys. J. 2012;103:1821–1827. doi: 10.1016/j.bpj.2012.09.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kuppa S., Deveryshetty J., et al. Antony E. Rtt105 regulates rpa function by configurationally stapling the flexible domains. Nat. Commun. 2022;13:5152. doi: 10.1038/s41467-022-32860-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kilic Z., Sgouralis I., Pressé S. Generalizing hmms to continuous time for fast kinetics: hidden markov jump processes. Biophys. J. 2021;120:409–423. doi: 10.1016/j.bpj.2020.12.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kapusta P., Wahl M., et al. Enderlein J. Fluorescence lifetime correlation spectroscopy. J. Fluoresc. 2007;17:43–48. doi: 10.1007/s10895-006-0145-1. [DOI] [PubMed] [Google Scholar]
  • 20.Ishii K., Tahara T. Two-dimensional fluorescence lifetime correlation spectroscopy. 1. principle. J. Phys. Chem. B. 2013;117:11414–11422. doi: 10.1021/jp406861u. [DOI] [PubMed] [Google Scholar]
  • 21.Otosu T., Ishii K., Tahara T. Microsecond protein dynamics observed at the single-molecule level. Nat. Commun. 2015;6:7685. doi: 10.1038/ncomms8685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Yoo J., Kim J.-Y., et al. Chung H.S. Fast three-color single-molecule FRET using statistical inference. Nat. Commun. 2020;11:3336. doi: 10.1038/s41467-020-17149-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lerner E., Ingargiola A., Weiss S. Characterizing highly dynamic conformational states: The transcription bubble in RNAP-promoter open complex as an example. J. Chem. Phys. 2018;148:123315. doi: 10.1063/1.5004606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Saurabh A., Safar M., et al. Pressé S. Single photon smFRET. I. theory and conceptual basis. bioRxiv. 2022 doi: 10.1101/2022.07.20.500887. Preprint at. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sgouralis I., Pressé S. An introduction to infinite HMMs for single-molecule data analysis. Biophys. J. 2017;112:2021–2029. doi: 10.1016/j.bpj.2017.04.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Sgouralis I., Madaan S., et al. Pressé S. A Bayesian nonparametric approach to single molecule Förster resonance energy transfer. J. Phys. Chem. B. 2019;123:675–688. doi: 10.1021/acs.jpcb.8b09752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Fox E.B., Sudderth E.B., et al. Willsky A.S. A sticky HDP-HMM with application to speaker diarization. Ann. Appl. Stat. 2011;5:1020. [Google Scholar]
  • 28.Teh Y.W., Jordan M.I., et al. Blei D.M. Hierarchical Dirichlet processes. J. Am. Stat. Assoc. 2006;101:1566–1581. [Google Scholar]
  • 29.Jayaram S. A constructive definition of Dirichlet priors. Stat. Sin. 1994;4:639. [Google Scholar]
  • 30.Pitman J. Poisson–Dirichlet and GEM invariant distributions for split-and-merge transformations of an interval partition. Combinator. Probab. Comput. 2002;11:501–514. [Google Scholar]
  • 31.Ferguson T.S. A Bayesian analysis of some nonparametric problems. Ann. Stat. 1973;1:209. [Google Scholar]
  • 32.Gershman S.J., Blei D.M. A tutorial on Bayesian nonparametric models. J. Math. Psychol. 2012;56:1–12. [Google Scholar]
  • 33.Sgouralis I., Whitmore M., et al. Pressé S. Single molecule force spectroscopy at high data acquisition: A Bayesian nonparametric analysis. J. Chem. Phys. 2018;148:123320. doi: 10.1063/1.5008842. [DOI] [PubMed] [Google Scholar]
  • 34.Tavakoli M., Taylor J.N., et al. Pressé S. Single molecule data analysis: An introduction. Adv. Chem. Phys. 2017;162:205–305. [Google Scholar]
  • 35.Tavakoli M., Jazani S., et al. Pressé S. Pitching single-focus confocal data analysis one photon at a time with Bayesian nonparametrics. Phys. Rev. X. 2020;10:011021. doi: 10.1103/physrevx.10.011021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Tavakoli M., Jazani S., et al. Pressé S. Direct photon-by-photon analysis of time-resolved pulsed excitation data using Bayesian nonparametrics. Cell Rep. Phys. Sci. 2020;1:100234. doi: 10.1016/j.xcrp.2020.100234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Bryan J.S., 4th, Sgouralis I., Pressé S. Diffraction-limited molecular cluster quantification with Bayesian nonparametrics. Nat. Comput. Sci. 2022;2:102–111. doi: 10.1038/s43588-022-00197-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Fazel M., Jazani S., et al. Pressé S. High resolution fluorescence lifetime maps from minimal photon counts. ACS Photonics. 2022;9:1015–1025. doi: 10.1021/acsphotonics.1c01936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Okamoto K., Sako Y. State transition analysis of spontaneous branch migration of the Holliday junction by photon-based single-molecule fluorescence resonance energy transfer. Biophys. Chem. 2016;209:21–27. doi: 10.1016/j.bpc.2015.11.004. [DOI] [PubMed] [Google Scholar]
  • 40.Hohng S., Joo C., Ha T. Single-molecule three-color FRET. Biophys. J. 2004;87:1328–1337. doi: 10.1529/biophysj.104.043935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.McKinney S.A., Déclais A.C., et al. Ha T. Structural dynamics of individual Holliday junctions. Nat. Struct. Biol. 2003;10:93–97. doi: 10.1038/nsb883. [DOI] [PubMed] [Google Scholar]
  • 42.McKinney S.A., Freeman A.D.J., et al. Ha T. Observing spontaneous branch migration of Holliday junctions one step at a time. Proc. Natl. Acad. Sci. USA. 2005;102:5715–5720. doi: 10.1073/pnas.0409328102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Panyutin I.G., Biswas I., Hsieh P. A pivotal role for the structure of the Holliday junction in DNA branch migration. EMBO J. 1995;14:1819–1826. doi: 10.1002/j.1460-2075.1995.tb07170.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Metropolis N., Rosenbluth A.W., et al. Teller E. Equation of state calculations by fast computing machines. J. Chem. Phys. 1953;21:1087–1092. [Google Scholar]
  • 45.Hastings W.K. Monte Carlo sampling methods using Markov chains and their applications. Biometrika. 1970;57:97–109. [Google Scholar]
  • 46.Fazel M., Wester M.J., et al. Lidke K.A. Bayesian multiple emitter fitting using reversible jump Markov chain Monte Carlo. Sci. Rep. 2019;9:13791. doi: 10.1038/s41598-019-50232-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Jazani S., Sgouralis I., et al. Pressé S. An alternative framework for fluorescence correlation spectroscopy. Nat. Commun. 2019;10:3662. doi: 10.1038/s41467-019-11574-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Rabiner L.R. A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE. 1989;77:257–286. [Google Scholar]
  • 49.Fazel M., Alexander V., et al. Pressé S. Fluorescence lifetime: beating the irf and interpulse window. bioRxiv. 2022 doi: 10.1101/2022.09.08.507224. Preprint at. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Gillespie D.T. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J. Comput. Phys. 1976;22:403–434. [Google Scholar]
  • 51.Kilic Z., Sgouralis I., et al. Pressé S. Extraction of rapid kinetics from smFRET measurements using integrative detectors. Cell Rep. Phys. Sci. 2021;2:100409. doi: 10.1016/j.xcrp.2021.100409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Karymov M., Daniel D., et al. Lyubchenko Y.L. Holliday junction dynamics and branch migration: single-molecule analysis. Proc. Natl. Acad. Sci. USA. 2005;102:8186–8191. doi: 10.1073/pnas.0407210102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Ferapontova E.E., Mountford C.P., et al. Mount A.R. Electrochemical control of a DNA Holliday Junction nanoswitch by Mg2+ ions. Biosens. Bioelectron. 2008;24:422–428. doi: 10.1016/j.bios.2008.04.021. [DOI] [PubMed] [Google Scholar]
  • 54.Pirchi M., Tsukanov R., et al. Nir E. Photon-by-photon hidden Markov model analysis for microsecond single-molecule FRET kinetics. J. Phys. Chem. B. 2016;120:13065–13075. doi: 10.1021/acs.jpcb.6b10726. [DOI] [PubMed] [Google Scholar]
  • 55.Kilic Z., Sgouralis I., Pressé S. Generalizing HMMs to continuous time for fast kinetics: Hidden Markov jump processes. Biophys. J. 2021;120:409–423. doi: 10.1016/j.bpj.2020.12.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Sebolt-Leopold J.S., English J.M. Mechanisms of drug inhibition of signalling molecules. Nature. 2006;441:457–462. doi: 10.1038/nature04874. [DOI] [PubMed] [Google Scholar]
  • 57.Jazani S., Sgouralis I., Pressé S. A method for single molecule tracking using a conventional single-focus confocal setup. J. Chem. Phys. 2019;150 doi: 10.1063/1.5083869. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supporting material, Figures S1–S7, and Tables S1–S3
mmc1.pdf (3.6MB, pdf)
Document S2, Article plus supporting material
mmc2.pdf (4.4MB, pdf)

Articles from Biophysical Reports are provided here courtesy of Elsevier

RESOURCES