Abstract
Thermally driven conformational fluctuations (or ‘breathing’) of DNA plays important roles in the function and regulation of the ‘macromolecular machinery of genome expression.’ Fluctuations in double-stranded (ds) DNA are involved in the transient exposure of pathways to protein binding sites within the DNA framework, leading to the binding of regulatory proteins to single-stranded (ss) DNA templates. These interactions often require that the ssDNA sequences, as well as the proteins involved, assume transient conformations critical for successful binding. Here we use microsecond-resolved single-molecule Fӧrster Resonance Energy Transfer (smFRET) experiments to investigate the backbone fluctuations of short [oligo(dT)n] templates within DNA constructs that also serve as models for ss-dsDNA junctions. Such junctions, together with the attached ssDNA sequences, are involved in interactions with the ssDNA binding (ssb) proteins that control and integrate the functions of DNA replication complexes. We analyze these data using a chemical network model based on multi-order time-correlation functions (TCFs) and probability distribution functions (PDFs) that characterize the kinetic and thermodynamic behavior of the system. We find that the oligo(dT)n tails of ss-dsDNA constructs inter-convert, on sub-millisecond time-scales, between three macrostates with distinctly different end-to-end distances. These are: (i) a ‘compact’ macrostate that represents the dominant species at equilibrium; (ii) a ‘partially extended’ macrostate that exists as a minority species; and (iii) a ‘highly extended’ macrostate that is present in trace amounts. We propose a model for ssDNA secondary structure that advances our understanding of how spontaneously formed nucleic acid conformations may facilitate the activities of ssDNA-associating proteins.
Graphical Abstract

1. Introduction
The genome of living cells exists primarily as B-form double-stranded (ds) DNA, which is packaged in the cell nucleus as chromatin (in eukaryotes) or nucleoid bodies (in bacteria) (1). During various stages of the cell cycle, segments of single-stranded (ss) DNA are exposed by the action of the DNA replication helicase to make available the ssDNA templates required by the DNA polymerases for the synthesis of the complementary daughter strands for cell division, and to allow access of other functional and regulatory proteins to these transiently formed ssDNA sequences. Within the cell, and in functional complexes such as the replisome, these exposed ssDNA segments are coated by ssDNA binding (ssb) proteins that function to protect the genetic information from nucleases and to favorably configure the single strands for further manipulation by the ‘machinery of genome expression’ (2, 3). These proteins interact with ssDNA to form stable, cooperatively bound nucleoprotein filaments, in which the bound ssDNA chain segments are nearly fully extended to their maximum contour lengths permitted by the sugar-phosphate backbones, resulting also in largely unstacking the attached bases (4, 5). Because such fully extended conformations of ssDNA are likely to be unstable in the absence of bound protein, a fundamental question arises concerning how ssb protein-ssDNA interactions evolve along a molecular reaction coordinate to form extended nucleoprotein filaments.
Unlike the cooperatively structured and stabilized dsDNA conformation, many possible conformations can be adopted by ssDNA in solution, and the specifics of these conformations depend on a variety of interactions between the nucleic acid components with their aqueous surrounds. These interactions include base stacking and unstacking, intra-chain repulsion between adjacent backbone phosphates, counterion condensation along the polyelectrolyte ssDNA backbone, binding and orientation of polar water molecules around these counterions and the backbone phosphates, strain within the sugar-phosphate backbone and increases in configurational entropy of the backbone as a consequence of base-unstacking and the concomitant rotation around multiple single-bonds of the extended backbone (6–8).
The cumulative effect of these interactions is a free energy landscape that determines the probability with which a given ssDNA configuration will occur. In general, the coordinate space over which the free energy landscape is defined can be partitioned into distinct sub-spaces, herein referred to as ‘macrostates,’ which correspond to local basins of stability separated by relatively high activation barriers (9). Each macrostate consists of a ‘family’ of energetically-degenerate microscopic conformations (or ‘microstates’) that rapidly interconvert (on tens-of-nanoseconds time-scales) in a non-Markovian manner (10–12). In contrast, higher thermal barriers separate conformations associated with different macrostates, such that their interconversion occurs on tens-of-microseconds and longer time scales. The relatively high activation barriers associated with macrostate interconversion ensures that the observed dynamics of the overall system can be accurately described using a kinetic master equation in which the rate constants contain no ‘hidden variables’ and memory effects need not be considered (13).
In this work, we present microsecond-resolved single-molecule Förster resonance energy transfer (smFRET) experiments to monitor the conformational fluctuations of short single-stranded oligo-deoxythymidine [oligo(dT)n] templates of varying length (n = 14 and 15) and polarity (3’ versus 5’). We note that these constructs have been used in previous studies as model systems to monitor the dynamics of the non-cooperative and cooperative binding and assembly of the T4 bacteriophage ssb proteins (gene product 32, or gp32) onto target ssDNA lattices (14, 15). The binding site size of the gp32 monomer is 7 nucleotides (nts) so that for the templates with a total length, n, of 14 nts there is excess room for the non-cooperative binding of a single gp32 monomer or, alternatively, precisely enough room for the cooperative assembly of a gp32 dimer. For templates with n = 15 nts, there are two possible conformations for the gp32 dimer-bound state; either at nucleotide positions 1 – 14 or 2 – 15. Furthermore, the gp32 proteins are expected to assemble onto ssDNA lattices in a polar manner, so that the assembly mechanism will depend on the polarity of the ss-ds DNA junction. Our previous studies showed that gp32 proteins bind to, and dissociate from, the ss-dsDNA junction as monomers on tens-of-milliseconds time scales, and that gp32 monomer-bound intermediates must encounter (also, on tens-of-milliseconds time scales) a ‘productive’ ssDNA conformation near the ss-dsDNA junction to permit a second ssb protein to form a cooperatively-bound gp32 dimer-cluster (14). By examining the sub-millisecond dynamics of the ssDNA lattice fluctuations in these constructs in the absence of proteins, we may ask whether short-lived conformational intermediates – which can serve as potential transition states for ssb binding – play a role in the assembly mechanism.
Each of the oligo(dT)n conformations that we investigated were formed within the overhanging ‘tail’ of a ss-dsDNA junction construct, in which the distal end of the ssDNA chain is labeled with the FRET donor chromophore Cy3 and the conjugate DNA strand is labeled, at the ss-dsDNA junction, with the FRET acceptor chromophore Cy5 (see Fig. 1). Such ss-dsDNA constructs labeled in this way were previously used by Lohman and Ha and co-workers in related studies with the E. coli SSB system using standard smFRET methods (3, 16, 17). Similar constructs were also used by Pollack and co-workers in their studies of ssDNA conformations by small-angle x-ray scattering (SAXS), smFRET and other biophysical techniques and analysis methods (18). Our experiments detect individual fluorescence photons from a continuously irradiated single molecule sample using instrumental methods (19) and data analyses developed previously (14, 20). Because these experiments resolve individual photon detection events, the dominant contribution to measurement uncertainty is the sparse statistical sampling of the ensemble-averaged signal rate (21), which varies stochastically in time due to the conformational fluctuations of the macromolecule. The time-dependent signal rates of the Cy3 donor (D) and the Cy5 acceptor (A) FRET-pair chromophores are given by ID(A)(t) = ND(A)(t)⁄Tw, where ND(A) is the number of photons detected during a fixed time window Tw (≥ 1 μs) beginning at the time t. Thus, the smFRET efficiency, EFRET(t) = IA(t)⁄[ID(t) + IA(t)], is a time-averaged signal, which represents an average of the macromolecular configurations detected during the sampling period Tw.
Figure 1.

Schematic of one of the four single-stranded (ss) – double-stranded (ds) DNA constructs [3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA – see Table 1] used in our experiments. (A) The double-stranded region of the ss-dsDNA construct was attached to the surface of a fused silica quartz (SiO2) slide coated with PEG and containing biotin-neutravidin linkers. (B) The Cy3 donor chromophore was attached to the distal end of the oligo(dT)n overhanging ‘tail’ region of the ss-dsDNA construct, and the Cy5 acceptor was attached to the opposite strand at the ss-dsDNA junction, as shown. Note that Panel B represents a more chemically detailed version of the lower part of the construct cartooned in Panel A.
Our microsecond-resolved smFRET measurements contain both structural and kinetic information about the microscopic configurations of the oligo(dT)n templates. To extract the maximum amount of information possible from our data, we applied an ‘inverse-problem’ procedure for analyzing photon correlation measurements of this type (14, 20). From the photon count data stream, we determined the following three ensemble-averaged functions of the smFRET signal: (i) the two-point time correlation function (TCF) that describes the average correlations between two successive measurements separated by the time interval τ, and which was sampled on microsecond time scales; (iii) the four-point TCF, that describes the average correlations between four successive measurements separated by the time intervals τ1, τ2 and τ3; and (iii) the probability distribution function (PDF) of the mean FRET efficiency P(EFRET), which was sampled on the millisecond time scale required for a randomly selected microstate to relax to a metastable intermediate. The above functions were used to characterize the equilibrium and kinetic properties of multi-step conformational transition pathways at equilibrium. While the two-point TCF, , contains information about the characteristic relaxation times of the system, the four-point TCF, , contains additional information about the exchange times between pathway intermediates (20, 22–24).
A key observation is that both the two-point and four-point TCFs exhibit two principal relaxation components that are well-separated in time (~20 μs and ~275 μs) and reflect conformation changes of the oligo(dT)n lattices. As discussed in Sect. 3.4, this observation suggests that the simplest kinetic model for the underlying dynamics is a network of three coupled macrostates in thermal equilibrium. We simulated the PDF and the TCFs using a master equation (25, 26), for which each macrostate is distinguished by a different mean FRET efficiency. Kinetic rate constants were varied as input parameters to the master equation to achieve the ‘best fit’ between simulated and experimentally derived functions. As we discuss further below, we compared the results of a linear versus cyclical three-state network model from which we obtained optimized values for the forward and backward rate constants that connect the conformational macrostates, in addition to the probability of observing each macrostate. The optimized rate constants so obtained determine the free energy barriers that separate each macrostate (applying the Arrhenius equation), and the optimized probabilities determine the free energy minima of each macrostate (applying the Boltzmann distribution).
A major finding of this work is that all four of the oligo(dT)n templates that we investigated undergo conformational dynamics over a broad range of time scales, spanning tens-to-hundreds of microseconds. We observed that all four of these systems interconvert between three distinct macrostates: (i) a ‘compact’ and relatively stable macrostate; (ii) a ‘partially extended’ macrostate of intermediate stability; and (iii) a ‘highly extended’ macrostate of relatively low stability. The free energy minima that we determined for the three macrostates increase rapidly with decreasing FRET efficiency, suggesting that the primary mechanism of macrostate destabilization is the loss of configurational entropy associated with increasing chain extension (10). Moreover, our results indicate the existence of transition state barriers that are intrinsic to the oligo(dT)n templates themselves, and which prolong the lifetimes of extended chain configurations relative to the more stable compact configurations. The existence of metastable extended chain conformations suggests that these short-lived species may serve as intermediates along the reaction pathway for the formation of ssb nucleoprotein clusters or filaments.
2. Materials and Methods
2.1. Single-Stranded (ss) – Double-Stranded (ds) DNA Constructs.
We performed microsecond-resolved smFRET experiments on four ss-dsDNA constructs with a ssDNA oligo(dT)n overhanging ‘tail’ region that varied in length (n = 14 and 15) and polarity (3’ and 5’, see Table 1). In Fig. 1 we show a representative schematic diagram using the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct. These samples were fluorescently labeled using the Cy3/Cy5 FRET donor-acceptor chromophore pair, as indicated.
Table 1.
Nucleotide base sequences for ss-dsDNA constructs studied in this work.
| DNA construct | Nucleotide base sequence |
|---|---|
| 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA | 5’-GTCGCCAGCCTCGCAGCCTTTTTTTTTTTTTTT/Cy3/-3’ 3’-biotin/CAGCGGTCGGAGCGTCGG-Cy5/-5’ |
| 3’-Cy3/Cy5-oligo(dT)14-ss-dsDNA | 5’-GTCGCCAGCCTCGCAGCCTTTTTTTTTTTTTT/Cy3/-3’ 3’-biotin/CAGCGGTCGGAGCGTCGG-Cy5/-5’ |
| 5’-Cy3/Cy5-oligo(dT)15-ss-dsDNA | 3’-GTCGCCAGCCTCGCAGCCTTTTTTTTTTTTTTT/Cy3/-5’ 5’-biotin/CAGCGGTCGGAGCGTCGG-Cy5/-3’ |
| 5’-Cy3/Cy5-oligo(dT)14-ss-dsDNA | 3’-GTCGCCAGCCTCGCAGCCTTTTTTTTTTTTTT/Cy3/-5’ 5’-biotin/CAGCGGTCGGAGCGTCGG-Cy5/-3’ |
2.2. Sample Preparation.
Sample solutions were prepared in an aqueous buffer containing 100 mM NaCl, 6 mM MgCl2 and 10 mM Tris (pH 8.0), as described previously (14, 19, 27). Oxygen scavenging and triplet quenching reagents were used to extend the excited singlet state lifetime and to minimize triplet state blinking (28, 29). Microfluidic sample chambers were constructed using fused silica quartz microscope slides, as shown in Fig. S1A (30). Single molecules were attached to the slide surface using a neutravidin linker, which binds strongly to biotin attached to both the ss-dsDNA constructs (Table 1) and to the poly(ethylene glycol) (PEG) layer that coats the slide surface (see Fig. 1A). Additional details of the sample preparation are included in the SI.
2.3. Microsecond-Resolved Single-Molecule (sm)FRET.
We performed our smFRET experiments on the ss-dsDNA constructs using instrumentation and procedures described previously (14, 19). Additional details of the data acquisition and signal calibration are described in the SI. A schematic diagram of the instrumental setup is shown in Fig. S1B. The sample was illuminated using a continuous wave laser at 532 nm set up in a prism-type total internal reflection fluorescence (TIRF) configuration. The laser beam was focused to a 50 μm diameter spot at the sample and the incident power was adjusted to 10 mW. The Cy3 donor (D) and Cy5 acceptor (A) fluorescence from a single ss-dsDNA construct was imaged through a pinhole, and spatially separated into D and A paths using a dichroic beam splitter. Individual photons from the D and A paths were detected using avalanche photodiodes (APDs), and each detection event was assigned a ‘time-stamp’ with 0.1 μs resolution and recorded to the computer hard disk. In post-data acquisition, we determined the time-dependent D(A) signal rates according to ID(A)(t) = ND(A)(t)⁄Tw, where ND(A) was the number of D(A) photons detected during an adjustable dwell period Tw (≥ 1 μs) at time t. The mean total fluorescence photon detection rate, , was typically ~15,000 – 20,000 cps. The signal ‘background rate’ was determined to be ~ 2,000 cps by scanning the xy-stage position away from the center of a molecule. Thus, the signal-to-background ratio of our experiments is ~ 10. Each single-molecule data set was recorded for a total duration of thirty seconds. From the time-dependent signal rates we determined the FRET efficiency according to EFRET(t) = IA(t)⁄IT(t). We measured the instrument response function (IRF) of our apparatus, which we determined to have a full-width-at-half-maximum (FWHM) value of ~1 μs (see SI).
We used different values for the dwell period Tw, depending on the statistical function that we calculated from our data. For our calculations of two-point and four-point time-correlation functions (TCFs), we used the value for Tw corresponding to the shortest time interval sampled in the calculation. We used Tw = 1 μs for our TCF calculations that focused on the first 20 μs of the decay dynamics. However, for the majority of our TCF calculations, we used Tw = 10 μs, as discussed in further detail below. For our calculations of the probability distribution function (PDF) of EFRET values we used Tw = 1 ms, which according to our analysis is roughly equal to the time required for the system to relax from a randomly sampled initial state to one of the possible metastable intermediates.
3. Theoretical Background
In our experiments, we monitored the oligo(dT)n end-to-end distance RDA between the D (Cy3) and A (Cy5) chromophore labels of the ss-dsDNA constructs (see Fig. 1) through measurements of the smFRET efficiency: EFRET = [1 + (RDA⁄R0)6]−1. Here R0 = 0.211(JDAκ2n−4ΦD)1/6 is the characteristic Förster distance (in units of Å) where ΦD is the D fluorescence quantum yield, n is the refractive index of the medium, JDA is the overlap integral between the D emission and the A absorbance spectra, and κ2 is the transition dipole-dipole orientation factor (31). The orientation factor accounts for the relative angles between the D and A chromophores where is the unit vector that points in the direction of the D (A) transition dipole moment and is the unit vector that points from the D to the A. For cases in which an isotropic distribution of relative D and A orientations can be assumed, which would require that at least one of the two dyes be freely rotating around its ‘tether,’ the orientation factor takes on the value κ2 = 2/3 and the Förster distance R0 ≅ 56 Å (32, 33). In the remainder of this work, we focus on the properties of the observable EFRET as an indicator of the end-to-end distance, Ree (assumed ≈ |RDA|), of the oligo(dT)n ‘tail’ regions of the ss-dsDNA constructs that we studied, although we return to the interpretation of the observable in terms of Ree in the Conclusions.
3.1. Macrostates and Free Energy Minima.
The oligo(dT)n templates can adopt a broad range of microscopic chain configurations (i.e., microstates). Assuming there are N total microstates available at a given temperature T, the probability that the ith microstate occurs is given by the Boltzmann distribution (34)
| (1) |
where kB is Boltzmann’s constant, Gi is the Gibbs free energy of the ith microstate, and is the partition function. We note that the normalized probabilities sum to unity: . Individual microstates are expected to interconvert on tens-of-nanosecond time scales, which is much faster than the microsecond resolution of our experiments (10). Thus, the ‘observed’ D and A fluorescence signals each represents a time-average over the possible microstates that occur at a particular time t during the sampling period Tw (= 10 μsec). It is useful to define a ‘macrostate’ as a collection of pseudo-degenerate microstates that occupy a common, locally stable region of the free energy landscape. We may then regard each D and A fluorescence photon detection event as a stochastic sample of the partial-ensemble-average of microstates defined within the phase-space boundary of a single macrostate.
In our analysis, we model the probability distribution function (PDF) of EFRET values as a sum of contributions from M macrostates , where the jth macrostate has the Gaussian form , with mean value 〈EFRET〉j, standard deviation σj, and amplitude Aj. The Gaussian model for the distribution of configurational microstates (within a given macrostate) is consistent with a random-flight, finite-length polymer chain (10, 35). Macrostates that contain a relatively large number of pseudo-degenerate microstates contribute a correspondingly broad distribution of EFRET values. Thus, the relative probability that the jth macrostate is present at equilibrium is given by the integrated area of its component Gaussian . We note that the sum of normalized probabilities over all macrostates is equal to unity: . Applying Eq. (1) to macrostates, we may thus write the free energy of the jth macrostate (in units of kBT) as a function of EFRET
| (2) |
In the context of the above model, the free energy of the jth macrostate is represented as a parabolic function of EFRET with vertical displacement . The total free energy surface as a function of EFRET can then be constructed by summing over the individual macrostate contributions according to .
3.2. Free Energies of Activation.
We assume that under equilibrium conditions, the microstates of the oligo(dT)n template are partitioned amongst M quasi-stable macrostates, and there exist conformational reaction channels that permit discreet, stochastic transitions to occur from the ith to the jth macrostate. In addition, we make the standard approximation that these transformations can be modeled as continuous processes (25, 36). We thus define the time-dependent population of the ith macrostate, pi(t), as a continuous, single-valued function that obeys the coupled, first-order ordinary differential equations . Here the forward rate constant kij is associated with the transition from macrostate i to macrostate j, and the backward rate constant kji is associated with the transition from macrostate j to macrostate i. The rate constant kij is related to the free energy of activation , which is given by the Arrhenius equation (37)
| (3) |
In Eq. (3), the maximum rate constant kmax corresponds to a barrierless process, to which all the thermally activated processes of the system are referenced.
3.3. Two-Point and Four-Point Time-Correlation Functions (TCFs).
We obtain kinetic information from our time-resolved smFRET data through our analyses of two-point and four-point time-correlation functions (TCFs) (20). We define the time-dependent fluctuation of the EFRET observable δEFRET(t) = EFRET(t) − 〈EFRET〉, where 〈EFRET〉 is the mean value determined from a time series of measurements performed on a single molecule. The two-point TCF is the average product of two successive measurements separated by the interval τ
| (4) |
In Eq. (4), the angle brackets indicate a running average over all possible initial measurement times. As we discuss further below, the function contains information about the minimum number of possible macrostates of the system, each macrostate’s mean FRET efficiency value, in addition to the characteristic time scales of state-to-state interconversion. The two-point TCF may be expressed using a standard statistical mechanical model according to:
| (5) |
Here is the probability that the ith macrostate is present at equilibrium, is the value of the fluctuation observable corresponding to the ith macrostate, and pij(τ) is the conditional probability that the system undergoes a transformation from macrostate i to macrostate j during the interval τ. Equation (5) states that is the statistically weighted average product of consecutive (two-point) observations, which are expected to occur within the interval τ as the system undergoes spontaneous transitions between the M macrostates.
We define the four-point TCF as the time-averaged product of four successive measurements separated by the intervals τ1, τ2, and τ3, respectively.
| (6) |
The function contains information about the roles of intermediates within the conformational kinetic pathways that affect the rates of interconversion between macrostates. The first term on the right-hand-side of Eq. (6) may be modeled according to:
| (7) |
Equation (7) expresses in terms of the weighted average of four-point products of the observable δEFRET, in which a transition occurs from macrostate i to macrostate j during the interval τ1, a subsequent transition occurs from macrostate j to macrostate k during the period τ2, and a final transition occurs from macrostate k to macrostate l during the interval τ3. We may thus view the four-point TCF as the τ1, τ2, τ3-dependent weighted average of four-point transitions (i.e., elementary kinetic pathways) that connect the M macrostates. As we discuss in the next section, the two-point and four-point TCFs described by Eqs. (5) and (7), respectively, in addition to the equilibrium PDF can be modeled theoretically using a kinetic master equation.
3.4. Master Equation Analysis.
To model the ensemble-averaged quantities discussed in the previous sections, it is necessary to determine for a given conformational reaction scheme the equilibrium distribution of macrostates, , and the time-dependent conditional probability, pij(τ), that the system will undergo a transformation from macrostate i to macrostate j during the interval τ. These probabilities are solutions to the master equation, which is the set of coupled first-order ordinary differential equations that describe the time-dependent macrostate populations.
| (8) |
In Eq. (8), p(t) = [p1(t), p2(t), . . , pM(t)] is the time-dependent macrostate population vector, and is its time-derivative. The M × M rate matrix K contains the rate constants kij according to
| (9) |
The rate matrix K is written in the ‘site-basis’ of elementary kinetic steps that comprise the coupled reaction scheme. The off-diagonal elements Kij = kij correspond to the gain in population of macrostate j due to the transition i → j. For cases in which there is no transition i → j, Kij = 0. The diagonal elements are written , and represent the sum of all reactions that deplete population from macrostate i. Equation (8) must satisfy completeness, , and the detailed balance conditions with . Moreover, the vector of equilibrium macrostate populations peq can be obtained by solving the master equation subject to the time-stationary boundary condition .
Given a particular set of rate constants, we solve Eq. (8) by finding the eigenvalues, λ1, λ2, … λ3, and the eigenvectors, v1, v2, … , v3, which satisfy the eigenvector equation Kvi = λivi. The eigenvalues are equal to the characteristic relaxation rates of the coupled dynamical system, and the eigenvectors represent the collective modes (i.e., linear combinations of elementary transitions) associated with the characteristic relaxations. An important symmetry property of the rate matrix K is that the elements within each of its columns sum to zero (i.e., ), which results in λ1 = 0 and the remaining eigenvalues λ2, … λM < 0 (38).
A general solution to the master equation [Eq. (8)] is
| (10) |
where the constants are expansion coefficients that depend on the specific initial condition, which we indicate by the index n (26). Equation (10) can be further simplified by making the substitution λ1 = 0 and taking the infinite-time limit, so that for all possible initial conditions.
| (11) |
The conditional probabilities needed to evaluate the two-point and four-point TCFs [Eqs. (5) and (7), respectively] are obtained by constraining Eq. (11) using appropriate boundary conditions. The conditional probability pij(t) is the probability that at time t, the population vector has unit occupancy of the jth macrostate, given that at time zero there was unit occupancy of the ith macrostate. Thus, for example, by applying the initial condition n = i to Eq. (11) with pi(0) = 1 and pj≠i(0) = 0, we may determine the expansion coefficients . The conditional probability pij(t) is then equal to the jth element of the population vector given by Eq. (11), according to
| (12) |
In principle, the above procedure can be used to determine the pij(t)s analytically for all of M2 possible combinations of i, j ∈ {1,2, … , M}, provided the number of states M is relatively small (20). In practice, the pij(t)s are determined numerically by writing Eq. (11) in the eigenbasis of the K matrix and carrying out the similarity transformation pn(t) = U[eλt]U−1pn(0), where the unitary matrix U = [v1, v2, … , vM] and the matrix [eλt] is diagonal with non-zero elements (25). Additional details of this procedure are given in the SI.
We note that in the infinite-time limit Eq. (12) recovers the expected equilibrium population values, . Furthermore, the conditional probabilities pij(t) are each a sum of M – 1 decaying exponential functions of time, with decay constants equal to the eigenvalues of the K matrix. Substitution of Eq. (12) into Eq. (5) shows that the two-point TCF is also a sum of M – 1 exponential terms: , where the s are the relative weights of the collective relaxation processes. Similarly, the four-point TCF can be shown to be a sum of (M – 1)2 terms: (20). Thus, the number of exponential terms that contribute to the two-point and four-point TCFs indicate the minimum number of macrostates M that underlie the observed dynamics.
4. Results and Discussion
For each of the four ss-dsDNA constructs that we studied (see Table 1), we recorded time series of single-molecule photon data streams, each for a duration of thirty seconds (see Sect. 2.3). The results of our analyses indicate that all four of the ss-dsDNA constructs exhibited very similar behaviors, as one might expect when focusing on conformational changes within identical ssDNA sequences only, although the four constructs behave differently at slower time scales in the presence of ssb (gp32) proteins (14, 15). In the following discussion, which deals only with ssDNA conformational changes, we focus on the results of one of the ss-dsDNA constructs [3’-oligo(dT)15]. We summarize our results for all four of the ss-dsDNA constructs that we studied in the SI. These additional data sets will serve as reference states in our subsequent studies of ssDNA-ssb interactions.
We begin by making some qualitative observations. In Fig. 2, we show an example trajectory of the donor D (Cy3, green) and acceptor A (Cy5, red) fluorescence intensities constructed from our raw single-molecule photon correlation measurements of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct. The fluorescence intensities are calculated according to ID(A)(t) = ND(A)(t)⁄Tw, where ND(A)(t) is the number of D (A) photons recorded within a finite sampling window Tw = 1 ms beginning at time t. We show an additional example trajectory using the same sampling window Tw = 1 ms, but for a duration of 8.5 s in Fig. S2 of the SI. The D and A fluorescence intensities fluctuate in an anti-correlated manner due to the varying inter-chromophore FRET coupling and indicates that these signals depend largely on conformational changes of the 3’-oligo(dT)15 tail region of the ss-dsDNA construct.
Figure 2.

Example smFRET trajectory for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct. The donor D (Cy3, green) and acceptor A (Cy5, red) fluorescence intensities, ID(A)(t) = ND(A)(t)⁄Tw, are shown with sampling window resolution Tw = 1 ms over a duration of ~ 2 seconds. The FRET efficiency EFRET(t) is plotted in the bottom panel (blue). The average Signal-to-Noise Ratio (SNR) per sampling window is .
The D and A intensities are used to calculate the FRET efficiency according to EFRET(t) = IA(t)⁄[ID(t) + IA(t)] (blue). For this particular photon data stream, the mean total signal flux was . We note that the apparent ‘noise’ of the trajectory is due to the measurement uncertainty associated with the finite number of photons sampled during the detection window Tw at a constant low-level flux. We assign the average Signal-to-Noise Ratio to the integrated signal shown in Fig. 2, where is the mean number of detected photons per sampling window. Thus, for Tw = 1 ms the average , which ensures that the EFRET signal is statistically meaningful. Upon further inspection of the EFRET trajectory shown, we see that the majority of values are clustered near EFRET ~ 0.7. Moreover, we observe that infrequent transitions occur to short-lived states with values centered near EFRET ~ 0.5, and even less frequent transitions occur to short-lived states with EFRET < 0.5. In the analysis that follows, we quantify the structural and dynamical information provided by the photon data stream using statistical functions.
We analyzed our raw photon correlation measurements by constructing probability distribution functions (PDFs) using the sampling window Tw = 1 ms, and two-point and four-point time-correlation functions (TCFs) using the sampling window Tw = 10 μs (see Fig. 3). In constructing these functions, we compared the results from individual single-molecule data sets to check for consistency before averaging these together. Typically, ~35 individual single-molecule data sets were used to compute each function. As discussed in further detail below, we performed numerical simulations of the PDFs and TCFs using the master equation approach described in Sect. 3.4. Additional details of the numerical methods we employed are described in the SI.
Figure 3.

Experimentally derived statistical functions constructed from single-molecule photon data streams of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct. Fits to these functions are also shown, which are based on a three-macrostate kinetic network model. (A) The normalized histogram of EFRET values using a sampling window Tw = 1 ms and 21 million photons is represented as a sum of three Gaussian macrostates, which are labeled S1 – S3. (B) The two-point time correlation function (TCF) (blue points) is well modeled as a weighted sum of two exponentially decaying terms (red curve) with decay constants that are well-separated in time. (C) The four-point TCF is shown as a three-dimensional rendering (red points), which is overlaid with the theoretical surface predicted by the three-state kinetic model (mesh surface). (D) The four-point TCF in comparison to the theoretical model are shown as two-dimensional contour plots. The experimental data are shown as white contours and the model as a solid color-coded surface. The optimized values obtained from the three-macrostate master equation for the equilibrium PDF and the rate constant parameters are given in Table 2 and Table 3, respectively.
In Fig. 3A, we show a normalized histogram of EFRET values, which was compiled from the data streams of 35 individual single-molecule measurements of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct. The EFRET values were calculated by binning the D and A intensities using the sampling window Tw = 1 ms. This value for Tw was chosen because it is roughly equal to the time required for a randomly selected microstate to relax to one of the possible metastable intermediates available to the system. This is, equivalently, the time scale for which the probability to occupy any one macrostate becomes time-independent (see Fig. 4C and related discussion below). Each single-molecule photon data stream had an approximate mean flux , and was collected over an acquisition period of 30 s. Thus, the total number of single-photon measurements used to compile the histogram is approximately ~21 million. The shape of the PDF, which is mostly peaked around EFRET ~ 0.7, but also exhibits a shoulder at EFRET ~ 0.5 and a tail for EFRET < 0.5, is consistent with our qualitative observations of the time-dependent behavior of the EFRET trajectory shown in Fig. 2.
Figure 4.

Theoretical modeling results of microsecond-resolved smFRET measurements of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct using the kinetic master equation. (A) Kinetic scheme depicting the elementary reactive steps connecting the three macrostates of the oligo(dT)15 template: ‘compact’ S1, ‘partially-extended’ S2, and ‘highly-extended’ S3. (B) The normalized probability distribution function (PDF) of EFRET values obtained from an optimized fit to the histogram shown in Fig. 3A. The distribution is represented as a sum of the M = 3 Gaussian macrostates with areas equal to the equilibrium probabilities , with i ∈ {1,2,3}. (C) The optimized solution to the master equation is the equilibrium PDF, peq, and the time-dependent conditional probabilities, pij(t), which give the likelihood that the system occupies the jth macrostate at time t if it previously occupied the ith macrostate at time 0, with i, j ∈ {1,2,3}. The equilibrium values are established on the time scale of ~1 ms. (D) The free energy landscape is constructed from the PDF shown in panel B according to the Boltzmann distribution [Eq. (2)]. Intersecting dashed purple lines indicate the entropic contributions to the free energy barriers that separate macrostates. Intersecting solid curves indicate the total free energy barriers calculated from the optimized rate constants according to the Arrhenius equation [Eq. (3)]. The optimized values obtained for the equilibrium PDF are given in Table 2, those for the rate constants are given in Table 3, and the free energy minima and transition barriers are given in Table 4. Estimated enthalpic and entropic contributions to the transition barriers are given in Table 5.
The sub-millisecond dynamics of the system are characterized by the two-point and four-point TCFs. As we discuss in further detail below, the TCFs exhibited two principal decay components (20.3 μs and 272 μs), which are well-separated in time (see Fig. 3B and related discussion below). As discussed in Sect. 3.4, the number of exponential terms that contribute to the TCFs (= M – 1) indicate the minimum number of macrostates M that may underlie the observed dynamics. Based on this observation, we adopted a kinetic network scheme composed of M = 3 macrostates in thermal equilibrium as the simplest possible model to interpret our data. The presence of two well-separated relaxation components in the TCFs is our primary justification for applying this three-macrostate model. We have thus described the PDF as a sum of three Gaussians, as indicated in Fig. 3A. We note that the shapes of the PDFs for all four ss-dsDNA constructs investigated were very similar, as one might expect given that the base sequences and lengths of the four oligo(dT)n templates are nearly identical (see Fig. S3 of the SI). We next discuss the kinetic analyses of these data, which provided optimized parameters for the Gaussian decomposition of the PDFs of the ss-dsDNA constructs (Table S1 of the SI), and we list the results for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct in Table 2.
Table 2.
Optimized parameters for the Gaussian components (based on the three-macrostate model) of the FRET probability distribution function (PDF) shown in Fig. 3A. Parameters defined in Eq. (2). Error bars for the mean FRET efficiencies are ~0.005 (see Fig. S11 and Table S10 of the SI).
| Construct | A 1 | 〈EFRET〉1 | σ 1 | A 2 | 〈EFRET〉2 | σ 2 | A 3 | 〈EFRET〉3 | σ 3 | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3’oligo(dT)15 | 3.04 | 0.77 | 0.11 | 0.81 | 1.00 | 0.52 | 0.07 | 0.18 | 0.08 | 0.23 | 0.07 | 0.01 |
We calculated the experimental two-point TCFs [defined by Eq. (4)] from the time-dependent photon data streams of our single-molecule measurements (see SI for further details). Similar to the behavior of the PDFs discussed above, we found that the two-point TCFs of the four ss-dsDNA constructs exhibited very similar decay dynamics, which spanned the range of time scales: 1 μs – 1 ms. On the fastest time scale of 1 – 10 μs, the TCFs exhibited a relaxation component on the order of ~3 μs, as shown in Fig. S4 of the SI. We fit the first 20 microseconds of the TCFs to a model mono-exponential decay, and the optimized time constants for each of the ss-dsDNA constructs are listed in Table S2 of the SI. Relaxations on the time scale of ~3 μs have been observed previously in photophysical studies of the Cy3 chromophore, and are known to be associated with excited state intersystem crossing and forward photoisomerization, which undergo the reverse step of ground state recovery within ~10 μs (39–43). This photoisomerization leads to quenching of the Cy3 donor chromophore excited state, which leads to a change in the relative intensities of Cy3 donor and Cy5 acceptor fluorescence in our ss-dsDNA constructs, and thus to a change in the FRET efficiency. We therefore assigned the ~3 μs decay in our current studies to photoisomerization of the Cy3 chromophore, and we assumed that these relatively fast processes contributed to the ‘time-averaged’ background of the signal fluctuations that we observed on tens-of-microseconds and longer time scales, and which we interpreted to be due to structural changes of the oligo(dT)n templates.
In Fig. 3B, the blue data points show the experimental two-point TCF over the range of time scales from 20 μs – 1 ms, which was calculated from the same 35 single-molecule measurements of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct used to compile the histogram shown in Fig. 3A. Although the majority of TCFs calculated from the individual single-molecule data streams decayed to zero, as expected according to Eq. (4), a small number of single-molecule data streams resulted in non-zero asymptotic behavior due to variation in measurement quality, so that it was necessary to include a constant offset B in our model fits. Upon modeling the average experimental decay using a tri-exponential function, we obtained very similar values of the relaxation time constants, although the quality of the fit (as determined by the goodness-of-fit parameter R2) did not improve (see Fig. S5 and Table S3 of the SI). From these analyses, we concluded that the TCFs contain two well-separated principal decay components, which justifies our use of the three-macrostate model to interpret simultaneously dynamic and equilibrium properties of the system. Thus, Fig. 3B shows the optimized three-macrostate model fit for the two-point TCF given by , with relaxation time constants [eigenvalues of the rate matrix, Eq. (9)] and , as shown, and the constant B, as discussed above. A comparison between the two-point TCFs and their model fits for all four ss-dsDNA constructs, including plots extending over the full range of time scales (20 μs – 3 s), are shown in Fig. S6 of the SI, and the corresponding model fit parameters are listed in Table S4 of the SI.
We next calculated the four-point TCFs from our experimental data according to Eq. (6). In Fig. 3C, we show a three-dimensional rendering of this function, , where we have set the intermediate time interval τ2 = 0. These data are shown overlaid with a two-dimensional model function, , which is constructed from the optimized solution to the kinetic master equation. In Fig. 3D, we show the experimental and simulated four-point TCFs as a two-dimensional contour plot. We note that the three-dimensional surface of exhibits pronounced convex contour lines (corresponding to negative values of , see Table S5), which indicate the presence of kinetic ‘bottleneck’ intermediates in the reaction pathway that inhibit the rapid exchange of population among the various macrostates of the system (20). We show the four-point TCFs and their model fits to all four ss-dsDNA constructs in Fig. S7 of the SI, and the corresponding model fit parameters are listed in Table S5 of the SI. The combined statistical functions shown in Fig. 3 serve to constrain a multi-parameter optimization of solutions to the master equation.
As stated above, the presence of two decay components in the two-point TCFs implies that there must be at least three conformational macrostates between which the oligo(dT)n templates interconvert. Moreover, the three-state model is consistent with the histogram of EFRET values (see Fig. 3A), in which the underlying Gaussian features, labeled S1 – S3, may be assigned to a ‘compact’ macrostate, a ‘partially-extended’ macrostate, and a ‘highly-extended’ macrostate, respectively. The compact macrostate S1 has the largest mean EFRET value, since it contains chain configurations with relatively short D – A chromophore distances. The partially-extended macrostate S2 and the highly-extended macrostate S3, on the other hand, have smaller mean EFRET values, indicating chain configurations with correspondingly larger D – A separations.
In Fig. 4A, we depict the coupled three-macrostate scheme that we used to interpret our results. The solutions to the M = 3 master equation [Eq. (8)] can be written analytically in terms of the rate constants of elementary reactive steps (20). We performed numerical simulations of Eq. (8) by searching the parameter space of input rate constants and EFRET values, as described in our prior work (14, 20). The solutions to the master equation are the time-dependent conditional probabilities pij(t) [Eq. (12)] and the equilibrium distribution of macrostates [Eq. (11)], which may be used to simulate expressions for the PDF and the two-point and four-point TCFs described in Sect. 3.3. We performed an iterative search of the parameter space in which the statistical functions p(EFRET), and , shown in Fig. 3 for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct, were calculated for a given set of input rate constants and mean EFRET values, which were subsequently compared to the experimentally-derived functions. The agreement between simulated and experimental functions was quantified using a linear least squares target function χ2, which was minimized using a multi-parameter regression algorithm to determine an optimized solution to the master equation (see SI for further details). We compared the results of our analyses using linear versus cyclical three-state network models, which showed that the cyclical model is favored. We confirmed that our optimized results represent a global minimum of the parameter space by monitoring the increase of the χ2 error function with variation of the time constant parameters (see Fig. S11 of the SI). The equilibrium populations and kinetic time constants resulting from of our analysis of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct are summarized in Table 2 and Table 3, respectively, and those of all four ss-dsDNA constructs are presented in Table S1 and Table S6 of the SI.
Table 3.
Optimized forward and backward time constants corresponding to the elementary reactive steps obtained for the 3’-Cy3-Cy5-oligo(dT)15-ss-dsDNA construct. The time constant parameters are defined in Eq. (9). Error bars were determined from the cross-sections of the error function surfaces , shown in Fig. S11 of the SI.
| Construct | ||||||
|---|---|---|---|---|---|---|
| 3’-oligo(dT)15 | 0.113 ±0.004 | 0.025 ±0.0005 | 104 ±1.85 | 1.85 ±0.03 | 4.01 ±0.03 | 0.325 ±0.03 |
In Fig. 4B, we show the model equilibrium PDF obtained from our optimization analysis of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct using the three-macrostate scheme. We have modeled the PDF as a sum of three Gaussian components, with normalized areas equal to the equilibrium populations of the three macrostates (see Table 2). Thus, macrostate S1 is the dominant component with and mean FRET efficiency 〈EFRET〉1 = 0.77 ± 0.01, macrostate S2 is a minority component with and mean FRET efficiency 〈EFRET〉2 = 0.52 ± 0.07, and macrostate S3 is a trace component with and mean FRET efficiency 〈EFRET〉3 = 0.23 ± 0.07. These optimized Gaussian parameters can be used to estimate the relative free energy minima of the macrostates using the Boltzmann relation [Eq. (2)].
In Fig. 4D, we plot the individual macrostate contributions to the free energy surface of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct, which are shown as dashed purple curves. In Fig. S8 of the SI, we compare the free energy surfaces for all four ss-dsDNA constructs. The sum of the macrostate contributions is shown as a solid blue curve. The three macrostates are represented by parabolas with minimum values (taken as the ground state), and . Here, the superscript ‘o’ indicates that we have taken the minimum of macrostate S1 as the reference for all other energies. The parabolic dependence of the free energy with respect to the EFRET parameter is due to our assumption of a Gaussian distribution of end-to-end distances of the oligo(dT)n template for a given macrostate. The Gaussian distribution describes approximately the relative probability to observe a microscopic chain configuration with an end-to-end distance (and a corresponding EFRET value) that deviates from the mean value, and thus accounts for the entropic contribution to the free energy penalty associated with chain elongation or compression (10). The Gaussian model is, of course, accurate for long ideal polymer chains, and ignores the effects of excluded volume and other internal chain segment interactions. However, modified Gaussians with non-zero origins can describe chain molecules of finite length, and account for the effects of intramolecular interactions. For the oligo(dT)n template strands, the random walk statistics of chain configurations is additionally influenced by attractive and repulsive interactions between non-adjacent nucleotides and the tendency of adjacent bases to stack locally. The presence of three distinct macrostates, each with a different mean end-to-end distance, suggests that there exist chain configurations of the oligo(dT)n templates with characteristically different local flexibilities, which are (presumably) due to the effects of different distributions of stacked bases. In the context of the Gaussian macrostate model, the points at which the component parabolas (dashed purple curves) cross in the free energy surface approximate the entropic contributions to the transition state barriers for macrostate interconversion. However, the true free energy barriers are likely to include enthalpic contributions associated with the energy needed to rearrange the internal constraints imposed by a given distribution of stacked bases associated with a given macrostate. In this way, the Gaussian macrostate model can be used to estimate the enthalpic contributions to the free energy barriers. We next examine the free energy barriers of activation that result from the kinetic analysis of our data.
In Fig. 4C, we show the time-dependence of the optimized conditional probabilities pij(t) where i, j ∈ {1,2,3}. These functions illustrate the kinetic behavior of the three-macrostate system. The equilibrium populations are the asymptotic values approached by the conditional probabilities in the long-time limit. For example, the diagonal terms pii(t) decay from unity to the equilibrium population value on the time scale , while off-diagonal terms pji(t) increase from zero to the equilibrium population value on the time scale . The relaxation dynamics of all nine conditional probability terms occur primarily on sub-millisecond time scales, such that the equilibrium (time-independent) values are established on the time scale of ~1 ms. The optimized values of the time constants for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct are presented in Table 3, and the time constants for all four of the ss-dsDNA constructs are listed in Table S6 of the SI. It is interesting to note that the fastest interconversion processes are between macrostates S1 and S2 ( and ), while the slowest interconversions are between macrostates S1 and S3 ( and ) and between macrostates S2 and S3 ( and ).
From the optimized time constants, we determined the free energies of activation using the Arrhenius relation [Eq. (3)]. In calculating the various values for , we have assumed that the fastest rate constant resulting from our analysis () for the transition S2 → S1 corresponds to the formation of a single TT stacked base pair with transition barrier (44) such that . We make this assumption based on the premise that the underlying differences in the secondary structures of the three macrostates is the mean number of stacked bases, and that interconversion between macrostates involves the formation or disruption of stacked base pairs. The remaining activation barrier heights are measured relative to this value. In Table 4, we list these activation energies for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct, and we compare the results of all four ss-dsDNA constructs in Table S7 of the SI.
Table 4.
Optimized free energy minima and transition barriers for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct. The S1 macrostate is taken to be the ground state and excited state energies are given by [Eq. (2)] with parameters and σi listed in Table 2. The activation energies are determined by [Eq. (3) of the main text, with kmax ≡ k21e1.7, which assumes that the fastest rate constant of the system corresponds to the formation of a single TT stacked base pair with activation barrier ~1 kcal mol−1 = 1.7 kBT (44)]. The rate constant parameters are listed in Table S6. Energies are listed in units of kBT.
| Construct | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 3’-oligo(dT)15 | 0 | 1.11 | 3.62 | 3.22 | 2.81 | 10.04 | 9.63 | 7.89 | 7.89 |
We plot the barrier heights as horizontal lines in the free energy diagram shown in Fig. 4D. We note that the transition barrier corresponding to the uphill reaction S1 → S2 is (green curve) exceeds the entropic contribution given by the intersection between the Gaussian components. We thus estimate the enthalpic contribution to the activation barrier for the S1 → S2 transition as . Similarly, the backward transition S1 ← S2 has (by assumption) such that and . The transition barriers corresponding to the forward and backward reactions S2 ⇌ S3 are and , respectively. The absolute barrier heights corresponding to these values (= 7.89 kBT in both cases) are indicated on the free energy diagram in Fig. 4D (orange curve). These barriers exceed the estimated entropic contribution , suggesting that the forward and backward transitions between macrostates S2 and S3 requires of internal redistribution energy. Similarly, the forward and backward transition barriers separating macrostates S1 and S3 are and , respectively. The absolute barrier heights corresponding to these values (10.04 kBT and 9.63 kBT, respectively) are plotted on the free energy surface shown in Fig. 4D as a red curve, and exceed the estimated entropic contribution , suggesting that the enthalpic contribution to the activation energies are and .
We list the estimated enthalpic and entropic contributions to the activation barriers for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct in Table 5 and for all four ss-dsDNA constructs in Table S8 of the SI. These values are very similar for the four constructs, as expected. The enthalpic contributions to the forward and backward activation barriers for the S1 ⇌ S2 reaction are within the range ~1 – 1.7 kBT, which follows from our assumption that (44). This value is comparable in magnitude to the enthalpy of base unstacking obtained for TT base pairs (1.04 kcal mol−1 ≈ 1.7 kBT) in thermal denaturation studies of nicked DNA (45). We note that the enthalpic contributions to the forward and backward activation barriers for both the S1 ⇌ S3 and the S2 ⇌ S3 processes are comparable to one another and within the range ~3 – 5 kBT, which is approximately twice the size of the enthalpic barriers for the S1 ⇌ S2 processes.
Table 5.
Estimated transition barrier enthalpies and entropies for the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct, which are based on the Gaussian macrostate model. The entropic contribution to the transition barrier is defined with the free energy at the point of intersection between the parabolas describing macrostate-i and macrostate-j (dashed purple curves in Fig. 4D). The enthalpy of transition is estimated to be . Enthalpic and entropic contributions to the transition barriers are given in units of kBT.
| Construct | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| 3’-oligo(dT)15 | 1.50 | 6.28 | 4.65 | 1.72 | 1.29 | 3.76 | 3.35 | 3.21 | 3.21 |
5. Conclusions
In this paper we use microsecond-resolved smFRET experiments, recently developed in our laboratory, to begin to characterize conformational fluctuations in the structures of ssDNA at low concentrations in dilute aqueous solution. This is important for studies of protein-DNA interactions because – unlike dsDNA – the interactions stabilizing ssDNA conformations are not cooperative, and therefore conformations that exist at equilibrium, even at trace concentrations, are measurable with our present techniques. These results are likely to be significant from a biological perspective because our approach extends smFRET techniques to the microsecond time regime. By expanding the range of experimentally accessible time scales, it is thus possible to monitor not only the binding and unbinding of protein components of protein-DNA complexes on millisecond time scales and longer, but also to observe directly the interconverting conformational species (on sub-millisecond time scales) of ssDNA that comprise the interaction partners in the initial steps of biologically relevant protein-ssDNA interactions.
As described in the Introduction, and also in previous papers (14, 19, 20, 27), we have been using smFRET methods to study, in particular, the initial steps in the binding, assembly, and translocation of the bacteriophage T4 ssb proteins (gp32) and their cooperatively-bound clusters onto and along ssDNA lattices. Understanding the molecular details of these interactions with both the ssDNA lattices and ss-dsDNA junctions, which are present in the DNA constructs that we have studied, are central to obtaining molecular insight into the regulatory mechanisms of these ssb proteins as they interact with and control the functions of the other components of the T4 replisome – including the helicase and primosome, helicase loader proteins, the DNA polymerases and the processivity clamps and clamp loaders.
In the current paper, we describe microsecond-resolved studies on the ssDNA ‘tails’ of the ss-dsDNA constructs that we have used previously in our studies of the gp32 binding and assembly system (see Fig. 1). We studied short ss oligo-deoxythymidine [oligo(dT)n] templates of varying length (n = 14 and 15 nts) and polarity (3’ versus 5’, Table S1 of the SI) because gp32 binding and assembly onto these ss-dsDNA constructs depends sensitively on these factors. As we have shown, and as one might expect, these differences in the details of the constructs do not significantly change the equilibrium distribution of the template conformations, nor the dynamics of state-to-state interconversion. Nevertheless, it was necessary to investigate all four ss-dsDNA constructs to provide a baseline for the ssb interaction studies that will follow. Thus, the present study enables us to re-examine the ssb-ssDNA interaction systems with the ability to determine how ssb protein binding and assembly is directly coupled to the dynamics of the DNA interaction partners.
The oligo(dT)n templates used in the present study comprised the ssDNA ‘tails’ of ss-dsDNA constructs, which had been labeled at their ends with a Cy3 donor chromophore and a Cy5 acceptor. We analyzed our data by constructing statistical functions, i.e., the equilibrium probability distribution function (PDF) of FRET efficiency values, with time resolution of 1 ms, and the two-point and four-point time correlation functions (TCFs) with time resolution of 10 μs. We found that the TCFs exhibited two well-separated principal decay components with time constants 20.3 μs and 272 μs (Fig. 3B). This observation suggests that the system can be described as a kinetic network of three coupled macrostates in thermal equilibrium. We simulated these functions using a kinetic network model based on a master equation to obtain optimized parameters from which we have reached the following conclusions.
Our results indicate that the oligo(dT)n templates of all four of the ss-dsDNA constructs used are dynamic systems with three distinct conformational macrostates: a ‘compact’ form S1, a ‘partially-extended’ form S2, and a ‘highly-extended’ form S3 (see Fig. 4A). These macrostates are separated in relative stability by just a few thermal energy units: , , and in the case of the 3’-Cy3-Cy5-oligo(dT)15-ss-dsDNA construct (see Fig. 4D and Table 4). The time scales of interconversion between these macrostates range from tens-to-hundreds of microseconds (see Fig. 4C and Table 3). The fastest relaxation time that we observed () corresponds to the transition from the partially-extended macrostate S2 to the compact macrostate S1, which is comparable to the relaxation times found for ssDNA loop closure in the formation of a DNA hairpin composed of an oligo(dT)16 loop (46, 47), and also for single-strand helix-coil transitions in oligoadenylates of various sizes (48).
The relative stabilities and relatively long lifetimes of the three macrostates may be related to the energetic cost of unstacking and stacking adjacent bases. We note that the relative stabilities of the three macrostates are comparable to the enthalpy of base unstacking for a single TT pair (1.04 kcal mol−1 ≈ 1.7 kBT), which was obtained from thermal denaturation studies of the bent-to-stacked equilibrium in nicked DNA (45). From the kinetic rate constants (Table 3), and by assuming that the fastest rate constant resulting from our analysis for the transition S2 → S1 corresponds to the formation of a single TT stacked base pair with transition barrier ~1 kcal mol−1 = 1.7 kBT (44), we determined the free energies of activation (Table 4). By applying the Gaussian macrostate model, we inferred the entropic and enthalpic contributions to the barrier heights (Table 5). The forward and backward enthalpies of activation that separate the highly-extended macrostate S3 from the partially-extended and compact macrostates, S2 and S1, respectively, have values within the range ~3.2 – 3.8 kBT. If we consider the barriers for all four of the ss-dsDNA constructs (see Table S8 of the SI), the range of forward and backward enthalpy values are from 2.9 – 4.8 kBT. The above estimates of the enthalpic contributions to the activation barriers suggest that only a small number of base stacking interactions are involved in these transformations.
The notion that multiple extended conformations of short oligomeric ssDNA templates exist as stable macrostates at equilibrium differs from our previous assumptions about short ssDNA lattices, in which only a single, relatively compact macrostate has been considered to be the primary conformation (14). It is interesting to note that previous workers have concluded from spectroscopic studies of model ssDNA constructs the existence of three distinct macrostates related to base-stacking interactions. Pörschke used sub-microsecond temperature-jump and UV hypochromicity experiments to observe two distinct relaxation processes in oligoadenylates, and inferred the presence of three different molecular states (48). Wu et al. performed time-resolved fluorescence measurements of a fluorescent base analogue-substituted ssDNA decamer and observed three distinct lifetime components, to which they assigned a ‘fully extended and completely unstacked state,’ a ‘loosely associated state,’ and a ‘fully stacked state’ (49). Since these studies were performed by ensemble methods, they provide insights about the average dynamic behavior of the oligomer single-strand (48) and the local environment experienced by a fluorescent base analogue probe, which is substituted within a single-strand (49). In contrast, our current studies are based on single-molecule FRET measurements that are sensitive to the distribution of end-to-end distances, which are affected directly by the tendency of flanking bases to undergo local stacking interactions.
The parameters that we have determined to characterize the free energy surface suggest that a new model for ssDNA secondary structure may be required (see Fig. 5). In order to gain additional insight into the nature of the extended macrostates, we carried out a coordinate transformation (25) of the free energy surface as a function of the FRET efficiency (shown in Fig. 4D) to one as a function of the donor – acceptor separation, which we equate with the end-to-end distance, Ree, of the oligo(dT)15 template (Fig. 5A). In performing this transformation, we used a value of the Förster transfer distance R0 = 56 Å obtained by others, which assumes that the relative orientation of the Cy3 donor and Cy5 acceptor, which are attached to the ss-dsDNA constructs using flexible linkers, is random and that the orientation factor κ2 = 2/3 (32, 33). From the transformed free energy surface, we see that the local minima of the three macrostates, S1 – S3, may be assigned to mean end-to-end distances 〈Ree〉 = 46.5, 56.0 and 67.5 Å, respectively.
Figure 5.

(A) Transformed free energy surface as a function of the end-to-end separation, Ree, of the oligo(dT)15 template. The transformation is given by P(Ree) = P(EFRET) dEFRET⁄dRee with and R0 = 56 Å, and assumes that the relative orientation between the FRET Cy3 donor and Cy5 acceptor is random such that the orientation factor κ2 = 2/3 (32, 33). (B) Schematic microstate configurations of the oligo(dT)15 template exemplifying the three macrostates S1 – S3 described by the free energy surface shown in panel (A). In B-form DNA (upper left), the mean distance (rise per residue) between adjacent bases is 3.4 Å. In the oligo(dT)15 template strands (labeled S1 – S3), adjacent bases are primarily unstacked with a mean inter-base distance of ~4.6 Å. Unstacked bases are stabilized by the gain in entropy associated with the resultant enhanced rotation about the segmental axes. A sequence of unstacked bases forms a contiguous linear segment. The presence of a single stacked base ‘defect site,’ introduces a ‘vertex’ with an accompanying change in the bond vector.
We next consider how the extended macrostates S2 and S3 might play a biological role as intermediates for protein binding mechanisms (50, 51). For example, the T4 ssDNA binding (ssb) protein may be able to access its 7 nt binding site footprint from extended macrostate conformations, S2 and S3, more readily than from the compact macrostate S1. It is known from electron microscopy studies of cooperatively bound ssb nucleoprotein filaments that adjacent nucleotide bases within the complexes are fully unstacked with an average inter-base separation of 4.6 Å (5). This contrasts with the 3.4 Å separation between stacked bases in B-form duplex DNA. The oligo(dT)14 and oligo(dT)15 templates examined in this study were chosen specifically to permit two ssb proteins to bind cooperatively within the final-formed nucleoprotein complexes. In the case of the 3’-Cy3/Cy5-oligo(dT)15-ss-dsDNA construct, the contour length of the ssDNA component within a cooperatively bound (ssb)2-oligo(dT)15 filament is expected to be approximately ~15 × 4.6 Å = 69 Å. Given that the mean end-to-end distance corresponding to macrostate S3 (Ree = 67.5 Å) is only slightly smaller than the contour length of the fully unstacked oligo(dT)15 template strand, we suggest that the extended macrostates do indeed represent potential loading sites for ssb nucleoprotein filament assembly.
Considering these findings, we propose a new model for ssDNA secondary structure, which is depicted schematically in Fig. 5B. We posit that at physiological conditions, an isolated ssDNA template exists at thermal equilibrium with a significant fraction of its flanking bases unstacked. This hypothesis is consistent with the findings of Pollack and co-workers, who combined small angle X-ray scattering of oligo(dT)30 and oligo(dA)30 templates with ensemble conformational analyses (52). In the B-form DNA duplex, the highly ordered secondary structure of the double-helix is stabilized by cooperative short-range attractive interactions (stacking) between flanking bases (depicted as blue purines and red pyrimidines) and the network of Watson-Crick hydrogen bonds between opposing bases of the complementary single-strands. The duplex is marginally destabilized by the Coulomb repulsions between negatively charged phosphates (shown as yellow circles) of the sugar-phosphate backbones, which are partially screened by the condensation layer of positively charged counterions in the immediate aqueous surroundings.
We conjecture that adjacent stacked bases within an isolated segment of ssDNA, which lack the stabilizing influence of the complementary strand of duplex DNA, are only marginally stabilized with respect to thermal fluctuations that disrupt the short-range (solvophobic) attraction between base surfaces (53). Unstacked bases within an isolated segment of ssDNA are energetically favored by the Coulombic repulsion between adjacent phosphates and the gain in entropy associated with enhanced rotational freedom about the segmental axes. We emphasize that the attractive base-stacking interactions are short range, and occur primarily at contact, while the Coulombic repulsion between adjacent phosphates in the backbone is a long-range effect, which dominates and stabilizes the extended forms of ssDNA segments. A sequence of metastable unstacked bases thus produces an essentially linear contiguous segment, which may be disrupted by the spontaneous formation of a stacking interaction between two adjacent bases along the ssDNA chain. The presence of such a vicinal stacked pair of bases introduces a ‘defect site,’ which then induces a change in the bond vector of two adjoining segments, each composed of unstacked bases. In Fig. 5B, we depict three examples of such microstate ssDNA conformations, which may each be assigned to one of the three macrostates S1 – S3, as shown. Each macrostate can be viewed as a stable family of microstates that contains a unique, dynamically equilibrated distribution of stacked base ‘defect sites.’ The principal difference between the underlying distributions of the three macrostates is the mean number of stacked base defect sites, whose presence effectively disrupts the linearity of the oligo(dT)15 template, and thereby reduces the end-to-end distance. The mean number of defect sites is maximized for the most stable macrostate S1 (depicted Fig. 5B as having three defect sites). The ground state S1 minimum represents the end-to-end distance (〈Ree〉 = 46.5) corresponding to balance between stabilizing and de-stabilizing (entropic and enthalpic) interactions, which arise in the presence of the dynamically interconverting stacked base defect sites. We further note that in the context of this model, conformational transformations between any two of the three macrostates, S1 – S3, correspond to the loss or gain of a discrete number of stacked base defect sites, which is fully consistent with the values that we have determined from our analyses for the relative stabilities and transition enthalpy barriers of the macrostates.
The interpretation of these experiments in terms of the three-macrostate model is based on a theoretical analysis in which we have made physically motivated and valid assumptions. However, the three-macrostate model is not intended to represent the only possible interpretation of these data, since more complex models cannot be ruled out. Nevertheless, the model presented herein is intended to represent the simplest interpretation that can self-consistently explain all aspects of the current experiments. The above model for ssDNA secondary structure can provide useful insights for the interpretation of future microsecond-resolved smFRET studies of the T4 bacteriophage ssb nucleoprotein filament assembly mechanism using these same ss-dsDNA constructs. The initial steps of ssb protein-filament assembly involve the association of a ssb (gp32) monomer with an exposed ssDNA template binding site (7 nts), which may form on sub-millisecond time scales as a contiguous segment of unstacked flanking bases. Additional functional steps in which ssb proteins can assemble into cooperatively bound clusters, slide along a ssDNA template strand and disassemble, will be mediated by the nucleation and dissolution of flanking stacked base defect sites, as described by our model. We anticipate that the model can be further tested by performing microsecond-resolved smFRET experiments on ss-dsDNA constructs with ssDNA sequences of purine or of mixed base sequences.
Supplementary Material
Acknowledgements
A.H.M. acknowledges useful discussions with Prof. Marina Guenza. The authors are also grateful to our laboratory colleagues in the Marcus and von Hippel groups for many helpful discussions. This work was supported by grants from the National Institutes of Health General Medical Sciences (GM-15792 to A.H.M. and P.H.v.H.) and the National Science Foundation Chemistry of Life Processes Program (CHE-1608915 to A.H.M.). B.I. and C.S.A. were supported as predoctoral trainees by an NIH-NIGMS Institutional Research Service Award in Molecular Biology and Biophysics (GM-07759). P.H.v.H. is an American Cancer Society Research Professor of Chemistry.
Footnotes
Supporting Information
Supplementary details for materials and methods. Example photon data stream; 9 s duration. Comparison of data and results for the four different Cy3/Cy5-oligo(dT)n-ss-dsDNA constructs (3’ versus 5’, n = 14, 15). Numerical procedures for calculating two-point and four-point TCFs. Instrument response function. Free energy landscapes. Numerical procedures for solving the three-state master equation. Procedure for implementing numerical multivariable optimization. Statistical uncertainty analysis.
References
- 1.Luger K, Dechassa ML, and Tremethick DJ, New insights into nucleosome and chromatin structure: an ordered state or a disorderedaffair? Nature 13, 436–447 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Spies M, and Ha T, Inching over hurdles: How DNA helicases move on crowded lattices. Cell Cycle 9, 1742–1749 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Kozlov AG, Shinn MK, and Lohman TM, Regulation of nearest-neighbor cooperative binding of E. coli SSB protein to DNA. Biophys. J 117, 2120–2140 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.von Hippel PH, and Marcus AH, The many roles of binding cooperativity in the control of DNA replication: Invited New and Notable Commentary related to Kozlov, Shinn and Lohman, Biophys. J, 2019, 117, 2120–40. Biophys. J. 117, 2143–2146 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Delius H, Mantell NJ, and Alberts B, Characterization by electron microscopy of the complex formed between T4 bacteriophage gene 32-protein and DNA. J. Mol. Biol 67, 341–350 (1972). [DOI] [PubMed] [Google Scholar]
- 6.Duane M, Molecular Biophysics: Structures in Motion (Oxford University Press, Oxford, 1993). [Google Scholar]
- 7.Muthukumar M, 50th anniversary perspective: A perspective on polyelectrolyte solutions. Macromolecules 50, 9528–9560 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Laage D, Elsaesser T, and Hynes JT, Water dynamics in the hydration shells of biomolecules. Chem. Revs 117, 10694–10725 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wales D, Energy Landscapes: Applications to Clusters, Biomolecules and Glasses, Cambridge Molecular Science (Cambridge University Press, Cambridge, 2004). [Google Scholar]
- 10.Rubinstein M, and Colby RH, Polymer Physics (Oxford University Press, Oxford, 2003). [Google Scholar]
- 11.Noé F, Krachtus D, Smith JC, and Fischer S, Transition networks for the comprehensive characterization of complex conformational change in proteins. J. Chem. Theory Comput 2, 840–857 (2006). [DOI] [PubMed] [Google Scholar]
- 12.Chung H-S, Gopich IV, McHale K, Cellmer T, Louis JM, and Eaton WA, Extracting rate coefficiencts from single-molecule photon trajectories and FRET efficiency histograms for a fast-folding protein. J. Phys. Chem. A 115, 3642–3656 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Beyerle ER, Dinpahjooh M, Ji H, von Hippel PH, Marcus AH, and Guenza MG, Dinucleotides as simple models of the base stacking-unstacking component of DNA ‘breathing’ mechanisms. Nucleic Acids Res 49, 1872–1885 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Phelps C, Israels B, Jose D, Marsh MC, von Hippel PH, and Marcus AH, Using microsecond single-molecule FRET to determine the assembly pathways of T4 ssDNA binding protein onto model DNA replication forks. Proc. Nat. Acad. Sci. Plus USA 114, E3612–E3621 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lee W, Gillies JP, Jose D, von Hippel PH, and Marcus AH, Single-molecule FRET studies of cooperative and non-cooperative binding kinetics of the bacteriophage T4 single-stranded DNA binding protein (gp32) to ssDNA lattices at replication fork junctions. Nucleic Acids Res 44, 10691–10710 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Murphy MC, Rasnik I, Cheng W, Lohman TM, and Ha T, Probing single-stranded DNA conformational flexibility using fluorescence spectroscopy. Biophys. J 86, 2530–2537 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ha T, Lohman TM, and Kozlov AG, Single-molecule views of protein movement on single-stranded DNA. Ann. Rev. Biophys 41, 295–319 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Chen H, Meisburger SP, Pabit SA, Sutton JL, Webb WW, and Pollack L, Ionic strength-dependent persistence lengths of single-stranded RNA and DNA. Proc. Nat. Acad. Sci 109, 799–804 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Phelps C, Lee W, Jose D, von Hippel PH, and Marcus AH, Single-molecule FRET and linear dichroism studies of DNA ‘breathing’ and helicase binding at replication fork junctions. Proc Natl Acad Sci U S A 110, 17320 – 17325 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Phelps C, Israels B, Jose D, Marsh MC, von Hippel PH, and Marcus AH, Using multi-order time correlation functions (TCFs) to elucidate biomolecular reaction pathways from microsecond single-molecule fluorescence experiments. J. Phys. Chem. B 120, 13003–13016 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tamimi A, Landes T, Lavoie J, Raymer MG, and Marcus AH, Fluorescence-detected Fourier transform electronic spectroscopy by phase-tagged photon counting. Opt. Express 28, 25194–25214 (2020). [DOI] [PubMed] [Google Scholar]
- 22.Berg MA, and Darvin JR, Measuring a hidden coordinate: Rate-exchange kinetics from 3D correlation functions. J. Chem. Phys 145, 054119 (2016). [DOI] [PubMed] [Google Scholar]
- 23.Berg MA, and Kaur H, Nonparametric analysis of nonexponential and multidimensional kinetics. I. Quantifying rate dispersion, rate heterogeneity, and exchange dynamics. J. Chem. Phys 146, 054104 (2017). [DOI] [PubMed] [Google Scholar]
- 24.Kryvohuz M, and Mukamel S, Nonlinear response theory in chemical kinetics. J. Chem. Phys 140, 034111 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Beddard G, Applying Maths in the Chemical and Biological Sciences: An example-based approach (Oxford University Press, Oxford, 2009). [Google Scholar]
- 26.Reichl LE, A Modern Course in Statistical Physics (John Wiley & Sons, New York, 1998). [Google Scholar]
- 27.Lee W, Jose D, Phelps C, Marcus AH, and von Hippel PH, A single-molecule view of the assembly pathway, subunit stoichiometry and unwinding activity of the bacteriophage T4 primosome (helicase-primase) complex. Biochemistry 52, 3157–3170 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Joo C, McKinney SA, Nakamura M, Rasnick I, Myong S, and Ha T, Real-time observation of RecA filament dynamics with single monomer resolution. Cell 126, 515–527 (2006). [DOI] [PubMed] [Google Scholar]
- 29.Rasnick I, McKinney SA, and Ha TJ, Nonblinking and long-lasting single-molecule fluorescence imaging. Nature Meths 3, 891–893 (2006). [DOI] [PubMed] [Google Scholar]
- 30.Chandradoss SD, Haagsma AC, Lee YK, Hwang J-H, Nam J-M, and Joo C, Surface passivation for single-molecule protein studies. J. Vis. Exper 86, e50549 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lakowicz JR, Principles of Fluorescence Spectroscopy (Kluwer Acad. / Plenum, New York, 1999). [Google Scholar]
- 32.Screenivasan R, Shkel IA, Chhabra M, Drennan A, Heitkamp S, Wang H-C, Sridevi MA, Plaskon D, McNerney C, Callies K, et al. , Fluorescence-detected conformational changes in duplex DNA in open complex formation by Escherichia coli RNA polymerase: Upstream wrapping and downstream bending precede clamp opening and insertion of the downstream duplex. Biochemistry 59, 1565–1581 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Screenivasan R, Chhabra I,M, Saecker R, Lingeman E, Poulos M, McCaslin D, Capp MW, Artsimovitch I, and Record MT Jr., Flourescence resonance energy transfer characterization of DNA wrapping in closed and open Escherichia coli RNA polymerase - λPR promoter complexes. Biochemistry 55, 2174–2185 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Chandler D, Introduction to Modern Statistical Mechanics (Oxford University Press, 1987). [Google Scholar]
- 35.Zimmt MB, Peterson KA, and Fayer MD, Short polymer chain statistics and the relationship to end to end electronic excitation transport: Random walks with variable step lengths. Macromolecules 21, 1145–1154 (1988). [Google Scholar]
- 36.Gillespie DT, Stochastic simulation of chemical kinetics. Annu. Rev. Phys. Chem 58, 35–55 (2007). [DOI] [PubMed] [Google Scholar]
- 37.McQuarrie DA, Simon JD, Physical Chemistry: A molecular approach (University Science Books, Sausolito (USA), 1997). [Google Scholar]
- 38.Glasser D, Horn FJM, and Meidan R, Properties of certain zero column-sum matrices with applications to the optimization of chemical reactors. J. Math. Anal. Appl 73, 315–337 (1980). [Google Scholar]
- 39.Kumari N, Cuiba MA, and Levitus M, Photophysical properties of the hemicyanine Dy-630 and its potential as a single-molecule fluorescent probe for biophysical applications. Methods Appl. Fluoresc 8, 015004 (2020). [DOI] [PubMed] [Google Scholar]
- 40.Sanborn ME, Connolly BK, Gurunathan K, and Levitus M, Fluorescence properties and photophysics of the sulgoindocyanine Cy3 linked covalently to DNA. J. Phys. Chem. B 111, 11064–11074 (2007). [DOI] [PubMed] [Google Scholar]
- 41.Jia KYW, Xia A, Li S, Gong F, and Yang G, Characterization of photoinduced isomerization and intersystem crossing of the cyanine dye Cy3. J. Phys. Chem. A 111, 1593–1597 (2007). [DOI] [PubMed] [Google Scholar]
- 42.Ciuba MA, and Levitus M, Manganese-induced triplet blinking and photobleaching of single molecule cyanine dyes. ChemPhysChem 14, 3495–3502 (2013). [DOI] [PubMed] [Google Scholar]
- 43.Kuzmin VA, and Darmanyan AP, Study of sterically hindered short-lived isomers of polymethine dyes by laser photolysis. Chem. Phys. Lett 54, 159–163 (1978). [Google Scholar]
- 44.Jafilan S, Klein L, Hyun C, and Florián J, Intramolecular base stacking of dinucleotide monophosphate anions in aqueous solution. J. Phys. Chem. B 116, 3613–3618 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Protozanova E, Yakovchuk P, and Frank-Kamenetskii MD, Stacked-unstacked equilibrium at the nick site of DNA. J. Mol. Biol 342, 775–785 (2004). [DOI] [PubMed] [Google Scholar]
- 46.Goddard NL, Bonnet G, Krichevsky O, and Libchaber A, Sequence dependent rigidity of single stranded DNA. Phys. Rev. Lett 85, 2400–2403 (2000). [DOI] [PubMed] [Google Scholar]
- 47.Bonnet G, Krichevsky O, and Libchaber A, Kinetics of conformational fluctuations in DNA hairpin-loops. Proc. Nat. Acad. Sci 95, 8602–8606 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Pörschke D, Molecular states in single-stranded adenylate chains by relaxation analysis. Biopolymers 17, 315–323 (1978). [Google Scholar]
- 49.Wu P, Nordlund TM, Gildea B, McLaughlin LW, Base stacking and unstacking as determined from a DNA decamer containing a fluorescent base. Biochemistry 29, 6508–6514 (1990). [DOI] [PubMed] [Google Scholar]
- 50.von Hippel PH, “On the molecular bases of the speficity of interaction of transcriptional proteins with genome DNA” in Biological Regulation and Development, Goldberger RF, Ed. (Plenum Publishing Corp., New York, 1979), vol. 1, pp. 279–347. [Google Scholar]
- 51.von Hippel PH, Johnson NP, and Marcus AH, 50 years of DNA ‘breathing’: Reflections on old and new approaches. Biopolymers 99, 923–954 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Plumridge A, Meisburger SP, Andresen K, and Pollack L, The impact of base stacking on the conformations and electrostatics of single-stranded DNA. Nucleic Acids Res 45, 3932–3943 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Lowe MJ, and Schellman JA, Solvent effects on dinucleotide conformation. J. Mol. Biol 65, 91–109 (1972). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
