Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2006 Dec 1;92(4):L30–L32. doi: 10.1529/biophysj.106.099572

Sequence-Dependent Twist-Stretch Coupling in DNA

Timothée Lionnet *, Filip Lankaš
PMCID: PMC1783891  PMID: 17142263

Abstract

Recent single-molecule micromanipulation experiments on DNA subject to small distortion revealed positive coupling between DNA stretching and twisting—for instance, DNA elongates when overtwisted. Here we propose a method to calculate the twist-stretch coupling constant specific to a DNA fragment of a given sequence. The method employs a sequence-dependent dinucleotide force field and is based on constrained minimization of the fragment's deformation energy. Using a force field inferred from atomistic molecular dynamics simulations, we obtain the twist-stretch coupling for random sequence to be 0.30 nm/turn, close to experimental values. An exhaustive calculation for all oligomers of nine basepairs yields values between 0.14 and 0.45 nm/turn, positively correlated with the contents of pyrimidine-purine steps in the sequence. Our method is simple to use and allows one to explore the hypothesis that some sequences may be optimized for twist-stretch coupling.


Mechanical properties of DNA play an important role in a range of biological processes such as DNA binding to proteins or DNA packaging in viruses. The helical structure of DNA implies continuum elastic models with extra terms in the quadratic elastic energy function, absent in the case of a homogeneous isotropic rod. In particular, a coupling term involving both twisting and extension is required (1,2). Previous analysis (13) of experimental single-molecule data has suggested that DNA shortens upon overtwisting, in agreement with a simple picture of DNA as a stack of plates connected by rigid backbones. However, two independent micromanipulation studies have recently revealed that kilobase-long DNA molecules under small distorsion, gently pulled to reduce thermal fluctuations and suppress stress-induced buckling, actually elongate when overtwisted (4,5) as well as overwind when stretched (5). Within the framework of linear elasticity, this behavior is described by a single proportionality constant indicating the length change upon the imposed rotation, which has been found to be 0.4 ± 0.2 nm/turn by one study (4) and 0.5 ± 0.1 nm/turn by the other (5). This coupling constant does not depend on the length of the molecule. For shorter molecules, it might be more practical to express it in the units of angstroms per degree (Inline graphic).

These values describe the average property of the long DNA molecules investigated in the experiments. However, it has been proposed (5,6) that proteins may exploit the positive coupling between elongation and overtwisting on a much shorter length scale to recognize binding sites with variable sequence lengths; in case of basepair deletion (or insertion), proteins may recognize the site by simultaneously stretching and overtwisting (or compressing and undertwisting) the DNA. Certain sequences may be optimized for coupled twisting and stretching (5), and the question arises concerning the sequence-dependence of the twist-stretch (TS) coupling. This problem is addressed in this study.

We consider a DNA fragment as a chain of dinucleotide (or basepair (bp)) steps. There are 10 unique dinucleotide sequences. The geometry of each step is described in our model by the six basepair step parameters, namely tilt, roll, twist, shift, slide, and rise (7), defining the translation and rotation of the second basepair in the step with respect to the first one. In the absence of external forces, the parameters adopt their minimum energy values. We assume that the energy associated with the deformation of a basepair step away from its minimum energy geometry has the form of a six-dimensional harmonic potential:

graphic file with name M2.gif (1)

where the index a (Inline graphic) labels the unique step sequence, Inline graphic is the deformation energy, Inline graphic the sequence-specific stiffness matrix, and Inline graphic the six-component column vector containing the deviations of the step parameters from their minimum energy values (T indicates the matrix transpose). We further assume that the total deformation energy of a DNA fragment is a sum of the energies of the individual steps.

Let us look first at the twist-stretch coupling in the individual dinucleotide steps. Suppose that twist ω in excess to the minimum energy twist is imposed on the step. The corresponding deviations of the other parameters can be found by minimizing the energy (1) under the constraint of the imposed twist.

If i is the index of the twist component in w (in the parameter ordering mentioned above, Inline graphic), we have

graphic file with name M8.gif
graphic file with name M9.gif (2)

which yields (we are omitting the superscript a)

graphic file with name M10.gif
graphic file with name M11.gif (3)

The solution to Eq. 3 can be written as Inline graphic, where Inline graphic is the solution to Eq. 3 with Inline graphic. Thus, the change of step parameters is proportional to the imposed excess twist, the proportionality coefficients being the components of Inline graphic. In particular, Inline graphic is the coupling coefficient between twist and the parameter rise, which measures the step extension along the helical axis. Thus, Inline graphic (we will subsequently denote it by d) is the twist-elongation coupling at the level of individual dinucleotide steps.

To calculate its sequence-dependent values, we take the matrices Inline graphic from a dinucleotide force field based on atomistic molecular dynamics simulations (8). These stiffness data have been recently found to perform particularly well in predicting binding affinity in a protein-DNA complex with an important indirect readout component (9). The results are shown in Table 1. Note that values for different steps may differ by an order of magnitude.

TABLE 1.

Twist-rise coupling d (Å·deg−1) and effective twist stiffness α (kcal·mol−1deg−2) as a function of the dinucleotide sequence

CG CA TA AG GG AA GA AT AC GC
d 0.0190 0.0172 0.0078 0.0101 0.0048 0.0042 0.0037 0.0042 0.0020 0.0012
α 0.0173 0.0178 0.0268 0.0307 0.0312 0.0251 0.0218 0.0404 0.0332 0.0261

What is the energy cost of imposing excess twist ω on a basepair step? From Eq. 1 we find

graphic file with name M19.gif (4)

so that the energy is a quadratic function of the excess twist, the effective stiffness constant being Inline graphic. Its sequence-dependent values are summarized in Table 1. Note the difference from the twist stiffness reported earlier (8) where it referred to the diagonal element of K corresponding to twist, i.e., stiffness with respect to twist while the other parameters retain their minimum energy values. Here, in contrast, α refers to the stiffness associated with twisting while the other parameters are free to relax.

Let us now calculate the twist-stretch coupling constant for an arbitrary DNA molecule. We assume that: a), thermal fluctuations are suppressed, b), all the imposed torsion is distributed into twists of the dinucleotide steps, and c), the total length of the molecule is equal to the sum of rises of the dinucleotide steps. These assumptions are plausible for molecules under moderate tension and fragments of small curvature bound to proteins.

Consider first just two steps in a row, with effective twist stiffness constants α1 and α2. The imposed total excess twist ω will be distributed in the step twists ω1 and ω2, ω = ω1 + ω2, in such a way that the energy, Inline graphic, be minimal. Performing the minimization, we find Inline graphic where the total twist stiffness Inline graphic is given by the relation Inline graphic. By induction we obtain the following result: the total twist stiffness Inline graphic for a chain of n steps with twist stiffnesses Inline graphic is given by the relation

graphic file with name M27.gif (5)

Now consider n steps with the total imposed excess twist ω, distributed among the steps: Inline graphic. To find the excess twists Inline graphic of the individual steps, we have to minimize the deformation energy of the chain, Inline graphic, with the constraint Inline graphic. The method of Langrange multipliers implies unconstrained minimization of the function Inline graphic, where Inline graphic. We obtain the following system of n+1 linear equations:

graphic file with name M34.gif
graphic file with name M35.gif (6)

It is obvious by mere inspection that the solution to this system is Inline graphic, where Inline graphic is given by Eq. 5. Thus, the excess twist in a particular step is proportional to the ratio of the total twist stiffness of the whole molecule to the twist stiffness of that step. Consequently, steps of the same type get the same amount of excess twist.

Finally, denoting by Inline graphic the twist-rise coupling of the individual steps, we find that the total change in rise is Inline graphic, or

graphic file with name M40.gif (7)

But according to assumption c, the total change in rise is equal to the elongation of the molecule. Thus, the expression in the parentheses is the desired TS coupling constant for the whole molecule. Note that equal step types contribute equally to the elongation. Assuming that there are Inline graphic steps of type a in the molecule (Inline graphic), we can rewrite Eq. 7 as

graphic file with name M43.gif
graphic file with name M44.gif
graphic file with name M45.gif (8)

where Inline graphic and Inline graphic are the values of α and d for step of type a taken from Table 1. Equations 8 thus enable one to easily calculate the TS coupling Inline graphicof a DNA molecule provided the number of steps of the ten unique types, Inline graphic, is known.

Alternatively we may express Inline graphic in terms of the fraction Inline graphic of the step type a in the sequence with a total of n steps: Inline graphic, so that Eq. 8 now becomes

graphic file with name M53.gif
graphic file with name M54.gif
graphic file with name M55.gif (9)

The TS coupling of a sequence, Inline graphic, is thus computed as a certain kind of weighted average of the TS coupling coefficients of the individual basepair steps, Inline graphic. Note that molecules of different length but identical composition would have the same TS coupling constant.

Let us calculate, as an example, the TS coupling for the tetramer ATAT. We have Inline graphic, Inline graphicand Eqs. 8 yield

graphic file with name M60.gif
graphic file with name M61.gif

For a long alternating polymer (AT)m, Inline graphic, we have Inline graphic, Inline graphic and using Eqs. 9 with Inline graphic (this is of course equivalent to the use of Eqs. 8 with Inline graphic) yields 0.23 nm/turn. Similarly we obtain 0.23 nm/turn also for (GA)m, and 0.43 nm/turn for both (CG)m and (AC)m.

We now calculate the TS coupling for a random sequence. By random we mean that at any position and independently on the rest of the sequence, each of the four nucleotides, A, T, C, G, is present with equal probability, namely 1/4. This implies that any dinucleotide of the form IJ, where I and J can be any of the four, is present with probability 1/16. For dinucleotides that are not self-complementary, one of the 10 unique steps is represented by two different sequences. Thus, the unique steps that are not self-complementary must be taken with a weight of 2 in the calculation, and the self-complementary ones with a weight of 1. Using Eqs. 8 with Inline graphic and other Inline graphic equal to 2, we obtain 0.30 nm/turn as the value of TS coupling for a random sequence.

One may be tempted to calculate the TS coupling in a different way, for instance computing first the arithmetic mean of the stiffness matrices, thus obtaining a sort of “average step”, then using Eq. 3. This procedure, however, seems to lack any rigorous justification and indeed yields, for a random sequence, a value of 0.21 nm/turn, further away from the experimental results. Similarly, a mere arithmetic mean of Inline graphic over a random sequence would yield 0.26 nm/turn.

Equations 8 or 9 allow one to investigate a large pool of sequences. As an example, we performed an exhaustive calculation for all nine bp oligomers (131,072 unique sequences) and got the distribution of TS coupling constants shown in Fig. 1. The values span the range from 0.14 to 0.45 nm/turn and are positively correlated with the contents of pyrimidine-purine steps in the sequence (correlation coefficient 0.76). The distribution is essentially bimodal (the origin of this shape remains to be determined). Similar calculations may serve to explore the hypothesis that some sequences may be optimized for TS coupling (5). Such a task is, however, outside the scope of this study.

FIGURE 1.

FIGURE 1

Distribution of TS coupling constants in the ensemble of all nine bp sequences. The two highest maxima are at 0.27 and 0.33 nm/turn, respectively.

Acknowledgments

We thank David Bensimon, Vincent Croquette, and Richard Lavery for helpful discussions.

This work was partially supported by the Swiss National Science Foundation (F.L.) and Centre National de la Recherche Scientifique, Université Pierre et Marie Curie, Paris 6 (T.L.).

References

  • 1.Kamien, R. D., T. C. Lubensky, P. Nelson, and C. S. O'Hern. 1997. Direct determination of DNA twist-stretch coupling. Europhys. Lett. 38:237–242. [Google Scholar]
  • 2.Marko, J. F. 1997. Stretching must twist DNA. Europhys. Lett. 38:183–188. [Google Scholar]
  • 3.Moroz, J. D., and P. Nelson. 1998. Entropic elasticity of twist-storing polymers. Macromolecules. 31:6333–6347. [Google Scholar]
  • 4.Lionnet, T., S. Joubaud, R. Lavery, D. Bensimon, and V. Croquette. 2006. Wringing out DNA. Phys. Rev. Lett. 96:178102. [DOI] [PubMed] [Google Scholar]
  • 5.Gore, J., Z. Bryant, M. Nollmann, M. U. Le, N. R. Cozzarelli, and C. Bustamante. 2006. DNA overwinds when stretched. Nature. 442:836–839. [DOI] [PubMed] [Google Scholar]
  • 6.Kosikov, K. M., A. A. Gorin, V. B. Zhurkin, and W. K. Olson. 1999. DNA stretching and compression: large-scale simulations of double helical structures. J. Mol. Biol. 289:1301–1326. [DOI] [PubMed] [Google Scholar]
  • 7.Dickerson, R. E., M. Bansal, C. R. Calladine, S. Diekman, W. N. Hunter, O. Kennard, R. Lavery, H. C. M. Nelson, W. K. Olson, W. Saenger, Z. Shakked, H. Sklenar, D. M. Soumpasis, C.-S. Tung, E. von Kitzing, A. H.-J. Wang, and V. B. Zhurkin. 1989. Definitions and nomenclature of nucleic acid structure parameters. J. Mol. Biol. 205:787–791. [DOI] [PubMed] [Google Scholar]
  • 8.Lankas, F., J. Sponer, J. Langowski, and T. E. Cheatham III. 2003. DNA basepair step deformability inferred from molecular dynamics simulations. Biophys. J. 85:2872–2883. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Becker, N., L. Wolff, and R. Everaers. 2006. Indirect readout: detection of optimized subsequences and calculation of relative binding affinities using different DNA elastic potentials. Nucleic Acids Res. 34:5638–5649. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES