Abstract
We have embedded the hexameric triplet repeats (CAG)6 and (CTG)6 between two (GC)3 domains to produce two 30-mer hairpins with the sequences d[(GC)3(CAG)6(GC)3] and d[(GC)3(CTG)6(GC)3]. This construct reduces the conformational space available to these repetitive DNA sequences. We find that the (CAG)6 and (CTG)6 repeats form stable, ordered, single-stranded structures. These structures are stabilized at 62°C by an average enthalpy per base of 1.38 kcal·mol−1 for the CAG triplet and 2.87 kcal·mol−1 for the CTG triplet, while being entropically destabilized by 3.50 cal·K−1·mol−1 for the CAG triplet and 7.6 cal·K−1·mol−1 for the CTG triplet. Remarkably, these values correspond, respectively, to 1/3 (for CAG) and 2/3 (for CTG) of the enthalpy and entropy per base values associated with Watson–Crick base pairs. We show that the presence of the loop structure kinetically inhibits duplex formation from the two complementary 30-mer hairpins, even though the duplex is the thermodynamically more stable state. Duplex formation, however, does occur at elevated temperatures. We propose that this thermally induced formation of a more stable duplex results from thermal disruption of the single-stranded order, thereby allowing the complementary domains to associate (perhaps via “kissing hairpins”). Our melting profiles show that, once duplex formation has occurred, the hairpin intermediate state cannot be reformed, consistent with our interpretation of kinetically trapped hairpin structures. The duplex formed by the two complementary oligonucleotides does not have any unusual optical or thermodynamic properties. By contrast, the very stable structures formed by the individual single-stranded triplet repeat sequences are thermally and thermodynamically unusual. We discuss this stable, triplet repeat, single-stranded structure and its interconversion with duplex in terms of triplet expansion diseases.
A variety of debilitating human diseases with non-Mendelian inheritance patterns have been traced to highly repetitive DNA sequences. Sequence analysis has shown the disease state to be associated with expansion of repetitive sequences of three nucleotides, most commonly (CAG)n/(CTG)n, and (CCG)n/(CGG)n repeats, copied beyond a critical threshold value. Consequently, these diseases collectively are referred to as triplet expansion diseases (1–4). Even in healthy subjects, three nucleotide repeat domains in chromosomes are highly variable in length but generally do not exceed the threshold value of ≈30 repeats. Various triplet expansion diseases differ in the phenotype of the disease, the location of the expanded DNA domain, and whether the expanded domain is found within gene introns, gene exons, or upstream/downstream of genes in putative regulatory regions (5). The one common feature among the different diseases is that the affected repetitive DNA segment has expanded beyond the repeat number found in the healthy population.
Proposed mechanisms for DNA expansion assume that the structure and dynamics of the repetitive DNA segments themselves play critical roles in the development of the disease (4–7). This assumption is based on the observation that cloned triplet repeat sequences also expand and contract within bacterial hosts (8–13). It has been suggested that the relevant DNA sequences either have unusual properties in the duplex state or adopt unusually stable alternative secondary structures involving single-stranded domains, which alter the course of the “normal” cellular DNA chemistry (14). Initial structural and some thermodynamic studies of model poly/oligonucleotides have indeed indicated that single strands consisting of repeated copies of (CAG)n, (CTG)n, and (CCG)n are able to adopt extensive secondary structures (14–28); however, conformational heterogeneity and/or kinetically trapped metastable states have precluded the determination of reliable thermodynamic data for this important class of DNA sequences (15, 29–33). The absence of reliable thermodynamic data prevents a rational evaluation of the different models and precludes theoretical predictions as to which sequences may be prone to assume structures that may lead to the disease state (34).
To alleviate this shortcoming, we have designed a series of oligonucleotide model systems in which two triplet repeat sequences, (CAG)6 and (CTG)6, are embedded between alternating (GC)3 sequences, thereby, in principle, allowing formation of stable stem/loop (hairpin) structures, as shown in Scheme . The stem portion of the hairpin consists of alternating GC base pairs, which act as a duplex clamp. The loop region contains the single-stranded triplet repeat sequence. The stem restricts the conformational space of the triplet repeat sequences with respect to slipped and misaligned structures and perhaps also prevents the formation of multiple kinetically trapped intermediates. The triplet repeat sequences are free to adopt whatever structure they prefer within the constraints imposed by the hairpin stem. The sequences chosen for this purpose, X[CAG]6X and X[CTG]6X, are complementary to each other, such that our constructs allow us to investigate the thermodynamics of each triplet repeat sequence in isolation and also when combined to form the corresponding duplex, as shown in Scheme . We show below that each of the triplet repeat sequences significantly adds to the overall thermodynamic stability of the hairpins; that the duplex is structurally and thermodynamically indistinguishable from canonical B form DNA; and that formation of the duplex is inhibited at low temperatures (presumably due to loop structure), even though the duplex is the thermodynamically most stable state.
Scheme 1.
Proposed model for the interaction of the two complementary triplet repeat oligonucleotides.
Materials and Methods
Materials.
Oligonucletides were synthesized and purified as previously described (35, 36). Concentrations were determined optically in 10 mM Na cacodylate, pH 7.0, buffer by using a Varian 300 UV/Vis spectrophotometer. Extinction coefficients at 260 nm were determined by phosphate assay (37, 38) in denaturing conditions (80 or 90°C) and were found to be
= 280,000 M−1⋅cm−1 (90°C);
= 308,000 M−1⋅cm−1 (90°C);
= 249,670 M−1⋅cm−1 (80°C), where X = (GC)3. Oligomer stocks were dialyzed extensively against at least two changes of buffer (10 mM Na cacodylate/1 mM Na2EDTA/Y mM NaCl) of the desired ionic strength.
UV Absorption Measurements.
Temperature-dependent UV absorption measurements were conducted and analyzed as previously described by using an Aviv (Lakewood, NJ) DS14 UV/VIS spectrophotometer equipped with a five-cell thermoelectric cell holder (35, 39–41). Unless otherwise indicated, oligonucleotide concentrations were always 1 μM in single strand or duplex.
Circular Dichroism (CD) Spectroscopy.
CD spectra as a function of temperature were recorded by using an Aviv DS61 Spectropolarimeter equipped with a thermoelectric cell holder in a 1-mm path length cell. After subtraction of buffer scans, sample scans were normalized for concentration (42), by using the software provided. Concentrations were 16 μM in single strand and 8 μM in duplex.
Calorimetry.
Excess heat capacity measurements were performed by using a NanoDSCII differential scanning calorimeter (DSC) (CSC, Provo, UT) with nominal cell volumes of 0.3 ml. Dialyzed samples of nominal concentration 50 μM (single strands) or 25 μM (duplex) were placed in the sample cell, whereas the appropriate buffer was placed in the reference cell. Samples were scanned repeatedly at a constant scan rate of 1°C/min in the temperature range 0–105°C. After buffer subtraction, the excess heat capacity data were normalized for DNA concentration and analyzed as previously described (41, 43, 44). ΔCp values were determined from the difference in the pre- and post-transition baseline extrapolated to the Tm of the transition.
Results and Discussion
The X[CAG]6X and X[CTG]6X Oligonucleotides Form Stable Hairpins.
By design, we expect the ordered structure for each triplet repeat 30-mer molecule to be a hairpin with a stem of six alternating purine/pyrimidine GC base pairs and a large loop domain comprised of the triplet repeat sequence (Scheme ). Although a variety of alternative structural models of intermolecular dimers with bulges also are feasible, the observed lack of concentration dependence of the Tm (not shown) for each 30 mer is consistent with these oligonucleotides adopting monomolecular hairpin structures (41). Hairpin formation also is supported by the moderate dependence of the Tm on the ionic strength (Table 1) characteristic of such secondary structures (45, 46). Taken together with the optical melting curve measurements presented below, we interpret the data as indicative of the formation of hairpin structures, consistent with our design objective.
Table 1.
Thermodynamic data for oligonucleotide order–disorder processes in 100 mM Na+/10 mM cacodylate/1 mM EDTA, pH 6.8, buffer
| Oligomer | Tm |
ΔH
|
ΔS
|
ΔCp |
ΔH
|
dTm/ dlog[Na+] |
|---|---|---|---|---|---|---|
| X[TTT]6X | 62.6 | 42.3 | 125.9 | 410 | 50.7 | ND |
| X[CAG]6X | 80.9 | 86.5 | 244.3 | 1,050 | 75.1 | 11.26 |
| X[CTG]6X | 81.5 | 110.2 | 310.7 | 850 | 101.4 | 11.79 |
| X[CAG]6X and X[CTG]6X | 70.7 | −66.7 | −193.9 | ND | (−199.5) | 0.13 |
| 89.2 | 247.2 | 682.2 | ND | 273.4 | 15.05 | |
| X[CAG]6X⋅X[CTG]6X | 89.3 | 278.7 | 768.9 | 1,470 | 244.6 | 15.99 |
ND, not determined.
°C, estimated error is ±0.2°C.
kcal·mol−1, estimated error is ±5%.
cal·mol−1·K−1, estimated error is ±5%.
cal·mol−1·K−1, estimated error is ±15%.
Free energies (ΔG) at any temperature can be calculated from the data in columns 2–5 by using the equation: ΔGT = ΔHTm − TΔSTm − ΔCp(Tm − T) + TΔCpln(Tm/T).
°C, estimated error is ±0.5°C.
Optical Evidence.
We observe clear differences in the CD spectra for each triplet repeat containing oligonucleotide (X[CAG]6X and X[CTG]6X) at low (0°C) and high (95°C) temperatures (Fig. 4A, which is published as supporting information on the PNAS web site, www.pnas.org). Such temperature-dependent differences suggest that these two molecules adopt ordered structures at low temperature (e.g., hairpins), which are disrupted at higher temperatures. Further, we observe substantial differences in the CD spectra of the two triplet repeat containing oligonucleotides when compared with the spectrum of X[TTT]6X, in which the triplet repeat sequences are replaced entirely by thymidine residues. This observation suggests significant differences in the ordered low-temperature structures of the two triplet repeat-containing molecules relative to the molecule with the [TTT]6 domain. Because the [TTT]6 sequence is assumed to exhibit minimal order (47–49), we ascribe the order apparent in the low-temperature CD spectrum of X[TTT]6X to its terminal (GC)3 stem region. We therefore propose that the temperature-dependent differences in CD spectra observed between the two triplet repeat-containing molecules and our control hairpin X[TTT]6X are due to contributions from order in the single-stranded, looped, triplet repeat sequences, as well as to differences in the intrinsic optical properties of the nucleotide bases. Sharp transitions observed near 80°C in the corresponding optical melting curves (Fig. 4B, which is published as supporting information on the PNAS web site) for X[CAG]6X and X[CTG]6X, as compared with the broad transition at 62°C for the X[TTT]6X hairpin, also are consistent with the presence of stabilizing ordered structures in both triplet repeat containing loops. NMR studies will provide additional insight into the detailed structure of the loop order we observe here.
Calorimetric Studies.
Calorimetrically measured excess heat capacity curves for the X[CAG]6X (red) and X[CTG]6X (blue) triplet-containing hairpins and the control X[TTT]6X hairpin (green) are shown in Fig. 1. The corresponding thermodynamic parameters are listed in Table 1. The measured thermodynamic parameters for the proposed hairpin structures refer to the combined contributions from the duplex stem domain and from the nominally single-stranded triplet repeat loop domain. To isolate the thermodynamic contribution of order in the triplet repeat loop regions of these hairpins, we have compared these hairpins to the similarly constructed hairpin, X[TTT]6X. Implicit in the choice of X[TTT]6X as a thermodynamic reference is the assumption that the all-T loop resembles, more than any other base combination, an unstructured loop (47, 49–54). In support of this assumption, we note that the measured enthalpy for X[TTT]6X is in good agreement with the enthalpy predicted by nearest-neighbor data (55, 56) for the hairpin stem. This observation suggests that an all-T loop exhibits no enthalpically stabilized structure. We recognize that this dissection of the thermodynamic parameters is imperfect due to the differences inherent in the junction between the conserved stem and the variable loop regions, including the possibility that the stem may be extended by one GC base pair in the X[CAG]6X and X[CTG]6X hairpins. In practice, any junction effects are subsumed into the loop contribution in our analysis.
Fig 1.
Excess heat capacity vs. temperature profiles (DSC melting curves) of X[CAG]6X (red), X[CTG]6X (blue), and X[TTT]6X (green) in 100 mM Na+ buffer.
The Triplet Repeat Loop Sequences Strongly Stabilize the Hairpin Structures.
Thermal stability.
Inspection of Fig. 1 and Table 1 reveals that the presence of the [CAG]6 and [CTG]6 triplet repeat loop domains results in an increase in thermal stability of the hairpins relative to the X[TTT]6X hairpin by 18.3 and 18.9°C, respectively. This increase in Tm is due to the apparent increase in order in the triplet repeat loop domains relative to the unstructured [TTT]6 loop. These data are consistent with suggestions that triplet repeat sequences can adopt extensive secondary structures (14–28); however, these data alone do not imply anything about the nature of that order.
Thermodynamic stability.
Our DSC measurements reveal that, in addition to an increase in thermal stability (Tm), the presence of either triplet repeat sequence also results in an increase in the thermodynamic stability (ΔG) of the hairpins relative to the X[TTT]6X reference hairpin. Inspection of the data in Table 1 reveals that the observed increases in the stabilities of the two hairpins with triplet repeat loops result from highly favorable enthalpic and partially compensating unfavorable entropic contributions. Because the heat capacity changes associated with the order–disorder transitions of each hairpin are not identical, it is important to compare enthalpy and entropy data at a common reference temperature (57–59). To avoid long extrapolations, we select the Tm of the X[TTT]6X as the thermodynamic reference temperature (62.6°C) for this comparison. These data, which can be derived directly from the data in Table 1, are summarized in Tables 2 and 3, which are published as supporting information on the PNAS web site. Although the Tm values for the two triplet repeat containing hairpins are nearly identical, the enthalpy and entropy changes associated with their order–disorder transitions are markedly different, an important feature that would be missed if one simply compared thermal stabilities (Tm values). As expected for favorably ordered domains, the triplet repeat loops enthalpically stabilize the hairpins by ΔΔHcal = 24.9 kcal⋅mol−1 for the [CAG]6 domain (67.2–42.3) and by ΔΔHcal = 51.7 kcal⋅mol−1 for the [CTG]6 domain (94.0–42.3), relative to the enthalpic stabilization imparted to the X[TTT]6X hairpin by the [TTT]6 loop. Also as expected for ordered domains, the triplet repeat loops entropically destabilize the hairpins by ΔΔScal = 63.1 cal⋅mol−1⋅K−1 for the [CAG]6 domain (189.0–125.9) and by ΔΔScal = 138.0 cal⋅mol−1⋅K−1 for the [CTG]6 domain (263.9–125.9), relative to the entropic destabilization imparted to the X[TTT]6X hairpin by the [TTT]6 loop. Because the [TTT]6 loop is assumed to be highly disordered, its entropic contribution likely represents only loop configurational entropy (50). Thus, the large entropic and enthalpic impacts of the [CAG]6 and [CTG]6 loops argue strongly for significant order in the triplet repeat sequences in their single-stranded states. The magnitudes of these loop impacts are remarkably large, representing ≈1/3 for [CAG]6 and 2/3 for [CTG]6 of the total enthalpy and entropy values expected for the corresponding Watson–Crick base pairs (55, 56). These results demonstrate that single-stranded triplet repeat regions can exhibit unusual stability, with the energetic nature of this order depending on the specific triplet repeat sequence. We use the term “triplet repeat structure” as a generic description for the structures adopted by all triplet repeat sequences in their single-stranded state. It is assumed that this term does not necessarily refer to a unique secondary structure but rather to an ensemble of structures adopted by triplet repeat DNA sequences in solution, without all such sequences necessarily adopting the same ensemble of secondary structures.
Taken together, our data demonstrate that the X[CAG]6X and X[CTG]6X oligonucleotides form stable hairpin structures with significant order in the triplet repeat loop domains. We consider our approach using constrained hairpin structures to derive thermodynamic data for the highly repetitive triplet repeat sequences superior to traditional studies of unconstrained model oligonucleotides. In unconstrained systems, highly repetitive sequences are known to adopt a wide range of secondary structures, including slipped DNA structures, inter- and intramolecular foldback structures, and others (7, 29, 30). Traditional approaches therefore result, at best, in thermodynamic data that are averaged over an often poorly defined distribution of populations. The likelihood of significantly populated alternative conformations is greatly diminished by the combined hairpin/loop approach used here.
Hairpin–Hairpin Interactions.
The X[CAG]6X and X[CTG]6X hairpins do not interact at low temperature.
CD spectra imply that the X[CAG]6X and X[CTG]6X hairpins do not interact at low temperature, despite having 18-base looped domains that are complementary to each other. The CD spectrum of an equimolar mixture of X[CAG]6X and X[CTG]6X at low temperature (before heating) is essentially identical to the calculated spectrum for noninteracting oligonucleotides obtained by summing weighted CD spectra of X[CAG]6X and X[CTG]6X (Fig. 5A, which is published as supporting information on the PNAS web site). These spectra differ significantly from the CD spectrum of the corresponding X[CAG]6X⋅X[CTG]6X duplex. These observations suggest that the two hairpins do not interact with each other at 0°C. In fact, we find such noninteracting behavior persists even up to ≈60°C.
The X[CAG]6X and X[CTG]6X hairpins do interact at elevated temperatures (>60°C) below the global hairpin melting temperature, to form a duplex.
Because the 18-base triplet repeat loop domains in X[CAG]6X and X[CTG]6X are complementary to one another, one might expect loop–loop interactions to occur. Once formed, such an initiation complex, a “kissing hairpin”-like structure (60–64), is expected to “zip up” to form the fully paired duplex, even at low temperatures, as cartooned in Scheme . Indeed, we observe such duplex formation from two hairpins with complementary looped regions when we mix X[TTT]6X with X[AAA]6X (unpublished results). The observation here that a duplex does not form on mixing of X[CAG]6X and X[CTG]6X at temperatures below 60°C suggests that the triplet repeat loops adopt structures that inhibit formation of a viable initiation complex. However, the proposed single-stranded loop structure can be thermally disrupted to form a duplex at a temperature below that required to globally melt the hairpins. We propose that this disruption of local structure allows the complementary looped sequences to associate into a bimolecular “initiation complex” (perhaps a “kissing hairpin”), from which the duplex state can grow. Preliminary kinetic experiments (unpublished observations) are consistent with this cascade of events, as cartooned in Scheme .
Optical evidence of hairpin–hairpin interactions.
Our temperature-dependent CD spectra (Fig. 5B, which is published as supporting information on the PNAS web site) reveal that the X[CAG]6X and X[CTG]6X hairpins interact to form a duplex after heating to 70°C, with an optical signature that differs from that calculated for noninteracting hairpins at 70°C. This observation suggests that, after heating to 70°C, the two oligomers interact and adopt a conformation different from that assumed before heating. We propose that the heating process induces formation of the X[CAG]6X⋅X[CTG]6X duplex from the complementary oligonucleotides. This behavior suggests that the order in the triplet repeat loop, the presumptive origin of inhibition of duplex formation, is disrupted at elevated temperature.
In Fig. 2, we present temperature-dependent optical data consistent with the proposition that the two complementary oligonucleotides X[CAG]6X and X[CTG]6X (red curve) form a duplex at elevated temperature. At temperatures up to ≈60°C, the measured absorbance of the 1:1 mixture of X[CAG]6X and X[CTG]6X is identical to that calculated by addition of the separate melting curves (dashed black curve) for a 1:1 noninteracting hairpin mixture. This observation is consistent with no interactions between the hairpins below 60°C. At higher temperatures, the measured absorbance decreases (red curve), a hypochromic effect, which reflects an increase in order within the DNA consistent with duplex formation. At even higher temperatures, the absorbance increases, a hyperchromic effect, due to the disruption of the duplex. Reheating the same sample results in a curve in which only the duplex to single-strand transition is observed (blue curve). Because the isolated hairpin structures do not reform on cooling, we conclude that the duplex state is the thermodynamically more stable state at low temperature. This conclusion means that the isolated hairpins, when present together, represent a kinetically trapped state.
Fig 2.
Absorbance vs. temperature profiles (UV-melting curves) of a 1:1 mixture of X[CAG]6X with X[CTG]6X in 100 mM Na+ buffer. The red curve is observed on heating for the first time; the blue curve is observed in all subsequent heating cycles. The dashed black line represents the calculated melting curve expected for noninteracting hairpins. Note the unusual hypochromic (negative) absorbance change above 60°C in the red curve indicating the formation of the X[CAG]6X⋅X[CTG]6X duplex.
The duplex-to-disordered single-strand transition depends on DNA concentration and ionic strength. The hairpin to disordered single-strand transitions are concentration independent, while also depending on ionic strength, albeit to a lesser degree than the duplex. By contrast, we find the observed hypochromic transition (duplex formation) to be nearly independent of DNA concentration and ionic strength. Taken together, these observations suggest that the hypochromic transition is dominated by a monomolecular process in which the average charge density of the molecules does not change significantly. Because the formation of duplex is by definition a bimolecular process, and therefore concentration dependent, we conclude that monomolecular processes preliminary to duplex formation overwhelm the contribution of strand association to the duplex formation process.
Based on the CD- and temperature-dependent optical data presented above, we propose the series of thermally induced transitions shown in Scheme between the noninteracting hairpins, the duplex, and the disordered single strands. In addition, we infer one or more intermediate states in the transition between the isolated hairpins and the duplex. In this scheme, we propose as a working model that the initiation complex for duplex formation is a “kissing hairpin” (60, 62). More direct structural and kinetic studies will be required to evaluate this proposed state, as well as to define in detail the nature of the surprisingly stable single-stranded triplet repeat looped structure.
At elevated temperatures, the X[CTG]6X and X[CAG]6X oligonucleotides form an optically and thermodynamically “normal” Watson–Crick duplex.
As noted above, a duplex is formed from the complementary oligonucleotides X[CTG]6X and X[CAG]6X by heating to 95°C and slow cooling to 0°C. The CD spectrum of the resultant X[CTG]6X⋅X[CAG]6X duplex (Fig. 5 A and B) is consistent with that observed for normal, Watson–Crick base-paired B form DNA (65). No anomalies in the shape or position of the spectrum are apparent, thereby implying an absence of any unusual duplex structure imparted by the triplet repeat sequence.
Calorimetric characterization of the X[CTG]6X⋅X[CAG]6X duplex also reveals nothing unusual. The calorimetrically determined enthalpy change for disruption of the X[CTG]6X⋅X[CAG]6X duplex, 278.7 kcal⋅mol−1, is in good agreement with estimates derived from nearest-neighbor data based on polymers, 262.0 kcal⋅mol−1, (55) and a combination of polymers and oligonucleotides, 273.3 kcal⋅mol−1 (56). The measured entropy change of 769 cal⋅mol−1⋅K−1 and the heat capacity change of 1470 cal⋅mol−1⋅K−1 also are in agreement with values expected for a duplex of 80% GC (57–59). Consequently, by all thermodynamic measures, the X[CTG]6X⋅X[CAG]6X duplex behaves like a conventional Watson–Crick duplex and does not show any unusual properties that could be attributed specifically to the triplet repeat sequences. In short, this triplet repeat-containing duplex does not exhibit any anomalous optical or thermodynamic properties. As a result, the unusual properties of the single-stranded triplet repeat domains we report here represent reasonable candidates in efforts to define a biophysical basis for the biological origins of triplet repeat diseases.
Our calorimetric data produce a closed thermodynamic cycle consistent with the proposed model.
We applied calorimetric methods to characterize the individual and overall thermodynamics of the processes we have proposed above. Table 1 lists the thermodynamic data obtained from the first and second calorimetric heating scans for an equimolar mixture of X[CAG]6X and X[CTG]6X. These transition curves are shown in Fig. 3. The first calorimetric heating scan of an equimolar mixture of the two hairpins (red) shows an unusual exothermic (negative) transition, which we assign to the thermally induced duplex formation from the kinetically trapped hairpin starting states. The subsequent endothermic (positive) transition in the red curve corresponds to the more conventional duplex melting event. The second heating scan (blue) shows only the positive duplex melting transition, which we used above to establish the thermodynamic parameters for the duplex. Significantly, the DSC experiment permits measurement of the enthalpy changes untainted by the kinetics of the processes and the non-two-state nature of the duplex to single-strand transition. The near superposition of the only transition in the second scan with the second transition in the first scan is consistent with its assignment to duplex melting.
Fig 3.
Excess heat capacity vs. temperature profiles (DSC-melting curves) of a 1:1 mixture of X[CAG]6X with X[CTG]6X in 100 mM Na+ buffer. The red curve is observed on heating for the first time; the blue curve is observed in all subsequent heating cycles. The dashed black line represents the calculated melting curve expected for noninteracting hairpins. Note the unusual exothermic (negative) heat capacity change above 60°C in the red curve indicating the formation of the X[CAG]6X⋅X[CTG]6X duplex.
We again define a convenient reference temperature to evaluate the thermodynamic parameters for the transitions observed in Fig. 3. To avoid the need for a value for the heat capacity change associated with the exothermic (negative) transition of the first DSC scan (which we cannot directly measure due to peak overlap), we select the Tm for that transition as the reference temperature (57–59). Scheme shows a schematic representation of the processes for interconversion of the three equilibrium thermodynamic states that we define in our model of the interactions of the X[CAG]6X and X[CTG]6X oligonucleotides: the isolated hairpins, the duplex, and the unstructured single strands (coils). The corresponding enthalpy and entropy values for the different transitions at our reference temperature of 70.7°C are shown in Scheme and are listed in Table 4, which is published as supporting information on the PNAS web site. The critical feature, which allows us to close the thermodynamic cycle depicted in Scheme , is that we independently measure the hairpin-to-coil transition enthalpies of each isolated hairpin without interference from any competing hairpin/duplex equilibrium (dashed black curve, Fig. 3). Applying Hess's law to the cycle, we see that A−B = C, where A, B, and C represent the enthalpy changes associated with these transitions. Thus, from the separately measured DSC data for disruption of the two hairpin ordered structures (dashed black) and the DSC of the duplex-to-coil transition (blue curve), we calculate the enthalpy change associated with the transition of the hairpins to the duplex. We find A − B = (75.8 + 100.9) − 251.3 = −74.6 kcal⋅mol−1, in very good agreement with the measured exothermic transition in the first DSC scan (−66.7 kcal⋅mol−1), which we assign to the transition from the isolated hairpins to the duplex. An analogous cycle in terms of entropy also closes, with A − B = (213.6 + 284.3) − 691.3 = −193.4 cal⋅mol−1⋅K−1 for the calculated value and −193.9 cal⋅mol−1⋅K−1 for the measured value.
Scheme 2.
Proposed model for the thermodynamic cycle observed for the triplet repeat oligonucleotides. Pathway A is determined only for the hairpins in isolation, which is indicated by the dashed arrows. The equilibrium for pathway B is shifted so far onto the side of the duplex that the reverse reaction (duplex to hairpin) is never observed, as indicated by the crossed arrow.
The critical observation here is that the isolated hairpins are kinetically trapped in an ordered state from which a significant activation barrier has to be overcome before duplex formation can occur, even though the duplex is the most thermodynamically stable state. In the next section, we discuss the implications of our observations within the context of current models of the biology associated with triplet repeat sequences and the onset of the disease state.
Biological Implications
Our experimental results demonstrate that the triplet repeat sequences CAG and CTG may adopt very stable, ordered, single-stranded structures. By contrast, the duplex formed by the (CAG)6 and (CTG)6 repeat sequences appears quite ordinary by every structural and thermodynamic measure evaluated herein. Although the duplex is the thermodynamically favored state for the studied oligonucleotides, our results also demonstrate that the transition through which the ordered single-stranded states form a duplex is kinetically inhibited. The ordered single-stranded triplet repeat structures represent metastable intermediate states (local energy minima), which once formed, persist in the absence of heating.
Given the hypothesis that DNA structure and dynamics are at least in part responsible for the development of the disease state in triplet repeat diseases (4–7), our experimental results favor models that involve, even if only transiently, single-stranded structure as part of the mechanism. Our results clearly suggest that, when in the duplex state, triplet repeat DNA behaves like conventional duplex DNA and is unlikely to be involved with the development of the disease state. Although it will not occur spontaneously from the duplex state, conditions for the formation of ordered single-stranded triplet repeat structure may exist during the course of normal cellular DNA chemistry (replication, transcription, etc.), when the duplex partially dissociates locally into single-stranded domains. The influence of proteins on the formation of ordered single strands remains to be assessed. However, our data suggest that, once formed, ordered, single-stranded triplet repeat structures can represent local minima. Errors in the passage of the replication fork through the triplet repeat domain (10, 67, 68), faulty repair processes (69, 70), or some other yet-to-be-determined mechanism may lead to expansion of triplet repeat DNA when such single-stranded ordered states are populated. Because the onset of the expansion of the triplet repeat domain may depend on such metastable states, the lifetime as well as the relative thermodynamic stability of the ordered structure must be considered in the regulation of events that allow expansion to occur. This interplay between the thermodynamic stability and the lifetime of the single-stranded structures also should determine the observed threshold values for the onset of the disease state.
Supplementary Material
Acknowledgments
This work was supported by National Institutes of Health Grants GM23509, GM34469, and CA47995 (to K.J.B.).
Abbreviations
CD, circular dichroism
DSC, differential scanning calorimeter
References
- 1.Cummings C. J. & Zoghbi, H. Y. (2000) Annu. Rev. Genomics Hum. Genet. 1, 281-328. [DOI] [PubMed] [Google Scholar]
- 2.Ashley C. T. & Warren, S. T. (1995) Annu. Rev. Genet. 29, 703-728. [DOI] [PubMed] [Google Scholar]
- 3.Timchenko L. T. & Caskey, C. T. (1999) Cell Mol. Life Sci. 55, 1432-1447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wells R. D. (1996) J. Biol. Chem. 271, 2875-2878. [DOI] [PubMed] [Google Scholar]
- 5.Mitas M. (1997) Nucleic Acids Res. 25, 2245-2254. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.McMurray C. T. (1999) Proc. Natl. Acad. Sci. USA 96, 1823-1825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Pearson C. E. & Sinden, R. R. (1998) Curr. Opin. Struct. Biol. 8, 321-330. [DOI] [PubMed] [Google Scholar]
- 8.Razin S. V., Ioudinkova, E. S., Trifonov, E. N. & Scherrer, K. (2001) J. Mol. Biol. 307, 481-486. [DOI] [PubMed] [Google Scholar]
- 9.Bowater R. P., Rosche, W. A., Jaworski, A., Sinden, R. R. & Wells, R. D. (1996) J. Mol. Biol. 264, 82-96. [DOI] [PubMed] [Google Scholar]
- 10.Ohshima K., Montermini, L., Wells, R. D. & Pandolfo, M. (1998) J. Biol. Chem. 273, 14588-14595. [DOI] [PubMed] [Google Scholar]
- 11.Ohshima K. & Wells, R. D. (1997) J. Biol. Chem. 272, 16798-16806. [DOI] [PubMed] [Google Scholar]
- 12.Ohshima K., Kang, S., Larson, J. E. & Wells, R. D. (1996) J. Biol. Chem. 271, 16773-16783. [DOI] [PubMed] [Google Scholar]
- 13.Ohshima K., Kang, S. & Wells, R. D. (1996) J. Biol. Chem. 271, 1853-1856. [DOI] [PubMed] [Google Scholar]
- 14.Pearson C. E. & Sinden, R. R. (1996) Biochemistry 35, 5041-5053. [DOI] [PubMed] [Google Scholar]
- 15.Gacy A. M., Goellner, G., Juranic, N., Macura, S. & McMurray, C. T. (1995) Cell 81, 533-540. [DOI] [PubMed] [Google Scholar]
- 16.Smith G. K., Jie, J., Fox, G. E. & Gao, X. (1995) Nucleic Acids Res. 23, 4303-4311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.LeProust E. M., Pearson, C. E., Sinden, R. R. & Gao, X. (2000) J. Mol. Biol. 302, 1063-1080. [DOI] [PubMed] [Google Scholar]
- 18.Pearson C. E., Wang, Y. H., Griffith, J. D. & Sinden, R. R. (1998) Nucleic Acids Res. 26, 816-823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mitas M., Yu, A., Dill, J., Kamp, T. J., Chambers, E. J. & Haworth, I. S. (1995) Nucleic Acids Res. 23, 1050-1059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Mitas M., Yu, A., Dill, J. & Haworth, I. S. (1995) Biochemistry 34, 12803-12811. [DOI] [PubMed] [Google Scholar]
- 21.Fry M. & Loeb, L. A. (1994) Proc. Natl. Acad. Sci. USA 91, 4950-4954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kettani A., Bouaziz, S., Skripkin, E., Majumdar, A., Wang, W., Jones, R. A. & Patel, D. J. (1999) Struct. Fold. Des. 7, 803-815. [DOI] [PubMed] [Google Scholar]
- 23.Chen X., Mariappan, S. V., Catasti, P., Ratliff, R., Moyzis, R. K., Laayoun, A., Smith, S. S., Bradbury, E. M. & Gupta, G. (1995) Proc. Natl. Acad. Sci. USA 92, 5199-5203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ohshima K., Kang, S., Larson, J. E. & Wells, R. D. (1996) J. Biol. Chem. 271, 16784-16791. [DOI] [PubMed] [Google Scholar]
- 25.Mariappan S. V., Catasti, P., Silks, L. A., III, Bradbury, E. M. & Gupta, G. (1999) J. Mol. Biol. 285, 2035-2052. [DOI] [PubMed] [Google Scholar]
- 26.Mariappan S. V., Catasti, P., Chen, X., Ratliff, R., Moyzis, R. K., Bradbury, E. M. & Gupta, G. (1996) Nucleic Acids Res. 24, 784-792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mariappan S. V., Silks, L. A., III, Bradbury, E. M. & Gupta, G. (1998) J. Mol. Biol. 283, 111-120. [DOI] [PubMed] [Google Scholar]
- 28.Mariappan S. V., Silks, L. A., III, Chen, X., Springer, P. A., Wu, R., Moyzis, R. K., Bradbury, E. M., Garcia, A. E. & Gupta, G. (1998) J. Biomol. Struct. Dyn. 15, 723-744. [DOI] [PubMed] [Google Scholar]
- 29.Petruska J., Arnheim, N. & Goodman, M. F. (1996) Nucleic Acids Res. 24, 1992-1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Petruska J., Hartenstine, M. J. & Goodman, M. F. (1998) J. Biol. Chem. 273, 5204-5210. [DOI] [PubMed] [Google Scholar]
- 31.Hartenstine M. J., Goodman, M. F. & Petruska, J. (2000) J. Biol. Chem. 275, 18382-18390. [DOI] [PubMed] [Google Scholar]
- 32.Gacy A. M. & McMurray, C. T. (1998) Biochemistry 37, 9426-9434. [DOI] [PubMed] [Google Scholar]
- 33.Yu A., Barron, M. D., Romero, R. M., Christy, M., Gold, B., Dai, J., Gray, D. M., Haworth, I. S. & Mitas, M. (1997) Biochemistry 36, 3687-3699. [DOI] [PubMed] [Google Scholar]
- 34.Harvey S. C. (1997) Biochemistry 36, 3047-3049. [DOI] [PubMed] [Google Scholar]
- 35.Völker J., Klump, H. H. & Breslauer, K. J. (2001) Proc. Natl. Acad. Sci. USA 98, 7694-7699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gelfand C. A., Plum, G. E., Mielewczyk, S., Remeta, D. P. & Breslauer, K. J. (1999) Proc. Natl. Acad. Sci. USA 96, 6113-6118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Plum G. E., (2000) Current Protocols in Nucleic Acid Chemistry (Wiley, New York), pp. 7.3.1–7.3.17.
- 38.Snell F. D. & Snell, C. T., (1972) Colorimetric Methods of Analysis, Including Some Turbidimetric and Nephelometric Methods (Krieger, Huntington, NY).
- 39.Breslauer K. J. (1995) Methods Enzymol. 259, 221-242. [DOI] [PubMed] [Google Scholar]
- 40.Breslauer K. J. (1994) in Methods in Molecular Biology, ed. Agrawal, S. (Humana, Totowa, NJ), Vol. 26, pp. 347–372. [DOI] [PubMed] [Google Scholar]
- 41.Marky L. A. & Breslauer, K. J. (1987) Biopolymers 26, 1601-1620. [DOI] [PubMed] [Google Scholar]
- 42.Cantor C. R. & Schimmel, P. R., (1980) Techniques for the Study of Biological Structure and Function (Freeman, San Francisco).
- 43.Privalov P. L. & Potekhin, S. A. (1986) Methods Enzymol. 131, 4-51. [DOI] [PubMed] [Google Scholar]
- 44.Privalov G. P. & Privalov, P. L. (2000) Methods Enzymol. 323, 31-62. [DOI] [PubMed] [Google Scholar]
- 45.Olmsted M. C., Anderson, C. F. & Record, M. T., Jr. (1991) Biopolymers 31, 1593-1604. [DOI] [PubMed] [Google Scholar]
- 46.Olmsted M. C. (1996) J. Biomol. Struct. Dyn. 13, 885-902. [DOI] [PubMed] [Google Scholar]
- 47.Senior M. M., Jones, R. A. & Breslauer, K. J. (1988) Proc. Natl. Acad. Sci. USA 85, 6242-6246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Mills J. B., Vacano, E. & Hagerman, P. J. (1999) J. Mol. Biol. 285, 245-257. [DOI] [PubMed] [Google Scholar]
- 49.Ferrari M. E. & Lohman, T. M. (1994) Biochemistry 33, 12896-12910. [DOI] [PubMed] [Google Scholar]
- 50.Goddard N. L., Bonnet, G., Krichevsky, O. & Libchaber, A. (2000) Phys. Rev. Lett. 85, 2400-2403. [DOI] [PubMed] [Google Scholar]
- 51.Ansari A., Kuznetsov, S. V. & Shen, Y. (2001) Proc. Natl. Acad. Sci. USA 98, 7771-7776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ansari A., Shen, Y. & Kuznetsov, S. V. (2002) Phys. Rev. Lett. 88, 069801. [DOI] [PubMed] [Google Scholar]
- 53.Kuznetsov S. V., Shen, Y., Benight, A. S. & Ansari, A. (2001) Biophys. J. 81, 2864-2875. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Bonnet G., Krichevsky, O. & Libchaber, A. (1998) Proc. Natl. Acad. Sci. USA 95, 8602-8606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Klump H. H. (1988) in Biochemical Thermodynamics: Studies in Modern Thermodynamics, ed. Jones, M. N. (Elsevier, Amsterdam), Vol. 8, pp. xi–403. [Google Scholar]
- 56.Breslauer K. J., Frank, R., Blocker, H. & Marky, L. A. (1986) Proc. Natl. Acad. Sci. USA 83, 3746-3750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Chalikian T. V., Völker, J., Plum, G. E. & Breslauer, K. J. (1999) Proc. Natl. Acad. Sci. USA 96, 7853-7858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Rouzina I. & Bloomfield, V. A. (1999) Biophys. J. 77, 3242-3251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Rouzina I. & Bloomfield, V. A. (1999) Biophys. J. 77, 3252-3255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Marino J. P., Gregorian, R. S., Jr., Csankovszki, G. & Crothers, D. M. (1995) Science 268, 1448-1454. [DOI] [PubMed] [Google Scholar]
- 61.Barbault F., Huynh Dinh, T., Paoletti, J. & Lanceloti, G. (2002) J. Biomol. Struct. Dyn. 19, 649-658. [DOI] [PubMed] [Google Scholar]
- 62.Tinoco I. (1997) Nucleic Acids Symp. Ser. 36, 49-51. [PubMed] [Google Scholar]
- 63.Kim C. H. & Tinoco, I., Jr. (2000) Proc. Natl. Acad. Sci. USA 97, 9396-9401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Theilleux-Delalande V., Girard, F., Huynh-Dinh, T., Lancelot, G. & Paoletti, J. (2000) Eur. J. Biochem. 267, 2711-2719. [DOI] [PubMed] [Google Scholar]
- 65.Bush C. A. (1974) in Basic Principles in Nucleic Acid Chemistry, ed. T'so, P. O. P. (Academic, New York), Vol. 2, pp. 91–169. [Google Scholar]
- 66.Plum G. E., Grollman, A. P., Johnson, F. & Breslauer, K. J. (1995) Biochemistry 34, 16148-16160. [DOI] [PubMed] [Google Scholar]
- 67.Kang S., Ohshima, K., Shimizu, M., Amirhaeri, S. & Wells, R. D. (1995) J. Biol. Chem. 270, 27014-27021. [DOI] [PubMed] [Google Scholar]
- 68.Kang S., Jaworski, A., Ohshima, K. & Wells, R. D. (1995) Nat. Genet. 10, 213-218. [DOI] [PubMed] [Google Scholar]
- 69.Jaworski A., Rosche, W. A., Gellibolian, R., Kang, S., Shimizu, M., Bowater, R. P., Sinden, R. R. & Wells, R. D. (1995) Proc. Natl. Acad. Sci. USA 92, 11019-11023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Moore H., Greenwell, P. W., Liu, C. P., Arnheim, N. & Petes, T. D. (1999) Proc. Natl. Acad. Sci. USA 96, 1504-1509. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








