Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2009 Jul 6;37(16):5559–5567. doi: 10.1093/nar/gkp563

Stability of intramolecular quadruplexes: sequence effects in the central loop

Aurore Guédin 1,2, Patrizia Alberti 1,2, Jean-Louis Mergny 1,2,*
PMCID: PMC2760802  PMID: 19581426

Abstract

Hundreds of thousands of putative quadruplex sequences have been found in the human genome. It is important to understand the rules that govern the stability of these intramolecular structures. In this report, we analysed sequence effects in a 3-base-long central loop, keeping the rest of the quadruplex unchanged. A first series of 36 different sequences were compared; they correspond to the general formula GGGTTTGGGHNHGGGTTTGGG. One clear rule emerged from the comparison of all sequence motifs: the presence of an adenine at the first position of the loop was significantly detrimental to stability. In contrast, adenines have no detrimental effect when present at the second or third position of the loop. Cytosines may either have a stabilizing or destabilizing effect depending on their position. In general, the correlation between the Tm or ΔG° in sodium and potassium was weak. To determine if these sequence effects could be generalized to different quadruplexes, specific loops were tested in different sequence contexts. Analysis of 26 extra sequences confirmed the general destabilizing effect of adenine as the first base of the loop(s). Finally, analysis of some of the sequences by microcalorimetry (DSC) confirmed the differences found between the sequence motifs.

INTRODUCTION

G-quadruplexes are unusual nucleic acids structures which result from the hydrophobic stacking of several quartets; each quartet being a planar association of four guanines held together by eight hydrogen bonds (1–5). Four (or more) tracts of two or more guanines are required to form an intramolecular structure. A cation (typically Na+ or K+) is located between two quartets and forms cation–dipole interactions with eight guanines. Quadruplex-prone regions abound in the human genome (6–8) and the sequence repertoire of sequences compatible with quadruplex formation is far more diverse than initially imagined (9). Intramolecular G-quadruplexes may form at telomeres, oncogene promoter sequences and other biologically relevant regions (10,11). Therefore, it is important to understand the rules that govern the formation of these intramolecular structures and to determine their stabilities. Previous works support the important role played by the nature and length of the loops in quadruplex stability (12–22) and reports suggest a strong influence of loop length on quadruplex stability. However, the stability cannot simply be deduced from total loop length, as elegantly demonstrated by Kumar and Maiti (21).

To understand the contribution of loop sequence, we chose to study model sequences involving three medium loops composed of three nucleotides. This leads to a general sequence of the type GGGTTTGGGHNHGGGTTTGGG and gives a total of 36 (3 × 4 × 3) different sequences, which are presented in Table 1. All sequences were able to form a quadruplex. A detailed quantitative and exhaustive analysis of all sequences allowed to obtain conclusions that we also validated in different sequence contexts.

Table 1.

Sequence of the oligonucleotide used (part 1)

Name Sequence 5′ => 3′ Tma Na+ Tma K+ ΔG°b Na+ (50°C) ΔG°b K+ (62°C)
TTT GGGTTTGGGTTTGGGTTTGGG 49.3 64.1 +0.02 −0.37
TAA GGGTTTGGGTAAGGGTTTGGG 49.9 65.2 −0.07 −0.45
ATA GGGTTTGGGATAGGGTTTGGG 46.3 62.0 +0.52 +0.07
AAT GGGTTTGGGAATGGGTTTGGG 44.8 61.0 +0.71 +0.24
TTC GGGTTTGGGTTCGGGTTTGGG 53.9 64.2 −0.4 −0.4
TCT GGGTTTGGGTCTGGGTTTGGG 47.9 63.2 +0.31 −0.23
CTT GGGTTTGGGCTTGGGTTTGGG 53.4 63.6 −0.46 −0.27
TCC GGGTTTGGGTCCGGGTTTGGG 51.8 62.2 −0.21 −0.11
CTC GGGTTTGGGCTCGGGTTTGGG 53.6 64.0 −0.56 −0.48
CCT GGGTTTGGGCCTGGGTTTGGG 49.5 61.6 +0.2 −0.08
ATT GGGTTTGGGATTGGGTTTGGG 45.8 61.2 +0.65 +0.1
TAT GGGTTTGGGTATGGGTTTGGG 49.9 64.2 +0.02 −0.31
TTA GGGTTTGGGTTAGGGTTTGGG 51.3 64.7 −0.18 −0.51
AAC GGGTTTGGGAACGGGTTTGGG 49.3 60.5 +0.06 +0.28
ACA GGGTTTGGGACAGGGTTTGGG 45.1 60.3 +0.62 +0.28
CAA GGGTTTGGGCAAGGGTTTGGG 50.1 63.0 −0.02 −0.27
ACC GGGTTTGGGACCGGGTTTGGG 48.6 58.1 +0.27 +0.6
CAC GGGTTTGGGCACGGGTTTGGG 53.6 65.0 −0.49 −0.5
CCA GGGTTTGGGCCAGGGTTTGGG 48.4 63.0 +0.20 −0.24
AAA GGGTTTGGGAAAGGGTTTGGG 44.8 60.1 +0.74 +0.32
CCC GGGTTTGGGCCCGGGTTTGGG 49.8 61.8 +0.04 +0.04
ATC GGGTTTGGGATCGGGTTTGGG 49.7 60.0 +0.1 +0.3
TAC GGGTTTGGGTACGGGTTTGGG 53.3 63.7 −0.38 −0.29
TCA GGGTTTGGGTCAGGGTTTGGG 50.0 63.3 +0.09 −0.21
ACT GGGTTTGGGACTGGGTTTGGG 45.9 59.2 +0.65 +0.5
CAT GGGTTTGGGCATGGGTTTGGG 50.8 63.5 −0.09 −0.32
CTA GGGTTTGGGCTAGGGTTTGGG 50.5 63.6 −0.02 −0.34
AGA GGGTTTGGGAGAGGGTTTGGG 45.5 60.2 +0.56 +0.26
AGT GGGTTTGGGAGTGGGTTTGGG 47.3 61.2 +0.39 +0.1
TGA GGGTTTGGGTGAGGGTTTGGG 50.4 64.4 0 −0.35
TGT GGGTTTGGGTGTGGGTTTGGG 52.0 66.0 −0.31 −0.66
TGC GGGTTTGGGTGCGGGTTTGGG 54.0 63.5 −0.58 −0.24
CGC GGGTTTGGGCGCGGGTTTGGG 54.9 64.5 −0.55 −0.43
CGT GGGTTTGGGCGTGGGTTTGGG 53.0 65.9 −0.5 −0.72
AGC GGGTTTGGGAGCGGGTTTGGG 50.0 60.5 −0.03 0.35
CGA GGGTTTGGGCGAGGGTTTGGG 49.8 64.3 +0.03 −0.35

aTm in °C, with a ± 0.5°C precision; average of two to four independent values; determined from the analysis of UV melting profiles at 295 nm and/or 240 nm.

bΔG° in kcal/mol, determined at 50°C in sodium, 62°C in potassium from the analysis of UV melting profiles at 295 nm. Negative values mean that the oligonucleotide is predominantly folded at this temperature.

MATERIALS AND METHODS

Nomenclature, synthesis and purification of oligonucleotide sequences

Oligonucleotides were synthesized by Eurogentec (Seraing, Belgium) at the 40-, 200- or 1000-nmol scale. Several sequences were re-synthesized; inter-lot reproducibility was excellent. Concentrations were estimated using extinction coefficients provided by the manufacturer and calculated with a nearest-neighbour model (23) as described before. Sequences are given in the 5′ to 3′ direction. H corresponds to A, C or T while N = A, C, T or G. We chose to exclude loop sequences that create an ambiguity in G tract definition, such as GNN or NNG, as there would be two different possibilities to select three consecutive guanines in a block of 4 G.

Absorbance measurements

Melting experiments were conducted as previously described (19,24,25) by recording the absorbance at 240 and 295 nm (26,27). Most sequences were tested at least twice at 5 μM strand concentration by two independent experimentators. As the Tm variations studied here are relatively modest, all samples were tested using strictly identical preparation protocols and buffers to maximize reproducibility. Despite all these precautions, the mean difference between two independent Tm determinations was 0.5°C. Hence, differences between two Tms below 0.5°C were not considered significant. All transitions were reversible, indicating that the denaturation curves correspond to a true equilibrium process (28,29). The intramolecular formation of the G-quadruplexes was evaluated by varying concentration in the 5–241 µM range for subset of sequences.

Thermodynamic and statistical analysis

One can extract from the raw absorbance data the fraction of folded oligonucleotide as a function of temperature, assuming linear baselines. One may then perform a van’t Hoff analysis of the melting curves using previously described procedures (30,31). However, most ΔH° values found were very similar and this analysis generally assumes that ΔCp = 0. This conventional assumption of a near-zero value for ΔCp has been invalidated in a number of cases (32), including for quadruplexes (also see the ‘DSC analysis’ section).

Rather than extrapolating ΔG° values to the physiologically relevant temperature (37°C), we chose reference temperatures that are close to the average Tm values (50°C in sodium, 62°C in potassium). Hence, ΔG° values may be directly determined from the folded fraction θ at this temperature, [ΔG° = –RT ln (K)] without any extrapolation or hypothesis on the temperature independency of the ΔH°. As the transition is predominantly intramolecular, K = θ/(1–θ), in which θ (between 0 and 1) is deduced from the absorbance versus temperature profiles, with linear baselines assumption. These ΔG°50°C(Na+) or ΔG°62°C(K+) are of course less biologically relevant than an extrapolated ΔG°37°C but they are determined with a much higher confidence and fewer hypotheses. Comparison of Tm values (linear fits, Student's t-tests) were performed with Kaleidagraph 3.6 software. Error bars correspond to standard deviation values between independent experiments or sequences.

TDS and CD spectra

Thermal difference spectra (TDS) were obtained by difference between the absorbance spectra from unfolded and folded oligonucleotides that were recorded much above and below their Tm (33).

Circular dichroism (CD) spectra were recorded on a JASCO-810 spectropolarimeter as previously described (19).

Gel electrophoresis

For some experiments, formation of G4-DNA was confirmed by non-denaturing PAGE. In that case, oligonucleotides were either 5′ labelled with T4 polynucleotide kinase or directly visualized by UV-shadow (see Supplementary Data). Prior to the incubation, the DNA samples were heated at 90°C for 5 min and slowly cooled (2 h) to room temperature. Samples were incubated at 50 nM or 4 µM strand concentration in Tris/HCl 10 mM pH 7.5 buffer with 100 mM NaCl or KCl. Ten percent sucrose was added just before loading. Oligothymidylate markers (dT15, dT21 or dT30) or double-stranded markers (Dx9: 5′d-GCGTATCGG + 5′d-CCGATACGC; Dx12: 5′d-GCGTGACTTCGG + 5′d-CCGAAGTCACGC) were also loaded on the gel. One should note that the migration of the dTn oligonucleotides does not necessarily correspond to single strands (34): these oligonucleotides were chosen here to provide an internal migration standard, not to identify intramolecular or higher-order structures.

Differential scanning calorimetry

Microcalorimetry experiments were carried out using a Nano DSC-II microcalorimeter as previously described (35,36). Buffer and oligo solutions were carefully degassed prior to their utilization and their thermal profiles were analysed in the 0 to 100°C temperature range at a scan rate of 1°C/min. A minimum of 12 scans (six heating/six cooling) was collected for each experiment (see Supplementary Data for examples of raw data). For all the scans, the oligo versus buffer scan was substracted by the previously performed buffer scan, which allowed us to obtain the best scan shapes. Subtraction of the constructed baseline and calculation of the thermodynamic parameters were carried out using the Cp-calc software (Applied Thermodynamics). The oligonucleotides were dissolved at concentrations ranging from 200 to 241 µM in 10 mM lithium cacodylate buffer at pH 7.2 containing 100 mM KCl or NaCl.

RESULTS

Evidence for quadruplex formation

Our objective is to compare the stability of close sequences under strictly identical conditions. All oligonucleotides (except one, see below) studied here form quadruplexes under both reference conditions (10 mM Lithium cacodylate pH 7.2 supplemented with 100 mM NaCl or KCl, abbreviated hereafter K+ or Na+ conditions). One can follow the typical evolution on temperature of the folded fraction determined at 295 nm in the presence of 100 mM KCl or NaCl (Figure 1A and B). One notices the presence of a cation-dependent conformational change associated with the temperature increase. As expected, stability of the quadruplex was always lower in NaCl and higher in KCl (Table 1). The TDS in K+ and Na+ buffers are shown in Figure 1C and D. These TDS spectra were obtained by difference between the absorbance spectra recorded above and below the observed transition. They exhibit the typical pattern of a G-quadruplex structure with two positive maxima at 240 and 275 nm and a negative minimum around 295 nm (26,33,37). Furthermore, the CD spectra of these structures were in agreement with the formation of quadruplexes (Figure 1E and F) (38,39). Representative examples are shown in this figure. Spectra in sodium (Figure 1E) and in potassium (Figure 1F) are different, suggesting that the folding schemes for all these sequences are different in the two salts; nevertheless, CD spectra do not provide direct evidence for the folding topology of quadruplexes and this conclusion should be treated with caution. Finally, formation of a quadruplex was confirmed by the stability dependence of the structure on the nature of the monocation (Table 1).

Figure 1.

Figure 1.

Spectroscopic data. (A, B) Folded fraction (derived from the absorbance at 295 nm) is plotted as a function of temperature for a selection of three oligonucleotides at two different concentrations: 5 μM (dotted lines; using 1-cm path length quartz cuvettes) and ≈225 μM (full lines; using 0.1-cm path length quartz cuvettes) for the ACT (circles), TCA (squares) and CGC (triangles) sequences (examples of raw melting profiles may be found in the Supplementary Data). Both buffers contained 10 mM lithium cacodylate at pH 7.2; (A) in 100 mM NaCl, (B) in 100 mM KCl. (C, D) Thermal difference spectra result from the difference between the absorbance recorded at 88 ± 2°C and at 4 ± 2°C for the same oligonucleotides (5 μM strand concentration) using 1-cm path length quartz cuvettes. They are normalized (TDSnorm = TDS/max(TDS) over the 220–335-nm wavelength range; (A) in 100 mM NaCl, (B) in 100 mM KCl. (E, F) Circular dichroism (CD) spectra (5 μM strand concentration) were recorded at 25°C using 1-cm path length quartz cuvettes; (A) in 100 mM NaCl, (B) in 100 mM KCl. Only a few experimental points are shown for clarity.

For some sequences, concentration-independent melting temperatures confirmed that the folding process was intramolecular (Figure 1A and B). Non-denaturing gel electrophoresis allowed us to compare the behaviour in sodium and potassium. In the example provided in Figure 2B, (bottom) a single band is observed in potassium for all sequences. Its fast migration is in excellent agreement with the formation of intramolecular complexes, as judged by molecular size markers such as double-stranded sequences or oligothymidylate repeats (left lanes). Migration was not affected by the addition of unlabelled oligonucleotide, indicating that the predominant species was the same at 50 nM or 4 µM strand concentration, in full agreement with intramolecular structures. In sodium, one also observed a single major band, for which concentration-independent migration was also in agreement with intramolecular quadruplexes (Figure 2A). Nevertheless, although intramolecular complexes predominated in all cases, minor bands corresponding to higher-order structures (probably dimers) were also observed in Na+ for some of the sequences with an ANN central loop (i.e. the sequences that form the least stable intramolecular quadruplexes—see below—are more prone to form species of higher molecularity). Interestingly, significant differences in migration between sequences were found in sodium, while migration was nearly insensitive to sequence in potassium (compare top and bottom gels). Analysis of all 36 mers in sodium was also performed at a higher strand concentration (30 μM) by UV-shadow analysis (Figure S2); it confirmed the results with radio-labelled oligomers.

Figure 2.

Figure 2.

Oligonucleotide migration on a non-denaturing gel. PAGE profiles for a selection of nine oligonucleotides at two different concentrations: 0.05 μM (radio-labelled only) and 4 μM. Samples were prepared in a 100 mM NaCl (A) or KCl (B) buffer and loaded on a non-denaturing 15% acrylamide gel supplemented with 20 mM of the corresponding salt and run at 20°C. Migration markers are double-stranded sequences (Dx12 and Dx 9, forming 12 and 9 bp) and ‘single-stranded’ dTn oligomers (15, 21 or 30 nt long).

Analysis of stability

The Tms and ΔG° of the 36 different oligonucleotides are summarized in Table 1. For all sequences, Tm is higher in potassium than in sodium, as for nearly all quadruplexes published so far. In this series, the Tm difference is moderate as the average Na+–K+ Tm difference was 12.7°C. The formation of quadruplex structures, whatever is their type, is clearly enthalpy driven (Table 2). Despite a negative contribution of entropy to stability, all quadruplex structures studied here are stable at physiological temperature, especially when potassium is present. One should also note that the correlation between Tms (Figure 3A) obtained in sodium and potassium is weak (correlation coefficient = 0.69; data not shown). This indicates that sequence dependent effects are somewhat different in sodium and potassium, and that results obtained under a given cation are weakly predictive of what can be expected under another cation. Differences in Tm accurately reflect differences in ΔG° (Figure 3B). A fair correlation may be found between these two parameters. A 1°C difference in Tm roughly translates into a 0.13 (in Na+) to 0.15 (in K+) kcal/mol difference in ΔG°.

Table 2.

Model-dependent versus model-independent thermodynamic parameters

Salt Oligo (Strand) μM UV-melting
DSC: average excess enthalpy/entropy
DSC: deconvolution general model, 1 transitiond
Tma (°C) ΔH°VH (kcal mol–1) ΔS°VH (cal K–1 mol–1) ΔHcalc (kcal mol–1) ΔScalc (cal K–1 mol–1) Tt(°C) ΔHe (kcal mol–1) ΔCpe Tm (°C)
Na+ ACT 223 45 42.0 ± 1.2 132 ± 3 30.1 ± 2.2 94 ± 7 47.3 ± 0.7 35.5 ± 0.7 −0.70 ± 0.14 50.8 ± 0.9
5 45 44.1 139
TCA 241 50 43.5 ± 1.8 135 ± 5 31.0 ± 2.2 95 ± 7 52.0 ± 0.8 39.0 ± 0.6 −0.53 ± 0.11 54.5 ± 0.7
5 49 44.0 136
CGC 225 53.5 44.9 ± 1.2 138 ± 4 28.6 ± 2.3 87 ± 7 56.2 ± 0.7 40.0 ± 0.9 −0.66 ± 0.20 59.2 ± 1.0
5 53 48.4 149
K+ ACT 215 59 (44)b (133)b 36.4 ± 3.8 109 ± 11 62.1 ± 0.5 44.6 ± 1.0 −0.79 ± 0.21 64.9 ± 0.5
5 59 (49)b (148)b
TCA 200 63 (48)b (142)b 39.6 ± 3.5 117 ± 10 66.7 ± 0.3 48.6 ± 0.8 −0.86 ± 0.19 69.0 ± 0.4
5 63 (49)b (146)b
CGC 233 63.5 (43)b (127)b 40.1 ± 3.8 118 ± 11 67.6 ± 0.4 47.9 ± 1.1 −0.86 ± 0.29 69.9 ± 0.6
5 63.5 (53)b (156)b

aThese values (which correspond to a different series of experiments) are in fair agreement with those presented in Table 1.

bΔH°VH and ΔS°VH in KCl are provided for illustration only, as lnK versus 1/T graphs significantly deviate from linearity (see Supplementary Figure S1D for an example). Hence, linear fitting of these graphs is inappropriate.

cInline graphic

where Cpexcess is the excess heat capacity function. Average of six heating and six cooling profiles, respectively.

dThe general transition model directly fits the molar heat capacity Cp (and not the excess heat capacity Cpexcess). It is used for transitions with ΔCp ≠ 0. In this model, ΔCp(T ) is fitted with a second order polynome: ΔCp(T) = a + bT + cT2 = ΔCp(Tm) + b(TTm) + c(T2Tm2). Average of six heating and six cooling profiles, respectively.

eΔH and ΔCp at T = Tm

Figure 3.

Figure 3.

Tm and ΔG° in sodium and potassium. (A) Tm in 100 mM NaCl (with 10 mM lithium cacodylate at pH 7.2) is plotted versus Tm in 100 mM KCl (same buffer). All 36 sequences are shown. Average values in potassium (62.7°C) and sodium (49.8°C) are provided. (B) ΔG°50°C in 100 mM NaCl with 10 mM lithium cacodylate at pH 7.2 (blue crosses) or ΔG°62°C in 100 mM KCl (same buffer; red triangles) is plotted versus Tm, determined under identical conditions. (A linear fit between ΔG° and 1/Tm would be expected if all sequences had identical enthalpies).

DSC analysis

Three sequences were re-synthesized at a large scale (1 μM) in order to perform DSC experiments at 200–241-μM strand concentration. Representative examples of DSC scans (in sodium and potassium) are shown in Figure 4 and full scans are presented in Figure S5. A comparison of the thermodynamic parameters obtained from DSC experiments with a van’t Hoff analysis derived from UV study can be found in Table 2. Calorimetry experiments confirmed the differences in stability between three sequences chosen for in depth analysis (ACT, TCA and CGC):

  • – a representative example of a low stability case (ACT)

  • – a representative example of an intermediate stability case (TCA)

  • – a representative example of a high stability case (CGC)

Figure 4.

Figure 4.

DSC analysis. Cp versus temperature plots for the ACT (circles), TCA (squares) and CGC (triangles) sequences at strand concentration of ≈225 μM. The experimental buffer contained 10 mM lithium cacodylate at pH 7.2 with either 100 mM NaCl (full lines) or 100 mM KCl (dotted lines).

Figure 4 illustrates the DSC profiles for these oligonucleotides. The Tt temperatures (maximum of the Cp versus temperature plots) derived from DSC experiments were in fair agreement with the Tm deduced from UV-melting experiments (Table 2). The non-linear behaviour of ln(K) versus 1/T Arrhenius plots for UV-melting experiments in potassium (an example is provided in Supplementary Figure S1D) prevented us from accurately determining ΔH° and ΔS° in KCl. Hence, a comparison of ΔH°cal and ΔH°VH (and of ΔS°cal and ΔS°VH) could only be done in sodium. Model-independent ΔH°cal values were less negative (by 28–36%) than ΔH°VH for the three oligonucleotides. This less favourable enthalpy was (of course) compensated by a less negative entropy. Deconvolution of the DSC profiles with a general model for a single transition with a non-zero ΔCp provided intermediate thermodynamic values (Table 2).

Extension to other contexts

Some rules emerged from the wealth of data collected here. Nevertheless, one may wonder if these rules apply to this sequence context only, or if they may be generalized to various quadruplex types. Quadruplexes are highly polymorphic (40–42): rules that are valid for a given loop type (lateral, diagonal, propeller) do not necessarily apply to other types. To test this hypothesis, we chose two loops starting with an adenine (ACT and ATT) and compared them with TCA and CGC. These loop sequences were tested in six other quadruplex types (Table 3) with a variable number of quartets, different positions (i.e. when present in the first, central or last loops). Finally we analysed the effect of adenines in loops of different lengths (two or four bases). As shown in this table, the ACT and ATT sequences are less stable than the TCA and CGC motifs in all cases (seven different sequence contexts in sodium versus six in potassium; no comparison could be made in one case due to high stability), demonstrating that the ‘No A as a first base’ rule is general. This rule also applies to loops consisting of two or four bases, as AT and ATTT loops were less stable than TA and TTTA, respectively. On the other hand, the CGC loop is often, but not always (8/11 cases) more stable than the TCA loop (Table 3).

Table 3.

Loop sequence effects in various contexts

Name Sequence 5′ => 3’ Tma Na+ Tma K+ ΔG°b Na+50°C ΔG°b K+62°C
TT GGGTTAGGGTTGGGTTAGGG 46.9 63.9 +0.4 −0.3
AT GGGTTAGGGATGGGTTAGGG 43.8 59.5 +0.8 +0.4
TA GGGTTAGGGTAGGGTTAGGG 55.2 66.2 −0.7 −0.7
TTTT GGGTTAGGGTTTTGGGTTAGGG 58.0 68.2 −1.3 −1.1
ATTT GGGTTAGGGATTTGGGTTAGGG 57.5 68.2 −1.1 −1.3
TTTA GGGTTAGGGTTTAGGGTTAGGG 64.0 71.5 −2.3 −2.1
ACT GGGTTAGGGACTGGGTTAGGG 52.9 62.7 −0.3 −0.1
TCA GGGTTAGGGTCAGGGTTAGGG 57.5 66.1 −1.0 −0.7
CGC GGGTTAGGGCGCGGGTTAGGG 61.4 68.5 −1.4 −1.0
L3ACT GGGTTTGGGTTTGGGACTGGG 46.8 60.5 +0.5 +0.2
L3ATT GGGTTTGGGTTTGGGATTGGG 47.5 61.9 +0.4 0
L3TCA GGGTTTGGGTTTGGGTCAGGG 51.7 67.6 −0.2 −0.9
L3CGC GGGTTTGGGTTTGGGCGCGGG 52.8 65.8* −0.4 −0.6
L1ACT GGGACTGGGTTTGGGTTTGGG 45.5 61.9 +0.6 0
L1ATT GGGATTGGGTTTGGGTTTGGG 46.8 62.4 +0.5 −0.1
L1TCA GGGTCAGGGTTTGGGTTTGGG 51.5 66.7 −0.2 −0.8
L1CGC GGGCGCGGGTTTGGGTTTGGG 51.1* 65.8* −0.1 −0.6
X3ACT GGGACTGGGACTGGGACTGGG 39.1 51.9 +1.5 +1.5
X3TCA GGGTCAGGGTCAGGGTCAGGG 51.6 63.2 −0.2 −0.2
X3CGC GGGCGCGGGCGCGGGCGCGGG c c c c
17ACT GGGTGGGACTGGGTGGG 40.2 >80 +1.1 <−3
17TCA GGGTGGGTCAGGGTGGG 48.1 >80 +0.2 <−3
17CGC GGGTGGGCGCGGGTGGG 50 >80 −0.1 <−3
15ACT GGTTGGACTGGTTGG <12 35.4 >3 >3
15TCA GGTTGGTCAGGTTGG 16 40.0 >3 >3
15TGTd GGTTGGTGTGGTTGG 20 50.2 >3 +1.6
15CGC GGTTGGCGCGGTTGG 26.7 48.6 >3 +1.8

aTm in °C, with a ±0.5°C precision; determined from the analysis of UV melting profiles at 295 nm.

bΔG° in kcal/mol, determined at 50°C in sodium, 62°C in potassium from the analysis of UV melting profiles at 295 nm. In some instance, values could not be accurately determined (stability of the quadruplex is either too low or too high at this temperature) and minimal/maximal values are provided.

cThis GC-rich sequence does not form a quadruplex.

dThrombin aptamer sequence.

*In this sequence context, CGC is slightly less stable than TCA (in contrast with all other samples).

DISCUSSION

In the present study, we analysed the effects of base substitutions in a 3-base-long loop for intramolecular quadruplexes. Although many oligomers adopt relatively similar conformations, the stabilities of these complexes may significantly vary. The comparison of an important number of sequences (a total of 54) in two ionic conditions allowed to draw general rules. The difference found between potassium and sodium is relatively modest for most of the sequences presented here. In comparison, this difference [Tm(K+)Tm(Na+)] is in the same range (8°C) for the human telomeric motif (26) but can reach 36°C for single-base loops sequences (19). These general observations do not explain why some sequences are more sensitive than others to the nature of the cation.

We demonstrate here that overall base composition of the loops is a poor predictor of quadruplex stability. Whatever the parameter considered (total number of purines, or A, T or C in the loops), the correlation with thermal stability or ΔG° is poor, especially in potassium (Supplementary Figures S3 and S4). Nevertheless, some sequence effects may be evidenced: adenine is significantly disfavoured over a pyrimidine base (T or C) in a sodium buffer (correlation coefficient = 0.64). This is in agreement with previous observations (19,43). However, to our surprise, this destabilizing effect mainly applies to the ‘first’ position of the loop (Figure 5A) and was confirmed in loops consisting of 2, 3 or 4 bases. This is not observed when adenine is present at the central or last position.

Figure 5.

Figure 5.

Position effects. Examples of paired comparisons: (A) Effect on Tm of the nature of the ‘first’ base in the loop (A versus C or T). (B) Effect on Tm of the nature of the ‘last’ base in the loop (C versus A or T). Student's paired t-test values are shown. Similar conclusions were reached when looking at ΔG° values (data not shown).

Other less obvious but still significant effects may be evidenced as well. A cytosine is slightly detrimental (by 1–2°C) to quadruplex stability when present at the central position (data not shown). In contrast, the presence of a cytosine at the last position has little effect in potassium, and a significant stabilizing effect in sodium (Figure 5B). These observations allow us to propose consensus sequences for high- or low-stability 3-base-long loops:

‘Most’ stable (Na+): Y  D  C

‘Most’ stable (K+): Y  D  H

‘Least’ stable (Na+): A  C  W

‘Least’ stable (K+): A  C  H

where Y = C or T; H = A, C or T; W = A or T and D = A, G or T. (Note that, due to ambiguity problem, we cannot easily determine the effect of a guanine as the first or last base of a loop.)

The stability of a structure at 37°C (directly related to ΔG°) is obviously more useful for biological purposes than a Tm. Unfortunately, its determination/extrapolation is more difficult for a number of reasons (to name a few: low reproducibility, baseline artefacts, non-zero ΔCp°, large uncertainty in ΔH° determination and non-two-state melting behaviour). For these reasons, we propose to experimentally determine the folded fraction at a fixed temperature close to the average Tm for a series of sequences, which may immediately be translated into a ΔG°. We simply chose reference temperatures of 50 and 62°C close to the average Tm in sodium and potassium, respectively, so that one can determine θ and thus K and ΔG with maximal accuracy. Differences in stabilities (ΔΔG°) may then be determined with good confidence and reproducibility. Obviously, these ΔΔG° are determined at a non-physiological temperature; nevertheless, provided that enthalpies of the various sequences are not too different, one may expect that the conclusions still hold at 37°C. Alternatively, Figure 3B demonstrates a fair correlation between Tm and ΔG°, suggesting that for this series of >50 sequences, a Tm difference of 1°C may be translated into a ΔG° difference of 0.13–0.15 kcal/mol. As the Tm values in sodium span a >10°C temperature range, sequence effects may affect the Kd by a factor of ≈10, showing that sequence effects play a very significant role which was underestimated before.

Our ultimate goal is to establish rules to predict the stability of intramolecular G-quadruplexes based on primary sequence. This approach is complementary to the recent work by Kumar and Maiti (21), who compared the stability of a number of biologically relevant sequences with various loop size and length (total loop length 5–18 bases). In this report, we decided to explore a limited sequence space (only 3-base-long loops are considered) but in a nearly exhaustive fashion, allowing us to evidence relatively subtle effects. Our results also demonstrate that loop composition affect quadruplex structure and thermodynamics, thus making it difficult to draw generalized correlations between loop length and thermodynamic stability (21). The destabilizing effect of adenine as the first base of the loop has not been reported before. Efforts are now being made to understand the structural basis of this destabilization. Therefore, thanks to the data accumulated by several groups, one may build a set of ‘Santa Lucia-like’ table for quadruplexes, in order to propose a predictive algorithm for G4 stability, which may eventually be incorporated in MFOLD. However, the situation will be more complex than for duplexes, as sequences effects cannot be simplified to nearest neighbour approximation. A first prediction algorithm provides a fair Tm estimate in a number of situations (44). Finally, it will be interesting to check at the genome-wide level if loops starting with an adenine are under or over-represented in G4 putative sequences. It is also striking that telomeric motifs from different species never correspond to ‘loops’ starting with an A (i.e. GGGTTA, GGGTTTA, GGGGTT, GGGGTTTT, etc.) as if the least stable quadruplex motif was selected against in telomeric motifs (Tran et al., in preparation).

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

INSERM, CNRS and the Muséum National d’Histoire Naturelle. Funding for open access charge: INSERM.

Conflict of interest statement. None declared.

Supplementary Material

[Supplementary Data]
gkp563_index.html (681B, html)

ACKNOWLEDGEMENTS

We thank L. Lacroix, J. Gros, L. L. Guittat, J.F. Riou and C. Trentesaux (Museum, Paris) for helpful discussions.

Footnotes

The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.

REFERENCES

  • 1.Neidle S, Balasubramanian S. Quadruplex Nucleic Acids. Cambridge: RSC Biomolecular Sciences; 2006. [Google Scholar]
  • 2.Neidle S, Parkinson GN. The structure of telomeric DNA. Curr. Opin. Struct. Biol. 2003;13:275–283. doi: 10.1016/s0959-440x(03)00072-1. [DOI] [PubMed] [Google Scholar]
  • 3.Burge S, Parkinson GN, Hazel P, Todd AK, Neidle S. Quadruplex DNA: sequence, topology and structure. Nucleic Acids Res. 2006;34:5402–5415. doi: 10.1093/nar/gkl655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gellert M, Lipsett MN, Davies DR. Helix formation by guanylic acid. Proc. Natl Acad. Sci. USA. 1962;48:2013–2018. doi: 10.1073/pnas.48.12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bates P, Mergny JL, Yang D. Quartets in G-major. The First International Meeting on Quadruplex DNA. EMBO Rep. 2007;8:1003–1010. doi: 10.1038/sj.embor.7401073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Todd AK, Johnston M, Neidle S. Highly prevalent putative quadruplex sequence motifs in human DNA. Nucleic Acids Res. 2005;33:2901–2907. doi: 10.1093/nar/gki553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Huppert JL, Balasubramanian S. Prevalence of quadruplexes in the human genome. Nucleic Acids Res. 2005;33:2908–2916. doi: 10.1093/nar/gki609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Huppert JL. Hunting G-quadruplexes. Biochimie. 2008;90:1140–1148. doi: 10.1016/j.biochi.2008.01.014. [DOI] [PubMed] [Google Scholar]
  • 9.Bourdoncle A, Estévez-Torres A, Gosse C, Lacroix L, Vekhoff P, Le Saux T, Jullien L, Mergny JL. Quadruplex-based molecular beacons as tunable DNA Probes. J. Am. Chem. Soc. 2006;128:11094–11105. doi: 10.1021/ja0608040. [DOI] [PubMed] [Google Scholar]
  • 10.Patel DJ, Phan AT, Kuryavyi V. Human telomere, oncogenic promoter and 5′-UTR G-quadruplexes: diverse higher order DNA and RNA targets for cancer therapeutics. Nucleic Acids Res. 2007;35:7429–7455. doi: 10.1093/nar/gkm711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Qin Y, Hurley LH. Structures, folding patterns, and functions of intramolecular DNA G-quadruplexes found in eukaryotic promoter regions. Biochimie. 2008;90:1149–1171. doi: 10.1016/j.biochi.2008.02.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bugaut A, Balasubramanian S. A sequence-independent study of the influence of short loop lengths on the stability and topology of intramolecular DNA g-quadruplexes. Biochemistry. 2008;47:689–697. doi: 10.1021/bi701873c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Risitano A, Fox KR. Stability of intramolecular DNA quadruplexes: comparison with DNA duplexes. Biochemistry. 2003;42:6507–6513. doi: 10.1021/bi026997v. [DOI] [PubMed] [Google Scholar]
  • 14.Risitano A, Fox KR. The stability of intramolecular DNA quadruplexes with extended loops forming inter- and intra-loop duplexes. Org. Biomol. Chem. 2003;1:1852–1855. doi: 10.1039/b302251j. [DOI] [PubMed] [Google Scholar]
  • 15.Hazel P, Huppert J, Balasubramanian S, Neidle S. Loop-length-dependent folding of G-quadruplexes. J. Am. Chem. Soc. 2004;126:16405–16415. doi: 10.1021/ja045154j. [DOI] [PubMed] [Google Scholar]
  • 16.Risitano A, Fox KR. Influence of loop size on the stability of intramolecular DNA quadruplexes. Nucleic Acids Res. 2004;32:2598–2606. doi: 10.1093/nar/gkh598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Guo Q, Lu M, Kallenbach NR. Effect of thymine tract length on the structure and stability of model telomeric sequences. Biochemistry. 1993;32:3596–3603. doi: 10.1021/bi00065a010. [DOI] [PubMed] [Google Scholar]
  • 18.Cevec M, Plavec J. Role of loop residues and cations on the formation and stability of dimeric DNA G-quadruplexes. Biochemistry. 2005;44:15238–15246. doi: 10.1021/bi0514414. [DOI] [PubMed] [Google Scholar]
  • 19.Guédin A, De Cian A, Gros J, Lacroix L, Mergny JL. Sequence effects in single base loops for quadruplexes. Biochimie. 2008;90:686–696. doi: 10.1016/j.biochi.2008.01.009. [DOI] [PubMed] [Google Scholar]
  • 20.Kumar N, Sahoo B, Varun KAS, Maiti S, Maiti S. Effect of loop length variation on quadruplex-Watson Crick duplex competition. Nucleic Acids Res. 2008;33:4433–4442. doi: 10.1093/nar/gkn402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kumar N, Maiti S. A thermodynamic overview of naturally occurring intramolecular DNA quadruplexes. Nucleic Acids Res. 2008;36:5610–5622. doi: 10.1093/nar/gkn543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Vorlickova M, Bednarova K, Kejnovska I, Kypr J. Intramolecular and intermolecular guanine quadruplexes of DNA in aqueous salt and ethanol solutions. Biopolymers. 2007;86:1–10. doi: 10.1002/bip.20672. [DOI] [PubMed] [Google Scholar]
  • 23.Cantor CR, Warshaw MM, Shapiro H. Oligonucleotide interactions. 3. Circular dichroism studies of the conformation of deoxyoligonucleotides. Biopolymers. 1970;9:1059–1077. doi: 10.1002/bip.1970.360090909. [DOI] [PubMed] [Google Scholar]
  • 24.Mergny JL, Lacroix L. Analysis of thermal melting curves. Oligonucleotides. 2003;13:515–537. doi: 10.1089/154545703322860825. [DOI] [PubMed] [Google Scholar]
  • 25.Mergny JL, Lacroix L. UV melting of G-quadruplexes. Curr. Protoc. Nucleic Acids Chem. 2009;17:17.11.11–17.11.15. doi: 10.1002/0471142700.nc1701s37. [DOI] [PubMed] [Google Scholar]
  • 26.Mergny JL, Phan AT, Lacroix L. Following G-quartet formation by UV-spectroscopy. FEBS Lett. 1998;435:74–78. doi: 10.1016/s0014-5793(98)01043-6. [DOI] [PubMed] [Google Scholar]
  • 27.Saccà B, Lacroix L, Mergny JL. The effect of chemical modifications on the thermal stability of different G-quadruplexes-forming oligonucleotides. Nucleic Acids Res. 2005;33:1182–1192. doi: 10.1093/nar/gki257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Rachwal PA, Findlow S, Werner JM, Brown T, Fox KR. Intramolecular DNA quadruplexes with different arrangements of short and long loops. Nucleic Acids Res. 2007;35:4214–4222. doi: 10.1093/nar/gkm316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gros J, Rosu F, Amrane S, De Cian A, Gabelica V, Lacroix L, Mergny JL. Guanines are a quartet's best friend: impact of base substitutions on the kinetics and stability of tetramolecular quadruplexes. Nucleic Acids Res. 2007;35:3064–3075. doi: 10.1093/nar/gkm111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Breslauer KJ. In: Methods in Molecular Biology, Vol. 26, Protocols for Oligonucleotide Conjugates. Agrawal S, editor. Totowa, NJ: Humana Press Inc.; 1994. pp. 347–372. [Google Scholar]
  • 31.Breslauer KJ. In: Energetics of Biological Macromolecules. Johnson ML, Ackers GK, editors. Vol. 259. San Diego, CA: Academic Press Inc.; 1995. pp. 221–242. [Google Scholar]
  • 32.Chalikian TV, Volker J, Plum GE, Breslauer KJ. A more unified picture for the thermodynamics of nucleic acid duplex melting: a characterization by calorimetric and volumetric techniques. Proc. Natl Acad. Sci. USA. 1999;96:7853–7858. doi: 10.1073/pnas.96.14.7853. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Mergny JL, Li J, Lacroix L, Amrane S, Chaires JB. Thermal difference spectra: a specific signature for nucleic acid structures. Nucleic Acids Res. 2005;33:e138. doi: 10.1093/nar/gni134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kejnovska I, Kypr J, Vorlickova M. Oligo(dT) is not a correct native PAGE marker for single-stranded DNA. Biochem. Biophys. Res. Commun. 2007;353:776–779. doi: 10.1016/j.bbrc.2006.12.093. [DOI] [PubMed] [Google Scholar]
  • 35.Amrane S, Saccà B, Mills M, Chauhan M, Klump HH, Mergny JL. Length-dependent energetics of (CTG)n and (CAG)n trinucleotide repeats. Nucleic Acids Res. 2005;33:4065–4077. doi: 10.1093/nar/gki716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Amrane S, Mergny JL. Length and pH-dependent energetics of (CCG)(n) and (CGG)(n) trinucleotide repeats. Biochimie. 2006;88:1125–1134. doi: 10.1016/j.biochi.2006.03.007. [DOI] [PubMed] [Google Scholar]
  • 37.Petraccone L, Erra E, Randazzo A, Giancola C. Energetic aspects of locked nucleic acids quadruplex association and dissociation. Biopolymers. 2006;83:584–594. doi: 10.1002/bip.20591. [DOI] [PubMed] [Google Scholar]
  • 38.Gray DM, Wen JD, Gray CW, Repges R, Repges C, Raabe G, Fleischhauer J. Measured and calculated CD spectra of G-quartets stacked with the same or opposite polarities. Chirality. 2007;20:431–440. doi: 10.1002/chir.20455. [DOI] [PubMed] [Google Scholar]
  • 39.Paramasivan S, Rujan I, Bolton PH. Circular dichroism of quadruplex DNAs: applications to structure, cation effects and ligand binding. Methods. 2007;43:324–331. doi: 10.1016/j.ymeth.2007.02.009. [DOI] [PubMed] [Google Scholar]
  • 40.Dai J, Carver M, Yang D. Polymorphism of human telomeric quadruplex structures. Biochimie. 2008;90:1172–1183. doi: 10.1016/j.biochi.2008.02.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Shirude PS, Balasubramanian S. Single molecule conformational analysis of DNA G-quadruplexes. Biochimie. 2008;90:1197–1206. doi: 10.1016/j.biochi.2008.01.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Neidle S, Parkinson GN. Quadruplex DNA crystal structures and drug design. Biochimie. 2008;90:1184–1196. doi: 10.1016/j.biochi.2008.03.003. [DOI] [PubMed] [Google Scholar]
  • 43.Rachwal PA, Brown T, Fox KR. Sequence effects of single base loops in intramolecular quadruplex DNA. FEBS Lett. 2007;581:1657–1660. doi: 10.1016/j.febslet.2007.03.040. [DOI] [PubMed] [Google Scholar]
  • 44.Stegle O, Payet L, Mergny JL, Huppert J. Predicting and understanding the stability of G-quadruplexes. Bioinformatics. 2009;25:i374–i382. doi: 10.1093/bioinformatics/btp210. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

[Supplementary Data]
gkp563_index.html (681B, html)
gkp563_1.pdf (2.4MB, pdf)

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES