Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Dec 10.
Published in final edited form as: J Am Chem Soc. 2008 Dec 10;130(49):16530–16532. doi: 10.1021/ja8075567

The Tail of the Telomere

Luigi Petraccone 1,2, John O Trent 1, Jonathan B Chaires 1,*
PMCID: PMC2636551  NIHMSID: NIHMS86570  PMID: 19049455

Abstract

The structure of a higher-order G-quadruplex structure for human telomeric DNA is presented. The structure was determined by a novel integrated approach in which molecular dynamics simulations were used to produce a stable structure, from which specific experimentally accessible properties were predicted. These properties were tested by sedimentation velocity and steady-state fluorescence measurements. The structure that emerges is a dimeric structure with two quadruplex units, each with a different structure. The interface between the quadruplex units is stabilized by specific stacking interactions of loop nucleotides. The interface is a unique structure and a unique target for drug design.


Telomeres are structures at the ends of chromosomes that protect chromosomal DNA from degradation, fusion, or recombination1,2. Human telomeric DNA consists of long repeats of the sequence 5'-TTAGGG. Several kilobases of that sequence are paired with a complementary strand to form duplex DNA, but approximately 200 nucleotides of the sequence remain unpaired and forms a single-stranded overhang. The telomere serves as a biological clock, and shortens with age. In immortalized cells and in cancer cells, however, telomerase is activated to maintain the length of the telomere3,4. Specific proteins interact with the overhang and regulate telomerase activity. Repeats of the 5'-TTAGGG sequence can spontaneously form intramolecular quadruplex structures in solution, with a variety of folded morphologies. There is strong evidence that quadruplex structures exist in vivo57, and recent evidence suggests that quadruplexes form in telomeric DNA at specific times in the cell cycle7,8.

Understanding telomere biology is hampered by the lack of knowledge of the structure of the DNA in the full length overhang. The structure of oligonucleotides containing approximately four repeats of the 5'-TTAGGG sequence have been well characterized under a variety of solution conditions911. Little is known, however, about the structure of longer sequences of the telomere repeat found in the overhang. No high-resolution structural data are available for these longer sequences, and such sequences may indeed be difficult to study by the usual NMR or crystallographic methods. The structure and stability of the full length overhang must be known in order to fully understand the interactions of the DNA with telomerase and other telomeric proteins and to understand the changes in telomere structure throughout the cell cycle.

We report here the initial results from a novel strategy that integrates molecular dynamics simulations with experimental validation to obtain realistic structural models for higher-order telomeric DNA. High-resolution quadruplex structures obtained for the four-repeat sequence were used to construct multimeric structures containing several quadruplex units. We focus here on dimeric structures containing two folded quadruplex units. These constructs were refined by molecular dynamics simulations to obtain stable structures which were then used to predict experimentally accessible hydrodynamic properties using the HYDROPRO algorithm12,13 and testable fluorescence properties of specific adenine residues. The experimental results eliminated several possible structures and narrowed the choice to one most probable structure in solution. A similar attempt to build multimeric telomeric DNA structures was recently reported14, but that study was less comprehensive in its exploration of possible structures, and lacked any experimental validation.

Several different models of dimeric quadruplexes with the (TTAGGG)8TT sequence were built by incorporating the available high-resolution structures of single quadruplexes reported in K+ solution. There are two known high-resolution NMR structures for the human telomeric sequence in K+ solution, hybrid 1 and hybrid 21518. The structures of the three G-quartets are similar in each; both consist of 2 lateral loops and one double chain reversal loop. The difference is that in hybrid 1 the double chain reversal loop is the first with respect to the 5’-end whereas in the hybrid 2 it is the last loop. There also exists a crystal structure of the human telomere sequence which has a unique symmetrical propeller-type structure with all parallel G-quartets and 3 double chain reversal loops19. This structure seems not to be the predominate form in solution20. Figure S1A (Supplementary Material) shows these structures. Using these monomeric motifs as building blocks we built five dimeric models that incorporated different combinations of structures, as shown in Figure S1B (Supplementary Material). These are named Hybrid-11 (formed by two hybrid 1 quadruplexes), Hybrid-22 (formed by two hybrid 2 quadruplexes), Hybrid-12 (formed by an hybrid 1 quadruplex at 5’-end and an hybrid 2 quadruplex at the 3’-end), Hybrid-21 (formed by an hybrid 2 quadruplex at 5’-end and an hybrid 1 quadruplex at the 3’-end) and all-propeller (formed by two propeller-type quadruplexes). In the building process, we either maximized or minimized the initial interactions between the two quadruplex units in each model to examine stacked or unstacked possibilities. We refined the initial structures using molecular dynamics. After an equilibration period of 250 ps, models were subjected to an unrestrained molecular dynamics trajectory of 10 ns duration (see Supplementary Material for details). In all of the simulations, the integrity of the individual quadruplexes was preserved throughout the simulation and the major conformational changes were in the relative motions of the quadruplex units. A detailed analysis of the trajectories will be reported elsewhere (in preparation). Here we focus on the main conformational features shown by the simulated models.

Of the five combinations, the optimal interactions and most stable structures were the stacked Hybrid-12 and the propeller models. The propeller model has the ideal geometry to form stable stacking between the two G-tetrad cores, and these stacking interactions were maintained over the whole simulation time. This indicates a very stable conformation, in agreement with a reported computational study on this particular dimer model14. In the Hybrid-12 model, there are strong interactions between the third loop of the hybrid 1 quadruplex unit and the second loop of the hybrid 2 quadruplex unit. In particular, the A-15 residue forms very stable stacking interactions with T-38 and T-25 residues for about 90% of the total simulation time (Figure 1A). Weaker interactions between the two quadruplex units were found in the initially stacked Hybrid-21 and Hybrid-11 models, and in the Hybrid-22 model all interfacial interactions were lost during the first nanosecond of the trajectory. These three less stable models also predicted fluorescence and circular dichroism properties that were inconsistent with experimentally observed behavior, as will be described in detail in a subsequent publication.

Figure 1.

Figure 1

A) Average structure from MD simulation of the Hybrid-12 model. dT and dA residues, in the junction region between the quadruplex units, are shown in red and blue, respectively. B) CD spectra of quadruplex structures. The (TTAGGG)8TT spectrum is shown in green, while the hybrid 1 and 2 monomer spectra are shown in black and red, respectively. The sum of the monomer spectra is shown by the blue line.

Figure 1B shows circular dichroism spectra of (TTAGGG)8TT together with the spectra of the (TTAGGG)4 and (TTAGGG)4TT sequences reported to form hybrid-1 and hybrid-2, respectively. The CD spectrum of the dimer shows a peak at 290 nm, a major shoulder at 268 nm a minor shoulder at 250 nm and a small negative peak at 240 nm. The shape of this spectrum is consistent with the reported spectra of hybrid-type structures21. This observed spectrum of the dimer sequence in solution is inconsistent with a propeller structure, which is predicted to have a spectrum with a maximum at 260 nm. Hybrid 1 and hybrid 2 quadruplexes have subtly different CD spectra (Figure 1).

The CD spectrum of the dimer closely matches the sum of the CD spectra of the hybrid 1 and hybrid 2 monomers. A more complete comparison of the calculated CD spectra for different models is shown in Figure S2 (Supplementary Material). No other model matches as well as the one shown in Figure 1.

The CD spectrum observed in Figure 1 is consistent with previously reported spectra obtained on long telomere repeat sequences2224, and all of these spectra are inconsistent with existence of a parallel-stranded structure in solution.

There are striking differences in loop topologies between the Hybrid-12 and propeller models. Figure 2 shows calculated relative solvent accessible surface areas for loop adenines for the two models. In the propeller model with all double-chain reversal loops, all adenines have similar accessibilities. In contrast, loop adenines show diverse accessibilities in the Hybrid-12 model. Differences in adenine loop residue accessibilities were probed using the fluorescent base analog 2-aminopurine. Single substitutions were made for each adenine within the dimer sequence Figure S3) and these were studied by steady-state fluorescence emission and collisional quenching experiments using acrylamide. The results are shown in Figure 3. It is readily apparent that the loop adenines are in diverse environments, and have widely different solvent accessibilities. These qualitative results are inconsistent with the predictions of the propeller model. Figure 2 shows the correlation between calculated solvent accessible surface areas of loop adenines and accessibility to the acrylamide quencher measured experimentally. There is good agreement between the predicted and actual behavior for the Hybrid-12 model, indicating that this structure can plausibly exist in solution. Such is not the case for the propeller model.

Figure 2.

Figure 2

Correlation of solvent accesible surface areas and expermental collisonal quenching results. The behavior of the experimentally determined F0/F value for the oligonucleotides containing 2-aminopurine (green line) is compared with the calculated solvent accesable surface area of the dA residues over the MD trajectories of the Hybrid-12 (black line) and all-propeller (red line) models. Each plot was normalized with respect to the highest SASA or F0/F value.

Figure 3.

Figure 3

The results of fluorescence intensity measurements (plots on the left) and fluorescence quenching studies using acrylamide (plots on the right) are shown for oligonucleotides containing 2-aminopurine. The structure of the Hybrid-12 model is also shown (in the middle). The colors indicate the position of the 2-aminopurine substitution as specified in the sequence written on the top. Each experiment was performed in 10 mM potassium phosphate, 100 mM KCl, 0.1 mM EDTA at pH 7.0.

Finally, the predicted and observed distributions of sedimentation coefficients for the Hybrid-12 model are shown in Figure S4 (Supplementary Material). The calculated and observed sedimentation coefficients differ by less than 0.1S, fully consistent with the Hybrid-12 structure. A more complete discussion of the sedimentation properties of all of the constructed models is in preparation.

Our simulations and experiments show that the most plausible structure for telomeric DNA containing eight 5'-TTAGGG repeats is one in which two folded quadruplex units exist, but each with a different folding motif. The two units are not fully independent, and are stabilized by specific interactions at the interface. The interface region forms a unique structure, one that may present a unique target for ligands. We find that while a propeller dimeric structure is computed to be stable, its predicted properties are inconsistent with what is experimentally observed. It is unlikely to be a major species in solution.

Supplementary Material

1_si_001. Supporting Information Available.

Supplementary figures, materials and methods. This materials is available free of charge via the Internet at http://pubs.acs.org.

Acknowledgements

Supported by NIH grants CA35635 and GM077422. We thank Dr. Robert Gray for helpful comments and Dr. Danzhou Yang for providing coordinate files.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1_si_001. Supporting Information Available.

Supplementary figures, materials and methods. This materials is available free of charge via the Internet at http://pubs.acs.org.

RESOURCES