Abstract
DNA G-hairpins are potential key structures participating in folding of human telomeric guanine quadruplexes (GQ). We examined their properties by standard MD simulations starting from the folded state and long T-REMD starting from the unfolded state, accumulating ∼130 μs of atomistic simulations. Antiparallel G-hairpins should spontaneously form in all stages of the folding to support lateral and diagonal loops, with sub-μs scale rearrangements between them. We found no clear predisposition for direct folding into specific GQ topologies with specific syn/anti patterns. Our key prediction stemming from the T-REMD is that an ideal unfolded ensemble of the full GQ sequence populates all 4096 syn/anti combinations of its four G-stretches. The simulations can propose idealized folding pathways but we explain that such few-state pathways may be misleading. In the context of the available experimental data, the simulations strongly suggest that the GQ folding could be best understood by the kinetic partitioning mechanism with a set of deep competing minima on the folding landscape, with only a small fraction of molecules directly folding to the native fold. The landscape should further include non-specific collapse processes where the molecules move via diffusion and consecutive random rare transitions, which could, e.g. structure the propeller loops.
INTRODUCTION
Telomeres are terminal regions of linear eukaryotic chromosomes. Their main function is to maintain genomic stability and facilitate chromosome replication. Telomeric DNA in vertebrates comprises of double-stranded DNA consisting of tandem repeat sequence d(GGGTTA).d(CCCTAA), terminating in a single-stranded 150–250 nt long 3′-terminal G-rich overhang (1,2). Telomeres are shortened during every DNA replication (3,4) and their loss may eventually trigger apoptosis (5). A specialized enzyme, telomerase, helps to maintain telomere length (6). However, excessive telomerase activity may lead to cell immortality (7); cells in more than 80% of cancers have indeed been shown to overexpress telomerase (8). The telomeric G-rich single-stranded overhang is capable of forming non-canonical evolutionary conserved structures called G-quadruplexes (GQ) (9), recently visualized in vivo (10,11), which may downregulate telomerase activity (12). Therefore, stabilization of telomeric GQs by small ligands may help in cancer treatment (8,13). GQ-forming sequences are widespread also in other parts of the genome and there is increasing evidence that formation of GQ structures may be a powerful way to regulate gene expression (14–19).
GQs are formed by stacking of guanine quartets. The quartet is a planar assembly of four cis Watson–Crick/Hoogsteen-paired (20) (shortly Hoogsteen-paired or cWH) guanines. Four carbonyl oxygens of each quartet aim towards GQ centre and form a central channel with a negative electrostatic potential. Monovalent cations are required to reside inside the channel to counterbalance the negative potential, and thus stabilize the GQ architecture. Mutual orientation of G-strands in quadruplexes is either parallel or antiparallel, and connecting loops can be lateral (edgewise), diagonal or propeller (double-chain reversal), giving rise to parallel, antiparallel and hybrid GQ topologies. Antiparallel orientation of two strands requires that any two cWH base-paired guanines have different χ angle orientation, one being in syn conformation and the other in anti. Parallel strands impose the same χ angle orientation for base-paired guanines. This leads to interdependence between the syn/anti patterns in the GQ stems and the overall GQ topologies (21,22). Human telomeric sequence is well known for its structural polymorphism. Free energy computations suggest that polymorphism is a feature inherent to DNA GQs having odd numbers of guanines in the G-tracts, unless their topology is specifically dictated by their loops (23,24). The dominant GQ topology in a given experiment depends on exact nucleotide sequence including the flanking nucleotides, nature of stabilizing cation, DNA strand concentration, presence of cosolvents and perhaps on other factors (25–37). Six different topologies of human telomeric GQ have been elucidated by X-ray and NMR studies (30–36,38,39).
There are currently intense experimental efforts to understand the elusive process of GQ folding. Although earlier studies suggested that at least some GQ sequences may fold on a time scale of dozens to hundreds of milliseconds (40,41), most recent ensemble experiments indicate that the process of folding takes hours or even days (42,43). This is consistent with the latest single molecule experiments which discovered conformational states with similar lifetimes (44,45). The basic limitation of experimental investigations is that most of them do not allow confident determination of the structures that are populated in the course of the folding process, and thus plausible structural suggestions are sometimes based on intuition. The order parameters that are currently used to describe the folding processes may be insufficient for the full characterization of folding. Some of them, e.g. derived from CD spectra, may occasionally be ambiguous (46). For example, although CD is commonly used to detect major changes of the GQ folded ensemble dominated by the basket-type GQ upon the Na+ → K+ replacement (47), Raman spectroscopy (48) and NMR (49) revealed that sometimes the most populated fold may remain unaltered despite a profound change of the CD signal. Different and sometimes even kinetically unrelated ensembles may in principle overlap in the measured signals (50). Recently, first time-resolved NMR study of GQ folding intermediates for the human telomeric sequence emerged. The time resolution allowed to identify two competing long-lived structures corresponding to the hybrid-2 (off-pathway intermediate) and hybrid-1 (thermodynamic minimum) folds (43).
In fact, when considering all the currently available experimental data and their interpretation, the picture of GQ folding is far from being unambiguous. This may partially reflect limitations of the specific techniques. However, consistent interpretation of the experimental data may also be hampered by the fact that different experiments capture the folding process under different physical conditions, resulting, e.g. in genuine differences between ensemble and single molecule experiments. The unfolded (denatured) state ensemble, which significantly influences the folding process (45,51), depends on the specific physical process of unfolding. Thermally denatured, chemically denatured and low-entropy force-denatured ensembles are indeed not identical (51,52) and may give rise to diverse folding pathways.
Because of the genuine limitations of the experimental techniques, there have been increasing efforts to complement the experiments by modern computational techniques (53–62). The computational methods can provide in-depth insights into those parts of the folding process which are not within the resolution limits of the experiments (43). Among the available techniques, molecular dynamics (MD) simulation approaches are very promising, since they allow an optimal compromise between accuracy and computational efficiency. Standard MD studies attempting to mimic real thermal dynamics of DNA molecules are however limited by their μs time scale (63). This allows studying the properties of different structures (i.e. their associated local free energy basins) that can participate in the folding, but does not allow direct characterization of the full folding process. MD studies were performed for many GQ native structures (54,64–67), potential four-stranded intermediates with strand slippage (58,60) and G-triplexes (42,57,61). Simulation methods that enhance sampling can achieve larger conformational changes and even to certain extent sample the space between folded and denatured species. However, any acceleration of barrier crossing is associated with some drawbacks and may distort the picture of the folding process (63). The induced folding or unfolding might follow a pathway that depends on the exact protocol adopted to enhance sampling, which may complicate comparison with experiments made using specific control parameters and setups that are not equivalent to those in the computations (63,68). Nevertheless, even with enhanced sampling methods, it is unlikely we can achieve a complete folding analysis for systems that fold on such a long time scale as the GQs. Besides that, all theoretical studies are certainly affected by the approximations inherent to the simple molecular mechanical (force field) model (63,69).
The DNA GQ folding has been sometimes likened to funnel-like folding of small fast-folding proteins or RNAs, perhaps in light of some earlier experiments indicating fast folding. However, we suggest that there is a substantial difference between the principles of small protein and GQ folding. Short proteins have folding times in hundreds of μs to ms (51,70,71). Their sequences have been optimized to eliminate kinetic traps from their free energy surfaces, reducing their ruggedness to be around kT. On the contrary, the fact that the very short GQ-forming DNA sequences fold on very long time scales indicates that GQ conformational surface is extremely rugged with deep competing basins of attractions (i.e. the substates) impeding fast spontaneous folding into the native basin of attraction (43,50). GQs have very limited sequence freedom for tuning the folding speed. Further, folding of short proteins is directed by formation of local secondary structure native contacts (71). This contrasts GQ DNA, where the most critical part of the folding may involve formation of the native consecutive ion-stabilized quartets. The quartets are based on non-local interactions and, in a given fold, must have a specific combination of syn and anti nucleotides (21,22). A decisive role of local contacts (which would include formation of the loop topologies) for the GQ folding is also not consistent with the fact that many GQs fold to diverse topologies upon subtle changes of the environment or flanking sequences. This indicates presence of several or even many competing basins of attraction. Then only a small fraction of the molecules reaches directly the thermodynamically most stable ensemble. The remaining molecules would be first trapped in competing substates, i.e. misfolded, resulting in extremely multi-pathway folding process. The native and most significant competing basins of attraction may be interchanged upon changing the experimental conditions, which may explain why the folding topology dominant in the final thermodynamic equilibrium is so sensitive to subtle changes of the folding conditions. Most molecules likely initially locate different (competing) folds and may need to unfold to make another folding attempt. This is consistent with our earlier simulation studies identifying numerous stable potential intermediates. It has been visualized experimentally for the folding of the human telomeric hybrid-1 type GQ for which the hybrid-2 structure has been shown as a major competing substate (43). The authors conclude that transitions between the two folds appear to be realized via the unfolded ensemble and note that the folding process can be affected by other substates which are below the resolution of the experiment. The number of substates that can be revealed by a given technique depends on its structural and time resolution.
Earlier, we studied potential four- and three-stranded architectures that may participate in GQ folding pathways (60,61). In this paper we complete the investigations by an extended set of simulations of potential double-stranded G-hairpin structures. G-hairpins are relevant not only to the earliest stages of the folding pathways (the hairpins were suggested to fold on the μs to ms time scale) (42,72), but also to various later phases of folding as the G-hairpins may be important parts of the ensembles during inter-conversions among different triplex and GQ arrangements (40,41,43,56,72–74). Only limited experimental and computational effort has been paid to G-hairpins so far (56,72–74).
We report three sets of investigations. First, a set of standard simulations is initiated assuming structures of the hairpins with cWH GG pairing with syn-anti orientations of the guanines corresponding to either different known GQ topologies or suggested GQ folding pathways. The aim is to characterize the conformational space associated with the native-like basins of the G-hairpins. The second set of standard simulations is initiated from unfolded single strand, in an attempt to initiate formation of a G-hairpin from a straight single strand. Finally, we use an extended temperature replica-exchange MD (T-REMD) (75) enhanced-sampling simulation to obtain a full-scale analysis of the conformational space of the G-hairpins that is independent of the starting structures.
MATERIALS AND METHODS
Antiparallel hairpins with lateral (edgewise) loops
We have examined three DNA sequences, differing in their 3′-terminal base: d[AGGGTTAGGG], d[AGGGTTAGGGT] and d[AGGGTTAGGGA]. The choice was based on the sequences of three known GQ NMR structures, basket-type 143D d[A(GGGTTA)3GGG] (31), 3 + 1 hybrid-1-type 2GKU d[TT(GGGTTA)3GGGA] (38) and 3 + 1 hybrid-2-type 2JPZ d[TTA(GGGTTA)3GGGTT] (32). For the sake of simplicity, we do not list the flanking bases while the TTA loop is marked as ‘—’ in the G-hairpin abbreviations used in the following text.
In antiparallel 5′-GGG—GGG-3′ hairpins, the first G of the first G-stretch pairs with the last G of the second G-stretch (both in bold), etc. Since the guanines are cWH paired, the antiparallel arrangement implies that if one G in a base pair adopts anti orientation of its χ angle, the other G in the pair must adopt syn orientation (or vice versa) (21,22). Thus, if we assign specific order of syn (s) and anti (a) residues (henceforth ‘s/a pattern’) to one G-stretch, the other adopts the inverted s/a pattern. Since there are three Gs per every G-stretch, there are 23 = 8 possible s/a patterns for an antiparallel hairpin. We studied four of them: (i) 5′-asa—sas-3′, (ii) 5′-saa—ssa-3′, (iii) 5′-ssa—saa-3′ and (iv) 5′-sas—asa-3′ (Figure 1A). These four combinations maximize the number of syn-anti and anti-anti combinations of two consecutive Gs (the dinucleotide steps, 5′-GpG-3′), which are energetically favourable when participating in complete GQ stems (23,24). The other four possible s/a patterns have not been simulated due to the following reasons. The 5′-aaa—sss-3′ and 5′-sss—aaa-3′ patterns possess two unfavourable syn-syn steps (23,24). In addition, due to lack of any alternation of syn and anti in one G-stretch, such structures would be prone to vertical strand slippage (60,61). The 5′-ass—aas-3′ and 5′-aas—ass-3′ contain three unfavourable syn-syn and anti-syn steps (23,24).
Depending on the overall GQ topology, an antiparallel lateral loop hairpin can adopt either a narrow groove or a wide groove shape (Supporting Supplementary Figure S1A–H) (21,22). We prepared both variants of the hairpins for all four studied s/a patterns, which means 4 × 2 = 8 hairpin variants (Figure 1A). When in addition considering the variants of the 3′-terminal bases (see above), we had altogether 24 starting structures. Eight of them could be directly prepared from the experimental structures (Supplementary Figure S1A–H), using the first frames of the PDB files. The remaining structures were prepared manually by creating the missing 3′-terminal nucleotide and/or adjusting the s/a pattern. The built-up structures were subjected to extended equilibration to avoid possible instability introduced by the modelling procedure (see the Equilibration Protocol in the Supporting Information).
Parallel hairpins with propeller (double-chain reversal) loops
Propeller loop occurs when the two strands are parallel, which implicates that each base pair contains nucleotides with the same χ angle (21,22). The first G of the first G-stretch pairs with the first G of the second G-stretch, etc. We have taken three starting structures: (i) the first loop from 2GKU with the adjacent G-stretches with one T at both termini (d[TGGGTTAGGGT] with 5′-saa—saa-3′ pattern; Figure 1B and Supplementary Figure S1i) (38), (ii) the last loop from 2JPZ with the adjacent G-stretches with one terminal residue at both termini (d[AGGGTTAGGGT]; 5′-saa—saa-3′; Figure 1B and Supplementary Figure S1j) (32), and (iii) the first loop from the parallel stranded X-ray structure 1KF1with the adjacent G-stretches with one terminal residue at both termini (d[AGGGTTAGGGT]; 5′-aaa—aaa-3′; Figure 1B and Supplementary Figure S1k) (34).
We have further examined parallel hairpin with single nucleotide propeller loop, namely, the last loop from the 3 + 1 hybrid structure of the Bcl-2 promoter GQ 2F8U with the adjacent G-stretches and one 5′-terminal T (d[TGGGCGGG]; 5′-saa-saa-3′; Figure 1B and Supplementary Figure S1l) (76). All propeller loops are associated with medium grooves (Supplementary Figure S1i–l) (21,22).
Note that the term ‘parallel hairpin’ is in the literature used also for a hairpin with topology resembling the antiparallel hairpin with a lateral loop, but with 5′-5′ or 3′-3′ chemical bond in the loop region. When referring to parallel hairpins in this article, we strictly mean a structure with parallel G-stretches linked by a propeller loop.
Duplexes without loops
To separate the effects of the loops, we extracted antiparallel wide and narrow groove duplexes and parallel medium groove duplex d[GGG]2 from the 2GKU (38) structure by deleting the loops and all flanking residues. The antiparallel duplexes had the 5′-saa-3′/5′-ssa-3′ pattern, while the parallel duplex had the 5′-saa-3′/5′-saa-3′ pattern. The removal of loops introduces symmetry for antiparallel arrangements, so 5′-saa-3′/5′-ssa-3′ is equivalent to 5′-ssa-3′/5′-saa-3′.
Unfolded single stranded chains
A single stranded d[GGGTTAGGG] B-DNA-like all-anti helix with the 5′-aaa—aaa-3′ pattern was modelled using the NAB module of AMBER (77). Then, three other s/a patterns were prepared manually by flipping Gs into syn orientation, targeting the distribution of syn and anti Gs of consecutive G-stretches in the 2GKU (38) and 2JPZ (32) GQs, namely 5′-saa—saa-3′ (to facilitate folding of the parallel hairpins with propeller loop), 5′-ssa—saa-3′ and 5′-saa—ssa-3′ (to facilitate folding of anti-parallel hairpins). We also built a d[AGGGTTAGGG] single stranded helix with 5′-aaa—aaa-3′ pattern and from that we prepared a 5′-asa—sas-3′ variant, corresponding to the lateral loops in the basket GQ 143D (31); the 5′-terminal A prevented syn-orientation of the first G (23,24).
Triplex and quadruplex systems
In order to test a specific hypothesis about folding of the basket structure which emerged from our interim results and our previous triplex study (61), we performed additional simulations of three hypothetical triplex intermediates T1, T2 and T3 with cWH paired Gs and of a hypothetical chair-type GQ marked as Q1 (Figure 1C). We built up the d[A(GGGTTA)2GGG] triplex T1 with the 5′-sas—asa—sas-3′ pattern and lateral (wide) → lateral (narrow) loop (groove) order, d[A(GGGTTA)2GGGT] T2 with 5′-asa—sas—asa-3′ pattern and lateral (narrow) → lateral (wide) loop (groove) order, and d[A(GGGTTA)2GGGT] T3 with 5′-asa—sas—asa-3′ pattern and lateral (narrow) → diagonal loop (groove) order. The GQ Q1 had the d[A(GGGTTA)3GGG] sequence, the 5′-asa—sas—asa—sas-3′ pattern identical to the 143D GQ and lateral (narrow) → lateral (wide) → lateral (narrow) loop order. The T1 fold was built based on the 2GKU structure by extracting the triplex with lateral (wide) → lateral (narrow) loop order and manual adjustment of the s/a pattern. The T2, T3 and Q1 folds were built from scratch, geometries of lateral loops were taken from the 2GKU. Cations were manually placed in their channels and all systems were subjected to prolonged equilibration (see Supporting Information). We performed three independent simulations of T1 and T2 and nine of T3 (Supplementary Table S1). The Q1 GQ was simulated under standard conditions for 500 ns. The final structure was then utilized for two independent unfolding no-salt simulations. In no-salt simulations, no ions are present in the simulation box, including the GQ channel. Selected structures observed in one of the no-salt simulations were taken as starting structures for new standard simulations with ions (refolding simulations) (Supplementary Table S1). Goal of this procedure was to observe possible structural rearrangements between Q1 and the triplex structures. No-salt simulations represent an efficient tool to investigate likely rearrangements of GQ in early stages of unfolding and the subsequent refolding simulations aim to suggest movements that are relevant to late stages of folding, as described elsewhere (60,78).
Standard MD simulation
All molecules were solvated by a truncated octahedral box of waters with minimal distance between the solute and the box border of 10 Å. We have used either TIP3P (79) or SPC/E (80) water models. These two water models differ somewhat in their kinetic properties (81), but this should not have any substantial systematic effect on our results, as confirmed by them. Thus, when a given system is simulated with both water models, all simulations are considered as equivalent. Solute molecules were first neutralized by counterions and then ∼0.15 M excess salt was added, either NaCl or KCl. TIP3P-adapted and SPC/E-adapted Joung & Cheatham ion parameters were used (82). In some simulations, Amber-adapted Aqvist Na+ (83) with Smith&Dang Cl− (84) ions in TIP3P water or Dang K+ (85) with Smith&Dang Cl− ions in SPC/E water were also used, in order to eliminate a possibility that some detected ion binding sites are due to specific ion parameters (86). The no-salt simulations of the Q1 molecule, one in TIP3P and one in SPC/E water box, were performed without presence of any salt. The residual charge was neutralized adding a homogenous compensating background.
Although the type of cation may affect final thermodynamic equilibrium of diverse folds in experiments, we can consider properties of K+ and Na+ ions as sufficiently similar in MD simulations, as discussed in ref. (78). Additional discussion about how to correctly compare simulations with experimental techniques can be found in ref. (87). The fact that Na+ and K+ can sometimes stabilize different folds under the condition of thermodynamic equilibrium does not mean that the other fold is entirely disrupted. Both folds are likely present with both cations, though the cations may modulate their relative free energies in such a way that one of them becomes undetectable. However, it does not significantly affect the kinetic stabilities and dynamics of both topologies on the μs time scale of the simulations (60). In other words, cation differences do not affect computations that do not directly investigate the free energy difference or transitions between the two topologies and with respect to the unfolded state. Thus, we do not expect any dramatic effect of the ion type or specific ion parameters on our results. When addressing specific properties where we suspected that the type of ion could affect our conclusions, we made simulations with both Na+ and K+. Solvation, addition of ions and manual adjustment of s/a patterns were done in the xLEaP module of AMBER (77).
Parmbsc0 (88) + parmχOL4 (89) + parmϵζOL1 (90) version (bsc0χOL4ϵζOL1) of the Cornell et al. force field (91,92) was used in almost all simulations (for parameters, see AmberTools15 May 2015 update or http://fch.upol.cz/en/ff_ol/index.php). The bsc0 modifies the α/γ balance to basically stabilize DNA simulations. The χOL4 improves the balance of syn/anti nucleotide conformers and ϵζOL1 modifies the backbone ϵ and ζ dihedrals. The χOL4 and ϵζOL1 adjustments improve simulations of GQs as well as B-DNA (89,90). In few simulations of the parallel stranded G-hairpins other force field combinations were also employed to test force field sensitivity of the results: bsc0χOL4, bsc0ϵζOL1, bsc0 alone or parm99 (93). The bsc0 alone was used for six standard simulations of unfolded single stranded chains, since they were done at the beginning of the project. We did not repeat these oldest simulations with the new bsc0χOL4ϵζOL1 version since their basic outcome was then confirmed by the extended T-REMD simulation using bsc0χOL4ϵζOL1. See (63) for overview of currently available nucleic acids force fields.
Electrostatic interactions were calculated using the particle mesh Ewald method (PME) (94,95), non-bonded cutoff was set to 9 Å. Temperature was held at 300 K and pressure at 1 atm using the Berendsen weak-coupling thermostat and barostat (96). The SHAKE algorithm was applied to all bonds involving hydrogens (97), and the integration step was set to 2 fs. The molecules were equilibrated as described in the Supporting Information. Production phase of the simulations was performed with GPU version of pmemd module of AMBER 12 (77). Trajectories were processed in the ptraj module of AMBER and visualized in VMD.
T-REMD simulation
In order to enhance sampling of the d[GGGTTAGGG] single strand, we employed T-REMD (75). As starting structures, we utilized unfolded ssDNAs differing in orientations of the glycosidic torsions of all six Gs. Namely, we constructed all 64 possible s/a variants (considering six Gs: 26 = 64) and these were taken as input structures for the T-REMD calculation involving 64 replicas. We checked that syn ↔ anti transitions were sufficiently sampled at all guanines and all replicas, so the final population of s/a variants was not biased by their distribution in the starting structures (Supplementary Figure S2). Such choice of different starting structures should result in better convergence compared to the usage of one uniform starting structure for all replicas, though we in no case claim that our results are quantitatively converged. The TTA residues were in anti orientation in all starting structures.
The starting topology and coordinates of d[GGGTTAGGG] ssDNA were prepared using the tLEaP module of AMBER 12 (77). All structures were solvated using rectangular box with a 10 Å thick layer of SPC/E water molecules surrounding the solute (80). The T-REMD simulation was performed in ∼0.15 M NaCl salt excess (84,98). All replicas shared the same number of water molecules, numbers of ions and size of the simulation box. All structures were minimized and equilibrated using equilibration protocol as described in Supporting Information. The T-REMD simulation was carried out using the AMBER suite of programs with bsc0χOL4ϵζOL1 (88–90). The temperatures of all 64 replicas spanned the range of 278–444.9 K, achieving exchange rates between 22 % and 28 %. The T-REMD simulation was performed at constant volume using the NVT ensemble in each replica, using PME with 1 Å grid spacing and 10 Å real-space cutoff. Since using the weak coupling approach (96) in combination with T-REMD can lead to artefacts (99), we used Langevin dynamics with friction coefficient of 2 ps−1 as a thermostat in all replicas. Exchanges were attempted every 10 ps. Each replica was simulated to 1 μs, accumulating thus 64 μs in total. The trajectories were sorted either to follow continuous trajectories or to follow a given temperature in the ladder.
RESULTS AND DISCUSSION
Antiparallel G-hairpins can readily participate in folding of GQs
We have carried out a series of standard 200-ns-long simulations of 24 different lateral-loop antiparallel G-hairpins using the bsc0χOL4ϵζOL1 DNA AMBER force field, in order to investigate their basic structural stability. We have carried out 6 independent simulations for each hairpin, resulting in 144 simulations with aggregate time 28.8 μs. Although we cannot claim that it provides a converged statistics, it should be sufficient to eliminate the most random results. Each simulation was carried out with different combination of water model/ion parameters. However, we consider all six simulations as equivalent, as we noticed no systematic effect of the solvent parameters on the simulation outcome. The simulation time scale was chosen to observe a representative spectrum of structural transitions and should be sufficient to decide if a given hairpin is intrinsically stable enough (having sufficient life-time) to directly participate in GQ folding pathways. This part of our work essentially monitors the unfolding rate of the folded G-hairpins. Note that the life-time of the G-hairpins in the simulations may be underestimated by the force field and thus we estimate rather the bottom border of their lifetimes (61). Nevertheless, our results should correctly reflect their relative structural stability order.
The simulations suggest that antiparallel G-hairpins are stable enough to participate in the folding pathways of GQs. At the same time, the simulations reveal exceptionally rich dynamics including entirely stable simulations, various local perturbations, transitions between narrow and wide groove G-hairpins, spontaneous formation of diagonal loop G-hairpins, strand slippage movements, complete unfolding events and even rare re-folding events. The dynamics is sensitive to the flanking nucleotides.
The basic results are summarized in Tables 1–3. We divide the simulations into three groups: (i) Simulations with at least two initial cWH GG base pairs present at the simulation end. This means either completely stable structures or simulations with a transient reversible visit of geometries different from the starting one. This group of simulations is referred to as stable simulations further in the text. (ii) Simulations with perturbed (compared to the start) structures at the end, nevertheless still keeping an overall G-hairpin-like structure; these include diagonal G-hairpins with trans Watson–Crick/Watson–Crick (tWW) (20) base pairs, hairpins that underwent narrow → wide groove transition (or vice versa), locally misfolded hairpins with strand slippage, etc. (iii) The last group includes disrupted (i.e. unstable, unfolded) structures, usually unwound ss-helices or coils. The first group of structures would be competent to promote folding to the desired topology within the context of the full sequence while the second group would need only minor rearrangements to do so. The third group is unsuitable for any immediate folding attempt. 73, 38 and 33 simulations were classified as stable, perturbed and unstable, respectively.
Table 1. Outcome of all 48 simulations (200 ns each) of the antiparallel d[AGGGTTAGGG] hairpin with the bsc0χOL4ϵζOL1 force field.
Groove width | s/a pattern | Water and ionic parametersa | |||||
---|---|---|---|---|---|---|---|
5′-asa–-sas-3′ | 5′-saa–-ssa-3′ | 5′-ssa–-saa-3′ | 5′-sas—asa-3′ | ||||
Narrow | 3 O | 1,2,3 tW* | 3O*;1,2,3 tW* | P: 1 O | TIP3P | NaCl | J&C |
3 O* | 2 tW | 3 tW* | TIP3P | KCl | J&C | ||
3 O* | 2 tW | TIP3P | NaCl | Aq. | |||
3 O* | U: S5 | SPC/E | NaCl | J&C | |||
3 O | SPC/E | KCl | Aq. | ||||
P: 1,2,3 tW | 2,3 tW* | U: 1,2,3 O | SPC/E | KCl | Dang | ||
Wide | U: S5 | U: S5 | U: 1,2,3 O*; S5 | U: 1,2,3 O | TIP3P | NaCl | J&C |
P: S5 | 1,2,3 tW | U: 1,2,3 O | P: 2,3 O* | TIP3P | KCl | J&C | |
3 O | U: 1,2,3 tW | See text | U: 2,3 tW*; 1,2,3 O | TIP3P | NaCl | Aq. | |
U: S5 | P: 1,2,3 tW | U: 1,2,3 O | SPC/E | NaCl | J&C | ||
U: 1,2,3 O | U: 1,2,3 O | SPC/E | KCl | Aq. | |||
U: 2,3 O; 1 tW | P: 1 O; 2,3 tW | P: G; 1 O | SPC/E | KCl | Dang |
Final structures in the 12 × 4 matrix are classified in the following way: no text or regular font text mean that the final structure resembles the starting structure with at least two native cWH GG pairs at the end of the simulation (loss of one base pair is tolerated). ‘P:’ stands for any other hairpin-like structure at the end (perturbed structures, see the text). ‘U:’ means entire disruption of the hairpin, e.g. unwinding to ss-helix or coil formation. Characteristic structural rearrangements in the simulations are described by the following symbols: ‘O’ means opening (loss) of a particular GG pair (1–closing the loop, 2–middle, 3–terminal pair), ‘tW’ means transition of the GG pair from cWH to tWW geometry, ‘S5’ and ‘S3’ mean strand slippage towards the 5′- or 3′-terminus with base pairing shifted by one base (Figure 3 depicts example of a structure with S5 slippage), ‘G’ means conversion from initial narrow groove to wide groove or vice versa (see Figure 2). ‘*’ denotes that the particular rearrangement was reversible and the initial cWH pairing was restored. Rearrangements shown for the unstable (U:) trajectories are always temporary, i.e. we mark those seen before the entire loss of the structure. If more structural changes occurred, they are divided by ‘;’ and ordered chronologically.
aJ&C: TIP3P-adapted or SPC/E-adapted Joung & Cheatham parameters; Aq: Amber-adapted Aquist Na+ with Dang & Smith Cl- parameters; Dang: Dang K+ with Dang & Smith Cl- paremeters.
Table 3. Outcome of 48 simulations (200 ns each) of the antiparallel d[AGGGTTAGGGA] hairpin.
Groove width | s/a pattern | Water and ionic parametersa | |||||
---|---|---|---|---|---|---|---|
5′-asa–-sas-3′ | 5′-saa–-ssa-3′ | 5′-ssa–-saa-3′ | 5′-sas—asa-3′ | ||||
Narrow | TIP3P | NaCl | J&C | ||||
3 O | U: 2,3 O | U: 3 O; 1,2,3 tW | U: 1,2,3 tW | TIP3P | KCl | J&C | |
P: 1*,2,3tW; 3O* | U: 1,2,3 tW; G | 1,2,3 tW* | U: 1,2,3 tW | TIP3P | NaCl | Aq. | |
3 O | SPC/E | NaCl | J&C | ||||
3 O | P: 2,3 O | SPC/E | KCl | Aq. | |||
3 O | P: 1,2,3 tW | 2,3 tW* | U: See text | SPC/E | KCl | Dang | |
Wide | 3 O | 1,2,3 tW* | P: S3 | U: 1 O | TIP3P | NaCl | J&C |
3 O | P: 1,2,3 tW | S3*; 1*,2*,3O | TIP3P | KCl | J&C | ||
U: 2,3 O | P: 1,2,3 tW | P: S3 | P: S5 | TIP3P | NaCl | Aq. | |
3 O | P: 1,2 O | SPC/E | NaCl | J&C | |||
3 O | P: 3 O; S3* | P: 1 O | SPC/E | KCl | Aq. | ||
U: 3 O | U: S3 | U: 1 O | SPC/E | KCl | Dang |
Length of the simulations is 200 ns. See the legend of the Table 1 for further details.
aJ&C: TIP3P-adapted or SPC/E-adapted Joung & Cheatham parameters; Aq: Amber-adapted Aquist Na+ with Dang & Smith Cl- parameters; Dang: Dang K+ with Dang & Smith Cl- paremeters.
d[AGGGTTAGGG] antiparallel G-hairpin
Table 1 summarizes the simulations of the d[AGGGTTAGGG] hairpin. Its narrow groove variant displayed remarkable stability for the naturally occurring 5′-asa—sas-3′, 5′-saa—ssa-3′ and 5′-ssa—saa-3′ patterns, known from existing GQ atomistic structures. 17 simulations were stable, 1 simulation perturbed and none was unstable. Nevertheless, the structures were not rigid and exhibited rich local dynamics. The ‘non-native’ 5′-sas—asa-3′ pattern was less stable, with 2 simulations entirely disrupted and 1 simulation perturbed.
The wide groove d[AGGGTTAGGG] hairpin was visibly less stable for all four studied s/a patterns. The 5′-saa—ssa-3′ pattern had a tendency to form tWW GG base pairs. Still, 2 of its 6 simulations resulted in a complete structure loss and 1 into a modest perturbation. The remaining three s/a patterns entirely unfolded in 3–5 of the 6 individual simulations (Table 1).
d[AGGGTTAGGGT] antiparallel G-hairpin
Among the narrow groove hairpins, only the 5′-asa—sas-3′ pattern remained completely stable in all 6 simulations (Table 2). The other narrow hairpin s/a patterns showed tendency to change the cWH GG pairing into the tWW diagonal G-hairpin pairing, in 12 out of 18 simulations. In 3 simulations of the ‘non-native’ 5′-sas—asa-3′ pattern, the rearrangements proceeded further into the wide groove G-hairpin (Figure 2), i.e. we evidenced a spontaneous rearrangement of the structure from narrow to wide G-hairpin through a diagonal hairpin. Important factor affecting the simulations was a formation of the terminal canonical cWW AT base pair between the first and last nucleotide of the whole sequence. This event always preceded the transition from cWH into tWW GG pairing; the stable 5′-asa—sas-3′ narrow groove pattern never formed the terminal AT base pair. The role of the AT base pair is a nice example how a specific interaction with non-G bases can modulate structural dynamics of G-hairpins at the atomistic level, in this particular case the balance between narrow groove, diagonal and wide groove antiparallel hairpins.
Table 2. Outcome of all 48 simulations (200 ns each) of the antiparallel d[AGGGTTAGGGT] hairpin.
Groove width | s/a pattern | Water and ionic parametersa | |||||
---|---|---|---|---|---|---|---|
5′-asa–-sas-3′ | 5′-saa—ssa-3′ | 5′-ssa—saa-3′ | 5′-sas—asa-3′ | ||||
Narrow | P: 2,3 tW; 1 O | P: 2,3 tW; G* | P: 1,2,3 tW | TIP3P | NaCl | J&C | |
P: 1,2,3 tW | P: 1,2,3 tW | P: 1,2,3 tW; G* | TIP3P | KCl | J&C | ||
P: 2,3 tW | P: 1,2,3 tW | P: G; 1 O | TIP3P | NaCl | Aq. | ||
SPC/E | NaCl | J&C | |||||
P: 2,3 tW | SPC/E | KCl | Aq. | ||||
P: 1,2,3 tW | P: G; 1 O | SPC/E | KCl | Dang | |||
Wide | U: 1,2 tW | P: 1,2,3 tW | U: S5*; 1,2,3 O | TIP3P | NaCl | J&C | |
U: S5 | P: S5 | U: 1,2,3 O | TIP3P | KCl | J&C | ||
3 O | P: 1,2,3 tW | U: S5 | P: S5 | TIP3P | NaCl | Aq. | |
3 O | U: S5 | SPC/E | NaCl | J&C | |||
U: S5 | P: S5 | P: 1 O | SPC/E | KCl | Aq. | ||
U: 1,2,3 O | P: S5 | P: 1,2 O | SPC/E | KCl | Dang |
Length of the simulations is 200 ns. See the legend of the Table 1 for further details.
aJ&C: TIP3P-adapted or SPC/E-adapted Joung & Cheatham parameters; Aq: Amber-adapted Aquist Na+ with Dang & Smith Cl− parameters; Dang: Dang K+ with Dang & Smith Cl− paremeters.
For wide groove d[AGGGTTAGGGT] hairpins, strand slippage was observed in 9 out of 24 simulations. The structures slipped towards the 5′-end can be stabilized by formation of a base pair between the first G of the second G-stretch (G8) and the first T of the loop (T5) (Figure 3) or a base-phosphate interaction between amino group of G8 and the T6 phosphate group. Out of the four s/a patterns, the 5′-saa—ssa-3′ variant was the most stable, with no unfolding and only one case of strand slippage. The patterns 5′-asa—sas-3′ (the most stable one in the narrow variant, see above) and 5′-sas—asa-3′ were the least stable (Table 2).
d[AGGGTTAGGGA] antiparallel G-hairpin
The narrow groove 5′-asa—sas-3′ and 5′-ssa—saa-3′ patterns were the most stable, each of them in 5 simulations (Table 3). They formed stacking of the terminal As on each other and on the terminal GG pair. The other two s/a patterns were perturbed or stable to approximately the same extent. Formation of the diagonal loop with tWW GG pairing was observed in 8 simulations, almost evenly distributed between the s/a patterns. One large-scale refolding event after initial structure loss was observed (see below). The wide groove hairpins exhibited roughly equal stability as in case of the d[AGGGTTAGGGT] hairpin. The 5′-saa—ssa-3′ pattern never unfolded, with formation of tWW GG pairs in half of the simulations. The other three s/a patterns unfolded in two simulations each.
Set of interactions stabilizing the 5′-xxa—sxx-3′ lateral-loop patterns
The simulations suggest that, regardless of the flanking residues, the non-native 5′-sas—asa-3′ lateral loop pattern suffers from opening of the GG base pair adjacent to the loop (the closing pair) while the native patterns are stable. In case of the narrow hairpins, it can be explained as follows. If the first G after the loop is syn-oriented, it usually forms intra-nucleotide hydrogen bond between its amino group and its phosphate group (Figure 4). This can stabilize all 5′-xxa—sxx-3′ patterns, but none of the 5′-xxs—axx-3′ patterns. This hydrogen bond is usually present in full GQs with narrow grooves (e.g. in 143D, 2MBJ, 2HY9, 2GKU and 2JSM) and can be considered as a native GQ interaction. Such intra-nucleotide base-phosphate interaction sometimes appears, albeit with much lower population, also within the second or terminal syn G in the second G-stretch.
Further stabilization of the 5′-xxa—sxx-3′ patterns may come from two additional hydrogen bonds of the first T, namely between T(O4) and the anti-oriented G(N2) preceding the loop, and between the T(O2) and the loop A(N6) (Supporting Supplementary Figure S3). These two H-bonds are probably relatively weak because they are not fully linear. They are rather rarely seen in the experimental structures, e.g. in 143D and 2JSM. This may be due to the fact that the A is, in fully folded GQs, often paired with bases from other loops. Nevertheless, the shape of the lateral loop in our stable G-hairpin simulations is very similar to the shape of the loops associated with narrow grooves of almost all human telomeric GQs experimental structures, except of 2JPZ. Perhaps, the weak T(O4)–G(N2) and T(O2)–A(N6) hydrogen bonds seen in our G-hairpin simulations are disrupted upon formation of more stable native interactions in later GQ folding stages. The structure of the loop is also supported by stacking of the middle T with the A (Supplementary Figure S3), which is seen also in complete GQs. Thus, some ‘native’ G-hairpins may be weakly favoured during the folding and some of their native interactions may locally influence the folding free energy landscape in the early stages of folding. Such specific molecular interactions may contribute to the GQ topology rules, see (22) for a recent review.
Cation binding in antiparallel G-hairpins
The simulations reveal notable cation binding sites around the G-hairpins. Two sites are associated with the -sa- GpG steps: a carbonyl binding site in both narrow and wide hairpins, equivalent to the channel binding site between two quartets in GQs, and a narrow groove binding site, where the cation is located between phosphates in the narrow groove (Figure 4). We have already observed the latter binding site in G-triplex intermediates (61) and in long simulations of GQs (unpublished data). Occupation of both sites by Na+ is roughly equal when the -sa- GpG step is adjacent to the loop. The narrow groove binding site is weaker when the -sa- GpG step involves the terminal GG pair, perhaps due to greater fluctuations of the terminal nucleotides affecting spatial orientation of the phosphate groups. Potassium cations were attracted to the carbonyl binding site more than to the narrow groove site, with Dang parameters leading to the weakest groove binding. This behaviour could be attributed to smaller radii of the Na+ compared with K+. Nonetheless, the cation binding does not seem to be critical for stability of narrow hairpins. We observed stable narrow hairpins in the no-salt simulation of the hypothetical Q1 chair GQ reported below.
No significant groove binding was observed for wide hairpins and diagonal hairpins with tWW base pairs, if formed in the simulations. All antiparallel hairpin geometries, narrow, wide, diagonal and even misfolded with slipped strands could temporarily bind cations in the loop region, depending on the actual spatial arrangement of bases in the loop. In summary, the simulations did not find any major difference between NaCl and KCl simulations which could explain the experimentally known effect of the ions on the equilibrium GQ structures. However, this result can be affected by the short time scale of our simulations and inability of the force field to capture the difference. In addition, it is possible that the effect is not deducible from properties of isolated G-hairpins. Perhaps, the subtle differences between the Na+ and K+ when bound to the narrow groove and carbonyl binding sites may potentially contribute to the sensitivity of the thermodynamically preferred GQ topologies to the type of the ion. However, quantifying this issue would require different types of computations and would be exceptionally difficult.
Why is the first G in a narrow groove following a lateral loop usually syn?
As noted above, antiparallel hairpins with the naturally occurring s/a patterns were found to be more stable than the ‘non-native’ variant. This might indicate that they are better poised to serve as intermediates in GQ folding pathways (40,41,56,72,73). Possibly, their higher intrinsic stability may even be transferred to the full GQ structures. The observations of cation binding support previous calculations predicting the binding site on the carbonyl side of -sa- GpG steps to be preferred over the -aa-, -ss- and -as- steps (56). It is known that the first nucleotide after narrow lateral loop in GQs is usually syn (22). We speculate that it could be explained by the potential of the -sa- GpG steps to bind cations in narrow groove, the above-explained capability of syn-oriented Gs to form internal hydrogen bond between the amino and phosphate groups and perhaps formation of the two above-described weaker H-bonds within the loop.
Diagonal antiparallel G-hairpins are viable intermediates
Diagonal hairpins are antiparallel hairpins with tWW GG base pairs. Although all our simulations started with lateral G-hairpins with cWH GG base pairs, we observed spontaneous sampling of the diagonal G-hairpins in many simulations (Tables 1–3). Detailed population of different hairpins is given in the Supporting Information Appendix Tables SA1–SA24. The diagonal G-hairpins compete mainly with the wide groove G-hairpins, though we have also seen simulations where diagonal hairpins served as intermediates in the interconversion between narrow and wide groove hairpins (Figure 2); both directions were sampled in the whole simulation set. Thus, there has been no need to start additional simulations using the diagonal tWW G-hairpin arrangement. We conclude that antiparallel G-hairpins can readily support formation of narrow lateral, wide lateral and diagonal loops, allowing easy rearrangements between them. All these G-hairpins can easily participate in GQ folding and rearrangements. Role of diagonal hairpins in folding of the hybrid GQ has been suggested (43).
Parallel G-hairpins with propeller loops are entirely unstable in simulations
Out of the 19 individual 200 ns simulations of parallel G-hairpins with TTA loops, 18 unfolded; 9 even within the first 10 ns. The behaviour was independent of the used force field version (Supplementary Table S2). The most common unfolding pathway started with counter-rotation of the strands into structures with roughly perpendicular G-stretches (Figure 5). Then, it proceeded into various coil or ss-helix geometries. Once significantly unfolded, the parallel hairpin never reformed. All five simulations of the studied single-nucleotide propeller loop G-hairpin also unfolded.
When combining the present results with earlier studies of other systems (54,60,64,78,100), we see the following simulation behaviour of parallel G-hairpins. They are entirely stable in simulations of full GQs stabilized by the channel ions, even in absence of the bulk ions (78). However, they are very unstable in all other situations, namely, in simulations of GQs lacking the channel ions (54,60), in G-triplexes even in excess salt conditions (61) and in isolation (the present work). This indicates that the propeller loop structures are held by the GQ core and do not contribute to the stability of the GQ structure. This simulation picture indirectly contrasts with the abundance of the propeller loops in known experimental structures of GQs. The single nucleotide propeller loop is actually supposed to be one of the most stable loops (101–105).
In the whole simulation set we observed one transformation of a parallel G-hairpin with TTA propeller loop and one with the single nucleotide propeller loop into antiparallel arrangements. These transitions were not accompanied by any syn ↔ anti flips and thus did not result in perfect G-hairpins. The 5′-saa—saa-3′ hairpin formed a slipped structure with two cWH GG base pairs. Such antiparallel hairpin could straightforwardly serve as an intermediate in formation of GQs with two quartets. The 5′-saa-saa-3′ hairpin adopted an antiparallel structure stabilized by tSS-like (trans sugar edge/sugar edge) GG pair closing the loop, tWH GG base pair in the middle and terminal tWW GG pair (Supplementary Figure S4). These two simulations directly visualize that the parallel structures are less stable than the antiparallel arrangements.
The reasons of the instability of the parallel G-hairpins in simulations are unclear. One option is that it is a genuine property of these hairpins. However, then their common presence in the experimental structures would be non-trivial, and their formation could be one of the bottlenecks of the folding attempts of the individual molecules. Alternatively, some approximations of the force field could under-stabilize the propeller loops in simulations. It could be caused by inaccurate description of the backbone substates needed for the chain-reversal topology (100) or of the interactions between the closely spaced phosphate groups. Although the DNA backbone dihedral potentials have been extensively optimized in the past, the simple pair-additive DNA force field model remains imperfect, as demonstrated by comparison with quantum chemical benchmarks (24,106). Our ability to tune the force field by simple refinements of the essentially unphysical dihedral potentials is quite limited (63).
Absence of loops introduces instability
Simulations of the cWH paired d[GGG]2 duplexes without any loop and flanking residues are unstable, regardless of being parallel or antiparallel (Supplementary Table S3). Perhaps, instability of the cWH d[GGG]2 duplexes is promoted by introduction of another DNA terminus by the loop excision, which increases susceptibility of such a short duplex to unfold due to simulation end effects (107). Nevertheless, instability of d[GGG]2 duplexes does not pose any problem to construct the intramolecular GQ folding pathways. Further details are described in Supporting Information. A plausible model of formation of intermolecular GQs lacking loops does not require cWH paired duplexes as intermediates (58).
Single stranded helices tend to form misfolded antiparallel hairpins in standard simulations
Four standard simulations of d[GGGTTAGGG] and two of d[AGGGTTAGGG] starting from the unfolded (straight) single stranded helix with different s/a patterns were done (Supplementary Table S4). They exhibited a tendency to bend towards the antiparallel-hairpin-like arrangements. The simulations resulted into a variety of misfolded structures with strand slippage. None of them was a properly folded hairpin with three GG pairs and a TTA loop, even though some of the helices started with manually pre-ordered self-complementary s/a GQ pattern. Nevertheless, the simulation with 5′-saa—ssa-3′ pattern attempted a move towards the right G-hairpin. It has formed the lateral TTA loop with the correct adjacent cWH GG base pair. However, further progression of the folding was blocked by stacking and base-phosphate interactions of the other Gs (Supplementary Figure S5). This is an atomistic example how non-native interactions enhance ruggedness of the folding landscape. The initial pre-folding event occurred at 900 ns. We have prolonged the simulation to 1500 ns, but no further development has been seen. Further details about the simulations are given in Supporting Information.
Rare refolding events observed in simulations of the antiparallel G-hairpins
As explained above, a few dozens of simulations of antiparallel G-hairpins resulted into a complete unfolding of the initial structure. We observed two cases of spontaneous re-folding, when essentially the starting conformation has been re-established after entire unfolding. Such simulations provide unique atomistic insights into the potential G-hairpin folding pathways and are in detail described in Supporting Information. However, we still caution that both re-folding events might have been affected by the structural memory effect and the molecules might have been bouncing back from the transition state ensemble without reaching a true unfolded state ensemble (108).
T-REMD simulation of d[GGGTTAGGG] reveals enormous richness of its conformational space with a clear trend to form antiparallel hairpin-like structures
Since standard simulations were not efficient to study folding of the G-hairpins, we attempted an enhanced sampling method. We have selected the temperature-accelerated T-REMD approach, a robust method often considered as benchmark for small systems, such as DNA hairpin loops with canonical B-DNA stem (109,110), UNCG and GNRA RNA hairpin loops (111,112) and RNA tetranucleotides (113,114). Although the T-REMD method does not allow derivation of kinetics of the folding, the reference replica at a given temperature should give the correct thermodynamics (relative population of different species), provided the simulation is converged.
Our T-REMD run simulated 64 independent replicas, each for 1 μs. It has been initiated from a straight ssDNA helix, with each replica initially adopting one of the 64 possible s/a patterns of the G-stretches, to sample distinct regions of the conformational space since the beginning of the simulation. We carefully monitored all 64 replicas at all temperatures, however, we decided to not report all populated structures in detail. It would mean to deal with hundreds of different structures, which is not necessary for the purpose of the present work.
Although the simulation was certainly still far from been rigorously converged, we have achieved broad sampling. The fully paired antiparallel G-hairpins were spontaneously reachable, but their population was only about 0.01% (see below). It may reflect complexity of the conformational space (i.e. the T-REMD was still too short), some underestimation of the stability of the ideally folded G-hairpins by the force field (61) or real absence of such structures as dominantly populated species in the equilibrium ensemble. It will be interesting to see if future experiments directly visualize formation of well-paired G-hairpins or rather reveal a complex mixture of diverse structures similar to that predicted by the T-REMD run. Until today, the most direct evidence of formation of stable G-hairpins came from DNA origami experiments by Sugiyama's group (73) and NMR experiments by Plavec's group (115). The first method, however, did not have atomistic resolution while the NMR study predicted dimerization of diagonal hairpins for the Oxytricha nova sequence d[G4T4G4]. The longer G-tracts and the dimerization may bring additional thermodynamic stabilization to the G-hairpins.
Nevertheless, the most important result was that our T-REMD easily reached structures resembling antiparallel G-hairpins, except that they were not fully paired (see below). These structures actually formed ∼17% of population of the reference 278 K replica. Therefore, a broad spectrum of structures that can support lateral and diagonal loops is easily accessible. On the other hand, we have seen absolutely no tendency to form any structures that could lead to parallel G-hairpins with propeller loops; not a single such fluctuation was detected. All these results were consistent with the preceding parts of the paper.
Formation of rather compact bent anti-parallel conformations was a hallmark of all replicas, with a slightly higher propensity of high-temperature replicas to adopt straight conformations. In order to monitor the global bending, we measured the end-to-end distance, i.e. the distance between centres of mass of the terminal guanines. Figure 6 summarizes the population of the bent and straight conformations in the reference 278 K replica as well as the populations summed over all temperatures. The minor population of the end-to-end distance of 28–30 Å corresponding to the straight single strand conformation (region III in Figure 6) was well separated from the other geometries. The other structures were various types of bent conformations. They included conformations with stacked terminal Gs, either similar to the GQ-like hairpin helix with stacked rather than base-paired terminal guanines or a ring-like conformation with two bends and stacked terminal bases (region I in Figure 6), different types of base-pairing of the terminal guanines (region II in Figure 6) and conformations, where the terminal Gs did not directly interact (also region II in Figure 6) such as structures with inter-strand stacking in hairpin-like conformation with shifted base pairing within the stem. Further analysis (see Supporting Information) revealed high propensity of bending in the TTA region at low temperatures, while at high temperatures the preference was less pronounced (Figure 7). Formation of the tWW or cWH GG pair (both wide and narrow groove geometry) adjacent to the loop was found to be the most frequent out of all possible GG base pairs in the reference replica (Supplementary Table S5), with 17.3% population. All such structures can be considered as the first step towards forming the antiparallel G-hairpins, although further zipping to two and three base pairs was seen with only ∼0.16% and ∼0.01% population, respectively.
Our T-REMD achieved sufficient amount of syn ↔ anti transitions (Supplementary Figure S2), in contrast to the standard simulations. REMD accelerated the sampling by temperature, so at higher temperatures, it is more likely to overcome the potential energy (enthalpic) barriers and achieve syn ↔ anti transitions. The total population of s:a orientations was almost uniformly ∼40:60 for G2, G3, G7, G8 and G9. The 5′-terminal guanine G1 revealed swapped preference favouring syn orientation in rate ∼80:20, stabilized by a 5′-OH…N3 intramolecular hydrogen bond (23,24). Using these populations, we can calculate idealized populations of all 64 s/a patterns and compare them with the actual T-REMD results. Idealized populations would be achieved when the guanines sample the syn ↔ anti transitions independently. We indeed found that with increasing temperature of the replicas, the observed s/a patterns converged to the ideal populations. So, at high temperatures the s/a orientations of Gs were mutually independent and mostly driven by entropy, guaranteeing proper sampling of the pool of all s/a combinations. On the other hand, we found several s/a patterns that were significantly more populated at low temperatures compared to the idealized populations, e.g. 5′-sss—asa-3′, 5′-sas—saa-3′, 5′-saa—saa-3′ and 5′-sas—asa-3′ (Supplementary Table S6). Interestingly, only the last s/a pattern could form an antiparallel hairpin compatible with a three-quartet GQ. We analyzed clusters of the base-pairing interactions for these four s/a patterns and we indeed found that the anomalously well populated s/a patterns could be explained by specific interactions and structures (Supplementary Figure S6). We describe further details of the T-REMD simulation in Supporting Information.
Hypothetical idealized basket-type GQ folding pathway
In the above paragraphs we emphasized that the simulations suggest that GQ folding is an extremely multi-pathway process over exceptionally rugged free energy surface. In this paragraph we show that the simulation results can be used to suggest idealized (simplified, few-state) pathways in a similar manner as when interpreting experiments. We construct a pathway from single strand to three-quartet basket-type topology lacking the problematic propeller loops. We simulated set of triplex structures and a hypothetical three-quartet chair-type GQ, marked as Q1. Q1 has never been observed in atomistic experiments, but it may be populated during the folding. The model is summarized in the Figure 8. We have simulated Q1, T1, T2 and T3 structures. T4 has been spontaneously formed. 143D is the observed GQ and properties of structures H1–H24 can be deduced from the simulations of the G-hairpins. Q2 is a hypothetical basket variant with opposite strand progression compared to 143D and we try to explain why it is not formed.
The simulations of T1 displayed tendency to change the lateral wide loop into a diagonal loop. After 2 µs, two simulations revealed two triads rearranged in such a way that one loop shifted from the initial lateral wide into diagonal arrangement. The triad adjacent to the loop adopted an intermediate triangle geometry (61) in one simulation and did not change in the other simulation (Supplementary Figure S7). This transition is consistent with our previous triplex study (61). T2 remained stable in two simulations and in one simulation one G-stretch detached while the remaining narrow groove hairpin stayed stable. In contrast, T3 exhibited instability of its diagonal loop in all nine simulations. In three of them, one G-stretch unwound so that the narrow groove hairpin remained. In the remaining six simulations the triplex partly or fully converted its diagonal loop into lateral wide loop, leading to the T3 → T2 conversion. As the A1–T11 base pair adjacent to the diagonal loop in the starting structure could destabilize the diagonal loop, we have manually disrupted the base pair in six simulations. However, the T3 diagonal loop remained unstable. We speculate that the T3 triplex is less stable due to a less favourable topology of the backbone, as in T3 the TTA loop starts from the narrow groove while in T4 (and 143D) it emerges from the wide groove; these two arrangements are topologically different. Although we cannot rule out some other reasons of T3 instability, it seems that the T3 triplex is less stable than the other triplexes and tends to convert to T2.
Since the hypothetical Q1 chair GQ is stable in standard simulations with full ion binding, we applied the unfolding (denaturing) no-salt simulation (60,78). In the first simulation the Q1 was split into a combination of two narrow hairpins H12. The second simulation resulted into a triplex-like intermediate. We then tried several standard excess salt simulations, in order to stabilize the perturbed structures. Four of them continued to unfold, one refolded back into Q1 and one converted into the T4 structure, i.e. in a direction towards the 143D basket.
When combining these results with the data for the antiparallel G-hairpins, we construct the following scenario for folding of the basket 143D GQ (Figure 8) while avoiding the alternative Q2 structure. The sequence of the GQ 143D is d[A(GGGTTA)3GGG] with the 5′-asa—sas—asa—sas-3′ pattern. Considering stability of the narrow hairpins H1 and H2 in the simulations of d[AGGGTTAGGGT] and d[AGGGTTAGGG] with the 5′-asa—sas-3′ pattern, we suggest that these two G-hairpins are stable enough to allow further folding with a sufficient probability. If both hairpins are formed simultaneously, they can merge together to form the chair-like GQ Q1. When only one of the two hairpins is present, it can bind an additional G-stretch with a compatible s/a pattern to form either T1 or T2 triplex. Contribution of wide hairpins of these two sequences and s/a patterns to folding is less likely due to their low stability, revealed in the first part of this study. Thus, we speculate that all intermediates containing the wide hairpin variant, including H13 and H24, are less probable. Because the T3 triplex is unstable and spontaneously changes into T2, neither T3 nor H24 can proceed to the GQ Q2. Q2 actually contains the same unstable lateral narrow → diagonal loop order which makes T3 unstable. On the other hand T1 eventually changes its lateral wide loop into diagonal, giving rise to T4. T4 can then pick up the last free G-stretch to form the stable 143D GQ. T1 and T2 can interconvert via unfolding or via the Q1 GQ, so T2 either gradually unfolds or forms Q1 and then follows the track of T1 (Figure 8). The model explains why 143D is formed more likely than Q2.
Comment on propeller loops
Idealized folding pathways of the hybrid GQ topologies proposed in the literature usually form the propeller loop at the very end (41,42,54,56,116), which is in accord with their instability observed in simulations here and earlier (54,60,61). These pathways begin with formation of antiparallel hairpins and here we show that G-hairpins with s/a patterns corresponding to the hybrid GQs (wide groove 5′-saa—ssa-3′ and narrow groove 5′-ssa—saa-3′) are stable enough to participate in the folding process. More problematic could be formation of the all-parallel GQ with three propeller loops. Perhaps, it could be structured using a mechanism suggested for intermolecular GQs, i.e. initial coil-like ensemble of four strands, formation of first ion-stabilized quartet, and subsequent structuring of the molecule via strand slippages (58).
Comment on the Oxytricha nova GQ
Common models of GQ folding assume consecutive additions of G-strands. Likewise, most of our calculations investigated possibilities of such scenarios. However, recent experiments by Plavec's group suggested yet another possibility, which can further complicate the folding landscape. They have suggested a prefolded four-stranded structure in absence of the ions with two weakly interacting diagonal hairpins, which quickly converts into the GQ after adding ions. This has been proposed based on an NMR study for dimeric Oxytricha nova GQ d[G4T4G4]2 (115). Such intermediate also appeared in our preliminary simulation of no-salt induced unfolding of this GQ (data not shown). It would be exceptionally interesting to clarify if analogous prefolded structures may form also for systems having less than four quartets.
Limitations of idealized folding pathways
The folding model depicted in the Figure 8 is simplified. Its entirely unrealistic prerequisite is that the s/a patterns of the approaching components are always the native ones. This simplification is also assumed in similar pathways presented in the literature (42,56). The H1 → T1 → T4 → 143D pathway without inclusion of any other stable intermediates is likely part of the real folding process, but ignores interference from off-pathway intermediates with different s/a patterns, including alternative three- and two-quartet GQs. Complexity of folding emerges even for our simple model not allowing for any changes of the s/a pattern. The alternative H2 → T2 → Q1 → T1 pathway contains the potential intermediate Q1, a chair GQ, which likely would be a long living intermediate slowing down the overall native GQ folding rate as a deep competing basin of attraction, i.e. a stable misfolded state.
As stated elsewhere, it is likely that instead of pure intermediates, such as H1 or T1, complex ensembles of structures are present during folding (43,117,118). MD simulations show atomistic models of many stable misfolded hairpins, triplexes and GQs (60,61). It indicates that the real process is slowed down by numerous intermediate structures with variable populations and lifetimes. It is easy to imagine a misfolded GQ which needs to unbind one G-stretch, flip syn and anti orientation of guanines and undergo another folding attempt, which might again lead to another misfolded GQ. Transitions between different GQ folds may require movements through the unfolded ensemble (43). Here we wish to point out that the differentiation between what is unfolded and what is misfolded structure may depend on the time and structural resolution of a given method. Let us assume binding of an unbound G-stretch to a triplex, in order to form a specific three-quartet GQ. If we assume that the s/a pattern of the unbound G-stretch is independent of the neighbouring nucleotides, then the probability of having a G-stretch with the s/a pattern complementing the triplex is 1/8, since three Gs can adopt 23 = 8 possible s/a patterns. The non-native s/a patterns may lead to formation of a stable two-quartet GQ with strand slippage. Moreover, even with the correct s/a pattern the G-stretch approaching the triplex must land correctly, i.e. every G must approach a G-triad; otherwise locally misfolded structures can form. The G-strand approach is likely affected by the connecting loop. Also locally misfolded G-hairpins can misfold the whole structures, going from a misfolded hairpin through a misfolded triplex to a misfolded GQ. This adds complexity in a form of other parallel pathways. Therefore, any idealized pathway shows only one possibility and in the best case could represent a ‘master pathway’, along which other misfolded intermediates occur. Taking into account the slowness of the GQ folding, we suggest that only a tiny fraction of the individual molecules reaches the native basin (fold) directly via the master pathway (51,119). Majority of them likely become trapped in different folds rather than staying truly unstructured and need to unfold before taking another folding attempt.
The idealized folding pathway proposed above also does not take into account a non-specific collapse (51), in which denatured oligonucleotides form a ‘coil’ not resembling the GQ and then slowly by diffusion and random rare transitions progress towards the native or misfolded GQ. In fact, it is even possible that most of the individual oligonucleotide strands undergo some non-specific collapse, rather than following a more-or-less well defined idealized folding pathways with a series of distinct intermediates. Our simulations of four-stranded and triplex intermediates located numerous stable (on the simulation time scale) structures stabilized by non-GQ interactions such as structures with perpendicular hydrogen-bonded G-strands, pairing between loop nucleotides and G-stretches, etc. (60,61). Since these two mechanisms may coexist, we can imagine a scenario in which relatively fast folding pathways exchange intermediates with the slow non-specific processes, allowing diffusion between the different major conformational basins. The non-specific diffusive processes could help in structuring the propeller loops.
Our last assumption when constructing the pathway was that all the G-hairpins have similar folding rate constants, so their populations are proportionate to their unfolding rates modelled by their relative lifetimes in the simulations. If this is not the case, then even at first sight less stable hairpins might participate in the folding process, provided their folding rate constants are high enough to keep their sufficient concentrations.
CONCLUSION
Free energy surface of GQ-forming sequences is rugged enough to allow separation of distinct energy minima and studying the corresponding structures independently from the others. G-hairpins were suggested to be the simplest such minima significantly participating in the GQ folding pathways (42,56). We examined their properties by standard MD simulations starting from a diverse set of folded G-hairpins with three cWH base pairs and long T-REMD simulation starting from the unfolded state.
We show that antiparallel G-hairpins can easily form in the earliest stages of human telomeric GQ folding to support lateral (both narrow and wide groove) and diagonal loops, with almost barrier-less rearrangements between them (Figure 2). The experimentally estimated microseconds to millisecond folding range (40–42,72) of G-hairpins is consistent with the simulations. The G-hairpins can also participate in many structural transitions in the later stages of the folding.
Despite a clear trend to form the antiparallel structures, there is no predisposition to fold into three-base-pair hairpins that would correspond to direct folding into specific GQ topologies with specific syn/anti patterns. Further, rather than forming the optimal three-base-pair G-hairpins, the T-REMD simulation reveals a rich zoo of structures, mixture of hairpins, slipped structures, entirely misfolded structures not resembling hairpins, etc. (Figure 6). Nevertheless, antiparallel structures plausible to move towards various G-triplexes and GQs are sufficiently populated and easily accessible.
Our key result is that an ideal unfolded ensemble of d[GGGTTAGGG] populates all 64 possible s/a patterns of the two G-stretches. Thus we predict that an ideal unfolded ensemble of a full human telomeric GQ sequence should populate all 4096 s/a combinations of its four G-stretches. Out of the 4096 s/a patterns, 2336 are compatible with some three- or a two-quartet GQ; the algorithm is given in the Supporting Information and disregards GQs with G-bulges (120). We suggest that a fully unfolded GQ-forming sequence initially folds into a rich spectrum of structures with no selection of the native s/a GQ pattern in the very early stages of folding. There is no sign of any straightforward funnel-like mechanism towards a single GQ topology inherent to the human telomeric sequence. The rich spectrum of various misfolded G-hairpin structures with non-native interactions further supports the view of multi-pathway nature of the GQ folding with the final GQ topology becoming thermodynamically prevalent in a very slow process of countless folding-unfolding events. It is consistent with recent experiments and observed time scales (43), but not consistent with simple single-pathway models.
The complexity of the G-hairpin conformational space gives just a flavour of the intricacy of the conformational space of the full GQ sequence when having unlimited time resolution. This supports the idea of very rough free energy landscape, with only a tiny fraction of structures directly folding towards the native GQ via a fast folding track. Note that even for simple systems such as DNA hairpins with canonical base pairs, a slow folding kinetics compared to ideal semi-flexible polymer was proposed (109,121,122). GQ folding should be incomparably more complex. The GQ folding could be consistent with the kinetic partitioning mechanism (KPM) with a set of deep competing minima on the folding landscape which are well separated from each other, with the native state being one of them (45,119). The number of resolved substates may depend on the resolution of the method. The folding landscape should also include phases of non-specific collapse (NSC) in which the oligonucleotides form a ‘coil’ not resembling the GQ and then slowly by diffusion and random rare transitions progress towards the native or misfolded GQs. The KPM and NSC mechanisms likely coexist and the real folding process can involve a very complex spectrum of structures with diverse populations and lifetimes, from milliseconds to days. The position of the global minimum can be modulated by details of the sequence and specific experimental conditions.
The simulations demonstrate how the flanking nucleotides affect behaviour of G-hairpins, which may be related to influence of flanking nucleotides on the GQ folds in experiments. Narrow groove hairpins seem to be particularly stable if their first G after the loop is syn-oriented because of formation of several stabilizing interactions (Figure 4 and Supplementary Figure S3). Our data are in good agreement with the experimentally known GQ s/a patterns and groove widths they adopt (22), indicating that at least some of the factors affecting the lifetimes of the G-hairpins may be already local native interactions tuning the final relative stabilities of different GQ folds.
In contrast to the antiparallel G-hairpins, parallel G-hairpins with propeller loops are very unstable in the simulations. It indicates that formation of propeller loops during GQ folding is not straightforward, at least compared to the lateral and diagonal loops. We tentatively suggest that the propeller loops could be structured by diffusion and series of consecutive random rare transitions after a non-specific collapse of the full GQ-forming sequence. For this model, however, we do not have a direct simulation support yet.
The present work has also limitations that are inherent to the MD method. Besides the simulation time, the results are affected by the approximate nature of the force field. Two problems could specifically affect quantitative results of this study, although they are unlikely to influence its basic conclusions. First, the technique may underestimate the stability of the cWH GG base-paired G-hairpin stems (61). In this case the population and stability of the fully paired antiparallel G-hairpins would be larger than seen in our simulations, with a smaller role of competing structures. This would further strengthen our basic conclusions. Second, the force field may destabilize the propeller loops. This likely would not change the conclusion that the formation of propeller loops is more difficult relatively to the lateral and diagonal loops, but the propeller loops could be more accessible compared to the present simulation data.
MD simulations of potential intermediates can be used to suggest idealized folding pathways of GQs, similar to those proposed based on experiments (41,42,116,123). Figure 8 shows such pathway for the basket form of the human telomeric GQ. However, if the GQ folding is a multi-pathway process with only a tiny fraction of the individual molecules folding directly to the native fold, such simplified pathways would not reflect the complexity of the real folding landscapes.
We suggest that combination of smart experimental techniques with sophisticated computational approaches is needed to better understand the GQ folding. Atomistic MD provides interesting insights into selected aspects of the GQ folding. However, complexity of the process will likely require combining MD with other approaches, such as high-resolution coarse-grained modelling (124), systematic conformational searches (125) and perhaps other methods. The folding pathways may vary for different GQs and for different experimental conditions having different unfolded ensembles (45). Thus, while we may be able to understand the basic principles of GQ folding, it is less likely that there exists a uniform mechanism valid for all sequences and folding conditions. The intrinsic GQ folding landscapes can be modulated by other molecules including proteins and, when a GQ-folding sequence needs to be first freed from a double helix, also by the kinetics of the duplex melting and release of the G-stretches. These factors may determine which structures are populated on the biochemically relevant time scales. Computational approaches could be used to study the effect of the unfolded ensemble on the folding.
Supplementary Material
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Czech Science Foundation [P208/11/1822]; project ‘CEITEC–Central European Institute of Technology’ [CZ.1.05/1.1.00/02.0068]; Ministry of Education, Youth and Sports of the Czech Republic [project No. LO1305]; Student Project [IGA_PrF_2015_027 of Palacký University to P.K, P.B., M.O.]; career development grant from the European Organization for Molecular Biology [IG2535] and Marie-Curie Re-integration (ECOPOD) [to L.T.]; European Research Council [S-RNA-S, 306662 to G.B]; Praemium Academiae [to J.S.]. Funding for open access charge: Czech Science Foundation [P208/11/1822].
Conflict of interest statement. None declared.
REFERENCES
- 1.Moyzis R.K., Buckingham J.M., Cram L.S., Dani M., Deaven L.L., Jones M.D., Meyne J., Ratliff R.L., Wu J.R. A Highly Conserved Repetitive DNA Sequence, (TTAGGG)n, Present at the Telomeres of Human Chromosomes. Proc. Natl. Acad. Sci. U.S.A. 1988;85:6622–6626. doi: 10.1073/pnas.85.18.6622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wright W.E., Tesmer V.M., Huffman K.E., Levene S.D., Shay J.W. Normal Human Chromosomes Have Long G-rich Telomeric Overhangs at One End. Genes Dev. 1997;11:2801–2809. doi: 10.1101/gad.11.21.2801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Allsopp R.C., Vaziri H., Patterson C., Goldstein S., Younglai E.V., Futcher A.B., Greider C.W., Harley C.B. Telomere Length Predicts Replicative Capacity of Human Fibroblasts. Proc. Natl. Acad. Sci. U.S.A. 1992;89:10114–10118. doi: 10.1073/pnas.89.21.10114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Engelhardt M., Martens U.M. The Implication of Telomerase Activity and Telomere Stability for Replicative Aging and Cellular Immortality (Review) Oncol. Rep. 1998;5:1043–1052. doi: 10.3892/or.5.5.1043. [DOI] [PubMed] [Google Scholar]
- 5.Aubert G., Lansdorp P.M. Telomeres and Aging. Physiol. Rev. 2008;88:557–579. doi: 10.1152/physrev.00026.2007. [DOI] [PubMed] [Google Scholar]
- 6.Greider C.W., Blackburn E.H. Identification of a Specific Telomere Terminal Transferase Activity in Tetrahymena Extracts. Cell. 1985;43:405–413. doi: 10.1016/0092-8674(85)90170-9. [DOI] [PubMed] [Google Scholar]
- 7.Kim N., Piatyszek M., Prowse K., Harley C., West M., Ho P., Coviello G., Wright W., Weinrich S., Shay J. Specific Association of Human Telomerase Activity with Immortal Cells and Cancer. Science. 1994;266:2011–2015. doi: 10.1126/science.7605428. [DOI] [PubMed] [Google Scholar]
- 8.Neidle S. Human Telomeric G-quadruplex: The Current Status of Telomeric G-quadruplexes as Therapeutic Targets in Human Cancer. Febs J. 2010;277:1118–1125. doi: 10.1111/j.1742-4658.2009.07463.x. [DOI] [PubMed] [Google Scholar]
- 9.Skolakova P., Foldynova-Trantirkova S., Bednarova K., Fiala R., Vorlickova M., Trantirek L. Unique C. elegans Telomeric Overhang Structures Reveal the Evolutionarily Conserved Properties of Telomeric DNA. Nucleic Acids Res. 2015;43:4733–4745. doi: 10.1093/nar/gkv296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Biffi G., Tannahill D., McCafferty J., Balasubramanian S. Quantitative Visualization of DNA G-quadruplex Structures in Human Cells. Nat. Chem. 2013;5:182–186. doi: 10.1038/nchem.1548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lam E.Y.N., Beraldi D., Tannahill D., Balasubramanian S. G-quadruplex Structures Are Stable and Detectable in Human Genomic DNA. Nat. Commun. 2013;4:1796. doi: 10.1038/ncomms2792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zahler A.M., Williamson J.R., Cech T.R., Prescott D.M. Inhibition of Telomerase by G-quartet DMA Structures. Nature. 1991;350:718–720. doi: 10.1038/350718a0. [DOI] [PubMed] [Google Scholar]
- 13.Balasubramanian S., Neidle S. G-quadruplex Nucleic Acids as Therapeutic Targets. Curr. Opin. Chem. Biol. 2009;13:345–353. doi: 10.1016/j.cbpa.2009.04.637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Huppert J.L., Balasubramanian S. G-quadruplexes in Promoters Throughout the Human Genome. Nucleic Acids Res. 2007;35:406–413. doi: 10.1093/nar/gkl1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Qin M., Chen Z., Luo Q., Wen Y., Zhang N., Jiang H., Yang H. Two-Quartet G-Quadruplexes Formed by DNA Sequences Containing Four Contiguous GG Runs. J. Phys. Chem. B. 2015;119:3706–3713. doi: 10.1021/jp512914t. [DOI] [PubMed] [Google Scholar]
- 16.Huppert J.L. Four-stranded nucleic acids: structure, function and targeting of G-quadruplexes. Chem. Soc. Rev. 2008;37:1375–1384. doi: 10.1039/b702491f. [DOI] [PubMed] [Google Scholar]
- 17.Duan X.-L., Liu N.-N., Yang Y.-T., Li H.-H., Li M., Dou S.-X., Xi X.-G. G-Quadruplexes Significantly Stimulate Pif1 Helicase-catalyzed Duplex DNA Unwinding. J. Biol. Chem. 2015;290:7722–7735. doi: 10.1074/jbc.M114.628008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cayrou C., Coulombe P., Puy A., Rialle S., Kaplan N., Segal E., Méchali M. New Insights into Replication Origin Characteristics in Metazoans. Cell Cycle. 2012;11:658–667. doi: 10.4161/cc.11.4.19097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Paeschke K., Bochman M.L., Garcia P.D., Cejka P., Friedman K.L., Kowalczykowski S.C., Zakian V.A. Pif1 Family Helicases Suppress Genome Instability at G-Quadruplex Motifs. Nature. 2013;497:458–462. doi: 10.1038/nature12149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Leontis N.B., Westhof E. Geometric nomenclature and classification of RNA base pairs. RNA. 2001;7:499–512. doi: 10.1017/s1355838201002515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.da Silva M.W. Geometric Formalism for DNA Quadruplex Folding. Chem. Eur. J. 2007;13:9738–9745. doi: 10.1002/chem.200701255. [DOI] [PubMed] [Google Scholar]
- 22.Karsisiotis A.I., O'Kane C., da Silva M.W. DNA Quadruplex Folding Formalism - A Tutorial on Quadruplex Topologies. Methods. 2013;64:28–35. doi: 10.1016/j.ymeth.2013.06.004. [DOI] [PubMed] [Google Scholar]
- 23.Cang X.H., Sponer J., Cheatham T.E. Explaining the Varied Glycosidic Conformational, G-Tract Length and Sequence Preferences for Anti-Parallel G-Quadruplexes. Nucleic Acids Res. 2011;39:4499–4512. doi: 10.1093/nar/gkr031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sponer J., Mladek A., Spackova N., Cang X.H., Cheatham T.E., Grimme S. Relative Stability of Different DNA Guanine Quadruplex Stem Topologies Derived Using Large-Scale Quantum-Chemical Computations. J. Am. Chem. Soc. 2013;135:9785–9796. doi: 10.1021/ja402525c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Renciuk D., Kejnovska I., Skolakova P., Bednarova K., Motlova J., Vorlickova M. Arrangements of Human Telomere DNA Quadruplex in Physiologically Relevant K +Solutions. Nucleic Acids Res. 2009;37:6625–6634. doi: 10.1093/nar/gkp701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Buscaglia R., Miller M.C., Dean W.L., Gray R.D., Lane A.N., Trent J.O., Chaires J.B. Polyethylene Glycol Binding Alters Human Telomere G-quadruplex Structure by Conformational Selection. Nucleic Acids Res. 2013;41:7934–7946. doi: 10.1093/nar/gkt440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Haensel R., Loehr F., Foldynova-Trantirkova S., Bamberg E., Trantirek L., Doetsch V. The Parallel G-quadruplex Structure of Vertebrate Telomeric Repeat Sequences Is Not the Preferred Folding Topology under Physiological Conditions. Nucleic Acids Res. 2011;39:5768–5775. doi: 10.1093/nar/gkr174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Haensel R., Loehr F., Trantirek L., Doetsch V. High-Resolution Insight into G-Overhang Architecture. J. Am. Chem. Soc. 2013;135:2816–2824. doi: 10.1021/ja312403b. [DOI] [PubMed] [Google Scholar]
- 29.Kejnovska I., Vorlickova M., Brazdova M., Sagi J. Stability of Human Telomere Quadruplexes at High DNA Concentrations. Biopolymers. 2014;101:428–438. doi: 10.1002/bip.22400. [DOI] [PubMed] [Google Scholar]
- 30.Lim K.W., Amrane S., Bouaziz S., Xu W., Mu Y., Patel D.J., Luu K.N., Phan A.T. Structure of the Human Telomere in K+ Solution: A Stable Basket-Type G-Quadruplex with Only Two G-Tetrad Layers. J. Am. Chem. Soc. 2009;131:4301–4309. doi: 10.1021/ja807503g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wang Y., Patel D.J. Solution Structure of the Human Telomeric Repeat d[AG(3)(T(2)AG(3))3] G-tetraplex. Structure. 1993;1:263–282. doi: 10.1016/0969-2126(93)90015-9. [DOI] [PubMed] [Google Scholar]
- 32.Dai J., Carver M., Punchihewa C., Jones R.A., Yang D. Structure of the Hybrid-2 type Intramolecular Human Telomeric G-quadruplex in K+ Solution: Insights into Structure Polymorphism of the Human Telomeric Sequence. Nucleic Acids Res. 2007;35:4927–4940. doi: 10.1093/nar/gkm522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dai J., Punchihewa C., Ambrus A., Chen D., Jones R.A., Yang D. Structure of the Intramolecular Human Telomeric G-quadruplex in Potassium Solution: A Novel Adenine Triple Formation. Nucleic Acids Res. 2007;35:2440–2450. doi: 10.1093/nar/gkm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Parkinson G.N., Lee M.P.H., Neidle S. Crystal Structure of Parallel Quadruplexes from Human Telomeric DNA. Nature. 2002;417:876–880. doi: 10.1038/nature755. [DOI] [PubMed] [Google Scholar]
- 35.Ambrus A., Chen D., Dai J.X., Bialis T., Jones R.A., Yang D.Z. Human Telomeric Sequence Forms a Hybrid-Type Intramolecular G-Quadruplex Structure with Mixed Parallel/Antiparallel Strands in Potassium Solution. Nucleic Acids Res. 2006;34:2723–2735. doi: 10.1093/nar/gkl348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lim K.W., Ng V.C.M., Martín-Pintado N., Heddi B., Phan A.T. Structure of the Human Telomere in Na+ Solution: An Antiparallel (2 + 2) G-quadruplex Scaffold Reveals Additional Diversity. Nucleic Acids Res. 2013;41:10556–10562. doi: 10.1093/nar/gkt771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Abu-Ghazalah R.M., Rutledge S., Lau L.W.Y., Dubins D.N., Macgregor R.B., Helmy A.S. Concentration-Dependent Structural Transitions of Human Telomeric DNA Sequences. Biochemistry. 2012;51:7357–7366. doi: 10.1021/bi300689t. [DOI] [PubMed] [Google Scholar]
- 38.Luu K.N., Phan A.T., Kuryavyi V., Lacroix L., Patel D.J. Structure of the Human Telomere in K+ Solution: An Intramolecular (3 + 1) G-quadruplex Scaffold. J. Am. Chem. Soc. 2006;128:9963–9970. doi: 10.1021/ja062791w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Phan A.T., Kuryavyi V., Luu K.N., Patel D.J. Structure of Two Intramolecular G-quadruplexes Formed by Natural Human Telomere Sequences in K+ Solution. Nucleic Acids Res. 2007;35:6517–6525. doi: 10.1093/nar/gkm706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gray R.D., Chaires J.B. Kinetics and Mechanism of K(+)- and Na(+)-induced Folding of Models of Human Telomeric DNA into G-quadruplex Structures. Nucleic Acids Res. 2008;36:4191–4203. doi: 10.1093/nar/gkn379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Zhang A.Y.Q., Balasubramanian S. The Kinetics and Folding Pathways of Intramolecular G-Quadruplex Nucleic Acids. J. Am. Chem. Soc. 2012;134:19297–19308. doi: 10.1021/ja309851t. [DOI] [PubMed] [Google Scholar]
- 42.Gray R.D., Trent J.O., Chaires J.B. Folding and Unfolding Pathways of the Human Telomeric G-quadruplex. J. Mol. Biol. 2014;426:1629–1650. doi: 10.1016/j.jmb.2014.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Bessi I., Jonker H.R., Richter C., Schwalbe H. Involvement of Long-Lived Intermediate States in the Complex Folding Pathway of the Human Telomeric G-Quadruplex. Angew. Chem., Int. Ed. 2015;54:8444–8448. doi: 10.1002/anie.201502286. [DOI] [PubMed] [Google Scholar]
- 44.You H., Zeng X., Xu Y., Lim C.J., Efremov A.K., Phan A.T., Yan J. Dynamics and Stability of Polymorphic Human Telomeric G-quadruplex under Tension. Nucleic Acids Res. 2014;42:8789–8795. doi: 10.1093/nar/gku581. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Long X., Stone M.D. Kinetic Partitioning Modulates Human Telomere DNA G-Quadruplex Structural Polymorphism. PLoS One. 2013;8:e83420. doi: 10.1371/journal.pone.0083420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Vorlickova M., Kejnovska I., Sagi J., Renciuk D., Bednarova K., Motlova J., Kypr J. Circular Dichroism and Guanine Quadruplexes. Methods. 2012;57:64–75. doi: 10.1016/j.ymeth.2012.03.011. [DOI] [PubMed] [Google Scholar]
- 47.Gray R.D., Li J., Chaires J.B. Energetics and Kinetics of a Conformational Switch in G-Quadruplex DNA. J. Phys. Chem. B. 2009;113:2676–2683. doi: 10.1021/jp809578f. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Palacky J., Vorlickova M., Kejnovska I., Mojzes P. Polymorphism of Human Telomeric Quadruplex Structure Controlled by DNA Concentration: A Raman Study. Nucleic Acids Res. 2013;41:1005–1016. doi: 10.1093/nar/gks1135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wang Z.-F., Li M.-H., Hsu S.-T.D., Chang T.-C. Structural Basis of Sodium–Potassium Exchange of a Human Telomeric DNA Quadruplex Without Topological Conversion. Nucleic Acids Res. 2014;42:4723–4733. doi: 10.1093/nar/gku083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Lee J.Y., Okumus B., Kim D.S., Ha T. Extreme Conformational Diversity in Human Telomeric DNA. Proc. Natl. Acad. Sci. U.S.A. 2005;102:18938–18943. doi: 10.1073/pnas.0506144102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Thirumalai D., O'Brien E.P., Morrison G., Hyeon C. Theoretical Perspectives on Protein Folding. Annu. Rev. Biophys. 2010;39:159–183. doi: 10.1146/annurev-biophys-051309-103835. [DOI] [PubMed] [Google Scholar]
- 52.Narayanan R., Zhu L., Velmurugu Y., Roca J., Kuznetsov S.V., Prehna G., Lapidus L.J., Ansari A. Exploring the Energy Landscape of Nucleic Acid Hairpins Using Laser Temperature-Jump and Microfluidic Mixing. J. Am. Chem. Soc. 2012;134:18952–18963. doi: 10.1021/ja301218e. [DOI] [PubMed] [Google Scholar]
- 53.Li H., Cao E.H., Gisler T. Force-induced Unfolding of Human Telomeric G-quadruplex: A Steered Molecular Dynamics Simulation Study. Biochem. Biophys. Res. Commun. 2009;379:70–75. doi: 10.1016/j.bbrc.2008.12.006. [DOI] [PubMed] [Google Scholar]
- 54.Bian Y., Tan C., Wang J., Sheng Y., Zhang J., Wang W. Atomistic Picture for the Folding Pathway of a Hybrid-1 Type Human Telomeric DNA G-quadruplex. PLoS Comput. Biol. 2014;10:e1003562. doi: 10.1371/journal.pcbi.1003562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Koirala D., Mashimo T., Sannohe Y., Yu Z.B., Mao H.B., Sugiyama H. Intramolecular Folding in Three Tandem Guanine Repeats of Human Telomeric DNA. Chem. Commun. 2012;48:2006–2008. doi: 10.1039/c2cc16752b. [DOI] [PubMed] [Google Scholar]
- 56.Mashimo T., Yagi H., Sannohe Y., Rajendran A., Sugiyama H. Folding Pathways of Human Telomeric Type-1 and Type-2 G-quadruplex Structures. J. Am. Chem. Soc. 2010;132:14910–14918. doi: 10.1021/ja105806u. [DOI] [PubMed] [Google Scholar]
- 57.Limongelli V., De Tito S., Cerofolini L., Fragai M., Pagano B., Trotta R., Cosconati S., Marinelli L., Novellino E., Bertini I., et al. The G-Triplex DNA. Angew. Chem., Int. Ed. 2013;52:2269–2273. doi: 10.1002/anie.201206522. [DOI] [PubMed] [Google Scholar]
- 58.Stefl R., Cheatham T.E., Spackova N., Fadrna E., Berger I., Koca J., Sponer J. Formation Pathways of a Guanine-Quadruplex DNA Revealed by Molecular Dynamics and Thermodynamic Analysis of the Substates. Biophys. J. 2003;85:1787–1804. doi: 10.1016/S0006-3495(03)74608-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Yang C., Jang S., Pak Y. Multiple Stepwise Pattern for Potential of Mean Force in Unfolding the Thrombin Binding Aptamer in Complex with Sr2+ J. Chem. Phys. 2011;135:225104. doi: 10.1063/1.3669424. [DOI] [PubMed] [Google Scholar]
- 60.Stadlbauer P., Krepl M., Cheatham T.E., Koca J., Sponer J. Structural Dynamics of Possible Late-Stage Intermediates in Folding of Quadruplex DNA Studied by Molecular Simulations. Nucleic Acids Res. 2013;41:7128–7143. doi: 10.1093/nar/gkt412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Stadlbauer P., Trantirek L., Cheatham T.E., Koca J., Sponer J. Triplex Intermediates in Folding of Human Telomeric Quadruplexes Probed by Microsecond-scale Molecular Dynamics Simulations. Biochimie. 2014;105:22–35. doi: 10.1016/j.biochi.2014.07.009. [DOI] [PubMed] [Google Scholar]
- 62.Kim E., Yang C., Pak Y. Free-energy Landscape of a Thrombin-binding DNA Aptamer in Aqueous Environment. J. Chem. Theory Comput. 2012;8:4845–4851. doi: 10.1021/ct300714u. [DOI] [PubMed] [Google Scholar]
- 63.Sponer J., Banas P., Jurecka P., Zgarbova M., Kuehrova P., Havrila M., Krepl M., Stadlbauer P., Otyepka M. Molecular Dynamics Simulations of Nucleic Acids. From Tetranucleotides to the Ribosome. J. Phys. Chem. Lett. 2014;5:1771–1782. doi: 10.1021/jz500557y. [DOI] [PubMed] [Google Scholar]
- 64.Islam B., Sgobba M., Laughton C., Orozco M., Sponer J., Neidle S., Haider S. Conformational Dynamics of the Human Propeller Telomeric DNA Quadruplex on a Microsecond Time Scale. Nucleic Acids Res. 2013;41:2723–2735. doi: 10.1093/nar/gks1331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Wei D., Husby J., Neidle S. Flexibility and Structural Conservation in a c-KIT G-quadruplex. Nucleic Acids Res. 2015;43:629–644. doi: 10.1093/nar/gku1282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Reshetnikov R., Golovin A., Spiridonova V., Kopylov A., Sponer J. Structural Dynamics of Thrombin-Binding DNA Aptamer d(GGTTGGTGTGGTTGG) Quadruplex DNA Studied by Large-scale Explicit Solvent Simulations. J. Chem. Theory Comput. 2010;6:3003–3014. doi: 10.1021/ct100253m. [DOI] [PubMed] [Google Scholar]
- 67.Ghosh S., Jana J., Kar R.K., Chatterjee S., Dasgupta D. Plant Alkaloid Chelerythrine Induced Aggregation of Human Telomere Sequence-A Unique Mode of Association between a Small Molecule and a Quadruplex. Biochemistry. 2015;54:974–986. doi: 10.1021/bi501117x. [DOI] [PubMed] [Google Scholar]
- 68.Abrams C., Bussi G. Enhanced Sampling in Molecular Dynamics Using Metadynamics, Replica-Exchange, and Temperature-Acceleration. Entropy. 2013;16:163–199. [Google Scholar]
- 69.Iacovelli F., Falconi M. Decoding the Conformation-linked Functional Properties of Nucleic Acids by the Use of Computational Tools. Febs J. 2015;282:3298–3310. doi: 10.1111/febs.13315. [DOI] [PubMed] [Google Scholar]
- 70.Lindorff-Larsen K., Piana S., Dror R.O., Shaw D.E. How Fast-Folding Proteins Fold. Science. 2011;334:517–520. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
- 71.Best R.B., Hummer G., Eaton W.A. Native Contacts Determine Protein Folding Mechanisms in Atomistic Simulations. Proc. Natl. Acad. Sci. U.S.A. 2013;110:17874–17879. doi: 10.1073/pnas.1311599110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Li Y., Liu C., Feng X., Xu Y., Liu B.-F. Ultrafast Microfluidic Mixer for Tracking the Early Folding Kinetics of Human Telomere G-Quadruplex. Anal. Chem. 2014;86:4333–4339. doi: 10.1021/ac500112d. [DOI] [PubMed] [Google Scholar]
- 73.Rajendran A., Endo M., Hidaka K., Sugiyama H. Direct and Single-Molecule Visualization of the Solution-State Structures of G-Hairpin and G-Triplex Intermediates. Angew. Chem. 2014;126:4191–4196. doi: 10.1002/anie.201308903. [DOI] [PubMed] [Google Scholar]
- 74.Rajendran A., Endo M., Hidaka K., Teulade-Fichou M.-P., Mergny J.-L., Sugiyama H. Small Molecule Binding to a G-hairpin and a G-triplex: A New Insight into Anticancer Drug Design Targeting G-rich Regions. Chem. Commun. 2015;51:9181–9184. doi: 10.1039/c5cc01678a. [DOI] [PubMed] [Google Scholar]
- 75.Sugita Y., Okamoto Y. Replica-Exchange Molecular Dynamics Method for Protein Folding. Chem. Phys. Lett. 1999;314:141–151. [Google Scholar]
- 76.Dai J., Chen D., Jones R.A., Hurley L.H., Yang D. NMR Solution Structure of the Major G-quadruplex Structure Formed in the Human BCL2 Promoter Region. Nucleic Acids Res. 2006;34:5133–5144. doi: 10.1093/nar/gkl610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Case D.A.D., Cheatham T.T.E., III, Simmerling C.L., Wang J.D., R.E.R.E. Luo R., Walker R., Zhang W., Merz K.M.R., Hayik S.S., Roitberg A., et al. AMBER 12. San Francisco: University of California; 2012. [Google Scholar]
- 78.Islam B., Stadlbauer P., Krepl M., Koca J., Neidle S., Haider S., Sponer J. Extended Molecular Dynamics of a c-kit Promoter Quadruplex. Nucleic Acids Res. 2015 doi: 10.1093/nar/gkv785. doi:10.1093/nar/gkv785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Jorgensen W.L., Chandrasekhar J., Madura J.D., Impey R.W., Klein M.L. Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
- 80.Berendsen H.J.C., Grigera J.R., Straatsma T.P. The Missing Term in Effective Pair Potentials. J. Phys. Chem. 1987;91:6269–6271. [Google Scholar]
- 81.Florova P., Sklenovsky P., Banas P., Otyepka M. Explicit Water Models Affect the Specific Solvation and Dynamics of Unfolded Peptides While the Conformational Behavior and Flexibility of Folded Peptides Remain Intact. J. Chem. Theory Comput. 2010;6:3569–3579. doi: 10.1021/ct1003687. [DOI] [PubMed] [Google Scholar]
- 82.Joung I.S., Cheatham T.E. Determination of Alkali and Halide Monovalent Ion Parameters for Use In Explicitly Solvated Biomolecular Simulations. J. Phys. Chem. B. 2008;112:9020–9041. doi: 10.1021/jp8001614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Aqvist J. Ion Water Interaction Potentials Derived from Free-Energy Perturbation Simulations. J. Phys. Chem. 1990;94:8021–8024. [Google Scholar]
- 84.Smith D.E., Dang L.X. Computer Simulations of NaCl Association in Polarizable Water. J. Chem. Phys. 1994;100:3757–3766. [Google Scholar]
- 85.Dang L.X., Kollman P.A. Free Energy of Association of the K + 18-crown-6 complex in Water - A New Molecular Dynamics Study. J. Phys. Chem. 1995;99:55–58. [Google Scholar]
- 86.Noy A., Soteras I., Luque F.J., Orozco M. The Impact of Monovalent Ion Force Field Model in Nucleic Acids Simulations. Phys. Chem. Chem. Phys. 2009;11:10596–10607. doi: 10.1039/b912067j. [DOI] [PubMed] [Google Scholar]
- 87.Sponer J., Cang X.H., Cheatham T.E. Molecular Dynamics Simulations of G-DNA and Perspectives on the Simulation of Nucleic Acid Structures. Methods. 2012;57:25–39. doi: 10.1016/j.ymeth.2012.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Perez A., Marchan I., Svozil D., Sponer J., Cheatham T.E., Laughton C.A., Orozco M. Refinenement of the AMBER Force Field for Nucleic Acids: Improving the Description of Alpha/Gamma Conformers. Biophys. J. 2007;92:3817–3829. doi: 10.1529/biophysj.106.097782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Krepl M., Zgarbova M., Stadlbauer P., Otyepka M., Banas P., Koca J., Cheatham T.E., Jurecka P., Sponer J. Reference Simulations of Noncanonical Nucleic Acids with Different Chi Variants of the AMBER Force Field: Quadruplex DNA, Quadruplex RNA, and Z-DNA. J. Chem. Theory Comput. 2012;8:2506–2520. doi: 10.1021/ct300275s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Zgarbova M., Luque F.J., Sponer J., Cheatham T.E., Otyepka M., Jurecka P. Toward Improved Description of DNA Backbone: Revisiting Epsilon and Zeta Torsion Force Field Parameters. J. Chem. Theory Comput. 2013;9:2339–2354. doi: 10.1021/ct400154j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Cornell W.D., Cieplak P., Bayly C.I., Gould I.R., Merz K.M., Ferguson D.M., Spellmeyer D.C., Fox T., Caldwell J.W., Kollman P.A. A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules. J. Am. Chem. Soc. 1995;117:5179–5197. [Google Scholar]
- 92.Cieplak P., Cornell W.D., Bayly C., Kollman P.A. Application of the Multimolecule and Multiconformational RESP Methodology to Biopolymers - Charge Derivation for DNA, RNA, and Proteins. J. Comput. Chem. 1995;16:1357–1377. [Google Scholar]
- 93.Wang J.M., Cieplak P., Kollman P.A. How Well Does a Restrained Electrostatic Potential (RESP) Model Perform in Calculating Conformational Energies of Organic and Biological Molecules. J. Comput. Chem. 2000;21:1049–1074. [Google Scholar]
- 94.Darden T., York D., Pedersen L. Particle Mesh Ewald - An N.log(N) Method for Ewald Sums in Large Systems. J. Chem. Phys. 1993;98:10089–10092. [Google Scholar]
- 95.Essmann U., Perera L., Berkowitz M.L., Darden T., Lee H., Pedersen L.G. A Smooth Particle Mesh Ewald Method. J. Chem. Phys. 1995;103:8577–8593. [Google Scholar]
- 96.Berendsen H.J.C., Postma J.P.M., Vangunsteren W.F., Dinola A., Haak J.R. Molecular-Dynamics with Coupling to an External Bath. J. Chem. Phys. 1984;81:3684–3690. [Google Scholar]
- 97.Ryckaert J.P., Ciccotti G., Berendsen H.J.C. Numerical Integration of Cartesian Equations of Motion of a System with Constraints - Molecular Dynamics of N-alkans. J. Comput. Phys. 1977;23:327–341. [Google Scholar]
- 98.Dang L.X. Mechanism and Thermodynamics of Ion Selectivity in Aqueous Solutions of 18-Crown-6 Ether: A Molecular Dynamics Study. J. Am. Chem. Soc. 1995;117:6954–6960. [Google Scholar]
- 99.Rosta E., Buchete N.-V., Hummer G. Thermostat Artifacts in Replica Exchange Molecular Dynamics Simulations. J. Chem. Theory Comput. 2009;5:1393–1399. doi: 10.1021/ct800557h. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Fadrna E., Spackova N., Sarzynska J., Koca J., Orozco M., Cheatham T.E., Kulinski T., Sponer J. Single Stranded Loops of Quadruplex DNA as Key Benchmark for Testing Nucleic Acids Force Fields. J. Chem. Theory Comput. 2009;5:2514–2530. doi: 10.1021/ct900200k. [DOI] [PubMed] [Google Scholar]
- 101.Rachwal P.A., Brown T., Fox K.R. Sequence Effects of Single Base Loops in Intramolecular Quadruplex DNA. Febs Lett. 2007;581:1657–1660. doi: 10.1016/j.febslet.2007.03.040. [DOI] [PubMed] [Google Scholar]
- 102.Smargiasso N., Rosu F., Hsia W., Colson P., Baker E.S., Bowers M.T., De Pauw E., Gabelica V. G-quadruplex DNA Assemblies: Loop Length, Cation Identity, and Multimer Formation. J. Am. Chem. Soc. 2008;130:10208–10216. doi: 10.1021/ja801535e. [DOI] [PubMed] [Google Scholar]
- 103.Rachwal P.A., Brown T., Fox K.R. Effect of G-tract Length on the Topology and Stability of Intramolecular DNA Quadruplexes. Biochemistry. 2007;46:3036–3044. doi: 10.1021/bi062118j. [DOI] [PubMed] [Google Scholar]
- 104.Rachwal P.A., Findlow I.S., Werner J.M., Brown T., Fox K.R. Intramolecular DNA Quadruplexes with Different Arrangements of Short and Long Loops. Nucleic Acids Res. 2007;35:4214–4222. doi: 10.1093/nar/gkm316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Bugaut A., Balasubramanian S. A Sequence-Independent Study of the Influence of Short Loop Lengths on the Stability and Topology of Intramolecular DNA G-quadruplexes. Biochemistry. 2008;47:689–697. doi: 10.1021/bi701873c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Mladek A., Krepl M., Svozil D., Cech P., Otyepka M., Banas P., Zgarbova M., Jurecka P., Sponer J. Benchmark Quantum-chemical Calculations on a Complete Set of Rotameric Families of the DNA Sugar-phosphate Backbone and Their Comparison with Modern Density Functional Theory. Phys. Chem. Chem. Phys. 2013;15:7295–7310. doi: 10.1039/c3cp44383c. [DOI] [PubMed] [Google Scholar]
- 107.Zgarbova M., Otyepka M., Sponer J., Lankas F., Jurecka P. Base Pair Fraying in Molecular Dynamics Simulations of DNA and RNA. J. Chem. Theory Comput. 2014;10:3177–3189. doi: 10.1021/ct500120v. [DOI] [PubMed] [Google Scholar]
- 108.Best R.B., Hummer G. Reaction Coordinates and Rates from Transition Paths. Proc. Natl. Acad. Sci. U.S.A. 2005;102:6732–6737. doi: 10.1073/pnas.0408098102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Portella G., Orozco M. Multiple Routes to Characterize the Folding of a Small DNA Hairpin. Angew. Chem., Int. Ed. 2010;49:7673–7676. doi: 10.1002/anie.201003816. [DOI] [PubMed] [Google Scholar]
- 110.Kannan S., Zacharias M. Folding of a DNA Hairpin Loop Structure in Explicit Solvent Using Replica-Exchange Molecular Dynamics Simulations. Biophys. J. 2007;93:3218–3228. doi: 10.1529/biophysj.107.108019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Kuehrova P., Banas P., Best R.B., Sponer J., Otyepka M. Computer Folding of RNA Tetraloops? Are We There Yet. J. Chem. Theory Comput. 2013;9:2115–2125. doi: 10.1021/ct301086z. [DOI] [PubMed] [Google Scholar]
- 112.Chen A.A., Garcia A.E. High-Resolution Reversible Folding of Hyperstable RNA Tetraloops Using Molecular Dynamics Simulations. Proc. Natl. Acad. Sci. U.S.A. 2013;110:16820–16825. doi: 10.1073/pnas.1309392110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Bergonzo C., Henriksen N.M., Roe D.R., Swails J.M., Roitberg A.E., Cheatham T.E. Multidimensional Replica Exchange Molecular Dynamics Yields a Converged Ensemble of an RNA Tetranucleotide. J. Chem. Theory Comput. 2013;10:492–499. doi: 10.1021/ct400862k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Bergonzo C., Henriksen N.M., Roe D.R., Cheatham T.E. Highly Sampled Tetranucleotide and Tetraloop Motifs Enable Evaluation of Common RNA Force Fields. RNA. 2015;21:1578–1590. doi: 10.1261/rna.051102.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Ceru S., Sket P., Prislan I., Lah J., Plavec J. A New Pathway of DNA G-Quadruplex Formation. Angew. Chem., Int. Ed. 2014;53:4881–4884. doi: 10.1002/anie.201400531. [DOI] [PubMed] [Google Scholar]
- 116.Li W., Hou X.-M., Wang P.-Y., Xi X.-G., Li M. Direct Measurement of Sequential Folding Pathway and Energy Landscape of Human Telomeric G-quadruplex Structures. J. Am. Chem. Soc. 2013;135:6423–6426. doi: 10.1021/ja4019176. [DOI] [PubMed] [Google Scholar]
- 117.Gray R.D., Buscaglia R., Chaires J.B. Populated Intermediates in the Thermal Unfolding of the Human Telomeric Quadruplex. J. Am. Chem. Soc. 2012;134:16834–16844. doi: 10.1021/ja307543z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Marchand A., Ferreira R., Tateishi-Karimata H., Miyoshi D., Sugimoto N., Gabelica V. Sequence and Solvent Effects on Telomeric DNA Bimolecular G-Quadruplex Folding Kinetics. J. Phys. Chem. B. 2013;117:12391–12401. doi: 10.1021/jp406857s. [DOI] [PubMed] [Google Scholar]
- 119.Thirumalai D., Woodson S.A. Kinetics of Folding of Proteins and RNA. Acc. Chem. Res. 1996;29:433–439. [Google Scholar]
- 120.Mukundan V.T., Phan A.T. Bulges in G-Quadruplexes: Broadening the Definition of G-Quadruplex-Forming Sequences. J. Am. Chem. Soc. 2013;135:5017–5028. doi: 10.1021/ja310251r. [DOI] [PubMed] [Google Scholar]
- 121.Ansari A., Kuznetsov S.V., Shen Y. Configurational Diffusion Down a Folding Funnel Describes the Dynamics of DNA Hairpins. Proc. Natl. Acad. Sci. U.S.A. 2001;98:7771–7776. doi: 10.1073/pnas.131477798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Jung J., Van Orden A. A Three-State Mechanism for DNA Hairpin Folding Characterized by Multiparameter Fluorescence Fluctuation Spectroscopy. J. Am. Chem. Soc. 2006;128:1240–1249. doi: 10.1021/ja0560736. [DOI] [PubMed] [Google Scholar]
- 123.Boncina M., Lah J., Prislan I., Vesnaver G. Energetic Basis of Human Telomeric DNA Folding into G-quadruplex Structures. J. Am. Chem. Soc. 2012;134:9657–9663. doi: 10.1021/ja300605n. [DOI] [PubMed] [Google Scholar]
- 124.Cragnolini T., Derreumaux P., Pasquali S. Coarse-Grained Simulations of RNA and DNA Duplexes. J. Phys. Chem. B. 2013;117:8047–8060. doi: 10.1021/jp400786b. [DOI] [PubMed] [Google Scholar]
- 125.Chakraborty D., Collepardo-Guevara R., Wales D.J. Energy Landscapes, Folding Mechanisms, and Kinetics of RNA Tetraloop Hairpins. J. Am. Chem. Soc. 2014;136:18052–18061. doi: 10.1021/ja5100756. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.