Abstract
Due to the long-range nature of high-order interactions between distal components in a biomolecule, transition dynamics of tertiary structures is often too complex to profile using conventional methods. Inspired by the exploded view in mechanical drawing, here, we used laser tweezers to mechanically dissect high-order DNA structures into two constituting G-quadruplexes in the promoter of the human telomerase reverse transcriptase (hTERT) gene. Assisted with click-chemistry coupling, we sandwiched one G-quadruplex with two dsDNA handles while leaving the other unit free. Mechanical unfolding through these handles revealed transition dynamics of the targeted quadruplex in a native environment, which is named as native mechanical segmentation (NMS). Comparison between unfolding of an NMS construct and that of truncated G-quadruplex constructs revealed a quadruplex–quadruplex interaction with 2 kcal/mol stabilization energy. After mechanically targeting the two G-quadruplexes together, the same interaction was observed during the first unfolding step. The unfolding then proceeded through disrupting the weaker G-quadruplex at the 5′-end, followed by the stronger G-quadruplex at the 3′-end via various intermediates. Such a pecking order in unfolding well reflects the hierarchical nature of nucleic acid structures. With surgery-like precisions, we anticipate this NMS approach offers unprecedented perspective to decipher dynamic transitions in complex biomacromolecules.
INTRODUCTION
Unlike proteins or RNAs, which demonstrate abundant examples of tertiary interactions, higher order DNA structures are not well known until G-quadruplexes start to attract research attention after their in vivo formation and biological activities have been recognized (1,2). A G-quadruplex comprises G-quartets stacking into a columnar structure through cation coordination and π–π interactions. Each G-quartet consists of four guanines interconnected to each other by Hoogsteen bonds (3).
A G-quadruplex requires at least four G-tracts to fold. Computerized search has indicated plenty of G-quadruplex hosting regions in human genome (4,5). In many regions, multiple G-tracts exist, which allows the folding of more than one G-quadruplex (6–9). This brings an opportunity for neighboring G-quadruplexes to interact. Indeed, experiments have confirmed quadruplex–quadruplex interaction (QQI) in adjacent G-quadruplexes (6,9–12). Such a discovery is of high significance to develop drugs with desired G-quadruplex specificity.
Due to the biological activities demonstrated by G-quadruplex, it becomes a viable approach to regulate these biological functions via G-quadruplex interacting ligands. It has been successfully demonstrated that ligands can differentiate between G-quadruplex structures and duplex DNA (13,14), which reduces side effects originated from non-discriminative binding of ligands to the DNA duplex. However, ligand specificity among different G-quadruplexes is so far problematic to achieve. The interaction between G-quadruplex units is expected to generate new targets through which specific ligands can be designed.
Nevertheless, it is rather challenging to identify and interrogate tertiary interactions, such as QQI, in macromolecules. First, tertiary interactions are of long-range in nature. Unlike the localized primary or secondary structures, long-range interaction involves structural components located in distance. Pinpoint these interactions require comprehensive knowledge of both local and global architectures of a macromolecule. Second, due to the very nature of tertiary interactions, macromolecules hosting these interactions are often large. Conventional structural determination methods such as X-ray and nuclear magnetic resonance have compromised capacity to probe large molecules.
Inspired by exploded drawing of components in a mechanical device, here, we developed a new approach to localize and characterize tertiary interactions in biomacromolecules by breaking down the entire structure into individual components with an exploded perspective. As a proof of concept, we demonstrated this method in a DNA sequence located upstream (–22 to –90) of the transcription start site of the human telomerase reverse transcriptase (hTERT) gene, or hTERT1–12.
This region consists of 12 G-tracts capable of forming two G-quadruplex structures in the 1st–4th and 5th–12th G-tracts, respectively (Figure 1). G-quadruplexes in the hTERT promoter are proposed to control the expression of the telomerase (10,15), which has elevated concentration in majority of cancer cells. The potential interaction between the two G-quadruplexes may offer another level of modulation for the telomerase expression. Although such a tertiary interaction has been proposed (10), no direct evidence has been presented.
Since higher order QQI has been proposed to occur between two G-quadruplexes (6,10,11), our strategy dissects the entire DNA structure into individual G-quadruplexes followed by separate characterization of each quadruplex unit. To probe the high-order communication between quadruplexes, the dissection should avoid disrupting tertiary interactions. The truncation strategy in which a part of the structure is deleted is, therefore, not appropriate. Instead, we rely on click chemistry (16,17) to achieve this goal. By selectively attaching a terminal alkyne group to a specific residue between the two hTERT G-quadruplexes, we were able to introduce a dsDNA handle with a terminal azide group in this region through the Cu(I)-catalyzed azide-alkyne cycloaddition (CuAAC) reaction (16,17). Next, another DNA handle can be attached to either the 5′- or 3′-end of the hTERT sequence that contains the full 12 G-tracts. As a result, a desired structural segment that contains one G-quadruplex can be mechanically targeted by applying force via the two DNA handles in a laser-tweezers instrument. Since the rest of the structure does not experience any mechanical force, this strategy only investigates the desired structural segment while keeping the remainder of the structure mechanically secluded. We name this strategy as native mechanical segmentation (NMS) to reflect the surgery-like precision for the mechanical dissection of a native biomolecule.
With this strategy, we found small unfolding transitions consistent with the disruption of a higher order interaction between the two hTERT G-quadruplexes in the DNA construct in which either the 1–4 or the 5–12 G-tracts are mechanically targeted. The same interaction also existed in the construct that has all 12 G-tracts mechanically targeted. Scrutinizing on the two mechanically segmented DNA constructs suggests that the higher order interaction is likely due to the stacking between the two G-quadruplexes with a stabilizing free energy of 2.0 ± 0.4 kcal/mol. In addition, we revealed that unfolding pathways of the whole structure can be pieced together by those presented in each G-quadruplex, confirming the hierarchical organization of tertiary DNA structures. Although complex DNA structures have been investigated here through dichotomizing individual structural domains with an exploded mechano-analytical perspective (XMAP), the XMAP approach can be readily extended to interrogate other biomacromolecules such as RNAs and proteins.
MATERIALS AND METHODS
Materials
All DNA fragments unless specifically mentioned were purchased from Integrated DNA Technologies (Coralville, IA, USA). The DNA fragments were purified by 8% polyacrylamide gel electrophoresis. The alkyne modified DNA sequence used in the NMS1–4 and NMS5–12 constructs was purchased from Baseclick GmbH, Germany. Chemicals used in this work were purchased from VWR (West Chester, PA, USA) with purity >95% and used directly without further purification. All restriction enzymes were purchased from New England Biolabs Inc. (Ipswich, MA, USA). Streptavidin coated polystyrene particles (1.87 μm diameter) and digoxigenin-antibody coated polystyrene particles (2.1 μm diameter) were purchased from Spherotech Inc. (Lake Forest, IL, USA).
Circular dichroism (CD) measurements
Circular dichroism spectrum was measured using Jasco-810 spectropolarimeter (JASCO Inc., Easton, MD, USA). The oligonucleotide (ODN 186, sequence see Supplementary Table S1) sample solution (5 μM) was prepared in a 10 mM Tris buffer (pH 7.4) with 100 mM KCl. The spectrum was recorded at 25°C in the wavelength range of 200–360 nm using a quartz cuvette with 1 mm path length at a scanning speed of 50 nm/min. Three measurements were recorded for the DNA sample and averaged. Prior to the CD measurement, the sample was denatured at 95°C for 10 min and cooled to room temperature in 2 h. CD spectrum of ODN 186 showed peaks at 210 nm and 265 nm, a shoulder at 290 nm and a valley at 240 nm (Supplementary Figure S1). These features were identical with those from the wild type sequence (10) (see Figure 4A in reference (10)), confirming conformations of G-quadruplexes were not altered after the alkyne modification of the C21 residue.
Synthesis of DNA constructs for single-molecule analysis
All the DNA constructs synthesized for this project comprise of a sequence of interest sandwiched between two long dsDNA handles. One of the dsDNA handles (2028-bp in length) was modified with biotin at the 5′-end (introduced through a biotinylated polymerase chain reaction (PCR) primer using a plasmid pBR322 template (site 629th–2961st)) and the other handle was 2690-bp in length with multiple copies of digoxigenin introduced by terminal transferase at the 3′-end of the plasmid pEGFP fragment (site 620th–2663rd, purified from SacI and EagI restriction enzyme digestions) (18). These modifications allowed the attachment of the DNA construct to the two types of protein coated polystyrene particles (see Materials) through biotin-streptavidin linkage and digoxigenin-anti-digoxigenin linkage, respectively.
The hTERT1–12, truncated1–4 and truncated1–4 + 7-nt constructs were prepared following the procedure shown in Supplementary Figure S2. The sequence of interest (SOI) in each of these constructs was flanked by two 26-nt sequences (see Supplementary Table S1 for the sequence). Two different oligonucleotides, which were complementary to the flanking regions of the SOI, were annealed, followed by the ligation with the two long dsDNA handles. In the case of the NMS1–4 or the NMS5–12 construct, the end that was not labeled with biotin in the 2028-bp dsDNA handle was modified with an azide group during the PCR preparation using a 5′- azide modified primer (site 629th–2961th of the plasmid pBR322 was used as the PCR template).
The hTERT1–12 fragment with a terminal alkyne group labeled at the residue C21 (ODN 186, see Supplementary Table S1 for the sequence: the modified cytosine is marked with an asterisk) was used in the NMS1–4 or the NMS5–12 construct. The 3′-end of the hTERT 1–12 G-tracts region was connected to a 29-nt flanking sequence. For the synthesis of the NMS5–12 construct, the oligonucleotides complementary to the flanking sequence at the 3′-end were annealed at 96°C for 10 min, followed by slow cooling to 25°C for about 5 h. During annealing, an EagI restriction site was generated at the 3′-end of the fragment, which was used to ligate with the 2690-bp dsDNA handle prepared above (similar to step B in Supplementary Figure S3 and the gel image corresponding to this step was shown in Supplementary Figure S4B). Following the DNA ligation, the azide-modified 2028-bp handle with biotin labeled at the other end was linked to the alkyne group at the C21 position of the hTERT1–12 fragment through the CuAAC reaction (similar to step C in Supplementary Figure S3; see Supplementary Figure S4D for the gel image of the reaction) (16,17). For the click chemistry coupling, the respective DNA sequences were mixed in the click reaction medium (1:2 ratio of 0.1 M CuBr / 0.1 M Tris(benzyltriazolylmethyl)amine in 3:1 DMSO/t-BuOH) and incubated overnight in absence of light. As the reaction mixture contained copper ions, 25 mM EDTA was added to remove the free ions after the click reaction. The DNA sample was purified by ethanol precipitation prior to single-molecule mechanochemical analysis.
For the synthesis of the NMS1–4 construct, the same hTERT1–12 fragment whose C21 was labeled with the terminal alkyne (ODN 186, see Supplementary Table S1) was used to attach two dsDNA handles that flanked the hTERT 1–4 G-tracts. This task required a flanking sequence on the 5′-end of the hTERT1–12 fragment. To introduce this sequence (ODN 187.1, see Supplementary Table S1 for the sequence), we developed a new synthetic strategy, KELPS (Kinetic Enrichment of Ligation via Product Selection, see main text and Supplementary Figure S3). First, a splint ligation (19) by T4 DNA ligase was carried out in which a DNA splint (SSd003.2) was hybridized to the 5′-end of the ODN 186 and 3′-end of the ODN 187.1 (step A1 in Supplementary Figure S3). To remove the DNA splint (SSd003.2) that was partially hybridized with the G-rich region in the hTERT1–12 fragment, we used another oligonucleotide (SSd003.3). This sequence was designed to have a toehold section at the 5′-end that can recognize the 5′-end flanking sequence of the hTERT1–12 while generating an EagI overhang. However, it would not interfere with the G-rich region of the hTERT1–12 fragment. To fully replace the oligonucleotide SSd003.2, 50 times excess of the SSd003.3 was used at 96°C for 10 min, followed by immediate cooling to 4°C (step A2 in Supplementary Figure S3). After replacement reaction, the above mentioned 2690-bp DNA handle was ligated to the modified hTERT1–12 fragment (step A3) through the newly generated flanking sequence at the 5′-end (step B in Supplementary Figure S3; refer Supplementary Figure S4A for gel image). Finally, the NMS1–4 construct was obtained by coupling of the azide-modified 2028-bp DNA handle to the alkyne modified hTERT-2690-bp DNA complex through the CuAAC reaction discussed above (step C in Supplementary Figure S3; gel image corresponding to this click reaction was shown in Supplementary Figure S4B). After the click reaction, the reaction mixtures were treated with ethylenediaminetetraacetic acid (EDTA) in equimolar ratio to Cu (II) ions added during the click reaction to remove the copper. Later, this solution was centrifuged at 1300 rpm for 5 min to collect the supernatant. The supernatant containing the DNA was purified by ethanol precipitation. The pellet was then directly used for the mechanical unfolding experiments after reconstitution with desired buffer. Although there were reactants in the pellet (see Supplementary Figure S4C,D), they did not affect mechanical unfolding experiments since they could not be tethered between the two optically trapped beads, which requires the labeling of both biotin or digoxigenin at the two ends of a DNA construct.
Single-molecule force ramping experiments
The detailed description of the home-made dual-trap optical tweezers instrument for single-molecule force spectroscopy experiments has been reported elsewhere (20). Briefly, a laser beam from a single laser source (1064 nm, CW mode, 4 W, BL-106C, Spectra-physics) was split into two polarized beams using a polarized beam splitter. The foci of the two laser beams served as two optical traps. The movement of that optical trap was controlled by a steerable mirror (Nano-MTA, Mad City Labs, Madison, WI, USA) at a back focal plane of the focusing objective (Nikon CFI0 Plan-Apochromat 60x, NA 1.2, water immersion, working distance ≈320 μm). The positions of the two laser traps were detected using position-sensitive photodetectors (DL100, Pacific Silicon Sensor) and the force exerted on the traps were calibrated by thermal motion measurement. A three-channel microfluidic chamber was used for the single-molecule assays. The middle channel serving as the reaction center was connected to the top or bottom channel via a micrometer-sized glass pipette (King Precison Glass Inc., Claremont, CA, USA). The streptavidin and digoxigenin antibody coated polystyrene beads were introduced to the central channel via the pipettes from the top and bottom channels, respectively.
The synthesized DNA construct modified with either biotin or digoxigenin on each end was incubated with digoxigenin-antibody coated polystyrene particles (2.1 μm, Spherotech Inc.) for 1 h. During the incubation, the linkage between the digoxigenin in the DNA construct and the digoxigenin-antibody on the bead surface was established. A DNA tethered bead and a streptavidin coated polystyrene particle were trapped in the two laser foci, respectively. Upon moving one of the laser foci toward the other, the biotin-modified end of the DNA was linked to the streptavidin coated bead, accomplishing the tether of the DNA to the two trapped polystyrene beads. The force-ramping experiments were then carried out by moving the two traps toward, or away from, each other. The loading rate of the stretching and relaxing a DNA tether was kept at 5.5 pN/s. The single-molecule nature was determined by a characteristic saw-tooth plateau observed at 65 pN on stretching the DNA tethered between the two beads. All single-molecule analysis reported was performed in 10 mM Tris buffers (pH 7.4) supplemented with 100 mM K+ at room temperature (23°C).
Determination of unfolding pathways
Stretching of the hTERT DNA constructs generated more than one feature in most of the F-X traces. The folded structure corresponding to each unfolding event was assigned based on the change-in-contour-length (ΔL, see below and Supporting Information for calculation) and the order of occurrence during stretching. For accurate assignment of different species, only the molecules showing the ΔL for the fully folded structure were selected for analyses (68 nts or 25 nm for the hTERT1–12 construct and 48 nts or 18 nm for the NMS5–12 construct). About 11% and 31% of the total traces showed unfolding of the fully folded structure to fully unfolded sequence in one step for the hTERT1–12 and NMS5–12 constructs respectively. To assign the unfolding intermediates, first, each feature was categorized into different ΔL bins with a bin width of 4 nm. This value was determined based on 3 standard deviations of the ΔL measurements. The categorization was followed by ranking each feature in a trace according to the order of occurrence. Many traces contained features with almost identical ΔL between two transitions. By this ranking method, respective folded structures and their orders of occurrence can be deduced after converting the ΔL of a particular transition to the number of nucleotides using Equation S1 shown in Supporting Information. The population of each folded state was determined by the total number of transitions determined for that state (see Supplementary Table S2 for detailed statistics). The percentage of each transition in an unfolding pathway was determined by the total number of transitions originated from that particular state.
RESULTS AND DISCUSSION
The exploded mechano-analytical perspective (XMAP) for DNA tertiary structures
It has been proposed that the two G-quadruplexes formed in the hTERT 1–12 sequence may interact through the stacking of terminal G-quartets from individual G-quadruplex units (10). Such stacking brings two G-quadruplexes closer, it is therefore anticipated that the oligonucleotides in the inter-quadruplex region should remain in a compact form. Removal of the tertiary interaction between the two G-quadruplexes would stretch out this region, generating an observable transition in force-extension (F-X) curves. Our exploded mechano-analytical perspective (XMAP) on the tertiary interaction is designed to focus on this region (Figure 1). First, we plan to dissect this region into two segments by modifying an inter-quadruplex cytosine (position C21, see Table 1) with a terminal alkyne (17). An azide-labeled dsDNA handle can then be attached to the cytosine residue through Cu(I)-catalyzed click chemistry reaction (16). Depending on the specific G-quadruplex (Figure 1B) for mechanical targeting, we can attach another dsDNA handle either to the 5′- or 3′-end of the hTERT 1–12 template. A total of three hTERT DNA constructs will be generated (Figure 1). Sequence hTERT1–12 has the two dsDNA handles flanking the entire 1st–12th hTERT G-tracts; sequence NMS1–4 has the two handles flanking the 1st–4th G-tracts; while the NMS5–12 construct has the dsDNA handles sandwiching the 5th–12th G-tracts. Mechanical unfolding on each of the three sequences can then be carried out and discussed in the following sections.
Table 1. Sequence of interest from hTERT 1–12 region studied under the various constructs used in this study.
DNA construct | Sequences of interest from the hTERT 1–12 region |
---|---|
hTERT 1–12 | 5′-G1G2G3G4A5G6G7G8G9C10T11G12G13G14A15G16G17G18C19C20C21G22G23A24G25G26G27G28G29 C30T31G32G33G34C35C36G37G38G39G40A41C42C43C44G45G46G47A48G49G50G51G52T53C54G55G56G57 A58C59G60G61G62G63C64G65G66G67G68 |
NMS 5–12 | 5′-G1G2G3G4A5G6G7G8G9C10T11G12G13G14A15G16G17G18C19C20C*21G22G23A24G25G26G27G28 G29C30T31G32G33G34C35C36G37G38G39G40A41C42C43C44G45G46G47A48G49G50G51G52T53C54G55G56 G57A58C59G60G61G62G63C64G65G66G67G68 |
NMS 1–4 | 5′-G1G2G3G4A5G6G7G8G9C10T11G12G13G14A15G16G17G18C19C20C*21G22G23A24G25G26G27G28G29 C30T31G32G33G34C35C36G37G38G39G40A41C42C43C44G45G46G47A48G49G50G51G52T53C54G55G56 G57A58C59G60G61G62G63C64G65G66G67G68 |
Truncated 1–4 | 5′- G1G2G3G4A5G6G7G8G9C10T11G12G13G14A15G16G17G18C19C20C21 |
Truncated 1–4 + 7-nt | 5′-G1G2G3G4A5G6G7G8G9C10T11G12G13G14A15G16G17G18C19C20C21G22G23A24G25 |
Note: The sequence in bold is the region sandwiched between the two dsDNA handles. The complete DNA sequence along with the flanking regions used for the synthesis of the various constructs are given in Supplementary Table S1. The cytosine with an asterisk (C*21) is modified by an alkyne group for the attachment of an azide-modified dsDNA handle.
Native mechanical segmentation targeting the 5th–12th G-tracts (NMS5–12) in the hTERT promoter
Since a truncated hTERT sequence that contains 5–12 G-tracts has been characterized by mechanical unfolding approaches previously (18), we first applied native mechanical segmentation on the G-quadruplex contained in this section (the NMS5–12 construct, see Table 1) for a direct comparison between the two approaches. As discussed above, two dsDNA handles were introduced at the C21 position using the click chemistry coupling and at the 3′-end (position 68) by enzymatic ligation, respectively (see Materials and Methods for details). These two duplex handles were then attached to the two optically trapped polystyrene particles through digoxigenin-antibody and biotin-streptavidin affinity interactions (Figure 1). Force ramping experiments (18) were carried out by moving one of the optically trapped beads using a motorized mirror (20). During this process, the tension inside the mechanically targeted segment increased. When the tension was greater than the mechanical stability of a folded structure in the segment, the folded species unraveled. The sudden unraveling produced an unfolding transition with a rupture force (Frupture), which reflects the mechanical stability. The transition was also characterized by a change in contour length (ΔL), which corresponds to the size of the folded structure. By contrast, the region outside the segment enclosed by the dsDNA handles did not experience any tension, and therefore, structures in this mechanically isolated region remained intact.
Analyses on the unfolding patterns of this construct revealed multiple structures in the hTERT5–12 segment (Figure 2A). The change in contour length (ΔL) corresponding to each unfolding event was measured (see Supplementary Figure S6 for the overall ΔL histogram). To identify predominant species, we performed Population Deconvolution at Nanometer resolution (PoDNano, see Supporting Information) (21). First, we converted each force-extension curve to an ΔL-F plot, which allowed us to retrieve unfolding transitions with associated noise (standard error) in ΔL. Each ΔL was then expanded with a Gaussian kernel whose width was determined by the standard error. Combination of these Gaussian kernels then led us to identify predominant ΔL populations (Figure 2D) from 5000 resampling processes. Previously, such a procedure allowed deconvoluting ΔL populations as small as 0.47 nm in difference (21). Using Equation S1 described in Supporting Information, the number of nucleotides contained in predominate species can be calculated from ΔL measurements.
Since mechanical unfolding is a continuous process starting from the fully folded to fully unfolded states in a particular molecular segment, we were able to establish not only possible intermediates but also specific unfolding pathways by using this method (see Materials and Methods). As shown in Figure 2B–D, we identified five major folded populations containing 15, 26, 32, 42 and 47 nts during the unfolding of the NMS5–12. These species can convert into each other, forming a rather complex network of transition pathways. The 47-nt species matched the size of a G-quadruplex that employed 5th, 6th, 11th and 12th G-tracts with a 26-nt hairpin in the middle loop (Figure 1). The same structure has been observed previously (10,18). Evidence of the same G-quadruplex conformation also came from the CD spectrum of the alkyne modified hTERT 1–12 fragment, which displayed identical signatures with those in the wild type sequence (210 and 265 nm peaks, a 240 nm trough and a 290 nm shoulder, see Supplementary Figure S1) (10). Compared with the previous investigation on a truncated hTERT5–12 sequence (18), the intermediates with 15, 26 and 32 nucleotides were also observed here (see right insets of Figure 2D for possible structures). Close inspection on the unfolding pathways involving these three intermediates revealed the similarity between the native mechanical segmentation and the truncated mechanical unfolding. For example, in the truncated hTERT5–12 template, the 15-nt hairpin (similar to the I-3 in the right inset of Figure 2D) acts as a seed for the folding of the G-quadruplex in which the hairpin serves as one of the loops; the 26-nt (I-2) represents a longer hairpin in the same G-quadruplex loop; while the 32-nt (I-1) is likely an extended hairpin structure that contains three Hoogsteen guanine base pairs in the original G-quadruplex (G32G33G34/G61G62G63, see Table 1 for the sequence) (18). Based on these similarities, we assigned the 3 species (15, 26 and 32 nt, see I-1 to I-3 in Figures 2D and 3, and Supplementary Figure S6) in the NMS5–12 sequence as hairpin intermediates found in the truncated hTERT5–12 template. The 42-nt species is rather unique here. Since the entire structure should contain 47 nucleotides in total (48 nts if the last G (G68) in the 3′-end participates in the G-quadruplex, see Table 1), the observation of the 42-nt species suggests an intermediate with a similar size to the folded G-quadruplex in the NMS5–12. It is possible that the difference of the 5 nucleotides may stem from the inter-quadruplex region due to the higher-order interaction between the two G-quadruplexes (Figures 2A and 3, and Supplementary Figure S6) which was observed in the force range from 10–42 pN.
Native mechanical segmentation targeting the 1st–4th G-tracts (NMS1–4) in the hTERT promoter
To confirm that the 5-nt feature observed above is due to the disruption of the higher-order QQI under mechanical stress, we performed native mechanical segmentation on the G-quadruplex formed in the hTERT 1–4 G-tracts (the NMS1–4 construct). To mechanically target this 5′-end G-quadruplex, we designed a new strategy, Kinetic Enrichment of Ligation via Product Selection (KELPS, details see Materials and Methods and Supplementary Figure S3), to attach a dsDNA handle to the 5′-end of the same ODN 186 sequence used for the NMS5–12 construct preparation (Supplementary Table S1). In this method, we first ligated the alkyne modified ODN 186 with a single-stranded DNA (ODN 187.1, Supplementary Table S1) at the 5′-end using splint ligation (splint DNA: SSd003.2, see Supplementary Table S1) (9,19). Next, the splint DNA was replaced by another ssDNA (SSd003.3, see Supplementary Table S1) in which a toehold (22) section was introduced at the 5′-end. As a result of the hybridization with the second ssDNA, an adhesive overhang was introduced for the hTERT1–12 template, allowing ligation with the dsDNA handle by T4 DNA ligase. This synthesis exploited two kinetic features, a toehold strategy and a concentration based enhancement of the Watson–Crick hybridization reaction. The final covalent ligation with the dsDNA handle made this kinetic selection irreversible, achieving the enrichment of desired product. Since no purification is necessary, such a method is rather efficient in the modification of long DNA sequences, which are synthetically challenging. After attachment of the first dsDNA handle, click chemistry reaction (16) was then used to attach a second dsDNA handle to the C21 residue of the ODN186, accomplishing the NMS1–4 construct preparation.
The results of the force ramping experiments on the NMS1–4 construct were shown in Figure 4. Although most F-X curves depicted only one sudden rupture event, some traces exhibited two-step unfolding events (see red trace in Figure 4A). Statistical analyses on the ΔL histogram (Figure 4A-iii) showed three significant populations centered at 4.9 nm, 6.3 nm and 7.5 nm, respectively. Calculation revealed these populations contained 14, 18 and 21 nucleotides, respectively (see Equation S1 in Supporting Information). While the 18-nt species has the number of nucleotides consistent with those involved in the fully folded G-quadruplex in the 1–4 G-tracts, the 14-nt species is likely an intermediate that may assume a G-triplex conformation (23,24). In comparison, ΔL histogram of the structures formed in the truncated sequence, 5′-GGG GAG GGG CTG GGA GGG CCC (or truncated1–4), also showed these two populations (Figure 4B-iii), therefore validating the assignment of their respective structures.
High-order interaction between the two neighboring G-quadruplexes
The population with ΔL = 7.5 nm in the NMS1–4 construct is rather unique. This population did not appear in the truncated1–4 sequence, suggesting the contribution of the 3′-end G-quadruplex (hTERT 5–12 G-tracts) to the presence of this feature. The ΔL difference between this species and that of the 5′-end G-quadruplex (7.5 – 6.3 = 1.2 nm) is consistent with the length of the inter-quadruplex region (3 nucleotides, see Equation S1 for calculation) between the 5′-end G-quadruplex (hTERT 1–4 G-tracts) and the alkyne-modified C21 residue. Therefore, the ΔL = 7.5 nm population could stem from the simultaneous unfolding of the tertiary QQI and the 5′-end G-quadruplex in the NMS1–4 construct. Occasionally, the unfolding of these two structural elements was sequential (see red data in Figure 4A(ii) and (iii)), supporting this assignment. Further support came from the unfolding of a control construct, 5′-GGG GAG GGG CTG GGA GGG CCC GGA G, which contained the complete inter-quadruplex sequence at its 3′-end (underlined, see ‘Truncated1–4 + 7-nt’ in Table 1). Similar to the truncated1–4 construct, we again found the 7.5 nm feature was missing (compare Figure 4B-iii and C-iii). This indicated that the inter-quadruplex sequence itself did not fold into any structure without the presence of the G-quadruplex at the 3′-end of the hTERT1–12 fragment.
It is likely that the 7-nt bridging segment may assume a looped-out conformation when there is high-order interaction, such as stacking (25), between the two hTERT G-quadruplexes. Unfolding of either the 5′-end or the 3′-end G-quadruplex during the native mechanical segmentation involves a common residue, C21, in the inter-quadruplex region. Under mechanical tension, the looped-out bridge region will release seven nucleotides as tertiary QQI becomes disrupted. Given the stacking interaction between the terminal G-quartets of the two G-quadruplexes, the NMS1–4 and NMS5–12 constructs should produce rupture features of 3 nts and 5 nts (the C21 residue is counted in both cases) from the 7-nt inter-quadruplex region when the 5′-end (1–4 G-tracts) and the 3′-end (5–12 G-tracts) G-quadruplexes are unfolded, respectively (see Equation S1 in Supporting Information for calculation). Actual observation of these two expected features (the 1.2 nm species in Figure 4A [the first transition in the red trace] and the 1.7 nm species in Figure 2C,D [transition from a* to a]), therefore, provided a strong support for this type of stacking interaction between the two hTERT G-quadruplexes.
Consistent with the tertiary interactions between the two G-quadruplexes, we observed many cooperative unfolding events in which both the high-order interaction and the G-quadruplex in the mechanically targeted segment unfolded simultaneously. For example, in the 5′-end G-quadruplex (NMS1–4), the cooperative unfolding constituted 30% of all events (Figure 4A) whereas it was 31% in the 3′-end G-quadruplex (NMS5–12, Figure 3). Previously, these cooperative unfolding events have been ascribed to the high-order interactions in proteins (26).
The assignment of the tertiary interaction in the NMS1–4 construct allowed us to calculate the change in free energy of the higher order structure (ΔGtertiary) by converting the unfolding work of the tertiary interaction using the Jarzynski non-equilibrium equality equation (Equation S2 in Supporting Information) (17,27). Since change in free energy is a state function which does not depend on specific unfolding pathways, unfolding work of either the NMS1–4 or NMS5–12 construct can be used to retrieve ΔGtertiary (17). The NMS5–12, however, contained unfolding patterns too complex to deconvolute accurately. Therefore, we proceeded to obtain ΔGtertiary from the unfolding pattern of the NMS1–4 construct. We first calculated the change in free energy of unfolding G-quadruplex alone (ΔGquadruplex, the 6.3 nm population in Figure 4A) and that of unfolding G-quadruplex and tertiary structure together (ΔGquadruplex+tertiary, the 7.5 nm population in Figure 4A). Due to the state function nature, the ΔGtertiary was determined as the difference between these two values, which exhibited a value of 2.0 ± 0.4 kcal/mol (see Supplementary Figure S5 and Supporting Information). Compared with the 6 kcal/mol stabilization energy between the 26-nt DNA hairpin and the main G-quadruplex in the truncated hTERT5–12 sequence (mainly H-bonding and van der Waals force, see reference (18)), the tertiary interaction observed here between the two G-quadruplexes (mainly π–π interactions) is significantly weaker. This result is consistent with recent finding that loop–loop interaction mediated by H-bonding and van der Waals force is stronger than the π–π interactions in human telomeric G-quadruplex (28).
Unfolding pathways of the species in the full hTERT1–12 sequence
If the stacking interaction indeed exists between the two G-quadruplexes, unfolding of the entire hTERT 1–12 sequence should reveal this interaction as well. To test this prediction, a DNA construct was synthesized by sandwiching the full hTERT 1–12 G-tracts sequence (68 nts, see Table 1) between two dsDNA handles (see Materials and Methods and Supplementary Figure S2 for preparation). The DNA construct was then subject to the force ramping procedures, which produced multiple unfolding events in most F-X traces (Figure 5). By measuring ΔL, the intermediates that lead to specific unfolding events can be analyzed. However, there is uncertainty in the assignment of these intermediates as molecules can be broken prematurely during stretching or they may not be fully refolded during the incubation before subsequent force-ramping cycles. To account for these uncertainties, only the molecules that showed both fully folded and fully unfolded traces in any of the repetitive F-X curves were used for analysis (see Supplementary Figure S6 for overall ΔL histograms).
Except for a few cases (see green trace in Figure 5B and C), majority of the curves showed individual transitions with ΔL = 2.5 nm. This number is equivalent to 7 nucleotides (see Equation S1 for calculation), which is consistent with those contained in the inter-quadruplex region. Such a result is in agreement with the disruption of the stacking interaction between the two G-quadruplexes. It is notable that the high-order interaction almost always appeared as the first event during the stretching of the DNA construct in a wider force range from 10–50 pN. In the case of the green trace in Figure 5B and C, we observed an 8 nm ΔL during the first transition event. Such a transition suggests simultaneous disruption of the QQI and the unfolding of the G-quadruplex from the hTERT 1–4 G-tracts (the expected ΔL is 8.4 nm, see Equation S1 for calculation). This result implied that the G-quadruplex from the 5′-end G-quadruplex (hTERT 1–4 G-tracts) was weaker than that formed in the 5–12 G-tracts.
Based on the size (ΔL) of the intermediates and the order of occurrence of these rupture events (see Materials and Methods and Supporting Information), next, we compiled the unfolding pathways of completely folded structures in the hTERT1–12 construct (Figure 6). All the transitions observed can be ascribed to the unfolding through either the 5′-end (hTERT 1–4 G-tracts) or the 3′-end (hTERT 5–12 G-tracts) G-quadruplex. During the unfolding through the 3′-end G-quadruplex, we observed three types of intermediates, I-3′/I-3, I-2′/I-2 and I-1′/I-1 (Figure 6B and Supplementary Figure S8). The sizes of these species were similar to those found in the native mechanical segmentation of the NMS5–12 construct (Figures 2 and 3) as well as those in the truncated hTERT5–12 sequence (18), which validated our pathway assignments (Figure 6A). Detailed analysis revealed that 11% of all unfolding events were direct unfolding from fully folded structure (a*) to fully unfolded population (d). Such cooperative unfolding confirmed the presence of the higher order interaction between the two hTERT G-quadruplexes (26). For the rest of the molecules, majority (65%) preferred to unfold through the G-quadruplex formed in the 1–4 G-tracts, followed by the unfolding of the hTERT 5–12 G-quadruplex. In contrast, only 24% unfolded with a reverse order (unfolding of the 3′-end G-quadruplex followed by the 5′-end G-quadruplex). This is likely due to the stronger mechanical stability of the 3′-end G-quadruplex, a result consistent with the above observation in which it was easier to unfold the higher order interaction and the 5′-end G-quadruplex together, rather than that with the 3′-end G-quadruplex together. The stronger mechanical stability in the 3′-end G-quadruplex was likely contributed from the tertiary interaction between the 26-nt hairpin loop and the G-quadruplex in the hTERT 5–12 G-tracts, which provided an additional stability of 6 kcal/mole (18). The hierarchical unfolding of the two G-quadruplex structures in the complex DNA structure confirmed the finding (29) that due to limited long range interactions between different structures in nucleic acids, the folding/unfolding pathways in RNA/DNA structures are hierarchical, which is markedly different from the cooperative transition pathways exhibited in proteins.
It is noteworthy that the unfolding pattern in the hTERT1–12 sequence did not support the reported model (30) that three stacking G-quadruplexes are formed in this DNA fragment. Calculations according to this model revealed that unfolding transitions with ΔL of 6.2 nm (the 5′-end G-quadruplex), 8.6 nm (the middle G-quadruplex) and 7.0 nm (the 3′-end G-quadruplex) should be observed for the three stacking G-quadruplexes counting from the 5′ end, respectively (see Supplementary Table S3). Although the 6.2 and 7.0 nm species were present, the 8.6 nm population in the full hTERT1–12 sequence decreased (see arrows in Supplementary Figure S6), implying that formation of three G-quadruplexes is not likely in the hTERT1–12 promoter. In addition, if the stacking model is correct, it is expected to form 8.6 nm and 7.0 nm G-quadruplex species in the NMS5–12 construct. Consistent with the observation in the hTERT1–12, however, the 8.6 nm G-quadruplex was absent in the NMS5–12 construct (Supplementary Figure S6), which again indicates that stacking of the 3 consecutive G-quadruplexes is not likely.
CONCLUSION
In this work, we developed a new mechanical unfolding strategy to analyze higher order interactions in biomacromolecules. Using click chemistry, we were able to mechanically target a desired domain in a biomolecule while leaving the rest of the structure free of mechanical stress. Similar to surgical operations, such a strategy allowed revealing the property of different structural domains in a native environment with a highly precise manner. Our results clearly indicated the existence of tertiary interaction between the two G-quadruplexes formed in the hTERT promoter fragment. The tertiary interaction occurs through the π–π interactions between the two G-quartets at the inter-quadruplex region and provides 2 kcal/mol in stabilization energy. In addition, we found the unfolding pathways of a full structure can be reconstituted from those of individual G-quadruplex units, confirming the hierarchical nature of nucleic acid structures. Analogous to the exploded representation in the mechanical drawing, by employing a generic approach of click chemistry, we anticipate this strategy can provide an exploded mechano-analytical perspective (XMAP) not only to nucleic acid tertiary structures but also to long range interactions in proteins.
Supplementary Material
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
NSF [CHE-1026532, CHE-1415883]. Funding for open access charge: NSF [CHE-1415883].
Conflict of interest statement. None declared.
REFERENCES
- 1.Biffi G., Tannahill D., McCafferty J., Balasubramanian S. Quantitative visualization of DNA G-quadruplex structures in human cells. Nat. Chem. 2013;5:182–186. doi: 10.1038/nchem.1548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Siddiqui-Jain A., Grand C.L., Bearss D.J., Hurley L.H. Direct evidence for a G-quadruplex in a promoter region and its targeting with a small molecule to repress c-MYC transcription. Proc. Natl. Acad. Sci. U.S.A. 2002;99:11593–11598. doi: 10.1073/pnas.182256799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Williamson J.R. G-quartet structures in telomeric DNA. Annu. Rev. Biophys. Biomol. Struct. 1994;23:703–730. doi: 10.1146/annurev.bb.23.060194.003415. [DOI] [PubMed] [Google Scholar]
- 4.Huppert J.L., Balasubramanian S. Prevalence of quadruplexes in the human genome. Nucleic Acids Res. 2005;33:2908–2916. doi: 10.1093/nar/gki609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Huppert J.L., Balasubramanian S. G-quadruplexes in promoters throughout the human genome. Nucleic Acids Res. 2007;35:406–413. doi: 10.1093/nar/gkl1057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Schonhoft J.D., Bajracharya R., Dhakal S., Yu Z., Mao H., Basu S. Direct experimental evidence for quadruplex-quadruplex interaction within the human ILPR. Nucleic Acids Res. 2009;37:3310–3320. doi: 10.1093/nar/gkp181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Amrane S., Adrian M., Heddi B., Serero A., Nicolas A., Mergny J.-L., Phan A.T. Formation of pearl-necklace monomorphic G-quadruplexes in the human CEB25 minisatellite. J. Am. Chem. Soc. 2012;134:5807–5816. doi: 10.1021/ja208993r. [DOI] [PubMed] [Google Scholar]
- 8.Yu H., Gu X., Nakano S.-i., Miyoshi D., Sugimoto N. Beads-on-a-string structure of long telomeric DNAs under molecular crowding conditions. J. Am. Chem. Soc. 2012;134:20060–20069. doi: 10.1021/ja305384c. [DOI] [PubMed] [Google Scholar]
- 9.Abraham Punnoose J., Cui Y., Koirala D., Yangyuoru P.M., Ghimire C., Shrestha P., Mao H. Interaction of G-quadruplexes in the full-length 3′human telomeric overhang. J. Am. Chem. Soc. 2014;136:18062–18069. doi: 10.1021/ja510079u. [DOI] [PubMed] [Google Scholar]
- 10.Palumbo S.L., Ebbinghaus S.W., Hurley L.H. Formation of a unique end-to-end stacked pair of G-quadruplexes in the hTERT core promoter with implications for inhibition of telomerase by G-quadruplex-interactive ligands. J. Am. Chem. Soc. 2009;131:10878–10891. doi: 10.1021/ja902281d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Petraccone L., Spink C., Trent J.O., Garbett N.C., Mekmaysy C.S., Giancola C., Chaires J.B. Structure and stability of higher-order human telomeric quadruplexes. J. Am. Chem. Soc. 2011;133:20951–20961. doi: 10.1021/ja209192a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Petraccone L. In: Quadruplex Nucleic Acids. Chaires JB, Graves D., editors. Vol. 330. Berlin Heidelberg: Springer; 2013. pp. 23–46. [Google Scholar]
- 13.Riou J.F., Guittat L., Mailliet P., Laoui A., Renou E., Petitgenet O., Mégnin-Chanet F., Hélène C., Mergny J.L. Cell senescence and telomere shortening induced by a new series of specific G-quadruplex DNA ligands. Proc. Natl. Acad. Sci. U.S.A. 2002;99:2672–2677. doi: 10.1073/pnas.052698099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Reed J.E., Arnal A.A., Neidle S., Vilar R. Stabilization of G-quadruplex DNA and inhibition of telomerase activity by square-planar nickel(II) complexes. J. Am. Chem. Soc. 2006;128:5992–5993. doi: 10.1021/ja058509n. [DOI] [PubMed] [Google Scholar]
- 15.Wright W.E., Piatyszek M.A., Rainey W.E., Byrd W., Shay J.W. Telomerase activity in human germline and embryonic tissues and cells. Dev. Genet. 1996;18:173–179. doi: 10.1002/(SICI)1520-6408(1996)18:2<173::AID-DVG10>3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
- 16.Kolb H.C., Finn M.G., Sharpless K.B. Click chemistry: diverse chemical function from a few good reactions. Angew. Chem. Int. Ed. 2001;40:2004–2021. doi: 10.1002/1521-3773(20010601)40:11<2004::AID-ANIE2004>3.0.CO;2-5. [DOI] [PubMed] [Google Scholar]
- 17.Yu Z., Koirala D., Cui Y., Easterling L.F., Zhao Y., Mao H. Click chemistry assisted single-molecule fingerprinting reveals a 3D biomolecular folding funnel. J. Am. Chem. Soc. 2012;134:12338–12341. doi: 10.1021/ja303218s. [DOI] [PubMed] [Google Scholar]
- 18.Yu Z., Gaerig V., Cui Y., Kang H., Gokhale V., Zhao Y., Hurley L.H., Mao H. Tertiary DNA structure in the single-stranded hTERT promoter fragment unfolds and refolds by parallel pathways via cooperative or sequential events. J. Am. Chem. Soc. 2012;134:5157–5164. doi: 10.1021/ja210399h. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Moore M.J., Query C.C. Joining of RNAs by splinted ligation. Methods Enzymol. 2000;317:109–123. doi: 10.1016/s0076-6879(00)17009-0. [DOI] [PubMed] [Google Scholar]
- 20.Luchette P., Abiy N., Mao H. Microanalysis of clouding process at the single droplet level. Sens. Actuators B Chem. 2007;128:154–160. [Google Scholar]
- 21.Yu Z., Mao H. Non-B DNA structures show diverse conformations and complex transition kinetics comparable to RNA or proteins ― a perspective from mechanical unfolding and refolding experiments. Chem. Rec. 2013;13:102–116. doi: 10.1002/tcr.201200021. [DOI] [PubMed] [Google Scholar]
- 22.Zhang D.Y., Winfree E. Control of DNA strand displacement kinetics using toehold exchange. J. Am. Chem. Soc. 2009;131:17303–17314. doi: 10.1021/ja906987s. [DOI] [PubMed] [Google Scholar]
- 23.Koirala D., Mashimo T., Sannohe Y., Yu Z., Mao H., Sugiyama H. Intramolecular folding in three tandem guanine repeats of human telomeric DNA. Chem. Commun. 2012;48:2006–2008. doi: 10.1039/c2cc16752b. [DOI] [PubMed] [Google Scholar]
- 24.Limongelli V., De Tito S., Cerofolini L., Fragai M., Pagano B., Trotta R., Cosconati S., Marinelli L., Novellino E., Bertini I., et al. The G-Triplex DNA. Angew. Chem. Int. Ed. Engl. 2013;52:2269–2273. doi: 10.1002/anie.201206522. [DOI] [PubMed] [Google Scholar]
- 25.Adrian M., Ang D.J., Lech C.J., Heddi B., Nicolas A., Phan A.T. Structure and conformational dynamics of a stacked dimeric G-quadruplex formed by the human CEB1 minisatellite. J. Am. Chem. Soc. 2014;136:6297–6305. doi: 10.1021/ja4125274. [DOI] [PubMed] [Google Scholar]
- 26.Yu Z., Cui Y., Selvam S., Ghimire C., Mao H. Dissecting cooperative communications in protein with a high-throughput single-molecular scalpel. Chemphyschem. 2015;16:223–232. doi: 10.1002/cphc.201402443. [DOI] [PubMed] [Google Scholar]
- 27.Jarzynski C. Nonequilibrium equality for free energy differences. Phys. Rev. Lett. 1997;78:2690–2693. [Google Scholar]
- 28.Ghimire C., Park S., Iida K., Yangyuoru P., Otomo H., Yu Z., Nagasawa K., Sugiyama H., Mao H. Direct quantification of loop interaction and π–π stacking for G-quadruplex stability at the submolecular level. J. Am. Chem. Soc. 2014;136:15537–15544. doi: 10.1021/ja503585h. [DOI] [PubMed] [Google Scholar]
- 29.Li P.T., Vieregg J., Tinoco I. Jr. How RNA unfolds and refolds. Annu. Rev. Biochem. 2008;77:77–100. doi: 10.1146/annurev.biochem.77.061206.174353. [DOI] [PubMed] [Google Scholar]
- 30.Chaires J.B., Trent J.O., Gray R.D., Dean W.L., Buscaglia R., Thomas S.D., Miller D.M. An improved model for the hTERT promoter quadruplex. PLoS One. 2014;9:e115580. doi: 10.1371/journal.pone.0115580. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.