Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2012 Dec 28;7(12):e52994. doi: 10.1371/journal.pone.0052994

Differential Targeting of Unpaired Bases within Duplex DNA by the Natural Compound Clerocidin: A Valuable Tool to Dissect DNA Secondary Structure

Matteo Nadai 1, Giorgio Palù 1, Manlio Palumbo 2, Sara N Richter 1,*
Editor: Heidar-Ali Tajmir-Riahi3
PMCID: PMC3532440  PMID: 23285245

Abstract

Non-canonical DNA structures have been postulated to mediate protein-nucleic acid interactions and to function as intermediates in the generation of frame-shift mutations when errors in DNA replication occur, which result in a variety of diseases and cancers. Compounds capable of binding to non-canonical DNA conformations may thus have significant diagnostic and therapeutic potential. Clerocidin is a natural diterpenoid which has been shown to selectively react with single-stranded bases without targeting the double helix. Here we performed a comprehensive analysis on several non-canonical DNA secondary structures, namely mismatches, nicks, bulges, hairpins, with sequence variations in both the single-stranded region and the double-stranded flanking segment. By analysis of clerocidin reactivity, we were able to identify the exposed reactive residues which provided information on both the secondary structure and the accessibility of the non-paired sites. Mismatches longer than 1 base were necessary to be reached by clerocidin reactive groups, while 1-base nicks were promptly targeted by clerocidin; in hairpins, clerocidin reactivity increased with the length of the hairpin loop, while, interestingly, reactivity towards bulges reached a maximum in 3-base-long bulges and declined in longer bulges. Electrophoretic mobility shift analysis demonstrated that bulges longer than 3 bases (i.e. 5- and 7-bases) folded or stacked on the duplex region therefore being less accessible by the compound. Clerocidin thus represents a new valuable diagnostic tool to dissect DNA secondary structures.

Introduction

Nucleic acids are highly polymorphic: depending on the sequences and environmental conditions they may exist in a variety of secondary structures such as duplexes, triplexes, tetraplexes, bulges, hairpins, loops [1], [2]. Such non-canonical structures in nucleic acids are of general biological significance: they have been postulated to mediate protein-nucleic acid interactions, either by contacting protein residues directly or by producing a distinct tertiary structure to which the protein binds [3], and to function as intermediates in the generation of frame-shift mutations when errors in DNA replication occur [4], [5]. In particular, extra-helical bases are thought to be implicated in nucleic acid non-canonical functions [6]. An essentially unlimited combination of secondary structural elements has been extensively described in RNA, where the single-stranded (ss) nucleic acid folds back on itself; however, DNA can also produce complex secondary structures during replication and recombination [7]. A shift of the reading frame during template-dependent DNA synthesis can lead to the addition or deletion of one or more nucleotide residues (nts) in the newly synthesized DNA, ensuing in bulged or mismatched structures. Bulged bases derived from replicative errors are considered the first step of frame-shift mutagenesis [6], resulting in a variety of diseases and cancers (e.g., myotonic dystrophy, Huntington's disease, Friederich's ataxia, and fragile X syndrome).

In general, compounds capable of binding to non-canonical conformations of the DNA could have significant therapeutic potential. Several derivatives with unrelated structures have been reported to individually target sequence-specific bulges [8], [9], [10], mismatches [11], [12] and loops [13]. However, it is not clear whether a particular disease is linked to only one sequence-specific DNA conformation; therefore, compounds able to universally target DNA unconventional structures within a duplex section of DNA could be appealing from both a therapeutic and diagnostic point of view.

Clerocidin (CL) (Fig. 1A) is a natural product isolated from Oidiodendron truncatum, initially described as a gyrase inhibitor [14], [15], [16]. CL was subsequently shown to target DNA ss regions while being unreactive towards the double-helix: CL directly reacts with three-base DNA bulges, with different mechanisms depending on the exposed nucleotide. In particular, CL electrophilic groups (i.e. a strained epoxy ring and an α-ketoaldehyde function in equilibrium with its hemi-acetalic form) target i) the nucleophilic N7 of guanine (G) inducing spontaneous depurination and DNA strand cleavage [17], [18], ii) the NH2 and N3 of cytosine (C) with formation of a stable condensed ring system, which is degraded to induce DNA cleavage only after hot alkali treatment [19], and iii) the NH2 and N1 of adenine (A) to generate an adduct that degrades upon alkali but does not result in DNA strand scission [20]. Due to lack of strong nucleophilic sites, thymine (T) does not react with CL. The bulky diterpenoid portion of CL modulates the accessibility of the epoxide and α-ketoaldehyde reactive groups towards the DNA.

Figure 1. Reagents used in this study.

Figure 1

A) Chemical structure of CL. B) Schematic representation of the single-stranded (ss) regions of the oligonucleotides used, subdivided according to the secondary structure category. Double-stranded regions flanking the ss moiety are shown, because CL reactivity was assayed and compared towards oligonucleotides with both G/C and A/T-rich flanking regions. Arrows indicate the position of CL alkylation and cleavage. The size of the arrows corresponds to the degree of reactivity.

Here we sought to investigate the ability of CL to target bases embedded in different DNA conformational environments, such as mismatched bases and nicked DNA, loops and hairpins. Our results showed that CL was able to react with most ss structures within a duplex DNA; however, the number of ss bases was important to determine the accessibility of the compound to the reactive site. Therefore CL, besides being able to target a wide range of ss structures in a double helix setting, can also be used as a tool to evaluate site accessibility and folding of ss areas of the DNA within a double helix environment.

Materials and Methods

Clerocidin and Oligonucleotides

CL was a gift of Leo Pharmaceutical Products (Ballerup, Denmark). Molar extinction coefficients were experimentally determined to be 11818 M−1 cm−1 for CL. Working drug solutions were obtained by diluting fresh stocks in the appropriate buffer. T4 polynucleotide kinase was purchased from Fermentas (Burlington, Canada). [γ-32P]ATP was from Perkin Elmer (MA, USA), while all oligonucleotides were from Sigma Aldrich (St Louis, MO, USA). Oligonucleotides used in this study are reported in Table 1.

Table 1. Oligonucleotides used in this study.

Oligonucleotide
Type Name Sequence Abbreviated description
Mismatch G 1 CTATTGCTTTATTTGTAACCATTATAAGCT MM G/T
1 rev AGCTTATAATGGTTATAAATAAAGCAATAG
1 CTATTGCTTTATTTGTAACCATTATAAGCT MM G/A
1a rev AGCTTATAATGGTTAAAAATAAAGCAATAG
1 CTATTGCTTTATTTGTAACCATTATAAGCT MM TG/TC
2 rev AGCTTATAATGGTTATCAATAAAGCAATAG
1 CTATTGCTTTATTTGTAACCATTATAAGCT MM TGT/GTC
3 rev AGCTTATAATGGTTGTCAATAAAGCAATAG
Mismatch C 4 CTATTGCTTTATTTCTAACCATTATAGCT MM C/T
1 rev AGCTTATAATGGTTATAAATAAAGCAATAG
5 CTATTGCTTTATTTCTAACCATTATAAGCT MM C/A
1a rev AGCTTATAATGGTTAAAAATAAAGCAATAG
Bulge G A/T rich 1 CTATTGCTTTATTTGTAACCATTATAAGCT B1 G A/T rich
1b rev AGCTTATAATGGTTAAAATAAAGCAATAG
1 CTATTGCTTTATTTGTAACCATTATAAGCT B2 G A/T rich
1c rev AGCTTATAATGGTTAAATAAAGCAATAG
1 CTATTGCTTTATTTGTAACCATTATAAGCT B3 G A/T rich
1d rev AGCTTATAATGGTTAATAAAGCAATAG
6 CTATTGCTTTATTTTGTTAACCATTATAAGCT B5 G A/T rich
1d rev AGCTTATAATGGTTAATAAAGCAATAG
7 CTATTGCTTTATTTTTGTTTAACCATTATAAGCT B7 G A/T rich
1d rev AGCTTATAATGGTTAATAAAGCAATAG
Bulge G G/C rich 8 CTATTGCTTTACGCGCGCCCATTATAAGCT B1 G G/C rich
8 rev AGCTTATAATGGGCGGCGTAAAGCAATAG
9 CTATTGCTTTCGCTGCGCCCATTATAAGCT B2 G G/C rich
9 rev AGCTTATAATGGGCGGCGTAAAGCAATAG
10 CTATTGCTTTCGCTGTCGCCATTATAAGCT B3 G G/C rich
10 rev AGCTTATAATGGCGGCGAAAGCAATAG
11 CTATTGCTTTCGCTTGTTCGCCATTATAAGCT B5 G G/C rich
10 rev AGCTTATAATGGCGGCGAAAGCAATAG
12 CTATTGCTTTCGCTTTGTTTCGCCATTATAAGCT B7 G G/C rich
10 rev AGCTTATAATGGCGGCGAAAGCAATAG
Bulge C A/T rich 5 CTATTGCTTTATTTCTAACCATTATAAGCT B1 C A/T rich
1b rev AGCTTATAATGGTTAAAATAAAGCAATAG
1 CTATTGCTTTATTTCTAACCATTATAAGCT B2 C A/T rich
1c rev AGCTTATAATGGTTAAATAAAGCAATAG
5 CTATTGCTTTATTTCTAACCATTATAAGCT B3 C A/T rich
1d rev AGCTTATAATGGTTAATAAAGCAATAG
13 CTATTGCTTTATTTTCTTAACCATTATAAGCT B5 C A/T rich
1d rev AGCTTATAATGGTTAATAAAGCAATAG
14 CTATTGCTTTATTTTTCTTTAACCATTATAAGCT B7 C A/T rich
1d rev AGCTTATAATGGTTAATAAAGCAATAG
Bulge C G/C rich 17 CTATTGCTTTACGCCCGCCCATTATAAGCT B1 C G/C rich
8 rev AGCTTATAATGGGCGGCGTAAAGCAATAG
18 CTATTGCTTTCGCTCCGCCCATTATAAGCT B2 C G/C rich
9 rev AGCTTATAATGGGCGGCGAAAGCAATAG
19 CTATTGCTTTCGCTCTCGCCATTATAAGCT B3 C G/C rich
10 rev AGCTTATAATGGCGGCGAAAGCAATAG
20 CTATTGCTTTCGCTTCTTCGCCATTATAAGCT B5 C G/C rich
10 rev AGCTTATAATGGCGGCGAAAGCAATAG
21 CTATTGCTTTCGCTTTCTTTCGCCATTATAAGCT B7 C G/C rich
10 rev AGCTTATAATGGCGGCGAAAGCAATAG
Nick G A/T rich 1 CTATTGCTTTATTTGTAACCATTATAAGCT N1 G A/T rich
11 rev AGCTTATAATGGTTA
12 rev AAATAAAGCAATAG
1 CTATTGCTTTATTTGTAACCATTATAAGCT N2 G A/T rich
11 rev ATTGGTAATATTCGA
13 rev AATAAAGCAATAG
1 CTATTGCTTTATTTGTAACCATTATAAGCT N3 G A/T rich
14 rev AGCTTATAATGGTT
13 rev GATAACGAAATAA
Nick G G/C rich 8 CTATTGCTTTACGCGCGCCCATTATAAGCT N1 G G/C rich
15 rev AGCTTATAATGGGCG
16 rev GCGTAAAGCAATAG
22 CTATTGCTTTCCGTGCGCCCATTATAAGCT N2 G G/C rich
17 rev GCGGGTAATATTCGA
18 rev CGGAAAGCAATAG
23 CTATTGCTTTCCGTGTGCCCATTATAAGCT N3 G G/C rich
19 rev AGCTTATAATGGGC
18 rev GATAACGAAAGGC
Nick C A/T rich 24 CTATTGCTTTATTTCTAACCATTATAAGCT N1 C A/T rich
11 rev ATTGGTAATATTCGA
12 rev GATAACGAAATAAA
24 CTATTGCTTTATTTCTAACCATTATAAGCT N2 C A/T rich
11 rev ATTGGTAATATTCGA
13 rev GATAACGAAATAA
24 CTATTGCTTTATTTCTAACCATTATAAGCT N3 C A/T rich
14 rev TTGGTAATATTCGA
13 rev GATAACGAAATAA
Nick C G/C rich 17 CTATTGCTTTACGCCCGCCCATTATAAGCT N1 C G/C rich
15 rev -GCGGGTAATATTCGA
16 rev GATAACGAAATGCG-
25 CTATTGCTTTCCGTCCGCCCATTATAAGCT N2 C G/C rich
17 rev GCGGGTAATATTCGA
18 rev GATAACGAAAGGC
26 CTATTGCTTTCCGTCTGCCCATTATAAGCT N3 C G/C rich
19 rev CGGGTAATATTCGA
18 rev GATAACGAAAGGC
Hairpin T 27 GCTGCAGCTGGCTTTTTGCCAGCTGCAGC H5 T C/G
28 GCTGCAGCTGGTTTTTTGCCAGCTGCAGC H5 T T/G
29 GCTGCAGCTGGCTTTTTACCAGCTGCAGC H5 T C/A
30 GCTGCAGCTGGCTTTTTTTGCCAGCTGCAGC H7 T C/G
31 GCTGCAGCTGGCTTTTTTTTTGCCAGCTGCAGC H9 T C/G
Hairpin G 37 GCTGCAGCTGGCTGTGCCAGCTGCAGC H3 G
38 GCTGCAGCTGGCTTGTTGCCAGCTGCAGC H5 G
39 GCTGCAGCTGGCTTTGTTTGCCAGCTGCAGC H7 G
40 GCTGCAGCTGGCTTTTGTTTTGCCAGCTGCAGC H11 G
Hairpin C 41 GCTGCAGCTGGCTCTGCCAGCTGCAGC H3 C
42 GCTGCAGCTGGCTTCTTGCCAGCTGCAGC H5 C
43 GCTGCAGCTGGCTTTCTTTGCCAGCTGCAGC H7 C
44 GCTGCAGCTGGCTTTTCTTTTGCCAGCTGCAGC H11 C
Bulge EMSA 49 GAAGCTCGCGCGCCAGAAG B1 G short
49 rev CTTCTGGCGGCGAGCTTC
48 GAGCTCGCTGCGCCAGAAG B2 G short
48 rev CTTCTGGCGGCGAGCTC
47 GAGCTCGCTGTCGCCAGAG B3 G short
47 rev CTCTGGCGGCGAGCTC
46 AGCTCGCTTGTTCGCCAGA B5 G short
46 rev TCTGGCGGCGAGCT
45 GCTCGCTTTGTTTCGCCAG B7 G short
45 rev CTGGCGGCGAGC
50 GAAGCTCGCGCGCCAGAAG B ds short
50 rev CTTCTGGCGCGCGAGCTTC
Hairpin EMSA 54 CGGCATGCGTGGCTGTGCCACGCATGCCG H3 G short
53 CGGCAGCGTGGCTTGTTGCCACGCTGCCG H5 G short
52 CGCAGCGTGGCTTTGTTTGCCACGCTGCG H7 G short
51 CGCAGCTGGCTTTTGTTTTGCCAGCTGCG H9 G short
55 CGGCATGCGTGGCT H ds short
55 rev AGCCACGCATGCCG

Alkylation by Clerocidin of Single Stranded Bases in a Duplex Environment

All oligonucleotides were gel-purified before use and prepared in desalted/lyophilised form. Forward oligonucleotides were 5′-end-labelled with [γ- 32P]ATP by T4 polynucleotide kinase and were subsequently purified by MicroSpin G-25 columns (Amersham Biosciences, Europe). Ss-duplex substrates were obtained by annealing the labelled forward oligonucleotide with equimolar amounts of the reverse, partially complement cold oligonucleotide (Table 1). Drug reactions with the labelled ss-duplex (5 pmol/sample) were performed at 37°C in 50 mM phosphate buffer, pH 7.4. These conditions were selected to maintain the stability of the target oligonucleotide structures and to minimize the possible competition of buffer molecules for drug alkylation [21]. At the indicated time intervals, samples were precipitated with ethanol to eliminate non-reacted drug, then resuspended and either kept on ice, or treated at 90°C for 30 min with 1 M piperidine to complete strand scission according to the Maxam and Gilbert protocol [22]. Samples were then lyophilised, resuspended in formamide gel loading buffer, and heated at 95°C for 3 min. Reaction products were analyzed on 20% denaturing polyacrylamide gels. Gels were visualized by phosphorimaging analysis (Typhoon FLA 9000, GE Healthcare, Europe).

Electrophoretic Mobility Shift Assay

Labelled ss-duplex substrates, obtained by annealing the labelled forward oligonucleotide with equimolar amounts of the reverse, partially complement cold oligonucleotide (Table 1), were loaded on 12% non denaturing polyacrylamide gels, run for ∼ 7 hours at 75 volts. Gels were visualized by phosphorimaging.

Results

Oligo Design and Result Reading

CL (Fig. 1A) was tested against a series of oligonucleotides containing ss regions embedded in a ds environment. Only one single reactive base (either G or C) was positioned in each case in the ss portion. The unreactive T was used when additional ss bases were necessary to obtain the final conformation. The ss motif was flanked alternatively by A/T rich or G/C rich sequences to analyse their effect on the accessibility of the reactive ss site (Table 1).

Reactions were analysed both before and after hot piperidine treatment. Before alkali, alkylation at G bases was normally detected by two bands: one migrating slower than the full-length oligo, indicating alkylation with no DNA cleavage, and one migrating slower than the corresponding G base obtained with the Maxam & Gilbert sequencing reaction, indicating DNA cleavage at G with maintenance of the alkylation adduct. At C, just the band running slower than the full-length oligo was obtained. After piperidine, alkylation at both G and C was manifested as a cleavage band which migrated as the corresponding base obtained in the marker lane, indicating cleavage at G and C and loss of the alkylating CL molecule.

For quantification purposes, the cleavage band obtained after piperidine treatment, which totals up the effect of alkylation and cleavage by CL, was analysed.

Mismatches

Mismatches form when one or more bases in the forward and reverse strand do not complement. They derive from mis-incorporation of bases that may occur during DNA replication or recombination, or during repairing of DNA damage.

CL was made react with 4 oligonucleotides containing one G or one C base mismatched with T or A, and two oligonucleotides containing TG or TGT bases mismatched with CT or CTG, respectively (Table 1 and Figure 1B). Reaction with the mismatched TG and TGT induced cleavage at the ss G base (before piperidine: symbols ¤ in lanes 5 and 7; after piperidine: asterisks in lanes 6 and 8, Figure 2). These effects increased at higher CL concentration (compare lanes 5 and 7, 6 and 8, Figure 2). Contrary, when the oligonucleotides contained a single mismatched base, no reaction could be observed both before and after piperidine treatment (Figure 2 and Figure 1B for summary).

Figure 2. CL footprinting of mismatched oligonucleotides.

Figure 2

Oligonucleotides 5, and 1 were heat denaturated and folded in the presence of the appropriate complementary sequences (1a rev, 2 rev, 3 rev, respectively, Table 1) to obtain MM C/A, MM TG/TC and MM TGT/GTC oligonucleotides. The folded oligonucleotides were incubated with increasing concentrations (50–100 µM) of CL for 24 h at 37°C. After reaction, samples were precipitated and either kept on ice or treated with hot piperidine and lyophilized (samples indicated by the symbol P) and loaded on a 20% denaturing polyacrylamide gel. The symbol § indicates CL/full-length DNA adducts which migrate slower than the full-length DNA. The symbol ¤ indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL. CL is still bound to the cleaved oligonucleotide, thus the cleavage band runs slower than the corresponding band in the Maxam and Gilbert marker lane (M lanes). The symbol * indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL, with loss of CL. Position of alkylation is evinced by comparison of cleavage bands after piperidine treatment and the Maxam and Gilbert marker lane. Oligonucleotide sequences are indicated on the left of the corresponding marker lane (M lanes). Base numbering has been assigned in the 5 prime→3 prime direction.

Nicks

A nick is a discontinuity in a double stranded DNA molecule where there is no phosphodiester bond between adjacent nucleotides of one strand, typically achieved through damage or enzyme action. Nicks usually release torsion in the strand.

Oligonucleotides containing 1, 2 or 3 nicked, non-constrained bases were formed by annealing the forward strand with two partially complementary reverse strands. Each nick contained either one G or C flanked by ss T bases, flanked by A/T- or G/C-rich ds regions (Table 1 and Fig. 1B). Cleavage was modest and comparable between 1-, 2- or 3-base nicks (asterisks, lanes 6, Fig. 3A and B and data not shown). No difference in cleavage intensity was observed between G and C nicked bases and between A/T- and G/C-rich flanking sequences. However, ds Gs close to the nicked portion were cleaved at a higher extent (÷ symbol for slower running bands before piperidine treatment and # symbol for cleavage bands after piperidine, lanes 5 and 6, Fig. 3A and Fig. 1B for summary).

Figure 3. CL footprinting of nicked oligonucleotides.

Figure 3

A) Oligonucleotides 17, 25 and 26 were heat denaturated and folded in the presence of the appropriate complementary sequences (15 rev-16 rev, 17 rev-18 rev, 19 rev-18 rev, respectively, Table 1). B) Oligonucleotide 24 was heat denaturated and folded in the presence of the appropriate complementary sequences (12 rev-12 rev, 11 rev-13 rev, 14 rev-13 rev, Table 1) to obtain the nicked C A/T rich oligonucleotides shown above the gel. The folded oligonucleotides were incubated with CL (100 µM) for 24 h at 37°C. After reaction, samples were precipitated and either kept on ice or treated with hot piperidine and lyophilized (samples indicated by the symbol P) and loaded on a 20% denaturing polyacrylamide gel. The symbol § indicates CL/full-length DNA adducts which migrate slower than the full-length DNA. The symbol ¤ indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL. The symbol * indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL, with loss of CL. The symbols ÷ and # indicate bases in the ds region flanking the nicked moiety that are alkylated and cleaved by CL (÷), with loss of CL (#). Position of alkylation is evinced by comparison of cleavage bands after piperidine treatment and the Maxam and Gilbert marker lane. Oligonucleotide sequences are indicated aside of the corresponding marker lane (M lanes). Base numbering has been assigned in the 5 prime→3 prime direction.

Bulges

Bulges are formed when bases in one strand have no pairing partner in the opposite strand. They may be created in DNA during recombination between imperfectly homologous sequences and they may exert a role in protein recognition.

Bulges were formed in oligonucleotides containing 1, 2, 3, 5 or 7 non-complemented bases. Each bulge contained either one G or C flanked by ss T bases, adjacent to A/T- or G/C-rich ds regions (Table 1 and Fig. 1B).

After reaction with CL, alkylation could be observed before piperidine (slower migrating bands compared to the full-length oligonucleotides and ss G or C marker) and after hot alkali (cleavage bands corresponding to ss G or C) (asterisks, Fig. 4). In the case of bulged Gs flanked by A/T rich regions (Fig. 4A), the amount of cleaved ss G was very poor with 1- and 7-base bulges, while was 3-fold higher with 2-, 3-, 5-base bulges. With bulged Gs flanked by G/C rich ds segments (Fig. 4B), again reaction was extremely poor at 1- and 7-base bulges, incremented by 2-folds with 2- and 5-base bulges, and was maximum with 3-base bulges. With bulged Cs flanked by A/T or G/C rich regions (Fig. 4C and data not shown), the higher cleavage was observed with 3-base bulges, followed by 2-, 5-base bulges; reaction at 7-base bulge was very modest, while no reaction was observed at 1-base bulge (Fig. 1B for summary).

Figure 4. CL footprinting of bulged oligonucleotides.

Figure 4

A) Oligonucleotides 1, 6 and 7 were heat denaturated and folded in the presence of the appropriate complementary sequences (1b rev, 1c rev, 1d rev, Table 1) to obtain the bulged G A/T rich oligonucleotides shown above the gel. B) Oligonucleotides 8, 9, 10, 11 and 12 were heat denaturated and folded in the presence of the appropriate complementary sequences (8 rev, 9 rev, 10 rev, Table 1) to the bulged G G/C rich oligonucleotides shown above the gel. C) Oligonucleotides 5, 1, 13, and 14, were heat denaturated and folded in the presence of the appropriate complementary sequences (1b rev, 1c rev, 1d rev, Table 1) to obtain the bulged C A/T rich oligonucleotides shown above the gel. The folded oligonucleotides were incubated with increasing concentrations (50–100 µM) of CL for 24 h at 37°C. After reaction, samples were precipitated and either kept on ice or treated with hot piperidine and lyophilized (samples indicated by the symbol P) and loaded on a 20% denaturing polyacrylamide gel. The symbol § indicates CL/full-length DNA adducts which migrate slower than the full-length DNA. The symbol ¤ indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL. CL is still bound to the cleaved oligonucleotide, thus the cleavage band runs slower than the corresponding band in the Maxam and Gilbert marker lane (M lanes). The symbol * indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL, with loss of CL. Position of alkylation is evinced by comparison of cleavage bands after piperidine treatment and the Maxam and Gilbert marker lane. Oligonucleotide sequences are indicated on the left of the corresponding marker lane (M lanes). Base numbering has been assigned in the 5 prime→3 prime direction.

Hairpins

Hairpins occur when two regions of the same strand, usually complementary in nucleotide sequence when read in opposite directions, base-pair to form a double helix that ends in an unpaired loop. Hairpins were designed with 3, 5, 7, 9 ss bases. Each loop contained either one G or C flanked by ss T bases, adjacent to G/C rich complementary strands (Table 1 and Fig. 1B). Alkylation at the exposed G or C base was observed in both cases, both prior to and after treatment with hot piperidine, only in loops larger than 3 bases, i.e. with 5, 7 and 9 bases, and CL effects were more evident in the 9 base-loop (asterisks, Fig. 5A and B). Interestingly, however, two adjacent Gs in the ds region were moderately cleaved in the 5-, 7- and 9-base hairpins (¤ symbols, lanes 6, Fig. 5A and B), while their supposedly complementing C bases were not affected by CL alkylation.

Figure 5. CL footprinting of hairpin oligonucleotides.

Figure 5

A) Oligonucleotides 37, 38, 39 and 40, B) oligonucleotides 41, 42, 43 and 44 and C) oligonucleotides 27, 30, 31, 29 and 28 (Table 1) were heat denaturated and folded to obtain the hairpin G, C or T oligonucleotides shown above the gels. The folded oligonucleotides were incubated with CL (100 µM) for 24 h at 37°C. After reaction, samples were precipitated and either kept on ice or treated with hot piperidine and lyophilized (samples indicated by the symbol P) and loaded on a 20% denaturing polyacrylamide gel. The symbol * indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL, without or with loss of CL, at the G or C base exposed in the hairpin region. The symbol ¤ indicates bands that correspond to the oligonucleotide alkylated and cleaved by CL, without or with loss of CL, at bases in the ds stem region of the oligonucleotide. Position of alkylation is evinced by comparison of cleavage bands after piperidine treatment and the Maxam and Gilbert marker lane. Oligonucleotide sequences are indicated on the left of the corresponding marker lane (M lanes). Base numbering has been assigned in the 5 prime→3 prime direction.

Oligonucleotides with loops formed by all Ts were next assayed (Fig. 5C). As expected, no cleavage in the T segment was observed. However, cleavage at the two adjacent Gs in the supposedly ds region was still observed in the 7- and 9-base hairpins (asterisks, lanes 5 and 6, Fig. 5C). The effect of the type of ss bases close to the ds region was thus analysed by assaying oligos with 7-base hairpins formed by CTTTTTA or TTTTTTG ss bases (Fig. 5C). We observed a very strong cleavage at the two adjacent ds Gs in the case of the CTTTTTA hairpin. A less intense effect was detected with the TTTTTTG segment. In this case, however, cleavage was also obtained at the ss G sequence (¤ symbols, lanes 5 and 6, Fig. 5C).

Long Bulged Regions are Less Sterically Hindered

By testing secondary structures with ss segments of different length we were able to evince that CL reacts at ss region longer than 1 base and in general its reactivity increments with the length of the ss portion. However, one main exception occurred with bulged structures, where the optimal ss length for CL reactivity was 3-bp, whereas lower reactivity was observed either decrementing (1 or 2 bp) or incrementing (5 or 7 bp) the bulge length. To understand why oligonucleotides with more than 3 bulged bases were less susceptible to CL cleavage, bulged sequences of the same overall length but with a different number of bulged bases (Table 1) were tested by electrophoretic mobility shift assay (EMSA). As shown in Fig. 6, electrophoretic mobility of 1-, 2-, 3-bulged oligonucleotides was similar to that of the control ds oligo (compare lanes 1–3 with ds, left side), slightly increasing with the bulge length. Strikingly, the mobility of the 5- and 7-bulged oligonucleotides dramatically incremented and it remained only mildly lower than that of the control ss oligo (compare lanes 5 and 7 with ss, left side). In contrast, the number of non-paired bases in hairpin oligonucleotides only slightly influenced the oligonucleotide electrophoretic mobility, which barely decreased with increasing hairpin length (Fig. 6, right side).

Figure 6. EMSA analysis of bulge and hairpin oligonucleotides.

Figure 6

Oligonucleotides 50, 49, 48, 47, 46 and 45 were annealed to the appropriate complementary oligonucleotides (50 rev, 49 rev, 48 rev, 47 rev, 46 rev and 45 rev, Table 1) to form ds, 1-, 2-, 3-, 5-, and 7-base bulged sequences, respectively. Oligonucleotides 54, 53, 52 and 51 were folded to form 3-, 5-, 7-, 9-base hairpin sequences. Ds and ss oligonucleotides with the same length as bulge or hairpin sequences were employed as controls of migration and they are indicated by the arrows aside the gel images.

Discussion and Conclusions

Non-canonical nucleic acid structures have been postulated to mediate protein-nucleic acid interactions and frameshift mutations, some of which may result in a variety of diseases and cancers. Therefore, both recognition and elucidation of the conformation of unusual secondary structures would be of the utmost importance to predict unexpected biological effects generating at the genomic level.

The natural compound CL had been previously shown to be able to detect and induce cleavage in partially ss regions within supercoiled plasmids, whereas it resulted completely inert versus ds nucleic acids [17], [18]. Here we showed that CL could detect the presence of non-paired sequences, when at least one ss non-T base was available, in a number of non-canonical DNA conformations. In addition, the degree of CL reactivity towards DNA bases indicated the accessibility of ss-sites, therefore providing a practical and simple tool to dissect unconventional DNA structures.

In the case of bulges, site accessibility of ss nucleotides varied depending on the length of the bulge itself. By using CL, we were able to demonstrate that the highest base accessibility was obtained when 3-bulged bases were present, indicating that TGT or TCT bulges, with both A/T- or G/C-rich flanking ds sequences, protruded from the double-helix. Accessibility was reduced both in shorter (2 and 1 bases) and, unexpectedly, in longer (5 and 7 bases) bulges. Longer bulges were shown to be less sterically hindered by EMSA; therefore they likely fold back on themselves or stack onto the double-helix, hindering access to reactive groups. These data complement previous analysis on bulge conformations. As a general rule, it was reported that in the case of one-base-bulges, the bulged purines stacked into the duplex [23], [24], [25], [26], whereas bulged pyrimidines were either stacked in or looped out into solution, depending upon the temperature and the flanking sequence [27], [28], [29], [30]. In the case of three- and five-nucleotide DNA bulges, three and five unpaired A bases were found to be mostly stacked into the helix continuously with the flanking DNA and to induce a local kink in the DNA molecule [31], [32]. Unfortunately, only data on bulged A bases were reported for three- and five-base bulges and no detailed structural information is available for different sequences or longer DNA bulges. The data presented here add information on this subject, showing that local folding in the considered sequences occurs only starting from 5-base bulges.

In the case of mismatches, the absence of reactivity towards CL demonstrated that one mismatched base was mostly buried within the double-helix; on the opposite, two mismatched bases was the minimal necessary condition to allow for extra-helical positioning of the non-paired nucleotides. In contrast, when DNA strands were interrupted (nicks), even one non-paired base on the intact strand was effectively exposed and thus available to react with CL. The degree of CL reactivity towards 1 to 3 non-paired nucleosides in nicks did not change, indicating similar exposure of the ss bases. Interestingly, however, some complemented bases close to site of DNA interruption became available for reaction probably due to breathing of the end region of the double-helix.

Opposite to bulges, reactive bases in the loop region of hairpins became more accessible when increasing the length of the loops itself. However, reaction with CL proved that ds bases adjacent to the looped regions did not perfectly pair, thus being accessible for reaction. Availability depended on the nature of the bases within the loop; therefore it is conceivable that non-Watson&Crick base pairing takes place within loop bases. In addition, we demonstrated that hairpins as long as 9 bases did not decrease reactivity towards CL, in contrast to what observed with bulges. Likely, the different relative position of the ss moiety on the ds segment facilitates stacking in the bulges, while hampers folding in the hairpins.

Importantly, in our case we did not find remarkable differences in the reactivity towards ss nucleotides when flanking sequences were either A/T- or G/C-rich, indicating that possible interaction of the extruded ss bases with the adjacent double-helix does not depend on the nature of the bases. Finally, by including either G or C in the ss regions we confirmed the possibility of CL to discriminate between bases, when reactions are visualized both before and after hot piperidine treatment. Differential reactivity towards A can also be achieved, as previously demonstrated [20].

Few high-resolution data are available for non-canonical DNA structures and, in general, data collected so far demonstrated that each sequence determines its own peculiar secondary structure. For this reason is has been difficult to develop compounds that broadly target unusual DNA structures without affecting ds regions. Nakatani’s and Teulade-Fichou’s groups have recently reported compounds able to recognize sequence-specific mismatched DNA and hairpins [13], [33], [34], [35], [36], [37]. No structure-activity relationship can be drawn from these very diverse chemicals, which do not share structural similarities to CL. However, the activities of these compounds and CL are also divergent: the formers target sequence-specific DNA conformations, the latter recognizes all DNA conformations that allow for the presence of single-stranded regions. While the sequence-specific compounds might be useful to treat genetic defects caused by a specific non-canonical DNA conformation, CL can help avoiding the use expensive and cumbersome molecular techniques to detect unusual DNA conformations, which are not readily predictable from sequence data.

CL is a small natural molecule that combines electrophilicity and bulkiness. This modulates the extent of alkylation (and cleavage) of non-canonical DNA conformations. In fact, it allows i) detecting ss regions in a double stranded environment, ii) discriminating between DNA bases within a ss region, iii) reacting to different extents with a given base (except T) as a function of accessibility of the target unpaired nucleotides, iv) easy localization of the target site by sequencing gels.

Since CL is a natural product isolated from a fungus, availability could be a problem. However, total synthesis of CL has been reported [38] and a structural analogue, in which the diterpenoid moiety is replaced by a naphthalene ring while preserving base selectivity and reactivity, can be easily synthesized [18], [19], [20].

Therefore, CL (along with its naphthalene derivative) represents a new valuable tool to localize and monitor unpaired structures in a DNA double helix context.

Funding Statement

This work was supported by the Italian Ministry of University and Research [FIRB-Ideas RBID082ATK_001] and by the University of Padua. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Belmont P, Constant J-F, Demeunynck M (2001) Nucleic acid conformation diversity: from structure to function and regulation. Chem Soc Rev 30: 70–81. [Google Scholar]
  • 2. Svozil D, Kalina J, Omelka M, Schneider B (2008) DNA conformations and their sequence preferences. Nucleic Acids Res 36: 3690–3706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Kamashev D, Balandina A, Mazur AK, Arimondo PB, Rouviere-Yaniv J (2008) HU binds and folds single-stranded DNA. Nucleic Acids Res 36: 1026–1036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Streisinger G, Okada Y, Emrich J, Newton J, Tsugita A, et al. (1966) Frameshift mutations and the genetic code. This paper is dedicated to Professor Theodosius Dobzhansky on the occasion of his 66th birthday. Cold Spring Harb Symp Quant Biol 31: 77–84. [DOI] [PubMed] [Google Scholar]
  • 5. Tippin B, Kobayashi S, Bertram JG, Goodman MF (2004) To slip or skip, visualizing frameshift mutation dynamics for error-prone DNA polymerases. J Biol Chem 279: 45360–45368. [DOI] [PubMed] [Google Scholar]
  • 6. Baase WA, Jose D, Ponedel BC, von Hippel PH, Johnson NP (2009) DNA models of trinucleotide frameshift deletions: the formation of loops and bulges at the primer-template junction. Nucleic Acids Res 37: 1682–1689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Smith GR (2008) Meeting DNA palindromes head-to-head. Genes Dev 22: 2612–2620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Jones GB, Lin Y, Ma D, Xiao Z, Hwang GS, et al. (2008) Congeners of the enediyne neocarzinostatin chromophore: designed agents for bulged nucleic acid targets. Curr Top Med Chem 8: 436–447. [DOI] [PubMed] [Google Scholar]
  • 9. Nakatani K, Okamoto A, Saito I (1999) Specific Alkylation of Guanine Opposite to a Single Nucleotide Bulge: A Chemical Probe for the Bulged Structure of DNA. Angew Chem Int Ed Engl 38: 3378–3381. [PubMed] [Google Scholar]
  • 10.Ong HC, Arambula JF, Rao Ramisetty S, Baranger AM, Zimmerman SC (2009) Molecular recognition of a thymine bulge by a high affinity, deazaguanine-based hydrogen-bonding ligand. Chem Commun (Camb): 668–670. [DOI] [PubMed]
  • 11. Nakatani K, Sando S, Kumasawa H, Kikuchi J, Saito I (2001) Recognition of guanine-guanine mismatches by the dimeric form of 2-amino-1,8-naphthyridine. J Am Chem Soc 123: 12650–12657. [DOI] [PubMed] [Google Scholar]
  • 12. Pierre VC, Kaiser JT, Barton JK (2007) Insights into finding a mismatch through the structure of a mispaired DNA bound by a rhodium intercalator. Proc Natl Acad Sci U S A 104: 429–434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Hong C, Hagihara M, Nakatani K (2011) Ligand-assisted complex formation of two DNA hairpin loops. Angew Chem Int Ed Engl 50: 4390–4393. [DOI] [PubMed] [Google Scholar]
  • 14. Andersen N, Rasmussen P (1984) The costitution of Clerocidin. A new antibiotic isolated from Oidiodendron truncatum. Tetrahedron Lett 25: 465–468. [Google Scholar]
  • 15. Andersen NR, Lorck HO, Rasmussen PR (1983) Fermentation, isolation and characterization of antibiotic PR-1350. J Antibiot (Tokyo) 36: 753–760. [DOI] [PubMed] [Google Scholar]
  • 16. Andersen NR, Lorck HO, Rasmussen PR (1984) The relative and absolute con®guration of clerocidin and its cometabolites. Tetrahedron Lett 25: 469–472. [Google Scholar]
  • 17. Gatto B, Richter S, Moro S, Capranico G, Palumbo M (2001) The topoisomerase II poison clerocidin alkylates non-paired guanines of DNA: implications for irreversible stimulation of DNA cleavage. Nucleic Acids Res 29: 4224–4230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Richter S, Gatto B, Fabris D, Takao K, Kobayashi S, et al. (2003) Clerocidin alkylates DNA through its epoxide function: evidence for a fine tuned mechanism of action. Nucleic Acids Res 31: 5149–5156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Richter SN, Menegazzo I, Fabris D, Palumbo M (2004) Concerted bis-alkylating reactivity of clerocidin towards unpaired cytosine residues in DNA. Nucleic Acids Res 32: 5658–5667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Richter SN, Menegazzo I, Nadai M, Moro S, Palumbo M (2009) Reactivity of clerocidin towards adenine: implications for base-modulated DNA damage. Org Biomol Chem 7: 976–985. [DOI] [PubMed] [Google Scholar]
  • 21. Richter S, Fabris D, Binaschi M, Gatto B, Capranico G, et al. (2004) Effects of common buffer systems on drug activity: the case of clerocidin. Chem Res Toxicol 17: 492–501. [DOI] [PubMed] [Google Scholar]
  • 22. Maxam AM, Gilbert W (1980) Sequencing end-labeled DNA with base-specific chemical cleavages. Methods Enzymol 65: 499–560. [DOI] [PubMed] [Google Scholar]
  • 23. Patel DJ, Kozlowski SA, Marky LA, Rice JA, Broka C, et al. (1982) Extra adenosine stacks into the self-complementary d(CGCAGAATTCGCG) duplex in solution. Biochemistry 21: 445–451. [DOI] [PubMed] [Google Scholar]
  • 24. Woodson SA, Crothers DM (1988) Preferential location of bulged guanosine internal to a G.C tract by 1H NMR. Biochemistry 27: 436–445. [DOI] [PubMed] [Google Scholar]
  • 25. Nikonowicz E, Roongta V, Jones CR, Gorenstein DG (1989) Two-dimensional 1H and 31P NMR spectra and restrained molecular dynamics structure of an extrahelical adenosine tridecamer oligodeoxyribonucleotide duplex. Biochemistry 28: 8714–8725. [DOI] [PubMed] [Google Scholar]
  • 26. Nikonowicz EP, Meadows RP, Gorenstein DG (1990) NMR structural refinement of an extrahelical adenosine tridecamer d(CGCAGAATTCGCG)2 via a hybrid relaxation matrix procedure. Biochemistry 29: 4193–4204. [DOI] [PubMed] [Google Scholar]
  • 27. van den Hoogen YT, van Beuzekom AA, de Vroom E, van der Marel GA, van Boom JH, et al. (1988) Bulge-out structures in the single-stranded trimer AUA and in the duplex (CUGGUGCGG).(CCGCCCAG). A model-building and NMR study. Nucleic Acids Res 16: 5013–5030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Kalnik MW, Norman DG, Zagorski MG, Swann PF, Patel DJ (1989) Conformational transitions in cytidine bulge-containing deoxytridecanucleotide duplexes: extra cytidine equilibrates between looped out (low temperature) and stacked (elevated temperature) conformations in solution. Biochemistry 28: 294–303. [DOI] [PubMed] [Google Scholar]
  • 29. Kalnik MW, Norman DG, Li BF, Swann PF, Patel DJ (1990) Conformational transitions in thymidine bulge-containing deoxytridecanucleotide duplexes. Role of flanking sequence and temperature in modulating the equilibrium between looped out and stacked thymidine bulge states. J Biol Chem 265: 636–647. [PubMed] [Google Scholar]
  • 30. Morden KM, Gunn BM, Maskos K (1990) NMR studies of a deoxyribodecanucleotide containing an extrahelical thymidine surrounded by an oligo(dA).oligo(dT) tract. Biochemistry 29: 8835–8845. [DOI] [PubMed] [Google Scholar]
  • 31. Rosen MA, Live D, Patel DJ (1992) Comparative NMR study of A(n)-bulge loops in DNA duplexes: intrahelical stacking of A, A-A, and A-A-A bulge loops. Biochemistry 31: 4004–4014. [DOI] [PubMed] [Google Scholar]
  • 32. Dornberger U, Hillisch A, Gollmick FA, Fritzsche H, Diekmann S (1999) Solution structure of a five-adenine bulge loop within a DNA duplex. Biochemistry 38: 12860–12868. [DOI] [PubMed] [Google Scholar]
  • 33. Dohno C, Kohyama I, Hong C, Nakatani K (2012) Naphthyridine tetramer with a pre-organized structure for 1:1 binding to a CGG/CGG sequence. Nucleic Acids Res 40: 2771–2781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Jourdan M, Granzhan A, Guillot R, Dumy P, Teulade-Fichou MP (2012) Double threading through DNA: NMR structural study of a bis-naphthalene macrocycle bound to a thymine-thymine mismatch. Nucleic Acids Res 40: 5115–5128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Granzhan A, Largy E, Saettel N, Teulade-Fichou MP (2010) Macrocyclic DNA-mismatch-binding ligands: structural determinants of selectivity. Chemistry 16: 878–889. [DOI] [PubMed] [Google Scholar]
  • 36. Bahr M, Gabelica V, Granzhan A, Teulade-Fichou MP, Weinhold E (2008) Selective recognition of pyrimidine-pyrimidine DNA mismatches by distance-constrained macrocyclic bis-intercalators. Nucleic Acids Res 36: 5000–5012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Peng T, Nakatani K (2005) Binding of naphthyridine carbamate dimer to the (CGG)n repeat results in the disruption of the G-C base pairing. Angew Chem Int Ed Engl 44: 7280–7283. [DOI] [PubMed] [Google Scholar]
  • 38. Xiang AX, Watson DA, Ling T, Theodorakis EA (1998) Total Synthesis of Clerocidin via a Novel, Enantioselective Homoallenylboration Methodology. J Org Chem 63: 6774–6775. [DOI] [PubMed] [Google Scholar]

Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES