Several genomic disorders are caused by an excessive number of DNA triplet repeats.
Abstract
Several genomic disorders are caused by an excessive number of DNA triplet repeats. We developed a DNA-templated reaction in which product formation occurs only when the number of repeats exceeds a threshold indicative for the outbreak of Chorea Huntington. The combined use of native chemical PNA ligation and auxiliary DNA probes enabled reactions on templates obtained from human genomic DNA.
Nucleic acid controlled chemical reactions are emerging as useful tools for nucleic acid diagnosis and imaging.1–6 Reactive probes offer advantages to the commonly used non-reactive hybridization probes.7 For example, DNA/RNA-controlled reactions proceed with very high sequence specificity – at the single nucleotide level if desired – because adjacent and simultaneous annealing of two probe molecules is required to trigger the chemical reaction. Furthermore, chemical reactions can occur with turnover in template, which offers opportunities for the amplified detection of product signals.7,8 While significant efforts have been invested into the development of new chemistries applied in challenging environments such as in cell lysates, during the polymerase chain reaction and within living cells, comparatively little attention has been paid to the nature of the targeted nucleic acid template itself.9–18 A number of genes contain repeat sequences. Several disorders such as Huntington disease, myotonic dystrophy and spinocerebellar ataxia are caused by an excessive number of trinucleotide repeats.19–23 For example, the expansion of triplet repeats in the human IT15 gene leads to Chorea Huntington Disease (HD).24,25 Alleles with less than 26 repeats are considered normal and are stable, but alleles that contain 27–36 triplet repeats may expand during meiosis, hence leading to increased repeats in the next generation. Eventually, alleles with more than 36 triplet repeats lead to HD pathology.26,27
We envisioned that a nucleic acid directed reaction could be designed to provide product required that the number of repeat sequences exceeds a critical threshold value. This idea was based on the notion that the number of triplet repeats in the template should correlate with the effective concentration of template available for binding of the reactive probes.
Our design of a reaction system that discriminates between repeat sequence DNA of varied length involves the native chemical ligation of peptide nucleic acid (PNA)-based probes (Scheme 1).28–32 PNA is a DNA analogue that has high affinity for complementary DNA.33–35 This facilitates hybridization and reactions on folded nucleic acid targets (such as long DNA repeat sequences). The method requires two reactive PNA probes, one of which is equipped with a C-terminal thioester moiety (I) while the other bears an N-terminal cysteine residue (II). Hybridization with the complementary DNA sequence brings the reactive groups into proximity and triggers the native chemical ligation, whereas no reaction should occur in the absence of template.30
Scheme 1. Principle of a native chemical ligation of PNA probes (I and II), in which formation of product III is indicative for a critical length of a triplet repeat DNA template.
In the initial phase of probe development, we compared a probe set directed against the CAG repeats in the Huntington coding strand with a probe set targeted against the CTG repeats of the corresponding non-coding strand. The latter was favoured by us owing to difficulties in the synthesis of (CTG)n containing PNA probes. Next we investigated whether templated reactions occur on repeat sequence templates. The CAG-containing PNA thioester 1 and cysteinyl-PNA 2 were allowed to react in the absence and presence of a size-matched synthetic CTG-repeat DNA template (Fig. 1a). The formation of product 3 was monitored by reversed phase-ultra high performance chromatography (RP-UPLC, Fig. 1b). This enabled the analysis of picomol amounts within short time (4 minutes). During the optimization of the native chemical ligation between 1 and 2 we noticed a surprisingly high background reaction in absence of template (Fig. S37†). To avoid probe–probe interactions the reaction temperature was increased to 50 °C. We next added 200 nM CTG-repeat DNA templates of varied length. This concentration is within the range of amplicons obtained after PCR. Of note, the rates of native chemical PNA ligation proved critically dependent on the number of repeats (Fig. 1c). The positive correlation between product yield and repeat length extended until 20 CTG repeats (Fig. 1d). Then, at 24 CTG repeats the efficiency of the PNA ligation suddenly dropped. Increases of probe concentration to 5 μM did not lead to an improvement. Melt experiments (Table S1†) suggested that the template adopted a superstructure that had significantly higher stability ((CTG)20, T M = 43 °C; (CTG)32, T M = 66 °C) than the complexes formed between the reactive PNA probes and the template (1·(CTG)4, T M = 39 °C; 2·(CTG)4, T M = 38 °C). The PNA probes were elongated by three nucleotides to increase the template affinity. However, the rather large extent of background ligation was problematic (Fig. S38†).
Fig. 1. (a) DNA templated native chemical PNA ligation between 1 and 2 at equimolar probe/template ratio; (b) UPLC analysis of aliquots withdrawn from the reaction mixture after the specified reaction times; (c) time course of reactions in presence of synthetic ssDNA varying in triplet repeat length; (d) fractional increase in product yield after 10 min, 30 min or 80 min. Conditions: 1 μM reactive PNA probes in buffer (10 mM NaH2PO4, 150 mM NaCl, 10 mM MesNa, pH 7.5), 200 nM template when added, 50 °C. R = (CH2)2SO3H.
We assumed that the use of aminomodified PNA should offer a solution to the problem of PNA–PNA interaction and, at the same time, confer the required increase of the template affinity. According to this, charge repulsion would help reduce PNA–PNA interactions whereas charge attraction would foster the formation of PNA–DNA template complexes. For that purpose, the aminoethyllysine cytidine building block c* was prepared (ESI†) and incorporated into the CAG-containing PNA thioester probe 4 (Fig. 2a). The cysteinyl-PNA 5 was equipped with a C-terminal lysine. The extent of background reaction between the aminomodified reactive probes 4 and 5 in absence of template was very small (Fig. S37†) and the scope of templated reactions was extended to sequences longer than 20 repeats. The yield of the templated ligation between 4 and 5 continued to increase until a length of 32 triplet repeats (Fig. 2b and S39†). This corresponds to the length, which may lead to the Huntington phenotype in offspring. However, the ligation efficiency was severely affected, again, when the DNA length exceeded 32 repeats (Fig. 2c). The addition of denaturating agents such as DMSO, glycerol, betaine or formamide allowed increases of product formation (Fig. S41 and S42†). But still, the fractional increase in ligation rate did not surpass a length threshold of 32 repeats.
Fig. 2. (a) Native chemical ligation reaction between aminomodified PNA probes 4 and 5; (b) time course of product 6 formed in absence or presence of 100 nM DNA templates varying in the number of triplet repeats; (c) template dependence of yields obtained after 10 min, 30 min or 80 min. (Conditions: see Fig. 1, R = (CH2)2SO3H).
In previous ligation chemistries, auxiliary strands have been added to prevent reannealing of a DNA template strand produced by PCR.11,30,36 However, intramolecular hybridization is a very efficient process and the length requirements for oligonucleotides that block the formation of superstructures in the Huntington template were unknown. To avoid slippage along the template, we designed a blocker strand that is equipped with a 5′-terminal hexanucleotide sequence which anchored the CAG repeat sequence to the 3′-flanking region of the CTG repeat tract. The so-called partial blocker DNA strands 7 and 8 offered 12 or 28 CAG repeats for binding interactions with the CTG repeat part of the Huntington non-coding strand (Fig. 3a). The partial blockers 7 or 8 were added to the 4 + 5 ligation. Of note, the ligations proceeded remarkably well on DNA templates containing 44 triplet repeats (Fig. 3b). No product was formed when the template strand contained less triplet repeats than the partial blocker. Reactions performed in presence of the short blocker 7 provided higher ligation yields than reactions in presence of the long blocker 8. This can be attributed to the increased number of template bases available for annealing of reactive probes. These results showed that a DNA-directed chemical reaction can distinguish between healthy and pathological DNA length.
Fig. 3. (a) Principle of native chemical PNA ligation aided by partial DNA blockers 7 and 8 and (b) time course of the product formation triggered by DNA templates varying in triplet repeats length for reaction systems involving 7 or 8. Conditions: 1 μM reactive PNA probes in buffer (10 mM NaH2PO4, 150 mM NaCl, 10 mM MesNa, pH 7.5), 100 nM DNA template, 100 nM partial blocker, R = (CH2)2SO3H.
To test the reaction system under realistic conditions we analyzed human genomic DNA templates. Samples from a Chorea Huntington patient (purchased from the Coriell Institute) and human wild-type DNA were subjected to PCR, which was used to amplify the trinucleotide repeat segment of the IT15 gene. This was achieved by using a biotin-modified forward primer and an Alexa700-labeled reverse primer and adapting a known PCR method.37,38 The amplified DNA was captured by streptavidin-magnetic beads and NaOH treatment was used to release single stranded DNA templates (ESI†). The yield of PCR and the concentration of template strands were determined via fluorescence measurements. The reactive probes and partial blocker 7 were added to the PCR-DNA single strand. Aliquots were withdrawn after 0–180 min reaction time and analyzed via UPLC (Fig. 4). The DNA template obtained through amplification of Huntington Disease genomic DNA (HD-DNA) triggered the formation of ligation product 6 (Fig. 4b). The ligation product was detectable after 10 min reaction time (10% yield) and reached 32% in 180 minutes. By contrast, no product was formed when the reaction system involved the wild-type DNA template (Fig. 4c). The analysis was repeated with partial blocker 8 (Fig. S44 and S45†). Again, no product was formed when the template was produced from wild-type genomic DNA, whereas UPLC analysis revealed the time-dependent synthesis of ligation product 6 in reactions including the HD-DNA template. We wish to note that the signal detected via RP-UPLC is robust and can be obtained nearly as rapidly as by a fluorometric read-out. Based on peak height the signal-to-noise on a genomic DNA template is 46 (after 10 min reaction time).
Fig. 4. (a) Principle of native chemical PNA ligation on templates obtained via PCR from human genomic DNA. UPLC traces (b and c) and time course (d) of reactions performed on DNA from (b) a Chorea Huntington patient (HD-DNA) or (c) a healthy individual (WT-DNA). Conditions: see Fig. 3.
The data shows that the PNA-based reaction system responds to the length of repeat sequences in a way that enables ligation when a certain threshold is surpassed. The combined use of reactive probes and auxiliary probes provided a clear yes/no response to the diagnosis problem. The comparison with data obtained with synthetic DNA templates suggested that the HD-DNA has more than 28 repeats. The rate profile resembles reactions with >40 repeat sequences (Fig. S44 and S45†).
The reported work represents the first attempt to assess the length of triplet repeat DNA by means of a DNA-templated reaction. The PNA native chemical ligation provided detectable product after 10 min reaction time and the UPLC analysis was obtained within 4 minutes. Therefore, the time required for analysis by PNA ligation is shorter than the time required for the commonly used analysis of PCR products by gel electrophoresis. However, the need for single stranded DNA template adds capture and denaturation steps to the PCR protocol and, thereby, increases the total analysis time. Sequencing methods could, in principle, provide a more direct means of counting the number of triplet repeats. Yet, long read lengths are required in order to avoid ambiguities in alignment. The resulting need for the separation of the differently sized DNA amplicons calls for electrophoretic methods, which also are time consuming. A reduction of the analysis time required in our method should be feasible. We propose the use of PNA-based invaders such as the G-clamp modified γPNA39 introduced by Ly as this would make the capture/denaturation step redundant.
Conclusions
We identified a reaction system, which allowed the discrimination between Chorea Huntington DNA and wild-type DNA. The reaction system relied on reactive PNA probes, which contained thioester and cysteinyl units required to drive a native chemical ligation reaction. Of note, aggregation of repeat sequence PNA and the formation of template superstructures presented a formidable challenge. A solution to the problem of false positive ligation and folding of large repeat templates was provided by the combined use of amino-modified PNA probes and auxiliary, non-reactive DNA hybridization probes. Under these conditions, the native chemical PNA ligation proceeded at high rate and afforded detectable signals after 10 min reaction time provided that the template was longer than the auxiliary probe (partial blocker). The length-reactivity relationship is within a range that allows the characterization of various polyglutamine diseases.
Acknowledgments
This work was supported by Deutsche Forschungsgesellschaft (DFG).
Footnotes
References
- Gorska K., Winssinger N. Angew. Chem., Int. Ed. 2013;52:6820–6843. doi: 10.1002/anie.201208460. [DOI] [PubMed] [Google Scholar]
- Li X., Liu D. R. Angew. Chem., Int. Ed. 2004;43:4848–4870. doi: 10.1002/anie.200400656. [DOI] [PubMed] [Google Scholar]
- Gryaznov S. M., Schultz R., Chaturvedi S. K., Letsinger R. L. Nucleic Acids Res. 1994;22:2366–2369. doi: 10.1093/nar/22.12.2366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silverman A. P., Kool E. T. Chem. Rev. 2006;106:3775–3789. doi: 10.1021/cr050057+. [DOI] [PubMed] [Google Scholar]
- Martí A. A., Jockusch S., Stevens N., Ju J., Turro N. J. Acc. Chem. Res. 2007;40:402–409. doi: 10.1021/ar600013q. [DOI] [PubMed] [Google Scholar]
- Shibata A., Abe H., Ito Y. Molecules. 2010;17:2446–2463. doi: 10.3390/molecules17032446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grossmann T. N., Strohbach A., Seitz O. ChemBioChem. 2008;9:2185–2192. doi: 10.1002/cbic.200800290. [DOI] [PubMed] [Google Scholar]
- Michaelis J., Roloff A., Seitz O. Org. Biomol. Chem. 2014;12:2821. doi: 10.1039/c4ob00096j. [DOI] [PubMed] [Google Scholar]
- Harcourt E. M., Kool E. T. Nucleic Acids Res. 2012;40:e65. doi: 10.1093/nar/gkr1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roloff A., Seitz O. Bioorg. Med. Chem. 2013;21:3458–3464. doi: 10.1016/j.bmc.2013.04.064. [DOI] [PubMed] [Google Scholar]
- Xu Y., Karalkar N. B., Kool E. T. Nat. Biotechnol. 2001;19:148–152. doi: 10.1038/84414. [DOI] [PubMed] [Google Scholar]
- Sando S., Abe H., Kool E. T. J. Am. Chem. Soc. 2004;126:1081–1087. doi: 10.1021/ja038665z. [DOI] [PubMed] [Google Scholar]
- Silverman A. P., Kool E. T. Trends Biotechnol. 2005;23:225–230. doi: 10.1016/j.tibtech.2005.03.007. [DOI] [PubMed] [Google Scholar]
- Abe H., Kool E. T. Proc. Natl. Acad. Sci. U. S. A. 2006;103:263–268. doi: 10.1073/pnas.0509938103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franzini R. M., Kool E. T. J. Am. Chem. Soc. 2009;131:16021–16023. doi: 10.1021/ja904138v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gorska K., Keklikoglou I., Tschulena U., Winssinger N. Chem. Sci. 2011;2:1969–1975. [Google Scholar]
- Sadhu K. K., Winssinger N. Chem.–Eur. J. 2013;19:8182–8189. doi: 10.1002/chem.201300060. [DOI] [PubMed] [Google Scholar]
- Saneyoshi H., Ito Y., Abe H. J. Am. Chem. Soc. 2013;135:13632–13635. doi: 10.1021/ja406724k. [DOI] [PubMed] [Google Scholar]
- LaSpada A. R., Wilson E. M., Lubahn D. B., Harding A. E., Fischbeck K. H. Nature. 1991;352:77–79. doi: 10.1038/352077a0. [DOI] [PubMed] [Google Scholar]
- Mahadevan M., Tsilfidis C., Sabourin L., Shutler G., Amemiya C., Jansen G., Neville C., Narang M., Barceló J., O'Hoy K., Leblond S., Earle-Macdonald J., deJong P. J., Wieringa B., Korneluk R. G. Science. 1992;255:1253–1255. doi: 10.1126/science.1546325. [DOI] [PubMed] [Google Scholar]
- Brook J. D., McCurrach M. E., Harley G. G., Buckler A. J., Church D., Aburatani H., Hunter K., Stanton V. P., Thirion J.-P., hudson T., Sohn R., Zemelman B., Snell R. G., Rundle S. A., Crow S., Davies J., Shelbourne P., Buxton J., Jones C., Juvonen V., Johnson K., Harper P. S., Shaw D. J., Housman D. E. Cell. 1992;68:799–808. doi: 10.1016/0092-8674(92)90154-5. [DOI] [PubMed] [Google Scholar]
- Tsilfidis C., McKenzie A. E., Mettler G., Barcelo J., Korneluk R. G. Nat. Genet. 1992;1:192–195. doi: 10.1038/ng0692-192. [DOI] [PubMed] [Google Scholar]
- Fu Y. H., Pizzuti D. P., Pieretti M., Sutcliffe J. S., Richards S., Verkerk A. J. M. H., Holden J. J. A., Fenwick Jr. R. G., Warren S. T., Oostra B. A., Nelson D. L., Caskey C. T. Cell. 1991;67:1047–1058. doi: 10.1016/0092-8674(91)90283-5. [DOI] [PubMed] [Google Scholar]
- Margolis L., Ross C. A. Clin. Chem. 2003;49:1726–1732. doi: 10.1373/49.10.1726. [DOI] [PubMed] [Google Scholar]
- The Huntington's Disease Collaborative Research Group Cell. 1993;72:971–983. doi: 10.1016/0092-8674(93)90585-e. [DOI] [PubMed] [Google Scholar]
- Teo C. R. L., Wang W., Law H. Y., Lee C. G., Chong S. S. Clin. Chem. 2008;54:964–972. doi: 10.1373/clinchem.2007.096503. [DOI] [PubMed] [Google Scholar]
- Walker F. O. J.-Lancet. 2007;369:218–228. doi: 10.1016/S0140-6736(07)60111-1. [DOI] [PubMed] [Google Scholar]
- Mattes A., Seitz O. Chem. Commun. 2001:2050–2051. doi: 10.1039/b106109g. [DOI] [PubMed] [Google Scholar]
- Dose C., Seitz O. Org. Biomol. Chem. 2004;2:59. doi: 10.1039/b309235f. [DOI] [PubMed] [Google Scholar]
- Ficht S., Mattes A., Seitz O. J. Am. Chem. Soc. 2004;126:9970–9981. doi: 10.1021/ja048845o. [DOI] [PubMed] [Google Scholar]
- Dose C., Ficht S., Seitz O. Angew. Chem., Int. Ed. 2006;45:5369–5373. doi: 10.1002/anie.200600464. [DOI] [PubMed] [Google Scholar]
- Dose C., Seitz O. Bioorg. Med. Chem. 2008;16:65–77. doi: 10.1016/j.bmc.2007.04.059. [DOI] [PubMed] [Google Scholar]
- Nielsen P. E., Egholm M., Berg R. H., Buchardt O. Science. 1991;254:1497–1500. doi: 10.1126/science.1962210. [DOI] [PubMed] [Google Scholar]
- Egholm M., Buchardt O., Christensen L., Behrens C., Freier S. M., Driver D. A., Berg R. H., Kim S. K., Norden B., Nielsen P. E. Nature. 1993;365:566–568. doi: 10.1038/365566a0. [DOI] [PubMed] [Google Scholar]
- Wittung P., Nielsen P. E., Buchardt O., Egholm M., Norden B. Nature. 1994;1994:561–563. doi: 10.1038/368561a0. [DOI] [PubMed] [Google Scholar]
- Roloff A., Seitz O. Chem. Sci. 2012;4:432. [Google Scholar]
- Goldberg Y. P., Andrew S. E., Clarke L. A., Hayden M. R. Hum. Mol. Genet. 1993;2:635–636. doi: 10.1093/hmg/2.6.635. [DOI] [PubMed] [Google Scholar]
- Riess O., Noerremoelle A., Soerensen S. A., Epplen J. T. Hum. Mol. Genet. 1993;2:637–638. doi: 10.1093/hmg/2.6.637. [DOI] [PubMed] [Google Scholar]
- Chenna V., Rapireddy S., Sahu B., Austin C., Pedrosso E., Ly D. H. ChemBioChem. 2008;9:2388–2391. doi: 10.1002/cbic.200800441. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





