Abstract
Huntington's disease (HD) is caused by a CAG repeat expansion that is unstable upon germ-line transmission and exhibits mosaicism in somatic tissues. We show that region-specific CAG repeat mosaicism profiles are conserved between several mouse models of HD and therefore develop in a predetermined manner. Furthermore, we demonstrate that these synchronous, radical changes in CAG repeat size occur in terminally differentiated neurons. In HD this ongoing mutation of the repeat continuously generates genetically distinct neuronal populations in the adult brain of mouse models and HD patients. The neuronal population of the striatum is particularly distinguished by a high rate of CAG repeat allele instability and expression driving the repeat upwards and would be expected to enhance its toxicity. In both mice and humans, neurons are distinguished from nonneuronal cells by expression of MSH3, which provides a permissive environment for genetic instability independent of pathology. The neuronal mutations described here accumulate to generate genetically discrete populations of cells in the absence of selection. This is in contrast to the traditional view in which genetically discrete cellular populations are generated by the sequence of random variation, selection, and clonal proliferation. We are unaware of any previous demonstration that mutations can occur in terminally differentiated neurons and provide a proof of principle that, dependent on a specific set of conditions, functional DNA polymorphisms can be produced in adult neurons.
Keywords: CAG repeat instability, differentiated neuron, Huntington's disease, mismatch repair, somatic mutation
Tandem DNA repeats are mutational hotspots in the genome for which variability in length can produce physiological phenotypic differences that may contribute to rapid evolution of species-typical traits (1, 2) and on expansion are known to cause a large number of diseases (3). Instability of expanded repeats, a change in repeat number, occurs on intergenerational transmission and in somatic tissues and can be observed as mosaicism (heterogeneity of the repeat length in the tissue). Because repeat length variability can produce phenotypic diversity, somatic instability of tandem repeats can be a source of cellular diversity both physiologically and pathologically.
Huntington's disease (HD) is caused by a CAG repeat expansion for which mosaicism in the somatic tissues of HD mouse models (4–7) and in HD patient brains (8, 9) has been widely reported. Observations that the repeat continues to be unstable in adult mouse brains led to a hypothesis that somatic instability may contribute to the disease process and that the repeats could be unstable in nondividing neurons (5, 9). Previous studies have demonstrated divergence in the CAG repeat length between brain cell types. In dentato-rubral pallido-luysian atrophy post mortem brains, granule cells were found to have shorter repeats than Purkinje cells and both were shorter than those in glia (10, 11). In HD patients and a knockin HD mouse model, neuronal alleles, on average, were found to be longer than glial alleles (12). However, in all of these studies the median differences between the neuronal and nonneuronal cell populations were small, usually not exceeding more than five repeat units.
Therefore, CAG repeat mosaicism in neuronal populations in dentato-rubral pallido-luysian atrophy and HD postmortem brains has been shown, but, crucially, it has not been established when and where these diverse alleles are generated: in replicating neuronal progenitor cells or in nonreplicating terminally differentiated neurons. Similarly, there has been no indication of the causes that predispose cells to differential rates of CAG instability. In this study we set out to determine whether the instability leading to CAG repeat mosaicism has occurred in terminally differentiated neurons and to understand how differences in instability rates arise between cell populations.
Results
Signatures of Somatic CAG Instability in the Brain.
The R6/1 mouse model of HD expresses exon 1 of the human HD gene with ≈120 CAGs (13), and we have previously reported that the degree of somatic instability that occurs in this mouse model varies between brain regions and peripheral tissues (5). We sought to establish baseline CAG repeat size distribution profiles and their variability between individual mice (Fig. 1). Consistent with previous findings, mosaicism developed with age and was highly specific to brain region. Our multiple comparisons demonstrate that patterns of CAG repeat distribution are remarkably reproducible between individuals, implying that the mutation behaves in a predetermined manner. Each distribution contained two major components: one that changes little with time (conservative component) and a multimodal component that increases in size with age (radical component). The multimodality can be inferred from the appearance of numerous peaks within traces (Figs. 1–3). In the striatum and brainstem, the radical component is apparent at 3 months of age and expands with time at mode-specific rates. The tissue profiles differ by the relative proportion and position of the radical components and their modal composition (Figs. 1–3). We then conducted a similar detailed analysis of CAG repeat instability in brain regions of the R6/2 human exon 1 model (13) (≈210 CAGs) and the HdhQ150 knockin model in which ≈150 CAGs have been knockin to the mouse HD gene (Hdh) (14, 15) [supporting information (SI) Fig. 6]. Comparison of repeat profiles among the three models reveals that the relative rate of CAG repeat instability and the modality of the CAG repeat size distribution for each brain region are conserved, generating a distinctive pattern or “signature” (Figs. 1 and 2).
We next examined CAG repeat distributions in the mRNA transcribed from the R6/1 locus at 9 months of age. We found that the ranges but not the profiles of the CAG distributions are well matched between DNA and RNA (Fig. 3). The expression levels of the CAG repeat-bearing transcripts are mode-specific. In the striatum the impact of instability on the gene product is particularly pronounced, because this mode specificity acts to reverse the polarity of the profile between the DNA and RNA traces with repeats of the radical component being more highly expressed. The predominant mode in the RNA has a peak of 160 CAG repeats, which represents an increase of ≈36% over the peak of the conservative component. Expression disequilibrium between alleles and the high rate of instability in the striatum would cumulatively drive the pathogenic length of the CAG repeat in the HD gene product upwards and be expected to enhance its toxicity.
Differential Distribution of the Radical and Conservative Components Between Brain Cell Types.
We observed that CAG repeat size distributions are composed of distinct populations of alleles, each with a differential level of expression. It has been documented that the level of HD gene expression is higher in neurons than in glia (16). It would, therefore, be consistent for striatal neurons to contain the radical component of instability. We used laser microdissection to isolate neuronal and nonneuronal populations and determined the associated CAG repeat distributions in the striata of R6/1 mice (Fig. 4A). Remarkably, at 6 months of age, the neuronal traces are composed mostly of the radical component. By 9 months, the entire neuronal distribution has migrated, while preserving its profile, as the alleles expand in a synchronous manner. The nonneuronal traces are composed mainly of the conservative component with a minor radical component that migrates with age at a rate similar to that of the neuronal distribution.
CAG Repeats Are Unstable in Terminally Differentiated Neurons.
The progressive synchronized increase in the neuronal CAG repeat size suggests that the mutation is ongoing in postmitotic neurons in the absence of DNA replication and cell division. However, evidence of neurogenesis has been reported in rodent brain (17). To establish how the samples from which our data were derived relate to the total population of striatal neurons we evaluated our sampling methods. If our traces reflect the total population then they will be reproducible in independent PCRs. If they are not representative, they will differ according to a Poisson distribution. To test this, the PCR amplification of each dissected neuronal specimen was replicated (SI Fig. 7). The CAG profiles corresponded to the radical component and were reproducible within samples and between mice. This reproducibility testifies that the traces are representative of the CAG repeats of the total neuronal population in the striatum. For the expansion of the neuronal CAG alleles between the ages of 6 and 9 months to be explained by neurogenesis, most neurons would have to have divided during this period. However, it has been estimated that in the rat and mouse adult brain one neuron is produced each day for every 2,000 existing neurons (17), i.e., 0.05% per day. Over 90 days this would amount to only ≈4.5% of the total population of neurons. Reports of decreased neurogenesis in R6/1 mice (18) supports our conclusion that we have characterized nondividing cells and that we present direct evidence for CAG repeat instability in postmitotic neurons.
Cellular Distribution of the CAG Mosaicism in HD Patient Brains.
A similar dissociation of the radical and conservative components of the CAG repeat distribution between the cell types can be observed in HD patient striata (Fig. 4B). The median CAG repeat lengths showed significant differences between neuronal and nonneuronal populations: HC104, neuronal 76.5, nonneuronal 54; HC73, neuronal 69, nonneuronal 51 (Kolmogorov–Smirnov Z test, two-tailed comparison of neuronal and nonneuronal distributions: HC104, Z = 3.253, P < 0.001; HC73, Z = 2.619, P < 0.001). Although the progression of repeat instability with age is difficult to establish in patients, it is reasonable to propose that, as in mouse, instability occurs in postmitotic neurons.
Cell Specificity of MSH3 Distributions Correlates with Instability Rates and Is Conserved Between Humans and Mice.
The factors that cause the cell-specific differences in CAG repeat instability are unknown. Development of instability has been tightly linked to the mismatch repair pathway (7, 19–21). MSH2 expression is necessary for the development of instability in HD mice (7, 19), and its complex with MSH3 and not with MSH6 is required for instability (20). Because nothing is known about the cellular distribution of MSH3 in the brain, we performed immunohistochemistry and confocal microscopy to examine this in both human and mouse. Our colocalization studies detected MSH3 in the neuronal cells but not in the majority of nonneuronal cells of both mouse and human striatum (Fig. 5 and SI Fig. 8). Because cells in which MSH3 is absent are unable to modulate CAG repeat size with age in adulthood (20), the neuronal expression of this protein accounts for cell specificity of expansion in the striatum. Conservation of the colocalization of the radical component of CAG instability and MSH3 in neurons of mice and humans suggests that the observed mosaicisms are generated by the same mechanism. By extension, the neuronal mosaicism observed in humans is likely to have developed postmitotically.
Discussion
We have shown that specific brain regions exhibit characteristic signatures of CAG repeat length distribution across different mouse models of HD. We established that these signatures evolve as a function of the differential rates of CAG instability occurring in the range of cell populations of which each brain region is composed. We found CAG repeats in terminally differentiated neurons of the adult mouse striatum to expand in synchrony between cells. Given the known dependence of CAG repeat instability on the mismatch repair system, our demonstration that repeats are somatically unstable in the cells where MSH3 is present at least partially explains how these cell-specific differences arise.
The length of the CAG repeat is proportional to the severity of the phenotype in model systems of HD (14, 22) and is negatively correlated to age of onset in HD patients (23). Because expression of the mutant allele is necessary for disease (24) the demonstration that somatically expanded alleles are expressed must link CAG instability to pathology. We show that striatum is distinguished from cortex and brainstem by a higher repeat length in mRNA as a result of high levels of CAG instability and neuronal expression levels. Although the functional consequence of this instability has not been quantified, the median difference in CAG size between neuronal and nonneuronal populations in human HD striatum would produce a difference of several decades in age of onset if analogous repeat lengths were inherited. We do not suggest that somatic CAG repeat instability is necessary for disease, because pathogenicity of the HD gene depends on many factors including CAG repeat length, gene expression level, huntingtin processing, and cellular context (a broad term encompassing protein folding and degradation networks among other cellular processes). Instead, we propose that instability acts as a significant disease modifier. In support of this, the rate of somatic instability has been shown to correlate with huntingtin accumulation in neuronal nuclei (7) although other factors clearly also contribute to this process (25).
We have found that striatal cell populations exhibit differential synchronized rates of CAG repeat instability and that therefore these must be driven by intrinsic cell-specific transacting factors. We have shown that MSH3 distribution contributes to this cell specificity. It has previously been demonstrated that MSH3 is absolutely necessary for instability in vivo (20) and that therefore the presence of MSH3 in neurons and its absence from most nonneuronal cells is highly significant. However, the wide variation in CAG instability between different brain regions predicts that the instability rate is governed by the mismatch repair system acting in conjunction with other factors. Oxidative damage is a possible candidate because this has been shown to modify instability rates (26). Recent experiments in Drosophila proposed that CAG instability is linked to pathogenesis (27). Our finding, that the CAG repeat signatures are comparable between R6/1 and HdhQ150 mice at 9 months of age, when R6/1 mice are at end-stage disease and the HdhQ150 knockins remain unaffected (15), contradicts this prediction and indicates that instability occurs as a consequence of age-related events that are little influenced by pathogenic cellular phenotypes.
We have shown that DNA hypermutation can take place in postmitotic neurons in vivo. Segregation and accumulation of functional alleles to produce a genetically distinct population of cells has been known to occur through selection and clonal proliferation (e.g., cancer or adaptive immune response). Here we show that it can occur through synchronous mutation in a significant population of postmitotic neurons. As a result of these phenomena, we show that cells in the brain diverge into genetically distinct groups as determined by the length of the CAG repeat. In the traditional view, random variation is produced throughout the genome between different cells. Even in adaptive immune response, random variations are produced within a region and are filtered through selection and clonal proliferation. For expansion of the neuronal CAG alleles to be synchronized, this mutation must be a deterministic response to a set of conditions that occur in separate cells. However, this is not adaptive (28), because the mutation is unlikely to optimize gene function in response to a selective environment.
We suggest that DNA may also be somatically unstable at other hypermutable tandem DNA repeat sequences and possibly other mutational hotspots in terminally differentiated cells. Such loci would have to have high mutation rates similar to those of the CAG repeats described here. It is known that variation in certain tandem DNA repeats can influence behavioral (2) and morphological (1) phenotypes. Therefore, it might be possible to identify hypermutable repeat loci that would contribute to diversification of characteristics in nondividing cells, thereby generating phenotypic variability outside of the disease setting.
Methods
Source of Mouse and Human Tissue.
Hemizygous R6/1 and R6/2 mice (13) were bred and reared in our colony by backcrossing R6 males to (CBA×C57BL/6) F1 females (B6CBAF1/OlaHsd; Harlan Olac) as previously described (29). HdhQ150 heterozygotes (original nomenclature: CHL2) on a mixed C57BL/6/129Ola (HM1 ES cells) background were generated at the University of Alabama (14). Frozen human brain tissue was from the New Zealand Neurological Foundation Brain Bank (University of Auckland). The study was approved by the St. Thomas's Hospital Research Ethics Committee. HD brains: HC104, male, age 40, grade 3 pathology, 18/51 CAG; HC73, male, age 47, grade 2 pathology, 19/49 CAG.
DNA Extraction, RNA Extraction, and CAG PCR Amplification.
DNA was prepared by phenol/chloroform extraction except for the laser-dissected material, which was extracted by boiling in 0.25 M KCl/0.05 M DTT for 10 min. RNA was prepared by using the RNeasy Mini Kit (Qiagen) and reverse transcribed. The detailed conditions used to amplify the CAG repeat from R6/1, R6/2, and HdhQ150 mouse DNA and human DNA are described in SI Methods.
Repeat Sizing by ABI 377.
PCR amplification conditions were optimized to detect both small-pool and conventional PCR by using the ABI377 sequencer (SI Fig. 9). The protocol is detailed in SI Methods.
Laser Microdissection.
Isolation of cellular populations was performed by using pressure-assisted laser microdissection. Frozen brain sections (15 μm) were mounted onto PEN1 membrane-covered slides (pressure-assisted laser microdissection). Sections were fixed in 90% ethanol and immunostained with NeuN antibody (1:200; Chemicon) as previously described (15) except that both primary and secondary antibody incubations were at room temperature for 20 min. Sections were washed and counterstained with methyl green. Conventional light microscopy (×40 objective) was used to collect the cell population of interest after all other cells had been ablated from the section.
Immunohistochemistry and Confocal Microscopy.
Immunohistochemistry for confocal microscopy was as previously described (30). Primary antibody dilutions were MSH3 (pAb, 1:50; Santa Cruz Biotechnology) and NeuN (mAb, 1:200; Chemicon), and fluorescent secondary antibodies (Molecular Probes) were Alexa Fluor 488 donkey anti-goat (1:1,000) and Texas red goat anti-mouse (1:400). Nuclei were visualized by using TO-PRO-3 (Molecular Probes).
Small-Pool PCR.
Small-pool PCR allows an exact description of the distribution of repeat lengths because their frequency can be measured directly by counting each single band as a single allele of a particular size (31). Each reaction contains a very small amount of template DNA so that individual alleles can be distinguished. We demonstrated that amplification of the CAG repeats from the striatum of an R6/1 mouse aged 9 months using both bulk PCR and small-pool PCR generated comparable repeat distributions (SI Fig. 10).
Supplementary Material
ACKNOWLEDGMENTS.
This work was supported by the Medical Research Council and the Wellcome Trust (Grants 60360 and 66270).
Footnotes
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/cgi/content/full/0800048105/DC1.
References
- 1.Fondon JW, III, Garner HR. Molecular origins of rapid and continuous morphological evolution. Proc Natl Acad Sci USA. 2004;101:18058–18063. doi: 10.1073/pnas.0408118101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hammock EA, Young LJ. Microsatellite instability generates diversity in brain and sociobehavioral traits. Science. 2005;308:1630–1634. doi: 10.1126/science.1111427. [DOI] [PubMed] [Google Scholar]
- 3.Wells RD, Ashizawa T. Genetic Instabilities and Neurological Diseases. San Diego: Elsevier; 2006. [Google Scholar]
- 4.Bates GP, Mangiarini L, Mahal A, Davies SW. Transgenic models of Huntington's disease. Hum Mol Genet. 1997;6:1633–1637. doi: 10.1093/hmg/6.10.1633. [DOI] [PubMed] [Google Scholar]
- 5.Mangiarini L, et al. Instability of highly expanded CAG repeats in mice transgenic for the Huntington's disease mutation. Nat Genet. 1997;15:197–200. doi: 10.1038/ng0297-197. [DOI] [PubMed] [Google Scholar]
- 6.Kennedy L, Shelbourne PF. Dramatic mutation instability in HD mouse striatum: Does polyglutamine load contribute to cell-specific vulnerability in Huntington's disease? Hum Mol Genet. 2000;9:2539–2544. doi: 10.1093/hmg/9.17.2539. [DOI] [PubMed] [Google Scholar]
- 7.Wheeler VC, et al. Mismatch repair gene Msh2 modifies the timing of early disease in Hdh(Q111) striatum. Hum Mol Genet. 2003;12:273–281. doi: 10.1093/hmg/ddg056. [DOI] [PubMed] [Google Scholar]
- 8.Telenius H, et al. Somatic and gonadal mosaicism of the Huntington disease gene CAG repeat in brain and sperm. Nat Genet. 1994;6:409–414. doi: 10.1038/ng0494-409. and erratum (1994) 7:113. [DOI] [PubMed] [Google Scholar]
- 9.Kennedy L, et al. Dramatic tissue-specific mutation length increases are an early molecular event in Huntington disease pathogenesis. Hum Mol Genet. 2003;12:3359–3367. doi: 10.1093/hmg/ddg352. [DOI] [PubMed] [Google Scholar]
- 10.Watanabe H, et al. Differential somatic CAG repeat instability in variable brain cell lineage in dentatorubral pallidoluysian atrophy (DRPLA): A laser-captured microdissection (LCM)-based analysis. Hum Genet. 2000;107:452–457. doi: 10.1007/s004390000400. [DOI] [PubMed] [Google Scholar]
- 11.Hashida H, et al. Single cell analysis of CAG repeat in brains of dentatorubral-pallidoluysian atrophy (DRPLA). J Neurol Sci. 2001;190:87–93. doi: 10.1016/s0022-510x(01)00596-2. [DOI] [PubMed] [Google Scholar]
- 12.Shelbourne PF, et al. Triplet repeat mutation length gains correlate with cell-type specific vulnerability in Huntington disease brain. Hum Mol Genet. 2007;16:1133–1142. doi: 10.1093/hmg/ddm054. [DOI] [PubMed] [Google Scholar]
- 13.Mangiarini L, et al. Exon 1 of the HD gene with an expanded CAG repeat is sufficient to cause a progressive neurological phenotype in transgenic mice. Cell. 1996;87:493–506. doi: 10.1016/s0092-8674(00)81369-0. [DOI] [PubMed] [Google Scholar]
- 14.Lin CH, et al. Neurological abnormalities in a knock-in mouse model of Huntington's disease. Hum Mol Genet. 2001;10:137–144. doi: 10.1093/hmg/10.2.137. [DOI] [PubMed] [Google Scholar]
- 15.Woodman B, et al. The Hdh(Q150/Q150) knock-in mouse model of HD and the R6/2 exon 1 model develop comparable and widespread molecular phenotypes. Brain Res Bull. 2007;72:83–97. doi: 10.1016/j.brainresbull.2006.11.004. [DOI] [PubMed] [Google Scholar]
- 16.Landwehrmeyer GB, et al. Huntington's disease gene: Regional and cellular expression in brain of normal and affected individuals. Ann Neurol. 1995;37:218–230. doi: 10.1002/ana.410370213. [DOI] [PubMed] [Google Scholar]
- 17.Gage FH. Mammalian neural stem cells. Science. 2000;287:1433–1438. doi: 10.1126/science.287.5457.1433. [DOI] [PubMed] [Google Scholar]
- 18.Lazic SE, et al. Decreased hippocampal cell proliferation in R6/1 Huntington's mice. NeuroReport. 2004;15:811–813. doi: 10.1097/00001756-200404090-00014. [DOI] [PubMed] [Google Scholar]
- 19.Manley K, Shirley TL, Flaherty L, Messer A. Msh2 deficiency prevents in vivo somatic instability of the CAG repeat in Huntington disease transgenic mice. Nat Genet. 1999;23:471–473. doi: 10.1038/70598. [DOI] [PubMed] [Google Scholar]
- 20.van den Broek WJ, et al. Somatic expansion behaviour of the (CTG)n repeat in myotonic dystrophy knock-in mice is differentially affected by Msh3 and Msh6 mismatch-repair proteins. Hum Mol Genet. 2002;11:191–198. doi: 10.1093/hmg/11.2.191. [DOI] [PubMed] [Google Scholar]
- 21.Owen BA, et al. (CAG)(n)-hairpin DNA binds to Msh2-Msh3 and changes properties of mismatch recognition. Nat Struct Mol Biol. 2005;12:663–670. doi: 10.1038/nsmb965. [DOI] [PubMed] [Google Scholar]
- 22.Morley JF, Brignull HR, Weyers JJ, Morimoto RI. The threshold for polyglutamine-expansion protein aggregation and cellular toxicity is dynamic and influenced by aging in Caenorhabditis elegans. Proc Natl Acad Sci USA. 2002;99:10417–10422. doi: 10.1073/pnas.152161099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Myers RH, Marans KS, MacDonald ME. In: Genetic Instabilities and Hereditary Neurological Diseases. Wells RD, Warren ST, editors. San Diego: Academic; 1998. pp. 301–323. [Google Scholar]
- 24.Yamamoto A, Lucas JJ, Hen R. Reversal of neuropathology and motor dysfunction in a conditional model of Huntington's disease. Cell. 2000;101:57–66. doi: 10.1016/S0092-8674(00)80623-6. [DOI] [PubMed] [Google Scholar]
- 25.Lloret A, et al. Genetic background modifies nuclear mutant huntingtin accumulation and HD CAG repeat instability in Huntington's disease knock-in mice. Hum Mol Genet. 2006;15:2015–2024. doi: 10.1093/hmg/ddl125. [DOI] [PubMed] [Google Scholar]
- 26.Kovtun IV, et al. OGG1 initiates age-dependent CAG trinucleotide expansion in somatic cells. Nature. 2007;447:447–452. doi: 10.1038/nature05778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Jung J, Bonini N. CREB-binding protein modulates repeat instability in a Drosophila model for polyQ disease. Science. 2007;315:1857–1859. doi: 10.1126/science.1139517. [DOI] [PubMed] [Google Scholar]
- 28.Cairns J, Overbaugh J, Miller S. The origin of mutants. Nature. 1988;335:142–145. doi: 10.1038/335142a0. [DOI] [PubMed] [Google Scholar]
- 29.Hockly E, Woodman B, Mahal A, Lewis CM, Bates G. Standardization and statistical approaches to therapeutic trials in the R6/2 mouse. Brain Res Bull. 2003;61:469–479. doi: 10.1016/s0361-9230(03)00185-0. [DOI] [PubMed] [Google Scholar]
- 30.Smith DL, et al. Inhibition of polyglutamine aggregation in R6/2 HD brain slices—Complex dose-response profiles. Neurobiol Dis. 2001;8:1017–1026. doi: 10.1006/nbdi.2001.0438. [DOI] [PubMed] [Google Scholar]
- 31.Jeffreys AJ, et al. Complex gene conversion events in germline mutation at human minisatellites. Nat Genet. 1994;6:136–145. doi: 10.1038/ng0294-136. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.