Abstract
The chromosome 9p21 amyotrophic lateral sclerosis-frontotemporal dementia (ALS-FTD) locus contains one of the last major unidentified autosomal dominant genes underlying these common neurodegenerative diseases. We have previously shown that a founder haplotype, covering the MOBKL2b, IFNK and C9ORF72 genes, is present in the majority of cases linked to this region. Here we show that there is a large hexanucleotide (GGGGCC) repeat expansion in the first intron of C9ORF72 on the affected haplotype. This repeat expansion segregates perfectly with disease in the Finnish population, underlying 46.0% of familial ALS and 21.1% of sporadic ALS in that population. Taken together with the D90A SOD1 mutation, 87% of familial ALS in Finland is now explained by a simple monogenic cause. The repeat expansion is also present in one third of familial ALS cases of outbred European descent making it the most common genetic cause of these fatal neurodegenerative diseases identified to date.
Introduction
Amyotrophic lateral sclerosis (ALS, OMIM #105400) is a fatal neurodegenerative disease characterized clinically by progressive paralysis leading to death from respiratory failure, typically within two to three years of symptom onset (Rowland and Shneider, 2001). ALS is the third most common neurodegenerative disease in the Western World (Hirtz et al., 2007), and there are currently no effective therapies. Approximately 5% of cases are familial in nature, whereas the bulk of patients diagnosed with the disease are classified as sporadic as they appear to occur randomly throughout the population (Chiò et al., 2008). There is growing recognition, based on clinical, genetic, and epidemiological data, that ALS and frontotemporal dementia (FTD, OMIM #600274) represent an overlapping continuum of disease, characterized pathologically by the presence of TDP-43 positive inclusions throughout the central nervous system (Lillo and Hodges, 2009; Neumann et al., 2006).
To date, a number of genes have been discovered as causative for classical familial ALS, namely SOD1, TARDBP, FUS, OPTN and VCP (Johnson et al., 2010; Kwiatkowski et al., 2009; Maruyama et al., 2010; Rosen et al., 1993; Sreedharan et al., 2008; Vance et al., 2009). These genes cumulatively account for ~ 25% of familial cases, indicating that other causative genes remain to be identified. Each new gene implicated in the etiology of ALS or FTD provides fundamental insights into the cellular mechanisms underlying neuron degeneration, as well as facilitating disease modeling and the design and testing of targeted therapeutics; thus, the identification of new genes that cause ALS or FTD is of great significance.
Linkage analysis of kindreds involving multiple cases of ALS, FTD and ALS-FTD had suggested that there was an important locus for the disease on the short arm of chromosome 9 (Boxer et al., 2011; Morita et al., 2006; Pearson et al., 2011; Vance et al., 2006). Using a genome-wide association (GWAs) approach, we recently reported that this locus on chromosome 9p21 accounted for nearly half of familial ALS and nearly one quarter of all ALS cases in a cohort of 405 Finnish patients and 497 control samples (Laaksovirta et al., 2010). This association signal had previously been reported by van Es and colleagues (van Es et al., 2009), and a meta-analysis involving 4,312 cases and 8,425 controls confirmed that chromosome 9p21 was the major signal for ALS (Shatunov et al., 2010). A recent GWAs for FTD also identified this locus (Van Deerlin et al., 2010). Analysis in the Finnish population narrowed the association to a 232 kilobase (kb) block of linkage disequilibrium, and allowed the identification of a founder haplotype that increased risk of disease by over twenty-fold. The associated haplotype appears to be the same in all European-ancestry populations, and several families previously shown to have genetic linkage to the chromosome 9p21 region also share this risk haplotype (Mok et al., 2011).
We have previously identified an ALS-FTD family from the UK and an apparently unrelated ALS-FTD family from the Netherlands that showed positive linkage to the chromosome 9p21 region (Mok et al., 2011; Pearson et al., 2011). Using these families and the Finnish ALS cases that had previously been used to identify the chromosome 9p21 association signal, we undertook a methodical assessment of the region using next-generation sequencing technology in an attempt to identify the genetic lesion responsible for disease.
Results
We undertook massively parallel, next-generation, deep re-sequencing of the chromosome 9p21 region in (a) DNA that had been flow-sorted enriched for chromosome 9 obtained from an affected member of the GWENT#1 kindred (IV-3, Figure 1A; Coriell ID ND06769) and from a neurologically normal control (ND11463); and (b) DNA that had been enriched for the target region using custom oligonucleotide baits obtained from 3 cases and 5 unaffected members of the DUTCH#1 kindred (V-1, V-3 & V-14, and V-2, V4, V5, VI-1 & spouse of V-1; Figure 1B).
Analysis of the GWENT#1 sequence data revealed eight novel variants within the 232kb block of linkage disequilibrium containing the previously identified association signal that were not described as polymorphisms in either the 1000 Genomes (April 2009 release) or the dbSNP (build 32) online databases. Six of these variants were located within a 30 base pair (bp) region. When the individual sequence reads within this region were examined and manually re-aligned, they indicated the presence of a hexanucleotide repeat expansion GGGGCC located 63bp centromeric to the first exon of the long transcript of C9ORF72 (RefSeq accession number = NM_018325.2; GenBank accession number = GI:209863035) in the affected cases that was not present in the control samples (see Figure S1 available online for individual reads). The repeat expansion also lies within the first intron of the other major transcript of C9ORF72 (RefSeq accession number = NM_145005.4; GenBank accession number = GI:209863036; Figure 2A).
We next used a repeat-primed PCR method to screen case and control samples for the presence of the GGGGCC hexanucleotide repeat expansion (see Figure 2B and Experimental Procedures for detailed explanation) (Kobayashi et al., 2011; Warner et al., 1996). The nature of the repeat-primed PCR assay means that it can detect a maximum of ~60 repeats, and thus the repeat length in a sample carrying the expansion could be far greater than the estimation provided by this technique. Despite this, the assay is an accurate and rapid system that allows samples to be categorized into those that carry a pathogenic repeat expansion (greater than 30 repeats) and those that carry only wild-type alleles (less than 20 repeats). The frequency distribution of the GGGGCC hexanucleotide repeat expansion lengths in ALS cases and control samples based on the repeat-primed PCR assay is shown in Figure 3.
Using the repeat-primed PCR method, we confirmed that the expanded hexanucleotide repeat was present in the affected members of the GWENT#1 and DUTCH#1 kindreds (IV-3, IV-5, IV-7 & IV-8 in GWENT#1 and V-1, V-3, V-14 & V-15 in DUTCH#1, Figure 1A and 1B), and that the expansion was absent from asymptomatic family members (III-1, III-9 & IV-1 in GWENT#1 and V-2, V-8, V-9 & VI-1 in DUTCH#1). In the Finnish cohort of 402 ALS cases and 478 controls, repeat-primed PCR analysis showed the hexanucleotide repeat to be expanded in 113 (28.1%) cases and 2 (0.4%) controls (Fisher’s exact test p-value for allelic association = 8.1×10−38; OR = 78.0, 95% CI = 19.2 – 316.8). Overall, 52 (46.4%) of the Finnish familial ALS cases had the expansion (p-value = 3.7×10−37; OR = 140.9, 95% CI = 34.0 – 583.9), and 61 (21.0%) of the sporadic cases had the expansion (p-value = 1.7×10−24; OR = 56.1, 95% CI 13.6 – 230.2). The average number of repeats detected by the PCR assay in the Finnish cases carrying the expansion was 53 (range, 30 to 71) compared to an average of 2 (range, 0 to 22) repeats observed in the 476 controls that did not carry the expansion, thereby allowing for robust classification of samples (see Figure 3A & B).
Of the 113 familial and sporadic Finnish cases that carried the hexanucleotide repeat expansion, two-thirds (n = 76, 67.3%) carried the previously identified chromosome 9p21 founder risk haplotype (Laaksovirta et al., 2010). In contrast, only one of the Finnish controls samples that carried the expansion also carried the risk haplotype.
For confirmation of the repeat expansion and to estimate its size, fluorescence in situ hybridization (FISH) was performed in an affected member of the GWENT#1 kindred (IV-3, Figure 1A, ND06769), in a case from the NINDS0760 pedigree (III-1, Figure 1E), and in neurologically normal controls (ND11463, ND08559, ND03052, and ND03053). These experiments used a fluorescently-labeled oligonucleotide probe consisting of three GGGGCC repeats (Haaf et al., 1996). All metaphases of the cases showed a strong hybridization signal to a single chromosome – 9p21 – consisting of a discrete dot on each sister chromatid (Figure 2C). Fluorescence was not detected in any metaphases of the control samples. These experiments indicated that the expansion was at least 1.5kb (representing ~ 250 GGGGCC repeats) in size, which is the minimum detectable size of a repeat using this technique (Liehr, 2009). Additional experimental approaches, such as southern blotting, will be needed to determine the true repeat length with greater precision.
Our data clearly showed the importance of the hexanucleotide repeat expansion within the Finnish ALS population and in families linked to the chromosome 9p21 region. To further determine the frequency of the hexanucleotide expansion in outbred European populations, we screened a cohort of 268 familial ALS probands from North America (n = 198), Germany (n = 41) and Italy (n = 29) using repeat-primed PCR. 102 (38.1%) of these cases carried the same hexanucleotide GGGGCC repeat expansion within C9ORF72 (Figure 3C). Within this dataset, we identified three additional multi-generational families where the presence of the repeat expansion segregated perfectly with disease within the kindred (Figure 1C, 1D and 1E). In contrast, the repeat expansion was not detected in 262 US Caucasian controls, 83 Italian controls and 64 German controls (total number of control chromosomes = 818, average number of repeats = 3, range 0 – 18, Figure 3D). An additional series of 300 anonymous African and Asian samples that are part of the Human Gene Diversity Panel (Cann et al., 2002) were included in the mutational analysis as controls to evaluate the genetic variability of the C9ORF72 hexanucleotide repeat expansion in non-Caucasian populations. None of these samples carried more than 15 GGGGCC repeats (average number of repeats = 3, range = 0 – 15).
Given the genetic and clinical overlap between ALS and FTD, as well as the co-occurrence of ALS and FTD within families linked to the chromosome 9p21 locus, we tested the hypothesis that the hexanucleotide repeat expansion may underlie a proportion of FTD cases by measuring its occurrence in a cohort of 75 Finnish FTD cases using the same repeat-primed PCR method. The percentage of these FTD cases carrying the repeat expansion was comparable to that of the Finnish ALS cohort (n = 22, representing 29.3% of the cohort), and the GGGGCC repeat expansion was highly associated with FTD in the Finnish population (Fisher’s exact test p-value based on 75 Finnish FTD cases and 478 Finnish controls = 4.3×10−18; OR = 82.0, 95% CI 19.1 – 352.8). Six of the Finnish FTD cases carrying the repeat expansion presented with progressive non-fluent aphasia, and the remaining 16 patients had clinical features consistent with behavioral-variant FTD. In addition, 8 (36.4%) of these Finnish FTD patients had a personal or family history of ALS.
Using expression arrays, C9ORF72 RNA was detected across multiple CNS tissues obtained from neuropathologically normal individuals including spinal cord, with the highest expression level observed within the cerebellum (Figure 4). Real-time RT-PCR analysis of expression in frontal cortex tissue obtained from patients and controls did not find any conclusive change in RNA levels, and produced inconsistent results across different labs and different samples (see Figure S2 available online for preliminary data). Immunocytochemistry using an antibody that recognizes both human and mouse C9ORF72 (Santa Cruz Biotechnology, Inc.) found the protein to be predominantly localized within the nucleus in human control fibroblast cell lines and in the mouse motor neuron NSC-34 cell line (Figure 5). Furthermore, C9ORF72 protein levels appeared to be reduced in fibroblast cell lines derived from ALS patients relative to controls, with relatively more cytoplasmic staining in cases compared to controls. However, these data must be considered to be preliminary, as firm conclusions cannot be drawn based on a small number of samples. Furthermore, the Santa Cruz C9ORF72 antibody used for these experiments is largely uncharacterized: preliminary data suggests that siRNA knockdown of C9ORF72 mRNA results in a mild reduction of C9ORF72 protein by western blot in H4 and T98G cells, though a similar effect was also seen using siRNA allstar control (see Figure S3 online). At this stage, the question of whether the pathogenic expansion is associated with a decrease in protein expression remains unresolved. Future experiments requiring generation of more specific antibodies and more quantitative approaches will be needed to definitively determine the localization of the different C9ORF72 isoforms in different tissues and at various stages of disease progression.
Discussion
In this paper, we used next-generation sequencing technology to identify a hexanucleotide repeat expansion within the C9ORF72 gene as the cause of chromosome 9p21-linked ALS-FTD, and subsequently confirmed the presence of this large expansion in a substantial proportion of familial ALS and FTD cases. Overall, the hexanucleotide repeat expansion was found in nearly one half of Finnish familial ALS cases and in more than one third of familial ALS cases of wider European ancestry. Our data indicate that the repeat expansion is more than twice as common as mutations in the SOD1 gene as a cause of familial ALS (Chiò et al., 2008), and more than three times as common as TARDBP, FUS, OPTN and VCP mutations combined (Johnson et al., 2010; Mackenzie et al., 2010; Maruyama et al., 2010). Taken together with the D90A SOD1 mutation, our data show that nearly 90% of familial ALS in Finland is now explained by a simple monogenic cause.
We present five pieces of genetic data demonstrating that the hexanucleotide repeat expansion is pathogenic for neurodegeneration. First, the hexanucleotide expansion segregated with disease within two multi-generational kindreds that have been convincingly linked to the region (Pearson et al., 2011). Second, the hexanucleotide expansion was highly associated with disease in the same cohort of ALS cases and controls that was used to identify the chromosome 9p21 region within the Finnish population. Furthermore, the association signal based on the presence or absence of the expansion was many times greater than that indicated by the surrounding SNPs (p-value based on expansion = 8.1×10−38 versus 9.11×10−11 based on the most associated SNP rs3849942 in the initial Finnish ALS GWAs) (Laaksovirta et al., 2010). Third, the hexanucleotide repeat expansion was not found in 409 population-matched control subjects or in 300 diverse population samples screened in our laboratory. Fourth, we found that a large proportion of apparently unrelated familial ALS and FTD cases carried the same hexanucleotide repeat expansion within C9ORF72. Within this cohort of European-ancestry familial samples, we identified three additional multi-generational families within which the repeat expansion segregated perfectly with disease. Fifth, FISH analysis demonstrated that the repeat expansion is large in size (at least 1.5kb to be visualized by this technique, Figure 2C), and such long expansions are typically pathogenic (Kobayashi et al., 2011). Finally, another group independently discovered the same genetic mutation to be the cause of chromosome 9p21-linked FTD/ALS (DeJesus-Hernandez, 2011).
Our data indicate that both ALS and FTD phenotypes are associated with the C9ORF72 GGGGCC hexanucleotide repeat expansion. Several members of the GWENT#1 and DUTCH#1 pedigrees manifested clinical signs of isolated motor neuron dysfunction or isolated cognitive decline, whereas other affected members developed mixed ALS-FTD symptomatology over the course of their illness (Pearson et al., 2011). It is interesting to note that the frequency of the repeat expansion was almost identical in our ALS and FTD case cohorts, suggesting that carriers of the mutant allele are equally at risk for both forms of neurodegeneration. Our data support the notion that the observed clinical and pathological overlap between ALS and FTD forms of neurodegeneration may be driven in large part by the C9ORF72 hexanucleotide repeat expansion.
The identification of the cause of chromosome 9p21-linked neurodegeneration allows for future screening of population-based cohorts to further unravel the overlap between ALS and FTD, and to identify additional genetic and environmental factors that push an individual’s symptoms towards one end of the ALS/FTD clinical spectrum. Some early observations may already be made: among our Finnish FTD cohort, we identified several patients carrying the pathogenic repeat expansion who presented with non-fluent progressive aphasia. This suggests that the difficulties with speech production that are commonly observed in ALS patients may in some cases be partially attributable to cortical degeneration in addition to tongue and bulbar musculature weakness secondary to hypoglossal motor neuron degeneration. It is also interesting to note that age of symptom onset varied widely in patients carrying the pathogenic hexanucleotide expansion including some individuals who developed weakness in their ninth decade of life. The genetic and/or environmental factors underlying this variability remain to be determined.
Our development of a rapid, reliable method of screening individuals for the repeat expansion will have immediate clinical utility by allowing early identification of ALS patients at increased risk of cognitive impairment, and of FTD cases at increased risk of progressive paralysis. In the longer term, the identification of the genetic lesion underlying chromosome 9p21-linked ALS and FTD, together with the observed high frequency in these patient populations, makes it an ideal target for drug development aimed at amelioration of the disease process.
Broadly speaking, pathogenic repeat expansions are thought to cause disease through haploinsufficiency, in which expression or splicing of the target gene is perturbed, or through the generation of abnormal amounts of toxic RNA that disrupt normal cellular pathways. We favor the second as a mechanism in chromosome 9 FTD/ALS, given the large size of the expansion visualized by FISH and its non-coding localization within the C9ORF72 gene. RNA generated from such pathogenic repeat expansions are thought to disrupt transcription by sequestering normal RNA and proteins involved in transcription regulation (Wojciechowska and Krzyzosiak, 2011), and disruption of RNA metabolism has already been implicated in the pathogenesis of ALS associated with mutations in TDP-43 and FUS (Lagier-Tourenne et al., 2010). However, knowing the pattern of distribution of C9ORF72 expression is likely to be key in understanding cell vulnerability and local expression of the hexanucleotide repeat expansion, which is likely influenced by the promoter of the C9ORF72 gene. We did not find consistent differences in expression between cases and controls. This may represent the true biological effect of the GGGGCC hexanucleotide repeat expansion on C9ORF72 expression, or alternatively may reflect the small number of samples analyzed or tissue-to-tissue variation in expression of this gene. More definitive experimental testing is required in the future to resolve the effect of the hexanucleotide repeat expansion on C9ORF72 expression, as well as additional molecular biology investigation to understand the precise mechanism by which the hexanucleotide repeat may disrupt RNA metabolism.
An important aspect of understanding a pathogenic repeat expansion focuses on its stability. Preliminary evidence suggests that the C9ORF72 hexanucleotide repeat expansion may be unstable. First, minor anticipation has been noted in pedigrees that originally identified the locus with earlier generations being relatively unaffected by disease, perhaps reflecting expanding repeat number over successive generations (Vance et al., 2006). Interestingly, anticipation was not observed within the five families in which we found the hexanucleotide repeat expansion (see Figure 1). Second, although there was strong concordance between the presence of the chromosome 9p21 founder risk haplotype and the presence of the hexanucleotide expansion in an individual, the expansion was also present in ALS cases that did not carry this haplotype. These data are consistent with the expansion occurring on multiple occasions on multiple haplotype backgrounds. Taken together, these observations suggest that the C9ORF72 repeat region has some degree of instability. This instability may be particularly relevant for sporadic ALS, where the apparent random nature of the disease in the community could be a consequence of stochastic expansion in the number of repeats. It is noteworthy that a sizeable proportion of the Finnish ALS cases that carried the repeat expansion was clinically classified as sporadic.
In summary, our data demonstrate that a massive hexanucleotide repeat expansion within C9ORF72 is the cause of chromosome 9p21-linked ALS, FTD and ALS-FTD. Furthermore, this expansion accounts for an unprecedented proportion of ALS cases in Finland and in familial ALS cases of European ancestry, and provides additional evidence supporting the role of disrupted RNA metabolism as a cause of neurodegeneration.
Experimental Procedures
Patients and material
We studied a four-generation Welsh family (GWENT#1) in which 9 individuals had been diagnosed with ALS and/or FTD, and were known to share the chromosome 9p21 risk haplotype. The pedigree of this family is shown in Figure 1A, and the clinical features have been previously reported (Pearson et al., 2011). DNA samples were available from four individuals of generations IV who had been diagnosed with ALS and/or FTD. Flow-sorting of chromosome 9 was performed on lymphoblastoid cell lines from an affected case ND06769 (IV-3, Figure 1A) and a neurologically normal population control ND11463 at Chrombios GmbH (www.chrombios.com) using a FACS Vantage cell sorter (BD Biosciences, Franklin Lakes, NJ, USA).
We also analyzed an apparently unrelated six-generation Dutch ALS/FTD family (DUTCH#1, Figure 1B), in which linkage and haplotype analysis showed significant linkage to a 61Mb region on chromosome 9p21 spanning from rs10732345 to rs7035160 and containing 524 genes and predicted transcripts. Genomic regions from all exons and exon-intron boundaries, 5′ UTRs, 3′ UTRs, ~650 bp of upstream promoter regions, sno/miRNA loci, and conserved regions were captured using SureSelect target enrichment technology (Agilent Inc., Santa Clara CA, USA). In total, 43,142 unique baits were used for these experiments covering a total of 2.58 MB in the chromosome 9p FTD/ALS locus (c9FTD/ALS).
For subsequent mutational screening of the GGGGCC hexanucleotide repeat expansion, we used DNA from 402 Finnish ALS cases and 478 Finnish neurologically normal individuals that had previously been used to identify the chromosome 9p21 association signal (Laaksovirta et al., 2010). An additional 268 DNA samples were obtained from affected probands in unrelated ALS families (198 US cases, 41 German cases, and 29 Italian cases), and from 75 Finnish individuals who had presented with isolated FTD. Control samples consisted of 262 neurologically normal US individuals obtained from the NINDS repository at Coriell, 64 neurologically normal German individuals, and 83 neurologically normal Italian individuals. An additional series of 300 anonymous African and Asian samples that are part of the Human Gene Diversity Panel (Cann et al., 2002) were included in the mutational analysis as controls to evaluate the genetic variability of the repeat expansion in non-Caucasian populations. Demographics and clinical features of these samples are summarized in Table 1 and in Laaksovirta et al, 2010. Appropriate institutional review boards approved the study.
Table 1.
European-descent familial ALS case (n = 268)* | European-descent controls (n = 409)§ | Finnish FTD cases (n = 75) | Finnish ALS cases (n = 402)# | Finnish controls (n = 478)¶ | ALS cases with expansion (n = 215)+ | |
---|---|---|---|---|---|---|
Mean age (range) | 56.5 (15 – 90) | 44.1 (4 – 91) | 58.4 (38 – 79) | 56.8 (18 – 85) | 88.6 (85 – 101) | 56.6 (35 – 80) |
Male (%) | 144 (53.7%) | 165 (40.4%) | 34 (45.3%) | 199 (49.5%) | 96 (20.1%) | 112 (52.1%) |
Familial (%) | 268 (100%) | - | 27 (36.0%) | 112 (27.9%) | - | 154 (71.6%) |
Site of symptom onset: | ||||||
Bulbar-onset (%) | 53 (26.4%) | - | - | 94 (27.8%) | - | 58 (32.6%) |
Spinal-onset (%) | 148 (73.6%) | - | - | 244 (72.2%) | - | 120 (67.4%) |
Behavior variant FTD (%) | - | - | 48 (64.0%) | - | - | - |
Progressive non-fluent | ||||||
aphasia (%) | - | - | 20 (26.7%) | - | - | - |
Semantic dementia (%) | - | - | 7 (9.3%) | - | - | - |
Mean age represents age at symptom onset for cases and age at sample collection for controls;
data missing for age at onset (n = 12), and site of onset (n = 67);
data missing for age at sampling (n = 1) and gender (n = 1);
data missing for age at onset (n = 29), site of onset (n = 64) and familial status (n =1);
data missing for age at collection (n = 6);
data missing for age at onset (n = 18) and site of onset (n = 37).
Next-generation Sequencing
Paired-end sequencing was performed on a next-generation HiSeq2000 sequencer according to the manufacturer’s protocol (Illumina Inc., San Diego, CA, USA). This generated 56.7 gigabases of alignable sequence data for the control sample ND11463 (mean read depth for the chromosome 9 region 27,367,278 to 27,599,746 bp = 42.2) and 114.4 gigabases for the case sample ND06769 (mean read depth = 170.4). Sequence alignment and variant calling were performed against the reference human genome (UCSC hg 18). Sequencing reads were aligned using BWA (Li and Durbin, 2009). Sorting, indexing, read duplicate removal and merging of BAM files was preformed with Picard (http://picard.sourceforge.net). The Genome Analysis Toolkit was used to perform base quality score recalibration and to call variants (McKenna et al., 2010). SNPs identified within CEU individuals from the 1000 Genomes project (April 2009 release, www.1000genomes.org) or in dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/, Build 132) were excluded. The remaining variants were annotated to RefSeq transcripts and protein coding variants prioritized for examination.
Repeat-primed PCR
Repeat-primed PCR was performed as follows: 100ng of genomic DNA were used as template in a final volume of 28ul containing 14ul of FastStart PCR Master Mix (Roche Applied Science, Indianapolis, IN, USA), and a final concentration of 0.18mM 7-deaza-dGTP (New England Biolabs Inc., Ipswich, MA, USA), 1x Q-Solution (Qiagen Inc., Valencia, CA, USA), 7% DMSO (Qiagen), 0.9mM MgCl2 (Qiagen), 0.7uM reverse primer consisting of ~ four GGGGCC repeats with an anchor tail, 1.4uM 6FAM-fluorescent labeled forward primer located 280bp telomeric to the repeat sequence, and 1.4uM anchor primer corresponding to the anchor tail of the reverse primer (sequences available online in Supplemental Experimental Procedures) (Kobayashi et al., 2011; Warner et al., 1996). A touchdown PCR cycling program was used where the annealing temperature was gradually lowered from 70°C to 56°C in 2°C increments with a 3-minute extension time for each cycle.
The repeat-primed PCR is designed so that the reverse primer binds at different points within the repeat expansion to produce multiple amplicons of incrementally larger size. The lower concentration of this primer in the reaction means that it is exhausted during the initial PCR cycles, after which the anchor primer is preferentially used as the reverse primer. Fragment length analysis was performed on an ABI 3730xl genetic analyzer (Applied Biosystems Inc., Foster City, CA, USA), and data analyzed using GeneScan software (version 4, ABI). Repeat expansions produce a characteristic sawtooth pattern with a 6-bp periodicity (Figure 2B).
Statistical analysis
Our previous GWAS data suggested no significant population stratification within the Finnish population (Laaksovirta et al., 2010). Therefore, association testing was performed using the Fisher’s exact test as implemented within the PLINK software toolkit (version 1.7) (Purcell et al., 2007).
FISH analysis
Metaphase and interphase FISH analysis of lymphoblastoid cell lines ND06769 (case IV-3 from GWENT#1, Figure 1A), ND08554 (case II-2 from NINDS0760, Figure 1E), ND11463 (control), ND11417 (control), ND08559 (unaffected spouse II-3 from NINDS0760), ND03052 (unaffected relative IV-1 from GWENT#1) and ND03053 (unaffected relative III-9 from GWENT#1), as well as a fibroblast cell line (Finnish sample ALS50)) was performed using Alexa fluor 488-labeled GGCCCCGGCCCCGGCCCCGGCC oligonucleotide probe (Eurofins MWG operon, Hunstville, AL, USA) designed against the repeat expansion. The hybridization was performed in low stringency conditions with 50% Formamide/2xSSC/10% Dextran Sulphate co-denaturation of the slide/probe, 1-hour hybridization at 37°C, followed by a 2-minute wash in 0.4xSSC/0.3% Tween 20 at room temperature. Slides were counterstained with DAPI. FISH signals were scored with a Zeiss epifluorescence microscope Zeiss Axio Imager-2 (Carl Zeiss Microimaging LLC, Thornwood, NY, USA) equipped with a DAPI/FITC/Rhodamine single band pass filters (Semrock, Rochester, NY) using 40–60x objectives.
RNA expression
Expression profiling on Affymetrix GeneChip Human Exon 1.0 ST Arrays (Affymetrix, UK) was performed on CNS tissue obtained from 137 neurologically normal individuals at AROS Applied Biotechnology AS company laboratories (http://www.arosab.com/) (Trabzuni et al., 2011). Gene-level expression was calculated for C9ORF72 based on the median signal of probe 3202421. Date of array hybridization and brain bank were included as co-factors to eliminate batch effects.
For RT-PCR, RNA was extracted from brain tissue using Trizol (Invitrogen, Paisley, UK), and first strand cDNA synthesized using random primers using the Superscript II cDNA Synthesis Kit (Invitrogen). Real-time PCR analyses for C9ORF72 and GAPDH were performed using the ABI 7900 Sequence Detection System instrument and software (Applied Biosystems). Samples were amplified in quadruplicate in 10ul volumes using the Power SYBR-green master mix (Applied Biosystems), and 10pM of each forward and reverse primer (see Supplemental Experimental Procedures online for primer sequences), using Applied Biosystems standard cycling conditions for real time PCR (initial denaturation at 95°C for 10 minutes, followed by 40 cycles of 95°C for 15 seconds, 60°C for 1 minute).
Immunocytochemistry
Cells were fixed with ice-cold methanol for 2 min and blocked with 10% FBS for 30 min at 37°C. Primary antibody (anti-C9ORF72 antibody by Santa Cruz, 1:30) and secondary antibody (Alexa488-conjugated anti-rabbit antibody by Invitrogen, 1:200) were diluted in 5% FBS and incubated at 37°C for 3 hours or 30 minutes, respectively. The cells were then treated with 5 μg/ml of Alexa633–conjugated wheat germ agglutinin (Invitrogen) in PBS for 10 min at room temperature (to detect cellular membranes), followed by incubation with 2 μg/ml propidium iodide (Invitrogen) in PBS for 3 minutes (to stain the nuclei). The cells were imaged with a TCS SP2 confocal microscope (Leica).
Supplementary Material
Acknowledgments
This work was supported in part by the Intramural Research Programs of the NIH, National Institute on Aging (Z01-AG000949-02), and NINDS. The work was also supported by the Packard Center for ALS Research at Hopkins (BJT), the ALS Association (BJT, AC), Microsoft Research (BJT, PJT), Ontario Research Fund (ER), Hersenstichting Nederland Fellowship project B08.03 and the Neuroscience Campus Amsterdam (JS-S), the UK MND Association (HM, JH, RWO), The Medical Research Council UK (JH, SPB), the Wellcome Trust (JH), the Helsinki University Central Hospital, the Finnish Academy (PJT), the Finnish Medical Society Duodecim, Kuopio University (PJT), the Italian Health Ministry (Ricerca Sanitaria Finalizzata 2007, to AC), Fondazione Vialli e Mauro ONLUS (AC), Federazione Italiana Giuoco Calcio (AC, MS, BJT) and Compagnia di San Paolo (AC, GR), the European Community’s Health Seventh Framework Programme (FP7/2007-2013) under grant agreement 259867 (AC), the Muscular Dystrophy Association (MB, JW), the Emory Woodruff Health Sciences Center (MB, JW), EVO grants from Oulu University Hospital (AMR) and the Finnish Medical Foundation (AMR). DNA samples for this study were obtained in part from the NINDS repository at the Coriell Cell Repositories (http://www.coriell.org/), and the National Cell Repository for Alzheimer’s Disease (http://ncrad.iu.edu). We thank the DNA extraction and storage facility of the National Institute for Health and Welfare/FIMM, Helsinki, Finland and Dr Tuomo Polvikoski, Institute for Ageing and Health, Campus for Ageing and Vitality, Newcastle University, Newcastle upon Tyne, UK for their help in extraction of DNA from ALS patients. We also thank Nayia Nicolaou for her assistance. J.R. is Director of the Packard Center for ALS Research at Hopkins, and D Heckerman is an employee of Microsoft Research. None of the other authors have any conflicts of interest. We thank the patients and research subjects who contributed samples for this study.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- Boxer AL, Mackenzie IR, Boeve BF, Baker M, Seeley WW, Crook R, Feldman H, Hsiung GY, Rutherford N, Laluz V, et al. Clinical, neuroimaging and neuropathological features of a new chromosome 9p-linked FTD-ALS family. J Neurol Neurosurg Psychiatry. 2011;82:196–203. doi: 10.1136/jnnp.2009.204081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cann HM, de Toma C, Cazes L, Legrand MF, Morel V, Piouffre L, Bodmer J, Bodmer WF, Bonne-Tamir B, Cambon-Thomsen A, et al. A human genome diversity cell line panel. Science. 2002;296:261–262. doi: 10.1126/science.296.5566.261b. [DOI] [PubMed] [Google Scholar]
- Chiò A, Traynor BJ, Lombardo F, Fimognari M, Calvo A, Ghiglione P, Mutani R, Restagno G. Prevalence of SOD1 mutations in the Italian ALS population. Neurology. 2008;70:533–537. doi: 10.1212/01.wnl.0000299187.90432.3f. [DOI] [PubMed] [Google Scholar]
- DeBose-Boyd RA, Brown MS, Li WP, Nohturfft A, Goldstein JL, Espenshade PJ. Transport-dependent proteolysis of SREBP: relocation of site-1 protease from Golgi to ER obviates the need for SREBP transport to Golgi. Cell. 1999;99:703–712. doi: 10.1016/s0092-8674(00)81668-2. [DOI] [PubMed] [Google Scholar]
- DeJesus-Hernandez M. Expanded GGGGCC hexanucleotide repeat in non-coding region of C9ORF72 causes chromosome 9p-linked frontotemporal dementia and amyotrophic lateral sclerosis. Neuron. 2011 doi: 10.1016/j.neuron.2011.09.011. In Press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haaf T, Sirugo G, Kidd KK, Ward DC. Chromosomal localization of long trinucleotide repeats in the human genome by fluorescence in situ hybridization. Nat Genet. 1996;12:183–185. doi: 10.1038/ng0296-183. [DOI] [PubMed] [Google Scholar]
- Hirtz D, Thurman DJ, Gwinn-Hardy K, Mohamed M, Chaudhuri AR, Zalutsky R. How common are the “common” neurologic disorders? Neurology. 2007;68:326–337. doi: 10.1212/01.wnl.0000252807.38124.a3. [DOI] [PubMed] [Google Scholar]
- Johnson JO, Mandrioli J, Benatar M, Abramzon Y, Van Deerlin VM, Trojanowski JQ, Gibbs JR, Brunetti M, Gronka S, Wuu J, et al. Exome sequencing reveals VCP mutations as a cause of familial ALS. Neuron. 2010;68:857–864. doi: 10.1016/j.neuron.2010.11.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kobayashi H, Abe K, Matsuura T, Ikeda Y, Hitomi T, Akechi Y, Habu T, Liu W, Okuda H, Koizumi A. Expansion of Intronic GGCCTG Hexanucleotide Repeat in NOP56 Causes SCA36, a Type of Spinocerebellar Ataxia Accompanied by Motor Neuron Involvement. Am J Hum Genet. 2011;89:121–130. doi: 10.1016/j.ajhg.2011.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kwiatkowski TJ, Jr, Bosco DA, Leclerc AL, Tamrazian E, Vanderburg CR, Russ C, Davis A, Gilchrist J, Kasarskis EJ, Munsat T, et al. Mutations in the FUS/TLS gene on chromosome 16 cause familial amyotrophic lateral sclerosis. Science. 2009;323:1205–1208. doi: 10.1126/science.1166066. [DOI] [PubMed] [Google Scholar]
- Laaksovirta H, Peuralinna T, Schymick JC, Scholz SW, Lai SL, Myllykangas L, Sulkava R, Jansson L, Hernandez DG, Gibbs JR, et al. Chromosome 9p21 in amyotrophic lateral sclerosis in Finland: a genome-wide association study. Lancet Neurol. 2010;9:978–985. doi: 10.1016/S1474-4422(10)70184-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lagier-Tourenne C, Polymenidou M, Cleveland DW. TDP-43 and FUS/TLS: emerging roles in RNA processing and neurodegeneration. Hum Mol Genet. 2010;19:R46–64. doi: 10.1093/hmg/ddq137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liehr T, editor. Fluorescence In Situ Hybridization (FISH) application guide. Berlin: Springer-Verlag; 2009. [Google Scholar]
- Lillo P, Hodges JR. Frontotemporal dementia and motor neurone disease: overlapping clinic-pathological disorders. J Clin Neurosci. 2009;16:1131–1135. doi: 10.1016/j.jocn.2009.03.005. [DOI] [PubMed] [Google Scholar]
- Mackenzie IR, Rademakers R, Neumann M. TDP-43 and FUS in amyotrophic lateral sclerosis and frontotemporal dementia. Lancet Neurol. 2010;9:995–1007. doi: 10.1016/S1474-4422(10)70195-2. [DOI] [PubMed] [Google Scholar]
- Maruyama H, Morino H, Ito H, Izumi Y, Kato H, Watanabe Y, Kinoshita Y, Kamada M, Nodera H, Suzuki H, et al. Mutations of optineurin in amyotrophic lateral sclerosis. Nature. 2010;465:223–226. doi: 10.1038/nature08971. [DOI] [PubMed] [Google Scholar]
- McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mok K, Traynor B, Schymick J, Tienari P, Laaksovirta H, Peuralinna T, Myllykangas L, Chio A, Shatunov A, Boeve B, et al. The chromosome 9 ALS and FTD locus is probably derived from a single founder. Neurobiol Aging. 2011 doi: 10.1016/j.neurobiolaging.2011.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morita M, Al-Chalabi A, Andersen PM, Hosler B, Sapp P, Englund E, Mitchell JE, Habgood JJ, de Belleroche J, Xi J, et al. A locus on chromosome 9p confers susceptibility to ALS and frontotemporal dementia. Neurology. 2006;66:839–844. doi: 10.1212/01.wnl.0000200048.53766.b4. [DOI] [PubMed] [Google Scholar]
- Neumann M, Sampathu DM, Kwong LK, Truax AC, Micsenyi MC, Chou TT, Bruce J, Schuck T, Grossman M, Clark CM, et al. Ubiquitinated TDP-43 in frontotemporal lobar degeneration and amyotrophic lateral sclerosis. Science. 2006;314:130–133. doi: 10.1126/science.1134108. [DOI] [PubMed] [Google Scholar]
- Pearson JP, Williams NM, Majounie E, Waite A, Stott J, Newsway V, Murray A, Hernandez D, Guerreiro R, Singleton AB, et al. Familial frontotemporal dementia with amyotrophic lateral sclerosis and a shared haplotype on chromosome 9p. J Neurol. 2011;258:647–655. doi: 10.1007/s00415-010-5815-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–575. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosen DR, Siddique T, Patterson D, Figlewicz DA, Sapp P, Hentati A, Donaldson D, Goto J, O’Regan JP, Deng HX, et al. Mutations in Cu/Zn superoxide dismutase gene are associated with familial amyotrophic lateral sclerosis. Nature. 1993;362:59–62. doi: 10.1038/362059a0. [DOI] [PubMed] [Google Scholar]
- Rowland LP, Shneider NA. Amyotrophic lateral sclerosis. N Engl J Med. 2001;344:1688–1700. doi: 10.1056/NEJM200105313442207. [DOI] [PubMed] [Google Scholar]
- Shatunov A, Mok K, Newhouse S, Weale ME, Smith B, Vance C, Johnson L, Veldink JH, van Es MA, van den Berg LH, et al. Chromosome 9p21 in sporadic amyotrophic lateral sclerosis in the UK and seven other countries: a genome-wide association study. Lancet Neurol. 2010;9:986–994. doi: 10.1016/S1474-4422(10)70197-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sreedharan J, Blair IP, Tripathi VB, Hu X, Vance C, Rogelj B, Ackerley S, Durnall JC, Williams KL, Buratti E, et al. TDP-43 mutations in familial and sporadic amyotrophic lateral sclerosis. Science. 2008;319:1668–1672. doi: 10.1126/science.1154584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trabzuni D, Ryten M, Walker R, Smith C, Imran S, Ramasamy A, Weale ME, Hardy J. Quality control parameters on a large dataset of regionally-dissected human control brains for whole genome expression studies. J Neurochem. 2011 doi: 10.1111/j.1471-4159.2011.07432.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Deerlin VM, Sleiman PM, Martinez-Lage M, Chen-Plotkin A, Wang LS, Graff-Radford NR, Dickson DW, Rademakers R, Boeve BF, Grossman M, et al. Common variants at 7p21 are associated with frontotemporal lobar degeneration with TDP-43 inclusions. Nat Genet. 2010;42:234–239. doi: 10.1038/ng.536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Es MA, Veldink JH, Saris CG, Blauw HM, van Vught PW, Birve A, Lemmens R, Schelhaas HJ, Groen EJ, Huisman MH, et al. Genome-wide association study identifies 19p13.3 (UNC13A) and 9p21.2 as susceptibility loci for sporadic amyotrophic lateral sclerosis. Nat Genet. 2009;41:1083–1087. doi: 10.1038/ng.442. [DOI] [PubMed] [Google Scholar]
- Vance C, Al-Chalabi A, Ruddy D, Smith BN, Hu X, Sreedharan J, Siddique T, Schelhaas HJ, Kusters B, Troost D, et al. Familial amyotrophic lateral sclerosis with frontotemporal dementia is linked to a locus on chromosome 9p13.2–21.3. Brain. 2006;129:868–876. doi: 10.1093/brain/awl030. [DOI] [PubMed] [Google Scholar]
- Vance C, Rogelj B, Hortobagyi T, De Vos KJ, Nishimura AL, Sreedharan J, Hu X, Smith B, Ruddy D, Wright P, et al. Mutations in FUS, an RNA processing protein, cause familial amyotrophic lateral sclerosis type 6. Science. 2009;323:1208–1211. doi: 10.1126/science.1165942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Warner JP, Barron LH, Goudie D, Kelly K, Dow D, Fitzpatrick DR, Brock DJ. A general method for the detection of large CAG repeat expansions by fluorescent PCR. J Med Genet. 1996;33:1022–1026. doi: 10.1136/jmg.33.12.1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wojciechowska M, Krzyzosiak WJ. Cellular toxicity of expanded RNA repeats: focus on RNA foci. Hum Mol Genet. 2011;20:3811–3821. doi: 10.1093/hmg/ddr299. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.