Abstract
Apart from amyloid β deposition and tau neurofibrillary tangles, Alzheimer disease (AD) is a neurodegenerative disorder characterized by neuronal loss and astrocytosis in the cerebral cortex. The goal of this study is to investigate genetic factors associated with the neuronal proportion in health and disease. To identify cell-autonomous genetic variants associated with neuronal proportion in cortical tissues, we inferred cellular population structure from bulk RNA-seq derived from 1,536 individuals. We identified the variant rs1990621 located in the TMEM106B gene region as significantly associated with neuronal proportion (p-value = 6.40×10−07) and replicated this finding in an independent dataset (p-value= 7.41×10−04) surpassing the genome-wide threshold in the meta-analyses (p-value = 9.42×10−09). This variant is in high LD with the TMEM106B non-synonymous variant p.T185S (rs3173615; r2 = 0.98) which was previously identified as a protective variant for Frontotemporal lobar degeneration (FTLD). We stratified the samples by disease status, and discovered that this variant modulates neuronal proportion not only in AD cases but also several neurovegetative diseases and in elderly cognitively healthy controls. Furthermore, we did not find a significant association in younger controls or schizophrenia patients, suggesting that this variant might increase neuronal survival or confer resilience to the neurodegenerative process. The single variant and gene-based analyses also identified an overall genetic association between neuronal proportion, AD and FTLD risk. These results suggest that common pathways are implicated in these neurodegenerative diseases, that implicate neuronal survival. In summary, we identified a protective variant in the TMEM106B gene that may have a neuronal protection effect against general aging, independent of disease status, which could help elucidate the relationship between aging and neuronal survival in the presence or absence of neurodegenerative disorders. Our findings suggest that TMEM106B could be a potential target for neuronal protection therapies to ameliorate cognitive and functional deficits.
Keywords: neurodegeneration, cortex, GWAS, QTL, complex traits, deconvolution
INTRODUCTION
Alzheimer disease (AD) is characterized by the presence of extracellular amyloid plaques and intracellular deposits of tau protein that form tangles [36]. AD is characterized by brain volume loss, and neuronal death [37]. Multiple studies have shown that neuronal loss and synapse dysfunction precede cognitive deficits in AD, occurring even 15–20 years before clinical onset [5].
Previous studies from our lab have shown that there are genetic variants associated with neuronal proportion. We developed, optimized and validated an in-silico digital deconvolution approach to infer brain cell proportion from RNA-Seq data [56]. As expected, we found that AD cases present with lower neuronal proportion than cognitively healthy controls. We also demonstrated that individuals that carry the rare APP, PSEN1 or PSEN2 pathogenic mutations and the common APOE ε4 variants present significantly lower neuronal proportion. The goal of this study is to identify common genetic variants associated with neuronal proportion in very large cohorts. Using neuronal proportion as a quantitative trait for genetic studies provides several advantages over classical genetic studies, as quantitative traits provide more power than regular case-control studies [17, 20–22, 42, 65, 75]. This phenotype (neuronal proportion), similar to other AD-related endophenotypes such as cerebrospinal fluid (CSF) tau and Aβ levels is closer to the underlying biology than case-control status, and provides a biological model of the diseases [17, 21, 22, 26, 35]. We have successfully used quantitative endophenotypes (CSF protein levels [20, 21, 42, 65, 75]) to identify novel genetic variants and genes implicated in AD.
Previous studies have already leveraged neuronal proportion derived from digital deconvolved RNA-seq data to identify genes implicated in disease. In a recently published study by the PsychENCODE consortium [83], a very similar approach to the deconvolution method we previously reported [56] was used to infer cell type composition from RNA-Seq obtained from brain tissue in psychiatric disorder cohorts. This study includes of participants from all age groups, but the median age was 51 years old. This study found that common variants in the FZD9 gene were associated with neuronal proportion [83]. Interestingly, variants in FZD9 are associated with Williams syndrome, a developmental disorder associated with mild to moderate intellectual disabilities with learning deficits and cardiovascular problems [12].
These results suggest that using neuronal proportion as a quantitate trait for genetic studies can identify variants and genes implicated in disease. However, the PsychENCODE study was focused on psychiatric disorders and included relatively young individuals. No similar studies have been performed in neurodegenerative disorders. In this study, we utilized neuronal proportions inferred from our deconvolution method [56] to perform cell type quantitative trait loci (cQTL) analysis in 2,008 brain samples derived from 1,536 individuals enriched for AD cases to search for potential new loci that are associated with neurodegenerative disorders.
METHODS
Study participants
Our analysis includes participants from seven studies with a total sample size of 1,536 (Table 1). Five of these studies are mainly focused on neurodegenerative disorders (mean age at death = 82.76 ± 9.86) that include brains affected by Alzheimer’s disease (AD; n = 640), Frontotemporal lobar degeneration (FTLD; n = 11), Progressive Supranuclear Palsy (PSP; n = 75), and pathological aging (PA; n = 28). PA is defined as a form of cerebral amyloidosis in older people, who have widespread extracellular amyloid-beta (Aβ) senile plaque deposits in the setting of limited neurofibrillary tau pathology [58].
Table 1.
Demographic information for cohorts included in this study.
| N | Region | Age | % Male | RIN | TIN | Control | AD | FTLD | PSP | PA | SCZ | BP | Unknown | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Discovery | ||||||||||||||
| ROSMAP | 484 | 1 | 86.7 ± 4.48 | 35.3 | 7.17 ±0.93 | 73.7 ±4.82 | 100 | 317 | 0 | 0 | 0 | 0 | 0 | 67 |
| Replication | ||||||||||||||
| Mayo | 235 | 1 | 80.4 ± 8.24 | 49.8 | 8.23 ± 0.82 | 77.7± 5.99 | 54 | 78 | 0 | 75 | 28 | 0 | 0 | 0 |
| MSSM | 206 | 4 | 84.1 ± 7.30 | 36.9 | 6.62 ± 1.65 | 76.4 ±2.51 | 45 | 161 | 0 | 0 | 0 | 0 | 0 | 0 |
| Knight-ADRC | 96 | 1 | 83.0 ± 12.50 | 44.8 | 6.49 ± 1.19 | 79.4 ±2.00 | 12 | 68 | 11 | 0 | 0 | 0 | 0 | 5 |
| DIAN | 15 | 1 | 50.9 ± 7.08 | 73.3 | 5.55 ± 1.09 | 78.9 ±0.99 | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 |
| GTEx | 118 | 3 | 57.9 ± 10.10 | 67.8 | 7.00 ± 0.85 | 74.0 ±2.86 | 114 | 1 | 0 | 0 | 0 | 0 | 0 | 3 |
| CommonMind | 382 | 1 | 65.6 ± 17.50 | 61.8 | 7.68 ± 0.89 | 50.1 ±7.08 | 163 | 0 | 0 | 0 | 0 | 189 | 30 | 0 |
| Replication Total | 1,052 | 388 | 323 | 11 | 75 | 28 | 189 | 30 | 8 | |||||
| Merged Total | 1,536 | 488 | 640 | 11 | 75 | 28 | 189 | 30 | 75 | |||||
N: sample size; Region: number of cortical brain regions involved in the analysis; % Male: percentage of male participants in the cohorts; RIN: RNA integrity number; TIN: transcript integrity number; AD: Alzheimer’s Disease; FTLD: Frontotemporal lobar degeneration; PSP: progressive supranuclear palsy; PA: pathological aging; SCZ: schizophrenia; BP: bipolar disease; Unknown: other unknown dementia or no diagnosis information. Age, RIN and TIN are shown as mean ± standard deviation.
These samples come from the Mayo Clinic (Mayo), Mount Sinai School of Medicine (MSSM), the Charles F. and Joanne Knight Alzheimer’s Disease Research Center (Knight ADRC), the Dominantly Inherited Alzheimer Network (DIAN), and the Religious Orders Study and Memory and Aging Project (ROSMAP) studies. In addition, we analyzed brains from the CommonMind (CMC; mean age at death = 65.6 ± 17.5) and GTEx (mean age at death = 57.9 ± 10.1) studies, whose age at death is significantly lower than the neurodegenerative aging cohorts (p-value < 4.41 × 10−67). Apart from neurodegenerative disorders, this study also includes samples from individuals with schizophrenia (SCZ; n = 189), bipolar disorders (BP; n = 30), and cognitively healthy controls (n = 488) (Table 1, Online Resource Table 1).
Additionally, two studies, MSSM and GTEx, contain data from more than one tissue per participant. In our analyses of the MSSM samples, we used RNA-Seq data generated from the anterior prefrontal cortex (BA10), superior temporal gyrus (BA22), ventral anterior cingulate cortex (BA24), parahippocampal gyrus (BA36), and inferior frontal gyrus (BA44). For the GTEx Study we used RNA-Seq data derived from the Brodmann area (BA) BA24 (anterior cingulate cortex), BA9 (frontal cortex), and general cortical regions (CTX). See Online Resource Table 2 and each study reference(s) for more data collection and generation specifications.
Standard protocol approvals, registrations and patient consents
The protocols for the DIAN and Knight-ADRC studies have been approved by the institutional review board of Washington University in St. Louis. The protocol for the Mayo dataset was approved by the Mayo Clinic Institutional Review Board (IRB). All neuropsychological, diagnostic and autopsy protocols for the MSSM dataset were approved by the Mount Sinai and JJ Peters VA Medical Center Institutional Review Boards. The religious orders study and the memory and aging project of ROSMAP were approved by the IRB of Rush University Medical Center. The NIH Common Fund’s GTEx program protocol was reviewed by Chesapeake Research Review Inc., Roswell Park Cancer Institute’s Office of Research Subject Protection, and the institutional review board of the University of Pennsylvania. Within the CommonMind consortium, the MSSM sample protocol was approved by the Icahn School of Medicine at Mount Sinai IRB; the Pitt sample protocol was approved by the University of Pittsburgh’s Committee for the Oversight of Research involving the Dead and the IRB for Biomedical Research; the Penn sample protocol was approved by the Committee on Studies Involving Human Beings at the University of Pennsylvania. All participants were recruited with informed consent for research use.
Data collection and generation
Cortical tissues from different regions of post-mortem brains were collected (Online Resource Table 2). RNA was extracted from lysed tissues and libraries prepared for next-generation sequencing. Ribosomal RNA constitutes 80%−90% of total RNA, but it is not the target of this study. To focus on mRNA quantification, rRNA was either depleted or the mRNA was captured during RNA-Seq library preparation. The DIAN [56], Knight ADRC [56], MSSM [85], and CommonMind [30] studies used an rRNA-depletion approach to remove ribosomal RNA. The Mayo [1], ROSMAP [7, 8], and GTEx [6, 15] studies used a poly-A enrichment approach to enrich for mRNA. DNA was also collected and sequenced to obtain genotype information. RNA-Seq paired with genotype data for each participant was either sequenced at Washington University for the DIAN and Knight-ADRC studies or downloaded from public databases for all the other studies.
Data QC and preprocessing
Genetic Data
Stringent quality control (QC) steps were applied to all genotyping array or sequence data. The minimum call rate for single nucleotide polymorphisms (SNPs) and individuals was 98%, and autosomal SNPs not in Hardy-Weinberg equilibrium (p-value < 1×10−06) were excluded. X-chromosome SNPs were analyzed to verify gender identification. Unanticipated duplicates and cryptic relatedness (PiHat ≥ 0.25) among samples were tested by genome-wide estimates of pairwise identity-by-descent. PLINK1.9 [13] was used to calculate principal components. The 1000 Genomes Project Phase 3 data (October 2014), SHAPEIT v2.r837 [19], and IMPUTE2 v2.3.2 [38] were used for phasing and imputation. Individual genotypes imputed with a probability < 0.90 were set to missing and imputed genotypes with probability ≥ 0.90 were analyzed as fully observed. Genotyped and imputed variants with MAF < 0.02 or IMPUTE2 information score < 0.30 were excluded. WGS QC was performed by filtering out reads with sequencing depth DP < 6 and quality GQ < 20, followed by similar QC approaches as described above for genotyping data. After the QC, all studies including imputed genotypes and WGS data were merged into a binary file using PLINK for downstream analysis. In order to identify related individuals, principal component analyses (PCA) for population stratification and identity by descent (IBD) analyses, were performed on the merged binary files using PLINK to retain unrelated participants of European ancestry and to ensure sample identity (Online Resource Figure 1 and 2).
Expression Data
FastQC was applied to RNA-Seq data to examine various aspects of sequencing quality [2]. Outlier samples with high rRNA content (> 10.9%) were removed from the study. The remaining samples were aligned to the human GRCh37 primary assembly using STAR with 2-Pass Basic mode (ver 2.5.4b) [23]. Alignment metrics were ascertained by applying Picard CollectRnaSeqMetrics [10] including reads bias, coverage, ribosomal contents, coding bases, etc. Transcript integrity number (TIN) was calculated directly from post-sequencing results to model the RNA degradation by measuring mRNA integrity directly [84]. We calculated the TIN for each library using RSeQC tin.py [84] (ver 2.6.5). RNA-Seq coding genes and transcript expression was quantified using Salmon transcript expression quantification (ver 0.7.2) with the GENCODE Homo sapiens GRCh37.75 reference genome [64].
We inferred neuronal proportions from RNA-Seq gene expression quantification using the same digital deconvolution method and panel reported by Li et al. [56]. Briefly, we first assembled and curated a cell-type specific reference panel to model the transcriptomic signature of neurons, astrocytes, oligodendrocytes and microglia, and selected a set of gene markers to capture the distinctive expression profile of these cell types. Gene markers for the four major cell types include STMN2, SYN1, SYT1, GAD1, CCK for neuron; GFAP, ALDH1L1, AQP4, GJA1, SOX9 for astrocyte; MOG, MOBP, SOX10, GPR37 for oligodendrocyte; TLR2, CX3CR1, IL1A for microglia. Using the reference panel and the method population-specific expression analysis (PSEA, also named meanProfile in the CellMix implementation [33]) and semi-supervised non-negative matrix factorization (ssNMF, also named ssFrobenius in the CellMix implementation), we calculated the cellular composition from the RNA-Seq data. This method estimates the proportion of neurons, astrocytes, oligodendrocytes and microglia in a specific sample. The proportions of these four cell types for a specific sample always add up to 1. Based on our previous studies, a decrease of neuronal proportion is associated with an increase of astrocytes although this is not perfectly correlated, indicating that the information of each cell proportion is unique and captures cell-type specific profile.
Outlier values for each cell-type proportion were removed from each study or from each brain region if the study included more than one region. Outliers are defined as cell-type proportion values that fall below the first quantile minus 1.5 × IQR (interquartile range) or above the third quantile plus 1.5 × IQR. If a subject contains an outlier proportion in any of the four major cell types from any of the two independent deconvolution methods, this subject would be removed from the analysis. For example, 39 subjects were removed from the ROSMAP study as they were outliers in cell deconvolution analyses. These 39 subjects were removed from 523 ROSMAP samples during QC (Online Resource Table 1 before QC), for demographics before QC), and 484 samples were included in the neuronal proportion QTL analysis (Table 1 for demographics after QC). This way we can ensure that the cell-type proportion phenotypes are validated by two independent deconvolution methods in the context of all four major CNS cell types, and any spurious deconvolution results will be excluded from downstream analysis. After removing outliers, neuronal distributions were zero centered for each cortical region and study by subtracting the mean value (Online Resource Figure 3).
Data analysis
We performed a three-stage genome-wide association study (GWAS) analysis: discovery, replication and meta-analysis. For the discovery phase, we used the ROSMAP dataset (n = 484). For the replication phase, we combined the MSSM, GTEx, CMC, WU (including Knight ADRC and DIAN), and Mayo studies (n = 1,052).
For all analyses, we applied a linear regression model using normalized neuronal proportions including age, sex, PC1 and PC2 derived from GWAS data to account for population stratification, and median TIN as covariates. TIN is a measure of RNA integrity. Association analyses were performed using the Meta-Tissue QTL software [34, 76] (http://genetics.cs.ucla.edu/metatissue/install.html). Meta-Tissue is the software developed by GTEx, designed to take into account the heterogeneity of multiple tissues. Meta-Tissue is designed to handle a wide array of tissues, from skin, to liver, to brain tissue donated from a diverse set of individuals. In this study we are using different brain regions, but all of them are cortical regions, therefore the heterogeneity of this study is lower than that in GTEx study in general. This method was used because MSSM and GTEx contain subjects with multiple cortical tissues. The Meta-Tissue QTL allows for the performance of association analyses when multiple samples from the same individual are included. The Meta-Tissue [76] processing pipeline calls two main functions, Meta TissueMM [76] followed by Metasoft [34]. Meta TissueMM applies a mixed-model to account for the heterogeneity of multiple tissue QTL effects. Metasoft performs a meta-analysis while providing a more accurate random effect p-value for multiple tissue analyses and an m-value based on Bayesian inference to indicate how likely a locus is to be a QTL in each tissue. Genome-wide association results were also visualized as Manhattan plots. For the signal of interest, we generated a PM-Plot that integrates evidence from both frequentist (p-value) and Bayesian (m-value) sides to interpret the heterogeneity of multi-tissue QTL effects. Variants with m-values greater than 0.9 are predicted to have an effect and are colored red. Variants with m-values less than 0.1 are predicted not to have an effect and are colored blue. All the other studies with m-values between 0.1 and 0.9 are predicted to have an ambiguous effect and are colored green [34].
For the meta-analysis, both discovery and replication studies were combined using the MetaTissue [76] method described above. Forest plots for each dataset were generated using the visualization function provided by Metasoft [34].
Because our study includes brain samples with different neurodegenerative diseases, we also analyzed each disease separately, including AD (n = 639), other neurodegenerative diseases (n = 103; PSP and PA), schizophrenia (n = 189), young subjects (Age at death < 65; n = 277) and cognitively normal elderly individuals (n = 199). Disease subcategories with less than 20 subjects were removed from the analysis to avoid false results due to a small sample size. Similar data preparation and analysis pipelines were employed as documented above. Results were depicted as Manhattan plots using R’s (ver 3.4.3) qqman package [78] (ver 0.1.4).
Genetic variance estimation
The Genome-wide Complex Trait Analysis (GCTA) v1.25.2 tool was used to estimate the proportion of phenotypic variance explained by the common (MAF > 0.02) imputed and genotyped autosomal variants. The restricted maximum likelihood (REML) analysis was performed on the normalized neuronal proportion values adjusted for age, sex, median TIN, and the first two principal components as covariates. Key variants rs1990621 (TMEM106B) and rs429358 (APOE) among others that were significantly associated with neuronal proportion were analyzed individually to determine the amount of variance contributed by these variants to the observed neuronal proportion phenotype using R’s linear regression function.
Functional annotation
All variants below the threshold for suggestive significance (p-value < 1×10−05) were uploaded to FUMA (v1.3.3d) [86] to annotate significant SNPs with GWAS catalog (e91_r2018-02-06) and ANNOVAR (updated 2017-07-17). Gene-based analysis was also performed by MAGMA (v1.06) [18] implemented in FUMA. MAGMA is an algorithm to perform gene and gene-set analysis, based on GWAS summary statistics. This gene-set analysis uses a regression structure to allow generalization analysis of continuous properties of genes and simultaneous analysis of multiple gene-sets and other gene properties, and provides a p-value for each gene in the genome, taking into account all the independent SNPs in the gene region. Tissue specific gene expression of related genes was performed with data from publicly available databases, Genotype-Tissue Expression (GTEx) Analysis V7 [6] and the Brain eQTL Almanac (Braineac) [67] to determine the tissue(s) in which the genes were most highly expressed.
RESULTS
Study Design
We performed a three-stage study design: discovery, replication and meta-analysis. The discovery phase included RNA-seq data from 484 subjects (ROSMAP). The other six studies were collapsed into replication dataset with 1,052 subjects (Table 1). To our knowledge, this is the largest study (n = 1,536) that analyzes genetic architecture underlying neuronal proportion in aging brains (Figure 1, and Table 1). The GTEx and MSSM studies include brain RNA-seq data from multiple cortical regions. For this reason we utilized the Meta-Tissue software [76], which was specifically designed for multi-tissue QTL analysis and performed a mixed model analysis with random effects that account for correlated measurements from multi-tissue individuals. To attain the largest available sample size for this study, the discovery and replication sets were merged to perform a multi-tissue QTL meta-analysis in a search for additional signals hidden in the previously separated discovery or replication analyses due to lack of power. After merged analysis, the cohorts were stratified into four major disease status groups (AD, control, schizophrenia, and other non-AD neurodegenerative disorders) to explore how different disease strata could impact the results.
Figure 1. Study Design.
RNA-Seq and paired genotype or whole genome sequencing (WGS) data were accessed and preprocessed for downstream analysis. Genotype data quality was ensured based on our quality control criteria and imputed as needed. WGS and imputed genotypes were merged, followed by principal component analysis (PCA) and identity by descent (IBD) to select unrelated subjects of European ancestry. RNA-Seq data quality was checked with FastQC and aligned to human GRCh37 primary assembly with STAR, from which transcript integrity number (TIN) was inferred with RSeQC to account for RNA integrity variation that we later incorporated into the analysis. Gene expression was quantified from unaligned RNA-Seq with the psuedo-aligner Salmon for deconvolution. Cell type composition comprised of the four major CNS cell types were inferred by performing deconvolution on gene expression quantification results. Using cell type proportions as quantitative traits, we identified loci in the TMEM106B gene region associated with neuronal proportion in our assembled dataset. CMC: CommonMind Consorsium; GTEx: The Genotype-Tissue Expression; Mayo: the Mayo Clinic; MSSM: Mount Sinai School of Medicine; Knight ADRC: the Charles F. and Joanne Knight Alzheimer’s Disease Research Center; DIAN: the Dominantly Inherited Alzheimer Network; ROSMAP: the Religious Orders Study and Memory and Aging Project; TCX: temporal cortex; PAR: parietal cortex; CTX: cortex; FCX: frontal cortex; DLPFC: dorsal lateral prefrontal cortex. BA9: dorsal lateral prefrontal cortex; BA10: Anterior prefrontal cortex; BA22: superior temporal gyrus; BA24: ventral anterior cingulate cortex; BA36: parahippocampal gyrus; BA44: inferior frontal gyrus.
We performed functional annotation to identify the variants and genes that drive the association with neuronal proportion. Single variants were annotated by ANNOVAR and other gene mapping tools integrated by FUMA to identify functional genes or pathways. In addition, neuronal proportion heritability contributed by the target variants was analyzed using genome-wide complex trait analysis (GCTA). To determine the overlap between the genetic architecture and other complex traits, including AD, FTLD, and schizophrenia, we analyzed if the SNPs and genes associated with these diseases were also associated with neuronal proportion.
TMEM106B variants associated with neuronal proportion
Using the normalized neuronal proportion as a quantitative trait in the discovery dataset (n = 484), we identified 1,340 loci that have p-values < 1.0×10−03; and 22 loci with p-values < 1.0×10−05, including: LIMCH1, GLRA1/ATOX1, DOCK2, FOXO3, TMEM106B, DLC1, PRKCA, RNF152, and CCDC102B (Online Resource Table 3, Online Resource Figure 4ab). One of the most significant SNPs, rs1990621 (β = 0.3; p-value = 6.40×10−07) is located in the TMEM106B gene region and was one of the few loci with p-value around 1.0×10−07 (Table 2, Figure 2e, and Online Resource Table 3).
Table 2.
SNPs passing genome-wide suggestive level in final meta-analysis merging discovery and replication data.
| SNP | Chr. | BP | A1 | A2 | MAF | Discovery | Replication | Meta-analyses | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Effect | P-value | Effect | P-value | Effect | P-value | % Variance | ||||||
| rs6750387 | 2 | 76,967,335 | G | A | 0.381 | −0.068 | 2.91×10−01 | −0.049 | 4.80×10−07 | −0.056 | 9.23×10−07 | 0.17% |
| rs665133 | 3 | 6,676,306 | C | T | 0.029 | 0.425 | 2.33×10−02 | 0.364 | 7.26×10−06 | 0.376 | 9.50×10−07 | 0.83% |
| rs6908749 | 6 | 111,962,944 | A | G | 0.073 | 0.180 | 1.73×10−01 | 0.042 | 8.08×10−06 | 0.084 | 7.43×10−06 | 0.02% |
| rs6925190 | 6 | 153,744,859 | A | T | 0.120 | −0.305 | 3.10×10−03 | −0.260 | 4.99×10−05 | −0.270 | 8.03×10−07 | 1.03% |
| rs1990621 | 7 | 12,283,873 | G | C | 0.466 | 0.300 | 6.40×10−07 | 0.130 | 7.41×10−04 | 0.160 | 9.42×10−09 | 2.28% |
| rs11239562 | 10 | 46,063,198 | G | A | 0.438 | −0.121 | 5.02×10−02 | −0.174 | 1.74×10−05 | −0.158 | 3.97×10−06 | 1.38% |
| rs112142762 | 12 | 14,384,935 | C | T | 0.035 | −0.173 | 2.97×10−01 | 0.183 | 3.53×10−06 | 0.069 | 8.11×10−06 | 0.01% |
| rs78937948 | 12 | 81,558,976 | G | C | 0.055 | 0.504 | 2.52×10−03 | 0.391 | 1.01×10−04 | 0.424 | 1.25×10−06 | 1.24% |
| rs441407 | 13 | 86,379,248 | T | C | 0.275 | −0.168 | 2.18×10−02 | −0.195 | 2.88×10−05 | −0.187 | 2.27×10−06 | 1.42% |
| rs283815 | 19 | 45,390,333 | G | A | 0.250 | −0.151 | 5.86×10−02 | −0.229 | 3.36×10−06 | −0.214 | 1.94×10−06 | 0.91% |
| rs429358 | 19 | 45,411,941 | C | T | 0.151 | −0.126 | 1.79×10−01 | −0.258 | 8.15×10−06 | −0.235 | 1.22×10−05 | 1.27% |
| rs73487196 | 22 | 48,793,169 | T | C | 0.071 | 0.453 | 7.42×10−03 | 0.374 | 1.75×10−04 | 0.398 | 6.62×10−06 | 0.62% |
Statistical analyses were performed with both PLINK and Meta-Tissue for the Discovery dataset and with Meta-Tissue for the Replication and Meta-analyses Nominally, significant p-values are in bold for the discovery and replication datasets. Only genome-wide significant p-values are in bold for the meta-analyses
Figure 2. The single variant and gene-based multi-tissue meta-analyses identified TMEM106B and APOE as genes associated with neuronal proportion.
All panels in this figure were produced using meta-analysis results from Meta-Tissue analytic pipeline. a) Manhattan plot showing the SNP-based genome-wide significant hit located in chromosome 7 with other suggestive SNP hits labeled. rs1990621, located in the chromosome 7 TMEM106B gene region, was significantly associated with neuronal proportion in meta-analyses. b) QQ plot of the SNP-based analysis. c) Manhattan plot showing the gene- based genome-wide significant hit located in chromosome 7 with other suggestive genes. d) QQ plot of the gene-based analysis. e) Local plot showing the zoomed-in view of the hit in chromosome 7 with the target SNP rs1990622 labeled with dark purple. The top leading SNP is rs1990621. Nearby SNPs were also mainly located in the TMEM106B gene region and color coded with their linkage disequilibrium (LD) r2 thresholds. f) Local plot showing the zoomed-in view of the hit in chromosome 19 with the target SNP rs2075650 labeled with dark purple. The top three leading SNPs are rs283815, rs769449, and rs429358. Nearby SNPs were also mainly located in the TOMM40/APOE gene region and color coded with their LD r2 thresholds. One gene omitted in this region is SNRPD2.
We used the replication cohorts (n = 1,052) to determine if any of these loci showed consistent association with neuronal proportion (Online Resource Table 4). The association of rs1990621 with neuronal proportion was in the same direction in all cohorts and also in all regions in the studies with multiple regions, except in two regions: BA24 and CTX, from the GTEx samples. This result indicates that there is some tissue heterogeneity in the effect of the association across the studies we analyzed (Figure 3a). For this reason, we decided to use a statistical framework that can analyze all of these cohorts and regions together while accounting for tissue heterogeneity. Meta-Tissue calculates a m-value [34] to predict if a variant has a tissue-specific significant effect. The m-value is similar to the posterior probability of an association based on the Bayes factor [34], but allows for the performance of meta-analyses on different datasets and regions accounting for heterogeneity. It is able to detect whether a specific variant has a significant effect in a study overall. Using this approach, rs1990621 in the TMEM106B gene region showed a significant association in the same direction in the replication cohort (rs1990621; β = 0.13; p-value = 7.41×10−04; Figure 3c, Table 2, and Online Resource Figure 4cd). This analysis included only individuals not related to those from the discovery dataset, therefore this represents an independent replication of this signal. None of the other 21 loci with suggestive p-value showed a nominal association in the replication dataset (Online Resource Table 4).
Figure 3. Meta-Tissue analysis results of rs1990621.
a) Forest plot showing the p-value and confidence interval for rs1990621 for each tissue site of each dataset that is included in the Meta-Tissue meta-analysis. Summary random effects were depicted at the bottom as RE Summary. b) Based on Meta-Tissue meta-analysis, PM-Plot of rs1990621 is plotted while combining both p-value (y axis) and m-value (x axis). Red dots indicate that the variant is predicted to have an effect in that particular dataset, blue dots mean that the variant is predicted to not have an effect, and green dots represent ambiguous predictions. c) Forest plot p-value and confidence interval for rs1990621 for discovery, replication, and merged datasets. d) Forest plot p-value and confidence interval for rs1990621 when splitting the merged dataset into four main disease categories. CMC: CommonMind Consorsium; GTEx: The Genotype-Tissue Expression; Mayo: the Mayo Clinic; MSSM: Mount Sinai School of Medicine; Knight ADRC: the Charles F. and Joanne Knight Alzheimer’s Disease Research Center; DIAN: the Dominantly Inherited Alzheimer Network; ROSMAP: the Religious Orders Study and Memory and Aging Project; TCX: temporal cortex; PAR: parietal cortex; CTX: cortex; FCX: frontal cortex; DLPFC: dorsal lateral prefrontal cortex. BA9: dorsal lateral prefrontal cortex; BA10: Anterior prefrontal cortex; BA22: superior temporal gyrus; BA24: ventral anterior cingulate cortex; BA36: parahippocampal gyrus; BA44: inferior frontal gyrus; AD: Alzheimer’s Disease; SCZ: schizophrenia; Other: other non-AD neurodegenerative disorders.
We used the same approach to perform a meta-analysis of the discovery and replication datasets, including multiple brain regions. In the multi-tissue meta-analyses including all samples (n = 1,536), rs1990621 passed the genome-wide threshold for significance (p-value = 9.42×10−09; Online Resource Table 5 and 6). The minor allele of rs1990621 was associated with increased neuronal proportion (β = 0.16; Figure 3ab, Online Resource Table 6). No other loci passed the GWAS threshold but there were eleven other loci that passed the suggestive threshold (Table 2 and Online Resource Table 5).
To determine the joint effects of genetic markers, genome-wide gene-based analyses were performed with MAGMA, based on the multi-tissue meta-analyses results. MAGMA provides a p-value for each gene by taking into account all the independent SNPs in a specific gene region. TMEM106B (p-value = 2.96×10−08) was the only gene that passed the genome-wide significance threshold, followed by APOE (p-value = 3.2×10−05), and APOE which is the most important genetic risk factor for sporadic AD (Figure 2cd, and Online Resource Table 7). However, a total of 27 genes showed a p-value < 1×10−03 (Online Resource Table 7). Among them, PDE11A [32, 44, 45] and PRND [39, 71], which have been associated with AD; and KIAA1586 [24, 49], which has been previously associated with schizophrenia.
Disease-specific analyses: TMEM106B and neuronal proportion
TMEM106B is a known risk locus for FTLD [16, 29, 60, 80]. To determine if the association of TMEM106B with neuronal proportion was driven by FTLD or other diseases involved in this study, we stratified the cohorts by disease type to determine if the association of rs1990621 was driven by a single disease or multiple diseases. Samples were stratified by the following categories: AD (n = 639), non-AD neurodegenerative disorders (n = 103), schizophrenia (n = 189), young (n = 277) and old (n = 199) cognitively healthy control individuals. The non-AD neurodegenerative disorder category also includes other neurodegenerative diseases, mainly PSP and PA.
Rs1990621 in TMEM106B showed a strong association with neuronal proportion in AD (β = 0.26; p-value = 1.95×10−07), and cognitively normal elderly individuals (β = 0.27; p-value = 1.34×10−02), but not in the young controls (Figure 3d, and Online Resource Table 8). Rs1990621 was also associated with neuronal proportion in the non-AD dementia category, with a higher effect size (β = 0.45; p-value = 8.19×10−04) than any other categories. These analyses indicate that the association of rs1990621 in TMEM106B with neuronal proportion is not unique to AD, but can be also found in other neurodegenerative diseases. The fact that this association is found in the normal elderly individuals with an effect size similar to that of the AD group also indicates that the association is not driven by pathological neuronal death.
In addition, no association was found in the in schizophrenic cohort (β = 0.01; p-value = 9.32×10−01) or cognitively normal young individuals (β = 0.05; p-value = 5.61×10−01; Figure 3d, and Online Resource Table 8). The samples included in the schizophrenic cohort and GTEx were significantly younger than the AD or non-AD neurodegeneration cohorts (p-value < 7.82 ×10−24). This suggests that rs1990621 is associated with a protective mechanism mainly in neurons of aging brains and neurodegenerative diseases.
Functional annotation
The variant rs1990621 is located in the TMEM106B gene region. Although the CADD score and RegulomeDB score for this variant are not high enough to suggest any functional consequences (Online Resource Figure 5bc), this variant is in high LD with rs1990622 (r2 = 0.98), a TMEM106B variant previously identified to be associated with FTLD risk [80], particularly in granulin (GRN) mutation carriers [16, 29]. Rs1990621 is also in high LD with rs1990620 (r2 = 0.99), which is a cis-eQTL for TMEM106B [31]. Another variant in LD with rs1990621 is rs3173615 (r2 = 0.98), which is located in the exon 6 of TMEM106B (dark blue dot in Online Resource Figure 5b), and produces a nonsynonymous amino acid change in TMEM106B: p.T185S. Previous studies indicate that the rs3173615 coding variant or rs1990620, is the functional variant driving the association in the FTLD risk GWAS [31, 60]. Our study finds the same signal related to this nonsynonymous change in TMEM106B. TMEM106B is expressed in neurons and microglia, with the highest protein expression detected in the late endosome/lysosomal compartments of neurons [9, 52, 72, 74]. The nonsynonymous SNP rs3173615 p.T185S affects TMEM106B protein levels by increasing TMEM106B protein degradation [9, 14, 60].
Estimation of neuronal proportion variance explained by associated genetic loci
To determine the proportion of variance in neuronal proportion explained by the common genetic loci, we analyzed all of the tested genotyped and imputed autosomal common variants (MAF > 0.02) using one-stage joint analysis. The approach we took is GCTA [90], which fits the effects of all the common SNPs as random effects in a mixed linear model. Then SNP variances are estimated based on the genetic relationship matrix between individuals. After correcting for age, sex, median TIN and the first two principal components, approximately 50.87% (p-value = 3.73×10−03) of the variability in neuronal proportion was explained by common variants. The top SNP, rs1990621, accounts for 2.28% of the variability in neuronal proportion (Table 2); the SNP rs429358 in APOE gene region accounts for 1.27% of neuronal proportion variability. The top 12 loci in the meta-analyses, that include APOE and TMEM106B explain 11.18% of the total variation in neuronal proportion. This result indicates that a large proportion of the genetic hereditability is not explained by the top 12 loci and that that many genetic loci are yet to be discovered.
The impact of other neurodegenerative risk loci on neuronal proportion
To investigate what other AD or FTLD variants might have an effect on neuronal proportion in QTL analysis, we analyzed the 38 genome-wide significant loci for AD [51], FTLD [27], and FTLD with TDP pathology [80]. Among those, only common variants (MAF > 0.02) located in the TMEM106B and APOE gene were associated with neuronal proportion. Both rs1990622 (TMEM106B; Figure 2e) and rs2075650 (TOMM40/APOE; Figure 2f) were found to be associated with FTLD (Table 3). The top signals in the APOE region are rs283815, rs769449, and rs429358 with p-values < 1.22×10−05. Note that rs429358 is one of the two SNPs that determines APOE isoforms. APOE ε4 and ε 2 alleles, coded by rs429358(C) and rs7412(C), confer the largest effect for AD risk. We observed that the C allele of rs429358 was associated with decreased neuronal proportion significantly, but no significant association was observed between rs7412 and neuronal proportion. Our gene-based analysis also indicates that there is an overlap in the genetic architecture of neuronal proportion between AD and FTLD, as the top two genes were TMEM106B (p-value = 2.96×10−08) and APOE (p-value = 3.2×10−05; Figure 2cd).
Table 3.
Neuronal proportion cQTL p-values were reported for variants previously identified in AD risk (by Lambert et al.), FTLD risk (by Ferrari et al.), and FTLD-TDP risk (by Van Deerlin et al.) studies.
| cQTL | AD risk | FTLD risk | FTLD-TDP risk | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP | Gene | Minor | MAF | SNP proxy | Effect | p-value | OR | p-value | OR | p-value | OR | p-value |
| rs6656401 | CR1 | A | 0.157 | - | −0.09 | 5.59×10−02 | 1.18 | 5.7×10−24 | - | - | - | - |
| rs730482 | BIN1 | T | 0.327 | rs6733839 | −0.03 | 3.73×10−01 | 1.22 | 6.9×10−44 | - | - | - | - |
| rs35349669 | INPP5D | T | 0.426 | - | −2.33×10−03 | 9.80×10−01 | 1.08 | 3.2×10−08 | - | - | - | - |
| rs190982 | MEF2C | G | 0.353 | - | 3.08×10−04 | 5.88×10−01 | 0.93 | 3.2×10−08 | - | - | - | - |
| rs9271192 | HLA-DRB5–HLA-DRB1 | C | 0.268 | - | −0.04 | 4.31×10−01 | 1.11 | 2.9×10−12 | - | - | - | - |
| rs10948363 | CD2AP | G | 0.249 | - | −0.01 | 8.62×10−01 | 1.1 | 5.2×10−11 | - | - | - | - |
| rs2718058 | NME8 | G | 0.386 | - | 3.76×10−04 | 9.99×10−01 | 0.93 | 4.8×10−09 | - | - | - | - |
| rs1476679 | ZCWPW1 | G | 0.266 | - | −0.02 | 7.71×10−01 | 0.91 | 5.6×10−10 | - | - | - | - |
| rs11771145 | EPHA1 | A | 0.371 | - | 0 | 9.67×10−01 | 0.9 | 1.1×10−13 | - | - | - | - |
| rs28834970 | PTK2B | C | 0.332 | - | −0.07 | 6.38×10−02 | 1.1 | 7.4×10−14 | - | - | - | - |
| rs1532277 | CLU | T | 0.362 | rs9331896 | 1.42×10−03 | 9.92×10−01 | 0.86 | 2.8×10−25 | - | - | - | - |
| rs10838725 | CELF1 | C | 0.275 | - | −0.07 | 8.60×10−02 | 1.08 | 1.1×10−08 | - | - | - | - |
| rs7124974 | MS4A6A | T | 0.365 | rs983392 | −0.05 | 2.05×10−01 | 0.9 | 6.1×10−16 | - | - | - | - |
| rs10792832 | PICALM | A | 0.340 | - | 0.01 | 4.50×10−01 | 0.87 | 9.3×10−26 | - | - | - | - |
| rs11218343 | SORL1 | C | 0.045 | - | 0.12 | 1.35×10−01 | 0.77 | 9.7×10−15 | - | - | - | - |
| rs17125944 | FERMT2 | G | 0.084 | - | −0.1 | 1.71×10−01 | 1.14 | 7.9×10−09 | - | - | - | - |
| rs10498633 | SLC24A4-RIN3 | A | 0.213 | 0.01 | 8.32×10−01 | 0.91 | 5.5×10−09 | - | - | - | - | |
| rs1460595 | DSG2 | A | 0.038 | rs8093731 | −0.05 | 3.81×10−01 | 0.73 | 1.0×10−04 | - | - | - | - |
| rs4147929 | ABCA7 | A | 0.159 | 0.04 | 4.78×10−01 | 1.15 | 1.1×10−15 | - | - | - | - | |
| rs429358 | APOE | C | 0.151 | - | −0.24 | 1.22×10−05 | 1.35 | 6.7×10−536 | - | - | - | - |
| rs3865444 | CD33 | A | 0.287 | - | 0.05 | 2.02×10−01 | 0.94 | 3.0×10−06 | - | - | - | - |
| rs7274581 | CASS4 | G | 0.105 | - | 0.02 | 3.01×10−01 | 0.88 | 2.5×10−08 | - | - | - | - |
| rs1980493 | BTNL2 | G | 0.144 | - | −0.02 | 7.83×10−01 | - | - | 0.77 | 1.57×10−08 | - | - |
| rs9268856 | HLA-DRB5 | A | 0.270 | - | −0.02 | 8.12×10−01 | - | - | 0.81 | 5.51×10−09 | - | - |
| rs9268877 | HLA-DRB5 | A | 0.427 | - | −0.05 | 1.91×10−01 | - | - | 1.21 | 1.05×10−08 | - | - |
| rs1020004 | TMEM106B | G | 0.315 | - | 0.1 | 1.35×10−03 | - | - | 1.03 | 4.59×10−01 | 0.60 | 5.00×10−11 |
| rs6966915 | TMEM106B | A | 0.461 | 0.16 | 1.24×10−08 | - | - | 1.07 | 1.21×10−01 | 0.61 | 1.63×10−11 | |
| rs1990622 | TMEM106B | G | 0.467 | - | 0.16 | 1.44×10−08 | - | 1.08 | 7.88×10−02 | 0.61 | 1.08×10−11 | |
| rs10128715 | RAB38/CTSC | A | 0.122 | rs74977128 | 0.03 | 7.29×10−01 | - | - | 1.81 | 3.06×10−08 | - | - |
| rs302668 | RAB38 | C | 0.308 | - | −0.08 | 4.57×10−02 | - | - | 0.81 | 2.44×10−07 | - | - |
| rs302665 | RAB38 | G | 0.246 | rs302652 | −0.07 | 3.65×10−02 | 0.73 | 2.02×1008 | - | - | ||
| rs2075650 | APOE | G | 0.141 | - | −0.2 | 2.11×10−04 | 1.30 | 8.81×1007 | - | - | ||
Minor: minor allele of the SNP; MAF: minor allele frequency in our genomic data; SNP proxy: the original SNP reported in previous studies but not present in our genomic data, and is replaced by the closest SNP in LD found in our data; OR: odds ratio reported in previous studies.
DISCUSSION
We have assembled the largest dataset to date to analyze the genetic architecture underlying aging brains. In this study, we analyzed RNA-seq data from more than 2,008 brain samples (1,536 unique individuals). Using neuronal proportion as a quantitative trait, we found genetic variants associated with neuronal proportion. Our study identified a protective variant, rs1990621, in TMEM106B, which was associated with increased neuronal proportion (p-value = 6.40×10−07). This association was replicated in an independent dataset (p-value = 7.41×10−04) showing a significant genome-wide p-value in the meta-analyses (p-value = 9.42×10−09). In this study we used multiple datasets that include different cortical regions, and different overall characteristics (for example, RNA library preparation, TIN, post-mortem interval, and age of death, between other variables (see Table 1)). This will lead to heterogeneity in the study and lower the statistical power of the analyses. It is possible that additional variants that influence neuronal proportion would require larger studies or studies with less heterogeneity in order to identify them. However, the fact that the rs1990621 replicated in a completely independent dataset indicates that this association is very robust and that this is in fact a real signal.
In addition, previous studies have already linked TMEM106B and rs1990621 with aging [69]. Rhinn et al, identified rs1990621 as the most significant signal for differential aging. Differential aging was calculated as the difference between the chronological and biological age. Although there is some overlap between the datasets used in Rhinn et al, mainly ROSMAP, most of the datasets used in Rihnn et al and the present study are different. Therefore, the fact that the same gene was identified and validated using different datasets and different phenotypes further supports the role of TMEM106B in neurodegeneration. Rhinn et al., also found that the effect of the TMEM106B variant started around 65 years old, which support our findings in which we observed an association of this variant in elderly, but not in young controls. At the same time, this study significantly extends the results from Rihnn et al. Firstly, the Rihnn et al study is focused on aging while this study is focused on neurodegenerative disease, especially Alzheimer Disease. We demonstrate that the TMEM106B is associated with neuronal proportion in AD, which is also supported by the Jun et al study, that reported that TMEM106B is associated with AD risk when taking into account APOE [41]. At the same time, this study goes beyond AD, and also indicates the TMEM106B is involved in other neurodegenerative diseases, such as PSP and FTLD. These findings point to a common underlying mechanism in these neurodegenerative diseases, which could have large implications in the field.
Because the top signal, rs1990621, is in LD with rs1990622, which is known to be associated with FTLD with TDP-43 inclusions [80], and our dataset includes FTLD individuals, we performed additional analyses to determine if this association was driven by individuals with FTLD or other neurodegenerative diseases such as PSP or PA. As expected, the strength of the association was larger in the group that included the FTLD samples (β = 0.45; p-value = 8.19×10−04), but the association of TMEM106B with neuronal proportion was also found in AD (β = 0.26; p-value = 1.95×10−07). More importantly, the effect size of TMEM106B with neuronal proportion is similar in AD and in elderly cognitively normal individuals (β = 0.27; p-value = 1.34×10−02). This effect was not observed in a younger schizophrenia cohort with a mean age of death less than 65 years old (β = 0.01; p-value = 9.32×10−01), or in the younger cognitively normal individuals (β = 0.05; p-value = 5.61×10−01). These results indicate that rs1990621 in TMEM106B can be considered as a protective variant for neurodegeneration in general, not only for FTLD, as it also leads to a higher neuronal proportion in aging brains in general. The protective effect of TMEM106B is also found in healthy brains and could explain the lower risk of developing neurodegenerative diseases in individuals carrying these variants. As we did not see any association with the younger controls, it would be more likely that TMEM106B would be associated with better neuronal survival under challenging conditions, like aging, although additional studies are needed to test this hypothesis. On a similar note, it is important to recall that TMEM106B, was initially reported to be associated with FTLD [80], which is characterized by TDP-43 pathology. TDP-43 pathology can be also found in around 40% of the pathologically diagnosed AD cases [77], and increases with healthy aging [43, 79, 88]. Therefore, it is possible that individuals with the TMEM106B protective allele would develop less TDP-43 pathology and would be protected from neurodegenerative diseases. These results suggest a common pathway involving TMEM106B on aging brains in the presence or absence of neurodegenerative pathology that may contribute to cognitive resilience and neuronal protection. In addition, the second most significant signal in our gene-based analyses was APOE (p-value = 3.2×10−05). The rs429358 variant that codes for the APOE ε4 isoform also showed a suggestive p-value for neuronal proportion (p-value = 1.22×10−05). This observation suggests that there is an overlap in the genetic architecture of neuronal proportion with AD, neurodegeneration and aging.
Based on previous studies [16, 31, 69, 80] and our functional annotation analyses, it is very clear that the functional gene that drives the association with neuronal proportion is TMEM106B. There are two SNPs that implicate TMEM106B that are proposed to be the functional variant underlying the association. The first one is a protein coding variant (rs3173615) in high LD with rs1990621 (r2 = 0.98), which leads to protein isoforms (p.T185S). The minor allele of rs1990622, which has a protective effect in FTLD [80], is in-phase with the minor allele of rs1990621, which is associated with increased neuronal proportion in our analysis. The protective role of the rs3173615 variant in TMEM106B is associated with attenuated cognitive deficits or better cognitive performance in ALS [81], hippocampal sclerosis [57], presymptomatic FTLD [66], aging individuals with various neuropathological burdens [89] and in the absence of known brain disease [69]. The S185 allele is protective and is associated with faster protein degradation than the common allele T185 [9, 14, 60]. The second one, rs1990620, is also in LD with rs1990621, and might also be the functional variant as it is a cis-eQTL for TMEM106B [31]. TMEM106B overexpression results in enlarged lysosomes and lysosomal dysfunction [9, 91]. It has also been shown that TMEM106B may interact with GRN in lysosomes [60]. Taken together, these results suggest that genetic variants regulate both neurodegeneration risk and the TMEM106B transcript and protein level in a cell autonomous manner.
Previous GWAS for AD risk performed with the International Genomics of Alzheimer’s Project (IGAP) data, stratified by APOE genotype, showed that AD risk is significantly influenced by the interaction between APOE and TMEM106B [41]. Together with our observation of cellular composition QTL, these results suggest that TMEM106B and APOE may play a role in affecting AD risk/vulnerability by affecting the cellular composition balance between neurons and astrocytes, and modulating endosomal and lysosomal function.
TMEM106B seems to affect risk for neurodegenerative disorders by regulating the endosomal/lysosomal pathway and playing a role in vulnerability to neurodegenerative disorders. Continuous lysosomal turnover of cellular contents through endocytosis and autophagy is crucial for neuronal survival [62]. Impaired lysosomal function reduces lysosomal degradative efficiency, which leads to abnormal build-up of toxic components in the cell. An impaired lysosomal system has been associated with normal aging [11] and a broad range of neurodegenerative disorders [55], including AD [61], PD [3, 70, 87], Huntington disease [28, 82], FTLD [54], ALS [28], Niemann-Pick disease type C [46, 63], Creutzfeldt-Jakob disease [53], Charcot-Marie Tooth disease type 2B [73], Neuronal ceroid lipofuscinoses (Batten disease) [47, 48], autosomal dominant hereditary spastic paraplegia [68], Chediak-Higashi syndrome [50], inclusion body myositis [4], and osteopetrosis [40]. During stress, the autophagylysosome pathway facilitates survival through clearance of damaged molecules and mobilization of intracellular stores of energy and nutrients. Abundant evidence supports the autophagy-lysosome pathway as a critical regulator of lifespan. The autophagy-lysosome activity declines with age and lysosome impairment predisposes one to assorted age-related diseases, including neurodegeneration.
Although each neurodegenerative disorder has its own characteristic proteopathy, the boundaries of protein pathology distribution are never clear-cut across different disorders. In fact, co-pathology or nonspecific proteopathy have been observed in most autopsies of neurodegenerative disorders, such as TDP-43 discussed above, Lewy bodies, α-synuclein and others [25]. Our observation of the association of variants in a lysosomal gene, TMEM106B, with neuronal proportion in aging cohorts supports the involvement of the lysosomal pathway in a common biological mechanism underlying a broad range of neurodegenerative disorders or aging processes in general that contribute to neuronal cell death. It also suggests that modulating the lysosomal function by regulating TMEM106B levels could contribute to the neuroprotective effect observed here. Lysosome-mediated degradation of TMEM106B could be a novel neuroprotective molecular target for aging and neurodegenerative diseases.
Our study has demonstrated the utility of using cell-type composition as a quantitative trait to identify genes associated with changes in cell populations. This approach is particularly powerful for complex disorders that involve considerable changes in cellular composition, for example, neurodegenerative disorders, as well as normal changes during developmental or aging processes. The development of recent single-cell studies will greatly increase the resolution of these methods and will help to advance our knowledge of cellular population changes [59]. With more detailed fine-mapping of cellular composition from single-cell studies and machine learning algorithms, bulk RNA-Seq deconvolution will more accurately capture cellular fraction changes in the samples, such as different types of neurons or different states of astrocytes or microglia. Regarding scalability, this bulk RNA-Seq deconvolution-based QTL is preferable because it is unclear if single-cell sequencing technology can be used on large numbers of subjects due to its current cost. Thus, currently bulk-RNA deconvolution approaches are preferable for carrying out cell-type composition QTL analysis. We predict that with larger sample sizes more hidden signals will be revealed as statistical power increases.
In conclusion, we identified a protective variant, rs1990621, in TMEM106B that is associated with increased neuronal proportion through bulk RNA-Seq deconvolution and neuronal proportion QTL analysis. This observation also confirms previous findings of the protective variant rs1990622 in FTLD risk, which is in high LD with rs19990621 [80]. In addition, we observed a significant association with increased neuronal proportions in the APOE region. This suggests the potential involvement of both APOE and TMEM106B in neuronal protection mechanisms underlying neurodegenerative and normal aging processes, and supports the previous observations of interactions between these two genes [41] in AD cohorts. With larger sample sizes and higher deconvolution resolution, this approach will reveal more biologically relevant and novel loci associated with changes in cellular composition to allow better interpretation of transcriptomic results in the context of both disease etiology and healthy aging. TMEM106B regulates lysosomal function, therefore it is possible that TMEM106B-related lysosomal changes might be involved in a common pathway underlying neuronal death and astrocytosis in neurodegenerative disorders and normal aging cohorts, and may be a potential target for neuronal protection therapies.
Supplementary Material
Acknowledgments:
We thank all the participants and their families, as well as the many institutions and their staff that provided support for the studies involved in this collaboration. We also thank Dr. Jae Hoon Sul for his help with the Meta-Tissue analysis applied in this study.
Funding: This work was supported by grants from the National Institutes of Health (R01AG044546, P01AG003991, RF1AG053303, R01AG058501, U01AG058922 K01AG046374, and R01HL119813), the Alzheimer’s Association (NIRG-11–200110, BAND-14–338165, AARG-16–441560 and BFG-15–362540). BAB is supported by 2018 pilot funding from the Hope Center for Neurological Disorders and the Danforth Foundation Challenge at Washington University. The recruitment and clinical characterization of research participants at Washington University were supported by NIH P50 AG05681, P01 AG03991, and P01 AG026276.
This work was supported by access to equipment made possible by the Hope Center for Neurological Disorders and the Departments of Neurology and Psychiatry at Washington University School of Medicine.
DIAN: Data collection and sharing for this project was supported by The Dominantly Inherited Alzheimer’s Network (DIAN, UF1AG032438) funded by the National Institute on Aging (NIA), the German Center for Neurodegenerative Diseases (DZNE), and the Raul Carrea Institute for Neurological Research (FLENI). Partial support was provided by the Research and Development Grants for Dementia from the Japan Agency for Medical Research and Development, AMED, and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI). This manuscript has been reviewed by DIAN Study investigators for scientific content and consistency of data interpretation with previous DIAN Study publications. We acknowledge the altruism of the participants and their families and contributions of the DIAN research and support staff at each of the participating sites for their contributions to this study.
Mayo RNAseq: Study data were provided by the following sources: The Mayo Clinic Alzheimer’s Disease Genetic Studies, led by Dr. Nilufer Taner and Dr. Steven G. Younkin, Mayo Clinic, Jacksonville, FL using samples from the Mayo Clinic Study of Aging, the Mayo Clinic Alzheimer’s Disease Research Center, and the Mayo Clinic Brain Bank. Data collection was supported through funding by NIA grants P50 AG016574, R01 AG032990, U01 AG046139, R01 AG018023, U01 AG006576, U01 AG006786, R01 AG025711, R01 AG017216, R01 AG003949, NINDS grant R01 NS080820, CurePSP Foundation, and support from Mayo Foundation. Study data includes samples collected through the Sun Health Research Institute Brain and Body Donation Program of Sun City, Arizona. The Brain and Body Donation Program is supported by the National Institute of Neurological Disorders and Stroke (U24 NS072026 National Brain and Tissue Resource for Parkinson’s Disease and Related Disorders), the National Institute on Aging (P30 AG19610 Arizona Alzheimer’s Disease Core Center), the Arizona Department of Health Services (contract 211002, Arizona Alzheimer’s Research Center), the Arizona Biomedical Research Commission (contracts 4001, 0011, 05–901 and 1001 to the Arizona Parkinson’s Disease Consortium) and the Michael J. Fox Foundation for Parkinson’s Research.
MSBB: These data were generated from postmortem brain tissue collected through the Mount Sinai VA Medical Center Brain Bank and were provided by Dr. Eric Schadt from Mount Sinai School of Medicine.
Footnotes
Publisher's Disclaimer: This Author Accepted Manuscript is a PDF file of an unedited peer-reviewed manuscript that has been accepted for publication but has not been copyedited or corrected. The official version of record that is published in the journal is kept up to date and so may therefore differ from this version.
Competing interests: CC receives research support from: Biogen, EISAI, Alector and Parabon. The funders of the study had no role in the collection, analysis, interpretation of data; or in the writing of the report; or in the decision to submit the paper for publication. CC is a member of the advisory board of ADx Healthcare and Vivid Genomics.
Data availability
The meta-analysis results can be accessed interactively through our Online Neurodegenerative Trait Integrative Multi-Omics Explorer (ONTIME): http://ngi.pub:5000/pheno/Neuron_QTL Knight-ADRC: https://www.synapse.org/#!Synapse:syn12181323
According to the data request terms, DIAN data are available upon request: http://dian.wustl.edu
Mayo: https://www.synapse.org/#!Synapse:syn5550404
MSSM: https://www.synapse.org/#!Synapse:syn3157743
ROSMAP: https://www.synapse.org/#!Synapse:syn3219045
CommonMind: https://www.synapse.org/#!Synapse:syn2759792
GTEx: https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000424.v7.p2
REFERENCES
- 1.Allen M, Carrasquillo MM, Funk C, Heavner BD, Zou F, Younkin CS et al. (2016) Human whole genome genotype and transcriptome data for Alzheimer’s and other neurodegenerative diseases. Sci Data 3: 160089 Doi 10.1038/sdata.2016.89 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Andrews S (2010) FastQC: a quality control tool for high throughput sequence data.
- 3.Anglade P, Vyas S, Javoy-Agid F, Herrero MT, Michel PP, Marquez J et al. (1997) Apoptosis and autophagy in nigral neurons of patients with Parkinson’s disease. Histology and histopathology 12: 25–31 [PubMed] [Google Scholar]
- 4.Askanas V, Engel WK (2001) Inclusion-body myositis: newest concepts of pathogenesis and relation to aging and Alzheimer disease. J Neuropathol Exp Neurol 60: 1–14 [DOI] [PubMed] [Google Scholar]
- 5.Bateman RJ, Xiong C, Benzinger TL, Fagan AM, Goate A, Fox NC et al. (2012) Clinical and biomarker changes in dominantly inherited Alzheimer’s disease. New England Journal of Medicine 367: 795–804 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Battle A, Brown CD, Engelhardt BE, Montgomery SB (2017) Genetic effects on gene expression across human tissues. Nature 550: 204–213 Doi 10.1038/nature24277 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bennett DA, Schneider JA, Arvanitakis Z, Wilson RS (2012) Overview and findings from the religious orders study. Curr Alzheimer Res 9: 628–645 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bennett DA, Schneider JA, Buchman AS, Barnes LL, Boyle PA, Wilson RS (2012) Overview and findings from the rush Memory and Aging Project. Curr Alzheimer Res 9: 646–663 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Brady OA, Zheng Y, Murphy K, Huang M, Hu F (2013) The frontotemporal lobar degeneration risk factor, TMEM106B, regulates lysosomal morphology and function. Hum Mol Genet 22: 685–695 Doi 10.1093/hmg/dds475 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Broad Institute The Picard Pipeline. http://broadinstitute.github.io/picard/
- 11.Carmona-Gutierrez D, Hughes AL, Madeo F, Ruckenstuhl C (2016) The crucial impact of lysosomes in aging and longevity. Ageing research reviews 32: 2–12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chailangkarn T, Trujillo CA, Freitas BC, Hrvoj-Mihic B, Herai RH, Yu DX et al. (2016) A human neurodevelopmental model for Williams syndrome. Nature 536: 338–343 Doi 10.1038/nature19067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ (2015) Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4: 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chen-Plotkin AS, Unger TL, Gallagher MD, Bill E, Kwong LK, Volpicelli-Daley L et al. (2012) TMEM106B, the risk gene for frontotemporal dementia, is regulated by the microRNA-132/212 cluster and affects progranulin pathways. J Neurosci 32: 11213–11227 Doi 10.1523/jneurosci.0521-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Consortium G (2013) The Genotype-Tissue Expression (GTEx) project. Nat Genet 45: 580–585 Doi 10.1038/ng.2653 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cruchaga C, Graff C, Chiang HH, Wang J, Hinrichs AL, Spiegel N et al. (2011) Association of TMEM106B gene polymorphism with age at onset in granulin mutation carriers and plasma granulin protein levels. Arch Neurol 68: 581–586 Doi 10.1001/archneurol.2010.350 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cruchaga C, Kauwe JS, Harari O, Jin SC, Cai Y, Karch CM et al. (2013) GWAS of cerebrospinal fluid tau levels identifies risk variants for Alzheimer’s disease. Neuron 78: 256–268 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.de Leeuw CA, Mooij JM, Heskes T, Posthuma D (2015) MAGMA: generalized gene-set analysis of GWAS data. Plos Comput Biol 11: e1004219 Doi 10.1371/journal.pcbi.1004219 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Delaneau O, Marchini J (2014) Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel. Nature communications 5: 3934 Doi 10.1038/ncomms4934 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Deming Y, Filipello F, Cignarella F, Cantoni C, Hsu S, Mikesell R et al. (2018) The MS4A gene cluster is a key regulator of soluble TREM2 and Alzheimer disease risk. bioRxiv: 352179 Doi 10.1101/352179 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Deming Y, Li Z, Kapoor M, Harari O, Del-Aguila JL, Black K et al. (2017) Genome-wide association study identifies four novel loci associated with Alzheimer’s endophenotypes and disease modifiers. Acta neuropathologica 133: 839–856 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Deming Y, Xia J, Cai Y, Lord J, Del-Aguila JL, Fernandez MV et al. (2016) Genetic studies of plasma analytes identify novel potential biomarkers for several complex traits. Scientific Reports 6: Doi 10.1038/srep18092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S et al. (2013) STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29: 15–21 Doi 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Duan J, Sanders AR, Moy W, Drigalenko EI, Brown EC, Freda J et al. (2015) Transcriptome outlier analysis implicates schizophrenia susceptibility genes and enriches putatively functional rare genetic variants. Human molecular genetics 24: 4674–4685 Doi 10.1093/hmg/ddv199 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Espay AJ, Vizcarra JA, Marsili L, Lang AE, Simon DK, Merola A et al. (2019) Revisiting protein aggregation as pathogenic in sporadic Parkinson and Alzheimer diseases. Neurology 92: 329–337 Doi 10.1212/wnl.0000000000006926 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Fagan AM, Roe CM, Xiong C, Mintun MA, Morris JC, Holtzman DM (2007) Cerebrospinal fluid tau/beta-amyloid(42) ratio as a prediction of cognitive decline in nondemented older adults. Arch Neurol 64: 343–349 Doi 10.1001/archneur.64.3.noc60123 [DOI] [PubMed] [Google Scholar]
- 27.Ferrari R, Hernandez DG, Nalls MA, Rohrer JD, Ramasamy A, Kwok JB et al. (2014) Frontotemporal dementia and its subtypes: a genome-wide association study. Lancet Neurol 13: 686–699 Doi 10.1016/s1474-4422(14)70065-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Filimonenko M, Stuffers S, Raiborg C, Yamamoto A, Malerod L, Fisher EM et al. (2007) Functional multivesicular bodies are required for autophagic clearance of protein aggregates associated with neurodegenerative disease. J Cell Biol 179: 485–500 Doi 10.1083/jcb.200702115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Finch N, Carrasquillo MM, Baker M, Rutherford NJ, Coppola G, Dejesus-Hernandez M et al. (2011) TMEM106B regulates progranulin levels and the penetrance of FTLD in GRN mutation carriers. Neurology 76: 467–474 Doi 10.1212/WNL.0b013e31820a0e3b [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fromer M, Roussos P, Sieberts SK, Johnson JS, Kavanagh DH, Perumal TM et al. (2016) Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nat Neurosci 19: 1442–1453 Doi 10.1038/nn.4399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Gallagher MD, Posavi M, Huang P, Unger TL, Berlyand Y, Gruenewald AL et al. (2017) A Dementia-Associated Risk Variant near TMEM106B Alters Chromatin Architecture and Gene Expression. Am J Hum Genet 101: 643–663 Doi 10.1016/j.ajhg.2017.09.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Garcia-Osta A, Cuadrado-Tejedor M, Garcia-Barroso C, Oyarzabal J, Franco R (2012) Phosphodiesterases as therapeutic targets for Alzheimer’s disease. ACS chemical neuroscience 3: 832–844 Doi 10.1021/cn3000907 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gaujoux R, Seoighe C (2013) CellMix: a comprehensive toolbox for gene expression deconvolution. Bioinformatics 29: 2211–2212 Doi 10.1093/bioinformatics/btt351 [DOI] [PubMed] [Google Scholar]
- 34.Han B, Eskin E (2012) Interpreting meta-analyses of genome-wide association studies. PLoS Genet 8: e1002555 Doi 10.1371/journal.pgen.1002555 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Harari O, Cruchaga C, Kauwe JS, Ainscough BJ, Bales K, Pickering EH et al. (2014) Phosphorylated tau-Aβ42 ratio as a continuous trait for biomarker discovery for early-stage Alzheimer’s disease in multiplex immunoassay panels of cerebrospinal fluid. Biol Psychiatry 75: 723–731 Doi 10.1016/j.biopsych.2013.11.032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hardy J, Selkoe DJ (2002) The amyloid hypothesis of Alzheimer’s disease: progress and problems on the road to therapeutics. science 297: 353–356 [DOI] [PubMed] [Google Scholar]
- 37.Holtzman DM, Morris JC, Goate AM (2011) Alzheimer’s disease: the challenge of the second century. Sci Transl Med 3: 77sr71–77sr71 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Howie B, Fuchsberger C, Stephens M, Marchini J, Abecasis GR (2012) Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet 44: 955–959 Doi 10.1038/ng.2354 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Infante J, Llorca J, Rodero L, Palacio E, Berciano J, Combarros O (2002) Polymorphism at codon 174 of the prion-like protein gene is not associated with sporadic Alzheimer’s disease. Neuroscience letters 332: 213–215 [DOI] [PubMed] [Google Scholar]
- 40.Jentsch TJ (2007) Chloride and the endosomal-lysosomal pathway: emerging roles of CLC chloride transporters. The Journal of physiology 578: 633–640 Doi 10.1113/jphysiol.2006.124719 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Jun G, Ibrahim-Verbaas CA, Vronskaya M, Lambert JC, Chung J, Naj AC et al. (2016) A novel Alzheimer disease locus located near the gene encoding tau protein. Mol Psychiatry 21: 108–117 Doi 10.1038/mp.2015.23 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kauwe JS, Bailey MH, Ridge PG, Perry R, Wadsworth ME, Hoyt KL et al. (2014) Genome-wide association study of CSF levels of 59 Alzheimer’s disease candidate proteins: significant associations with proteins involved in amyloid processing and inflammation. PLoS genetics 10: e1004758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Keage HA, Hunter S, Matthews FE, Ince PG, Hodges J, Hokkanen SR et al. (2014) TDP-43 pathology in the population: prevalence and associations with dementia and age. J Alzheimers Dis 42: 641–650 Doi 10.3233/jad-132351 [DOI] [PubMed] [Google Scholar]
- 44.Kelly MP (2017) A Role for Phosphodiesterase 11A (PDE11A) in the Formation of Social Memories and the Stabilization of Mood. Advances in neurobiology 17: 201–230 Doi 10.1007/978-3-319-58811-7_8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Kelly MP, Adamowicz W, Bove S, Hartman AJ, Mariga A, Pathak G et al. (2014) Select 3’,5’-cyclic nucleotide phosphodiesterases exhibit altered expression in the aged rodent brain. Cellular signalling 26: 383–397 Doi 10.1016/j.cellsig.2013.10.007 [DOI] [PubMed] [Google Scholar]
- 46.Ko DC, Milenkovic L, Beier SM, Manuel H, Buchanan J, Scott MP (2005) Cell-autonomous death of cerebellar purkinje neurons with autophagy in Niemann-Pick type C disease. PLoS Genet 1: 81–95 Doi 10.1371/journal.pgen.0010007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Koike M, Shibata M, Ohsawa Y, Nakanishi H, Koga T, Kametaka S et al. (2003) Involvement of two different cell death pathways in retinal atrophy of cathepsin D-deficient mice. Molecular and cellular neurosciences 22: 146–161 [DOI] [PubMed] [Google Scholar]
- 48.Koike M, Shibata M, Waguri S, Yoshimura K, Tanida I, Kominami E et al. (2005) Participation of autophagy in storage of lysosomes in neurons from mouse models of neuronal ceroidlipofuscinoses (Batten disease). Am J Pathol 167: 1713–1728 Doi 10.1016/s0002-9440(10)61253-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Kushima I, Aleksic B, Nakatochi M, Shimamura T, Okada T, Uno Y et al. (2018) Comparative Analyses of Copy-Number Variation in Autism Spectrum Disorder and Schizophrenia Reveal Etiological Overlap and Biological Insights. Cell reports 24: 2838–2856 Doi 10.1016/j.celrep.2018.08.022 [DOI] [PubMed] [Google Scholar]
- 50.Kypri E, Schmauch C, Maniak M, De Lozanne A (2007) The BEACH protein LvsB is localized on lysosomes and postlysosomes and limits their fusion with early endosomes. Traffic (Copenhagen, Denmark) 8: 774–783 Doi 10.1111/j.1600-0854.2007.00567.x [DOI] [PubMed] [Google Scholar]
- 51.Lambert JC, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C et al. (2013) Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet 45: 1452–1458 Doi 10.1038/ng.2802 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Lang CM, Fellerer K, Schwenk BM, Kuhn PH, Kremmer E, Edbauer D et al. (2012) Membrane orientation and subcellular localization of transmembrane protein 106B (TMEM106B), a major risk factor for frontotemporal lobar degeneration. J Biol Chem 287: 19355–19365 Doi 10.1074/jbc.M112.365098 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Laszlo L, Lowe J, Self T, Kenward N, Landon M, McBride T et al. (1992) Lysosomes as key organelles in the pathogenesis of prion encephalopathies. J Pathol 166: 333–341 Doi 10.1002/path.1711660404 [DOI] [PubMed] [Google Scholar]
- 54.Lee JA, Beigneux A, Ahmad ST, Young SG, Gao FB (2007) ESCRT-III dysfunction causes autophagosome accumulation and neurodegeneration. Curr Biol 17: 1561–1567 Doi 10.1016/j.cub.2007.07.029 [DOI] [PubMed] [Google Scholar]
- 55.Leidal AM, Levine B, Debnath J (2018) Autophagy and the cell biology of age-related disease. Nat Cell Biol 20: 1338–1348 Doi 10.1038/s41556-018-0235-8 [DOI] [PubMed] [Google Scholar]
- 56.Li Z, Del-Aguila JL, Dube U, Budde J, Martinez R, Black K et al. (2018) Genetic variants associated with Alzheimer’s disease confer different cerebral cortex cell-type population structure. Genome Med 10: 43 Doi 10.1186/s13073-018-0551-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Murray ME, Cannon A, Graff-Radford NR, Liesinger AM, Rutherford NJ, Ross OA et al. (2014) Differential clinicopathologic and genetic features of late-onset amnestic dementias. Acta Neuropathol 128: 411–421 Doi 10.1007/s00401-014-1302-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Murray ME, Dickson DW (2014) Is pathological aging a successful resistance against amyloid-beta or preclinical Alzheimer’s disease? Alzheimers Res Ther 6: 24 Doi 10.1186/alzrt254 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Newman AM, Steen CB, Liu CL, Gentles AJ, Chaudhuri AA, Scherer F et al. (2019) Determining cell type abundance and expression from bulk tissues with digital cytometry. Nature biotechnology: 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Nicholson AM, Finch NA, Wojtas A, Baker MC, Perkerson RB 3rd, Castanedes-Casey M et al. (2013) TMEM106B p.T185S regulates TMEM106B protein levels: implications for frontotemporal dementia. J Neurochem 126: 781–791 Doi 10.1111/jnc.12329 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Nixon RA (2005) Endosome function and dysfunction in Alzheimer’s disease and other neurodegenerative diseases. Neurobiol Aging 26: 373–382 Doi 10.1016/j.neurobiolaging.2004.09.018 [DOI] [PubMed] [Google Scholar]
- 62.Nixon RA, Yang DS, Lee JH (2008) Neurodegenerative lysosomal disorders: a continuum from development to late age. Autophagy 4: 590–599 [DOI] [PubMed] [Google Scholar]
- 63.Pacheco CD, Kunkel R, Lieberman AP (2007) Autophagy in Niemann-Pick C disease is dependent upon Beclin-1 and responsive to lipid trafficking defects. Hum Mol Genet 16: 1495–1503 Doi 10.1093/hmg/ddm100 [DOI] [PubMed] [Google Scholar]
- 64.Patro R, Duggal G, Love MI, Irizarry RA, Kingsford C (2017) Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 14: 417–419 Doi 10.1038/nmeth.4197 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Piccio L, Deming Y, Del-Aguila JL, Ghezzi L, Holtzman DM, Fagan AM et al. (2016) Cerebrospinal fluid soluble TREM2 is higher in Alzheimer disease and associated with mutation status. Acta Neuropathol 131: 925–933 Doi 10.1007/s00401-016-1533-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Premi E, Grassi M, van Swieten J, Galimberti D, Graff C, Masellis M et al. (2017) Cognitive reserve and TMEM106B genotype modulate brain damage in presymptomatic frontotemporal dementia: a GENFI study. Brain 140: 1784–1791 Doi 10.1093/brain/awx103 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Ramasamy A, Trabzuni D, Guelfi S, Varghese V, Smith C, Walker R et al. (2014) Genetic variability in the regulation of gene expression in ten regions of the human brain. Nat Neurosci 17: 1418–1428 Doi 10.1038/nn.3801 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Reid E, Connell J, Edwards TL, Duley S, Brown SE, Sanderson CM (2005) The hereditary spastic paraplegia protein spastin interacts with the ESCRT-III complex-associated endosomal protein CHMP1B. Hum Mol Genet 14: 19–38 Doi 10.1093/hmg/ddi003 [DOI] [PubMed] [Google Scholar]
- 69.Rhinn H, Abeliovich A (2017) Differential Aging Analysis in Human Cerebral Cortex Identifies Variants in TMEM106B and GRN that Regulate Aging Phenotypes. Cell Syst 4: 404–415.e405 Doi 10.1016/j.cels.2017.02.009 [DOI] [PubMed] [Google Scholar]
- 70.Sarkar S, Davies JE, Huang Z, Tunnacliffe A, Rubinsztein DC (2007) Trehalose, a novel mTOR-independent autophagy enhancer, accelerates the clearance of mutant huntingtin and alpha-synuclein. J Biol Chem 282: 5641–5652 Doi 10.1074/jbc.M609532200 [DOI] [PubMed] [Google Scholar]
- 71.Schroder B, Franz B, Hempfling P, Selbert M, Jurgens T, Kretzschmar HA et al. (2001) Polymorphisms within the prion-like protein gene (Prnd) and their implications in human prion diseases, Alzheimer’s disease and other neurological disorders. Human genetics 109: 319–325 Doi 10.1007/s004390100591 [DOI] [PubMed] [Google Scholar]
- 72.Schwenk BM, Lang CM, Hogl S, Tahirovic S, Orozco D, Rentzsch K et al. (2014) The FTLD risk factor TMEM106B and MAP6 control dendritic trafficking of lysosomes. Embo j 33: 450–467 Doi 10.1002/embj.201385857 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Shirk AJ, Anderson SK, Hashemi SH, Chance PF, Bennett CL (2005) SIMPLE interacts with NEDD4 and TSG101: evidence for a role in lysosomal sorting and implications for Charcot-Marie-Tooth disease. J Neurosci Res 82: 43–50 Doi 10.1002/jnr.20628 [DOI] [PubMed] [Google Scholar]
- 74.Stagi M, Klein ZA, Gould TJ, Bewersdorf J, Strittmatter SM (2014) Lysosome size, motility and stress response regulated by fronto-temporal dementia modifier TMEM106B. Molecular and cellular neurosciences 61: 226–240 Doi 10.1016/j.mcn.2014.07.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Suárez-Calvet M, Capell A, Caballero MÁA, Morenas-Rodríguez E, Fellerer K, Franzmeier N et al. (2018) CSF progranulin increases in the course of Alzheimer’s disease and is associated with sTREM2, neurodegeneration and cognitive decline. EMBO molecular medicine 10: e9712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Sul JH, Han B, Ye C, Choi T, Eskin E (2013) Effectively identifying eQTLs from multiple tissues by combining mixed model and meta-analytic approaches. PLoS Genet 9: e1003491 Doi 10.1371/journal.pgen.1003491 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Toledo JB, Cairns NJ, Da X, Chen K, Carter D, Fleisher A et al. (2013) Clinical and multimodal biomarker correlates of ADNI neuropathological findings. Acta Neuropathol Commun 1: 65 Doi 10.1186/2051-5960-1-65 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Turner SD (2014) qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots. bioRxiv: 005165 Doi 10.1101/005165 [DOI] [Google Scholar]
- 79.Uchino A, Takao M, Hatsuta H, Sumikura H, Nakano Y, Nogami A et al. (2015) Incidence and extent of TDP-43 accumulation in aging human brain. Acta Neuropathol Commun 3: 35 Doi 10.1186/s40478-015-0215-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Van Deerlin VM, Sleiman PM, Martinez-Lage M, Chen-Plotkin A, Wang LS, Graff-Radford NR et al. (2010) Common variants at 7p21 are associated with frontotemporal lobar degeneration with TDP-43 inclusions. Nature genetics 42: 234–239 Doi 10.1038/ng.536 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Vass R, Ashbridge E, Geser F, Hu WT, Grossman M, Clay-Falcone D et al. (2011) Risk genotypes at TMEM106B are associated with cognitive impairment in amyotrophic lateral sclerosis. Acta Neuropathol 121: 373–380 Doi 10.1007/s00401-010-0782-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Vonsattel JP, DiFiglia M (1998) Huntington disease. J Neuropathol Exp Neurol 57: 369–384 [DOI] [PubMed] [Google Scholar]
- 83.Wang D, Liu S, Warrell J, Won H, Shi X, Navarro FCP et al. (2018) Comprehensive functional genomic resource and integrative model for the human brain. Science 362: Doi 10.1126/science.aat8464 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Wang L, Nie J, Sicotte H, Li Y, Eckel-Passow JE, Dasari S et al. (2016) Measure transcript integrity using RNA-seq data. BMC Bioinformatics 17: 58 Doi 10.1186/s12859-016-0922-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Wang M, Beckmann ND, Roussos P, Wang E, Zhou X, Wang Q et al. (2018) The Mount Sinai cohort of large-scale genomic, transcriptomic and proteomic data in Alzheimer’s disease. Sci Data 5: 180185 Doi 10.1038/sdata.2018.185 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Watanabe K, Taskesen E, van Bochoven A, Posthuma D (2017) Functional mapping and annotation of genetic associations with FUMA. Nature communications 8: 1826 Doi 10.1038/s41467-017-01261-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Webb JL, Ravikumar B, Atkins J, Skepper JN, Rubinsztein DC (2003) Alpha-Synuclein is degraded by both autophagy and the proteasome. J Biol Chem 278: 25009–25013 Doi 10.1074/jbc.M300227200 [DOI] [PubMed] [Google Scholar]
- 88.Wennberg AM, Whitwell JL, Tosakulwong N, Weigand SD, Murray ME, Machulda MM et al. (2019) The influence of tau, amyloid, alpha-synuclein, TDP-43, and vascular pathology in clinically normal elderly individuals. Neurobiol Aging 77: 26–36 Doi 10.1016/j.neurobiolaging.2019.01.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.White CC, Yang HS, Yu L, Chibnik LB, Dawe RJ, Yang J et al. (2017) Identification of genes associated with dissociation of cognitive performance and neuropathological burden: Multistep analysis of genetic, epigenetic, and transcriptional data. PLoS medicine 14: e1002287 Doi 10.1371/journal.pmed.1002287 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Yang J, Lee SH, Goddard ME, Visscher PM (2011) GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 88: 76–82 Doi 10.1016/j.ajhg.2010.11.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Zhou X, Sun L, Brady OA, Murphy KA, Hu F (2017) Elevated TMEM106B levels exaggerate lipofuscin accumulation and lysosomal dysfunction in aged mice with progranulin deficiency. Acta Neuropathologica Communications 5: 9 Doi 10.1186/s40478-017-0412-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



