Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2021 May 20;16(5):e0250278. doi: 10.1371/journal.pone.0250278

Molecular subtyping of Alzheimer’s disease with consensus non-negative matrix factorization

Chunlei Zheng 1, Rong Xu 1,2,*
Editor: Hemant K Paudel3
PMCID: PMC8136734  PMID: 34014928

Abstract

Alzheimer’s disease (AD) is a heterogeneous disease and exhibits diverse clinical presentations and disease progression. Some pathological and anatomical subtypes have been proposed. However, these subtypes provide a limited mechanistic understanding for AD. Leveraging gene expression data of 222 AD patients from The Religious Orders Study and Memory and Aging Project (ROSMAP) Study, we identified two AD molecular subtypes (synaptic type and inflammatory type) using consensus non-negative matrix factorization (NMF). Synaptic type is characterized by disrupted synaptic vesicle priming and recycling and synaptic plasticity. Inflammatory type is characterized by disrupted IL2, interferon alpha and gamma pathways. The two AD molecular subtypes were validated using independent data from Gene Expression Omnibus. We further demonstrated that the two molecular subtypes are associated with APOE genotypes, with synaptic type more prevalent in AD patients with E3E4 genotype and inflammatory type more prevalent in AD patients with E3E3 genotype (p = 0.031). In addition, two molecular subtypes are differentially represented in male and female AD, with synaptic type more prevalent in male and inflammatory type in female patients (p = 0.051). Identification of AD molecular subtypes has potential in facilitating disease mechanism understanding, clinical trial design, drug discovery, and precision medicine for AD.

Introduction

Alzheimer’s disease (AD) is the most common neurodegenerative disease in elderly population, characterized by pathological extracellular deposition of beta-amyloid (Aβ) peptides and intracellular tau protein fibers in the brain [1]. AD is a heterogenous and multifactorial disease, with diverse clinical presentations in different affected brain areas (left and right cerebral hemispheres as well as anterior-posterior axis) [25], different phenotypes (dysexecutive, amnesic and aphasic) [6, 7], and different rates of disease progression [8]. Recent studies suggested that Aβ aggregates in different biochemical composition [9]. Defining subtypes of AD is important for disease mechanism understanding, clinical trial design, drug discovery, and personalized treatments.

Neuroimaging, beta amyloid and tau have been used for AD subtyping [913], however, subtypes identified based on image analysis and beta amyloid offer limited mechanistic understanding into AD pathophysiology. High-throughput genomic data has greatly expanded our understanding for disease mechanism of AD. Genome-wide association studies (GWAS) have initially identified over 20 loci for late-onset AD [14, 15]. A recent approach called genome-wide association-by-proxy (GWAX) using larger sample size has further expanded the susceptibility loci of AD to 40 [1618]. Several pathways or molecular networks involved in AD were identified using gene expression data [19, 20]. In addition, advanced machine learning and statistical methods have used genomic data to classify AD from normal and mild cognitive impairment (MCI) or predicting MCI to AD conversion [2124]. However, genomic data have not been used for AD subtyping.

The Religious Orders Study and Memory and Aging Project (ROSMAP) is a longitudinal clinical-pathologic cohort study of aging and AD [25]. Currently, around 2,500 individuals were involved in this study and genomic data from 642 participants are available. In this study, we leveraged these valuable data for AD molecular subtyping using non-negative matrix factorization (NMF) clustering method. It has been shown that NMF-based classification is accurate and robust for clustering of genomic data as compared to other methods [26]. NMF has been used in cancer molecular subtyping [27, 28]. In this study, we applied NMF to identify molecular subtypes of AD using gene expression data from ROSMAP and validated the AD molecular subtype in independent datasets. We also investigated the association of AD molecular subtype with patient demographic, clinical and APOE status variables.

Materials and methods

The overall methods were illustrated in Fig 1. The Religious Orders Study and Memory and Aging Project (ROSMAP) was used as the discovery dataset. First, we applied consensus matrix-based NMF into ROSMAP to identify AD molecular subtypes. Second, subtype analysis was performed to identify signature genes and enriched pathways for each molecular subtype. Third, we validated these molecular subtypes in independent datasets (GEO). Finally, we investigated the association of AD molecular subtype with available demographic and clinical variables, and APOE genotype.

Fig 1. Overview of the methods.

Fig 1

NMF: non-negative matrix factorization.

ROSMAP data

ROSMAP contains 222 participants with clinical consensus diagnosis of AD at time of death. Raw gene expression data from frontal cortex and corresponding clinical data were downloaded from synapse.org (syn3219045). Raw count data were normalized and processed according to commonly used procedure described in edgeR (version: 3.28.0) [29, 30]. Data were first normalized by sequencing library size. Non-expressed genes, defined as count per million less than 5 in 80% of samples, were then filtered out, resulting in 12281 genes. To obtain a robust classifier and also reduce the number of genes for NMF-based clustering, we experimented with the different cutoffs ranging from top 10% to 40% (1228 to 4912 genes) based on their interquartile range (IQR) for clustering. While the obtained results were very similar, we presented the clustering result using the top 20% cutoff (2456 genes).

Consensus NMF for AD molecular subtyping

Non-negative matrix factorization

Among different variants of NMF, we employed divergence-based algorithm proposed by Lee and Seung [31] due to its simplicity and robustness [26, 31]. Briefly, given a gene expression matrix A of size n × m (n genes and m samples) and desired number of clusters k, the NMF decomposes A into two non-negative matrices W (n × k) and H (k × m) (Fig 2).

Fig 2. Non-negative matrix factorization procedure.

Fig 2

W and H matrix are computed using iterative method to minimize the following cost function.

D=ij(AijlogAij(WH)ij-Aij+WHij)

In each iteration, W and H are updated using following multiplicative updating rules,

HauHauiWiaAiu/(WH)iukWka
WiaWiauHauAiu/(WH)iuvWav

Cluster membership of each sample is assigned based on the row index of maximal number in the column of H matrix.

Consensus-matrix based model selection

We used consensus matrix-base model selection strategy to select best number of clusters [26]. For a given number of clusters K, NMF groups the samples into K clusters. A total of 40 NMF runs were employed to construct the consensus matrix C (n × n). Each element of consensus matrix represents the probability that two samples cluster together. Then, the cophenetic correlation coefficient ρk was computed as the Pearson correlation of the distance matrix between samples induced by the consensus matrix, i.e., IC, and the distance matrix induced by the hierarchical clustering of IC. ρk measures how faithfully a dendrogram preserves the pairwise distances in the consensus matrix and was calculated using the cophenet function in the scikit-learn library [32]. The best clustering is based on the value of ρk.

Identification of molecular subtype-specific signatures

To identify molecular subtype-specific signatures, we first computed the silhouette for each sample using following equation.

s(i)=b(i)-a(i)max{a(i),b(i)}

Where a(i) and b(i) were computed as following,

a(i)=1|Ci|-1jCi,ijd(i,j)
b(i)=minki1|Ck|jCkd(i,j)

a(i) is the mean distance of a sample to all other samples in the same cluster. It measures how well a sample is assigned to its own cluster. The smaller the value is, the better the assignment is. b(i) is the smallest mean distance of a sample to all samples in any other cluster. |Ci| is the number of samples in its own cluster, |Ck| is the number of samples in any other cluster, and d(i, j) is the distance of two samples computed with Euclidean distance.

The silhouette is a measure of how similar a sample is to its own cluster compared to other clusters. After removing outlier samples with negative silhouette width from each subtype, we applied statistical package edgeR (version: 3.28.0) to obtain pairwise differentially expressed genes (DEGs) between molecular subtypes. To facilitate downstream analysis of molecular subtypes, we used fold change of 1.5 and false discover rate (FDR) of 0.05 as cutoffs. We define the gene signature of each subtype as DEGs that have the highest value in each molecular subtype.

Pathway enrichment analysis

A Bioconductor package clusterProfiler (Version 3.14.3) [33] was used to perform pathway enrichment analysis for each identified molecular subtype. ClusterProfile is a statistical package that integrates several ontologies, including Gene Ontology, Disease Ontology, and KEGG pathway, to perform over-representation analysis and gene set enrichment analysis.

Validation of AD molecular subtype in independent datasets

Two independent datasets from Gene Expression Omnibus (GSE44770, GSE118553) were used for validation of AD molecular subtypes. GSE44770 includes gene expression data from frontal cortex of 230 subjects, 128 of which are late-onset Alzheimer´s disease (LOAD) patients. GSE118553 includes gene expression data from frontal cortex of 112 subjects, including 52 AD patients. We used normalized data from GSE44770 and GSE118553 to validate molecular subtypes identified based on ROSMAP data.

Since ground truth of clusters in a dataset is unknown, there are no quantitative method to formally validate clusters in an independent dataset. Therefore, visualization is suggested as a valid approach [34]. A discovery by signature gene strategy proposed by other studies was used for this validation [27, 28]. The basic idea of this strategy is that using the signature gene from the discovery dataset to cluster a new dataset to see if the signature gene expression shows similar patterns with the discovery dataset. It includes three steps. First, signature genes were projected onto normalized independent dataset and consensus NMF clustering was used to identify number of clusters. Second, molecular subtype identity was assigned using signature genes. Third, a heatmap of signature gene expression was then generated to visualize the molecular subtype. In addition, we performed pathway enrichment analysis to further confirm the molecular subtypes in independent datasets.

Correlation of AD molecular subtype with patient demographics, clinicopathology, and APOE genotype

We examined the demographic distributions of AD molecular subtype, including age, sex, race and education, and assessed the associations of AD molecular subtype with APOE genotype and clinical variables, including Braak stage, The Consortium to Establish a Registry for Alzheimer’s Disease (CERAD) diagnosis, and Mini-Mental State Examination (MMSE) score. The Braak stage is a semiquantitative measure of severity of neurofibrillary tangle (NFT) pathology [35, 36]. Braak stages I and II indicate NFTs confined mainly to the entorhinal region of the brain. Braak stages III and IV indicate involvement of limbic regions such as the hippocampus. Braak stages V and VI indicate moderate to severe neocortical involvement. CERAD score is a semiquantitative measure of neurotic plaques [37]. Based on semiquantitative estimates of neurotic plaque density, a neuropathologic diagnosis was made of no AD, possible AD, probable AD, or definite AD. MMSE test is a 30-point questionnaire that is used extensively in clinical and research settings to measure cognitive impairment.

For categorical variables, including Braak stage, CERAD, and APOE, Fisher’s exact test was used to assess their associations with AD molecular subtype. For continuous variables, such as MMSE and education, student’s t-test was used. All statistical analysis was performed using R (version: 3.6.2). Significance level was defined as p value less than 0.05.

Ethics statement

This is a secondary research use for ROSMAP data and patient information is not identifiable. The IRB at Case Western Reserve University determined that the proposed activity is not research involving human subjects and IRB review and approval is not required (STUDY20190935). Therefore, patient consent is not applicable or not required.

Results

AD consists of two molecular subtypes

We used consensus NMF to cluster gene expression data of 222 AD patients from ROSMAP. Compared with three and four clusters, consensus matrix from two clusters are more stable (Fig 3A–3C). In addition, cophenetic correlation coefficient drops when we assign the data into three subtypes (Fig 3D). These evidences indicate that patient data can be best represented by two distinct subtypes. We obtained 403 differentially expressed genes between these two molecular subtypes as signature genes using 197 core samples with positive silhouette score (Fig 4A). We can see the distinct pattern of signature gene expression in these two subtypes (Fig 4B).

Fig 3. NMF-based clustering of gene expression data from 222 AD patients in ROSMAP.

Fig 3

(A-C) Consensus matrices for 2, 3 and 4 clusters respectively. (D) Plot of cophenetic correlation coefficient against the number of clusters.

Fig 4. Signature genes in each molecular subtype.

Fig 4

(A) Silhouette score for each sample (B) Heatmap for signature gene expression in each molecular subtype. Gene expression is represented as normalized value.

We named the molecular subtypes according to signature genes up-regulated in each cluster. For synaptic type, highly expressed genes are associated with synapse function, such as SNAP25, RAB3A, VAMP1, SYNJ1, and STXBP1. A total of 37 pathways were significantly enriched and 23 of 37 (62.2%) are related to synapse function (S1 Table). The top 10 enriched pathways of this subtype are shown in Table 1. We can see that synaptic type is characterized by dysfunction of synapse, including synaptic vesicle priming and recycling, and neurotransmitter secretion (Table 1).

Table 1. Top 10 enriched pathways in the synaptic type of AD.

PATHWAY P value (adjusted) Fold enriched
Synaptic vesicle cycle 4.1E-04 5.63
Vesicle-mediated transport in synapse 4.1E-04 5.36
Synaptic vesicle priming 1.4E-03 21.32
Synaptic vesicle recycling 1.4E-03 8.98
Calcium ion regulated exocytosis 1.4E-03 5.81
Synaptic vesicle endocytosis 3.0E-03 9.33
Presynaptic endocytosis 3.0E-03 9.33
Neurotransmitter secretion 3.3E-03 5.03
Signal release from synapse 3.2E-03 5.03
Signal release 1.1E-02 2.92

For inflammatory type, highly expressed genes are related to inflammatory pathways, such as BST2, GBP4, IFI44L, IFITM2, IFITM3, IL4R, IRF, MT2A, PSMB9, and TXNIP. A total of 3 pathways were significantly enriched using the signature genes. This subtype is characterized with dysfunction of inflammatory responses, including interferon alpha (IFN-α), interferon gamma (INF-γ) and IL2 pathways (Table 2).

Table 2. Enriched pathways in the inflammatory type of AD.

PATHWAY P value (adjusted) Fold enriched
Interferon alpha response 4.3E-05 7.83
Interferon gamma response 1.4E-03 4.22
IL2-Stat5 signaling 2.1E-02 3.37

AD molecular subtypes were validated in independent datasets

We validated the two AD molecular subtypes using two independent datasets from GEO (GSE44770, n = 128 and GSE118553, n = 40). Using consensus NMF, we identified clusters based on these two independent datasets from GEO (Figs 5 and 6). Majority of samples have positive silhouette scores (Figs 5E and 6E), indicating that samples are well classified using signature genes we obtained from ROSMAP. We can see distinct patterns for signature gene expression in these two clusters, indicating that these two clusters represent the same molecular subtypes from ROSMAP (Figs 5E and 6F).

Fig 5. Molecular subtype validation in GEO dataset (GSE44770).

Fig 5

(A-C) Consensus matrices for 2, 3 and 4 clusters respectively. (D) Plot of cophenetic correlation coefficient against the number of clusters. (E) Silhouette distance for each sample. (F) Heatmap for signature gene expression.

Fig 6. Molecular subtype validation in GEO dataset (GSE118553).

Fig 6

(A-C) Consensus matrices for 2, 3 and 4 clusters respectively. (D) Plot of cophenetic correlation coefficient against the number of clusters. (E) Silhouette distance for each sample. (F) Heatmap for signature gene expression.

To further validate the AD molecular subtypes in these two datasets, we performed pathway enrichment for each cluster in each dataset. For GSE44770 dataset, a total of 30 pathways were significantly enriched in first cluster (S2 Table). Seven of them exactly occur in enriched pathways of ROSMAP-based synaptic type AD and ten additional pathways are related to synaptic function, indicating that this cluster is a synaptic type. Ten pathways were enriched in second cluster and all three enriched pathways in ROSMAP-based inflammatory AD occur in this cluster, indicating that this cluster is an inflammatory type. Similar results were obtained in GSE118553 dataset. A total of ten pathways and two pathways were significantly enriched in each cluster respectively (S3 Table). In the first cluster, four of ten pathways are overlapped with the enriched pathways of ROSMAP-based synaptic type AD and five other pathways are related to synaptic function. In the second cluster, two enriched pathways are overlapped with the enriched pathways in ROSMAP-based inflammatory subtype.

Association analyses of AD molecular subtype with patient demographics, clinicopathology, and APOE genotype

We investigated whether AD subtypes are associated with demographic and clinical variables using the core samples from ROSMAP dataset (197 patients). The distributions of AD molecular subtype in demographic variables, including age, race and education, show no significant difference (Table 3). Interestingly, we noticed that synaptic type AD is more prevalent than inflammatory type in male patients (p = 0.051). Several measurements for AD severity are available in ROSMAP, including AD Braak stage, CREAD score and MMSE score. We didn’t see significant associations of AD molecular subtype with these variables (Table 3). This result suggests that AD molecular subtype might be not related to AD severity, but caution should be taken when explaining this result due to small sample size. ROSMAP also includes APOE genotype, the most important genetic risk factor for late-onset AD. A significant association of AD molecular subtype with APOE was observed (p = 0.031). We can see that synaptic type AD is more prevalent in patients with E3E4 genotype and inflammatory type AD is more prevalent in patients with E3E3 genotype (Table 3).

Table 3. Association of AD molecular subtype with demographic, clinical variables and APOE genotype in the ROSMAP dataset.

Synaptic type (Num. of Patients) Inflammatory type (Num. of Patients) p a
Age
< 65 0 0 1.0
65–80 5 4
> 80 101 87
Sex
Female 65 68 0.051
Male 41 23
Race
White 104 90 1.0
Black 2 1
Education 16.70 16.21 0.98
Braak stage
I 4 2 0.88
II 4 2
III 20 17
IV 35 29
V 41 37
VI 2 4
CREAD score
Definite 48 50 0.47
Probable 44 28
Possible 5 5
No AD 9 8
MMSE 13.84 12.23 0.19
APOE
E2E2 0 1 0.031
E2E3 12 9
E2E4 5 2
E3E3 46 55
E3E4 43 22
E4E4 0 2

a For categorical variables, including Braak stage, CREAD score and APOE, p value was computed using Fisher’s exact test. For continuous variables, including Education and MMSE, the p value was computed using student’s t-test.

We then examined whether these associations can also be observed in the two validation datasets. Although we didn’t see a significant association of sex with molecular subtype, we observed that the synaptic type is more prevalent in male patients than in females in both datasets. In the GSE44770, 37 of 60 (61.7%) are synaptic type in male patients, while it is 33 of 66 (50.0%) in female patients. In the GSE118553, the prevalence of synaptic type in male and female patients are 10 of 14 (71.4%) and 17 of 25 (68.0%) respectively. Due to the lack of APOE genotype in these two datasets, we were unable to investigate the association of APOE with molecular subtype (Table 4).

Table 4. Association of AD molecular subtype with age and sex in the two validation datasets.

Synaptic type (Num. of Patients) Inflammatory type (Num. of Patients) p
GSE44770 Age
 < 65 6 4 0.963
 65–80 30 23
 > 80 34 29
Sex
 Female 33 33 0.212
 Male 37 23
GSE118553 Age
 < 65 0 1 0.495
 65–80 9 4
 > 80 18 7
Sex
 Female 17 8 1
 Male 10 4

Discussion

In this study, we applied non-negative matrix factorization combined with consensus matrix-based cluster selection and identified two molecular subtypes based on gene expression data of AD. Synaptic type is characterized by dysfunction of synaptic pathways. Substantial loss of neurons and synapses is a hallmark in late stage AD. Recent studies also show synaptic dysfunction was observed in mild cognitive impairment patients [3840], suggesting that synaptic dysfunction is a fundamental mechanism of AD. On the other hand, inflammatory type is enriched with over-activation of IL-2, IFN-α, and IFN-γ pathways. The central role of inflammation in AD development is recently established [4143]. A sustained inflammatory response, mediated by over-activation of microglia and other immune cells, has been demonstrated to exacerbate both amyloid and tau pathology [42]. Roy ER et al reported that IFN-α response drives neuroinflammation and grossly upregulated in AD [44]. A recent study links IL-2 pathway to amyloid pathology of AD [45]. All these evidences demonstrated that inflammation represents another mechanism of AD. Therefore, the two AD molecular subtypes we identified reflect inherent molecular mechanism of AD. Interestingly, two studies reported that microglia are involved in synaptic pruning and plays a role in pathological remodeling of neuronal circuits [46, 47], indicating that two molecular processes may be related.

GWAS has identified more than 40 genes/loci as the genetic risk factors of AD, which greatly expands our mechanistic understanding of the etiology of AD. While some of these genes/loci have been mapped to Aβ pathology, including amyloid precursor protein (APP) metabolism, Aβ aggregation, clearance, toxicity, and Tau pathology, a large amount of these genes is related to non-Aβ and -Tau pathways [48]. Lambert et al suggested that a common mechanism, i.e., focal adhesion pathway, may link Aβ and tau pathology and ultimately lead to synapse dysfunction. A shift from Aβ-centered hypothesis to synapse-centered hypothesis has emerged [48, 49]. Here, we used gene expression data to define two molecular subtypes of AD and enriched pathways high-lighten synapse dysfunction, which supports this synapse-centered hypothesis. Furthermore, our study implies two mechanisms for synaptic dysfunction. One is the aberrant synaptic pathways themselves, such as synaptic vesicle endocytosis and exocytosis. Another is the indirect mechanism through immune system dysfunction, which may affect Aβ clearance and synaptic pruning.

Using available patient clinical information, we evaluated their associations with molecular subtypes. We didn’t find significant correlation of molecular subtype with severity of cognitive impairment. However, we were unable to control potential confounders due to very limited information available in the dataset. We show that AD molecular subtype is significantly associated with APOE genotype. APOE has three alleles, including E2, E3 and E4. APOE4 is the main genetic determinant for late-onset AD and individual with APOE4 significantly increases the risk of AD [50, 51]. While some studies show APOE4 promotes AD by interaction with Aβ, especially it hinders Aβ clearance [52], other studies link APOE4 with synaptic function, such as synapse recycling [53]. In this study, we observed that synaptic type of AD is more common in patients with E3E4 genotype. Although APOE is not in the list of signature genes, it may regulate synaptic function by interacting with downstream molecules including APOE receptor in the brain. This observation further supports synaptic mechanism of APOE4 in AD development.

We observed that inflammatory type of AD is more prevalent in women. On the other hand, synaptic type of AD is more prevalent in men. Sex differences in both synaptic plasticity and inflammatory response have been observed [54, 55]. Females often have strong both innate and adaptive immune responses [55]. This results in faster clearance of pathogens in females than males, but also contributes to increased susceptibility to inflammatory diseases in females, such as systemic lupus erythematosus and multiple sclerosis [56]. Since inflammation plays a central role in AD development, females are more likely to develop inflammatory type AD than males. Sex difference in dendrite spine density (DSD) in the hippocampus has been observed in animal models decades ago, which is regulated by steroid hormones and environmental stress. The female rats have double of DSD than males and DSD experienced dramatic changes during the estrous cycle [57, 58]. This structural change in the hippocampus was also observed in human women during the menstrual cycle [59]. Many animal studies showed that increased spine density is associated with memory enhancement [60]. Compared to females, males have lower DSD in the hippocampus. Besides, no periodic fluctuation of hormone in males may lead to less synapse plasticity of hippocampal neurons due to lack of “practicing”. We hypothesize that lower DSD and possibly less synapse plasticity may make males more vulnerable to hippocampus damage, which may explain why synaptic type AD is more common in males.

Identification of AD molecular subtype has an implication for better design in clinical trials. Currently, clinical trials for AD are based on different cognitive groups from mild, moderate, and severe AD. However, most of this symptom-based clinical trials for AD fails, reflecting a lack of mechanistic understanding of AD. A recent clinical trial about a monoclonal antibody solanezumab failed the phase III trials for mild to moderate AD [61], but later it was found that it has benefits for a subgroup of patients with mild symptoms [62], supporting that patient subgrouping is important. Molecular subtyping of AD patients provides an attracting strategy for patient stratification in clinical trials. We prospect that including molecular subtype in clinical trial may contribute to discover personalized treatments for AD.

One limitation of this study is that molecular subtyping is based on gene expression data from post-mortem brain tissue, which limits its clinical usage. Nevertheless, identified molecular subtypes will help to understand the mechanism of AD. In the future, developing a practical molecular subtyping system for AD is demanded. Proteomic data from cerebrospinal fluid and genotype data from blood could be useful for such purpose.

Conclusions

In this study, we reported the first gene expression-based molecular subtyping of AD. Using consensus NMF, we identified two robust molecular subtypes-synaptic type and inflammatory type-that represent two fundamental mechanisms of AD. These molecular subtypes are associated with APOE genotype and exhibit sex difference in distribution. Identification of molecular subtypes may have an implication in better clinical trial design and personalized medicine for AD.

Supporting information

S1 Table. Pathways enriched in each cluster from ROSMAP.

(XLSX)

S2 Table. Pathways enriched in in each cluster from GSE44770.

(XLSX)

S3 Table. Pathways enriched in in each cluster from GSE118553.

(XLSX)

Acknowledgments

The results published here are in part based on data obtained from the AD Knowledge Portal (https://adknowledgeportal.org). Study data were provided by the Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago. Additional phenotypic data can be requested at www.radc.rush.edu. We thank the staff of the Rush Alzheimer’s Disease Center.

Data Availability

Data are available from Dryad: https://doi.org/10.5061/dryad.ttdz08kxq.

Funding Statement

Funding for our research projects was provided by NIH (nih.gov) grants to RX, including DP2HD084068, R01 AG057557-01, R01 AG061388-01 and R56 AG062272-01, and American Cancer Society (cancer.org) grant (RSG-16-049-01-MPC to R. Xu). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Blennow K, de Leon MJ, Zetterberg H. Alzheimer’s disease. Lancet. 2006; 368:387–403. 10.1016/S0140-6736(06)69113-7 [DOI] [PubMed] [Google Scholar]
  • 2.Kramer JH, Miller BL. Alzheimer’s disease and its focal variants. Semin Neurol 2000;20: 447–454. 10.1055/s-2000-13177 [DOI] [PubMed] [Google Scholar]
  • 3.Johnson JK, Head E, Kim R, Starr A, Cotman CW. Clinical and pathological evidence for a frontal variant of Alzheimer disease. Arch Neurol 1999; 56:1233–1239. 10.1001/archneur.56.10.1233 [DOI] [PubMed] [Google Scholar]
  • 4.Butters MA, Lopez OL, Becker JT. Focal temporal lobe dysfunction in probable Alzheimer’s disease predicts a slow rate of cognitive decline. Neurology 1996; 46:687–692. 10.1212/wnl.46.3.687 [DOI] [PubMed] [Google Scholar]
  • 5.Tang-Wai DF, Graff-Radford NR, Boeve BF, et al. Clinical, genetic, and neuropathologic characteristics of posterior cortical atrophy. Neurology 2004; 63:1168–1174. 10.1212/01.wnl.0000140289.18472.15 [DOI] [PubMed] [Google Scholar]
  • 6.Dickerson BC, Wolk DA. Dysexecutive versus amnesic phenotypes of very mild Alzheimer’s disease are associated with distinct clinical, genetic and cortical thinning characteristics. J Neurol Neurosurg Psychiatry 2011; 82:45–51 10.1136/jnnp.2009.199505 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gefen T, Gasho K, Rademaker A, et al. Clinically concordant variations of Alzheimer pathology in aphasic versus amnestic dementia. Brain 2012; 135:1554–1565. 10.1093/brain/aws076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Abu-Rumeileh S, Capellari S, Parchi P. Rapidly Progressive Alzheimer’s Disease: Contributions to Clinical Pathological Definition and Diagnosis. J Alzheimers Dis. 2018;63:887–897. 10.3233/JAD-171181 [DOI] [PubMed] [Google Scholar]
  • 9.Di Fede G, Catania M, Maderna E, Ghidoni R, Benussi L, Tonoli E, et al. Molecular subtypes of Alzheimer’s disease. Sci Rep. 2018; 8:3269. 10.1038/s41598-018-21641-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Dujardin S, Commins C, Lathuiliere A, Beerepoot P, Fernandes AR, Kamath TV, et al. Tau molecular diversity contributes to clinical heterogeneity in Alzheimer’s disease. Nat Med. 2020. 10.1038/s41591-020-0938-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Noh Y, Jeon S, Lee JM, Seo SW, Kim GH, Cho H, et al. Anatomical heterogeneity of Alzheimer disease: based on cortical thickness on MRIs. Neurology. 2014; 83:1936–44. 10.1212/WNL.0000000000001003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Na HK, Kang DR, Kim S, Seo SW, Heilman KM, Noh Y, et al. Malignant progression in parietal-dominant atrophy subtype of Alzheimer’s disease occurs independent of onset age. Neurobiol Aging. 2016; 47:149–156 10.1016/j.neurobiolaging.2016.08.001 [DOI] [PubMed] [Google Scholar]
  • 13.Nettiksimmons J, DeCarli C, Landau S, Beckett L; Alzheimer’s Disease Neuroimaging Initiative. Biological heterogeneity in ADNI amnestic mild cognitive impairment. Alzheimers Dement. 2014; 10:511–521.e1. 10.1016/j.jalz.2013.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lambert JC, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet. 2013; 45:1452–8. 10.1038/ng.2802 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Rosenthal SL, Kamboh MI. Late-Onset Alzheimer’s Disease Genes and the Potentially Implicated Pathways. Curr Genet Med Rep. 2014; 2:85–101. 10.1007/s40142-014-0034-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jansen IE, Savage JE, Watanabe K, Bryois J, Williams DM, Steinberg S, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat Genet. 2019; 51:404–413. 10.1038/s41588-018-0311-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Marioni RE, Harris SE, Zhang Q, McRae AF, Hagenaars SP, Hill WD, et al. Transl Psychiatry. 2018; 8:99. 10.1038/s41398-018-0150-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kunkle BW, Grenier-Boley B, Sims R, Bis JC, Damotte V, Naj AC, et al. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing. Nat Genet. 2019; 51:414–430. 10.1038/s41588-019-0358-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zhang B, Gaiteri C, Bodea LG, Wang Z, McElwee J, Podtelezhnikov AA, et al. Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer’s disease. Cell. 2013; 153:707–20. 10.1016/j.cell.2013.03.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang M, Roussos P, McKenzie A, Zhou X, Kajiwara Y, Brennand KJ, et al. Integrative network analysis of nineteen brain regions identifies molecular signatures and networks underlying selective regional vulnerability to Alzheimer’s disease. Genome Med. 2016; 8:104. 10.1186/s13073-016-0355-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Li H, Leurgans S, Elm J, Gebregziabher M. Statistical Methodology for Multiclass Classifications: Applications to Dementia. J Alzheimers Dis. 2019; 68:173–186. 10.3233/JAD-180580 [DOI] [PubMed] [Google Scholar]
  • 22.Alexiou A, Mantzavinos VD, Greig NH, Kamal MA. A Bayesian Model for the Prediction and Early Diagnosis of Alzheimer’s Disease. Front Aging Neurosci. 2017; 9:77. 10.3389/fnagi.2017.00077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Khanna S, Domingo-Fernández D, Iyappan A, Emon MA, Hofmann-Apitius M, Fröhlich H. Using Multi-Scale Genetic, Neuroimaging and Clinical Data for Predicting Alzheimer’s Disease and Reconstruction of Relevant Biological Mechanisms. Sci Rep. 2018; 8:11173. 10.1038/s41598-018-29433-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.De Velasco Oriol J, Vallejo EE, Estrada K, Taméz Peña JG, Disease Neuroimaging Initiative TA. Benchmarking machine learning models for late-onset alzheimer’s disease prediction from genomic data. BMC Bioinformatics. 2019; 20:709. 10.1186/s12859-019-3158-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Bennett DA, Schneider JA, Arvanitakis Z, Wilson RS. Overview and findings from the religious orders study. Curr Alzheimer Res. 2012; 9:628–45. 10.2174/156720512801322573 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Brunet JP, Tamayo P, Golub TR, Mesirov JP. Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci U S A. 2004; 101:4164–9 10.1073/pnas.0308531101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Verhaak RG, Hoadley KA, Purdom E, Wang V, Qi Y, Wilkerson MD, et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell. 2010; 17:98–110. 10.1016/j.ccr.2009.12.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Sadanandam A, Lyssiotis CA, Homicsko K, Collisson EA, Gibb WJ, Wullschleger S, et al. A colorectal cancer classification system that associates cellular phenotype and responses to therapy. Nat Med. 2013; 19:619–25. 10.1038/nm.3175 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Robinson MD, McCarthy DJ and Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26:139–140 10.1093/bioinformatics/btp616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.McCarthy DJ, Chen Y and Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012; 40:4288–4297 10.1093/nar/gks042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Lee DD and Seung HS. Algorithms for non-negative matrix factorization. Advances in Neural Information Processing Systems 13 (NIPS 2000).
  • 32.Pedregosa F, Varoquaux Gael, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine learning in Python. Journal of machine learning research. 2011; 12:2825–30. [Google Scholar]
  • 33.Yu G, Wang L, Han Y and He Q. “clusterProfiler: an R package for comparing biological themes among gene clusters.” OMICS: A Journal of Integrative Biology. 2012; 16:284–287. 10.1089/omi.2011.0118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Halkiki M, Batistakis Y and Vazigiannis M. On Clustering Validation Techniques. Journal of Intelligent Information Systems. 2001; 17:107–145. [Google Scholar]
  • 35.Bennett DA, Schneider JA, Arvanitakis Z, Kelly JF, Aggarwal NT, Shah RC, et al. Neuropathology of older persons without cognitive impairment from two community-based studies. Neurology 2006; 66: 1837–44 10.1212/01.wnl.0000219668.47116.e6 [DOI] [PubMed] [Google Scholar]
  • 36.Braak H, Braak E. Neuropathological staging of Alzheimer-related changes. Acta neuropathologica 1991; 82:239–59 10.1007/BF00308809 [DOI] [PubMed] [Google Scholar]
  • 37.Mirra SS, Heyman A, McKeel D, Sumi SM, Crain BJ, Brownlee LM, et al. Consortium to Establish a Registry for Alzheimer’s Disease (CERAD) Part II. Standardization of the neuropathologic assessment of Alzheimer’s disease. Neurology 1991; 41: 479 10.1212/wnl.41.4.479 [DOI] [PubMed] [Google Scholar]
  • 38.Scheff SW, Price DA, Schmitt FA, Mufson EJ. Hippocampal synaptic loss in early Alzheimer’s disease and mild cognitive impairment. Neurobiol Aging. 2006; 27:1372–84. 10.1016/j.neurobiolaging.2005.09.012 [DOI] [PubMed] [Google Scholar]
  • 39.Scheff SW, Price DA, Schmitt FA, DeKosky ST, Mufson EJ. Synaptic alterations in CA1 in mild Alzheimer disease and mild cognitive impairment. Neurology. 2007; 68:1501–8. 10.1212/01.wnl.0000260698.46517.8f [DOI] [PubMed] [Google Scholar]
  • 40.Counts SE, Alldred MJ, Che S, Ginsberg SD, Mufson EJ. Synaptic gene dysregulation within hippocampal CA1 pyramidal neurons in mild cognitive impairment. Neuropharmacology. 2014; 79:172–9. 10.1016/j.neuropharm.2013.10.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Bolós M, Perea JR, Avila J. Alzheimer’s disease as an inflammatory disease. Biomol Concepts. 2017; 8:37–43. 10.1515/bmc-2016-0029 [DOI] [PubMed] [Google Scholar]
  • 42.Kinney JW, Bemiller SM, Murtishaw AS, Leisgang AM, Salazar AM, Lamb BT. Inflammation as a central mechanism in Alzheimer’s disease. Alzheimers Dement (NY). 2018; 4:575–590. 10.1016/j.trci.2018.06.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Ginhoux F, Lim S, Hoeffel G, Low D, Huber T. Origin and differentiation of microglia. Front Cell Neurosci. 2013; 7:45. 10.3389/fncel.2013.00045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Roy ER, Wang B, Wan YW, Chiu G, Cole A, Yin Z, et al. Type I interferon response drives neuroinflammation and synapse loss in Alzheimer disease. J Clin Invest. 2020. 10.1172/JCI133737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Alves S, Churlaud G, Audrain M, Michaelsen-Preusse K, Fol R, Souchet B, et al. Interleukin-2 improves amyloid pathology, synaptic failure and memory in Alzheimer’s disease mice. Brain. 2017; 140:826–842. 10.1093/brain/aww330 [DOI] [PubMed] [Google Scholar]
  • 46.Paolicelli RC, Bolasco G, Pagani F, Maggi L, Scianni M, Panzanelli P, et al. Synaptic pruning by microglia is necessary for normal brain development. Science. 2011; 333:1456–1458. 10.1126/science.1202529 [DOI] [PubMed] [Google Scholar]
  • 47.Schafer DP, Lehrman EK, Kautzman AG, Koyama R, Mardinly AR, Yamasaki R,et al. Microglia sculpt postnatal neural circuits in an activity and complement-dependen manner. Neuron. 2012; 74:691–705. 10.1016/j.neuron.2012.03.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Dourlen P, Kilinc D, Malmanche N, Chapuis J, Lambert JC. The new genetic landscape of Alzheimer’s disease: from amyloid cascade to genetically driven synaptic failure hypothesis? Acta Neuropathol. 2019; 138:221–236. 10.1007/s00401-019-02004-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Bellenguez C, Grenier-Boley B, Lambert JC. Genetics of Alzheimer’s disease: where we are, and where we are going. Curr Opin Neurobiol. 2020; 6:40–48. [DOI] [PubMed] [Google Scholar]
  • 50.Corder EH. Saunders AM. Strittmatter WJ, et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families. Science. 1993; 261:921–3. 10.1126/science.8346443 [DOI] [PubMed] [Google Scholar]
  • 51.Farrer LA. Cupples LA. Haines JL, et al. Effects of age, sex, and ethnicity on the association between apolipoprotein E genotype and Alzheimer disease. A meta-analysis. APOE and Alzheimer Disease Meta Analysis Consortium. JAMA. 1997; 278:1349–56 [PubMed] [Google Scholar]
  • 52.Castellano JM, Kim J, Stewart FR, Jiang H, DeMattos RB, Patterson BW, et al. Human apoE isoforms differentially regulate brain amyloid-β peptide clearance. Sci Transl Med. 2011; 3:89ra57 10.1126/scitranslmed.3002156 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Lane-Donovan C, Herz J. ApoE, ApoE Receptors, and the Synapse in Alzheimer’s Disease. Trends Endocrinol Metab. 2017; 28:273–284. 10.1016/j.tem.2016.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hyer MM, Phillips LL, Neigh GN. Sex Differences in Synaptic Plasticity: Hormones and Beyond. Front Mol Neurosci. 2018; 11:266. 10.3389/fnmol.2018.00266 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Klein SL, Flanagan KL. Sex differences in immune responses. Nat Rev Immunol. 2016;16(10):626–38. 10.1038/nri.2016.90 [DOI] [PubMed] [Google Scholar]
  • 56.Fairweather D, Frisancho-Kiss S, Rose NR. Sex differences in autoimmune disease from a pathological perspective. Am J Pathol. 2008; 173:600–609. 10.2353/ajpath.2008.071008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Woolley CS, Gould E, Frankfurt M, McEwen BS. Naturally occurring fluctuation in dendritic spine density on adult hippocampal pyramidal neurons. J Neurosci. 1990;10:4035–9. 10.1523/JNEUROSCI.10-12-04035.1990 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Shors TJ, Chua C, Falduto J. Sex differences and opposite effects of stresson dendritic spine density in the male versus female hippocampus. J Neurosci. 2001; 21:6292–7. 10.1523/JNEUROSCI.21-16-06292.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Protopopescu X, Butler T, Pan H, Root J, Altemus M, Polanecsky M, McEwen B, Silbersweig D, Stern E. Hippocampal structural changes across the menstrual cycle. Hippocampus. 2008; 18:985–8. 10.1002/hipo.20468 [DOI] [PubMed] [Google Scholar]
  • 60.Farrell MR, Gruene TM, Shansky RM. The influence of stress and gonadal hormones on neuronal structure and function. Horm Behav. 2015; 76:118–124. 10.1016/j.yhbeh.2015.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Doody RS. Thomas RG. Farlow M, et al. Phase 3 trials of solanezumab for mild-to-moderate Alzheimer’s disease. N Engl J Med. 2014; 370:311–21 10.1056/NEJMoa1312889 [DOI] [PubMed] [Google Scholar]
  • 62.Siemers ER. Sundell KL. Carlson C, et al. Phase 3 solanezumab trials: Secondary outcomes in mild Alzheimer’s disease patients. Alzheimers Dement. 2016; 12:110–20. 10.1016/j.jalz.2015.06.1893 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Table. Pathways enriched in each cluster from ROSMAP.

(XLSX)

S2 Table. Pathways enriched in in each cluster from GSE44770.

(XLSX)

S3 Table. Pathways enriched in in each cluster from GSE118553.

(XLSX)

Data Availability Statement

Data are available from Dryad: https://doi.org/10.5061/dryad.ttdz08kxq.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES