Skip to main content
NPJ Aging logoLink to NPJ Aging
. 2022 Nov 4;8(1):15. doi: 10.1038/s41514-022-00096-9

Identification of potential blood biomarkers for early diagnosis of Alzheimer’s disease through immune landscape analysis

Daichi Shigemizu 1,2,, Shintaro Akiyama 1, Risa Mitsumori 1, Shumpei Niida 3, Kouichi Ozaki 1,2,
PMCID: PMC9636153  PMID: 36333348

Abstract

Mild cognitive impairment (MCI) is a clinical precursor of Alzheimer’s disease (AD). Recent genetic studies have reported on associations between AD risk genes and immunity. Here, we obtained samples and data from 317 AD, 432 MCI, and 107 cognitively normal (CN) subjects and investigated immune-cell type composition and immune clonal diversity of T-cell receptor (TRA, TRB, TRG, and TRD) and B-cell receptor (IGH, IGK, and IGL) repertoires through bulk RNA sequencing. We found the proportions of plasma cells, γδ T cells, neutrophils, and B cells were significantly different and the diversities of IGH, IGK, and TRA were significantly small with AD progression. We then identified a differentially expressed gene, WDR37, in terms of risk of MCI-to-AD conversion. Our prognosis prediction model using the potential blood-based biomarkers for early AD diagnosis, which combined two immune repertoires (IGK and TRA), WDR37, and clinical information, successfully classified MCI patients into two groups, low and high, in terms of risk of MCI-to-AD conversion (log-rank test P = 2.57e-3). It achieved a concordance index of 0.694 in a discovery cohort and of 0.643 in an independent validation cohort. We believe that further investigation, using larger sample sizes, will lead to practical clinical use in the near future.

Subject terms: Neural ageing, Biomarkers, Risk factors, Alzheimer's disease

Introduction

Mild cognitive impairment (MCI) is an intermediate stage of cognitive impairment between normal aging and dementia and is associated with an increased risk of developing clinically probable Alzheimer’s disease (AD)13. The annual conversion rate from MCI to AD is approximately 10% to 15%4, and most MCI patients convert to AD within 5 years from diagnosis (i.e., they are MCI converters: MCI-C)5,6; however, some MCI patients remain stable or convert back to being cognitively normal (CN) (i.e., they are MCI non-converters: MCI-NC)7. There are four drugs approved by the US Food and Drug Administration (FDA) for the management of cognitive impairment and dysfunction in symptomatic AD (three cholinesterase inhibitors: donepezil, rivastigmine, and galanthamine, and a glutamate regulator: memantine), although they can only help lessen symptoms, such as memory loss and confusion8. Currently the FDA has only approved aducanumab as a therapy drug for AD9. However, the current best strategy is to delay disease progression to reduce the number of patients who ultimately develop AD10. Therefore, promising biomarkers for early detection of MCI-C are urgently required because the accurate prediction of MCI-to-AD conversion would enable earlier interventions for MCI-C patients, which could lead to a reduction in MCI patients at high risk for converting to AD.

The most common multifactorial neurodegenerative disease, AD is induced by a complex interaction between genetic and environmental factors. The heritability of AD—that is, the presence of genetic risk factors—is estimated to be 60% to 80%11. A large number of genetic factors undoubtedly contribute to the etiopathogenesis and progression of AD, and some of them have been identified via whole-genome sequencing analyses12,13 and genome-wide association studies1416. The AD risk genes are implicated in the immune response (CLU, CR1, CD33, EPHA1, MS4A4E/MS4A6A, ABCA7, PTK2B, TREM2, and TREML2), endocytosis (BIN1, PICALM, and CD2AP), and lipid processing (APOE, ABCA7, and SORL1)17,18. Thus, a large percentage of the detected AD risk genes are associated with immunity. In addition, recent clinical observations have revealed that neutrophils, essential for executing the acute inflammatory response, contribute to AD pathogenesis and cognitive impairment19,20. Therefore, there is consensus that the immune system is intimately involved in AD pathology, but it has yet to be established which components of the immune system actively contribute to its development.

The human adaptive immune system provides protection against an enormous variety of pathogens. The protection is mediated by receptors on the surfaces of T cells (TCRs) and B cells (BCRs). The complementarity determining region 3 (CDR3) of TCRs and BCRs is the main determinant of specificity for antigen recognition21. The diversity of the TCR and BCR repertoires is established during development through recombination of variable (V), diversity (D), and joining (J) genes (VDJ recombination) and gene insertion/deletion22,23. Previous studies have reported that the repertoire diversity plays a critical role in several diseases, including cancers24, autoimmune diseases25, and neurodegenerative diseases26.

High-throughput next-generation sequencing platforms enable a comprehensive assessment of the TCR and BCR repertoires, and various methodologies have been developed for the analysis of TCR and BCR repertoires from bulk RNA sequencing (RNA-seq) data (e.g., MiXCR27 and TRUST428). Here, we performed large-scale RNA-seq transcriptome analyses using a large number of samples from persons with AD or MCI or who were CN, to detect blood-based biomarkers for early AD diagnosis. We investigated the differences in immune-cell type composition and immune clonal diversity of TCR and BCR repertoires using bulk RNA-seq data of samples with the three phenotypes (AD, MCI, and CN). We further characterized differentially expressed genes (DEGs) in the RNA-seq data between MCI-C and MCI-NC samples (i.e., prospective data). Prognosis prediction models were applied to the prospective data based on clinical information (age, sex, and APOE ε4 genotypes) and combinations of the immune-related biomarker candidates. Our final prognosis prediction model—composed of two immune repertoires (IGK and TRA), one DEG (WDR37), and clinical information—successfully classified the MCI patients in an independent validation cohort into two groups, high and low, in terms of risk of MCI-to-AD conversion. We believe that further investigation, using a larger sample size, will contribute to future practical clinical use in healthcare.

Results

RNA-sequencing data

The study included 856 samples: 317 AD, 432 MCI, and 107 CN. The average ages of the individuals from whom the AD, MCI, and CN samples were obtained were 79.3 years (SD = 5.6 years), 76.9 years (SD = 6.2 years), and 70.8 years (SD = 5.7 years), respectively, and the percentages of male subjects were 31, 42 and 51, respectively. RNA-seq analysis was performed on all samples by using the Illumina NovaSeq6000 platform. Averages of 42.9, 42.8, and 44.9 million raw read sequences were obtained from the AD, MCI, and CN samples, respectively, of which >99.5% were high quality (>Q20). After low-quality read sequences and trimmed reads with adaptor sequences were discarded, >42.3 million reads of cleaned data remained, of which >82.5% were uniquely mapped to the human reference genome (GRCh37) in the three phenotype groups (Table 1).

Table 1.

Statistical summary of mapping resultsa.

Characteristic AD MCI CN
Raw reads, n 42,851,964 ± 11,727,451 42,755,728 ± 12,595,261 44,883,385 ± 10,635,281
Q20, % 99.60 ± 0.47 99.53 ± 0.58 99.62 ± 0.44
Cleaned raw reads, n 42,378,025 ± 11,785,689 42,442,969 ± 12,532,294 44,241,452 ± 10,721,761
Unique mapped reads, % 82.54 ± 3.95 82.51 ± 3.85 82.77 ± 3.79
Multiple mapped reads, % 13.90 ± 3.68 14.05 ± 3.38 13.38 ± 3.49

AD Alzheimer’s disease, MCI mild cognitive impairment, CN cognitively normal.

aAll values are mean ± SD.

Immune-cell type composition

We used the RNA-seq data to compare cell-type distribution among the AD, MCI, and CN samples. CIBERSORT29 estimated the relative proportions (as TPM) of 12 major types of immune cells (see Methods) in each sample. Statistically significant differences in cell-type proportions among the three phenotypes were assessed with the Jonckheere–Terpstra trend test. Of the 12 immune-cell types, four showed statistically significant differences among the three phenotypes at a false discovery rate (FDR) < 0.05. The proportions of plasma cells, γδ T cells, and neutrophils were significantly increased in AD progression (plasma cells, 0.026; γδ T cells, 0.034; neutrophils, 0.0024; Fig. 1) and the proportion of B cells was significantly decreased in AD progression (B cells, 0.0048; Fig. 1). We also examined the differential composition of immune-cell type among the three phenotypes by using MCP-counter30, but CIBERSORT detected more significant immune-cell types than MCP-counter (T cells, 0.002; B lineage, 0.016; Supplementary Fig. 1).

Fig. 1. Proportions of the 12 major immune-cell types among samples from patients in each group (AD, MCI, and CN).

Fig. 1

Comparison of cell types among samples from patients with Alzheimer’s disease (AD), mild cognitive impairment (MCI), and normal cognition (CN) (*FDR < 0.05, Jonckheere–Terpstra trend test). Data are represented as box and whisker plots, depicting minimum, lower quartile (Q1), median (Q2), upper quartile (Q3), and maximum values.

Clonal diversity of T-cell receptor and B-cell receptor repertoires

The adaptive immune system is organized into two classes of specialized lymphocytes, T cells and B cells, which generate repertoires of T-cell receptors (TCRs) and B-cell receptors (BCRs) with sufficient diversity to recognize the universe of potential pathogens. We investigated if there were differences in proportions and diversities of TCR and BCR repertoires both at older ages and with AD progression. TRUST4 software28 estimated the proportions and diversities of 4 TCR (TRA, TRB, TRG, and TRD) and 3 BCR (IGH, IGK, and IGL) repertoires from bulk RNA-seq in each sample (see Methods). A linear regression model was used to identify statistically significant increases or decreases in the proportion and clonal diversity of TCR and BCR repertoires among ages and among phenotypes. Although no statistically significant differences were seen in the proportions of the TCR and BCR repertoires at older ages (Fig. 2a) or with AD progression (Fig. 2b), statistically significant decreases were observed in the diversities of their repertoires both at older ages (Fig. 3a) and with AD progression (Fig. 4a). The diversities of TCR repertories were significantly smaller at older ages in both sexes (Fig. 3b), whereas those of BCR repertories were significantly smaller at older ages only in male subjects at an FDR < 0.05 (IGH, 0.01; IGK, 5 × 10−4; IGL, 0.02; Fig. 3c). These results indicated that age and sex were associated with the diversities of TCR and BCR repertoires. Therefore, when assessing the associations of the diversities of their repertoires in patients with AD, we used a linear regression with adjustment for age and sex, and we found that the diversities of IGH, IGK, and TRA were statistically significantly smaller in those with AD progression at an FDR < 0.05 (IGH, 0.006; IGK, 0.016; TRA, 0.003; Fig. 4b, c).

Fig. 2. Proportion of TCR and BCR repertoires among ages and among phenotypes.

Fig. 2

Radar chart showing the proportions of the TCR and BCR repertories among ages (a) and among phenotypes (b).

Fig. 3. Clonal diversity of TCR and BCR repertoires among ages.

Fig. 3

(a) Radar chart showing a difference in clonal diversity of the TCR and BCR repertories among ages. (b, c) Differences in clonal diversity of the TCR (b) and BCR (c) repertoires among ages. A linear regression model was used to identify statistically significant increases or decreases in the clonal diversity of TCR and BCR repertoires among ages for each sex. The diversities of TCR repertories were significantly smaller at older ages in both sexes (b), whereas those of BCR repertories were significantly smaller at older ages only in male subjects at an adjusted P < 0.05 (c). Pearson correlation coefficient is represented by R.

Fig. 4. Clonal diversity of TCR and BCR repertories among phenotypes.

Fig. 4

(a) Radar chart showing a difference in clonal diversity of the TCR and BCR repertories among phenotypes. Difference in clonal diversity of the TCR (b) and BCR (c) repertoires among phenotypes. A linear regression model with adjustment for age and sex was used to identify statistically significant increases or decreases in the clonal diversity of TCR and BCR repertoires among phenotypes. The diversities of IGH, IGK, and TRA were statistically significantly smaller in those with AD progression at an adjusted P < 0.05 (b, c). Data are represented as box and whisker plots, depicting minimum, lower quartile (Q1), median (Q2), upper quartile (Q3), and maximum values (b, c).

Identification of differentially expressed genes

Of the 432 MCI patients whose samples were used in RNA-seq analysis, 145 were followed for at least 6 months and up to 6 years (mean ± SD, 896.0 ± 468.2 days), during which time 52 (35.86%) converted to AD (i.e., were classified as MCI-C), and the remaining 93 MCI patients (64.14%) were classified as MCI-NC. The average ages of the MCI-C and MCI-NC subjects were 76.4 years (SD = 5.6 years) and 76.9 years (SD = 6.3 years), and the percentages of male subjects were 37 and 42, respectively (Table 2). To identify biomarker candidates for early AD diagnosis, we examined DEGs between MCI-C and MCI-NC samples from 19,702 genes with a threshold of counts per million reads mapped (CPM) > 1 in more than one-fourth of all sequenced samples using the ‘exactTest’ function in edgeR. Although no DEGs were statistically different between groups (i.e., FDR < 0.05 and fold change > 1.2), we focused on two candidate genes, with Entrez Gene IDs, showing suggestive associations (i.e., FDR < 0.2 and fold change > 1.2: SPCS1, FC = 1.28, P = 1.43 × 10−5, FDR = 0.11; WDR37, FC = 1.30, P = 9.56 × 10−5, FDR = 0.11). The expression of these candidate genes was then validated in brain tissues as well as peripheral blood mononuclear cells from the Human Protein Atlas database31, which provides quantitative transcriptomics at the tissue and organ level and is publicly accessible at http://www.proteinatlas.org (Fig. 5a). We also examined the differential composition of immune-cell type and clonal diversity of TCR and BCR between MCI-C and MCI-NC samples, but no statistically significant differences were observed between the groups in the composition of immune-cell types and clonal diversity of TCRs and BCRs at a false discovery rate (FDR) < 0.05 (Supplementary Table 1).

Table 2.

Summary of sample characteristics for MCI-C and MCI-NC.

Characteristic MCI-C MCI-NC All
Number of samples 52 93 145
Male, % (n) 37 (19) 42 (39) 40 (58)
Age, y, mean ± SD 76.4 ± 5.6 76.9 ± 6.3 76.7 ± 6.1
APOE ε4 genotypes, n (No. of patients) 0 (29), 1 (20), 2 (3) 0 (65), 1 (26), 2 (2) 0 (94), 1 (46), 2 (5)
Follow-up, mean ± SD 913.5 ± 433.4 886.3 ± 488.6 896.0 ± 468.2

APOE apolipoprotein E, MCI, mild cognitive impairment, MCI-C converter from MCI to Alzheimer’s disease, MCI-NC nonconverter from MCI to Alzheimer’s disease.

Fig. 5. Genes differentially expressed between MCI-C and MCI-NC.

Fig. 5

(a) The expression of two DEGs in brain tissues was validated in the Human Protein Atlas database. The x-axis represents the consensus normalized expression (nTPM) values. (b) DEGs were validated by using qRT-PCR (n = 20; 10 MCI-C and 10 MCI-NC). (c) High-density lipoprotein (HDL) cholesterol, low-density lipoprotein (LDL) cholesterol, non-HDL cholesterol, total cholesterol (T-CHO), and triglycerides (TG) were measured in routine blood tests. Their differences were examined between MCI-C and MCI-NC samples (n = 145; 52 MCI-C and 93 MCI-NC). Welch’s t-test was used to identify statistically significant difference between MCI-C and MCI-NC samples in the two DEGs and five items measured in routine blood tests (b, c). Data are represented as box and whisker plots, depicting minimum, lower quartile (Q1), median (Q2), upper quartile (Q3), and maximum values (b, c).

To validate our RNA-seq results, we used quantitative RT-PCR (qRT-PCR) analysis and evaluated the candidate genes detected (SPCS1 and WDR37) by comparing the expression of 10 randomly selected MCI-C samples to those of 10 randomly selected MCI-NC samples. The SPCS1 gene was not validated by the qRT-PCR (Welch’s t-test P = 0.75), whereas qRT-PCR showed a modest association for the WDR37 gene (Welch’s t-test P = 0.063, Fig. 5b). Because WDR37 may play a key role in cholesterol biosynthesis32, we investigated high-density lipoprotein (HDL) cholesterol, low-density lipoprotein cholesterol, non-HDL cholesterol, total cholesterol, and triglycerides measured in routine blood tests, but no statistically significant differences in these markers were found between MCI-C and MCI-NC samples (Fig. 5c). We further assessed associations between genetic variants on the WDR37 gene (chr10:1102325 to 1178312) and MCI-to-AD conversion by using whole-genome sequencing data of 10 MCI-C and 17 MCI-NC samples, downloaded from the National Center for Geriatrics and Gerontology (NCGG) Biobank database. Statistically significant differences in the 271 genetic variants detected, 245 single nucleotide variants (SNVs) and 26 insertions/deletions (indels) between MCI-C and MCI-NC samples were assessed with the Fisher’s exact test, but no statistically significant associations were observed between MCI-C and MCI-NC samples in the genetic variants (Supplementary Table 2).

Prognosis prediction model construction

We examined blood-based biomarker candidates for early AD diagnosis through immune-cell composition, immune repertoire, and DEG analyses. In order to detect potential biomarkers from the candidates, we attempted to establish prognosis prediction models using clinical information (age, sex, and APOE ε4 genotype) and a combination of candidate markers (i.e., plasma cells, γδ T cells, neutrophils, and B cells from immune-cell compositional analyses; IGH, IGK, and TRA from the immune repertoire analyses; and WDR37 from the DEG analyses). Prognosis prediction models were applied to a discovery cohort of 73 subjects (26 MCI-C and 47 MCI-NC). Four-fifths of the entire discovery cohort data was used for the model construction with a Cox proportional hazard model. The remaining fifth of the entire discovery cohort was used for the evaluation of the adjusted model. We used the average C-index to detect potential biomarkers. The highest C-index was observed in the fivefold cross-validation of the discovery cohort when three candidates (IGK, TRA, and WDR37) were used. Our final prognosis prediction model was constructed from these three potential biomarkers and clinical information using the entire discovery cohort. The adjusted model was then evaluated on a validation cohort of 72 subjects (26 MCI-C and 46 MCI-NC), which was completely independent from the discovery cohort. Our final model achieved a C-index of 0.694 in the discovery cohort and of 0.643 in the validation cohort (Fig. 6).

Fig. 6. Kaplan–Meier curves of survival without conversion to AD produced by the prediction models.

Fig. 6

We calculated a prognostic index for each subject by applying three biomarkers (IGK, TRA, and WDR37) and clinical factors to our prognosis prediction model. (a) Based on the prognostic index, we divided the samples of the discovery cohort into high- (red) and low-risk (blue) groups. The optimal cutoff values were detected by using the minimum P-value from the log-rank test and comparing the differences in survival without MCI-to-AD conversion as determined by Kaplan-Meier curves (optimal cutoff = –0.0969, minimum P = 1.94 × 10−4). (b) The adjusted model was then evaluated on the validation cohort (log-rank test P = 2.57 × 10−3).

We further calculated a prognostic index assigned to each patient by applying the three potential biomarkers and clinical information to our prognosis prediction model. Based on the prognostic index, we divided samples of the discovery cohort into high- and low-risk groups. The optimal cutoff value was detected by using the minimum log-rank trend test P-value and comparing the differences in MCI-to-AD conversion–free survival as determined by Kaplan-Meier curves (optimal cutoff = -0.0969, minimum P = 1.94 × 10−4, Fig. 6a). This adjusted model was then able to successfully classify MCI patients in the validation cohort into low and high groups in terms of risk of MCI-to-AD conversion (log-rank test P = 2.57 × 10−3, Fig. 6b).

Discussion

Although blood-based biomarkers for early diagnosis have been examined in many diseases, including AD3335, there have been no robust and reliable blood biomarkers discovered that are used in routine clinical practice for AD so far. As a powerful approach to detect blood-based biomarkers, next-generation RNA-seq in human peripheral blood mononuclear cells allows a comprehensive analysis of the entire transcriptome, but many of the previous studies used only a small number of samples, especially for AD.

Most MCI patients convert to AD within 5 years of diagnosis (MCI-C)5,6; however, some MCI patients remain stable or convert back to being cognitively normal (MCI-NC)7. If accurate biomarkers that can classify MCI patients into low and high risk of MCI-to-AD conversion exist, it would enable early and targeted interventions for MCI-C patients, which could lead to a reduction in MCI patients at high risk for converting to AD. To detect potential blood-based biomarkers that could be used for early AD diagnosis, we performed comprehensive RNA-seq analysis with a large number of samples. We investigated the differences in immune-cell type composition and clonal diversity of TCR and BCR repertoires through bulk RNA-seq data from patients with AD, MCI, and CN. Of 12 major immune-cell types, four (B cells, plasma cells, γδ T cells, and neutrophils) had a significantly higher or lower cell-type proportion among the three phenotypes. It has been reported that neutrophils contribute to AD pathogenesis and cognitive impairment19,20, and the proportion of neutrophils might have the potential to be a blood biomarker of early AD diagnosis. However, the proportions of the remaining three cell types were very low in all samples, making it difficult to determine whether they were truly associated with the AD progression.

Age has been associated with a decrease in diversity of the TCR and BCR repertoires36,37, and our RNA-seq data analyses confirmed this. However, our data further showed that the diversity of BCR repertories was significantly lower at older ages only in males, whereas that of TCR repertoires showed statistically significant decreases at older ages in both sexes. In addition, we found that the diversity of IGH, IGK, and TRA was significantly lower in those diagnosed with AD, and a DEG (WDR37) showed a suggestive difference between MCI-C and MCI-NC samples. These results support the idea that the immune system could be used effectively for early prediction of AD diagnosis.

To identify potential biomarkers from the candidate immune-related biomarkers, we attempted to establish prognosis prediction models by using clinical information and a combination of the candidates described above. Our final prognosis prediction model achieved the highest C-index of 0.643 in the validation cohort (completely independent from the discovery cohort) when three candidates (IGK, TRA, and WDR37) were used.

Our study provides new information on the changes in BCR diversity associated with AD. Xu et al. previously reported that the diversity of TCR repertoires was significantly lower in samples from AD patients than in samples from CN individuals, as analyzed with single-cell RNA-seq26. The remaining potential biomarker, the WDR37 gene, encodes a member of the WD40-repeat protein family and is broadly expressed in neurons of adult brains. Recent studies have shown that WDR37 might underlie a syndromic neurological disorder38. Reis et al. have reported that WDR37 may play a key role in cholesterol biosynthesis32, and pervasive abnormality in this system is associated with AD39. These results provide compelling evidence that our findings could be potential biomarkers for early detection of MCI-to-AD conversion.

We proposed an MCI-to-AD conversion prediction model based on a Cox proportional hazard method through immune landscape analysis. Although our prediction model using only biomarkers identified in this study might be insufficient to early diagnosis of AD, our findings can serve as potential biomarkers for predicting disease prognosis. In previous studies, we reported that 24 miR-eQTLs (the relationship between SNPs and miRNA expression) and three clinical factors (age, sex, and APOE4 alleles) successfully classified MCI patients into low and high risk of MCI-to-AD conversion13. Moreover, recent survival analyses have showed that plasma phosphorylated tau (i.e. P-tau181 and P-tau217) and neurofilament light (NfL) could be effective biomarkers for separating MCI patients who converted to AD from those that did not4042, though the AUCs were modest (P-tau181 = 0.77, NfL = 0.62)43. Therefore, we believe that omics analyses, using additional data including genetic variations (single nucleotide polymorphisms: SNPs, and insertions and deletions: indels) and mRNA and miRNA expressions, plasma P-tau and NfL, as well as immune-cell type composition and immune clonal diversity detected in this immune landscape analysis, will contribute to further improvement of the prognosis prediction model. As these biomarkers might also enable prediction of not only MCI-to-AD conversion, but also Amyloid/Tau/Neurodegeneration (ATN) pathology group, proposed as a means of evidencing the biological state of AD, further investigation using a larger number of samples will be required.

The main limitation of the current analyses is that it is difficult to collect many MCI-C and MCI-NC, so that our prediction model based on immune repertoires was constructed using a limited sample size. In the future, we will perform further investigations with a larger sample size and will validate the effectiveness of this classifier.

Our final prediction model based on immune repertoires successfully classified MCI patients as having low or high risk of MCI-to-AD conversion and achieved a high C-index on an independent validation cohort. Our findings showed that changes in the immune system, and specifically the immune repertoires associated with AD, could contribute to early prediction of conversion to AD. Accurate prediction of MCI-to-AD conversion would enable earlier intervention for MCI patients at high risk, potentially reducing conversion to AD.

Methods

Ethics approval and consent to participate

This study protocol was approved by the ethics committee of the NCGG and was done following the guidelines from the Helsinki Declaration. The design and performance of the current study involving human subjects were clearly described in a research protocol. All participation in the Biobank is voluntary, and all donors completed informed consent in writing before registering with the NCGG Biobank, and no direct or indirect identifiers have been used in reporting this manuscript.

Sample collection

We obtained 856 blood samples with measured mRNA expression and their associated clinical data from the NCGG Biobank. Of the samples, 317 were from patients with AD, 432 were from patients with MCI, and 107 were from CN donors. Of the 432 patients with MCI, 145 were followed for more than half a year, and 52 of these 145 patients converted to AD (i.e., were MCI-C), and the remaining 93 patients remained stable with MCI (i.e., were MCI-NC). The patients from whom the AD and MCI samples were obtained were diagnosed with probable or possible AD by using the criteria of the National Institute on Aging Alzheimer’s Association workgroups1,2. Only samples from patients with probable AD were used as AD samples in this study. The CN samples were obtained from patients who had subjective cognitive complaints but normal cognition on a neuropsychological assessment. The diagnosis of all samples was conducted based on medical history, physical examination and diagnostic tests, neurological examination, neuropsychological tests and brain imaging with magnetic resonance imaging or computerized tomography by experts including neurologists, psychiatrists, geriatricians or a neurosurgeon, all of which are experts in dementia and familiar with its diagnostic criteria. Comprehensive neuropsychological tests included Mini-Mental State Examination (MMSE), Alzheimer’s Disease Assessment Scale Cognitive Component Japanese version, Logical Memory I and II from the Wechsler Memory Scale–Revised, frontal assessment battery, Raven’s colored progressive matrices and Geriatric Depression Scale. All samples were from men and women who were ≥60 years old at the time of testing. All CN samples had a MMSE score of >23.

cDNA library preparation and RNA sequencing

Buffy coat samples were isolated from whole blood following the standard operating procedure of the NCGG Biobank44. Only high-quality samples with an RNA Integrity Number (RIN) ≥ 6.0 were used to construct the sequencing library. Sequencing libraries were prepared with 500 μg of total RNA for each sample by using Illumina TruSeq Stranded Total RNA with Ribo-Zero Globin and IDT for Illumina TruSeq UD Indexes in accordance with the manufacturer’s instructions (Illumina, San Diego, CA). The libraries were subsequently sequenced by using the Illumina NovaSeq6000 platform with paired-end reads of 151 bp in accordance with the manufacturer’s instructions.

RNA-sequencing data analysis

All RNA-seq data were downloaded from the NCGG Biobank database. The quality of the read sequences (fastq files) was assessed with FastQC (ver. 0.11.7) and Cutadapt (ver. 1.16)20. The remaining clean, sequenced reads were mapped to the human reference genome (GRCh37) with STAR45 (ver. 2.5.2b). Read counts for each gene were calculated with the featureCounts program46 from the subread package (ver. 1.6.6) to generate expression levels. The read counts from each sample were combined into a count file, on which differential expression analysis was performed with edgeR47 (ver. 3.18.1). The ‘caclNormFactors’ function in edgeR was used to obtain TMM (trimmed mean of M-values) normalization factors to account for library sizes. The ‘exactTest’ function in edgeR was applied to obtain DEGs between MCI-C and MCI-NC samples.

Proportion of immune-cell types

Cell-type quantification methods can be conceptually distinguished into deconvolution-based approaches and marker-gene-based approaches. We used CIBERSORT29 from the deconvolution-based approaches and MCP-counter30 from the marker-gene-based approaches. After we used STAR45 to align the RNA-seq reads to the human reference genome, quantification in transcripts per million (TPM) was performed with RSEM48 (ver. 1.3.0). The TPMs were suitable for use with the CIBERSORT (ver. 1.0.1) and MCP-counter (ver. 1.2.0). CIBERSORT estimated the proportions of 22 immune-cell types, and we further categorized the 22 cell types into 12 major cell types20 by summing the proportions. MCP-counter allowed robust quantification of the absolute abundance of 10 immune and stromal cell populations.

Detection of immune receptor repertoires

Immune receptor repertoires in T cells and B cells from RNA-seq data were detected using TRUST4 software28 (v1.0.5), in which the inferred CDR3 clonotypes included αβ/γδ TCRs (TRA, TRB, TRG, and TRD) and BCRs (IGH, IGK, and IGL). The clonal diversity of TCRs and BCRs was estimated by using an inverse Simpson index and was calculated by using VDJtools49 (v1.2.1). A linear regression model was used to identify statistically significant increases or decreases in the proportion and clonal diversity of TCR and BCR repertoires among ages and among phenotypes.

Prognosis prediction model construction

All data were strictly separated into a discovery cohort and a validation cohort. Four-fifths of the entire discovery cohort was used to construct a prognosis prediction model based on a Cox proportional hazard model by using a combination of candidate biomarkers and clinical information (age, sex, and APOE ε4 genotypes). The adjusted model was then evaluated using the remaining fifth of the discovery cohort. This process was repeated five times (5-fold cross validation). On the basis of the average concordance index (C-index), we determined the optimal combination of candidate biomarkers for model construction. The final model was constructed with the entire discovery cohort.

Using a combination of candidate biomarkers and clinical information, we calculated a prognostic index for each sample using the discovery cohort. We classified the samples into two groups (high and low risk) according to an optimal cutoff value of the prognostic index50. The optimal cutoff value was defined as the minimum log-rank trend test P-value when differences between high- and low-risk groups were compared in the discovery cohort. The optimal cutoff value was used for the validation of our prognosis prediction model. Kaplan–Meier curves were constructed to illustrate differences in MCI-to-AD conversion–free survival. The log-rank test was used to compare the different conditions. A P-value of 0.05 or less was considered statistically significant.

Verification of quantitative RT-PCR assay

cDNA was synthesized by using a PrimeScript II 1st Strand cDNA Synthesis Kit (Takara Bio, Shiga, Japan). qRT-PCR analysis was performed by using TB Green Premix Ex Taq II (Takara Bio, Shiga, Japan) and the Quantstudio7 Flex Real-Time PCR System (Thermo Fisher, Waltham, MA). The following commercially available PCR primers (forward and reverse, 5ʹ to 3ʹ) were used for gene expression analysis: SPCS1 (CCAGGCTGCTGACACTTCCT, GAGCATTAGGTGGTTTTAGCTCTTATCTG), and WDR37 (CTGCATCAGCCGATCACACG, CGTATCTCCAGATATGAGCAGTCTG). Human beta-2-microglobulin (hB2M) was preselected as a reference gene for normalization of target gene expression levels. Gene expression levels from qRT-PCR were calculated relative to hB2M by using the semiquantitative method51. Gene expression levels were obtained for 10 MCI-C and 10 MCI-NC randomly selected samples. This experiment was independently performed three times for each gene.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Supplementary information

Supplementary Information (123.5KB, pdf)
Reporting Summary (1.5MB, pdf)

Acknowledgements

We thank the NCGG Biobank for providing the study materials, clinical information, and technical support. This study was supported by grants from AMED (Grant Numbers JP18kk0205009 and JP18kk0205012 to S.N., Grant Numbers JP21dk0207045 and JP22dk0207060 to S.N. and K.O., Grant Numbers JP21de0107002 and JP21dk0207052 to S.N. and D.S., and Grant Number JP21km0405501 to K.O.); Research Funding for Longevity Sciences from the NCGG (21-23 to K.O. and 21-24 to D.S.); JSPS KAKENHI (Grant Number JP21H02470 to D.S.); The Hori Science and Arts Foundation (to D.S.); The Chukyo Longevity Medical Research and Promotion Foundation (to D.S.) and a grant from the Japanese Ministry of Health, Labour, and Welfare for Research on Dementia (to K.O.). We also thank ELSS editors for English corrections.

Author contributions

D.S. developed the method and performed the analyses; R.M. and K.O. performed the mRNA expression experiments and the qRT-PCR validation of gene expression; S.A. provided technical assistance; S.N. contributed to data acquisition and the analyses; D.S. wrote the manuscript; D.S. and K.O. organized this work. All authors contributed to and approved the final manuscript.

Data availability

The data that support the findings of this study are available from NBDC (National Bioscience Database Center) website under controlled access (https://humandbs.biosciencedbc.jp/en/). The NBDC number is hum0215, and the JGA (Japanese Genotype-phenotype Archive) accession number is JGAS000532.

Code availability

We used open source program languages R (version 3.4.3) and Ruby (version 2.4.0) to analyze data and create plots. Code is available upon request from the corresponding authors.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Daichi Shigemizu, Email: daichi@ncgg.go.jp.

Kouichi Ozaki, Email: ozakikk@ncgg.go.jp.

Supplementary information

The online version contains supplementary material available at 10.1038/s41514-022-00096-9.

References

  • 1.Albert MS, et al. The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. 2011;7:270–279. doi: 10.1016/j.jalz.2011.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.McKhann GM, et al. The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. 2011;7:263–269. doi: 10.1016/j.jalz.2011.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Sperling RA, et al. Toward defining the preclinical stages of Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. 2011;7:280–292. doi: 10.1016/j.jalz.2011.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Petersen RC, et al. Mild cognitive impairment: ten years later. Arch. Neurol. 2009;66:1447–1455. doi: 10.1001/archneurol.2009.266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Moradi E, Pepe A, Gaser C, Huttunen H, Tohka J. Alzheimer’s Disease Neuroimaging I. Machine learning framework for early MRI-based Alzheimer’s conversion prediction in MCI subjects. Neuroimage. 2015;104:398–412. doi: 10.1016/j.neuroimage.2014.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Petersen RC, et al. Practice guideline update summary: Mild cognitive impairment: Report of the Guideline Development, Dissemination, and Implementation Subcommittee of the American Academy of Neurology. Neurology. 2018;90:126–135. doi: 10.1212/WNL.0000000000004826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lovell MA. A potential role for alterations of zinc and zinc transport proteins in the progression of Alzheimer’s disease. J. Alzheimers Dis. 2009;16:471–483. doi: 10.3233/JAD-2009-0992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Long JM, Holtzman DM. Alzheimer Disease: An Update on Pathobiology and Treatment Strategies. Cell. 2019;179:312–339. doi: 10.1016/j.cell.2019.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Yang P, Sun F. Aducanumab: The first targeted Alzheimer’s therapy. Drug. Discov. Ther. 2021;15:166–168. doi: 10.5582/ddt.2021.01061. [DOI] [PubMed] [Google Scholar]
  • 10.Kim DH, et al. Genetic markers for diagnosis and pathogenesis of Alzheimer’s disease. Gene. 2014;545:185–193. doi: 10.1016/j.gene.2014.05.031. [DOI] [PubMed] [Google Scholar]
  • 11.Gatz M, et al. Role of genes and environments for explaining Alzheimer disease. Arch. Gen. Psychiatry. 2006;63:168–174. doi: 10.1001/archpsyc.63.2.168. [DOI] [PubMed] [Google Scholar]
  • 12.Asanomi Y, et al. A functional variant of SHARPIN confers increased risk of late-onset Alzheimer’s disease. J. Hum. Genet. 2022;67:203–208. doi: 10.1038/s10038-021-00987-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Shigemizu D, et al. Whole-genome sequencing reveals novel ethnicity-specific rare variants associated with Alzheimer’s disease. Mol. Psychiatry. 2022;27:2554–2562. doi: 10.1038/s41380-022-01483-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jansen IE, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat. Genet. 2019;51:404–413. doi: 10.1038/s41588-018-0311-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Kunkle BW, et al. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Abeta, tau, immunity and lipid processing. Nat. Genet. 2019;51:414–430. doi: 10.1038/s41588-019-0358-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Shigemizu D, et al. Ethnic and trans-ethnic genome-wide association studies identify new loci influencing Japanese Alzheimer’s disease risk. Transl. Psychiatry. 2021;11:151. doi: 10.1038/s41398-021-01272-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Villegas-Llerena C, Phillips A, Garcia-Reitboeck P, Hardy J, Pocock JM. Microglial genes regulating neuroinflammation in the progression of Alzheimer’s disease. Curr. Opin. Neurobiol. 2016;36:74–81. doi: 10.1016/j.conb.2015.10.004. [DOI] [PubMed] [Google Scholar]
  • 18.Chan G, et al. CD33 modulates TREM2: convergence of Alzheimer loci. Nat. Neurosci. 2015;18:1556–1558. doi: 10.1038/nn.4126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zenaro E, et al. Neutrophils promote Alzheimer’s disease-like pathology and cognitive decline via LFA-1 integrin. Nat. Med. 2015;21:880–886. doi: 10.1038/nm.3913. [DOI] [PubMed] [Google Scholar]
  • 20.Shigemizu D, et al. Identification of potential blood biomarkers for early diagnosis of Alzheimer’s disease through RNA sequencing analysis. Alzheimers Res. Ther. 2020;12:87. doi: 10.1186/s13195-020-00654-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Six A, et al. The past, present, and future of immune repertoire biology - the rise of next-generation repertoire analysis. Front. Immunol. 2013;4:413. doi: 10.3389/fimmu.2013.00413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tonegawa S. Somatic generation of antibody diversity. Nature. 1983;302:575–581. doi: 10.1038/302575a0. [DOI] [PubMed] [Google Scholar]
  • 23.Li A, et al. Utilization of Ig heavy chain variable, diversity, and joining gene segments in children with B-lineage acute lymphoblastic leukemia: implications for the mechanisms of VDJ recombination and for pathogenesis. Blood. 2004;103:4602–4609. doi: 10.1182/blood-2003-11-3857. [DOI] [PubMed] [Google Scholar]
  • 24.Nakahara Y, et al. Clinical significance of peripheral TCR and BCR repertoire diversity in EGFR/ALK wild-type NSCLC treated with anti-PD-1 antibody. Cancer Immunol. Immunother. 2021;70:2881–2892. doi: 10.1007/s00262-021-02900-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Zheng F, et al. Immune cell and TCR/BCR repertoire profiling in systemic lupus erythematosus patients by single-cell sequencing. Aging (Albany NY) 2021;13:24432–24448. doi: 10.18632/aging.203695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Xu H, Jia J. Single-Cell RNA Sequencing of Peripheral Blood Reveals Immune Cell Signatures in Alzheimer’s Disease. Front. Immunol. 2021;12:645666. doi: 10.3389/fimmu.2021.645666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bolotin DA, et al. Antigen receptor repertoire profiling from RNA-seq data. Nat. Biotechnol. 2017;35:908–911. doi: 10.1038/nbt.3979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Song L, et al. TRUST4: immune repertoire reconstruction from bulk and single-cell RNA-seq data. Nat. Methods. 2021;18:627–630. doi: 10.1038/s41592-021-01142-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Chen B, Khodadoust MS, Liu CL, Newman AM, Alizadeh AA. Profiling Tumor Infiltrating Immune Cells with CIBERSORT. Methods Mol. Biol. 2018;1711:243–259. doi: 10.1007/978-1-4939-7493-1_12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Becht E, et al. Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression. Genome Biol. 2016;17:218. doi: 10.1186/s13059-016-1070-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Uhlen M, et al. Proteomics. Tissue-based map of the human proteome. Science. 2015;347:1260419. doi: 10.1126/science.1260419. [DOI] [PubMed] [Google Scholar]
  • 32.Reis LM, et al. De Novo Missense Variants in WDR37 Cause a Severe Multisystemic Syndrome. Am. J. Hum. Genet. 2019;105:425–433. doi: 10.1016/j.ajhg.2019.06.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Long J, Pan G, Ifeachor E, Belshaw R, Li X. Discovery of Novel Biomarkers for Alzheimer’s Disease from Blood. Dis Markers. 2016;2016:4250480. doi: 10.1155/2016/4250480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Rai N, et al. Relative Alterations in Blood-Based Levels of Sestrin in Alzheimer’s Disease and Mild Cognitive Impairment Patients. J. Alzheimers Dis. 2016;54:1147–1155. doi: 10.3233/JAD-160479. [DOI] [PubMed] [Google Scholar]
  • 35.San Segundo-Acosta P, et al. Identification of Alzheimer’s Disease Autoantibodies and Their Target Biomarkers by Phage Microarrays. J. Proteome. Res. 2019;18:2940–2953. doi: 10.1021/acs.jproteome.9b00258. [DOI] [PubMed] [Google Scholar]
  • 36.Dunn-Walters DK, Ademokun AA. B cell repertoire and ageing. Curr. Opin. Immunol. 2010;22:514–520. doi: 10.1016/j.coi.2010.04.009. [DOI] [PubMed] [Google Scholar]
  • 37.Qi Q, et al. Diversity and clonal selection in the human T-cell repertoire. Proc. Natl. Acad. Sci. USA. 2014;111:13139–13144. doi: 10.1073/pnas.1409155111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kanca O, et al. De Novo Variants in WDR37 Are Associated with Epilepsy, Colobomas, Dysmorphism, Developmental Delay, Intellectual Disability, and Cerebellar Hypoplasia. Am. J. Hum. Genet. 2019;105:413–424. doi: 10.1016/j.ajhg.2019.06.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Varma VR, et al. Abnormal brain cholesterol homeostasis in Alzheimer’s disease-a targeted metabolomic and transcriptomic study. NPJ Aging Mech. Dis. 2021;7:11. doi: 10.1038/s41514-021-00064-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Janelidze S, et al. Plasma P-tau181 in Alzheimer’s disease: relationship to other biomarkers, differential diagnosis, neuropathology and longitudinal progression to Alzheimer’s dementia. Nat. Med. 2020;26:379–386. doi: 10.1038/s41591-020-0755-1. [DOI] [PubMed] [Google Scholar]
  • 41.Lantero Rodriguez J, et al. Plasma p-tau181 accurately predicts Alzheimer’s disease pathology at least 8 years prior to post-mortem and improves the clinical characterisation of cognitive decline. Acta. Neuropathol. 2020;140:267–278. doi: 10.1007/s00401-020-02195-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Mattsson-Carlgren N, et al. Longitudinal plasma p-tau217 is increased in early stages of Alzheimer’s disease. Brain. 2020;143:3234–3241. doi: 10.1093/brain/awaa286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Simren J, et al. The diagnostic and prognostic capabilities of plasma biomarkers in Alzheimer’s disease. Alzheimers Dement. 2021;17:1145–1156. doi: 10.1002/alz.12283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Shigemizu D, et al. Risk prediction models for dementia constructed by supervised principal component analysis using miRNA expression data. Commun. Biol. 2019;2:77. doi: 10.1038/s42003-019-0324-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
  • 47.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Shugay M, et al. VDJtools: Unifying Post-analysis of T Cell Receptor Repertoires. PLoS Comput Biol. 2015;11:e1004503. doi: 10.1371/journal.pcbi.1004503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Shigemizu D, et al. Prognosis prediction model for conversion from mild cognitive impairment to Alzheimer’s disease created by integrative analysis of multi-omics data. Alzheimers Res. Ther. 2020;12:145. doi: 10.1186/s13195-020-00716-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Marone M, Mozzetti S, De Ritis D, Pierelli L, Scambia G. Semiquantitative RT-PCR analysis to assess the expression levels of multiple transcripts from the same sample. Biol. Proced. Online. 2001;3:19–25. doi: 10.1251/bpo20. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (123.5KB, pdf)
Reporting Summary (1.5MB, pdf)

Data Availability Statement

The data that support the findings of this study are available from NBDC (National Bioscience Database Center) website under controlled access (https://humandbs.biosciencedbc.jp/en/). The NBDC number is hum0215, and the JGA (Japanese Genotype-phenotype Archive) accession number is JGAS000532.

We used open source program languages R (version 3.4.3) and Ruby (version 2.4.0) to analyze data and create plots. Code is available upon request from the corresponding authors.


Articles from NPJ Aging are provided here courtesy of Nature Publishing Group

RESOURCES