Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

medRxiv logoLink to medRxiv
[Preprint]. 2025 Sep 27:2025.09.24.25336555. [Version 1] doi: 10.1101/2025.09.24.25336555

A multi-ancestry polygenic risk score for Alzheimer disease is associated with cognitive decline, hippocampal atrophy and neuropathological hallmarks in diverse populations

Nuzulul Kurniansyah 1,2, Shinya Tasaki 3, Habbibur Rehman 2, Congcong Zhu 2, John Farrell 2, Richard Sherva 2,4, Richard Hauger 5,6,7,8, Victoria C Merritt 5,6,7, Matthew Panizzon 6,8, Rui Zhang 4, J Michael Gaziano 9,10, Jungsoo Gim 11,12,13,14, Kunho Lee 11,12,13,15, Dong Yong Lee 16, Kwansik Nho 17, Ricardo A Vialle 3, Shubhabrata Mukherjee 18, Emily H Trittschuh 19,20, Annie J Lee 21,22,23, Adam M Brickman 24, Carlos Cruchaga 25,26, Shannon Risacher 17,27, Douglas N Greve 28, Paul Crane 18; VA Million Veteran Program; Alzheimer’s Disease Genetics Consortium; Alzheimer Disease Sequencing Project, Eden Martin 29, William Bush 30, Richard Mayeux 21,22,23,31, Jonathan L Haines 30, Margaret A Pericak-Vance 29, Mark Logue 2,4,32,33, David A Bennett 3, Lisa L Barnes 3, Andrew Saykin 17, Timothy Hohman 34,35, Li-San Wang 36, Gerard C Schellenberg 36, Ting Fang Alvin Ang 37,38, Rhoda Au 2,37,38,39,40,41,42, Jesse Mez 39,40,41, Kathryn L Lunetta 33,39, Xiaoling Zhang 1,2,33,*, Lindsay A Farrer 1,2,33,39,40,41,42,43,*
PMCID: PMC12486013  PMID: 41040715

Abstract

Alzheimer disease (AD) has a strong genetic basis, yet previously derived polygenic risk scores (PRS) are heavily weighted by the APOE locus and perform inconsistently across diverse ancestries. We developed an APOE-independent multi-ancestry AD PRS using genome-wide association study summary statistics from cohorts in the United States, Europe and East Asia that were applied to European ancestry (EA), African American (AA), Caribbean Hispanic (CH), and East Asian cohorts from the Alzheimer’s Disease Genetics Consortium. PRS performance was evaluated in the multi-ancestry Alzheimer’s Disease Sequencing Project (ADSP) dataset and validated in several additional multi-ancestry cohorts. The PRS was significantly associated with AD in the ADSP EA, AA, CH, and Native American Hispanic groups with adjusted odds ratios (ORs) between 1.14 and 1.52 per standard deviation of the PRS. PRS performance was validated in the replication cohorts (ORs 1.21–1.65). The PRS was also associated with poorer memory, executive function, and language performance; greater AD-related neuropathological burden (including CERAD, Braak stage, and Thal phase scores); reduced hippocampal volume; lower CSF Aβ42; and elevated total tau and phosphorylated tau (p-tau), with stronger p-tau associations observed in women. Longitudinal analyses revealed that individuals in the highest PRS decile exhibited the steepest cognitive decline, particularly among those who progressed to AD. Our findings demonstrate the utility of an ancestry-aware and APOE-independent PRS for advancing understanding of the genetic basis of AD across diverse populations. Associations observed with early biological and cognitive changes and potential sex-specific differences support the incorporation of a PRS in clinical trials and personalized intervention and prevention strategies.


Late-onset Alzheimer disease (AD) is a progressive disorder characterized by cognitive decline and memory impairment, affecting an estimated 6.9 million people in the United States1. The clinical course of AD unfolds over many years, often beginning with subtle cognitive changes before progressing to overt dementia. Although the rate of decline varies across individuals, growing evidence suggests that genetic factors play a central role in shaping both the onset and trajectory of cognitive deterioration2. While the APOE ε4 allele remains the strongest known genetic risk factor3,4, genome-wide association studies (GWAS) have identified numerous additional common and rare variants contributing to polygenic susceptibility primarily in individuals of European ancestry (EA)5,6 and to a lesser extent in other populations7,8.

Polygenic risk scores (PRS) aggregate the effects of multiple genetic variants into a single quantitative measure of inherited susceptibility, each of which is weighted based on GWAS summary statistics9. A critical challenge in the application of PRS lies in the underrepresentation of diverse genetic ancestries in AD GWAS datasets, which include predominantly white individuals of European ancestry (EA). Although AD PRS and genetic risk scores derived from European ancestry (EA) cohorts show predictive value in several non-European populations, their performance varies by ancestry, with notably reduced predictive value in some underrepresented groups1012. The availability of GWAS data from diverse populations, including African Americans (AA)7,13, Caribbean Hispanics (CH)14, and East Asian (EAS) 15,16 provides an opportunity to enhance the transferability and accuracy of PRS across multiple ancestries. This expanded genetic representation is crucial for developing more robust and universally applicable PRS models, which can improve risk prediction.

AD PRSs have been associated with cerebrospinal fluid (CSF) biomarkers, neuropathological features, hippocampal atrophy, and cognitive decline1721 However, these findings have largely been derived from studies in EA populations, limiting their relevance to other populations. A broader evaluation of PRS in ancestrally diverse settings is essential to determine its clinical utility and potential role in early detection11. We developed a multi-ancestry AD PRS using GWAS summary statistics from multiple population ancestries. A weighted PRS summation was derived from the multi-ancestry datasets assembled by the Alzheimer’s Disease Genetics Consortium (ADGC) and evaluated for performance with and without variants in the APOE region across diverse ancestries represented in the AD Sequencing Project (ADSP). To ensure robustness, the PRS was validated in independent cohorts, including the multi-ancestry All of Us (AoU), EA participants from the Framingham Heart Study (FHS), the Korean Brain Aging Study for the Early Diagnosis and Prediction of AD (KBASE) Study, and AA participants from the Religious Orders Study and Memory Aging Project (ROSMAP), Minority Aging Research Study (MARS), and Rush Alzheimer’s Disease Center (collectively referred to as Rush). We examined its association with CSF and plasma biomarkers, a brain MRI measure of hippocampal volume, neuropathological traits, and longitudinal trajectories of cognitive decline to assess whether genetic factors influence early manifestations of AD and the degree to which these effects differ by sex and ancestry.

Results

Overall study design for the development and evaluation of a multi-ancestry AD PRS

Figure 1 provides an overview of the study. We first derived several PRS using multiple AD GWAS datasets including EA, AA, and EAS individuals that were constructed using clumping and thresholding via PRSice2 and Bayesian shrinkage methods using LDPred2, PRS-CS, and PRS-CSx. We applied three strategies to develop a robust multi-ancestry AD PRS, including (1) combining GWAS results from all datasets by meta-analysis first, then constructing the PRS, (2) weighted summation with weights derived from the ADGC dataset, and (3) unweighted summation. PRS performance was evaluated in the ADSP dataset for AD risk prediction, selecting the best-performing PRS based on the largest odds ratio (OR), highest area under the curve (AUC), and lowest p-value. To further refine the most robust PRS, we recalculated PRS weights after removing SNPs within 1 Mb of the APOE coding region to eliminate the confounding effect of APOE genotype and applying the same methods. The final APOE region-excluded PRS was validated in independent cohorts, including AoU version 7, KBASE, FHS, and Rush. We then examined associations of the AD PRS with AD biomarkers, cognitive function, hippocampal volume, and neuropathological traits.

Figure 1. The overview of the study.

Figure 1.

Development and evaluation of AD PRS. The process involves: (a) PRS development using clumping-thresholding and Bayesian shrinkage approaches; (b) multi-ancestry PRS evaluation using meta-analysis, unweighted PRS summation, and weighted PRS summation with weights derived from ADGC; (c) evaluation of PRS performance in the ADSP dataset; (d) refinement of the best-performing PRS by excluding the APOE region, followed by validation of this PRS in independent datasets; (e) Evaluation of the association of the AD PRS with AD-related phenotypes.

Multi-ancestry PRS strongly associated with AD risk in diverse ancestry groups

Evaluation of population substructure among 36,362 individuals in the ADSP Release 4 dataset by principal component (PC) analysis identified six ancestry clusters shown in Supplementary Figure 1: European (EA, N=16,840), African (AA, N=6,532), Caribbean Hispanic (CH, N=5,317), Native American Hispanic (NAH, N=4,722), South Asian (SA, N=2,772), and East Asian (EAS, N=178). Ancestry inference estimates highlight the genetic diversity and composition within each major ancestry group (Supplementary Figure 2). Individuals within the EA, AA, NAH, SA, and EAS clusters appear relatively homogeneous, whereas significant genetic admixture was observed in the CH cluster. After excluding individuals with missing AD status, age or sex, 26,874 individuals of diverse ancestral backgrounds remained for analysis (Supplementary Table 1).

We compared the performance of a multi-ancestry AD PRS, constructed using GWAS summary statistics from ancestrally diverse populations in the ADSP dataset. Three different PRS construction strategies were evaluated, including one based on PRS weight summation described in the Methods that was derived from the ADGC dataset (Supplementary Table 2). The PRS-CS weighted summation method consistently outperformed other strategies, both with and without adjustment for APOE ɛ4 carrier status (Supplementary Figure 3), and this PRS construction was employed in subsequent analyses. To assess the specific impact of variants in the APOE region on PRS performance, we recalculated the PRS after removing SNPs within 1 Mb of the APOE coding region from the training data using the PRS-CS weighted summation approach. The PRS distributions among APOE ε4 carriers and non-carriers were nearly identical in all ancestries, whereas on average, ε4 carriers in all groups had a higher PRS when APOE region SNPs were included in the PRS (Figure 2A). These findings suggest the disproportionate effect of APOE genotype on PRS compared to the collective contribution of SNPs in other portions of the genome.

Figure 2. AD PRS characteristics and behavior.

Figure 2.

A. Distribution of AD PRS calculated with and without the APOE region by ancestry and ε4 carrier status. B. Association of AD PRS excluded APOE region (per 1 SD increase) with AD in the ADSP (stratified by ancestry), FHS, and RADC cohorts. Analyses were adjusted for age, APOE ε4 and ε2, sex, and the first 10 principal components of ancestry. In FHS, models were further adjusted for study cohort and family ID as random effects. Both FHS and RADC models also adjusted for education. In all participants ADSP and AoU, models included additional adjustment for ancestry group. AUC: Area under the curve, CI: confidence interval, OR: Odds ratio; SD: standard deviation

Next, we evaluated the performance of the multi-ancestry AD PRS with and without SNPs in the APOE region across ADSP ancestry groups. The PRS including APOE region SNPs was significantly associated with AD in all ancestry groups except SA and EAS populations due to small sample sizes. The performance of the PRS was attenuated after adjusting for APOE ε4 and ε2 carrier status (Supplementary Figure 4). PRS performance remained robust even after excluding the APOE region, with models adjusted for APOE ε4 and ε2 carrier status showing slightly stronger associations than unadjusted models (Supplementary Figure 5). When including APOE ε4 and ε2 carrier status in the model, the AD PRS excluding the APOE region was significantly associated with AD (OR per 1 standard deviation [SD] increase = 1.39, 95% CI: 1.34–1.44, P = 7.06 × 10−71, AUC = 0.78) in the total sample (Figure 2B). Among ancestry groups, the strongest association was observed in EAs (OR = 1.52, 95% CI: 1.44–1.60, P = 4.66×10−52, AUC = 0.77), followed by the CH (OR = 1.34, 95% CI: 1.22–1.46, P = 7.69×10−11, AUC = 0.69), AA (OR = 1.21, 95% CI: 1.11–1.32, P = 1.04×10−5, AUC = 0.70), and NAH (OR = 1.14, 95% CI: 1.02–1.27, P = 2.40×10−2, AUC = 0.69) groups. No significant associations were observed for the SA or EAS groups.

The PRS excluding the APOE region was selected for subsequent analyses of AD and related outcomes because it provided a robust measure of genetic risk, captured ancestry-specific effects, and minimized the influence of the large effect of APOE genotype on the overall score. Further analysis showed that the association between PRS and AD risk was similar in men and women across all ancestry groups, except in the NAH group in which the PRS was not significantly associated with AD in men (Supplementary Figure 6). Examination within APOE genotype in the overall sample revealed that the association of the AD PRS was significant in all genotype groups, but the OR for ε2 carriers (1.16) was substantially smaller than the ORs for the ε3/ε3 (1.37), ε3/ε4 (1.45) and ε4/ε4 (1.52) groups (Supplementary Figure 7). These patterns were most evident in the EA group, while effect sizes were attenuated in the AA group. In the CH sample, the strength of association was similar across ε2/ε2+ε2/ε3, ε3/ε3, and ε3/ε4 genotypes (ORs = 1.33–1.39). The numbers of ε4 homozygotes in this group and NAH ε2 and ε4 carriers was too small to draw reliable conclusions.

AD PRS validation in the independent cohorts

We validated the robust performance of the AD PRS in three independent cohorts: KBASE which includes 425 South Koreans (Supplementary Table 1), AoU (version 7) which includes 71,549 individuals from diverse ancestries (Supplementary Table 3), Rush comprising 897 AA participants (Supplementary Table 4), and FHS which includes 4,080 EA participants (Supplementary Table 5). The AD PRS was significantly associated with AD (per 1 SD increase) in the AoU (OR = 1.21, 95% CI: 1.06–1.38, P = 3.77 × 10−3, AUC = 0.62), Rush (OR = 1.35, 95% CI: 1.05–1.73, P = 1.92 × 10−2, AUC = 0.60), KBASE (OR = 1.65, 95% CI: 1.17–2.34, P = 4.72 × 10−3, AUC = 0.66), and FHS (OR = 1.35, 95% CI: 1.19–1.54, P = 2.95 × 10−6, AUC = 0.79) cohorts (Figure 2B). Further analysis to evaluate sex differences revealed that in the FHS dataset the AD PRS was significantly associated with AD in women (OR = 1.42, 95% CI: 1.23–1.65, P =2.22 × 10−6), but not men (OR = 1.15, 95% CI: 0.94–1.41, P =0.17). However, no sex-specific association was observed in the larger ADSP cohort (Supplementary Figure 6). The AD PRS was not significant in either men or women in the Rush cohort, likely because of small sample sizes (Supplementary Figure 8).

To contextualize the performance of our multi-ancestry AD PRS, we compared it to a PRS that we calculated using previously published AD-specific variant weights from the polygenic score (PGS) catalog and recent literature. This allowed us to assess whether our approach improves prediction across diverse populations. We restricted comparisons to scores explicitly developed for AD. For consistency, we downloaded variant weights (excluding the APOE region) and applied them to the ADSP and KBASE datasets to construct PRSs and directly compare effect sizes. Because all published scores were derived from GWAS that included European samples from ADGC and ADSP, we focused our comparisons on non-European ancestry groups in ADSP (AA, CH, and NAH) to avoid sample overlap. We tested the performance of each PRS within ancestry groups in ADSP and in KBASE using logistic regression models adjusted for age, sex, APOE ε4/ε2 status, and the top ten PCs. In the ADSP dataset, our PRS showed the strongest association with AD in the AA and CH groups, but was less predictive in the NAH group compared to the score by Lourida et al22. (Supplementary Table 6). Our PRS consistently outperformed all other scores in the Korean KBASE dataset (Supplementary Table 7).

Association of AD PRS with cerebrospinal fluid (CSF) and plasma AD biomarkers

We assessed the association of AD PRS with amyloid-β 42 (Aβ42), total Tau (t-Tau), and phosphorylated Tau 181 (pTau181) measured in CSF obtained from 1,389 ADSP participants (49.7% men) whose characteristics are shown in Supplementary Table 8. All biomarker values were harmonized and standardized across cohorts prior to analysis. A 1 SD higher AD PRS was significantly associated with reduced Aβ42 level (β = −0.14, 95% CI: −0.20 to −0.08, P = 1.03 × 10−6), increased pTau181 (β = 0.14, 95% CI: 0.07 to 0.20, P = 2.03 × 10−5) and increased t-Tau (β = 0.18, 95% CI: 0.12 to 0.24, P = 1.79 × 10−8). The association with pTau181 was significant only in women (β = 0.21, 95% CI: 0.12 to 0.30, P = 2.76 × 10−6) (Figure 3A).

Figure 3. Associations of the AD PRS with AD, CSF biomarkers and AT classification in the ADSP dataset.

Figure 3.

A. Association of the AD PRS and CSF biomarkers (Aβ42, pTau, and Tau) stratified by sex. Beta coefficients represent the change in standardized CSF biomarker level per 1 SD increase in the AD PRS. Analysis models included covariates for age at CSF biomarker measurement, sex (unless stratified), education, APOE ε4 and ε2 status, ancestry group and the first 10 principal components of ancestry. B. Predicted probabilities for each AT category are shown across PRS values, with other covariates fixed at their observed values. Models were adjusted for the same covariates as in panel A. The AT classification categorizes biomarkers as follows: A+/T+ (amyloid and tau pathology present), A–/T+ (tau pathology without amyloid) ,A+/T− (amyloid pathology without tau involvement), and A–/T− (neither amyloid nor tau pathology present). CSF: cerebrospinal fluid; CI: confidence interval, SD: standard deviation.

We further investigated these associations by modeling amyloid and tau (AT) classification, where A+T+ represents amyloid and tau positivity consistent with AD pathology; A+T− reflects amyloid positivity without tau accumulation; A−T+ suggests non-AD tauopathies; and A−T− indicates no AD-related pathology. AT classification was modeled as the outcome in a multinomial logistic regression with AD PRS as a continuous predictor, adjusting for age, APOE ε4 and ε2 carrier status, and the top ten ancestry PCs. In the overall sample, higher PRS was associated with a higher predicted probability of A+T+ and a lower probability of A−T−, indicating a graded increase in biomarker-defined AD pathology across the PRS distribution (Figure 3B). In sex-stratified models, men had a higher predicted probability of A+T+ at lower PRS values, whereas women showed a steeper increase in A+T+ probability across the PRS range, suggesting that women may experience a more pronounced shift toward biomarker-positive AD pathology as genetic risk increases.

We also examined the association of AD PRS with plasma p-tau181 in the Framingham Heart Study (FHS; n = 1,680) and longitudinal plasma p-tau217 in the Alzheimer’s Disease Neuroimaging Initiative (ADNI; n = 301) cohorts (Supplementary Tables 8 and 9). A 1 SD higher AD PRS was associated with elevated p-tau181 levels (β = 0.10, 95% CI: 0.02−0.17, P = 0.01), an effect that was evident in women (β = 0.11, 95% CI: 0.01−0.20, P = 0.03) but not men (β = 0.05, 95% CI: −0.08 to 0.18, P = 0.45). A higher PRS was similarly associated with elevated p-tau217 (β = 0.20, 95% CI: 0.05−0.35, P = 8.9 × 10−3) which was evident in women (β = 0.33, 95% CI: 0.10−0.57, P = 7.6 × 10−3) but not men (β = 0.02, 95% CI: −0.17 to 0.22, P = 0.82) (Supplementary Figure 9).

Association of AD PRS with neuropathological traits

We assessed the association of the AD PRS (per 1 SD increase) with AD-related neuropathological traits among ADSP participants (Supplementary Table 10). Neuropathological traits were modeled as ordered categorical outcomes, preserving the full range of each scale. A 1 SD higher AD PRS was significantly associated with greater severity across all traits: CERAD score for neuritic plaque burden (OR = 1.37, 95% CI: 1.25–1.49, P = 3.95×10−12), amyloid deposition measured by Thal phase (OR = 1.38, 95% CI: 1.20–1.58, P = 6.90×10−6), Braak stage (OR = 1.41, 95% CI: 1.30– 1.53, P = 7.15×10−15), and AD neuropathologic change (ADNC) score (OR = 1.52, 95% CI: 1.30–1.76, P = 6.80×10−8) (Supplementary Figure 10).

Association of AD PRS with cognitive performance

We assessed the association of the AD PRS with co-calibrated, harmonized and standardized scores for memory, executive function, and language, derived from longitudinal cognitive assessments across multiple time points in ADSP, KBASE, and FHS participants (Supplementary Table 11). The derivation of cognitive domain scores is detailed in the Methods. Each 1 SD increase in AD PRS was significantly associated with poorer performance across all time points for memory (β = −0.17, 95% CI: −0.19 to −0.57, P = 1.44 × 10−68), executive function (β = −0.11, 95% CI: −0.13 to −0.10, P = 1.01 × 10−33), and language (β = −0.12, 95% CI: −0.14 to −0.10, P = 7.01 × 10−39). Associations of AD PRS with cognitive performance scores were significant in the EA, AA, and CH groups but not in the NAH group (Figure 4, Supplementary Figures 11 and 12). Similar trends were observed in FHS participants for all cognitive domains but for memory only in KBASE participants. Further examination of the association of the AD PRS with episodic memory and global cognition in Rush AA participants (Supplementary Table 4) showed that a 1 SD higher AD PRS was significantly associated with lower episodic memory (β = −0.20, 95% CI: −0.31 to −0.10, P = 1.62 × 10−4) and global cognition (β = −0.12, 95% CI: −0.21 to −0.04, P = 5.99 × 10−3) scores (Supplementary Figure 13).

Figure 4. Association of AD PRS with memory score.

Figure 4.

Beta coefficients represent the decrease in standardized memory scores per 1 SD increase in the AD PRS. Analyses were adjusted for age at exam, sex (unless stratified), education, ancestry (in the combined ADSP dataset), APOE ε4 and ε2 carrier status, and the first 10 principal components of ancestry. In the FHS dataset, models also included covariates for generation cohort and family ID as random effects. CI: confidence interval, SD: standard deviation.

Sex-stratified analyses conducted in the ADSP and FHS datasets showed that the association of a 1 SD higher AD PRS was more strongly associated with lower memory performance in women than men, particularly among EA participants as indicated by a meta-analysis of results from both datasets in women (β = −0.13, 95% CI: −0.15 to −0.11, P = 5.86 × 10−32) compared to men (β = −0.09, 95% CI: −0.11 to −0.06, P = 4.68 × 10−14) (Figure 4). In contrast, associations of AD PRS with executive function and language scores were similar between sexes across all ancestries (Supplementary Figures 11 and 12). Sex-stratified analyses in the Rush and KBASE cohorts yielded comparable results (Supplementary Figure 13).

AD PRS predicts age-related cognitive decline

We estimated trajectories of cognitive performance between ages 60 and 80 using generalized additive mixed models (GAMMs), modeling age as a smooth term and stratifying individuals by AD PRS percentiles (<10%, 10–50%, 50–90%, >90%). Predicted scores for performance in memory, executive function, and language domains were derived from longitudinal cognitive assessments in ADSP participants. Details of the derivation of cognitive domain scores and modeling procedures are described in the Methods section. As shown in Figure 5, among all ADSP participants, individuals in the highest PRS stratum (>90%) had lower memory performance from age 60 onward, with the performance gap relative to lower-risk groups widening with age. Similar but less pronounced differences were observed for executive function and language. These findings suggest that higher AD polygenic risk is associated with lower cognitive performance in later life, particularly in memory, with largely parallel rates of age-related decline. Further analysis of cognitive trajectories by sex and revealed no significant differences in the slope or level of decline across PRS strata between males and females in the overall sample (Supplementary Figure 14).

Figure 5. Age-related and longitudinal trajectories of cognitive performance by AD PRS in ADSP participants.

Figure 5.

Analyses included 32,167 participants in the ADSP and were adjusted for sex, education, APOE ε4 and ε2 status, ancestry group, and the first 10 principal components of ancestry. A. Age-related trajectories for memory, executive function, and language across PRS groups (<10%, 10–50%, 50–90%, and >90%). Solid lines depict predicted cognitive scores by age, with shaded ribbons indicating 95% confidence intervals. B. Longitudinal trajectories in the same domains, stratified by AD status. Solid lines depict predicted scores over time, with shaded ribbons indicating 95% confidence intervals. Models included main effects and all two-way and three-way interactions between PRS group, AD status, and time since baseline.

To test whether AD polygenic risk influences cognitive decline differentially by diagnostic status, we fit linear mixed-effects models including PRS group (low, moderate, high), AD status (AD vs. non-AD), time since first visit, and their interactions. Individuals who developed AD exhibited significantly steeper decline across memory, executive function, and language domains compared to those who remained cognitively normal (Figure 5B). Among AD cases, we observed subtle trends toward faster memory decline in those with a higher PRS, particularly in the years preceding clinical diagnosis. However, differences across PRS groups were modest and diminished after diagnosis. For executive function and language, PRS-related effects were minimal. Cognitive trajectories among non-AD participants declined slowly and remained largely parallel across PRS groups. Individuals in the high PRS group consistently exhibited lower cognitive scores, particularly in the memory domain. These results indicate that the AD PRS primarily captures differences in cognitive performance that emerge before clinical diagnosis, rather than predicting the rate of decline after diagnosis.

AD PRS is associated with hippocampal atrophy

We examined the association of AD PRS with hippocampal volume using longitudinal FreeSurfer-derived structural MRI data from ADSP (n= 6,568) and FHS (n= 970) participants ages 60 and older (Supplementary Table 12). Hippocampal volume was adjusted for head size by normalizing to intracranial volume and subsequently logtransformed. Effect sizes reflect the change in log-transformed hippocampal volume normalized to intracranial volume, per one standard deviation increase in AD PRS. Higher AD PRS was significantly associated with reduced hippocampal volume across all time points in both cohorts (ADSP: β = −0.021, 95% CI: −0.028 to −0.013, P = 1.37 × 10−7; FHS: β = −0.017, 95% CI: −0.029 to −0.005, P = 5.23 × 10−3). This association was observed separately in both women and men in ADSP (women: β = −0.024, 95% CI: −0.035 to −0.014, P = 3.14 × 10−6; men: β = −0.016, 95% CI: −0.028 to −0.005, P = 6.60 × 10−3), whereas in FHS the association was evident only in women (β = −0.019, 95% CI: −0.035 to −0.003, P = 2.33 × 10−2) and not men (β = −0.017, 95% CI: −0.036 to 0.002, P = 0.073) (Supplementary Figure 15).

Discussion

We developed and validated a PRS that captures the influence of common genetic variation for and is highly predictive of AD across diverse ancestries. Although APOE ε4 is the strongest common genetic risk factor for AD, our PRS excludes variants in the APOE region, which thus enables assessment and highlights the contribution of polygenic risk independent of APOE genotype. The score was significantly associated with clinical AD diagnosis, cognitive performance, hippocampal volume, neuropathological measures of AD severity, and CSF and plasma AD biomarkers.

This study addresses critical challenges in developing and generalizing an AD PRS across ancestrally diverse populations. Rather than relying solely on European-ancestry GWAS, we integrated summary statistics from diverse ancestries and employed ancestry-aware modeling strategies. Our multi-ancestry PRS outperformed comparative scores from the PGS Catalog and the recently published EA PRS by Nicolas et al.12 in AA, CH, and EAS groups within the ADSP and KBASE datasets. The observed PRS signal in the CH group likely reflects shared African and European components represented in the training data, enabling more accurate tagging of risk variants through overlapping LD structure23. In contrast, the strongest AD PRS association in the NAH group was observed with the EA-derived PRS by Lourida et al22, a counterintuitive result likely driven by residual European admixture and coincidental LD alignment, rather than genuine cross-population transferability, particularly given the absence of NAH GWAS data2427. Our findings underscore the potential improvement in performance of a PRS derived from multi-ancestry populations when applied to underrepresented populations, despite imbalance sample sizes.28. However, the diminished performance in Native American–enriched groups highlight ongoing limitations when entire ancestries are excluded from the construction of the PRS. Prior studies have shown that differences in LD structure, allele frequencies, and effect size distributions across populations limit PRS portability26,27. Improving risk prediction in admixed and underrepresented populations will require expanded GWAS representation and analytical approaches that explicitly model ancestry-specific genomic architecture.

The best-performing PRS included variants in the APOE region but its association with AD was markedly attenuated after adjustment for APOE ε4 and ε2 genotype, confirming that the signal was largely APOE driven. To capture risk independent of APOE, we constructed a PRS excluding and modeled APOE genotype separately; it remained significantly associated with AD and showed comparable effect sizes in ε4 carriers and non-carriers, demonstrating additional polygenic risk beyond APOE genotype. Excluding this region may also reduce ancestry-related variability linked to local ancestry, cis- and trans-modifiers, and vascular burden2933. These findings support integrating both APOE genotype and an APOE-excluded PRS for more precise risk stratification, enabling early identification even among ε4 non carriers. Translation into clinical practice, however, will require addressing variability in PRS performance across ancestries34, limited integration of genetic data into clinical workflows, and the absence of standardized thresholds for clinical interpretation35.

Understanding the downstream effects of genetic liability measures is critical for developing biologically informed and clinically relevant tools for risk assessment. In this study, the AD PRS was associated with multiple domains of AD pathophysiology, including neuropathology, CSF and plasma biomarkers, cognition, and hippocampal volume. Significant associations were observed with CERAD plaque burden, Thal phase, Braak stage, and overall AD-related neuropathological severity, aligning with established staging frameworks and supporting the PRS as a marker of cumulative disease burden36. Our AD PRS was also linked to lower CSF Aβ42 and higher t-tau, pTau181 and pTau 217 concentrations, consistent with early amyloid accumulation and downstream tau pathology characteristic of the pre-clinical stage of AD37. Using AT classification based on CSF amyloid and tau status, higher PRS was associated with a graded pattern across biomarker profiles, with increasing PRS corresponding to a higher predicted probability of A+T+ and a lower probability of A–T. These associations extended to plasma biomarkers, with significant effects for pTau181 in FHS38,39 and, importantly, in ADNI for pTau217, which is a blood-based biomarker recently cleared by the FDA for clinical use40. Although modest in sample size, the convergence of genetic liability with this clinically validated biomarker underscores the translational potential of integrating PRS with scalable blood-based assays to identify high-risk individuals even before symptoms emerge, warranting extension in larger, more diverse populations.

The clinical relevance of these biological associations is supported by cognitive performance analyses. AD PRS was associated with poorer memory, executive function, and language scores across ADSP ancestry groups, although these effects were attenuated or absent in smaller cohorts including FHS, KBASE and Caribbean and Native American Hispanics. Trajectory modeling showed that individuals in the highest PRS decile had lower memory performance from age 60 onward, with the gap relative to lower-PRS groups widening with age. Executive function and language showed similar but weaker patterns. These effects were most apparent among individuals who developed AD, with group differences evident before diagnosis, consistent with prior studies linking AD PRS to cognitive performance differences in the prodromal phase4143. In cognitively normal participants declined slowly and in parallel across PRS groups, though those with high PRS consistently performed worse, particularly in memory. Among individuals with AD, PRS had only modest effects prior to diagnosis and did not predict the rate of decline once the disease was present. Together, these findings suggest that the AD PRS captures enduring differences in cognitive reserve rather than accelerating decline after onset, highlighting its potential utility in identifying subtle, subthreshold cognitive differences that could inform early intervention44,45. Future studies integrating AD PRS with longitudinal cognitive performance, neuroimaging, and biomarker data will be essential to refine individualized risk prediction and guide precision prevention strategies.

Our results also suggest that the impact of aggregate genetic risk on AD biomarker levels and cognitive performance varies by sex. In ADSP, PRS associations with clinical diagnosis were similar in men and women, whereas in FHS stronger effects were seen in women, a difference likely reflecting study design, with FHS capturing incident cases prospectively and ADSP relying on cross-sectional, predominantly prevalent cases where survival or diagnostic bias may obscure sex-related effects46. Across both CSF and plasma biomarkers, the association between AD PRS and tau pathology was stronger in women than in men. This effect was most evident for CSF p-tau181 and extended to plasma p-tau181 and p-tau217. Predicted probability curves from AT classification demonstrated a steeper rise in A+T+ probability as PRS increased, further suggests that women may be more sensitive to developing combined amyloid and tau pathology. Although associations with hippocampal volume were modest, the direction and magnitude of effects were generally stronger among women47, though further investigation in larger cohorts is warranted. In cross-sectional analyses, the association of AD PRS with memory performance was more pronounced in women, particularly within the EA group, consistent with a previous study showing significant genetic correlations with 20 memory tests in women compared to only two in men48. We did not detect sex differences in longitudinal memory decline, which may reflect limited power or cognitive trajectory convergence in later disease stages. These observed sex-specific patterns likely reflect a combination of biological and environmental influences, including hormonal changes, X-linked genetic factors, and differences in vascular, metabolic, and lifestyle exposures between men and women4952.

A key strength of this study is the development and rigorous evaluation of an APOE genotype-independent PRS for AD. We constructed the weighted PRS using a systematically tested framework based on PRS-CS, incorporating ancestry-specific summary statistics from non-proxy GWAS spanning large EA cohorts and the largest available non-EA datasets to date. Summation weights were derived from ADGC data to enhance performance across ancestries. The resulting PRS was evaluated in clinically confirmed AD cases from the multi-ancestry ADSP dataset and further validated in the FHS, KBASE, and Rush cohorts, with consistent performance also observed in the multi-ancestry AoU dataset. Our findings from genetic risk modeling with clinical diagnosis, cognitive trajectories, a structural MRI measure of hippocampal volume, and CSF and plasma AD biomarkers demonstrate the utility of an ancestry-aware, APOE-independent PRS for individualized risk stratification and provide new insights into the genetic architecture and early manifestations of AD across diverse populations.

Despite these advancements, our study has several notable limitations. Non-European populations were underrepresented in both discovery and validation datasets, and the absence of ancestry-specific GWAS for key groups such as Native American and Caribbean populations may limit the generalizability of some findings. Additionally, reliance on ICD codes for case ascertainment in the MVP, FinnGen, and AoU datasets may have constrained diagnostic accuracy. Several datasets that were used to assess AD-related phenotypes, particularly among non-European ancestry groups, had modest statistical power to detect ancestry- or sex-specific effects. Nevertheless, these findings represent a critical step toward understanding AD across diverse populations and underscore the need for larger and more ancestrally representative datasets in both discovery and testing efforts. Furthermore, while we observed sex differences in the association of the AD PRS with tau pathology and memory performance, the underlying biological mechanisms remain unclear. Future studies incorporating exposures, such as hormone replacement therapy among post-menopausal women, may help elucidate the molecular pathways driving these sex-specific patterns.

In summary, this study demonstrates the utility of an APOE genotype-independent ancestry-aware polygenic risk score for AD, with consistent associations across cognitive, imaging, biomarker, and neuropathological features. Our findings highlight the value of integrating genetic risk into early detection and monitoring strategies and underscore the importance of expanding diverse, well-characterized cohorts to advance precision medicine approaches in Alzheimer’s disease.

Online methods

Study Participants

Alzheimer’s Disease Sequencing Project (ADSP)

We obtained whole genome sequencing (WGS) and phenotypic data included in the ADSP release 4 dataset, obtained from 36,361 individuals drawn from multiple diverse ancestry cohorts. Study protocols for all cohorts were reviewed and approved by the appropriate institutional review boards. Details of these cohorts and the generation and processing of WGS data were previously described in other studies5355.

The Alzheimer’s Disease Genetics Consortium (ADGC)

The Alzheimer’s Disease Genetics Consortium (ADGC) assembled genotype and phenotypic data for > 56,000 individuals from 61 cohorts56. For this analysis, we identified and removed overlapping individuals between the ADGC and the ADSP datasets, as detailed in Supplementary Note 1. We also excluded individuals who were included in the GWAS training dataset, including those from the Japanese cohort. We then restricted the analysis to individuals who were less related than third-degree relatives (kinship coefficient < 0.04), resulting in a final dataset of 32,016 unrelated individuals from 28 cohorts and diverse populations: European ancestry (EA, n = 27,231), African American (AA, n = 4,227), Caribbean Hispanic (CH, n = 264), and East Asian (EAS, n = 294). Detailed descriptions of the ADGC cohorts, including participant recruitment, diagnostic procedures, SNP imputation and data processing are reported elsewhere5,6,5658. Further details of the genotyping and quality controls are provided in Supplementary Note 1. Study protocols for all cohorts were reviewed and approved by the appropriate institutional review boards

Framingham Heart Study (FHS)

The Framingham Heart Study (FHS) is a prospective, multigeneration cohort study that has collected comprehensive data on cardiovascular, neurological, and other disease outcomes, blood chemistry data, and lifestyle risk factors since 194859. The original cohort enrolled a random sample of 5,209 participants from Framingham, Massachusetts who were monitored through biennial exams60. In 1971, the offspring cohort, consisting of 5,214 children of the original cohort and their spouses, was added and examined every 4 to 6 years6163. In 2002, the Third Generation Cohort, comprising 4,095 individuals who are the grandchildren of the Original Cohort, was established64. Participants underwent detailed cognitive, brain MRI, and neurological evaluations, with dementia diagnoses determined through a consensus process including neurologists and neuropsychologists. Surveillance methods for dementia have been published previously61,62 .The study protocol was approved by the Boston University Institutional Review Board, followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline, and was monitored by a National Heart, Lung, and Blood Institute Observational Study Monitoring Board. Additional details on genotyping, imputation, and quality control are provided in Supplementary Note 2.

Korean Brain Aging Study for the Early Diagnosis and Prediction of AD (KBASE)

KBASE is a prospective cohort study initiated at Seoul National University in 2014 that was designed following the protocol of the Alzheimer’s Disease Neuroimaging Initiative (ADNI). The cohort includes well-characterized individuals, comprising cognitively normal adults ages 20 to 90 years, as well as older adults diagnosed with mild cognitive impairment (MCI) or AD. Participants underwent longitudinal assessments capturing clinical, cognitive, and lifestyle information. WGS and phenotype data were obtained through the National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site NIAGADS; https://dss.niagads.org/datasets/ng00067/). Study protocols were approved by the relevant institutional review boards. Additional details on genotyping and quality control procedures are provided in Supplementary Note 3.

Rush University Medical Center (Rush) cohorts

We utilized data from four Rush cohorts, which are well-characterized, longitudinal, clinical-pathologic studies designed to investigate aging, cognitive decline, and dementia in diverse populations. The Religious Orders Study (ROS) and Rush Memory and Aging Project (MAP)65 are ongoing epidemiologic studies enrolling older individuals without known dementia at baseline, with annual clinical follow-ups and high autopsy rates. The Minority Aging Research Study (MARS)66 and African American Core of the Rush Alzheimer’s Disease Center (AAc)67 specifically focus on non-Hispanic Black participants. All four cohorts follow a harmonized study design, ensuring consistency in clinical assessments, neuropathological examinations, and cognitive evaluations across participants. This standardization allows for robust comparisons and integrated analyses of aging and neurodegenerative disease processes. For this study, we analyzed data from 897 Black participants across these cohorts, including 541 from MARS, 214 from AAc, 80 from MAP, and 62 from ROS. Through these harmonized assessments, our study provides a comprehensive and systematic evaluation of aging and neurodegenerative disease in Black individuals. The study is funded by the National Institute on Aging and approved by the Institutional Review Board of Rush University Medical Center. Further details on genotyping, imputation, and quality control are provided in Supplementary Note 4

All of Us (AoU)

The All of Us (AoU) Research Program is a large, longitudinal study aimed at enrolling over one million U.S. participants, with a focus on diverse populations for studies of the biological, clinical, social, and environmental determinants of health and disease (All of Us Research Program Investigators et al., 2019). Health data are collected from electronic medical records (EMRs) and participant surveys, which are publicly accessible via the program’s website. Eligible participants are U.S. residents aged 18 years or older who provide informed consent. Study protocols were reviewed and approved by the All of Us Institutional Review Board (IRB).

We accessed the AoU version 7 dataset (08/09/2024), which includes whole-genome sequencing (WGS) data from 245,394 individuals. Additional details on WGS and quality control procedures are provided in Supplementary Note 5. Alzheimer’s Disease (AD) phenotypes were defined following the Centralized Interactive Phenomics Resource (CIPHER) guidelines, specifically the classification developed by the Million Veteran Program (MVP) Cognitive Decline and Dementia During Aging Working Group68. AD cases were identified using ICD-9 code 333.0 and ICD-10 codes G30.1, G30.8, and G30.9. Controls were restricted to individuals aged 65 years or older without a diagnosis of dementia themselves and without a family history of AD, as determined by responses to the “Personal and Family Health History” survey question: “Including yourself, who in your family has had dementia (includes Alzheimer’s, vascular, etc.)?”. The age at diagnosis was recorded as the onset age and the current age was used for controls. After excluding related individuals and controls younger than 65 years to mitigate potential survival bias, 71,549 unrelated participants remained for analysis.

Whole genome-sequencing (WGS) and quality control in the ADSP dataset

We obtained whole-genome sequencing (WGS) and phenotype Release 4 data from NIAGADS (https://dss.niagads.org/datasets/ng00067/) for ADSP participants. Information about sequencing protocols, variant calling, and quality control (QC) procedures for ADSP data are available elsewhere (https://www.adgenomics.org/data/). We excluded individuals with a diagnosis of MCI and variants with minor allele frequency (MAF) ≤ 0.01 or missing genotype rates > 1% for the purpose of constructing the PRS. Population ancestry was determined from clusters derived by PCA as described in Supplementary Note 6. Details for assessing relatedness and population substructure using the GENESIS R package are provided in Supplementary Note 7.

PRS development

AD PRSs were calculated using GWAS summary statistics from datasets non-overlapping with each other or with the testing data set including data from EADB Stage I (excluding UKBB)6, FinnGen69, MVP7, and Japanese15 and Korean GWAS16. GWAS results from the relatively small Japanese and South Korean datasets were combined using GWAMA version 2.170. We constructed an AD PRS using multiple methods in order to identify the PRS with the best predictive performance across multi-ancestry populations (1) PRSice2: We utilized the Clumping and Thresholding method in PRSice271 following our previous approach72. Ancestry-specific subsets of the ADSP dataset served as the reference panel for estimating linkage disequilibrium (LD) for each ancestry. PRSs were constructed for each GWAS dataset using clumping parameters (R2) of 0.1, 0.2, 0.3, distances of 100 Kb, 250 Kb, 500 Kb, and p-value thresholds ranging from 5×10−8 to 0.5. The best-performing PRS was identified using the coefficient of variation approach, where the ADSP dataset was split into five random, unrelated subsets, and the PRS minimizing the coefficient of variation across effect sizes was selected. (2) LDPred2: PRSs were developed using the auto model in the bigsnpr R package version 1.9.1173 and approximately 1.4 million SNPs from HapMap3+74. The ADSP WGS dataset served as the reference panel for LD estimation and weight calibration. After computing the weights, PRSs were calculated without clumping parameters using PRSice2. (3) PRS-CS: PRS-CS is a Bayesian polygenic risk-scoring method that calculates a PRS from GWAS summary statistics. Ancestry appropriate portions of the 1000 Genome projects reference panel, including HapMap3 SNPs, were used for estimating LD75 in each of the GWAS datasets. Default PRS-CS parameters were applied and PRSs were computed in the ADSP dataset using PRSice2 without clumping. (4) PRS-CSx: We employed PRS-CSx software, which integrates summary statistics from multiple ancestries using the default parameters76. Each GWAS was assigned an ancestry-matched reference panel from 1KG. Since PRS-CSx requires a single dataset per ancestry, GWAS results from the EADB and FinnGen European ancestry datasets were combined by meta-analysis using GWAMA v2.170. Each iteration of PRS-CSx yielded three ancestry-specific PRS weights. PRSs were then computed in the ADSP dataset using PRSice2 without applying clumping parameters. To facilitate the comparison of effect sizes across multi-ancestry analyses, each PRS was scaled and centered to have a mean of 0 and a SD of 1 in the total ADSP sample.

Training of PRS summation weights using ADGC data

We identified and excluded from the ADGC dataset 33,402 individuals who overlapped the ADSP WGS dataset and individuals diagnosed with MCI, yielding an independent sample of 32,016 individuals. PRS weights were derived and computed for all ADGC participants using the same set of SNPs employed in the development of the PRS in the ADSP dataset. This process was conducted separately for each ADGC dataset (PRSice2, PRS-CS, PRS-CSx, and LDPred2). The PRS weights were scaled using the PRS means and SDs derived from the ADSP dataset77. The association of AD with the PRS derived from each GWAS (i.e., EADB, FinnGen, MVP, and EAS) was evaluated for each method using a logistic regression model including covariates for age (either onset of AD symptoms or age at last exam), sex, ancestry, cohort or study (to account for batch effects), the first 10 PCs to adjust for population stratification, and carrier status for the APOE ε4 and ε2 alleles. The effect estimates from the PRSs for each GWAS (w1 from EADB, w2 from FinnGen, w3 from MVP, and w4 from EAS) were derived separately for each PRS method and used as weights for the respective PRSs. These weighted effect estimates were subsequently used to calculate the PRS summation.

PRS evaluation in the ADSP dataset

We developed a multi-ancestry PRS by comparing its performance among three strategies used for its construction: (1) Meta-analysis: Combining GWAS summary statistics from all ADSP WGS datasets using GWAMA v2.1 (Mägi & Morris, 2010) and creating PRSs using PRSice2 and LDPred2; (2) Unweighted PRS summation: PRS1 + PRS2 + PRS3 + PRS4, applicable across all methods; (3) Weighted PRS summation: w1 × PRS1 + w2 × PRS2 + w3 × PRS3 + w4 × PRS4, applicable across all methods. To simplify the PRS summation combinations into a single PRS weight, we applied the “PRSsum Simple” approach using our previous method (https://github.com/nkurniansyah/PRSsum_Simple)78. The best-performing PRS was identified by comparing odds ratios (ORs), p-values and area under the curve (AUC) from tests of its association with AD. We further trained the best-performing PRS by excluding SNPs within a 1 Mb window of APOE. Summation weights were derived from the ADGC dataset, and the mean and SD of the final PRS calculated in the ADSP cohort are reported in Supplementary Table 13.

PRS validation in independent datasets

The performance of the AD PRS, constructed excluding the APOE region and adjusted for APOE ε4 and ε2 carrier status, was validated in the AoU version 7, FHS, KBASE, and Rush cohorts using the same SNP weights from the best-performing PRS in the ADSP evaluation. For weighted PRS summation, we applied PRS combination weights derived from the ADGC dataset to calculate PRS in these cohorts. To ensure consistency across cohorts, the PRS was scaled using the mean and SD from the ADSP dataset77. To evaluate the performance of our multi-ancestry PRS, we benchmarked it against previously published AD-specific variant weights curated in the PGS Catalog6,10,18,22,7994, as well as a recently published PRS12. Notably, variant weights in all of these studies were derived from GWAS conducted in EA individuals only, many of which included individuals overlapping the ADGC and ADSP datasets. To avoid bias due to sample overlap, benchmarking analyses were restricted to non-European ancestry groups including AA, CH and NAH ADSP participants, and Korean KBASE participants. Published variant weights (excluding the APOE region) were downloaded and uniformly applied to individual-level genotype data for ADSP and KBASE participants to construct the AD PRS. Association of the PRS with AD was tested separately in each ancestry group using logistic regression models adjusted for age, sex, dosage of APOE ε4 and ε2, and 10 PCs. Study accession numbers and corresponding publications are provided in Supplementary Tables 6 and 7.

AD-related traits

We examined the association between AD PRS and AD-related phenotypes using data from multiple cohorts listed in Supplementary Table 14.

AD Biomarkers:

Cerebrospinal fluid (CSF) measures of amyloid beta (Aß), total Tau, and phosphorylated Tau 181 (pTau181) obtained from 1,389 ADSP participants were standardized and harmonized using Z-score normalization to ensure comparability95,96. pTau181 was measured in plasma from 1,689 FHS participants derived from blood samples obtained at Exam 9 using the Quanterix Simoa pTau-181 Advantage V2 kit on EDTA plasma samples39. Plasma p-tau217 concentrations were measured longitudinally in ADNI participants using the Fujirebio Lumipulse G1200 automated chemiluminescent enzyme immunoassay, a validated platform for high-throughput quantification of AD biomarkers97,98.

Neuropathological traits:

Measures of amyloid plaque severity (CERAD score), neurofibrillary degeneration (Braak stage), and Thal phase were obtained from brain tissue donated by 2,862 ADSP participants. The CERAD score is a semi-quantitative measure of neuritic plaque density, based on established guidelines from the Consortium to Establish a Registry for Alzheimer’s Disease (CERAD)99,100. Braak staging characterizes the distribution of neurofibrillary tangles (NFTs) across six stages: stages I-II indicate NFTs in the entorhinal cortex, III-IV involve limbic regions such as the hippocampus, and V-VI reflect neocortical involvement101. It consolidates Braak NFTs stages into four levels of severity, providing a standardized assessment of AD-related tau pathology progression102. Thal phasing describes beta-amyloid deposition across five phases, progressing from neocortical to widespread deposition103. The Alzheimer’s Disease Neuropathologic Change (ADNC) score of AD severity determined using modified NIA-Reagan criteria was categorized as low, intermediate, or high, based on CERAD and Braak scores102.

Cognitive domain measures:

Co-calibrated factor scores for several cognitive domains (executive function, language, and memory) were generated using bifactor structural equation models described elsewhere104106. These cognitive performance measures are comparable across individuals and datasets regardless of the cognitive battery administered. Scores with a standard error (SE) > 0.6 and individuals younger than 60 years were excluded. The total sample included 59,906 ADSP, 1,762 KBASE, and 8,465 FHS participants with genetic data and harmonized longitudinal cognitive domain scores. In addition, episodic memory and global cognitive function scores were available for 895 Rush participants and were derived using established protocols107.

Neuroimaging data:

Longitudinal MRI measures of hippocampal volume were obtained for 1,945 ADSP and 970 FHS participants older than 60 years. ADSP T1-weighted scans were processed using the automated FreeSurfer v6.0 pipeline (https://surfer.nmr.mgh.harvard.edu), following standard recon-all procedures for cross-sectional data. Detailed processing protocols have been described previously108. Visual quality control followed the ENIGMA protocol (https://enigma.ini.usc.edu/protocols/imaging-protocols/), in which ten trained raters scored cortical surface parcellations on a standardized 3-point scale; scans rated 2 (acceptable) or 3 (full pass) were retained. To harmonize imaging data across scanners, sites, and field strengths, longitudinal ComBat was applied, preserving biological variance due to age, sex, race/ethnicity, and diagnostic status109.

FHS structural weighted MRI scans were processed using FreeSurfer 7.4.1. Participant confidentiality was ensured using MiDeFace, which replaces identifiable facial features with an average face template (“MIDEFACE” label) while minimally altering non-facial structures (https://surfer.nmr.mgh.harvard.edu/fswiki/MiDeFace). As part of the quality assurance (QA) pipeline, all scans were manually inspected to verify the presence of the MIDEFACE label and characteristic facial ridges, indicating that the 3D reconstruction was non-identifiable. Quality was scored on a 1–10 scale; scans rated ≥6 were retained, while those scoring ≤5 were excluded due to inadequate quality or excessive correction requirements. Total hippocampal volume was calculated as the sum of left and right hippocampal volumes. All volumes were normalized to intracranial volume to account for head size differences and subsequently log-transformed for statistical analysis110,111.

Statistical analysis

We evaluated the association of AD PRS with AD using models adjusted for age, sex, ancestry, and 10 PCs, and kinship matrix to account for relatedness among subjects implemented in the GENESIS R package (version 2.16.1). Additional models included APOE ε4 and ε2 status. The AoU and Rush cohorts were analyzed using logistic regression models, while generalized linear mixed models with a logit link function accounted for family structure in the FHS dataset. Analyses of Rush and FHS data also adjusted for years of education. The area under the curve (AUC) for each model was calculated using the pROC R package (version 1.18.5)112, restricting analyses to unrelated individuals to prevent bias from familial correlations and maintain independent observation. Continuous AD related-traits (e.g. cognitive domain score, biomarker measures and hippocampal volume) were analyzed using linear regression and linear mixed-effects models (LMMs) via the lme4 package (v1.1–29), and ordinal traits (e.g., neuropathological staging) were assessed using ordinal logistic regression with the polr function from the MASS package (v7.3–60). AT classification (A−T−, A+T−, A−T+, A+T+) was modeled using multinomial logistic regression implemented in the VGAM R package (v1.1–6)113, with continuous AD PRS as the predictor, adjusting for age at biomarker measurement, education, APOE ε4 and ε2 status, and 10 PCs. Predicted probabilities were estimated across the PRS distribution and averaged over the study population. Cognitive domain results from ADSP and FHS datasets were combined by meta-analysis using fixed-effects models in the meta R package (v8.0–1)114.

To examine the influence of AD PRS on age-related changes in memory performance, we fit generalized additive mixed models (GAMMs), which extend linear mixed models by allowing nonlinear age effects while accounting for random effects, using the gamm4 R package version 0.2–6115. Age was centered at the sample median and modeled as a penalized spline with 5 basis functions (k = 5) to allow nonlinear trajectories. Random intercepts and slopes accounted for repeated measures. Analyses included individuals ages 60–80 years with ≥2 cognitive assessments. AD PRS was modeled categorically using empirical quantiles (<10%, 10–50%, 50–90%, >90%), with an age × PRS group interaction term to assess differential trajectories. Marginal predictions and 95% confidence intervals were extracted using the ggeffects R package. To evaluate whether cognitive decline differed by PRS and AD status, we additionally fit LMMs, which allow for interpretable estimation of interaction effects and enable direct comparison of cognitive decline slopes across groups. Due to limited sample size among AD cases, AD PRS was modeled categorically using tertiles (low, moderate, high) to ensure stable group comparisons. The model included fixed effects for age at first visit, dementia diagnosis, PRS group, time since baseline, and their two- and three-way interactions:

factorscoreij=β0+β1ageat1stvisiti+β2diagnosisi+β3PRSi+β4timeij+β5(PRSi×timeij)+β6(PRSi×diagnosisi)+β7(PRSi×diagnosisi×timeij)+covariates+ui+ij

Where ui denotes participant-specific random intercepts and slopes. Both models adjusted for the same covariates: sex, education, APOE ε2/ε4 status, ancestry, and 10 PCs. Visualization of marginal predictions with 95% confidence intervals was performed using the ggeffects R package.

Supplementary Material

Supplement 1
media-1.pdf (4.5MB, pdf)

Acknowledgments

This study was supported by National Institute on Aging grants R01-AG048927, U01-AG058654, U01-AG062602, P30-AG072978, U19-AG068753, U01-AG072577, R01-AG080810, and U01-AG082665; National Science Foundation grant DMS/NIGMS-2347698. NK was supported by the National Institute of General Medical Sciences grant T32-GM150533. ML was supported by MVP000 as well as MVP grant (MVP015/VA BLR&D I01BX004192) and the continuation grant (MVP040/BLR&D I01BX005749). This publication does not represent the views of the Department of Veteran Affairs or the United States Government. Acknowledgments for the Alzheimer’s Disease Sequencing Project (ADSP), Alzheimer’s Disease Genetics Consortium (ADGC) and Million Veterans Program (MVP) datasets are provided in Supplementary Notes 8, 9 and 10. The Korean Brain Aging Study for the Early Diagnosis and Prediction of Alzheimer’s disease (KBASE) is supported by a grant from Ministry of Science, ICT and Future Planning (Grant No: NRF-2014M3C7A1046042). This work was also supported by the Framingham Heart Study’s National Heart, Lung, and Blood Institute contract N01-HC-25195. We express our gratitude to the staff and participants of the Framingham Heart Study.

The authors thank the participants of the Minority Aging Research Study, the Rush Memory and Aging Project, the Religious Order Study, and the Rush Clinical Core for their invaluable contributions. We thank Charlene Gamboa, MPH; Tracy Colvin, MPH; Tracey Nowakowski, MS; and Theresa Jenkins for study recruitment and coordination, and John Gibbons, MS and Greg Klein for data management, and the staff of the Rush Alzheimer’s Disease Center. This research was supported by National Institute on Aging Grants P30AG10161, P30AG72975, R01AG17917. R01 AG015819, U01 AG072572, U01 AG046152, and R01AG22018 and the Illinois Department of Public Health.

The All of Us Research Program is supported by the National Institutes of Health, Office of the Director: Regional Medical Centers: 1 OT2 OD026549; 1 OT2 OD026554; 1 OT2 OD026557; 1 OT2 OD026556; 1 OT2 OD026550; 1 OT2 OD 026552; 1 OT2 OD026553; 1 OT2 OD026548; 1 OT2 OD026551; 1 OT2 OD026555; IAA #: AOD 16037; Federally Qualified Health Centers: HHSN 263201600085U; Data and Research Center: 5 U2C OD023196; Biobank: 1 U24 OD023121; The Participant Center: U24 OD023176; Participant Technology Systems Center: 1 U24 OD023163; Communications and Engagement: 3 OT2 OD023205; 3 OT2 OD023206; and Community Partners: 1 OT2 OD025277; 3 OT2 OD025315; 1 OT2 OD025337; 1 OT2 OD025276. The All of Us Research Program would not be possible without the partnership of its participants.

We thank Dr. Takeshi Ikeuchi for use of GWAS data from the Japanese Genetic Study Consortium for Alzheimer’s disease (JGSCAD) provided to the Alzheimer’s Disease Genetics Consortium and the European Alzheimer’s Disease Biobank (EADB) for providing GWAS summary statistics. Further information about the EADB is available at https://eadb.eu/. We also acknowledge the participants and investigators of the FinnGen study

Competing interests

CC is a member of the scientific advisory board of Circular Genomics and owns stock and serves on the scientific advisory board of ADmit and Alamar. He also consults for Sanofi, NovoNordisk, and Owkin, and received research support from GSK, Danaher and EISAI. AJS serves on advisory boards for Siemens Medical Solutions, Eisai Pharmaceuticals and Novo Nordisk, on a monitoring board and external advisory committees for NIH, and as Editor-in-Chief for Brain, Imaging and Behavior. AJS also received equipment from Avid Radiopharmaceuticals and Gates Ventures, and in-kind contribution of proteomics assays from Sanofi. TJH serves as a consultant for Circular Genomics and on the editorial board of multiple journals, owns stock in Vivid Genomics, and received travel support from the Alzheimer’s Association. L-SW received honoraria for several lectures. GDS received an honorarium for serving on an external advisory board. RA received grants from NIH, the Alzheimer’s Disease Data Initiative, Gates Ventures, American Heart Association and Chosun University; consulting fees from Novo Nordisk, Signant Health and GSK; and equipment and materials from Eli Lilly/Avid, Robert Thomas, OpenAI and Linus Health. JM received honoraria from the Concussion Legacy Foundation and Imperial College London. LAF received support from NIH grants and an honorarium for serving as a journal editor, and is a scientific advisor for ALZAI Health. None of the other authors have conflicts of interest to disclose.

Data availability statement

AD PRS variants and weights will be deposited in the PGS Catalog upon paper acceptance. These data will also be available on the GitHub repository (www.github.com/nkurniansyah/Alzheimer_PRS). Summary statistics for ADRD in the Million Veteran Program African-American dataset have been deposited in dbGaP under study accession phs001672. Summary statistics of the EADB GWAS (without UK biobank) have been deposited to the European Bioinformatics Institute GWAS Catalog (https://www.ebi.ac.uk/gwas/) under accession no. GCST90565439. Summary statistics for AD in the FinnGen European release 10 are available at the FinnGen website(www.finngen.fi/en/access_results). Summary statistics for AD in the Korean GWAS are available from the National Institute on Aging Genetics of Alzheimer’s Disease Storage Site (NIAGADS) (www.niagads.org). Summary statistics for AD in the Japanese GWAS can be accessed at the Database Center for Life Science (DBCLS)(www.humandbs.dbcls.jp/en/). Genotyping and phenotypic data for the Alzheimer’s Disease Sequencing Project (ADSP) are available through the NIAGADS under NG00067 – ADSP Umbrella. Genotyping and phenotypic data for the Alzheimer’s Disease Genetics Consortium can be requested from NIAGADS (www.dss.niagads.org/studies/sa000003). Genotyping and phenotypic data for the Rush cohorts (ROSMAP/MARS/ AAc) can be requested through the Rush Alzheimer’s Disease Center (www.radc.rush.edu). Genotyping and phenotypic data for the Framingham Heart Study (FHS) are available in dbGaP and/or through application to the FHS Research Committee (www.framinghamheartstudy.org/fhs-for-researchers). Data from the NIH All of Us study can be accessed through institutional agreements for researchers who meet eligibility requirements for confidential data access. Interested researchers must complete the registration process via the All of Us Researcher Workbench (www.researchallofus.org/register). For further inquiries, researchers can contact All of Us Researcher Workbench Support at support@researchallofus.org.

Code availability

Scripts used to perform analyses described in the paper and code to construct multi-ancestry AD PRS are available in the GitHub repository (https://github.com/nkurniansyah/Alzheimer_PRS) and the Zenodo repository (will be provided upon paper acceptance).

References

  • 1.Alzheimer’s Association. 2024 Alzheimer’s disease facts and figures. Alzheimers Dement 20, 3708–3821 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Roe J. M. et al. Brain change trajectories in healthy adults correlate with Alzheimer’s related genetic variation and memory decline across life. Nat. Commun. 15, 10651 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Corder E. H. et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families. Science 261, 921–923 (1993). [DOI] [PubMed] [Google Scholar]
  • 4.Farrer L. A. et al. Effects of age, sex, and ethnicity on the association between apolipoprotein E genotype and Alzheimer disease. A meta-analysis. APOE and Alzheimer Disease Meta Analysis Consortium. JAMA 278, 1349–1356 (1997). [PubMed] [Google Scholar]
  • 5.Kunkle B. W. et al. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing. Nat. Genet. 51, 414–430 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Bellenguez C. et al. New insights into the genetic etiology of Alzheimer’s disease and related dementias. Nat. Genet. 54, 412–436 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sherva R. et al. African ancestry GWAS of dementia in a large military cohort identifies significant risk loci. Mol. Psychiatry 28, 1293–1302 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Kang M. et al. Genome-wide pleiotropy analysis of longitudinal blood pressure and harmonized cognitive performance measures. medRxiv (2025) doi: 10.1101/2025.02.11.25322014. [DOI] [Google Scholar]
  • 9.Choi S. W., Mak T. S.-H. & O’Reilly P. F. Tutorial: a guide to performing polygenic risk score analyses. Nat. Protoc. 15, 2759–2772 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Sofer T. et al. A polygenic risk score for Alzheimer’s disease constructed using APOE-region variants has stronger association than APOE alleles with mild cognitive impairment in Hispanic/Latino adults in the U.S. Alzheimers Res Ther 15, 146 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Clark K., Leung Y. Y., Lee W.-P., Voight B. & Wang L.-S. Polygenic risk scores in alzheimer’s disease genetics: methodology, applications, inclusion, and diversity. J Alzheimers Dis 89, 1–12 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Nicolas A. et al. Transferability of European-derived Alzheimer’s disease polygenic risk scores across multiancestry populations. Nat. Genet. 57, 1598–1610 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kunkle B. W. et al. Novel Alzheimer Disease Risk Loci and Pathways in African American Individuals Using the African Genome Resources Panel: A Meta-analysis. JAMA Neurol. 78, 102–113 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sariya S. et al. Polygenic risk score for alzheimer’s disease in caribbean hispanics. Ann. Neurol. 90, 366–376 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Shigemizu D. et al. Ethnic and trans-ethnic genome-wide association studies identify new loci influencing Japanese Alzheimer’s disease risk. Transl. Psychiatry 11, 151 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Kang M. et al. Whole-genome sequencing study in Koreans identifies novel loci for Alzheimer’s disease. Alzheimers Dement (2024) doi: 10.1002/alz.14128. [DOI] [Google Scholar]
  • 17.Bellou E. et al. Benchmarking Alzheimer’s disease prediction: personalised risk assessment using polygenic risk scores across various methodologies and genome-wide studies. Alzheimers Res Ther 17, 6 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Jung S.-H. et al. Transferability of Alzheimer Disease Polygenic Risk Score Across Populations and Its Association With Alzheimer Disease-Related Phenotypes. JAMA Netw. Open 5, e2247162 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Yang H.-S. et al. Cell-type-specific Alzheimer’s disease polygenic risk scores are associated with distinct disease processes in Alzheimer’s disease. Nat. Commun. 14, 7659 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Chandler H. L., Hodgetts C. J., Caseras X., Murphy K. & Lancaster T. M. Polygenic risk for Alzheimer’s disease shapes hippocampal scene-selectivity. Neuropsychopharmacology 45, 1171–1178 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Tasaki S., Gaiteri C., Mostafavi S., De Jager P. L. & Bennett D. A. The molecular and neuropathological consequences of genetic risk for alzheimer’s dementia. Front. Neurosci. 12, 699 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lourida I. et al. Association of lifestyle and genetic risk with incidence of dementia. JAMA 322, 430–437 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Miao J. et al. Quantifying portable genetic effects and improving cross-ancestry genetic prediction with GWAS summary statistics. Nat. Commun. 14, 832 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Marnetto D. et al. Ancestry deconvolution and partial polygenic score can improve susceptibility predictions in recently admixed individuals. Nat. Commun. 11, 1628 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Mostafavi H. et al. Variable prediction accuracy of polygenic scores within an ancestry group. eLife 9, (2020). [Google Scholar]
  • 26.Martin A. R. et al. Clinical use of current polygenic risk scores may exacerbate health disparities. Nat. Genet. 51, 584–591 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Duncan L. et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat. Commun. 10, 3328 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Majara L. et al. Low and differential polygenic score generalizability among African populations due largely to genetic diversity. HGG Adv. 4, 100184 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Luchsinger J. A. et al. Aggregation of vascular risk factors and risk of incident Alzheimer disease. Neurology 65, 545–551 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Rajabli F. et al. Ancestral origin of ApoE ε4 Alzheimer disease risk in Puerto Rican and African American populations. PLoS Genet. 14, e1007791 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Choi K. Y. et al. APOE Promoter Polymorphism-219T/G is an Effect Modifier of the Influence of APOE ε4 on Alzheimer’s Disease Risk in a Multiracial Sample. J. Clin. Med. 8, (2019). [Google Scholar]
  • 32.Tang S. et al. Novel Variance-Component TWAS method for studying complex human diseases with applications to Alzheimer’s dementia. PLoS Genet. 17, e1009482 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Aravena J. M. et al. Beneficial Effect of Societal Factors on APOE-ε2 and ε4 Carriers’ Brain Health: A Systematic Review. J. Gerontol. A Biol. Sci. Med. Sci. 79, (2024). [Google Scholar]
  • 34.Araújo D. S. & Wheeler H. E. Genetic and environmental variation impact transferability of polygenic risk scores. Cell Rep. Med. 3, 100687 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hao L. et al. Development of a clinical polygenic risk score assay and reporting workflow. Nat. Med. 28, 1006–1013 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Spencer B. E., Jennings R. G., Fan C. C. & Brewer J. B. Assessment of genetic risk for improved clinical-neuropathological correlations. Acta Neuropathol. Commun. 8, 160 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Campbell M. R. et al. P-tau/Aβ42 and Aβ42/40 ratios in CSF are equally predictive of amyloid PET status. Alzheimers Dement (Amst) 13, e12190 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Pilotto A. et al. Differences Between Plasma and Cerebrospinal Fluid p-tau181 and p-tau231 in Early Alzheimer’s Disease. J Alzheimers Dis 87, 991–997 (2022). [DOI] [PubMed] [Google Scholar]
  • 39.McGrath E. R. et al. Blood phosphorylated tau 181 as a biomarker for amyloid burden on brain PET in cognitively healthy adults. J Alzheimers Dis 87, 1517–1526 (2022). [DOI] [PubMed] [Google Scholar]
  • 40.Hu S., Yu H. & Gao J. The pTau217/Aβ1–42 plasma ratio: The first FDA-cleared blood biomarker test for diagnosis of Alzheimer’s disease. Drug Discov. Ther. 19, 208–209 (2025). [DOI] [PubMed] [Google Scholar]
  • 41.Vasiljevic E. et al. Cognitive trajectories diverge by genetic risk in a preclinical longitudinal cohort. Alzheimers Dement 19, 3108–3118 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Porter T. et al. A polygenic risk score derived from episodic memory weighted genetic variants is associated with cognitive decline in preclinical alzheimer’s disease. Front. Aging Neurosci. 10, 423 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Kauppi K., Rönnlund M., Nordin Adolfsson A., Pudas S. & Adolfsson R. Effects of polygenic risk for Alzheimer’s disease on rate of cognitive decline in normal aging. Transl. Psychiatry 10, 250 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Ritchie S. J. et al. Polygenic predictors of age-related decline in cognitive ability. Mol. Psychiatry 25, 2584–2598 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gao X.-L., Wang J., Wang Y.-J. & Bu X.-L. Combining multiple factors to predict alzheimer’s disease. Neurosci. Bull. 38, 969–972 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Anderson C. D. et al. The effect of survival bias on case-control genetic association studies of highly lethal diseases. Circ. Cardiovasc. Genet. 4, 188–196 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Yagi S. & Galea L. A. M. Sex differences in hippocampal cognition and neurogenesis. Neuropsychopharmacology 44, 200–213 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Eissman J. M. et al. Sex-specific genetic architecture of late-life memory performance. Alzheimers Dement 20, 1250–1267 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Saleh R. N. M., Hornberger M., Ritchie C. W. & Minihane A. M. Hormone replacement therapy is associated with improved cognition and larger brain volumes in at-risk APOE4 women: results from the European Prevention of Alzheimer’s Disease (EPAD) cohort. Alzheimers Res Ther 15, 10 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Arenaza-Urquijo E. M. et al. Sex and gender differences in cognitive resilience to aging and Alzheimer’s disease. Alzheimers Dement 20, 5695–5719 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Guo L., Zhong M. B., Zhang L., Zhang B. & Cai D. Sex differences in alzheimer’s disease: insights from the multiomics landscape. Biol. Psychiatry 91, 61–71 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Rahman A. et al. Sex and gender driven modifiers of alzheimer’s: the role for estrogenic control across age, race, medical, and lifestyle risks. Front. Aging Neurosci. 11, 315 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Wang Y. et al. Key variants via the Alzheimer’s Disease Sequencing Project whole genome sequence data. Alzheimers Dement 20, 3290–3304 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Tsapanou A. et al. Data from a cross-sectional study on Apolipoprotein E (APOE-ε4) and snoring/sleep apnea in non-demented older adults. Data Brief 5, 351–353 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Naj A. C. et al. Quality control and integration of genotypes from two calling pipelines for whole genome sequence data in the Alzheimer’s disease sequencing project. Genomics 111, 808–818 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Rajabli F. et al. Multi-ancestry genome-wide meta-analysis of 56,241 individuals identifies LRRC4C, LHX5-AS1 and nominates ancestry-specific loci PTPRK, GRB14, and KIAA0825 as novel risk loci for Alzheimer’s disease: the Alzheimer’s Disease Genetics Consortium. medRxiv (2023) doi: 10.1101/2023.07.06.23292311. [DOI] [Google Scholar]
  • 57.Jun G. R. et al. Transethnic genome-wide scan identifies novel Alzheimer’s disease loci. Alzheimers Dement 13, 727–738 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Naj A. C. et al. Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset Alzheimer’s disease. Nat. Genet. 43, 436–441 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Tsao C. W. & Vasan R. S. Cohort Profile: The Framingham Heart Study (FHS): overview of milestones in cardiovascular epidemiology. Int. J. Epidemiol. 44, 1800–1813 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Dawber T. R., Meadors G. F. & Moore F. E. Epidemiological approaches to heart disease: the Framingham Study. Am J Public Health Nations Health 41, 279–281 (1951). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Satizabal C. L. et al. Incidence of Dementia over Three Decades in the Framingham Heart Study. N. Engl. J. Med. 374, 523–532 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Archer D. B. et al. Longitudinal change in memory performance as a strong endophenotype for Alzheimer’s disease. Alzheimers Dement 20, 1268–1283 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Feinleib M., Kannel W. B., Garrison R. J., McNamara P. M. & Castelli W. P. The Framingham Offspring Study. Design and preliminary data. Prev. Med. 4, 518–525 (1975). [DOI] [PubMed] [Google Scholar]
  • 64.Splansky G. L. et al. The Third Generation Cohort of the National Heart, Lung, and Blood Institute’s Framingham Heart Study: design, recruitment, and initial examination. Am. J. Epidemiol. 165, 1328–1335 (2007). [DOI] [PubMed] [Google Scholar]
  • 65.Bennett D. A. et al. Religious orders study and rush memory and aging project. J Alzheimers Dis 64, S161–S189 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Barnes L. L., Shah R. C., Aggarwal N. T., Bennett D. A. & Schneider J. A. The Minority Aging Research Study: ongoing efforts to obtain brain donation in African Americans without dementia. Curr. Alzheimer Res. 9, 734–745 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Schneider J. A., Aggarwal N. T., Barnes L., Boyle P. & Bennett D. A. The neuropathology of older persons with and without dementia from community versus clinic cohorts. J Alzheimers Dis 18, 691–701 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Logue M. W. et al. Alzheimer’s disease and related dementias among aging veterans: Examining gene-by-environment interactions with post-traumatic stress disorder and traumatic brain injury. Alzheimers Dement 19, 2549–2559 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Kurki M. I. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508–518 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Mägi R. & Morris A. P. GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics 11, 288 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Choi S. W. & O’Reilly P. F. PRSice-2: Polygenic Risk Score software for biobank-scale data. Gigascience 8, (2019). [Google Scholar]
  • 72.Kurniansyah N. et al. A multi-ethnic polygenic risk score is associated with hypertension prevalence and progression throughout adulthood. Nat. Commun. 13, 3549 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Privé F., Arbel J. & Vilhjálmsson B. J. LDpred2: better, faster, stronger. Bioinformatics 36, 5424–5431 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Privé F., Albiñana C., Arbel J., Pasaniuc B. & Vilhjálmsson B. J. Inferring disease architecture and predictive ability with LDpred2-auto. Am. J. Hum. Genet. 110, 2042–2055 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ge T., Chen C.-Y., Ni Y., Feng Y.-C. A. & Smoller J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Ruan Y. et al. Improving polygenic prediction in ancestrally diverse populations. Nat. Genet. 54, 573–580 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Kurniansyah N. et al. Evaluating the use of blood pressure polygenic risk scores across race/ethnic background groups. Nat. Commun. 14, 3202 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Kurniansyah N. et al. Polygenic scores for obstructive sleep apnoea reveal pathways contributing to cardiovascular disease. EBioMedicine 117, 105790 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.van der Lee S. J. et al. The effect of APOE and other common genetic variants on the onset of Alzheimer’s disease and dementia: a community-based cohort study. Lancet Neurol. 17, 434–444 (2018). [DOI] [PubMed] [Google Scholar]
  • 80.Leonenko G. et al. Polygenic risk and hazard scores for Alzheimer’s disease prediction. Ann. Clin. Transl. Neurol. 6, 456–465 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Ebenau J. L. et al. Risk of dementia in APOE ε4 carriers is mitigated by a polygenic risk score. Alzheimers Dement (Amst) 13, e12229 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Zimmerman S. C. et al. Association of Genetic Variants Linked to Late-Onset Alzheimer Disease With Cognitive Test Performance by Midlife. JAMA Netw. Open 5, e225491 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Xicota L. et al. Association of APOE-Independent Alzheimer Disease Polygenic Risk Score With Brain Amyloid Deposition in Asymptomatic Older Adults. Neurology 99, e462–e475 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Petrican R., Paine A. L., Escott-Price V. & Shelton K. H. Overlapping brain correlates of superior cognition among children at genetic risk for Alzheimer’s disease and/or major depressive disorder. Sci. Rep. 13, 984 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Mukadam N., Giannakopoulou O., Bass N., Kuchenbaecker K. & McQuillin A. Genetic risk scores and dementia risk across different ethnic groups in UK Biobank. PLoS ONE 17, e0277378 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Green R. E. et al. Investigating associations between blood metabolites, later life brain imaging measures, and genetic risk for Alzheimer’s disease. Alzheimers Res Ther 15, 38 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Lake J. et al. Multi-ancestry meta-analysis and fine-mapping in Alzheimer’s disease. Mol. Psychiatry 28, 3121–3132 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Tomassen J. et al. Amyloid-β and APOE genotype predict memory decline in cognitively unimpaired older individuals independently of Alzheimer’s disease polygenic risk score. BMC Neurol. 22, 484 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Sleiman P. M. et al. Trans-ethnic genomic informed risk assessment for Alzheimer’s disease: An International Hundred K+ Cohorts Consortium study. Alzheimers Dement 19, 5765–5772 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Lawingco T. et al. Genetic variants in glutamate-, Aβ-, and tau-related pathways determine polygenic risk for Alzheimer’s disease. Neurobiol. Aging 101, 299.e13–299.e21 (2021). [Google Scholar]
  • 91.Chouraki V. et al. Evaluation of a genetic risk score to improve risk prediction for alzheimer’s disease. J Alzheimers Dis 53, 921–932 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Desikan R. S. et al. Genetic assessment of age-associated Alzheimer disease risk: Development and validation of a polygenic hazard score. PLoS Med. 14, e1002258 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Tosto G. et al. Polygenic risk scores in familial Alzheimer disease. Neurology 88, 1180–1186 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Najar J. et al. Polygenic risk scores for Alzheimer’s disease are related to dementia risk in APOE ɛ4 negatives. Alzheimers Dement (Amst) 13, e12142 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Timsina J. et al. Harmonization of CSF and imaging biomarkers for Alzheimer’s disease biomarkers: need and practical applications for genetics studies and preclinical classification. BioRxiv (2023) doi: 10.1101/2023.05.24.542118. [DOI] [Google Scholar]
  • 96.Deming Y. et al. Genome-wide association study identifies four novel loci associated with Alzheimer’s endophenotypes and disease modifiers. Acta Neuropathol. 133, 839–856 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Shaw L. M. et al. ADNI Biomarker Core: A review of progress since 2004 and future challenges. Alzheimers Dement 21, e14264 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Arranz J. et al. Diagnostic performance of plasma pTau217, pTau181, Aβ1–42 and Aβ1–40 in the LUMIPULSE automated platform for the detection of Alzheimer disease. Alzheimers Res Ther 16, 139 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Mirra S. S. et al. The Consortium to Establish a Registry for Alzheimer’s Disease (CERAD). Part II. Standardization of the neuropathologic assessment of Alzheimer’s disease. Neurology 41, 479–486 (1991). [DOI] [PubMed] [Google Scholar]
  • 100.Bennett D. A. et al. Neuropathology of older persons without cognitive impairment from two community-based studies. Neurology 66, 1837–1844 (2006). [DOI] [PubMed] [Google Scholar]
  • 101.Braak H. & Braak E. Neuropathological stageing of Alzheimer-related changes. Acta Neuropathol. 82, 239–259 (1991). [DOI] [PubMed] [Google Scholar]
  • 102.Hyman B. T. et al. National Institute on Aging-Alzheimer’s Association guidelines for the neuropathologic assessment of Alzheimer’s disease. Alzheimers Dement 8, 1–13 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Thal D. R., Rüb U., Orantes M. & Braak H. Phases of A beta-deposition in the human brain and its relevance for the development of AD. Neurology 58, 1791–1800 (2002). [DOI] [PubMed] [Google Scholar]
  • 104.Mukherjee S. et al. Cognitive domain harmonization and cocalibration in studies of older adults. Neuropsychology 37, 409–423 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Kang M. et al. A genome-wide search for pleiotropy in more than 100,000 harmonized longitudinal cognitive domain scores. Mol. Neurodegener. 18, 40 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Scollard P. et al. Ceiling effects and differential measurement precision across calibrated cognitive scores in the Framingham Study. Neuropsychology 37, 383–397 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Wilson R. S. et al. Temporal course and pathologic basis of unawareness of memory loss in dementia. Neurology 85, 984–991 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Fischl B. FreeSurfer. Neuroimage 62, 774–781 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Beer J. C. et al. Longitudinal ComBat: A method for harmonizing longitudinal multi-scanner imaging data. Neuroimage 220, 117129 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Weissman D. G., Hatzenbuehler M. L., Cikara M., Barch D. M. & McLaughlin K. A. State-level macro-economic factors moderate the association of low income with brain structure and mental health in U.S. children. Nat. Commun. 14, 2085 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Vijayakumar A. & Vijayakumar A. Comparison of hippocampal volume in dementia subtypes. ISRN Radiol. 2013, 174524 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Robin X. et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics 12, 77 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Yee T. W. Vector generalized linear and additive models. (Springer; New York, 2015). doi: 10.1007/978-1-4939-2818-7. [DOI] [Google Scholar]
  • 114.Schwarzer G., Carpenter J. R. & Rücker G. Meta-Analysis with R. (Springer International Publishing, 2015). doi: 10.1007/978-3-319-21416-0. [DOI] [Google Scholar]
  • 115.Wood S. N. Generalized Additive Models: An Introduction with R. (Chapman and Hall/CRC, 2017). doi: 10.1201/9781315370279. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.pdf (4.5MB, pdf)

Data Availability Statement

AD PRS variants and weights will be deposited in the PGS Catalog upon paper acceptance. These data will also be available on the GitHub repository (www.github.com/nkurniansyah/Alzheimer_PRS). Summary statistics for ADRD in the Million Veteran Program African-American dataset have been deposited in dbGaP under study accession phs001672. Summary statistics of the EADB GWAS (without UK biobank) have been deposited to the European Bioinformatics Institute GWAS Catalog (https://www.ebi.ac.uk/gwas/) under accession no. GCST90565439. Summary statistics for AD in the FinnGen European release 10 are available at the FinnGen website(www.finngen.fi/en/access_results). Summary statistics for AD in the Korean GWAS are available from the National Institute on Aging Genetics of Alzheimer’s Disease Storage Site (NIAGADS) (www.niagads.org). Summary statistics for AD in the Japanese GWAS can be accessed at the Database Center for Life Science (DBCLS)(www.humandbs.dbcls.jp/en/). Genotyping and phenotypic data for the Alzheimer’s Disease Sequencing Project (ADSP) are available through the NIAGADS under NG00067 – ADSP Umbrella. Genotyping and phenotypic data for the Alzheimer’s Disease Genetics Consortium can be requested from NIAGADS (www.dss.niagads.org/studies/sa000003). Genotyping and phenotypic data for the Rush cohorts (ROSMAP/MARS/ AAc) can be requested through the Rush Alzheimer’s Disease Center (www.radc.rush.edu). Genotyping and phenotypic data for the Framingham Heart Study (FHS) are available in dbGaP and/or through application to the FHS Research Committee (www.framinghamheartstudy.org/fhs-for-researchers). Data from the NIH All of Us study can be accessed through institutional agreements for researchers who meet eligibility requirements for confidential data access. Interested researchers must complete the registration process via the All of Us Researcher Workbench (www.researchallofus.org/register). For further inquiries, researchers can contact All of Us Researcher Workbench Support at support@researchallofus.org.

Scripts used to perform analyses described in the paper and code to construct multi-ancestry AD PRS are available in the GitHub repository (https://github.com/nkurniansyah/Alzheimer_PRS) and the Zenodo repository (will be provided upon paper acceptance).


Articles from medRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES