Skip to main content
PLOS One logoLink to PLOS One
. 2022 Dec 7;17(12):e0277378. doi: 10.1371/journal.pone.0277378

Genetic risk scores and dementia risk across different ethnic groups in UK Biobank

Naaheed Mukadam 1,*, Olga Giannakopoulou 1, Nick Bass 1, Karoline Kuchenbaecker 1, Andrew McQuillin 1
Editor: Jeong-Sun Seo2
PMCID: PMC9728885  PMID: 36477264

Abstract

Background

Genetic Risk Scores (GRS) for predicting dementia risk have mostly been used in people of European ancestry with limited testing in other ancestry groups.

Methods

We conducted a logistic regression with all-cause dementia as the outcome and z-standardised GRS as the exposure across diverse ethnic groups.

Findings

There was variation in frequency of APOE alleles across ethnic groups. Per standard deviation (SD) increase in z-GRS including APOE, the odds ratio (OR) for dementia was 1.73 (95%CI 1.69–1.77). Z-GRS excluding APOE also increased dementia risk (OR 1.21 per SD increase, 95% CI 1.18–1.24) and there was no evidence that ethnicity modified this association. Prediction of secondary outcomes was less robust in those not of European ancestry when APOE was excluded from the GRS.

Interpretation

z-GRS derived from studies in people of European ancestry can be used to quantify genetic risk in people from more diverse ancestry groups. Urgent work is needed to include people from diverse ancestries in future genetic risk studies to make this field more inclusive.

Background

Dementia has a significant impact on those living with it, their families and society in general. It currently costs the global economy $1 trillion annually and these costs are likely to increase as life expectancy increases [1]. There is therefore an urgent need to understand mechanisms of disease development and to develop preventative strategies. The most common genetic risk factor for dementia is Apolipoprotein E (APOE), with the ε2 allele being protective against Alzheimer’s dementia and the ε4 allele greatly increasing risk [2]. The APOE gene is also associated with the risk of vascular [3], Lewy Body [4] and other forms of dementia [5]. Genetic risk scores (GRS) are a way of summing the genetic risk that an individual carries for a particular disease. They are developed following Genome-Wide Association Studies (GWAS) that test the association between genetic variants–usually Single Nucleotide Polymorphisms (SNP) and phenotypes. They have become increasingly important in quantifying risk for dementia [6, 7] with good ability to predict both dementia and mild cognitive impairment [8] in people of European origin. In general, scores which use data from the whole genome are referred to as polygenic risk scores and those which use only SNP with genome wide significance are referred to as Genetic Risk Scores (GRS) [9]. Being able to quantify the genetic risk for a disease is important in being able to predict who is at greater risk and to potentially intervene earlier to prevent the disorder. However, around 80% of GWAS are restricted to people of European origin [10, 11] despite the fact that only 11.5% of the world population is of European origin [12]. Using GRS from GWAS conducted in European origin samples to predict disease status in diverse ethnic groups in general could be less accurate than predicting phenotypes in those of European origin [13] due to differences in allele frequencies and patterns of linkage disequilibrium, potentially leading to widening health inequalities. Although conducting GWAS in diverse groups could lead to identification of new disease variants and would be more equitable, this requires tens of thousands of cases and controls and is resource intensive [14].

Previous large GWAS in European origin participants have identified a number of SNPs associated with dementia, with the SNPs in turn being part of genes identified as important in brain amyloid clearance or the immune system [15]. GRS constructed using these risk alleles, including APOE have been used to predict dementia and are correlated with cognitive test scores, cortical thickness [16] and hippocampal [17, 18] and amygdala [17] volume as well as atrophy in basal forebrain and temporal atrophy [19]. So far, this quantification has been limited to people of European origin with some promise in validating these genetic risk scores in a relatively small sample of East Asians [20]. Another study found less robust prediction of dementia propensity scores in non-Hispanic Black participants in the USA compared with non-Hispanic White participants using a genetic risk score incorporating APOE and nine other SNPs [21]. Established genetic risk factors for dementia from GWAS in people of European ancestry were associated with learning and delayed recall in a sample of around 900 South Asians [22] but have not been tested against a clinical diagnosis of dementia. Assumptions that GRS developed in European origin participants will not be valid for predicting dementia in people from diverse ancestries has meant that people from diverse ancestries have been excluded from investigations of interactions between genetic risk and environmental risk factors [23] and cognitive trajectories in later life [24], all with the potential to improve dementia prevention and treatments.

In this study we aimed to establish how well GRS for dementia developed in European ancestry participants could predict dementia and other associated factors in the two main minority ethnic groups in the UK. We reasoned that if ethnicity did not modify the association between GRS and dementia risk then this could lead to greater inclusion of people from diverse ethnic groups in future epidemiological and interventional studies.

Our aims were to:

  1. Establish frequency of APOE variants across ethnic groups

  2. Assess the effect of APOE on risk of dementia and test for an interaction between ethnic group and APOE on dementia risk

  3. Assess how well Genetic Risk Score (GRS) based on data from people of European origin predicts dementia risk in a diverse group and test whether ethnic group modifies this association

  4. Assess association of GRS with other traits linked to dementia, including cognition

Methods

Sample

The UK Biobank is a population-based cohort of more than 500,000 UK-based participants, with differing socioeconomic and ethnic characteristics [25], all aged 40 and over at registration from 2006 to 2010. Participants completed demographic and health questionnaires, interviews and blood, urine and saliva tests. UK Biobank also links participants to their existing health records, such as those from general practice (GP), hospitals and those collected centrally (cancer and death statistics for instance). The UK Biobank team assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008. The UK Biobank received approval from the National Information Governance Board for Health and Social Care and the National Health Service North West Multicentre Research Ethics Committee. All participants provided informed consent through electronic signature at baseline assessment. We obtained internal ethical approval from UK Biobank for our proposal (ref 40055).

Genotyping and quality control procedures

Genome-wide genotyping was performed on all UK Biobank participants using the UK BiLEVE or UK Biobank Axiom Array. Approximately 850,000 variants were directly measured, with > 90million variants imputed using the Haplotype Reference Consortium and UK10K + 1000 Genomes reference panels [26]. UK Biobank conducted initial quality control procedures and flagged samples that were outliers in terms of heterozygosity or missing rates. They also provided genetic principal components values generated from all valid samples and a list of samples with sex discrepancies. We conducted quality control procedures for European and non-European samples separately, including estimates of relatedness between individuals. We excluded those with sex discrepancies and people who had a kinship coefficient of 0.0625 or greater (equivalent to a third degree relative or closer) [27] with another participant. We excluded SNPs with a Minor Allele Frequency (MAF) of <1% and Info score of <0.9.

APOE

We used two SNPs rs429358 and rs7412 as markers for APOE [28]. These SNPs were directly genotyped in UK Biobank. We used combinations of the polymorphisms at these two SNPs to determine APOE alleles [29] as described in S1 Table in S1 File.

Genetic risk score

We extracted genotypes for SNPs that have previously shown an association with dementia risk in a large Genome Wide Association Study in European ancestry individuals [15]. We specifically only included SNPs that showed an association with genome wide significance (p<5x10-8) across both the training and validation datasets, in a previous study which did not include UK Biobank [15]. For each individual in the UK Biobank, we calculated a GRS by multiplying the genotype dosage of each risk allele for each variant (except APOE SNPs) by its respective weight (log odds ratio (OR) for dementia), and then summing across all variants and dividing by number of variants [30] (see supplementary text for formula) using the—score function in Plink 2.0 [31]. APOE genotype was calculated as detailed above and number of ε2/ε4 was multiplied by the log OR for dementia associated with ε2/ε4 respectively. This was then added to the GRS calculated for other SNPs. The weighting for each SNP including APOE was as reported by Lambert and colleagues [15]. Details of SNPs and log odds ratios used is in S2 Table in S1 File. The genetic risk scores were then z-standardized across the whole sample and this was used as our main exposure variable. We repeated all analyses using a z-standardised GRS calculated from all SNPs but excluding APOE.

Main outcome

We determined the diagnosis of dementia from self-report or linked electronic health data in the UK Biobank [32]. The positive predictive values (PPVs) for the diagnosis of all-cause dementia were 86.8% for primary care, 87.3% for hospital admissions and 80.0% for mortality data respectively and 82.5% across all datasets when compared against blinded clinician diagnosis from notes using standardised diagnostic criteria [33]. This indicates that all-cause dementia as identified within UK Biobank is a valid diagnosis. We chose all-cause rather than Alzheimer’s disease specifically because the number of dementia cases specified as Alzheimer’s was relatively low and most dementia types share common genetic risk, particularly APOE, as outlined above. ICD-10 codes used to identify dementia cases are as described previously [32] and included F00, F01, F02, F03 and G30 subcategories.

Secondary outcomes

Family history of dementia: Participants self-reported whether their father or mother had been diagnosed with dementia. We combined information from these questions to designate whether participants had a positive family history of dementia, scoring zero for no family history, one for one parent and two if both parents had dementia.

Reaction time: Participants were shown two cards at a time on the touchscreen and instructed to press the button on the button box as quickly as possible when the symbols on the cards match. The exercise involved 12 pairs of cards. Mean time taken to correctly identify matches was calculated by UK Biobank, using data from all rounds in which both cards matched.

Brain volume: Over 40,000 UK Biobank participants have had three structural brain MRI scans; T1, T2 fluid attenuation inversion recovery (FLAIR) and susceptibility-weighted MRI (swMRI). T1 scans allow precise volumetric measures of the whole brain, as well as specific cortical and subcortical regions. An automated processing pipeline for brain image analysis and quality control was established for UKB at the University of Oxford’s Wellcome Centre for Integrative Neuroimaging (WIN/FMRIB) [34]. This pipeline is primarily based around FSL (FMRIB’s Software Library), and other packages such as FreeSurfer [35]. We used hippocampal and amygdala volume in mm3 (combining volumes for left and right structures) as calculated from T1 images carried out at visit 2(2014), using Automatic Segmentation (ASEG) [36], as two of our secondary outcome measures, adjusting for total brain volume which was normalised for head size as well as age and sex.

Ethnicity and genetic ancestry

All participants self-identified their ethnicity using the UK Census categories. For this study we combined ethnicities into White (British and Irish); South Asian (Asian Bangladeshi, Indian, Pakistani); or Black or Black British (Black African, Black Caribbean, Black British, Black Other), based on research showing clustering of low frequency genetic variants in people originating in the same geographic location [37]. We chose self-defined ethnicity for our primary analysis as this is more easily available for most studies and is a practical choice for grouping. We focused on these three ethnic groups as they were the largest in the sample and in the UK overall. Only those with a valid ethnicity in their records were included in our analyses as this was our main exposure variable of interest. We repeated all analyses using genetically defined European, South Asian and African ancestry. We conducted principal component (PC) analysis in order to classify participants into European, African, South Asian and admixed-ancestry groups. First, we estimated PCs for a subset of non-European ancestry participants using PC-AiR implemented in the GENESIS package, that has been optimized for admixed-ancestry datasets [38]. We assigned the samples into ancestry groups, based on visual inspection of the PCs as described previously [39]. Further details of this PC analysis are also shown in the supplementary material.

Covariates

We considered the following variables based on data which participants gave at baseline, as the association between genotype and dementia might be confounded by these.

Age–We adjusted for baseline age in all models.

Sex–participants self-identified as male or female and we adjusted for sex in all models.

The first 10 genetic principal components, as calculated separately by UK Biobank for European samples and by our group, for non-European ancestry participants, was included in all models testing the association between genetic ancestry and dementia to adjust for genetic variability.

Statistical analysis

We used Stata 16.0 for all statistical analyses. First, we summarised age, sex, APOE variants, other genetic variants of interest, brain volume measures and mean GRS across ethnic groups.

To identify main effects of exposures, for self-defined ethnicity regression analyses, we first ran regression models with ε2 and ε4 alleles as exposures with age, sex and ethnicity as covariates. We then repeated the analyses, including an interaction term for ethnicity and ε2/ε4 alleles respectively. To assess how well GRS predicted dementia across self-defined ethnic groups, we conducted a logistic regression with all cause dementia as the outcome and z-standardised GRS as the exposure, including age, sex and ethnic group as covariates. We then tested how well z-GRS predicted family history of dementia using logistic regression, adjusting for age, sex and ethnic group. To test for the association between z-GRS and cognition we used linear regression with reaction time as the outcome variable and z-GRS as exposure, adjusting for age, sex and ethnicity. We also tested the association between z-GRS score and amygdala and hippocampal volumes, using linear regression, adjusting for age, sex, ethnicity and total brain volume. We repeated these analyses, including an interaction term between z-GRS and ethnicity. We then conducted all these analyses using z-GRS excluding APOE as the exposure, conducting analyses to identify main effects then repeating analyses and including interaction terms between z-GRS and ethnicity. All analyses were then repeated using genetically defined ancestry groups and including the first 10 principal components derived for each ancestry group separately as well as age, sex and genetic ancestry groups as covariates in all models. We tested interaction effects between self-defined ethnicity/genetically determined ancestry and APOE allele status and z-GRS in turn.

Results

There were 502,516 participants in the whole sample, of whom 487,395 had valid genotyping data. After excluding outliers for heterozygosity (n = 115) and individuals who were related to others in the sample (81,889 White, 699 South Asian and 318 Black participants), and 25,903 people of other ethnicities, we included 364,879 self-defined White British, 6665 South Asian and 6927 Black participants. Seventeen of the twenty-one SNPs of interest were directly genotyped in UK Biobank and the remaining were imputed. All SNPs had an Info score of over 0.9 and a Minor Allele Frequency (MAF) of >1%, with similar values for each in those of European and non-European ancestry (S3 Table in S1 File). Overall, 4,729 (1.3%) White British, 73 (1.1%) South Asian and 87 (1.3%) of Black participants were diagnosed with dementia.

Baseline characteristics are shown in Table 1. Black and South Asian participants were on average younger than the White participants and the proportion of women was highest in the Black group compared to the White and South Asian groups. Self-reported family history of dementia was much lower in the South Asian and Black participants compared to the White participants.

Table 1. Characteristics of included participants by self-defined ethnicity.

Characteristic White British (N = 364,879) South Asian (N = 6,665) Black (N = 6,927)
Age (mean, SD) 56.8 (8.0) 53.4 (8.5) 51.9 (8.1)
N(%) female 196,247 (53.8) 3,054 (45.8) 3,904 (56.4)
Family history of dementia N(%) None 318,191 (87.2) 6,357 (95.4) 6,519 (94.1)
One parent 44,789 (12.3) 292 (4.4) 394 (5.7)
Both parents 1,899 (0.5) 16 (0.2) 14 (0.2)
ε2 allele N(%) 0 317,779 (87.1) 6,154 (92.3) 5,852 (84.5)
1 44,802 (12.3) 501 (7.5) 992 (14.3)
2 2,298 (0.6) 10 (0.2) 83 (1.2)
ε4 allele N(%) 0 260,152 (71.3) 5,440 (81.6) 4,101 (59.2)
1 96,006 (26.3) 1,160 (17.4) 2,457 (35.5)
2 8,721 (2.4) 65 (1.0) 369 (5.3)
Reaction time in seconds—mean(SD) 556 (114) 614 (157) 632 (177)
Dementia cases N(%) 4,729 (1.3) 73 (1.1) 87 (1.3)
Mean z-standardised GRS score including APOE (SD) 0.00(1.00 -0.15 (0.79) 0.23 (1.19)
Mean z-standardised GRS score excluding APOE (SD) 0.01 (1.00) 0.11 (0.97) -0.50 (0.83)
Imaging sub-sample N = 32,626 N = 328 N = 243
Hippocampal volume—left plus right(Mean(SD) in mm3) 8066.6 (805.0) 7953.4 (849.2) 7956.1 (746.1)
Amygdala volume—left plus right(Mean(SD) in mm3) 3289.7 (426.0) 3292.0 (468.2) 3252.9 (398.1)
Brain volume, normalised for head size (Mean(SD) in mm3*106) 1.5(0.1) 1.5(0.1) 1.5(0.1)

APOE frequencies

South Asian participants had the lowest frequency of ε2 alleles (7.5% heterozygous and 0.2% homozygous carriers) and Black participants had the highest frequency (14.3% heterozygous and 1.2% homozygous carriers) compared to the White participants (12.3% heterozygous and 0.6% homozygous carriers). The ε4 allele was also most common in Black participants (35.5% heterozygous; 5.3% homozygous carriers vs 17.4% and 1.0% in South Asians and 26.3% and 2.4% in White participants). Mean z-standardised GRS (z-GRS) score including APOE was lower in South Asians and higher in Black participants and the score excluding APOE was lower in South Asian and Black participants compared to White participants.

Regression analyses

Results from the regression analyses for genetic risk factors are shown in Table 2. Logistic regression showed that the ε2 allele roughly halved dementia risk but not in a dose dependent way (OR for one ε2 allele 0.54 and for two 0.56). There was no evidence to support an interaction effect between ethnicity and ε2 allele in terms of dementia risk (p-values for interaction >0.05 for both South Asian and Black participants). One ε4 allele more than doubled dementia risk (OR 2.59, 95% CI 2.44–2.75) and two ε4 alleles increased risk by more than eight-fold (OR 8.47, 95% CI 7.68–9.34). There was no evidence to support an interaction between ethnicity and ε4 (p value for interaction >0.05 in both minority ethnic groups).

Table 2. Dementia risk associated with genotype by self-defined ethnicity.

    Main effect  
Exposure   OR* LCI UCI p value Interaction effect, South Asian participants, OR (95%CI, p-value) Interaction effect, Black participants, OR (95%CI, p-value)
ε2 alleles 0 Reference OR 1.46 (0.58–3.66, 0.336) OR 0.95 (0.47–1.95, 0.896)
  1 0.54 0.48 0.60 <0.0001
  2 0.56 0.35 0.89 0.014
ε4 alleles 0 Reference OR 0.92 (0.60–1.39, 0.676) OR 0.99 (0.72–1.37, 0.974)
  1 2.59 2.44 2.75 <0.0001
  2 8.47 7.68 9.34 <0.0001
z-GRS with APOE—effect per SD increase   1.73 1.69 1.77 <0.0001 OR 0.97 (0.76–1.20, 0.687) OR 0.99 (0.83–1.17, 0.864)
z-GRS without APOE–effect per SD increase   1.21 1.18 1.24 <0.0001 OR 0.82 (0.65–1.05, 0.113) OR 0.94 (0.72–1.23, 0.620)

*Odds ratio of dementia, adjusted for age, sex and ethnic group. SD = Standard deviation LCI = Lower confidence interval UCI = Upper Confidence interval. Interaction tested between ethnicity and APOE allele or z-GRS association with dementia.

Per standard deviation (SD) increase in z-GRS including APOE, the odds ratio for dementia was 1.73 (95%CI 1.69–1.77) with no evidence for an interaction effect with ethnic group. Z-GRS which excluded APOE also increased risk of dementia (OR per SD increase 1.21, 95%CI 1.18–1.24) with no evidence for an interaction effect.

Table 3 shows results of secondary outcome regression analyses. Z-GRS including APOE was associated with family history of dementia (OR 1.32 per SD increase, 95% CI 1.31–1.33) with no evidence for interaction with South Asian ethnicity (p = 0.086) and a slightly smaller effect in people who self-defined as Black (p = 0.013). Each increase of one standard deviation of z-GRS increased reaction time by 0.14 seconds but this was not statistically significant (p = 0.44). There was a trend for each standard deviation increase of z-GRS to decrease hippocampal and amygdala volume, but confidence intervals were wide and there was no evidence of an interaction with ethnic group for any of the analyses. Associations with secondary outcomes were much smaller using the z-GRS which excluded APOE, but with no evidence of an interaction effect in minority ethnic groups (S4 Table in S1 File).

Table 3. Regression analyses for secondary outcomes with z-GRS including APOE as exposure.

Dependent variable Main effect Interaction effect, South Asian participants, Coeff/OR (95%CI, p-value) Interaction effect, Black participants, Coeff/OR (95%CI, p-value)
Coeff/OR* LCI UCI p-value
Family history of dementia 1.32 1.31 1.33 <0.0001 0.89 (0.78–1.02, 0.086) 0.90 (0.83–0.98, 0.013)
Reaction time 0.14 -0.21 0.49 0.44 1.58 (-1.92 to 5.08, 0.377) 1.24 (-1.06 to 3.52, 0.291)
Hippocampal volume -7.20 -14.89 0.48 0.07 -8.69 (-101.54 to 84.15, 0.854) 10.92 (-64.04 to 85.85, 0.775)
Amygdala volume -2.39 -6.33 1.54 0.23 -28.56 (-76.14 to19.03, 0.240) -1.58 (-39.99 to 36.83, 0.936)

*Coefficient for linear regression and Odds ratio for logistic regression after age, sex and ethnic group adjustment (and brain volume for hippocampal and amygdala volume regressions), LCI = Lower confidence interval, UCI = Upper confidence interval. Interaction tested between self-defined ethnic group and exposure and dementia risk

When participants were grouped by genetic ancestry, we included 368,277 White, 7869 South Asian and 8413 Black participants. Baseline characteristics and results of regression analyses when participants were grouped by genetic rather than self-defined ethnicity, were similar to main analyses, as shown in the supplementary material (S5-S7 Tables in S1 File).

Discussion

To our knowledge this is the first investigation of the association between genetic risk score developed in people of European ancestry and dementia diagnosis across different ethnic groups in the UK, including minority ethnic groups. We find that APOE alleles distribution varies by ethnic groups, but the alleles have a similar association with dementia across all ethnic groups. This contrasts with previous findings that the APOE ε4 allele is not associated with dementia in African Americans [40] and older Nigerians [41, 42] although others have found a significant association in African Americans [43]. We replicate findings that APOE impact on dementia risk is not affected by global ancestry [44]. Our estimates of odds ratios associated with the APOE ε4 alleles is very similar to previous findings, indicating validity [15]. Distribution of ε2 and ε4 alleles is similar to other surveys, although there is great variability within ethnic groups [45]. However, our findings, particularly of higher ε4 frequency in those of Black ethnicity suggests this genetic risk should be more widely considered in these populations as this could have public health implications. GRS including APOE predicted dementia in all ethnic groups and was also associated with family history of dementia. There was also some indication of association with secondary outcomes, but these analyses had less power. GRS excluding APOE was also associated with dementia but with smaller coefficients and no evidence for modification of effect by ethnic group. Accuracy of genetic risk and its association with dementia is improved in all groups by including APOE.

Those from minority ethnic groups were less likely to report family history of dementia. This may be due to under-diagnosis of dementia in their parents, due to bias within the healthcare system or reluctance of people from minority ethnic people to seek a dementia diagnosis or to report it if given a diagnosis [46]. A greater proportion of Black participants had the ε4 allele compared to the White and South Asian participants, so even though the association of this allele with dementia risk was similar to the White population, Black people had a higher genetic burden, which could explain previous findings of an increased risk of dementia in Black participants in the UK Biobank [47].

Mean z-GRS varied by ethnic group. However, differences in average GRS caused by population differences in allele frequencies are expected [48] and do not necessarily mean there are corresponding group differences in risk. Our results align with a previous study showing that allele frequencies of key SNPs associated with dementia are different in South Asians compared to the European origin populations from which GRS are derived [22]. The same study also found that a relatively small percentage of the variance of cognitive scores in South Asian participants was explained when using GRS that did not include APOE, which also supports our findings for secondary outcome measures.

The strengths of this study are the large sample, including a relatively large number of people of self-defined South Asian and Black ethnicity. However, we acknowledge that the ratio of White to minority ethnic participants was high. Follow up time was almost 15 years and the total number of dementia cases was high, allowing for robust estimates. However, there was a relatively small number of dementia cases in minority ethnic groups and it is possible interaction analyses were therefore underpowered. In addition, not all participants in the sample had linked hospital and primary care records so some diagnoses of dementia may have been missed. The number of people with brain scans was relatively low which means we had reduced power to detect an association between z-GRS and hippocampal and amygdala volumes. We also acknowledge that self-defined ethnicity may be associated with other societal factors that could influence associations with dementia.

Historically, people of non-European ancestry have been excluded from genetic studies as including diverse groups is more methodologically and computationally demanding and it was thought that the difference in patterns of linkage disequilibrium and allele frequencies may lead to population stratification and bias [49]. This has meant that most advances in understanding the genetics of dementia has come from those of European origin and people from other ethnic groups have been excluded. A lack of disease modifying treatments and growing evidence for dementia prevention [1] have meant there is increasing interest in recruiting those with higher genetic risk of dementia for investigation of disease mechanisms and targets for prevention and treatment. People from minority ethnic groups tend to be under-represented in dementia trials [50]. Not only does this mean we have limited understanding of whether treatments are effective in everyone, but it also misses the opportunity to study diverse mechanisms of disease. To advance our understanding of this complex disorder, it is imperative that we quantify genetic risk and link this with phenotypes in a diverse range of people.

Overall, our study shows that GRS for dementia generated from GWAS in European origin samples is associated with dementia diagnosis in those from diverse ancestries even when the GRS did not include APOE. This indicates that all variants with strong evidence of association with dementia discovered in those of European ancestry can be used to quantify genetic risk in those from more diverse ancestries and exclusion of people from diverse ancestries from future dementia risk studies is not justifiable. Previous work has shown that disease prediction is improved when multi-ethnic samples are used to develop GRS [51]. Studying genetic risk of diseases in diverse-ancestry groups also has the potential to identify new genetic risk loci and further our understanding of disease [52]. It is therefore imperative that future GWAS include people from diverse backgrounds in order to make this field more inclusive.

Supporting information

S1 Fig

(TIF)

S2 Fig

(TIF)

S3 Fig

(TIF)

S1 File

(DOCX)

S2 File

(PDF)

Data Availability

The data that support the findings of this study (UKB project ref 40055) contain sensitive and potentially identifying information and therefore not suitable for sharing. This is due to the terms of the UK Biobank data use. Further details on the study are available from the corresponding author, (NM) upon reasonable request. For access to the data please contact access@ukbiobank.ac.uk.

Funding Statement

NM is funded by an Alzheimer’s Society fellowship (AS-SF-18b-001). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Livingston G. et al. , "Dementia prevention, intervention, and care: 2020 report of the Lancet Commission," The Lancet, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Farrer L. A. et al. , "Effects of age, sex, and ethnicity on the association between apolipoprotein E genotype and Alzheimer disease: a meta-analysis," Jama, vol. 278, no. 16, pp. 1349–1356, 1997. [PubMed] [Google Scholar]
  • 3.Chuang Y.-F. et al. , "Association between APOE ε4 allele and vascular dementia: the Cache County study," Dementia and geriatric cognitive disorders, vol. 29, no. 3, pp. 248–253, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Guerreiro R. et al. , "Investigating the genetic architecture of dementia with Lewy bodies: a two-stage genome-wide association study," The Lancet Neurology, vol. 17, no. 1, pp. 64–74, 2018. doi: 10.1016/S1474-4422(17)30400-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Huang X., Chen P., Kaufer D. I., Tröster A. I., and Poole C., "Apolipoprotein E and dementia in Parkinson disease: a meta-analysis," Archives of neurology, vol. 63, no. 2, pp. 189–193, 2006. doi: 10.1001/archneur.63.2.189 [DOI] [PubMed] [Google Scholar]
  • 6.Desikan R. S. et al. , "Genetic assessment of age-associated Alzheimer disease risk: Development and validation of a polygenic hazard score," PLoS Medicine, vol. 14, no. 3, p. e1002258, 2017. doi: 10.1371/journal.pmed.1002258 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Escott-Price V., Shoai M., Pither R., Williams J., and Hardy J., "Polygenic score prediction captures nearly all common genetic risk for Alzheimer’s disease," Neurobiology of aging, vol. 49, pp. 214–2e7, 2017. doi: 10.1016/j.neurobiolaging.2016.07.018 [DOI] [PubMed] [Google Scholar]
  • 8.Logue M. W. et al. , "Use of an Alzheimer’s disease polygenic risk score to identify mild cognitive impairment in adults in their 50s," Molecular psychiatry, vol. 24, no. 3, pp. 421–430, 2019. doi: 10.1038/s41380-018-0030-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Khera A. V. et al. , "Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations," Nature genetics, vol. 50, no. 9, pp. 1219–1224, 2018. doi: 10.1038/s41588-018-0183-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Popejoy A. B. and Fullerton S. M., "Genomics is failing on diversity," Nature News, vol. 538, no. 7624, p. 161, 2016. doi: 10.1038/538161a [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Sirugo G., Williams S. M., and Tishkoff S. A., "The missing diversity in human genetic studies," Cell, vol. 177, no. 1, pp. 26–31, 2019. doi: 10.1016/j.cell.2019.02.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.U. N., "World Population Prospects: the 2017 revision," United Nations, Department of Economic and Social Affairs, Population Division, New York, 2017. [Online]. Available: https://esa.un.org/unpd/wpp/Publications/ [Google Scholar]
  • 13.Martin A. R., Kanai M., Kamatani Y., Okada Y., Neale B. M., and Daly M. J., "Current clinical use of polygenic scores will risk exacerbating health disparities," Nature genetics, vol. 51, no. 4, p. 584, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bien S. A. et al. , "The future of genomic studies must be globally representative: perspectives from PAGE," Annual review of genomics and human genetics, vol. 20, pp. 181–200, 2019. doi: 10.1146/annurev-genom-091416-035517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lambert J. C. et al. , "Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease," Nature genetics, vol. 45, no. 12, pp. 1452–1458, 2013. doi: 10.1038/ng.2802 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Sabuncu M. R. et al. , "The association between a polygenic Alzheimer score and cortical thickness in clinically normal subjects," Cerebral cortex, vol. 22, no. 11, pp. 2653–2661, 2012. doi: 10.1093/cercor/bhr348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lupton M. K. et al. , "The effect of increased genetic risk for Alzheimer’s disease on hippocampal and amygdala volume," Neurobiology of aging, vol. 40, pp. 68–77, 2016. doi: 10.1016/j.neurobiolaging.2015.12.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Walhovd K. B. et al. , "Genetic risk for Alzheimer disease predicts hippocampal volume through the human lifespan," Neurology Genetics, vol. 6, no. 5, 2020. doi: 10.1212/NXG.0000000000000506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Weiner M. W. et al. , "Recent publications from the Alzheimer’s Disease Neuroimaging Initiative: Reviewing progress toward improved AD clinical trials," Alzheimer’s & Dementia, vol. 13, no. 4, pp. e1–e85, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhou X. et al. , "Genetic and polygenic risk score analysis for Alzheimer’s disease in the Chinese population," Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring, vol. 12, no. 1, p. e12074, 2020. doi: 10.1002/dad2.12074 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Marden J. R., Walter S., Tchetgen Tchetgen E. J., Kawachi I., and Glymour M. M., "Validation of a polygenic risk score for dementia in black and white individuals," Brain and behavior, vol. 4, no. 5, pp. 687–697, 2014. doi: 10.1002/brb3.248 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Smith J. A. et al. , "Association Between Episodic Memory and Genetic Risk Factors for Alzheimer’s Disease in South Asians from the Longitudinal Aging Study in India–Diagnostic Assessment of Dementia (LASI‐DAD)," Journal of the American Geriatrics Society, vol. 68, pp. S45–S53, 2020. doi: 10.1111/jgs.16735 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lourida I. et al. , "Association of lifestyle and genetic risk with incidence of dementia," Jama, vol. 322, no. 5, pp. 430–437, 2019. doi: 10.1001/jama.2019.9879 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Riaz M. et al. , "Effect of APOE and a polygenic risk score on incident dementia and cognitive decline in a healthy older population," Aging cell, vol. 20, no. 6, p. e13384, 2021. doi: 10.1111/acel.13384 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sudlow C. et al. , "UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age," Plos med, vol. 12, no. 3, p. e1001779, 2015. doi: 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Bycroft C. et al. , "The UK Biobank resource with deep phenotyping and genomic data," Nature, vol. 562, no. 7726, pp. 203–209, 2018. doi: 10.1038/s41586-018-0579-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Thompson E., "The estimation of pairwise relationships," Annals of human genetics, vol. 39, no. 2, pp. 173–188, 1975. doi: 10.1111/j.1469-1809.1975.tb00120.x [DOI] [PubMed] [Google Scholar]
  • 28.Kulminski A. M. et al. , "Genetic and regulatory architecture of Alzheimer’s disease in the APOE region," Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring, vol. 12, no. 1, p. e12008, 2020. doi: 10.1002/dad2.12008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.SNPedia. APOE [Online] Available: https://www.snpedia.com/index.php/APOE
  • 30.Dudbridge F., "Power and predictive accuracy of polygenic risk scores," PLoS genetics, vol. 9, no. 3, p. e1003348, 2013. doi: 10.1371/journal.pgen.1003348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Purcell S. et al. , "PLINK: a tool set for whole-genome association and population-based linkage analyses," The American journal of human genetics, vol. 81, no. 3, pp. 559–575, 2007. doi: 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Bush K., Wilkinson T., Schnier C., Nolan J., and Sudlow C., "Definitions of Dementia and the Major Diagnostic Pathologies, UK Biobank Phase 1 Outcomes Adjudication," UK Biobank, 2018. [Google Scholar]
  • 33.Wilkinson T. et al. , "Identifying dementia outcomes in UK Biobank: a validation study of primary care, hospital admissions and mortality data," European journal of epidemiology, vol. 34, no. 6, pp. 557–565, 2019. doi: 10.1007/s10654-019-00499-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Littlejohns T. J. et al. , "The UK Biobank imaging enhancement of 100,000 participants: rationale, data collection, management and future directions," Nature Communications, vol. 11, no. 1, pp. 1–12, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Alfaro-Almagro F. et al. , "Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank," Neuroimage, vol. 166, pp. 400–424, 2018. doi: 10.1016/j.neuroimage.2017.10.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Fischl B. et al. , "Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain," Neuron, vol. 33, no. 3, pp. 341–355, 2002. doi: 10.1016/s0896-6273(02)00569-x [DOI] [PubMed] [Google Scholar]
  • 37.G. P. Consortium, "A global reference for human genetic variation," Nature, vol. 526, no. 7571, pp. 68–74, 2015. doi: 10.1038/nature15393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Conomos M. P., Miller M. B., and Thornton T. A., "Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness," Genetic epidemiology, vol. 39, no. 4, pp. 276–293, 2015. doi: 10.1002/gepi.21896 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Farmaki A.-E. et al. , "Type 2 diabetes risks and determinants in 2nd generation migrants and mixed ethnicity people of South Asian and African Caribbean descent in the UK," medRxiv, p. 2019.12. 13.19014704, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Tang M. X. et al. , "The APOE-î4 allele and the risk of Alzheimer disease among African Americans, whites, and Hispanics," Jama, vol. 279, no. 10, pp. 751–755, 1998. [DOI] [PubMed] [Google Scholar]
  • 41.Gureje O. et al. , "APOE ε4 is not associated with Alzheimer’s disease in elderly Nigerians," Annals of neurology, vol. 59, no. 1, pp. 182–185, 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Osuntokun B. O. et al. , "Lack of an association between apolipoprotein E e4 and Alzheimer’s disease in elderly Nigerians," Annals of neurology, vol. 38, no. 3, pp. 463–465, 1995. [DOI] [PubMed] [Google Scholar]
  • 43.Murrell J. R. et al. , "Association of apolipoprotein E genotype and Alzheimer disease in African Americans," Archives of neurology, vol. 63, no. 3, pp. 431–434, 2006. doi: 10.1001/archneur.63.3.431 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Rajabli F. et al. , "Ancestral origin of ApoE ε4 Alzheimer disease risk in Puerto Rican and African American populations," PLoS genetics, vol. 14, no. 12, p. e1007791, 2018. doi: 10.1371/journal.pgen.1007791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Singh P. P., Singh M., and Mastana S. S., "APOE distribution in world populations with new data from India and the UK," Annals of human biology, vol. 33, no. 3, pp. 279–308, 2006. doi: 10.1080/03014460600594513 [DOI] [PubMed] [Google Scholar]
  • 46.Mukadam N. L., G.; Mueller C.; Werbeloff N.; Stewart R.; Livingston G., "Ethnic differences in cognition and age in people diagnosed with dementia: a study of electronic health records in two large mental healthcare providers," International Journal of Geriatric Psychiatry, vol. 34, no. 3, pp. 504–510, 2019. doi: 10.1002/gps.5046 [DOI] [PubMed] [Google Scholar]
  • 47.Mukadam N. M., L.; Lewis G.; Livingston G., "Risk factors, ethnicity and dementia: a UK Biobank prospective cohort study of White, South Asian and Black participants " 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Martin A. R. et al. , "Human demographic history impacts genetic risk prediction across diverse populations," The American Journal of Human Genetics, vol. 100, no. 4, pp. 635–649, 2017. doi: 10.1016/j.ajhg.2017.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Lonjou C. et al. , "Linkage disequilibrium in human populations," Proceedings of the National Academy of Sciences, vol. 100, no. 10, pp. 6069–6074, 2003. doi: 10.1073/pnas.1031521100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Olson N. L. and Albensi B. C., "Race-and Sex-Based Disparities in Alzheimer’s Disease Clinical Trial Enrollment in the United States and Canada: An Indigenous Perspective," Journal of Alzheimer’s Disease Reports, no. Preprint, pp. 1–20, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Márquez‐Luna C., Loh P. R., S. A. T. D. Consortium, S. T. D. Consortium, and Price A. L., "Multiethnic polygenic risk scores improve risk prediction in diverse populations," Genetic epidemiology, vol. 41, no. 8, pp. 811–823, 2017. doi: 10.1002/gepi.22083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Zhao W. et al. , "Identification of new susceptibility loci for type 2 diabetes and shared etiological pathways with coronary heart disease," Nature genetics, vol. 49, no. 10, p. 1450, 2017. doi: 10.1038/ng.3943 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Jeong-Sun Seo

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

23 Aug 2022

PONE-D-22-19235Genetic risk scores and dementia risk across different ethnic groups in UK BiobankPLOS ONE Dear Dr. Naaheed Mukadam,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

ACADEMIC EDITOR:Please address the concerns raised by Reviewer #1, especially by providing the AUROC results either in main text or in supplementary material.

Please submit your revised manuscript by Oct 7, 2022. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Jeong-Sun Seo

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf.

2. Thank you for stating the following in the Acknowledgments Section of your manuscript: 

"NM is funded by an Alzheimer’s Society grant (AS-SF-18b-001). NM, NB, KK and AM are supported by the NIHR BRC".

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. 

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: 

 "NM is funded by an Alzheimer’s Society fellowship (AS-SF-18b-001). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript".

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. 

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

4. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. 

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: No

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In the present manuscript, the authors used microarray data comprising 850,000 variants and more than 90 million variants were used for downstream analysis after imputation. After QC, they extracted previously identified dementia risk associated SNPs from previous GWAS (Lambert et al., 2013, Nature Genetics) which were not included in the UK Biobank set, and calculated GRS using those SNPs and weight in addition to APOE alleles. Since OR of APOE for dementia is extremely high, they used logOR for APOE alleles. Then they prepared 1) GRS with APOE alleles followed by z-standardization across whole sample (z-GRS with APOE), and 2) GRS without APOE alleles followed by z-standardization (z-GRS without APOE). They performed age, sex and ethnic group adjusted logistic regression analysis to investigate the effect of GRS and APOE alleles for dementia risk. Authors showed that significant increase dementia risk for APOE ε4 alleles, and the effects per SD increase of z-GRS with APOE and without APOE were significantly associated with increasing dementia risk, without significant ethnic group interaction effect. Authors also demonstrated association between SD increase of z-GRS and other features of dementia including family history, reaction time, and hippocampal and amygdala volumes, and showed significant association between z-GRS and family history. Also in this case, they have shown the effects of z-GRS for secondary outcomes had little or no significant interactions with other ethnic groups.

Overall, the manuscript falls below the level of publication in PLOS one, and authors are required to improve this study by referring the designs of many other research studies with strong understanding of genetics.

Major points

1)Main purpose of this study was to assess relevance of PRS model constructed from GWAS using Europeans. The way of showing applicability of European GWAS derived GRS in other ethnic groups is difficult to be agreed. Authors have measured interaction effect of ethnicity in logistic regression using z-GRS for dementia risk. This analysis presents whether ethnic differences interacted with z-GRS for dementia risk, not telling transferability of European GRS to South Asian and African groups. So this cannot support authors’ research questions. Instead, applying GRS model to South Asian and African separately, and then measure proportion of dementia cases or odds ratio by quantile groups, or area under receiver operating characteristics (AUROC) which represent classification power of the model.

2)Also authors conducted z-standardization to have z-score of GRS across whole sample in the UK Biobank set. This approach may diminish ethnic differences due to skewed proportion of White British (approximately 96% of total). It seems not appropriate to use z-standardization across whole sample for assessing transferability of European GRS to other ethnic groups.

3)To precisely assess applicability of European PRS models, various PRS calculation methods can be used. Authors can clump using linkage disequilibrium then adjusted p-value thresholds of European GWAS to construct various models (Pruning and thresholding) or leveraging genome-wide SNPs and adjust and shrink effect sizes using Bayesian models (PRScs or EB-PRS).

4)Authors are strongly required to graphically illustrate their result regarding their results.

Minor points

1)Authors need to change the terms for APOE alleles; 0, 1 and 2 into non-carrier, heterozygous ε2 allele carrier, and homozygous ε2 allele carrier, respectively.

2)In the Table 2, APOE allele non-carriers (0 in this paper) were labelled as “Reference” but this is not necessarily presented in the table, and exact p-values are missing for interaction effects.

3)Authors have used threshold for Info score of 0.9, and this might be too high. Authors may present references for justifying this threshold.

4)Authors are required to proofread the manuscript to avoid awkward descriptions.

5)Result section should be divided into subtitles for better readability.

Leonenko et al., 2021, Nature communications

https://www.nature.com/articles/s41467-021-24082-z

Najar et al., 2021, Alzheimers Dment (Amst)

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7821873/

Baker and Escott-Price, 2020, Front. Digit. Health

https://www.frontiersin.org/articles/10.3389/fdgth.2020.00014/full

Fritsche et al., 2021, PLOS Genetics

https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1009670

Ge et al., 2022, Genome Medicine

https://genomemedicine.biomedcentral.com/articles/10.1186/s13073-022-01074-2#Sec14

Reviewer #2: Through Genome-Wide Association Studies (GWAS), researchers have developed Genetic Risk Scores (GRS) for numerous diseases. However, these studies are mostly biased towards European white races, so their effectiveness in other populations is not certain. Recognizing this problem, the authors verified the GRS developed for dementia in other races other than European white.

Such verification is expected to improve reliability in the use of the developed GRS as well as to enable the classification of GRS that cannot be used for various races. Therefore, I agree to accept this research for publication on Plos One.

I found a few minor things that needed to be corrected as described below.

1. The tables in the paper are analyzed for self-defined ethnicity, and the tables in the supplement material are analyzed for genetically-defined ethnicity. Although the results are similar, is there a reason to set the analysis result for self-defined ethnicity as the main table?

2. East Asian (n=2464) was also defined in the supplement material, but why was it excluded from the analysis?

3. A total of 21 SNPs including APOE were selected, and the GRS excluding APOE also had an odds ratio >1. It seems that the GRS for other SNPs is also meaningful. Are there any related analysis results?

4. Typo(5-6 page) - The numbers in the quotation marks in the manuscript are different from Table 1- South Asian participants had the lowest frequency of ε2 alleles (“7.7%” with one and 0.2% with two alleles) and Black participants had the highest frequency (“14.8%” with one and 1.2% with two alleles) compared to the White participants (12.3% one and 0.6% two alleles). The ε4 allele was also most common in Black participants (“35.3%” one allele; 5.3% two alleles vs “17.3%” and 1.0% in South Asians and 26.3% and 2.4% in White participants).

5. “%” is not attached to (1.3) in the White British column of the Dementia cases N(%) row of Table

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Dec 7;17(12):e0277378. doi: 10.1371/journal.pone.0277378.r002

Author response to Decision Letter 0


27 Sep 2022

Reviewer #1: In the present manuscript, the authors used microarray data comprising 850,000 variants and more than 90 million variants were used for downstream analysis after imputation. After QC, they extracted previously identified dementia risk associated SNPs from previous GWAS (Lambert et al., 2013, Nature Genetics) which were not included in the UK Biobank set, and calculated GRS using those SNPs and weight in addition to APOE alleles. Since OR of APOE for dementia is extremely high, they used logOR for APOE alleles. Then they prepared 1) GRS with APOE alleles followed by z-standardization across whole sample (z-GRS with APOE), and 2) GRS without APOE alleles followed by z-standardization (z-GRS without APOE). They performed age, sex and ethnic group adjusted logistic regression analysis to investigate the effect of GRS and APOE alleles for dementia risk. Authors showed that significant increase dementia risk for APOE ε4 alleles, and the effects per SD increase of z-GRS with APOE and without APOE were significantly associated with increasing dementia risk, without significant ethnic group interaction effect. Authors also demonstrated association between SD increase of z-GRS and other features of dementia including family history, reaction time, and hippocampal and amygdala volumes, and showed significant association between z-GRS and family history. Also in this case, they have shown the effects of z-GRS for secondary outcomes had little or no significant interactions with other ethnic groups.

Overall, the manuscript falls below the level of publication in PLOS one, and authors are required to improve this study by referring the designs of many other research studies with strong understanding of genetics.

- Thank you for reviewing our manuscript. While some of the summary of our manuscript is correct, it is not correct that we used SNPs for the GRS that were not available within the UKBiobank samples. We used logOR as a weight for all SNPs and not just APOE in line with the accepted formula.

Major points

1)Main purpose of this study was to assess relevance of PRS model constructed from GWAS using Europeans. The way of showing applicability of European GWAS derived GRS in other ethnic groups is difficult to be agreed. Authors have measured interaction effect of ethnicity in logistic regression using z-GRS for dementia risk. This analysis presents whether ethnic differences interacted with z-GRS for dementia risk, not telling transferability of European GRS to South Asian and African groups. So this cannot support authors’ research questions. Instead, applying GRS model to South Asian and African separately, and then measure proportion of dementia cases or odds ratio by quantile groups, or area under receiver operating characteristics (AUROC) which represent classification power of the model.

- Testing interaction effects is a common and statistically appropriate way of measuring whether the impact of a given risk factor (in this case GRS) has a different impact on outcome (dementia) based on the value of a third variable (ethnicity in this case). This has been used often in interrogating whether the impact of risk factors varies by ethnic group (see for example, Eastwood SV, Tillin T, Chaturvedi N, Hughes AD. Ethnic Differences in Associations Between Blood Pressure and Stroke in South Asian and European Men : Novelty and Significance. Hypertension 2015; 66(3): 481-8). We have also now, as requested, calculated and plotted the AUROC curves for each ethnic group and provided these in the supplementary materials. The AUROC for the groups was 0.8236 for White, 0.8268 for South Asian and 0.8766 for Black participants. This shows the GRS using these SNPs performed very well in predicting dementia and was equally good in all ethnic groups, mirroring our interaction analyses.

2)Also authors conducted z-standardization to have z-score of GRS across whole sample in the UK Biobank set. This approach may diminish ethnic differences due to skewed proportion of White British (approximately 96% of total). It seems not appropriate to use z-standardization across whole sample for assessing transferability of European GRS to other ethnic groups.

- Our aim was to test whether GRS using GWAS derived from European samples would predict dementia in people from other ethnic groups. In order to explore the effect of ethnicity on the performance of the GRS, we needed to include all ethnic groups in the initial z-standardisation. If we only calculated z-score for each ethnic group separately this would have meant being unable to include ethnic group as a variable in our regression models and would have precluded us from examining ethnicity as an effect modifier.

3)To precisely assess applicability of European PRS models, various PRS calculation methods can be used. Authors can clump using linkage disequilibrium then adjusted p-value thresholds of European GWAS to construct various models (Pruning and thresholding) or leveraging genome-wide SNPs and adjust and shrink effect sizes using Bayesian models (PRScs or EB-PRS).

- Thank you for your comment. We agree that the methods could be used to generate PRS scores. However, we aimed to generate a GRS based solely on genome-wide significant hits from Lambert et al 2013. Therefore, we only included these 21 robustly associated SNPs. The SNPs were unlinked so did not require further processing. We opted for a limited GRS as we did not think that calculating a PRS would offer any advantage in this context. This approach has been used in several other studies, for example: doi.org/10.1212/WNL.0000000000200544

4)Authors are strongly required to graphically illustrate their result regarding their results.

- As requested we have now provided AUROC curves for all groups in the supplementary material

Minor points

1)Authors need to change the terms for APOE alleles; 0, 1 and 2 into non-carrier, heterozygous ε2 allele carrier, and homozygous ε2 allele carrier, respectively.

- We have now changes this in the text as suggested

2)In the Table 2, APOE allele non-carriers (0 in this paper) were labelled as “Reference” but this is not necessarily presented in the table, and exact p-values are missing for interaction effects.

- We have labelled non-carriers as reference category in the table and provided confidence intervals and p values in brackets in the table. We have copied the table below

Main effect

Exposure OR* LCI UCI p value Interaction effect, South Asian participants, OR (95%CI, p-value) Interaction effect, Black participants, OR (95%CI, p-value)

ε2 alleles 0 Reference OR 1.46 (0.58-3.66, 0.336) OR 0.95 (0.47-1.95, 0.896)

1 0.54 0.48 0.60 <0.0001

2 0.56 0.35 0.89 0.014

ε4 alleles 0 Reference OR 0.92 (0.60-1.39, 0.676) OR 0.99 (0.72-1.37, 0.974)

1 2.59 2.44 2.75 <0.0001

2 8.47 7.68 9.34 <0.0001

z-GRS with APOE - effect per SD increase 1.73 1.69 1.77 <0.0001 OR 0.97 (0.76-1.20, 0.687) OR 0.99 (0.83-1.17, 0.864)

z-GRS without APOE – effect per SD increase 1.21 1.18 1.24 <0.0001 OR 0.82 (0.65-1.05, 0.113) OR 0.94 (0.72-1.23, 0.620)

3)Authors have used threshold for Info score of 0.9, and this might be too high. Authors may present references for justifying this threshold.

- The Info score is a measure of imputation reliability and we wished to only include SNPs which we could be relatively certain were imputed accurately. We therefore chose the threshold of 0.9 to avoid including poor quality data. This threshold is at the discretion of researchers but we preferred to err on the side of caution. See for example https://doi.org/10.1371/journal.pone.0009697

4)Authors are required to proofread the manuscript to avoid awkward descriptions.

- We have now done this but are happy to edit specific sections if you felt these were unclear.

5)Result section should be divided into subtitles for better readability.

- We have now added subheadings as suggested.

Reviewer #2: Through Genome-Wide Association Studies (GWAS), researchers have developed Genetic Risk Scores (GRS) for numerous diseases. However, these studies are mostly biased towards European white races, so their effectiveness in other populations is not certain. Recognizing this problem, the authors verified the GRS developed for dementia in other races other than European white.

Such verification is expected to improve reliability in the use of the developed GRS as well as to enable the classification of GRS that cannot be used for various races. Therefore, I agree to accept this research for publication on Plos One.

- Thank you for reviewing our manuscript.

I found a few minor things that needed to be corrected as described below.

1. The tables in the paper are analyzed for self-defined ethnicity, and the tables in the supplement material are analyzed for genetically-defined ethnicity. Although the results are similar, is there a reason to set the analysis result for self-defined ethnicity as the main table?

- We chose self-defined ethnicity for our primary analysis as this is more easily available for most studies and is a practical choice for grouping. We have stated this in our methods section.

2. East Asian (n=2464) was also defined in the supplement material, but why was it excluded from the analysis?

- The QC procedures for non-European participants identified an East Asian group and is shown in the supplementary material for completeness. However, we pre-specified we would use self-defined ethnic group and compare between the three largest ethnic groups in the sample which were White, South Asian and Black only.

3. A total of 21 SNPs including APOE were selected, and the GRS excluding APOE also had an odds ratio >1. It seems that the GRS for other SNPs is also meaningful. Are there any related analysis results?

- We are not sure what related analyses you mean. We have included all results including regression analyses for primary and secondary outcomes with and without APOE and using self-defined ethnicity and genetic ancestry grouping.

4. Typo(5-6 page) - The numbers in the quotation marks in the manuscript are different from Table 1- South Asian participants had the lowest frequency of ε2 alleles (“7.7%” with one and 0.2% with two alleles) and Black participants had the highest frequency (“14.8%” with one and 1.2% with two alleles) compared to the White participants (12.3% one and 0.6% two alleles). The ε4 allele was also most common in Black participants (“35.3%” one allele; 5.3% two alleles vs “17.3%” and 1.0% in South Asians and 26.3% and 2.4% in White participants).

- thank you for noticing this. We have now corrected the numbers in the text.

5. “%” is not attached to (1.3) in the White British column of the Dementia cases N(%) row of Table

- We have now removed the % sign from the other numbers as we have indicated in the first column that all numbers in brackets are percentages

Attachment

Submitted filename: Response to reviewers.docx

Decision Letter 1

Jeong-Sun Seo

26 Oct 2022

Genetic risk scores and dementia risk across different ethnic groups in UK Biobank

PONE-D-22-19235R1

Dear Dr. Naaheed Mukadam,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Jeong-Sun Seo

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: The authors have addressed all my comments satisfactorily. I would recommend this manuscript to move towards publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

**********

Acceptance letter

Jeong-Sun Seo

11 Nov 2022

PONE-D-22-19235R1

Genetic risk scores and dementia risk across different ethnic groups in UK Biobank

Dear Dr. Mukadam:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Jeong-Sun Seo

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig

    (TIF)

    S2 Fig

    (TIF)

    S3 Fig

    (TIF)

    S1 File

    (DOCX)

    S2 File

    (PDF)

    Attachment

    Submitted filename: Response to reviewers.docx

    Data Availability Statement

    The data that support the findings of this study (UKB project ref 40055) contain sensitive and potentially identifying information and therefore not suitable for sharing. This is due to the terms of the UK Biobank data use. Further details on the study are available from the corresponding author, (NM) upon reasonable request. For access to the data please contact access@ukbiobank.ac.uk.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES