Abstract
Objective:
The purpose of this study was to infer causal relationships between 22 previously reported risk factors for Alzheimer’s disease (AD) and the “AD phenome”: AD, AD age of onset (AAOS), hippocampal volume, cortical surface area and thickness, cerebrospinal fluid (CSF) levels of amyloid-β (Aβ42), tau, and ptau181, and the neuropathological burden of neuritic plaques, neurofibrillary tangles (NFTs), and vascular brain injury (VBI).
Methods:
Polygenic risk scores (PRS) for the 22 risk factors were computed in 26,431 AD cases/controls and the association with AD was evaluated using logistic regression. Two-sample Mendelian randomization (MR) was used to infer the causal effect of risk factors on the AD phenome.
Results:
PRS for increased education and diastolic blood pressure were associated with reduced risk for AD. MR indicated that only education was causally associated with reduced risk of AD, delayed AAOS, and increased cortical surface area and thickness. Total- and LDL-cholesterol levels were causally associated with increased neuritic plaque burden, although the effects were driven by single nucleotide polymorphisms (SNPs) within the APOE locus. Diastolic blood pressure and pulse pressure are causally associated with increased risk of VBI. Furthermore, total cholesterol was associated with decreased hippocampal volume; smoking initiation with decreased cortical thickness; type 2 diabetes with an earlier AAOS; and sleep duration with increased cortical thickness.
Interpretation:
Our comprehensive examination of the genetic evidence for the causal relationships between previously reported risk factors in AD using PRS and MR supports a causal role for education, blood pressure, cholesterol levels, smoking, and diabetes with the AD phenome.
Late-onset Alzheimer’s disease (AD) is a debilitating neurological condition characterized by progressive neurodegeneration and deterioration of cognitive function leading to dementia.1 The primary neuropathological hallmarks of AD are the aggregation of extracellular amyloid-β (Aβ) peptides into amyloid plaques and of intracellular hyperphosphorylated tau into neurofibrillary tau tangles (NFTs) often accompanied by cerebrovascular injury or other pathologies.1
In the absence of any disease-modifying therapies, the number of people living with dementia in the United States is expected to exceed 13.8 million by 2050.1 Observational studies have identified potentially modifiable risk factors that could be targeted in intervention studies to reduce the risk of dementia or delay its onset, thereby significantly reducing the population prevalence of AD and related dementias.2 From these studies, it has been estimated that 40% of AD cases may be attributable to preventable causes, such as low educational attainment, hearing loss, traumatic brain injury, hypertension, alcohol consumption, obesity, smoking, depression, physical inactivity, social isolation, air pollution, and diabetes.3,4 However, the quality of evidence for interventions targeting these risk factors to reduce the risk of dementia or cognitive decline is mixed.2,5 Lifestyle interventions that target modifiable risk factors are entirely dependent on accurate causal relationships being established between modifiable risk factors and AD. In observational studies, a correlation between a risk factor and AD cannot be reliably interpreted as evidence of a causal relationship due to potential confounding or reverse causation. Therefore, unless those modifiable factors specifically exacerbate disease progression, disease reduction strategies targeting them will not be successful.
Methods of causal inference that exploit genetic information, such as polygenic risk scores (PRS) and Mendelian randomization (MR), can overcome some of the limitations of observational studies. PRS are a measure of an individual’s genetic propensity to a trait and can be used in cross-trait analyses to test whether genetic liability for one trait is associated with disease risk for a second.6 Although this does not imply that the trait causally modifies disease risk, because there are several alternative explanations, such a PRS-disease association would be expected if the trait were causal of disease, and thus PRS can be used to prioritize putative causal risk factors.6 MR uses genetic variants as proxies for environmental exposures to infer a causal relationship between an intermediate exposure and a disease outcome. MR is akin to conducting a “genetic randomized control trial,” with the risk factors (genotypes) randomly allocated (from parents to offspring), independent of confounding factors that influence the risk factors and disease and unaffected by reverse causation.7 Although MR can be used to infer causal relationships between traits, it typically has lower statistical power than tests of PRS-disease associations.6 The results from PRS and MR analyses can be integrated and compared with those from traditional epidemiological studies and randomized control trials (RCTs). By integrating the results from multiple approaches that have different sources of bias, it is possible to rigorously assess the evidence regarding causality of interventions targeting modifiable risk factors to reduce the risk of dementia.8
In this study, we used PRS and MR to infer causal relationships among potentially modifiable risk factors for dementia (alcohol consumption, smoking, hearing loss, diabetes mellitus, obesity, dyslipidemia, blood pressure depression, sleep, social isolation, physical activity, and diet) reported in the Dementia prevention, intervention, and care: 2020 report of the Lancet Commission4 and the World Health Organization’s Guidelines for Risk Reduction of Cognitive Decline and Dementia5 and the AD phenome (AD status, AD age of onset survival [AAOS], cerebrospinal fluid [CSF] levels of Aβ42, tau and hyperphosphorylated tau [ptau181], hippocampal volume, cortical surface area and thickness, and the neuropathological burden of neuritic plaques, NFTs, and vascular brain injury [VBI]). Based on these analyses, we identified a subset of modifiable risk factors that represent the most promising targets for public health initiatives to reduce AD burden in the population.
Methods
Genomewide Association Summary Statistics
We obtained genomewide association study (GWAS) summary statistics (GWAS-SS) for each exposure and outcome of interest (Table 1). Modifiable risk factors included alcohol consumption,9 the alcohol use disorder identification test (AUDIT),10 moderate-vigorous physical activity (MVPA),11 lipid traits,12 systolic blood pressure (SBP), diastolic blood pressure (DBP), pulse pressure (PP),13 type 2 diabetes (T2D),14 body mass index (BMI),15 meat-related diet and a fish- and plant-related diet,16 depression,17 insomnia symptoms,18 sleep duration,19 social isolation,20 smoking initiation,9 cigarettes per day,9 educational attainment,21 and hearing difficulty.22 These summary statistics were used to generate PRS and as the exposures in the MR analyses.
TABLE 1.
Study | Trait | Cohort/consortium | N | Age | Females, % |
---|---|---|---|---|---|
Exposures | |||||
Liu et al 2019 | Alcohol consumption | GSCAN; 23andMe | 941,280 | — | — |
Smoking initiation | GSCAN; 23andMe | 1,232,091 | — | — | |
Cigarettes per day | GSCAN; 23andMe | 337,334 | — | — | |
Sanchez-Roige et al 2019 | Alcohol use disorder test | UKBB; 23andMe | 141,932 | — | — |
Wells et al 2019 | Hearing difficulty | UKBB | 250,389 | — | — |
Xue et al 2018 | Type 2 diabetes | DIAGRAM; UKBB; GERA | 659,316 | — | — |
Yengo et al 2018 | BMI | UKBB; GIANT | 690,495 | — | — |
Willer et al 2013 | Total cholesterol | GLC | 188,577 | 54.94 | 56.58 |
LDL cholesterol | |||||
HDL cholesterol | |||||
Triglycerides | |||||
Evangelou et al 2018 | DBP | UKBB; ICBP | 757,601 | — | — |
SBP | |||||
PP | |||||
Howard et al 2019 | Depression | UKBB; PGC; deCODE; iPSYCH; GeneScotland; GERA; 23andMe | 807,553 | — | — |
Jansen et al 2018 | Insomnia symptoms | UKBB; 23andMe | 1,331,010 | — | — |
Dashti et al 2019 | Sleep duration | UKBB | 446,118 | 57.3 | 54.1 |
Day et al 2018 | Social isolation | UKBB | 452,302 | — | — |
Lee et al 2018 | Educational attainment | UKBB; SSGAC; 23andMe | 1,131,881 | 63.8 | 54.7 |
Klimentidis et al 2018 | Moderate-vigorous physical activity | UKBB | 377,234 | — | — |
Niarchou et al 2020 | Meat-related diet | UKBB | 335,576 | — | 54% |
Fish and plant-related diet | UKBB | 335,576 | — | 54% | |
Outcomes | |||||
Lambert et al 2013 | Late onset AD | IGAP | 54,162 | 71 | 58.4 |
Kunkle et al 2019 | Late onset AD | IGAP | 63,926 | 72.6 | 58.5 |
Huang et al 2017 | AAOS | IGAP | 40,255 | 77.5 | 60.35 |
Deming et al 2017 | CSF Aβ42 | Knight-ADRC | 3,146 | 71.8 | 49.57 |
CSF Ptau181 | |||||
CSF Tau | |||||
Hibar et al 2015 | Hippocampal volume | ENIGMA | 13,688 | 39.9 | 51.8 |
Hibar et al 2017 | Hippocampal volume | ENIGMA; CHARGE | 26,814 | 54.3 | 55.3 |
Grasby et al 2020 | Cortical surface area | ENIGMA | 33,709 | 45.9 | 51.9 |
Cortical thickness | |||||
Beecham et al 2014 | NPs | ADGC | 4,914 | 74.7 | 65.4 |
NFTs | |||||
VBI |
AAOS = Alzheimer’s disease age of onset; AB = amyloid-β; AD = Alzheimer’s disease; BMI = body mass index; CSF = cerebrospinal fluid; DBP = diastolic blood pressure; HDL = high-density lipoprotein; LDL = low-density lipoprotein; NFTs = neurofibrillary tangles; NPs = neuritic plaques; PP = pulse pressure; Ptau181 = hyperphosphorylated tau; SBP = systolic blood pressure; VBI = vascular brain injury.
GWAS-SS for the AD phenome consisted of late-onset AD status,23 AAOS,24 CSF levels of Aβ42, ptau181 and total tau (Tau),25 hippocampal volume,26 cortical surface area and thickness,27 neuropathological burden of neuritic plaques, NFT burden and VBI.28 Although VBI is not a classical neuropathological hallmark of AD, concurrent cerebrovascular injury is a common neuropathological finding in AD.29 As such, VBI was included in the AD phenome to disentangle whether a particular risk factor may influence the clinical symptomatology of AD via vascular or amyloid/tau pathways. Due to data use restrictions associated with evaluating alcohol intake and education phenotypes in the most recent GWAS of AD and hippocampal volume, we used an earlier GWAS for AD30 and hippocampal volume31 for estimating the causal effect of alcohol intake and educational attainment on these phenotypes. These summary statistics were used as outcomes in the MR analyses.
GWAS-SS that were mapped to earlier human genome builds were lifted over to Human Genome Build 19.32 GWAS-SS were standardized using a pipeline,33 that (1) aligns effect alleles to the alternate allele on the forward strand of the human genome reference build and normalizes indels, (2) annotates variants with marker names using chromosome:position:ref:alt, 1000 Genomes rsIDs (phase III), and database-single-nucleotide polymorphism (dbSNP) rsIDs (b151) (3) where allele frequencies are missing, annotates allele frequencies using non-Finnish Europeans from gnomAD (version 2.1), and (4) convert summary statistics to VCF and TSV files.
Alzheimer’s Disease Genetics Consortium
Individual-level genetic and phenotypic data used to compute and test the association of PRS were obtained from the Alzheimer’s Disease Genetics Consortium (ADGC), a large multicenter project composed of 34 separate cohorts with the goal of performing genomewide analyses of AD. The recruitment and genotyping of ADGC samples has been described in detail elsewhere.23,34 Briefly, genotype data in each cohort underwent stringent quality control (QC) checks, with variants excluded if the call rate < 0.95, not in Hardy–Weinberg equilibrium (p < 1 × 10–6), and samples excluded if the call rate was < 0.95, discordant sex was reported based on X chromosome heterozygosity, cryptic relatedness, and non-European ancestry. Related individuals were determined within and across cohorts by identity-by-descent (IBD) using KING, 35 with individuals excluded based on a proportion of IBD < 0.1875, corresponding to less than halfway between second-and third-degree relatives. Ancestry was determined empirically by projecting samples onto principal components from known ancestral populations in the 1000 Genomes Project, with samples determined to be European population outliers if they were ± 6 SD away from the European population mean on the first 10 principal components using PC-Air36 and PLINK.37 Single nucleotide polymorphisms (SNPs) that were not directly assayed were imputed on the Michigan Imputation Server individually for each of the cohorts or subcohorts using all ethnicities of the Haplotype Reference Consortium (HRC) 1.1 reference panel.38 Eagle was used for phasing and Minimac3 was used for imputation. Following imputation, poorly imputed (r2 < 0.8) or rare (minor allele frequency [MAF] < 0.01) variants were removed and the cohorts merged for joint analysis. Following this merger, variants with low call rate due to differential imputation (< 95%) were removed, and then samples with low call rates (< 95%) were removed. Within-ancestry principal components were created using PLINK to correct for residual population stratification within the European population subset. After sample QC, 26,431 participants were available (Table 2). Written informed consent was obtained from study participants or, for those with substantial cognitive impairment, from a caregiver, legal guardian, or other proxy, and the study protocols for all populations were reviewed and approved by the appropriate institutional review boards (IRBs).
TABLE 2.
Variable | Cases (n = 13,312) | Controls (n = 13,119) |
---|---|---|
Female | 7,699 (57.8%) | 7,785 (59.3%) |
APOE e4+ | 7,690 (57.8%) | 3,085 (23.5%) |
Age, yr | 73.4 (8.3) | 76.6 (8.3) |
ADGC = Alzheimer’s Disease Genetics Consortium.
Polygenic Risk Scores
PRSice-2 was used to construct the PRS for each of the exposures of interest in ADGC.39 PRSice generates PRS as the sum of all alleles associated with the exposure of interest exceeding a given p value threshold (pt), weighted by their effect size estimated in an independent GWAS on the trait. SNPs were clumped to obtain variants in linkage equilibrium with an r2 > 0.001 within a 10 MB window and PRS were constructed across a range of pt (pt = 5e-8, 1e-6, 1e-5, 1e-4, 1e-3, 0.01, 0.05, 0.1, 0.2, 0.3, 0.4, and 0.5). We then performed a principal component analysis (PCA) on the resulting PRS and used the first PRS-PCA in subsequent association tests.40 The PCA reweights the variants included in the PRS to achieve maximum variation across all the pt.40 This PRS-PCA approach avoids the optimization step in standard pruning and thresholding models to determine the optimal pt, which can inflate type 1 error and result in overfitting.40 As a sensitivity analysis, we further excluded variants located ±250 kb of the APOE e4 defining SNP, rs429358. The association between each exposure PRS and AD was evaluated using logistic regression adjusting for age, sex, APOE ε4 dose, and 10 principal components. The Benjamini & Hochberg false discovery rate was used to account for the multiple testing across the 22 different exposures.
Mendelian Randomization Analysis
We inferred causal relationships between each modifiable risk factor (exposures) and the AD phenome (outcomes) using 2-sample MR, in which the associations for the exposure- and outcome-genetic instrumental variables are obtained from GWAS-SS generated from different, non-overlapping samples.
Genetic Instruments
For each exposure, we constructed 2 different sets of instrumental variables (IVs), corresponding to independent 1) genomewide significant SNPs (p < 5 × 10−8) and 2) SNPs of at least borderline significance (p < 5 × 10−6). Increasing the number of SNPs used as IVs increases the phenotypic variance explained and, thus, has the potential to increase statistical power. However, if the additional variants included violate the core MR assumptions then they may instead reduce power, biasing the results toward the null by introducing weak instrument bias. To obtain independent SNPs, linkage disequilibrium (LD) clumping was performed by excluding SNPs that have an r2 > 0.001 with another variant with a smaller p value association within a 10 MB window using PLINK.37 For genetic variants that were not present in the outcome GWAS, PLINK was used to identify proxy SNPs that were in LD (r2 > 0.8; European reference population). Finally, the exposure and outcome GWAS datasets were harmonized so that the effect size for the exposure and outcome corresponded to the same effect alleles. Genetic variants that were palindromic with ambiguous allele frequencies (AF > 0.42), or that had incompatible alleles, were removed. Instruments that were genomewide significant for the outcome were removed. As a sensitivity analysis, we further excluded variants located ± 250 kb from the APOE ε4 defining SNP, rs429358. The proportion of variance in the phenotype explained by each instrument and F-statistic were calculated, as previously described.41,42
Statistical Analysis
For each genetic variant, we calculated an instrumental variable ratio estimate by dividing the SNP-exposure by SNP-outcome and the resulting coefficients were combined in a fixed-effects meta-analysis using an inverse-variance weighted (IVW) approach to give an overall estimate of causal effect.7 The IVW method assumes that all SNPs included in the causal estimate are valid instruments - that is, that they do not violate any of the underlying MR assumptions, in particular horizontal pleiotropy, whereby genetic variants have direct effects on multiple phenotypes, could lead to false inference of causal associations.7 In order to account for potential violations of the assumptions underlying the IVW analysis, we conducted sensitivity analyses using alternative MR methods known to be more robust to horizontal pleiotropy in particular, but at the cost of reduced statistical power. The alternative approaches included (1) Weighted Median Estimator (WME), which tests the median effect of all of the IV variants, allowing 50% of variants to exhibit horizontal pleiotropy7; (2) Weighted Mode Based Estimator (WMBE), which clusters variants into groups based on the similarity of causal effects and reports the final causal effect based on the cluster with the largest number of variants7; and (3) MR-Egger regression, which allows all variants to be subject to direct effects that bias the estimate in the same direction.7
The MR-Egger regression intercept was used to verify the absence of pleiotropic effects of the SNPs on the outcome.7 To further confirm the absence of distortions in the causal effects due to heterogeneity or horizontal pleiotropy, we used the MR Pleiotropy Residual Sum and Outlier (MR-PRESSO) test to detect and correct for horizontal pleiotropic outliers.43 Where heterogeneity was detected (the MR-PRESSO Global Test) and significant outliers were detected (MR-PRESSO Outlier Test), the outliers were removed.
We report the IVW results for the set of IV variants (at p < 1 × 10−8 or 5 × 10−6) with the smallest p value, outliers were removed if detected. Where there was evidence of horizontal pleiotropy or heterogeneity (MR-PRESSO Global Test p < 0.05 or an MR-Egger Intercept p < 0.05), we report the IVW results for which the sensitivity analyses were also significant and the effect direction was concordant with the IVW results. To account for multiple testing, we report q-values, a false discovery rate-based measure of significance.44 Power analyses were conducted using the non-centrality parameter-based approach using the observed IVW coefficient.45
As a further sensitivity analysis to evaluate potential reverse causation, we also conducted MR in the reverse direction with the AD phenome as the exposure and the modifiable risk factors as the outcome using the same analysis pipeline as detailed above. Variants located ± 250 kb from the APOE ε4 defining SNP, rs429358, were excluded. For the neuropathology and CSF endophenotypes, analyses were limited to selecting nominally significant SNPs (p < 5 × 10−6) due to lack of genome-wide significant SNPs available to include as instruments, and additionally, are likely to be underpowered.
All statistical analyses were conducted using R version 3.6.3. MR analysis was performed using the “TwoSampleMR” version 0.4.22 package.7 A Snakemake workflow was constructed that automates the PRS and MR analysis pipelines and allows for multiple exposure – outcomes datasets to be run in parallel.46
The SNPs used as IVs, their harmonized effects, and outliers are presented in Supplementary Tables S1 and S2. The causal estimates for each p value threshold, MR method, and pre- and post-outlier removal and bidirectional results are presented in Supplementary Tables S3 to S5.
Results
Polygenic Risk Score Analysis
We evaluated the association of 22 PRS for potentially modifiable risk factors with AD in ADGC (Table 3). After correction for multiple testing, a 1 SD higher PRS for educational attainment was associated with decreased risk of AD (odds ratio [OR] = 0.93, confidence interval [CI] = 0.91, 0.96). Additionally, a 1 SD increased DBP was nominally (false discovery rate [FDR] < 0.1) associated with reduced risk of AD (OR = 0.96, CI = 0.94, 0.99). These results remained unchanged after excluding variants located in the APOE locus.
TABLE 3.
Including APOE | Excluding APOE | |||||
---|---|---|---|---|---|---|
Exposure | b (se) | p | FDR | b (se) | p | FDR |
Educational attainment | −0.07 (0.014) | 8.90E-07 | 2E-05 | −0.07 (0.014) | 8.90E-07 | 2E-05 |
DBP | −0.038 (0.014) | 0.007 | 0.076 | −0.038 (0.014) | 0.007 | 0.076 |
Total cholesterol | 0.034 (0.014) | 0.017 | 0.127 | 0.014 (0.014) | 0.336 | 0.825 |
Meat diet | −0.028 (0.014) | 0.043 | 0.239 | −0.026 (0.014) | 0.063 | 0.459 |
LDLs | 0.024 (0.015) | 0.093 | 0.409 | −0.001 (0.014) | 0.965 | 0.968 |
Social isolation | 0.022 (0.014) | 0.115 | 0.423 | 0.022 (0.014) | 0.115 | 0.635 |
SBP | −0.019 (0.014) | 0.184 | 0.579 | −0.019 (0.014) | 0.184 | 0.81 |
Moderate-to-vigorous PA | 0.017 (0.014) | 0.233 | 0.64 | 0.017 (0.014) | 0.233 | 0.825 |
Cigarettes per day | −0.014 (0.014) | 0.305 | 0.746 | −0.014 (0.014) | 0.31 | 0.825 |
Hearing difficulties | −0.013 (0.014) | 0.351 | 0.771 | −0.013 (0.014) | 0.351 | 0.825 |
AUDIT | −0.012 (0.014) | 0.393 | 0.786 | −0.012 (0.014) | 0.393 | 0.825 |
Alcohol consumption | 0.011 (0.014) | 0.451 | 0.807 | 0.011 (0.014) | 0.45 | 0.825 |
PP | −0.01 (0.014) | 0.491 | 0.807 | −0.012 (0.014) | 0.419 | 0.825 |
Smoking Initiation | −0.009 (0.014) | 0.545 | 0.807 | −0.009 (0.014) | 0.516 | 0.872 |
BMI | −0.008 (0.014) | 0.55 | 0.807 | −0.008 (0.014) | 0.573 | 0.901 |
Depressive symptoms | −0.005 (0.014) | 0.743 | 0.992 | −0.005 (0.014) | 0.743 | 0.961 |
Type 2 diabetes | 0.004 (0.014) | 0.807 | 0.992 | 0.004 (0.014) | 0.807 | 0.968 |
Sleep duration | 0.002 (0.014) | 0.871 | 0.992 | 0.002 (0.014) | 0.871 | 0.968 |
Fish and plant diet | 0.002 (0.014) | 0.913 | 0.992 | 0.001 (0.014) | 0.968 | 0.968 |
Insomnia symptoms | −0.001 (0.014) | 0.942 | 0.992 | −0.001 (0.014) | 0.939 | 0.968 |
Triglycerides | −0.001 (0.014) | 0.967 | 0.992 | −0.006 (0.014) | 0.652 | 0.956 |
HDL | 0 (0.014) | 0.992 | 0.992 | 0.005 (0.014) | 0.728 | 0.961 |
AUDIT = alcohol use disorder identification test; BMI = body mass index; DBP = diastolic blood pressure; HDL = high-density lipoprotein; LDL = low-density lipoprotein; PA = physical activity; PP = pulse pressure; Ptau181 = hyperphosphorylated tau; SBP = systolic blood pressure.
Mendelian Randomization Analysis
We used MR to infer causal relationships among 22 potentially modifiable risk factors and 11 AD outcomes, across 2 sets of IV variants corresponding to 2 different p value thresholds. We observed 12 exposure-outcome pairs that were significant at an FDR < 0.05 and that either showed no evidence of heterogeneity or horizontal pleiotropy, or in the presence of heterogeneity or horizontal pleiotropy, the additional MR sensitivity analyses were significant (Fig; Table 4). The descriptive statistics for the number of SNPs, PVE, F-statistics, and power for each exposure are presented in Supplementary Table S6, with estimates for the individual exposure-outcome pairs are presented in Supplementary Tables S3 to S5.
TABLE 4.
IVW | MR-Egger | WMBE | WME | MR-PRESSO Global | MR-Egger Intercept | ||||
---|---|---|---|---|---|---|---|---|---|
Exposure | pt | SNPs | b (se) | q-value | b (se) | b (se) | b (se) | p | p |
LOAD | |||||||||
Educational attainment | 5E-08 | 478 | −0.44 (0.071) | 7.73E-08 | −0.55 (0.27)* | −0.44 (0.11)*** | −0.43 (0.38) | 0.04156 | 0.68 |
AAOS | |||||||||
Educational attainment | 5E-06 | 716 | −0.28 (0.058) | 5.67E-05 | −0.19 (0.21) | −0.31 (0.09)*** | −0.28 (0.25) | 0.003 | 0.62 |
Type 2 diabetes | 5E-06 | 218 | 0.072 (0.017) | 9.93E-04 | 0.035 (0.044) | 0.045 (0.033) | 0.067 (0.034)* | 5E-04 | 0.34 |
NPs | |||||||||
LDLs | 5E-08 | 74 | 0.7 (0.19) | 0.006 | 0.67 (0.32)* | 0.51 (0.33) | 0.21 (0.41) | 0.2495 | 0.91 |
Total cholesterol | 5E-06 | 122 | 0.69 (0.18) | 0.004 | 0.7 (0.31)* | 0.8 (0.31)** | 0.71 (0.44) | 0.8558 | 0.95 |
NFTs | |||||||||
Total cholesterol | 5E-06 | 123 | 0.33 (0.11) | 0.042 | 0.28 (0.23) | 0.15 (0.19) | 0.32 (0.2) | 0.0051 | 0.76 |
VBI | |||||||||
DBP | 5E-06 | 608 | 0.073 (0.017) | 0.001 | 0.1 (0.041)* | 0.068 (0.028)* | 0.022 (0.056) | 0.58812 | 0.47 |
PP | 5E-08 | 384 | 0.058 (0.017) | 0.013 | 0.14 (0.045)** | 0.055 (0.028). | 0.033 (0.068) | 0.17312 | 0.057 |
Hippocampal volume | |||||||||
Total cholesterol | 5E-06 | 125 | −0.065 (0.02) | 0.019 | −0.034 (0.035) | −0.084 (0.034)* | −0.061 (0.032). | 0.0974 | 0.26 |
Cortical surface area | |||||||||
AUDIT | 5E-06 | 51 | 5,400 (1,400) | 0.003 | −4,800 (9,900) | 3,400 (2,300) | 460 (4,000) | 0.0088 | 0.3 |
Educational attainment | 5E-06 | 707 | 4,600 (440) | 1.17E-22 | 2,300 (1,800) | 3,000 (730)*** | 370 (3,300) | <4e-05 | 0.18 |
Insomnia symptoms | 5E-06 | 375 | −4,100 (1,300) | 0.028 | −4,900 (8,200) | −2,700 (2,000) | 700 (5,800) | 0.0016 | 0.92 |
Type 2 diabetes | 5E-06 | 217 | −460 (140) | 0.016 | −410 (400) | −360 (310) | −340 (310) | <1e-04 | 0.89 |
Cortical thickness | |||||||||
BMI | 5E-08 | 506 | −0.0092 (0.0024) | 0.003 | −0.0062 (0.0075) | −0.0077 (0.0041). | −0.019 (0.011). | <4e-05 | 0.67 |
Educational attainment | 5E-06 | 710 | 0.01 (0.0031) | 0.016 | 0.012 (0.012) | 0.011 (0.0049)* | 0.013 (0.014) | <4e-05 | 0.87 |
Sleep duration | 5E-06 | 191 | 0.016 (0.0045) | 0.01 | 0.012 (0.02) | 0.018 (0.007)* | 0.036 (0.016)* | 0.0034 | 0.83 |
Smoking initiation | 5E-06 | 554 | −0.022 (0.0065) | 0.012 | −0.08 (0.029)** | −0.019 (0.0099). | 0.0083 (0.028) | <4e-05 | 0.04 |
AAOS = Alzheimer’s disease age of onset; AUDIT = alcohol use disorder identification test; BMI = body mass index; DBP = diastolic blood pressure; IVW = inverse-variance weighted; LDL = low-density lipoprotein; MR = Mendelian randomization; MR-PRESSO = Mendelian randomization pleiotropy residual sum and outlier; NFTs = neurofibrillary tangles; NPs = neuritic plaques; PP = pulse pressure; SNPs = single nucleotide polymorphisms; VBI = vascular brain injury; WMBE = Weighted Mode Based Estimator; WME = Weighted Median Estimator.
= p < 0.05;
= p < 0.01;
= p < 0.001.
Genetically predicted higher educational attainment was associated with significantly (1) lower risk of Alzheimer’s disease (OR = 0.64, CI = 0.56, 0.74), (2) delayed AAOS (hazard ratio [HR] = 0.76, CI = 0.67, 0.85), (3) increased cortical surface area (β mm2 = 4,600, CI = 3,737.6, 5462.4), and (4) increased cortical thickness (β mm = 0.01, CI = 0, 0.02). Genetically predicted higher diastolic DBP (OR = 1.08, CI = 1.04, 1.11) and PP (OR = 1.06, CI = 1.02, 1.1) were associated with significantly increased risk of VBI. Genetically predicted longer sleep duration was associated with significantly increased cortical thickness after outlier removal (β mm = 0.02, CI = 0.01, 0.02). Genetically predicted smoking status was associated with significantly reduced cortical thickness (β mm = −0.02, CI = −0.03 to −0.01]). Genetically predicted type 2 diabetes was associated with significantly earlier AAOS (HR = 1.07, CI = 1.04, 1.11). Genetically predicted increased low-density lipoproteins (OR = 2.01, CI = 1.39, 2.92) and total cholesterol levels (OR = 1.99, CI = 1.4, 2.84) were associated with significantly increased risk of neuritic plaques. Additionally, increased total cholesterol levels were associated with reduced hippocampal volume (β = −0.06, CI = −0.1 to −0.03]). However, after excluding variants in the APOE locus, these associations between total cholesterol and low-density lipoproteins with neuritic plaques and total cholesterol and hippocampal volume were nonsignificant.
For the risk factors that were causally associated with the AD phenome, bidirectional MR analysis indicated that increased cortical surface area and cortical thickness were significantly associated with increased educational attainment and a higher risk of smoking, respectively.
A further 5 risk factors, including AUDIT, diabetes, BMI, total cholesterol, and insomnia, were causally associated with the AD phenome in the IVW analysis (see Table 4; Fig), however, there was evidence of heterogeneity and the sensitivity analyses were nonsignificant suggesting that the observed associations were not robust to violations of MR underlying assumptions.
Discussion
Using genetic variants as proxies for modifiable risk factors, we applied PRS and MR analyses to investigate the association of putative modifiable risk factors with the AD phenome. PRS for higher educational attainment and DBP were observed to be associated with reduced risk for AD. In the MR analysis, only higher educational attainment was causally associated with a reduced risk of AD. Additionally, education was causally associated with a delayed AAOS, increased cortical surface area, and increased cortical thickness. There was no evidence that education was causally associated with AD neuropathology or CSF biomarkers, supporting the hypothesis that education mitigates dementia risk via cognitive reserve rather than affecting AD pathogenesis.47 Bidirectional MR analysis also indicated a bidirectional effect of cortical surface area on education.
The lack of causal associations between modifiable risk factors and AD may reflect heterogeneity in the underlying pathogenesis that can lead to clinical phenotypes analogous to AD. An endophenotype is usually less genetically complex than the disorder it underlies due to the endophenotype being influenced by fewer genetic risk factors than the disease as a whole and reflecting a single pathophysiological pathway of the overall clinical disorder. As endophenotypes can be measured in both cases and controls, there is greater power to detect an association due to the effect allele influencing the endophenotype even in asymptomatic carriers. As such, we expanded our MR analysis to infer causal relationships between modifiable risk factors and AD endophenotypes to evaluate how potential risk factors may influence the underlying pathophysiological pathways of AD. We observed (1) higher total-cholesterol and LDL-cholesterol levels to be causally associated with increased risk of neuritic plaque burden, although causal effects were driven by variants located within the APOE locus; (2) higher DBP and PP causally associated with increased risk of VBI; (3) higher total cholesterol was causally associated with decreased hippocampal volume; (4) smoking status was causally associated with reduced cortical thickness; and (5) longer sleep duration was causally associated with increased cortical thickness.
Observational studies have indicated that lifestyle interventions targeting modifiable risk factors can either prevent or delay the age of onset of dementia. In particular, low educational attainment, hearing loss, traumatic brain injury, hypertension, alcohol consumption, obesity, smoking, depression, physical inactivity, social isolation, and diabetes.3,4 However, with the exception of educational attainment, our analyses did not provide strong evidence of a causal association with these risk factors and AD or AAOS. The lack of a causal association between these risk factors and AD could be due to insufficient power in our analyses, but, alternatively, may be a result of confounding or reverse causation in observational studies. For instance, increased physical activity is generally associated with a reduced risk of dementia,3 however, a recent meta-analysis found that the protective association with dementia was observed when physical activity was measured < 10 years before dementia diagnosis, but when measured > 10 years before dementia onset no association with dementia was observed – consistent with reverse causation driving the observed protective association.48 Additionally, although these risk factors may not be associated with AD pathogenesis, they may be associated with the pathogenesis of other dementia subtypes. For instance, the observed association between blood pressure and VBI suggests that while reducing blood pressure in late life may have limited utility in the prevention of AD, it may reduce the risk of vascular dementia by reducing the risk of VBI and therefore affect the risk for all-cause dementia, but not specifically affect the risk of AD.
The association of modifiable risk factor PRS with clinically diagnosed AD has not been extensively studied, although several studies have conducted phenome-wide scans to evaluate the association of AD PRS with a wide range of diseases and other traits. Using data from the UK Biobank (n = 334,398), Richardson and colleagues found that an AD PRS composed of 124 SNPs and inclusive of APOE (pt ≤ 5e-05) was associated with 72 of 551 traits (FDR < 0.05).6 In particular, a higher AD PRS was associated with lower DBP and BMI, reduced risk of self-reported diabetes, shorter sleep duration, increased risk of self-reported high cholesterol, and increased amount of moderate-physical activity.6 Similarly, a second study by Korologou-Linden and colleagues evaluated the association of an AD PRS composed of 18 SNPs, inclusive of APOE (pt ≤ 5e-08) across 15,403 traits in the UK Biobank (n = 334,968).49 A higher AD PRS was associated with 165 traits and, in particular, with lower DBP, lower BMI, increased total cholesterol, levels, reduced risk of self-reported diabetes, increased oily fish consumption, increased sleeplessness or insomnia, reduced sleep duration, increased amount of moderate-physical activity, and increased risk of self-reported depression.49 In a follow-up MR analysis of these traits, only moderate-physical activity was observed to be causally associated with an increased risk of AD.49
Two earlier studies used MR to evaluate the association of potentially modifiable risk with AD cases-control status.50,51 First, Østergaard and colleagues evaluated the association of 13 risk factors with AD and observed that higher SBP, high-density lipoprotein (HDL) cholesterol, and smoking quantity were associated with a reduced risk of AD, whereas higher total cholesterol and low-density lipoprotein (LDL) cholesterol were associated with increased risk.51 No significant associations were observed for BMI, diabetes, insulin resistance, triglycerides, smoking initiation, or education, and after variants in the APOE locus were excluded from the analysis, the cholesterol levels were no longer significantly associated with AD risk.51 Second, Larsson and colleagues evaluated the association of 22 risk factors with AD, finding that years of education, intelligence, and 25-hydroxyvitamin D were associated with a reduced risk of AD, whereas coffee consumption was associated with increased risk.50 No significant associations were observed among alcohol consumption, serum folate, serum vitamin B12, homocysteine, cardiometabolic factors, or C reactive protein with AD.50
The results of this study should be interpreted in conjunction with knowledge of its limitations and those of MR in general. First, we are unable to validate the predictive ability of the exposure PRS as individual level data on the exposure traits were unavailable in the ADGC cohorts and we thus assume that each PRS is a good measure of genetic liability for that trait. Second, although we cannot exclude that our findings may be affected by weak instrument bias, the F-statistics for all of the analyses were greater than 10, indicating that the instrument strength was sufficient for MR analysis.42 However, in 2-sample MR, weak instrument bias is in the direction of the null, thus, we cannot exclude low power as an explanation for the null results.52 In particular, the samples size for the CSF and neuropathological outcomes are small and as such the MR analyses are potentially underpowered. Third, we cannot completely rule out violations of the independence and the exclusion restriction assumption, particularly in regard to pleiotropy.53 Nevertheless, we used several methods to infer robust causal estimates, including outlier removal using MR-PRESSO and WMBE, WME, and MR-Egger sensitivity analyses. Finally, it is assumed that both samples used to generate the GWAS summary statistics used in the MR model come from comparable populations. In evaluating the demographics of the studies used in this analysis, the exposures have an average age of 56.1 to 63.8 years, whereas outcomes, with the exception of hippocampal volume and cortical sufrace area and thickness, have an average age of 71 to 74.7 years. As such, some of the results reported here may be subject to survivor bias.54 Nevertheless, the bias introduced by survival effects is large for exposures that strongly affect survival. However, when selection effects are weak or moderate, selection bias does not adversely affect causal estimates.54
Despite these limitations, this study has significant strengths. We assessed the causal effect of multiple modifiable factors strongly hypothesized as affecting AD risk. In addition, we used the largest GWAS for AD and the exposure traits available at the time of analysis, allowing us to include the largest possible number of instruments for the exposures, resulting in increased statistical power. Finally, rather than limiting our analyses to AD case/control status, we expanded our MR analysis to include AD endophenotypes.
In conclusion, this study used large exposure and outcome GWAS to conduct PRS and MR analyses to infer causal relationships between potentially modifiable risk factors and the AD phenome. The PRS analysis indicated that a higher genetic predisposition education and DBP influenced AD risk. In the follow-up MR analysis, only genetically predicted higher education was observed to have a causal association with reduced AD risk. Expanding our analysis to additional AD endophenotypes, we observed that higher genetically predicted cholesterol levels and blood pressure were associated with increased risk of neuritic plaque burden and VBI, respectively, suggesting that these risk factors influence the development of neurodegenerative disease pathology. The results from this analysis should be considered within a triangulation of evidence framework by integrating the findings from different study types, such as observational studies and RCTs, all of which have different sources of bias to identify and prioritize modifiable risk factors that are robust.
Supplementary Material
Acknowledgments
S.J.A., B.F.H., E.M., and A.M.G. were supported by the JPB Foundation (http://www.jpbfoundation.org) and by the National Institutes of Health (U01AG052411 and U01AG058635; principal investigator Alison Goate). P.F.O. was supported by funding from the UK Medical Research Council (MR/N015746/1) and the National Institute of Health (R01MH122866). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. This analysis was possible due to the generous sharing of genomewide association summary statistics. We would like to thank the research participants and employees of 23andMe for making this work possible. ADGC: The Alzheimer’s Disease Genetics Consortium supported collection and genotyping of samples used in this study through National Institute on Aging (NIA) grants U01AG032984 and RC2AG036528. NCRAD: Samples from the National Centralized Repository for Alzheimer’s Disease and Related Dementias (NCRAD), which receives government support under a cooperative agreement grant (U24 AG21886) awarded by the National Institute on Aging (NIA), were used in this study. We thank contributors who collected samples used in this study, as well as patients and their families, whose help and participation made this work possible. NIAGADS: Data for this study were prepared, archived, and distributed by the National Institute on Aging Alzheimer’s Disease Data Storage Site (NIAGADS) at the University of Pennsylvania (U24 AG041689). The NACC database is funded by NIA/NIH Grant U01 AG016976. NACC data are contributed by the NIA-funded ADCs (Supplementary Table S7).
Footnotes
Additional supporting information can be found in the online version of this article.
Potential Conflicts of Interest
S.J.A., B.F.H., E.M., P.F.O., and A.M.G. have no conflicts of interest to declare.
Data Availability
This study used published summary results from published research papers, with the references for those studies provided in the main paper. Supplementary Tables S1 and S2 provide the harmonized SNP effects needed to reproduce the results of this analysis. The Code used to conduct this analysis is available at: https://github.com/sjfandrews/MR_ADPhenome.
References
- 1.Alzheimer’s Association. 2020 Alzheimer’s disease facts and figures. Alzheimer’s Dement 2020;16:391–460. [Google Scholar]
- 2.Anstey KJ, Ee N, Eramudugolla R, et al. A systematic review of meta-analyses that evaluate risk factors for dementia to evaluate the quantity, quality, and global representativeness of evidence. J Alzheimers Dis 2019;70:S165–S186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Livingston G, Sommerlad A, Orgeta V, et al. Dementia prevention, intervention, and care. Lancet 2017;390:2673–2734. [DOI] [PubMed] [Google Scholar]
- 4.Livingston G, Huntley J, Sommerlad A, et al. Dementia prevention, intervention, and care: 2020 report of the Lancet Commission. Lancet 2020;396:413–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.World Health Organization, Risk Reduction of Cognitive Decline and Dementia: WHO Guidelines 2019. [PubMed] [Google Scholar]
- 6.Richardson TG, Harrison S, Hemani G, Smith GD. An atlas of polygenic risk score associations to highlight putative causal relationships across the human phenome. Elife 2019;8:e43657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hemani G, Zheng J, Elsworth B, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife 2018;7:e34408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lawlor DA, Tilling K, Smith GD. Triangulation in aetiological epidemiology. Int J Epidemiol 2017;45:dyw314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Liu M, Jiang Y, Wedow R, et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat Genet 2019;51:237–244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sanchez-Roige S, Palmer AA, Fontanillas P, et al. Genome-wide association study meta-analysis of the alcohol use disorders identification test (AUDIT) in two population-based cohorts. Am J Psychiatry 2019;176:107–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Klimentidis YC, Raichlen DA, Bea J, et al. Genome-wide association study of habitual physical activity in over 377,000 UK Biobank participants identifies multiple variants including CADM2 and APOE. Int J Obes (Lond) 2005 2018;42:1161–1176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Willer CJ, Schmidt EM, Sengupta S, et al. Discovery and refinement of loci associated with lipid levels. Nat Genet 2013;45:1274–1283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Evangelou E, Warren HR, Mosen-Ansorena D, et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat Genet 2018;50:1412–1425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Xue A, Wu Y, Zhu Z, et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat Commun 2018;9:2941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Yengo L, Sidorenko J, Kemper KE, et al. Meta-analysis of genome-wide association studies for height and body mass index in ~700000 individuals of European ancestry. Hum Mol Genet 2018;27: 3641–3649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Niarchou M, Byrne EM, Trzaskowski M, et al. Genome-wide association study of dietary intake in the UK biobank study and its associations with schizophrenia and other traits. Transl Psychiatry 2020; 10:51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Howard DM, Adams MJ, Clarke T-K, et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci 2019;22:343–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Jansen PR, Watanabe K, Stringer S, et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nat Genet 2019;51:394–403. [DOI] [PubMed] [Google Scholar]
- 19.Dashti HS, Jones SE, Wood AR, et al. Genome-wide association study identifies genetic loci for self-reported habitual sleep duration supported by accelerometer-derived estimates. Nat Commun 2019; 10:1100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Day FR, Ong KK, Perry JRB. Elucidating the genetic basis of social interaction and isolation. Nat Commun 2018;9:2457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lee JJ, Wedow R, Okbay A, et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet 2018;50:1112–1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Wells HRR, Freidin MB, Abidin FNZ, et al. GWAS identifies 44 independent associated genomic loci for self-reported adult hearing difficulty in UK Biobank. Am J Hum Genet 2019;105:788–802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kunkle BW, Grenier-Boley B, Sims R, et al. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processing. Nat Genet 2019;51:414–430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Huang K-L, Marcora E, Pimenova AA, et al. A common haplotype lowers PU.1 expression in myeloid cells and delays onset of Alzheimer’s disease. Nat Neurosci 2017;20:1052–1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Deming Y, Li Z, Kapoor M, et al. Genome-wide association study identifies four novel loci associated with Alzheimer’s endophenotypes and disease modifiers. Acta Neuropathol 2017;133: 839–856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hibar DP, Adams HHH, Jahanshad N, et al. Novel genetic loci associated with hippocampal volume. Nat Commun 2017;8:13624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Grasby KL, Jahanshad N, Painter JN, et al. The genetic architecture of the human cerebral cortex. Science 2020;367:eaay6690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Beecham GW, Hamilton K, Naj AC, et al. Genome-wide association meta-analysis of neuropathologic features of Alzheimer’s disease and related dementias. PLoS Genet 2014;10:e1004606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Toledo JB, Arnold SE, Raible K, et al. Contribution of cerebrovascular disease in autopsy confirmed neurodegenerative disease cases in the National Alzheimer’s Coordinating Centre. Brain 2013;136: 2697–2706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lambert J-C, Ibrahim-Verbaas CA, Harold D, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet 2013;45:1452–1458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hibar DP, Stein JL, Renteria ME, et al. Common genetic variants influence human subcortical brain structures. Nature 2015;520: 224–229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kent WJ, Sugnet CW, Furey TS, et al. The human genome browser at UCSC. Genome Res 2002;12:996–1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lyon MS, Andrews SJ, Elsworth BL, et al. The variant call format provides efficient and robust storage of GWAS summary statistics. Biorxiv 2020;1–27. 10.1101/2020.05.29.115824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Naj AC, Jun G, Beecham GW, et al. Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset Alzheimer’s disease. Nat Genet 2011;43:436–441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Manichaikul A, Mychaleckyj JC, Rich SS, et al. Robust relationship inference in genome-wide association studies. Bioinformatics 2010; 26:2867–2873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Conomos MP, Miller MB, Thornton TA. Robust inference of population structure for ancestry prediction and correction of stratification in the presence of relatedness. Genet Epidemiol 2015;39:276–293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007;81:559–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Consortium HR, McCarthy S, Das S, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 2016;48: 1279–1283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Choi SW, O’Reilly PF. PRSice-2: polygenic risk score software for biobank-scale data. Gigascience 2019;8:1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Coombes BJ, Ploner A, Bergen SE, Biernacka JM. A principal component approach to improve association testing with polygenic risk scores. Genet Epidemiol 2020;44:676–686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Shim H, Chasman DI, Smith JD, et al. A multivariate genome-wide association analysis of 10 LDL subfractions, and their response to statin treatment, in 1868 Caucasians. PLoS One 2015;10:e0120758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Burgess S, Thompson SG, CRP CHD Genetics Collaboration. Avoiding bias from weak instruments in Mendelian randomization studies. Int J Epidemiol 2011;40:755–764. [DOI] [PubMed] [Google Scholar]
- 43.Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet 2018;50:693–698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Storey JD. A direct approach to false discovery rates. J R Stat Soc Ser B Stat Methodol 2002;64:479–498. [Google Scholar]
- 45.Brion M-JA, Shakhbazov K, Visscher PM. Calculating statistical power in Mendelian randomization studies. Int J Epidemiol 2013;42: 1497–1501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Koster J, Rahmann S. Snakemake—a scalable bioinformatics workflow engine. Bioinformatics 2012;28:2520–2522. [DOI] [PubMed] [Google Scholar]
- 47.Stern Y. Cognitive reserve in ageing and Alzheimer’s disease. Lancet Neurol 2012;11:1006–1012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Kivimäki M, Singh-Manoux A, Pentti J, et al. Physical inactivity, cardiometabolic disease, and risk of dementia: an individual-participant meta-analysis. BMJ 2019;l1495:365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Korologou-Linden R, Anderson EL, Howe LD, et al. The causes and consequences of Alzheimer’s disease: a Mendelian randomization analysis. Medrxiv 2019;1–43. 10.1101/2019.12.18.19013847. [DOI] [Google Scholar]
- 50.Larsson SC, Traylor M, Malik R, et al. Modifiable pathways in Alzheimer’s disease: Mendelian randomisation analysis. BMJ 2017; 359:j5375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Østergaard SD, Mukherjee S, Sharp SJ, et al. Associations between potentially modifiable risk factors and Alzheimer disease: a Mendelian randomization study. PLoS Med 2015;12:e1001841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Pierce BL, Burgess S. Efficient design for Mendelian randomization studies: subsample and 2-sample instrumental variable estimators. Am J Epidemiol 2013;178:1177–1184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hemani G, Bowden J, Smith GD. Evaluating the potential role of pleiotropy in Mendelian randomization studies. Hum Mol Genet 2018;27:R195–R208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Gkatzionis A, Burgess S. Contextualizing selection bias in Mendelian randomization: how bad is it likely to be? Int J Epidemiol 2018;48: 691–701. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.