Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Oct 1.
Published in final edited form as: Neurobiol Aging. 2022 May 27;118:117–123. doi: 10.1016/j.neurobiolaging.2022.05.011

An integrated genome and phenome-wide association study approach to understanding Alzheimer’s disease predisposition

Archita S Khaire 1, Courtney E Wimberly 1, Eleanor C Semmes 2,3, Jillian H Hurst 3, Kyle M Walsh 1,3,4,*
PMCID: PMC9787699  NIHMSID: NIHMS1831630  PMID: 35715361

Abstract

Genome-wide association studies (GWAS) have identified common single nucleotide polymorphisms (SNPs) that increase late-onset Alzheimer’s disease (LOAD) risk. To identify additional LOAD-associated variants and provide insight into underlying disease biology, we performed a phenome-wide association study (PheWAS) on 23 known LOAD-associated SNPs and 4:1 matched control SNPs using UK Biobank data. LOAD-associated SNPs were significantly enriched for associations with 8/778 queried traits, including three platelet traits. The strongest enrichment was for platelet distribution width (PDW) (P=1.2×10−5), but increased PDW was not associated with LOAD susceptibility in Mendelian randomization analysis. Of 384 PDW-associated SNPs identified by prior GWAS, 36 were nominally associated with LOAD risk (17,008 cases; 37,154 controls) and 5 survived false-discovery rate correction. Associations confirmed known LOAD risk loci near PICALM, CD2AP, SPI1, and NDUFAF6, and identified a novel risk locus in EGFR. Integrating GWAS and PheWAS data reveals substantial pleiotropy between genetic determinants of LOAD and of platelet morphology, and for the first time implicates EGFR – a mediator of β-amyloid toxicity – in Alzheimer’s disease susceptibility.

Keywords: Alzheimer’s disease, GWAS, PheWAS, EGFR, genetic epidemiology

INTRODUCTION

Late-onset Alzheimer’s disease (LOAD) is defined by the development of symptoms after age 65 and is the most common form of dementia. LOAD affects an estimated 5.7 million Americans, a number projected to grow rapidly with demographic trends.(Rabinovici 2019) Genome-wide association studies (GWAS) have greatly enhanced our understanding of the etiology of LOAD.(Bruni, Bernardi, and Gabelli 2020; Lewcock et al. 2020) A recent GWAS meta-analysis identified common heritable risk alleles at 24 statistically independent genetic loci, implicating genes involved in immunity, lipid metabolism, tau and amyloid-β (Aβ) in LOAD pathogenesis.(Kunkle et al. 2019) Although collaborative GWAS meta-analyses including >90k LOAD patients have enabled identification of low-frequency risk alleles, gains in statistical power are increasingly limited. Furthermore, functional genomics studies to reveal the biologic consequences of associated single nucleotide polymorphisms (SNPs) through in silico and in vivo analyses remain limited.(Andrews, Fulton-Howard, and Goate 2020) The use of polygenic score and Mendelian randomization (MR) approaches,(Shi et al. 2021; Zhang, Wang, and Liu 2021; Li et al. 2021; Handy et al. 2021) gene-pathway analyses,(Gao et al. 2018) and phenome-wide association studies (PheWAS)(Lumsden et al. 2020) can augment traditional GWAS approaches and expand our understanding of the underlying biology behind LOAD pathogenesis.

PheWAS are an approach to analyze a collection of phenotypes for their association with a particular genetic variant of interest, and they have recently been made feasible through the integration of electronic health record (EHR) data with large-scale genomic datasets.(Denny, Bastarache, and Roden 2016) A PheWAS essentially inverts the classic GWAS approach, testing a large collection of phenotypes (rather than a single disease or trait) for association with a single genetic variant (rather than millions of variants distributed across the genome). However, both GWAS and PheWAS typically test SNP-trait associations through the same multivariable regression-based frameworks, including adjustment for subject ancestry to control for population stratification.(Bush, Oetjens, and Crawford 2016)

By expanding the list of phenotypes associated with a variant of interest, performing PheWAS on a known disease-associated SNP can identify potential co-morbid conditions,(Byun et al. 2021) as well as traits that may mediate the SNP-disease association.(Pettit et al. 2021) For example, a PheWAS of the lung cancer risk-variant rs16969968 reveals numerous strong associations with various smoking-related behaviors (e.g., cigarettes per day) that mediate the variant’s association with lung cancer risk.(Millard et al. 2019) Identifying such “intermediate traits” connecting LOAD risk alleles to causal mechanisms of LOAD pathogenesis has clear relevance for disease prevention and population-based risk reduction.(Semmes, Zhang, and Walsh 2018) Furthermore, by identifying traits that have overlapping genetic architecture with a disease of interest, investigators can curate the genetic determinants of such traits to generate an empirical list of candidate-SNPs and test these for pleiotropic association with the disease under study.(Semmes et al. 2020)

The strongest genetic risk factor for sporadic LOAD is the APOE ε4 allele, and a PheWAS of APOE gene alleles (ε4, ε3 and ε2) was recently performed.(Lumsden et al. 2020) Although that study identified 14 non-neurologic diseases associated with APOE alleles, including hypercholesterolemia and ischemic heart disease, a comprehensive PheWAS analysis of all known LOAD risk loci has not been conducted. We recently published an analytic strategy that integrates GWAS and PheWAS data for a disease of interest in order to identify both epidemiologic traits and trait-associated genetic variants that influence disease risk.(Semmes et al. 2020) By applying our approach to LOAD, we attempt to leverage genetic pleiotropy between LOAD and other traits to identify new LOAD risk factors and to nominate biological pathways through which LOAD risk alleles may exert their pathogenic effects.

MATERIALS AND METHODS

Selection of LOAD risk alleles.

Hundreds of studies have investigated the genetic epidemiology of LOAD, with associated alleles replicating to highly variable degrees in subsequent research. Associated variants discovered by GWAS are historically more likely to replicate in future studies due to their use of more stringent thresholds for determining statistical significance, lack of an a priori hypothesis, and tendency to externally replicate associated variants within the primary publication.(Walsh et al. 2013) We compiled a list of common (minor allele frequency ≥0.01 in European-ancestry populations) LOAD-associated genetic variants for PheWAS analysis from the most recent and largest GWAS meta-analysis of LOAD published at the time, which included 35,274 clinical and autopsy-documented LOAD cases and 59,163 controls genotyped or imputed for 9,456,058 SNPs.(Kunkle et al. 2019) Using a list of LOAD-associated variants reaching canonical genome-wide statistical significance (i.e. P<5.0×10−8) in their study, we performed linkage disequilibrium (LD) pruning (R2≤0.15 in European-ancestry populations) using LDlink and retained the variant with the lowest P-value.(Machiela and Chanock 2015) A total of 24 independent LOAD-associated variants were carried forward for PheWAS analyses, including 19 SNPs identified by prior GWAS of LOAD and 5 novel variants that were successfully replicated within that publication.

Control SNP Set.

We compiled a comparison SNP-set to serve as controls for PheWAS analyses. A set of unlinked control SNPs (1000 Genomes Project) was generated using SNPsnap (Broad Institute).(Pers, Timshel, and Hirschhorn 2015) Four control SNPs were matched to the LOAD-associated SNPs on: minor allele frequency (±5%), surrounding gene density (±50%), distance to nearest gene (±50%) and, as a proxy for haplotype block size, the number of other SNPs in LD at R2≥0.50 (±50%). For several LOAD risk SNPs, we could not generate more than 4 control SNPs without loosening our matching parameters, but the gain in statistical power achieved beyond a case-to-control ratio of 1:4 is typically minimal.(Hennessy et al. 1999) For one LOAD-associated SNP in the MHC class II region on chromosome 6 (rs9270816), no appropriate control SNPs could be matched. Therefore, rs9270816 was excluded from downstream analyses and a total of 23 LOAD-associated SNPs and 92 control SNPs were carried forward for PheWAS analyses.

PheWAS analyses.

The UK Biobank atlas of genetic associations (http://geneatlas.roslin.ed.au.uk/) was constructed by genotyping 452,264 European-ancestry individuals for 805,426 genetic variants, performing genome-wide SNP imputation and quality-controls, and linking genetic data to EHR data. GeneATLAS contains data for 778 traits (118 quantitative, 660 binary) and associations with 9,113,133 genetic variants (genotyped or imputed). GeneATLAS is searchable and can be queried for genetic (e.g., SNPs) or phenotypic (e.g., height) data to assess genotype-phenotype associations.(Canela-Xandri, Rawlik, and Tenesa 2018)

We queried GeneATLAS for trait associations with 23 known LOAD risk SNPs and the control SNP-set (92 matched SNPsnap controls). Summary statistics for traits associated with each queried variant were downloaded from GeneATLAS for downstream analyses. Nominally significant SNP-trait associations (P<0.01) were carried forward to trait-enrichment analyses, as previously described.(Semmes et al. 2020) Although a more stringent p-value threshold for carrying SNP-trait associations forward was considered (e.g. 0.05/778), this was determined to be too conservative because many of the 778 traits in GeneATLAS have high genetic correlations with each other (e.g., weight and hip circumference, 0.909; reticulocyte percentage and reticulocyte count, 0.952). Additionally, these individual SNP-trait associations were carried forward for trait-enrichment comparisons between LOAD-associated SNPs and the control SNP-set, and as such the PheWAS significance threshold is somewhat arbitrary so long as it is uniform across all SNP-sets.

PheWAS associations of the 23 LOAD SNPs with 778 phenotypic traits were compared to PheWAS results for the control SNP-set using the Python Programming Environment (https://www.python.org, version 3.6.9). For traits associated with >1 LOAD SNP, we used Fisher’s exact tests to compare the frequency at which individual traits were associated with LOAD SNPs versus control SNPs to determine if traits were enriched for association with known LOAD risk variants. Fisher’s exact p-values for trait enrichment underwent FDR correction to adjust for multiple testing.

SNPs associated with PheWAS-identified trait (platelet distribution width).

We identified 546 unlinked (R2≤0.15 in European-ancestry populations) SNPs with minor allele frequency (MAF) >0.01 that were independently associated with platelet distribution width (PDW) at genome-wide statistical significance in a recent GWAS (N=563,085 subjects).(Vuckovic et al. 2020) Variants, effect alleles, effect sizes, and standard errors were extracted for downstream single-SNP association testing with LOAD and for two-sample Mendelian randomization analysis with LOAD.

Mendelian randomization analyses.

To assess for a causal relationship between PDW and LOAD risk, we performed formal two-sample MR analyses with the R package “MendelianRandomization”.(Yavorska and Burgess 2017; Burgess and Thompson 2013) Using summary statistics of SNP-exposure (i.e., PDW) and SNP-outcome (i.e., LOAD) associations, we used the (1) inverse-variance weighted (IVW), (2) MR-Egger, and (3) weighted median methods to test for a causal relationship between PDW and LOAD risk.(Bowden et al. 2016; Bowden, Davey Smith, and Burgess 2015)

LOAD case-control data.

To determine whether PDW-associated SNPs impact risk of LOAD at the single-variant level or under a Mendelian randomization framework, we leveraged SNP summary-level data from a recent GWAS of LOAD. This GWAS included 17,008 European-ancestry LOAD patients and 37,154 controls from the International Genomics of Alzheimer’s Project (I-GAP).(Lambert et al. 2013) Of these LOAD patients, 62% had a clinical or MRI-based diagnosis and 22% had autopsy-confirmed AD. Genotypes were imputed with the use of European population reference (EUR) haplotype data from the 1000 Genomes Project, and SNPs that did not pass study-level genotyping quality-control (QC) measures or had a MAF <0.01 were excluded, as were SNPs that did not pass study-level imputation QC measures (INFO score ≥0.30) or were not successfully genotyped/imputed in at least 40% of subjects across studies. The association of LOAD with genotype dosage was analyzed using logistic regression, adjusting for age, sex and ancestry-informative principal components (for three studies with incident LOAD data, Cox proportional hazards models were used). A total of 7,055,881 SNPs with MAF>0.01 are included in the final I-GAP meta-analysis results. Of 546 PDW-associated SNPs with MAF >0.01 from Vuckovic, et al.,(Vuckovic et al. 2020) 384 had data available from I-GAP after QC filtering.

eQTL and in silico SNP functional analyses.

We characterized SNPs in novel LOAD risk loci using HaploReg to annotate chromatin state and regulatory motifs.(Ward and Kellis 2016) We examined whether variants were protein-binding, located in DNAse hypersensitive sites, promoter or enhancer histone marks, predicted to change transcription factor binding motifs, or functioned as expression quantitative trait loci (eQTL) in diverse tissues,(Ward and Kellis 2016) including the brain.(Zou et al. 2012)

RESULTS

An overview of the published methodology adapted to our study of LOAD is displayed in Figure 1.(Semmes et al. 2020) We first identified 24 common LOAD-associated SNPs from prior GWAS of European-ancestry populations, of which 23 were successfully matched to 4 control SNPs each (Table 1 and Supplementary Table 1). PheWAS analyses were then performed with the UK Biobank GeneATLAS database to test each LOAD-associated variant and control variant for association with 778 traits. After determining which PheWAS-identified traits were enriched for association with the LOAD SNP-set (n = 23) compared to the control SNP-set (n = 92), we identified SNPs associated with the enriched PheWAS-identified trait in prior GWAS. Using MR and candidate-SNP approaches, we tested whether enriched traits or trait-associated SNPs conferred LOAD risk in I-GAP data.

Figure 1. Methodology for integrative analysis of GWAS and PheWAS data.

Figure 1.

This figure illustrates our approach for investigating phenotype associations with known disease risk-variants in order to identify novel candidate risk loci and/or intermediate phenotypes for subsequent analysis in case-control cohorts. Applying this approach to late-onset Alzheimer’s disease (LOAD) identified platelet distribution width (PDW) as a phenotype enriched for association with LOAD GWAS hits and informed downstream analysis of the role of PDW-associated variants in conferring LOAD risk in a case-control sample. Created with Biorender.com.

a Genome-wide meta-analysis of 35,274 clinical and autopsy-documented LOAD cases and 59,163 controls (Kunkle, et al.)

b PheWAS catalog (UK Biobank GeneATLAS) - http://geneatlas.roslin.ed.ac.uk/phewas/

c Four control SNPs were matched to each LOAD-associated SNP on minor allele frequency (±5%), surrounding gene density (±50%), distance to nearest gene (±50%) and the number of other SNPs in LD at R2≥0.50 (±50%) using the SNPsnap software (Broad Institute) - https://data.broadinstitute.org/mpg/snpsnap/

d Genome-wide meta-analysis of platelet traits in 563,085 subjects (Vuckovic, et al.)

e Summary-level SNP association data available from the I-GAP genome-wide meta-analysis LOAD patients and controls (Lambert, et al.)

Abbreviations: GWAS, genome-wide association study; PheWAS, phenome-wide association study; LOAD, late-onset Alzheimer’s disease; SNPs, single nucleotide polymorphisms.

Table 1.

Common genetic variants associated with late-onset Alzheimer’s disease (LOAD) in prior GWAS and included in our PheWAS-based enrichment analyses.

Position (Chr:BP)a rsID MAFb Function Nearest Gene
1:207802552 rs4844610 0.19 intronic CR1
2:127892810 rs6733839 0.40 intergenic BIN1
2:233981912 rs10933431 0.22 intergenic INPP5D
6:47431284 rs9473117 0.27 intergenic CD2AP
7:100091795 rs12539172 0.32 3’ UTR NYAP1
7:143099133 rs10808026 0.23 intronic EPHA1
8:27219987 rs73223431 0.35 intronic PTK2B
8:27467686 rs9331896 0.40 intronic CLU
10:11720308 rs7920721 0.36 intergenic ECHDC3
11:47380340 rs3740688 0.46 intronic SPI1
11:59936926 rs7933202 0.40 intergenic MS4A6A
11:85868640 rs3851179 0.38 intergenic PICALM
11:121435587 rs11218343 0.04 intronic SORL1
14:53391680 rs17125924 0.08 intronic FERMT2
14:92932828 rs12881735 0.22 intronic SLC24A4
15:59045774 rs593742 0.31 intergenic ADAM10
16:19808163 rs7185636 0.15 intronic IQCK
16:79355857 rs62039712 0.10 intergenic WWOX
17:61538148 rs138190086 0.01 intergenic ACE
19:1056492 rs3752246 0.18 missense ABCA7
19:45411941 rs429358 0.14 missense APOE
20:54997568 rs6024870 0.08 intronic CASS4
21:28156856 rs2830500 0.30 intergenic ADAMTS1
a

Position in GRCh37/hg19

b

Minor Allele Frequency in European-ancestry subjects from the 1000 Genomes Project

PheWAS analyses.

216 of the 778 traits in the GeneATLAS database were nominally associated (P<0.01) with >1 of the 23 LOAD-associated SNPs and were tested for enrichment versus the control SNP-set (Table S2). A total of eight traits were more likely to be associated with LOAD SNPs than with control SNPs at nominal statistical significance (P<0.05), including: platelet distribution width (PDW), platelet crit, platelet count, leukocyte count, immature reticulocyte fraction, cheese intake, high light scatter reticulocyte percentage, and current tobacco smoking (Table 2). PDW was the PheWAS-identified trait most enriched for association with LOAD SNPs, as 11 out of 23 (47.8%) LOAD SNPs were associated with platelet distribution width at P<0.01 compared to 6 of 92 (6.5%) control SNPs (Fisher’s exact P=1.2×10−5). Moreover, this association survived both FDR correction (PFDR=2.6×10−3) and Bonferroni correction for all 778 traits examined (P=9.3×10−3). We did not observe any consistent direction of effect between PDW and LOAD risk, as five LOAD risk SNPs were associated with increased PDW and six LOAD risk SNPs were associated with decreased PDW. Although the other seven traits appearing in Table 2 did not demonstrate significant enrichment for association with LOAD SNPs after FDR correction, it is noteworthy that they include two additional platelet-related traits (platelet crit and platelet count).

Table 2:

Traits more likely to be associated with late-onset Alzheimer’s disease (LOAD) risk SNPs than with control SNPs in PheWAS enrichment analyses. The number of LOAD SNPs and control SNPs assocated with each trait at P<0.01 among 452,264 European-ancestry individuals from UK Biobank are listed, along with median z-scores for these associations. P-values are calculated using Fisher’s exact test, comparing the frequency at which LOAD SNPs and matched control SNPs were associated with each PheWAS trait.

Trait Associated LOAD SNPs (median z-score) Associated control SNPs (median z-score) P-value P-value(FDR)
Platelet distribution width 11/23 (3.31) 6/92 (4.42) 1.2×10−5 2.6×10−3
Platelet crit 10/23 (3.27) 11/92 (3.19) 1.4×10−3 0.15
Leukocyte count 7/23 (5.29) 5/92 (3.31) 2.2×10−3 0.16
Cheese intake 4/23 (3.25) 1/92 (3.22) 5.5×10−3 0.25
Platelet count 8/23 (3.92) 9/92 (3.51) 5.9×10−3 0.25
Immature reticulocyte fraction 5/23 (5.06) 3/92 (3.21) 7.9×10−3 0.28
High light scatter reticulocyte percentage 6/23 (3.85) 6/92 (3.23) 0.014 0.39
Current tobacco smoking 4/23 (3.29) 2/92 (3.05) 0.014 0.39

Mendelian randomization (MR) analyses.

Although the eleven LOAD SNPs associated with PDW did not demonstrate associations in a consistent direction (i.e., increased versus decreased PDW), we further evaluated the relationship between PDW and LOAD risk using a larger set of variants underlying PDW biology. To test for a causal relationship between PDW and LOAD, we used several MR analytical approaches leveraging 384 known PDW-associated SNPs from prior GWAS as instrumental variables and summary statistics for these SNPs from the I-GAP GWAS of LOAD. Estimates from IVW (PIVW=0.27), MR-Egger (PMR-Egger=0.69, PMR-intercept=0.19), and median-weighted (PMR-median=0.64) methods were all non-significant, indicating that PDW does not mediate LOAD risk and that there is likely no causal relationship between the traits.

PDW-associated SNPs as candidate LOAD risk loci.

To examine whether individual PDW-associated variants might have pleiotropic effects on LOAD risk, we evaluated single-SNP associations of 384 PDW-associated SNPs with LOAD risk in 17,008 LOAD cases and 37,154 controls from I-GAP. Thirty-six SNPs were nominally associated (P<0.05) with LOAD case-control status (Table S3), of which five remained associated after FDR correction (Table 3). These associations confirmed known LOAD risk loci near PICALM, CD2AP, SPI1, and NDUFAF6, and identified a novel risk locus at rs12535226 in the epidermal growth factor receptor (EGFR) gene (OR=1.06; 95% CI: 1.02–1.09). The rs12535226 variant is located the first intron of EGFR and has a minor allele frequency of 48% in European-ancestry populations. In silico functional analysis provided strong evidence of this variant having a regulatory role, as it is located in an enhancer region enriched for H3K27Ac histone marks in both brain tissues and neurosphere cultures,(Ward and Kellis 2016) and was identified as an EGFR expression quantitative trait locus (eQTL) in both the cerebellum and temporal cortex.(Zou et al. 2012) A PheWAS of the rs12535226 variant in UK Biobank data revealed strong association with many hematologic traits (e.g., monocyte count, leukocyte count, eosinophil count) and nominal associations several vascular traits, including cardiac arrhythmias (P=6.2×10−3), cerebrovascular disease (P=0.033) and cerebral infarction (P=0.012) (Table S4).

Table 3:

Variants achieving genome-wide statistical significance in a prior GWAS of platelet distribution width (Vuckovic, et al.) and their association with late-onset Alzheimer’s disease (LOAD) risk (Lambert et al. 2013) (17,008 patients; 37,154 controls). A total of 384 sentinel variants from Vuckovic, et al. were assessed for association with LOAD risk and appear in Supplementary Table 3, of which five remained associated after FDR correction and appear here.

Variant ID Position (Chr:BP)a P-value P-value(FDR) Nearest gene(s) Previously-identified LOAD risk locus
rs3851179 11:85868640 2.8×10−15 1.1×10−12 EED, PICALM Yes
rs4711890 6:47606712 4.9×10−6 9.4×10−4 CD2AP Yes
rs7107356 11:47676170 8.4×10−6 1.1×10−3 MTCH2, AGBL2 Yes
rs12535226 7:55156419 4.3×10−4 0.034 EGFR No
rs4735333 8:95955074 4.5×10−4 0.034 TP53INP1, NDUFAF6 Yesb
a

Position in GRCh37/hg19

b

This region approached genome-wide statistical significance in a prior GWAS of LOAD(Kunkle et al. 2019) (lead SNP rs4735340, P=9.2×10−8), but was not independently replicated and thus is not yet considered a validated LOAD risk locus.

DISCUSSION

We apply an innovative and previously-validated framework for leveraging existing GWAS and PheWAS data to uncover traits associated with Alzheimer’s risk and to identify trait-associated variants as candidate LOAD risk loci. We first curated SNPs associated with LOAD from large GWAS meta-analyses and matched these LOAD risk variants to control SNPs on multiple parameters, including MAF, gene density, distance to nearest gene, and the number of other SNPs in LD. Performing PheWAS on these LOAD and control SNPs using UK Biobank data, we identified platelet distribution width (PDW) as being significantly enriched for association with LOAD risk loci. Although MR analysis did not implicate PDW as having a causal role in LOAD pathogenesis, a number of PDW-associated SNPs demonstrated pleiotropic associations with LOAD risk, including a novel association between LOAD and an EGFR eQTL variant.

Although PDW was significantly enriched for association with LOAD risk SNPs, MR analyses did not suggest the presence of vertical pleiotropy, wherein PDW directly contributes to LOAD risk. However, the large number of pleiotropic loci observed to influence both PDW and LOAD suggest that substantial horizontal pleiotropy exists between these traits, which occurs when a variant affects LOAD directly or through multiple pathways not limited to platelet morphology. This appears reasonable given the emerging role of immune function in LOAD pathogenesis,(Lewcock et al. 2020) and is supported by our single-SNP association results that identified known and novel LOAD risk loci using PDW-associated variants as empirically-derived candidate SNPs. Confirming the validity of this approach, our analyses re-identified variants in SPI1, a transcriptional activator of myeloid and B cell developmental gene expression previously linked to LOAD pathogenesis. They also identified a PDW-associated variant upstream of NDUFAF6, a region that has approached genome-wide statistical significance in a prior GWAS of LOAD (P=9.2×10−8) but has not yet emerged as a confirmed risk locus and was therefore not included among the LOAD SNPs used in our PheWAS analysis.

A novel LOAD risk locus in EGFR was implicated by our integrated analyses and the risk allele has been associated with elevated EGFR expression levels in human brain tissues.(Zou et al. 2012) EGFR, a member of the Human Epidermal Growth Factor Receptor 1 (ErbB) family, is a transmembrane tyrosine kinase receptor that initiates cell differentiation and proliferation. EGFR expression increases during neurogenesis and gliogenesis, and it activates pathways involved in oncogene expression.(Voldborg et al. 1997) EGFR plays a crucial role in the development of cancer, especially non-small lung cell carcinoma and glioblastoma, and extensive research has been completed evaluating it as a therapeutic target in the oncology setting.(da Cunha Santos, Shepherd, and Tsao 2011; Congdon et al. 2014) However, its function in the pathology of neurodegenerative diseases, such as LOAD, has received comparatively little attention.(Mansour et al. 2021)

Heritable risk alleles in EGFR have been associated with glioblastoma in prior GWAS, with lead variant rs2252586 associated with a 1.15-fold increased risk.(Walsh et al. 2013) EGFR levels are anticipated to be more modest in the brains of LOAD patients than in glioblastoma tumors, as these tumors frequently harbor high-copy EGFR gene amplifications or express constitutively active mutant EGFR isoforms. Although the LOAD-associated EGFR variant we identify is associated with a more modest 1.06-fold increase in risk, EGFR-directed therapies may yet hold promise for the treatment of LOAD due to the lack of activating somatic mutations and, consequently, a potentially lower threshold of activity required for a drug to elicit clinical benefit. However, it remains to be determined whether EGFR activity is relevant only to LOAD development, or also plays a role in disease progression/severity. Further studies in well-characterized LOAD patient populations, with imaging and endo-phenotyping data, will be required to address these important questions.

A strongly supported theory for LOAD pathogenesis is the amyloid cascade hypothesis, positing that Aβ plaque accumulation leads to “synaptic dysfunction, tau pathology, and neuronal loss”.(Saraceno et al. 2013) The Aβ42 isoform increases in quantity in forms of AD, forming the characteristic amyloid plaques associated with Alzheimer’s neuropathology.(Schmidt et al. 2009) In Drosophila model systems, research suggests that Aβ-induced memory loss results from activation of EGFR by Aβ42 oligomers, providing evidence that EGFR may function as a cell membrane receptor of Aβ peptides.(Wang et al. 2012) Following brain injury, EGFR expression increases, leading to the conversion of quiescent astrocytes to reactive astrocytes.(Liu and Neufeld 2007) While quiescent astrocytes have a constructive function in the developing CNS, reactive astrocytes are destructive and lead to neuronal loss. Recent tissue-based analyses have also identified significant differences in EGFR expression and protein levels between LOAD patients and control subjects,(He et al. 2021) and single-cell sequencing analyses implicate EGFR and MAP4K4 upregulation as a biomarker of altered ligand-receptor axis communication between neurons and microglia in AD patient brains.(Jian et al. 2021) These emerging data complement our findings and lend support to ongoing work that seeks to repurpose EGFR-targeted cancer therapies to inhibit EGFR as a potential pathway to rescuing memory loss and preventing neuronal loss in LOAD patients.(Tavassoly, Sato, and Tavassoly 2020)

There are several limitations to our study and valid concerns of our hybrid GWAS-PheWAS approach. For the PheWAS analyses, we used a p-value threshold of <0.01 to carry a SNP-trait association forward to enrichment analyses, rather than a Bonferroni-corrected threshold (i.e., 0.05/778 traits). However, we found that this threshold maximized study power in our previous application of this GWAS-PheWAS approach to leukemia etiology.(Semmes et al. 2020) Additionally, this threshold is only for carrying traits forward to subsequent enrichment analyses, and associations from these enrichment comparisons were subjected to FDR correction to account for the number of traits evaluated for potential enrichment with LOAD SNPs. Another limitation is that our analysis relied upon traits evaluable in the U.K. Biobank, and therefore does not permit us to assess enrichment for potential disease-related traits that may have relevance to LOAD pathology, such as cerebral microbleeds or history of viral infections. Our PheWAS analyses also relied upon well-validated LOAD risk loci that had reached genome-wide statistical significance and been replicated in external datasets. Inclusion of additional putative risk alleles meeting less stringent cutoffs could potentially improve study power, but may also dilute effects by including false-positive LOAD associations. Since completing our analyses, an additional GWAS meta-analysis of LOAD has been published, identifying several additional risk alleles.(Wightman et al. 2021) However, more than 50% of LOAD cases included in this study were “proxy cases” with a positive family history of LOAD, which may increase statistical power at the expense of precision. Such studies are useful adjuncts to study LOAD etiology, but do not lend themselves to MR approaches due to potential misspecification of SNP effect sizes.

Although multiple GWAS in the past decade have contributed to our understanding of inherited susceptibility to LOAD, there remains significant missing heritability.(Wightman et al. 2021; Kunkle et al. 2019; Lambert et al. 2013) A recent review on the benefits and pitfalls GWAS emphasized the need for novel analytic approaches to enhance our understanding of genotype-phenotype associations in the post-GWAS era and the utility of large biorepository databases linking EHR and genotyping data, polygenic scores, and innovative study designs.(Wang, Li, and Hakonarson 2010) While increasing GWAS sample size may reveal more associations, new methods for analyzing the wealth of existing data are essential, particularly in unraveling the underlying biological pathways involved in SNP-disease associations. In summary, by integrating GWAS and PheWAS datasets to decipher the complex etiology of LOAD, we identify horizontal pleiotropy with PDW and implicate EGFR variation in disease predisposition. These results should stimulate further mechanistic studies on the role of EGFR in AD pathogenesis to dissect their potential relationship. Such findings will have utility in informing the rational design of future prevention and treatment strategies for an aging population.

Supplementary Material

Table S2
Table S1
Table S3
Table S4

Financial support:

This study was supported by the Neuroscience Initiative of Translating Duke Health (CEW, KMW), the Infectious Diseases Society of America Foundation’s “Microbial Pathogenesis in Alzheimer’s Disease” grant (KMW, CEW, ECS), and the National Institute on Aging of the National Institutes of Health under Award Number P30AG072958 (ASK, KMW). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Footnotes

Conflict of interest disclosure: The authors declare no competing interests.

REFERENCES

  1. Andrews SJ, Fulton-Howard B, and Goate A. 2020. ‘Interpretation of risk loci from genome-wide association studies of Alzheimer’s disease’, Lancet Neurol, 19: 326–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bowden J, Davey Smith G, and Burgess S. 2015. ‘Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression’, Int J Epidemiol, 44: 512–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bowden J, Davey Smith G, Haycock PC, and Burgess S. 2016. ‘Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator’, Genet Epidemiol, 40: 304–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bruni AC, Bernardi L, and Gabelli C. 2020. ‘From beta amyloid to altered proteostasis in Alzheimer’s disease’, Ageing Res Rev, 64: 101126. [DOI] [PubMed] [Google Scholar]
  5. Burgess S, and Thompson SG 2013. ‘Use of allele scores as instrumental variables for Mendelian randomization’, Int J Epidemiol, 42: 1134–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bush WS, Oetjens MT, and Crawford DC 2016. ‘Unravelling the human genome-phenome relationship using phenome-wide association studies’, Nat Rev Genet, 17: 129–45. [DOI] [PubMed] [Google Scholar]
  7. Byun J, Han Y, Ostrom QT, Edelson J, Walsh KM, Pettit RW, Bondy ML, Hung RJ, McKay JD, Amos CI, and Integral-Ilcco for. 2021. ‘The Shared Genetic Architectures Between Lung Cancer and Multiple Polygenic Phenotypes in Genome-Wide Association Studies’, Cancer Epidemiol Biomarkers Prev, 30: 1156–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Canela-Xandri O, Rawlik K, and Tenesa A. 2018. ‘An atlas of genetic associations in UK Biobank’, Nat Genet, 50: 1593–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Congdon KL, Gedeon PC, Suryadevara CM, Caruso HG, Cooper LJ, Heimberger AB, and Sampson JH 2014. ‘Epidermal growth factor receptor and variant III targeted immunotherapy’, Neuro Oncol, 16 Suppl 8: viii20–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. da Cunha Santos G, Shepherd FA, and Tsao MS 2011. ‘EGFR mutations and lung cancer’, Annu Rev Pathol, 6: 49–69. [DOI] [PubMed] [Google Scholar]
  11. Denny JC, Bastarache L, and Roden DM 2016. ‘Phenome-Wide Association Studies as a Tool to Advance Precision Medicine’, Annu Rev Genomics Hum Genet, 17: 353–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Gao S, Casey AE, Sargeant TJ, and Makinen VP 2018. ‘Genetic variation within endolysosomal system is associated with late-onset Alzheimer’s disease’, Brain, 141: 2711–20. [DOI] [PubMed] [Google Scholar]
  13. Handy A, Lord J, Green R, Xu J, Aarsland D, Velayudhan L, Hye A, Dobson R, Proitsi P, initiative Alzheimer’s Disease Neuroimaging, AddNeuroMed, and Gerad Consortium the. 2021. ‘Assessing Genetic Overlap and Causality Between Blood Plasma Proteins and Alzheimer’s Disease’, J Alzheimers Dis, 83: 1825–39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. He Z, Le Guen Y, Liu L, Lee J, Ma S, Yang AC, Liu X, Rutledge J, Losada PM, Song B, Belloy ME, Butler RR 3rd, Longo FM, Tang H, Mormino EC, Wyss-Coray T, Greicius MD, and Ionita-Laza I. 2021. ‘Genome-wide analysis of common and rare variants via multiple knockoffs at biobank scale, with an application to Alzheimer disease genetics’, Am J Hum Genet, 108: 2336–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Hennessy S, Bilker WB, Berlin JA, and Strom BL 1999. ‘Factors influencing the optimal control-to-case ratio in matched case-control studies’, Am J Epidemiol, 149: 195–7. [DOI] [PubMed] [Google Scholar]
  16. Jian C, Wei L, Mo R, Li R, Liang L, Chen L, Zou C, Meng Y, Liu Y, and Zou D. 2021. ‘Microglia Mediate the Occurrence and Development of Alzheimer’s Disease Through Ligand-Receptor Axis Communication’, Front Aging Neurosci, 13: 731180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Kunkle BW Grenier-Boley B, Sims R, Bis JC, Damotte V, Naj AC, Boland A, Vronskaya M, van der Lee SJ, Amlie-Wolf A, Bellenguez C, Frizatti A, Chouraki V, Martin ER, Sleegers K, Badarinarayan N, Jakobsdottir J, Hamilton-Nelson KL, Moreno-Grau S, Olaso R, Raybould R, Chen Y, Kuzma AB, Hiltunen M, Morgan T, Ahmad S, Vardarajan BN, Epelbaum J, Hoffmann P, Boada M, Beecham GW, Garnier JG, Harold D, Fitzpatrick AL, Valladares O, Moutet ML, Gerrish A, Smith AV, Qu L, Bacq D, Denning N, Jian X, Zhao Y, Del Zompo M, Fox NC, Choi SH, Mateo I, Hughes JT, Adams HH, Malamon J, Sanchez-Garcia F, Patel Y, Brody JA, Dombroski BA, Naranjo MCD, Daniilidou M, Eiriksdottir G, Mukherjee S, Wallon D, Uphill J, Aspelund T, Cantwell LB, Garzia F, Galimberti D, Hofer E, Butkiewicz M, Fin B, Scarpini E, Sarnowski C, Bush WS, Meslage S, Kornhuber J, White CC, Song Y, Barber RC, Engelborghs S, Sordon S, Voijnovic D, Adams PM, Vandenberghe R, Mayhaus M, Cupples LA, Albert MS, De Deyn PP, Gu W, Himali JJ, Beekly D, Squassina A, Hartmann AM, Orellana A, Blacker D, Rodriguez-Rodriguez E, Lovestone S, Garcia ME, Doody RS, Munoz-Fernadez C, Sussams R, Lin H, Fairchild TJ, Benito YA, Holmes C, Karamujic-Comic H, Frosch MP, Thonberg H, Maier W, Roschupkin G, Ghetti B, Giedraitis V, Kawalia A, Li S, Huebinger RM, Kilander L, Moebus S, Hernandez I, Kamboh MI, Brundin R, Turton J, Yang Q, Katz MJ, Concari L, Lord J, Beiser AS, Keene CD, Helisalmi S, Kloszewska I, Kukull WA, Koivisto AM, Lynch A, Tarraga L, Larson EB, Haapasalo A, Lawlor B, Mosley TH, Lipton RB, Solfrizzi V, Gill M, Longstreth WT Jr., Montine TJ, Frisardi V, Diez-Fairen M, Rivadeneira F, Petersen RC, Deramecourt V, Alvarez I, Salani F, Ciaramella A, Boerwinkle E, Reiman EM, Fievet N, Rotter JI, Reisch JS, Hanon O, Cupidi C, Andre Uitterlinden AG, Royall DR, Dufouil C, Maletta RG, de Rojas I, Sano M, Brice A, Cecchetti R, George-Hyslop PS, Ritchie K, Tsolaki M, Tsuang DW, Dubois B, Craig D, Wu CK, Soininen H, Avramidou D, Albin RL, Fratiglioni L, Germanou A, Apostolova LG, Keller L, Koutroumani M, Arnold SE, Panza F, Gkatzima O, Asthana S, Hannequin D, Whitehead P, Atwood CS, Caffarra P, Hampel H, Quintela I, Carracedo A, Lannfelt L, Rubinsztein DC, Barnes LL, Pasquier F, Frolich L, Barral S, McGuinness B, Beach TG, Johnston JA, Becker JT, Passmore P, Bigio EH, Schott JM, Bird TD, Warren JD, Boeve BF, Lupton MK, Bowen JD, Proitsi P, Boxer A, Powell JF, Burke JR, Kauwe JSK, Burns JM, Mancuso M, Buxbaum JD, Bonuccelli U, Cairns NJ, McQuillin A, Cao C, Livingston G, Carlson CS, Bass NJ, Carlsson CM, Hardy J, Carney RM, Bras J, Carrasquillo MM, Guerreiro R, Allen M, Chui HC, Fisher E, Masullo C, Crocco EA, DeCarli C, Bisceglio G, Dick M, Ma L, Duara R, Graff-Radford NR, Evans DA, Hodges A, Faber KM, Scherer M, Fallon KB, Riemenschneider M, Fardo DW, Heun R, Farlow MR, Kolsch H, Ferris S, Leber M, Foroud TM, Heuser I, Galasko DR, Giegling I, Gearing M, Hull M, Geschwind DH, Gilbert JR, Morris J, Green RC, Mayo K, Growdon JH, Feulner T, Hamilton RL, Harrell LE, Drichel D, Honig LS, Cushion TD, Huentelman MJ, Hollingworth P, Hulette CM, Hyman BT, Marshall R, Jarvik GP, Meggy A, Abner E, Menzies GE, Jin LW, Leonenko G, Real LM, Jun GR, Baldwin CT, Grozeva D, Karydas A, Russo G, Kaye JA, Kim R, Jessen F, Kowall NW, Vellas B, Kramer JH, Vardy E, LaFerla FM, Jockel KH, Lah JJ, Dichgans M, Leverenz JB, Mann D, Levey AI, Pickering-Brown S, Lieberman AP, Klopp N, Lunetta KL, Wichmann HE, Lyketsos CG, Morgan K, Marson DC, Brown K, Martiniuk F, Medway C, Mash DC, Nothen MM, Masliah E, Hooper NM, McCormick WC, Daniele A, McCurry SM, Bayer A, McDavid AN, Gallacher J, McKee AC, van den Bussche H, Mesulam M, Brayne C, Miller BL, Riedel-Heller S, Miller CA, Miller JW, Al-Chalabi A, Morris JC, Shaw CE, Myers AJ, Wiltfang J, O’Bryant S, Olichney JM, Alvarez V, Parisi JE, Singleton AB, Paulson HL, Collinge J, Perry WR, Mead S, Peskind E, Cribbs DH, Rossor M, Pierce A, Ryan NS, Poon WW, Nacmias B, Potter H, Sorbi S, Quinn JF, Sacchinelli E, Raj A, Spalletta G, Raskind M, Caltagirone C, Bossu P, Orfei MD, Reisberg B, Clarke R, Reitz C, Smith AD, Ringman JM, Warden D, Roberson ED, Wilcock G, Rogaeva E, Bruni AC, Rosen HJ, Gallo M, Rosenberg RN, Ben-Shlomo Y, Sager MA, Mecocci P, Saykin AJ, Pastor P, Cuccaro ML, Vance JM, Schneider JA, Schneider LS, Slifer S, Seeley WW, Smith AG, Sonnen JA, Spina S, Stern RA, Swerdlow RH, Tang M, Tanzi RE, Trojanowski JQ, Troncoso JC, Van Deerlin VM, Van Eldik LJ, Vinters HV, Vonsattel JP, Weintraub S, Welsh-Bohmer KA, Wilhelmsen KC, Williamson J, Wingo TS, Woltjer RL, Wright CB, Yu CE, Yu L, Saba Y, Consortium Alzheimer Disease Genetics, Initiative European Alzheimer’s Disease, Heart Cohorts for, Consortium Aging Research in Genomic Epidemiology, Genetic, Polygenic Environmental Risk in Ad/Defining Genetic, Consortium Environmental Risk for Alzheimer’s Disease, Pilotto A, Bullido MJ, Peters O, Crane PK, Bennett D, Bosco P, Coto E, Boccardi V, De Jager PL, Lleo A, Warner N, Lopez OL, Ingelsson M, Deloukas P, Cruchaga C, Graff C, Gwilliam R, Fornage M, Goate AM, Sanchez-Juan P, Kehoe PG, Amin N, Ertekin-Taner N, Berr C, Debette S, Love S, Launer LJ, Younkin SG, Dartigues JF, Corcoran C, Ikram MA, Dickson DW, Nicolas G, Campion D, Tschanz J, Schmidt H, Hakonarson H, Clarimon J, Munger R, Schmidt R, Farrer LA, Van Broeckhoven C, O’ Donovan M C, DeStefano AL, Jones L, Haines JL, Deleuze JF, Owen MJ, Gudnason V, Mayeux R, Escott-Price V, Psaty BM, Ramirez A, Wang LS, Ruiz A, van Duijn CM, Holmans PA, Seshadri S, Williams J, Amouyel P, Schellenberg GD, Lambert JC, and Pericak-Vance MA 2019. ‘Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Abeta, tau, immunity and lipid processing’, Nat Genet, 51: 414–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Lambert JC, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, DeStafano AL, Bis JC, Beecham GW, Grenier-Boley B, Russo G, Thorton-Wells TA, Jones N, Smith AV, Chouraki V, Thomas C, Ikram MA, Zelenika D, Vardarajan BN, Kamatani Y, Lin CF, Gerrish A, Schmidt H, Kunkle B, Dunstan ML, Ruiz A, Bihoreau MT, Choi SH, Reitz C, Pasquier F, Cruchaga C, Craig D, Amin N, Berr C, Lopez OL, De Jager PL, Deramecourt V, Johnston JA, Evans D, Lovestone S, Letenneur L, Moron FJ, Rubinsztein DC, Eiriksdottir G, Sleegers K, Goate AM, Fievet N, Huentelman MW, Gill M, Brown K, Kamboh MI, Keller L, Barberger-Gateau P, McGuiness B, Larson EB, Green R, Myers AJ, Dufouil C, Todd S, Wallon D, Love S, Rogaeva E, Gallacher J, St George-Hyslop P, Clarimon J, Lleo A, Bayer A, Tsuang DW, Yu L, Tsolaki M, Bossu P, Spalletta G, Proitsi P, Collinge J, Sorbi S, Sanchez-Garcia F, Fox NC, Hardy J, Deniz Naranjo MC, Bosco P, Clarke R, Brayne C, Galimberti D, Mancuso M, Matthews F, Initiative European Alzheimer’s Disease, Genetic, Disease Environmental Risk in Alzheimer’s, Consortium Alzheimer’s Disease Genetic, Heart Cohorts for, Epidemiology Aging Research in Genomic, Moebus S, Mecocci P, Del Zompo M, Maier W, Hampel H, Pilotto A, Bullido M, Panza F, Caffarra P, Nacmias B, Gilbert JR, Mayhaus M, Lannefelt L, Hakonarson H, Pichler S, Carrasquillo MM, Ingelsson M, Beekly D, Alvarez V, Zou F, Valladares O, Younkin SG, Coto E, Hamilton-Nelson KL, Gu W, Razquin C, Pastor P, Mateo I, Owen MJ, Faber KM, Jonsson PV, Combarros O, O’Donovan MC, Cantwell LB, Soininen H, Blacker D, Mead S, Mosley TH Jr., Bennett DA, Harris TB, Fratiglioni L, Holmes C, de Bruijn RF, Passmore P, Montine TJ, Bettens K, Rotter JI, Brice A, Morgan K, Foroud TM, Kukull WA, Hannequin D, Powell JF, Nalls MA, Ritchie K, Lunetta KL, Kauwe JS, Boerwinkle E, Riemenschneider M, Boada M, Hiltuenen M, Martin ER, Schmidt R, Rujescu D, Wang LS, Dartigues JF, Mayeux R, Tzourio C, Hofman A, Nothen MM, Graff C, Psaty BM, Jones L, Haines JL, Holmans PA, Lathrop M, Pericak-Vance MA, Launer LJ, Farrer LA, van Duijn CM, Van Broeckhoven C, Moskvina V, Seshadri S, Williams J, Schellenberg GD, and Amouyel P. 2013. ‘Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease’, Nat Genet, 45: 1452–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Lewcock JW, Schlepckow K, Di Paolo G, Tahirovic S, Monroe KM, and Haass C. 2020. ‘Emerging Microglia Biology Defines Novel Therapeutic Approaches for Alzheimer’s Disease’, Neuron, 108: 801–21. [DOI] [PubMed] [Google Scholar]
  20. Li M, Lin J, Liang S, Chen Z, Bai Y, Long X, Huang S, and Mo Z. 2021. ‘The role of age at menarche and age at menopause in Alzheimer’s disease: evidence from a bidirectional mendelian randomization study’, Aging (Albany NY), 13: 19722–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Liu B, and Neufeld AH 2007. ‘Activation of epidermal growth factor receptors in astrocytes: from development to neural injury’, J Neurosci Res, 85: 3523–9. [DOI] [PubMed] [Google Scholar]
  22. Lumsden AL, Mulugeta A, Zhou A, and Hypponen E. 2020. ‘Apolipoprotein E (APOE) genotype-associated disease risks: a phenome-wide, registry-based, case-control study utilising the UK Biobank’, EBioMedicine, 59: 102954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Machiela MJ, and Chanock SJ 2015. ‘LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants’, Bioinformatics, 31: 3555–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Mansour HM, Fawzy HM, El-Khatib AS, and Khattab MM 2021. ‘Potential Repositioning of Anti-cancer EGFR Inhibitors in Alzheimer’s Disease: Current Perspectives and Challenging Prospects’, Neuroscience, 469: 191–96. [DOI] [PubMed] [Google Scholar]
  25. Millard LAC, Munafo MR, Tilling K, Wootton RE, and Davey Smith G. 2019. ‘MR-pheWAS with stratification and interaction: Searching for the causal effects of smoking heaviness identified an effect on facial aging’, PLoS Genet, 15: e1008353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Pers TH, Timshel P, and Hirschhorn JN 2015. ‘SNPsnap: a Web-based tool for identification and annotation of matched SNPs’, Bioinformatics, 31: 418–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Pettit RW, Byun J, Han Y, Ostrom QT, Edelson J, Walsh KM, Bondy ML, Hung RJ, McKay JD, and Amos CI 2021. ‘The shared genetic architecture between epidemiological and behavioral traits with lung cancer’, Sci Rep, 11: 17559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Rabinovici GD 2019. ‘Late-onset Alzheimer Disease’, Continuum (Minneap Minn), 25: 14–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Saraceno C, Musardo S, Marcello E, Pelucchi S, and Di Luca M. 2013. ‘Modeling Alzheimer’s disease: from past to future’, Front Pharmacol, 4: 77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Schmidt M, Sachse C, Richter W, Xu C, Fandrich M, and Grigorieff N. 2009. ‘Comparison of Alzheimer Abeta(1–40) and Abeta(1–42) amyloid fibrils reveals similar protofilament structures’, Proc Natl Acad Sci U S A, 106: 19813–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Semmes EC, Vijayakrishnan J, Zhang C, Hurst JH, Houlston RS, and Walsh KM 2020. ‘Leveraging Genome and Phenome-Wide Association Studies to Investigate Genetic Risk of Acute Lymphoblastic Leukemia’, Cancer Epidemiol Biomarkers Prev, 29: 1606–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Semmes EC, Zhang C, and Walsh KM 2018. ‘Intermediate phenotypes underlying osteosarcoma risk’, Oncotarget, 9: 37345–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Shi Y, Liu R, Guo Y, Li Q, Zhou H, Yu S, Liang H, and Li Z. 2021. ‘An Updated Mendelian Randomization Analysis of the Association Between Serum Calcium Levels and the Risk of Alzheimer’s Disease’, Front Genet, 12: 731391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Tavassoly O, Sato T, and Tavassoly I. 2020. ‘Inhibition of Brain Epidermal Growth Factor Receptor Activation: A Novel Target in Neurodegenerative Diseases and Brain Injuries’, Mol Pharmacol, 98: 13–22. [DOI] [PubMed] [Google Scholar]
  35. Voldborg BR, Damstrup L, Spang-Thomsen M, and Poulsen HS 1997. ‘Epidermal growth factor receptor (EGFR) and EGFR mutations, function and possible role in clinical trials’, Ann Oncol, 8: 1197–206. [DOI] [PubMed] [Google Scholar]
  36. Vuckovic D, Bao EL, Akbari P, Lareau CA, Mousas A, Jiang T, Chen MH, Raffield LM, Tardaguila M, Huffman JE, Ritchie SC, Megy K, Ponstingl H, Penkett CJ, Albers PK, Wigdor EM, Sakaue S, Moscati A, Manansala R, Lo KS, Qian H, Akiyama M, Bartz TM, Ben-Shlomo Y, Beswick A, Bork-Jensen J, Bottinger EP, Brody JA, van Rooij FJA, Chitrala KN, Wilson PWF, Choquet H, Danesh J, Di Angelantonio E, Dimou N, Ding J, Elliott P, Esko T, Evans MK, Felix SB, Floyd JS, Broer L, Grarup N, Guo MH, Guo Q, Greinacher A, Haessler J, Hansen T, Howson JMM, Huang W, Jorgenson E, Kacprowski T, Kahonen M, Kamatani Y, Kanai M, Karthikeyan S, Koskeridis F, Lange LA, Lehtimaki T, Linneberg A, Liu Y, Lyytikainen LP, Manichaikul A, Matsuda K, Mohlke KL, Mononen N, Murakami Y, Nadkarni GN, Nikus K, Pankratz N, Pedersen O, Preuss M, Psaty BM, Raitakari OT, Rich SS, Rodriguez BAT, Rosen JD, Rotter JI, Schubert P, Spracklen CN, Surendran P, Tang H, Tardif JC, Ghanbari M, Volker U, Volzke H, Watkins NA, Weiss S, Million Veteran Program VA, Cai N, Kundu K, Watt SB, Walter K, Zonderman AB, Cho K, Li Y, Loos RJF, Knight JC, Georges M, Stegle O, Evangelou E, Okada Y, Roberts DJ, Inouye M, Johnson AD, Auer PL, Astle WJ, Reiner AP, Butterworth AS, Ouwehand WH, Lettre G, Sankaran VG, and Soranzo N. 2020. ‘The Polygenic and Monogenic Basis of Blood Traits and Diseases’, Cell, 182: 1214–31 e11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Walsh KM, Anderson E, Hansen HM, Decker PA, Kosel ML, Kollmeyer T, Rice T, Zheng S, Xiao Y, Chang JS, McCoy LS, Bracci PM, Wiemels JL, Pico AR, Smirnov I, Lachance DH, Sicotte H, Eckel-Passow JE, Wiencke JK, Jenkins RB, and Wrensch MR 2013. ‘Analysis of 60 reported glioma risk SNPs replicates published GWAS findings but fails to replicate associations from published candidate-gene studies’, Genet Epidemiol, 37: 222–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Wang K, Li M, and Hakonarson H. 2010. ‘Analysing biological pathways in genome-wide association studies’, Nat Rev Genet, 11: 843–54. [DOI] [PubMed] [Google Scholar]
  39. Wang L, Chiang HC, Wu W, Liang B, Xie Z, Yao X, Ma W, Du S, and Zhong Y. 2012. ‘Epidermal growth factor receptor is a preferred target for treating amyloid-beta-induced memory loss’, Proc Natl Acad Sci U S A, 109: 16743–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Ward LD, and Kellis M. 2016. ‘HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease’, Nucleic Acids Res, 44: D877–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Wightman DP, Jansen IE, Savage JE, Shadrin AA, Bahrami S, Holland D, Rongve A, Borte S, Winsvold BS, Drange OK, Martinsen AE, Skogholt AH, Willer C, Brathen G, Bosnes I, Nielsen JB, Fritsche LG, Thomas LF, Pedersen LM, Gabrielsen ME, Johnsen MB, Meisingset TW, Zhou W, Proitsi P, Hodges A, Dobson R, Velayudhan L, Team andMe Research JM Sealock L. K. Davis, Pedersen NL, Reynolds CA, Karlsson IK, Magnusson S, Stefansson H, Thordardottir S, Jonsson PV, Snaedal J, Zettergren A, Skoog I, Kern S, Waern M, Zetterberg H, Blennow K, Stordal E, Hveem K, Zwart JA, Athanasiu L, Selnes P, Saltvedt I, Sando SB, Ulstein I, Djurovic S, Fladby T, Aarsland D, Selbaek G, Ripke S, Stefansson K, Andreassen OA, and Posthuma D. 2021. ‘A genome-wide association study with 1,126,563 individuals identifies new risk loci for Alzheimer’s disease’, Nat Genet, 53: 1276–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Yavorska OO, and Burgess S. 2017. ‘MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data’, Int J Epidemiol, 46: 1734–39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Zhang Z, Wang M, and Liu X. 2021. ‘C-reactive protein and risk of Alzheimer’s disease’, Neurobiol Aging. [DOI] [PubMed] [Google Scholar]
  44. Zou F, Chai HS, Younkin CS, Allen M, Crook J, Pankratz VS, Carrasquillo MM, Rowley CN, Nair AA, Middha S, Maharjan S, Nguyen T, Ma L, Malphrus KG, Palusak R, Lincoln S, Bisceglio G, Georgescu C, Kouri N, Kolbert CP, Jen J, Haines JL, Mayeux R, Pericak-Vance MA, Farrer LA, Schellenberg GD, Consortium Alzheimer’s Disease Genetics, Petersen RC, Graff-Radford NR, Dickson DW, Younkin SG, and Ertekin-Taner N. 2012. ‘Brain expression genome-wide association study (eGWAS) identifies human disease-associated variants’, PLoS Genet, 8: e1002707. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S2
Table S1
Table S3
Table S4

RESOURCES