Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Feb 9.
Published in final edited form as: Int J Clin Biostat Biom. 2017 Dec 1;3(1):014. doi: 10.23937/2469-5831/1510014

Bayesian Cox Proportional Hazards Model in Survival Analysis of HACE1 Gene with Age at Onset of Alzheimer’s Disease

Ke-Sheng Wang 1,*, Ying Liu 1, Shaoqing Gong 2, Chun Xu 3, Xin Xie 4, Liang Wang 1, Xingguang Luo 5,6,*
PMCID: PMC5806706  NIHMSID: NIHMS939412  PMID: 29430571

Abstract

Alzheimer’s disease (AD), the most common form of dementia, is a chronic neurodegenerative disease. The HECT domain and ankyrin repeat containing E3 ubiquitin protein ligase 1 (HACE1) gene is expressed in human brain and may play a role in the pathogenesis of neurodegenerative disorders. Till now, no previous study has reported the association of the HACE1 gene with the risk and age at onset (AAO) of AD; while few studies have checked the proportional hazards assumption in the survival analysis of AAO of AD using Cox proportional hazards model. In this study, we examined the associations of 14 single nucleotide polymorphisms (SNPs) in the HACE1 gene with the risk and the AAO of AD using 791 AD patients and 782 controls. Multiple logistic regression model identified one SNP (rs9499937 with p = 1.8×10−3) to be associated with the risk of AD. For survival analysis of AAO, both classic Cox regression model and Bayesian survival analysis using the Cox proportional hazards model were applied to examine the association of each SNP with the AAO. The hazards ratio (HR) with its 95% confidence interval (CI) was estimated. Survival analysis using the classic Cox regression model showed that 4 SNPs were significantly associated with the AAO (top SNP rs9499937 with HR=1.33, 95%CI=1.13–1.57, p=5.0×10−4). Bayesian Cox regression model showed similar but a slightly stronger associations (top SNP rs9499937 with HR=1.34, 95%CI=1.11–1.55) compared with the classic Cox regression model. Using an independent family-based sample, one SNP rs9486018 was associated with the risk of AD (p=0.0323) and the T-T-G haplotype from rs9786015, rs9486018 and rs4079063 showed associations with both the risk and AAO of AD (p=2.27×10−3 and 0.0487, respectively). The findings of this study provide first evidence that several genetic variants in the HACE1 gene were associated with the risk and AAO of AD.

Keywords: Alzheimer’s disease, Age at onset, HACE1, Single nucleotide polymorphisms, Survival analysis, Cox model, Bayesian analysis

Introduction

Alzheimer’s disease (AD) is a chronic neurodegenerative disease [1]. It is estimated that, in 2010, about 4.7 million people in the United States (US) aged 65 years or older live with AD, and this number is projected to rise to 13.8 million, a nearly three-fold increase, by 2050 [2]. Weuve et al. [3] estimated the number of adults (aged ≥65 years) with AD in each US state and the District of Columbia (DC) and found that the number of older adults (aged ≥65 years) with AD in the US ranged from 5.1 (Alaska) to 530 million (California) in 2010; while in 2010, older adults with AD dementia comprised a median of 1.6% of a given state’s total population; whereas by 2025, throughout the US, this proportion will keep increasing by nearly one-third on average [3]. The prevalence was estimated to be about 3.12% in 2012 for older adults (≥60 years) in the US using the Nationwide Inpatient Sample (NIS) 2002–2012 [4]. A meta-analysis estimated the prevalence for the combined population of African-Americans and Caucasians aged 65–90 in 2013 was 5.7% in the US; the prevalence for African-Americans aged 65–90 years was 8.6%, compared to 5.5% for Caucasians [5]. Globally, 26.6 million people (0.40% of the world population) suffered from AD in 2006 and it was predicted to affect 1 in 85 people by 2050 [6]. A recent meta-analysis showed that the prevalence of AD in Europe was 5.05% for adult (aged ≥50 years) (3.31% for males and 7.13% females) [7]. Increasing evidence suggests that autophagy may play a central role in AD [8, 9]. The genetic heritability of AD ranges from 49% to 79% based on reviews of twin and family studies [10]. In addition to the risk, the age at onset (AAO) of AD has a genetic component with heritability about 42% [11, 12].

The HECT domain and ankyrin repeat containing E3 ubiquitin protein ligase 1 (HACE1) gene (also known as KIAA1320) is located at 6q16.3 [1315]. The HACE1 is expressed in brain, heart, lung, kidney, testis, and ovary [13,15]. Several studies have implicated that HACE1 is a candidate chromosome 6q21 tumor suppressor gene involved in multiple cancers [1618]. Recently, it has been reported that HACE1 gene may play a role in neurodegeneration [19] and autophagy pathway [20]; while HACE1 mutations are involved in an autosomal recessive neurodevelopmental disorder [21], and glutamine addiction [22]. A genome-wide association study (GWAS) identified five single-nucleotide polymorphisms (SNPs) (rs17065302, rs11759010, rs6927608, rs4946645 and rs4245525) within the HACE1 gene associated with equol-producing phenotype such as blood pressure, which may implicate HACE1 in intestinal immune responses [23]. Another GWAS identified 5 SNPs (rs4336470. rs9404576, rs4079063, rs24996663, and rs2499667) in HACE1 associated with neuroblastoma susceptibility [24]; while in a replication study, rs4336470 showed moderate association (p<0.05) with risk of neuroblastoma [25]. More recently, it has been reported that the five above SNPs in the HACE1 gene may have a weak combined effect (p=0.065) on neuroblastoma risk in Southern Chinese children [26]. Thus we hypothesized that HACE1 genetic variants may be in association with AD development.

To our best knowledge, no study has focused on the association of the HACE1 gene with the risk and AAO of AD. Even though the Cox proportional hazards model has been used to detect genetic associations with the AAO of AD [2729], the proportional hazards assumption may be violated and they might not be carefully checked. Bayesian methods have been widely used recently in genetic association studies and provide alternative ways to traditional statistical methods [3032]. In this study, we explored the association of HACE1 with the AAO of AD by using a Bayesian proportional hazards model in a population-based sample and then a family-based sample for replication.

Subjects and Methods

Study population

791 patients with AD and 782 controls with complete genotype and phenotype information in a Canadian sample were selected from the Multi-Site Collaborative Study for Genotype-Phenotype Associations in Alzheimer’s disease and the Longitudinal follow-up of Genotype-Phenotype Associations in Alzheimer’s disease and the Neuroimaging component of Genotype-Phenotype Associations in Alzheimer’s disease - Study Accession: phs000219.v1.p1. Covariates include sex and age. The details about these subjects were described in previous studies [27, 33]. Genotyping was conducted using the Affymetrix technique. The genotypes of 14 SNPs within the HACE1 gene were available in this data.

A family-based study (1266 AD cases and their relatives, 1070 individuals with the AAO values) were available from the National Institute on Aging - Late Onset Alzheimer’s Disease (NIA-LOAD) Family Study: Genome-Wide Association Study for Susceptibility Loci – Study Accession: phs000168.v1.p1. Genotyping by the Center for Inherited Disease Research (CIDR) was performed using the Illumina Infinium II assay protocol. The details about the sample of subjects were described elsewhere [34]. There are 28 SNPs within the HACE1 gene in this family-based sample.

Statistical analysis

Descriptive statistics and genotype quality control

Descriptive statistics were used to characterize participants’ sex, age and the AAO of AD stratified by AD case and control status [29, 35]. Hardy-Weinberg equilibrium (HWE) was tested for all SNPs using the controls by HAPLOVIEW software [36]. Then, minor allele frequency (MAF) was determined for each SNP. Pairwise linkage disequilibrium (LD) statistics (r2) among SNPs were assessed using the European sample from the HapMap dataset (http://hapmap.ncbi.nlm.nih.gov/) and the founders in the family study.

Multiple logistic and linear regression models in PLINK Software

Multiple logistic regression analysis of each SNP with the risk of AD as a binary outcome, adjusted for sex and age, was performed using PLINK [37]; while the asymptotic p-values were obtained and the odds ratio (OR) and 95% confident interval (CI) were estimated. The parallel procedure was performed for the multiple linear regression analysis of each SNP with the AAO of AD as a continuous outcome. Bonferroni correction (α=0.05/14=3.57×10−3) was used to deal with the multiple comparison issue [38].

Bayesian Cox proportional hazards model in PROC PHREG

The Cox proportional hazards model (1) or Cox regression model [39] is widely used in the analysis of time-to-event data [4042].

h(tx)=h0(t)exp(β1SNPk+β2Sex+β3Age) (1)

where h(t/x) is the hazard at time t for a subject (AAO for this study), h0(t) is the baseline hazard function. The hazard ratio (HR) is defined as the ratio of the predicated hazard function under two different values of a predictor variable. The PHREG procedure in SAS fits the Cox model by maximizing the partial likelihood function. Both the graphical and numerical methods [43] were used to check the proportional hazards assumption in the ASSESS option of PROC PHREG. The ASSESS option plots the cumulative score residuals against time for each independent variable; while the RESAMPLE option computes the p-value of a Kolmogorov-type supremum test based on a sample of 1,000 simulated residual patterns. A significant p-value indicates a poor fit.

The Akaike information criterion (AIC) (2) was used as a measure of better fit among candidate models [44, 45].

AIC=-2ln{p(xθ^)}+2k (2)

where x is the random variable, θ̂ is the maximum likelihood estimate, and k is the number of parameters. A smaller AIC generally indicates a better fit.

Bayesian statistics is an extension of Bayes theorem, which can be written as (3)

P(θY)=P(Yθ)P(θ)P(Y) (3)

where θ is the parameter of interest, Y is the observed evidence, P(Y) is the marginal probability, P(Y|θ) is the likelihood function, P(θ) is the prior, and P(θ|Y) is the posterior probability [32]. Bayesian Cox regression can be requested by using the BAYES statement in the PHREG procedure. Summary statistics (mean, standard deviation, the highest posterior density (HPD) and credible intervals, and correlation matrix) were computed for each of the parameters. Trace plots, posterior density plots, and autocorrelation function plots were also provided [32]. For Bayesian survival analysis of the AAO of AD, the normal prior was chosen for the coefficients and the deviance information criteria (DIC) was available instead of AIC. DIC is intended as a generalization of AIC [46]. A measure of effective numbers of parameters is defined as pD in (4), where pD is the posterior mean deviance minus the deviance measured at the posterior mean of the parameters.

pD=E[-2ln{p(xθ)}]+2ln{p(xθ^)} (4)

Then DIC is defined analagously to AIC as in (5). Models with samller DIC are better supported by the data.

DIC=-2ln{p(xθ^)}+pD (5)

The PHREG procedure in SAS was used to fit the Cox model. Multiple Cox regression model analysis, adjusted for sex and age, was conducted to examine association of each SNP with the AAO of AD. Descriptive statistics and Cox regression analysis were performed with SAS v.9.4 (SAS Institute, Cary, NC, USA).

Family-based study

A family-based association analysis for AD was performed using PBAT version 3.6.7 [47]. For the risk of AD, the family-based association test using generalized estimating equations (FBAT-GEE) was used [48]; while for the AAO, FBAT-Wilcoxon statistics were employed [49]. The AAO values for healthy siblings were censored and age at entry into the study was used. Haplotype analysis was conducted in 2 or 3-SNP sliding window.

Results

Descriptive statistics and genotype quality control

The demographic characteristics of the subjects are detailed in Table 1. The mean AAO for cases was 76.4 and 72.3 years, respectively, in the NIA and Canadian samples, respectively. All 14 SNPs had MAF>5% and were in HWE in the controls (p>0.05); while 1 of 28 SNPs with HWE<0.10−4 in the family sample was removed for further analysis.

Table 1.

Descriptive characteristics of cases and controls

Variable Family Study (NIA sample) Case-control Study (Canadian sample)

Patients Controls Patients Controls
Sample size (n) 1266 1279 791 782
Mean of age at onset (years±SD) 76.4 (±6.7) - 72.3 (±8.5) -
Median of age at onset (years) 77 - 73 -
Range of age at onset (years) 50–98 - 40–97 -
Mean age at entry (years±SD) - 75.5 (±8.1) 77.6 (±8.6) 73.4 (±7.9)
Median age at entry (years) - 75 79 79
Range of age at entry (years) - 42–103 43–100 48–94

Multiple linear and logistic regression analyses using PLINK

We found that one SNP was associated with the risk of AD (rs9499937 with p=1.8 × 10−3) and four SNPs were associated with the AAO of AD (rs7746856, rs6941988, rs9499937 and rs7770002 with p=3.09 × 10−2, 3.88 × 10−3, 7.39 × 10−4, and 3.14 × 10−2, respectively) (Table 2). Interestingly, the same SNP rs9499937 showed associations with both the risk and AAO of AD and the results remained significant after Bonferroni correction (p<3.57×10−3).

Table 2.

SNPs associated with the risk and/or age at onset of AD (p<0.05)

SNP Position Allelea MAFb HWEc OR-ADd p-ADe β-AAOf p-AAOg
rs7746856 105253053 A 0.47 0.787 0.92(0.79–1.06) 0.246 −0.367(−0.70, −0.034) 0.0309
rs6941988 105253349 C 0.47 0.84 0.92(0.79–1.06) 0.239 −0.351(−0.68, −0.019) 0.00388
rs9499937 105273953 T 0.19 0.649 0.73(0.60–0.89) 1.80×10−3 −0.79(−1.25, −0.34) 7.39×10−4
rs7770002 105280138 A 0.47 0.96 0.92(0.80–1.07) 0.268 −0.366(−0.70, −0.033) 0.0314
a

Minor allele;

b

Minor allele frequency;

c

Hardy-Weinberg equilibrium test p-value;

d

Odds ratio based on logistic regression;

e

p-value based on logistic regression;

f

Regression coefficient for age at onset (AAO) based on multiple linear regression;

g

p-value based on linear regression.

Supremum test for proportional hazards assumption

Figures 1 and 2 display the observed standardized score process with 20 simulated realizations from the null distribution for rs9499937 CC and CT genotypes, respectively. The plots showed that the observed process was atypical compared to the simulated realizations and revealed proportional hazards for the two genotypes compared with TT. The Kolmogorov-type supremum test results based on 1,000 simulations for all the covariates were not significant (p>0.05), suggesting the proportional hazards assumption was valid for all the variables in the Canadian sample.

Figure 1.

Figure 1

Explore plot for checking proportional hazards assumption for rs9499937C_C gentoype compared with rs9499937T_T genotype

Figure 2.

Figure 2

Explore plot for checking proportional hazards assumption for rs9499937C_T compared with rs9499937T_T

Classic and Bayesian Cox proportional hazards model using PROC PHREG

The classic Cox model showed that four SNPs were associated with the AAO of AD (the CT genotype of the top SNP rs9499937 with HR=1.33, 95%CI=1.13–1.57, p=5.0×10−4). The HRs based on the Bayesian survival analyses revealed similar but a slightly stronger associations compared with the non-Bayesian analyses results (Table 3). The DIC for the four SNPs were similar to those of AIC. The trace plot, posterior density plot, and autocorrelation function plot based on Bayesian analysis (Figure 3) indicated that the Markov chain had stabilized with good mixing for rs9499937. The posterior density plot, which estimates the posterior marginal distributions for the four regression coefficients, showed a smooth and unimodal shape for the posterior marginal distribution (Figure 4).

Table 3.

SNPs associated with the age at onset of AD using PROC PHREG (p<0.05)

SNP GTa HRb pc AICd HRe HPDf DICg
rs7746856 C-C 7622.19 7622.21
A-A 1.28(1.04–1.57) 0.0208 1.28 1.02–1.56
A-G 1.02(0.87–1.20) 0.82 1.02 0.86–1.19
rs6941988 T-T 7622.24 7622.32
C-C 1.27(1.03–1.56) 0.026 1.27 1.02–1.54
C-T 1.01(0.85–1.18) 0.959 1.01 0.85–1.18
rs9499937 C-C 7616.16 7616.17
C-T 1.33(1.13–1.57) 5.0×10−4 1.34 1.11–1.55
T-T 1.35(0.80–2.27) 0.266 1.35 0.68–2.06
rs7770002 A-G 7622.37 7622.34
A–C 1.27(1.03–1.57) 0.0239 1.28 1.02–1.55
C-C 1.01(0.86–1.20) 0.864 1.02 0.85–1.19
a

Tested genotype comparing with the reference;

b

Hazards ratio (HR) for the tested genotype based on classic Cox regression analysis using PROC PHREG;

c

p-value for the tested genotype based on classic Cox regression analysis;

d

Akaike information criterion (AIC) value based on classic Cox regression analysis;

e

HR for the tested genotype based on Bayesian Cox regression analysis;

f

Lower and upper 95% highest posterior density (HPD) of HR based on Bayesian Cox regression analysis;

g

Deviance information criteria (DIC) value based on Bayesian Cox regression analysis.

Figure 3.

Figure 3

Trace plot, autocorrelation function plot, and posterior density plot for rs9499937

Figure 4.

Figure 4

The posterior density plots for the 4 regression coefficients

Family-based association analyses

We observed one SNP associated with the risk of AD (rs9486018 with p=0.0323) by using FBAT-GEE analysis in the family-based study. The T-A haplotype from rs6937432 and rs6940552 revealed mostly significant associations with the risk (p=9.7×10−4). The T-T haplotype from rs9786015 and rs9486018 and the T-G haplotype from rs9486018 and rs4079063 showed significant association with the risk (p=1.56×10−3 and 4.46×10−3, respectively) (Table 4). Using the FBAT-Wilcoxon test, the C-C haplotype from rs6937026 and rs6946640 revealed mostly significant associations with the AAO (p=0.0223). The C-A-G and C-C-A haplotypes from rs6437026, rs4946640 and rs6910034 showed associations with the AAO (p=0.0274 and 0.0225, respectively). The T-T-G haplotype from rs9786015, rs9486018 and rs4079063 showed associations with both the risk (p=2.27×10−3) and the AAO (p=0.0487).

Table 4.

Haplotype analysis of the risk and age at onset of AD in the family sample

SNPs Haplotypea Frequencyb Fam# p-valued
Risk of AD

rs9499927 rs9404573 C-C 0.01 37 0.00144
rs9404573 rs9499934 C-T 0.02 41 0.00355
rs6937432 rs6940552 T-A 0.01 37 0.00097
rs1378720 rs13198196 C-T 0.02 60 0.0147
rs9486015 rs9486018 T-T 0.06 101 0.00156
rs9486018 rs4079063 T-G 0.07 120 0.00446
rs9486015 rs9486018 rs4079063 T-T-G 0.04 54 0.00227

Age at onset of AD

rs6937026 rs4946640 C-C 0.23 166 0.0223
rs6937026 rs4946640 rs6910034 C-A-G 0.51 156 0.0274
C-C-A 0.16 100 0.0225
rs9486015 rs9486018 rs4079063 T-T-A 0.01 22 0.0719
T-T-G 0.04 46 0.0487
a

Haplotype inferred from 2 or 3 SNPs;

b

Haplotype frequency;

c

FAM# refers to the number of informative families using an additive model;

d

p-value for the haplotype based on FBAT-GEE analysis for the risk or based on FBAT-Wilcoxon analysis for the age at onset.

The linkage disequilibrium structure of the HACE1 gene

Using the HapMap data, we identified one haplotype block including the four SNPs associated AD and/or AAO in the case-control study. Figure 5 shows the LD structure based on LD statistics (r2). Based on the rough rule of thumb, values of r2>1/3 might indicate sufficiently strong LD that can be used for a fine mapping [50]. The neuroblastoma associated rs4336470 [24, 25] had moderate or strong LD with three AAO associated SNPs (rs7746856, rs6941988 and rs7770002 with r2=0.6, 0.6 and 0.71, respectively) and weak LD with the risk and the AAO - associated SNP rs9499937 (r2=0.27). Furthermore, there was strong LD observed between rs2499663 and three AAO associated SNPs (rs7746856, rs6941988 and rs7770002 with r2=0.81, 0.81 and 0.93, respectively). Using the founders in the family study, LD structure based on (r2) was constructed for all 28 SNPs (Figure 6). The two neuroblastoma associated SNPs (rs4336470 and rs9404576) [24, 25] had moderate LD with SNPs (such as rs6910034, rs9404573, rs9499934, rs696937432, rs9486015, and rs4079063) which built AD and AAO associated haplotypes; while another neuroblastoma associated SNPs rs2499663 [24, 25] had moderate or strong LD with SNPs involved in AD and AAO associated haplotypes.

Figure 5.

Figure 5

Linkage disequilibrium structure (r2) within the HACE1 gene using the HapMap data (Dark area shows r2=1)

Figure 6.

Figure 6

Linkage disequilibrium structure (r2) within the HACE1 gene of 28 SNPs in the family sample (Dark area shows r2=1)

Discussion

In the present study we explored the association of 14 HACE1 SNPs with the risk and AAO of AD using a case-control study and identified one SNP associated with AD and four SNPs with the AAO of AD using PLINK software. Interestingly, the same SNP rs9499937 showed associations with both the risk and AAO of AD. Bayesian Cox regressions revealed similar but a slightly stronger associations with the AAO of AD. Using an independent family-based sample, one SNP rs9486018 was associated with the risk of AD while haplotype analyses further revealed the associations with the risk and AAO of AD. The findings of this study provide the first evidence that several genetic variants in the HACE1 gene influenced the risk and the AAO of AD.

A previous study suggested that rs9391227 in the HACE1 gene was associated with celiac disease involved in the immune system and antigen presentation [51]; while another study revealed that five SNPs (rs17065302, rs11759010, rs6927608, rs4946645 and rs4245525) within HACE1 gene associated with equol-producing phenotype such as blood pressure [23] which may implicate HACE1 in immune responses. However, rs9391227 [51] was not available in both the case-control and family-based samples; while equol-producing phenotype associated five SNPs [23] were not associated with the risk or the AAO of AD in the case-control study and rs6927608 was not associated with the risk and AAO of AD in both samples. However, rs9391227 had strong LD with three AAO associated SNPs (rs7746856, rs6941988 and rs7770002 with r2=1.0, 1.0 and 0.87, respectively) in the case-control study (Figure 5); whereas rs6927608 had weak LD with other SNPs (Figures 5 and 6). Previous studies have shown that AD is a chronic neurodegenerative disease while autophagy, immune and inflammatory processes are involved in the pathogenesis of AD [1, 8, 9, 5256].

Previous epidemiology studies have suggested comorbidity of AD with certain cancers [5759]; whereas there may be an inverse link between cancer and AD [5961]. Recently, a meta-analysis of nine studies supports an association between AD and decreased cancer risk [62]. However, the mechanism still remains unclear [60]. Several SNPs (such as rs4336470. rs9404576, rs4079063, rs24996663, and rs2499667) in the HACE1 have been found to be associated with neuroblastoma susceptibility [2426]. In the present study, the T-G haplotype inferred from the rs9486018 and rs4079063 was found to be associated with the risk of AD; while the T-T-A haplotype inferred from the rs9486015, rs9486018 and rs4079063 was found to be associated with the AAO of AD in the family-based study (Table 4). Furthermore, four neuroblastoma associated SNPs (rs4336470, rs9404576, rs4079063 and rs2499663) [2426] had moderate to strong LDs with SNPs which built AD and AAO associated haplotypes (Figure 6). In addition, recently, it has been reported that HACE1 may play a role in neurodevelopment and addiction [19, 21, 22]. Taken together, the above findings may suggest that HACE1 gene may be involved in the pathogenesis of AD, cancers and blood pressure; however, the mechanism warrants further studies.

Several strengths of this study are worthy of noting. The present study provides the first evidence of several genetic variants within the HACE1 gene associated with the risk and the AAO of AD using a case-control sample and a family-based sample for replication. Furthermore, we checked the proportional hazards assumption using both the graphical and numerical methods for the Cox proportional hazards model and found that the proportional hazards assumption was valid for the AAO of AD data in the Canadian sample. In addition, we conducted Bayesian survival analysis of genetic variants with the AAO of AD. Bayesian method may provide an alternative approach to assessing and verifying associations that alleviates the limitations of p-values at the cost of some additional modelling and it has recently made great inroads in genetic association studies [30]. Like other research studies, some limitations also exist in this study. First, due to different genotyping platforms, only two SNPs (rs6927608 and rs24999663) overlap in two samples; therefore, our replication results in the family-based sample are gene-based rather than SNP-based. Second, only one SNP rs9499937 was associated with the risk and AAO of AD in the case-control study; while one haplotype T-T-G was associated with the risk and AAO of AD in the family-based study; which revealed heterogeneity between risk and AAO of AD. In addition, our current findings might be subject to type I error and need to be replicated in future studies.

Conclusion

To the best of our knowledge, this is the first candidate gene study which investigated the associations of HACE1 SNPs with the risk of and the AAO of AD. The findings may serve as a resource for replication in other populations for future investigations on target genetic variation and AD. Future functional studies of this gene may help better characterize the genetic architecture of the risk of and AAO of AD.

Acknowledgments

We acknowledge the NIH GWAS Data Repository, the Contributing Investigator(s) who contributed the phenotype data and DNA samples from his/her original study and the primary funding organization that supported the contributing study “Multi-Site Collaborative Study for Genotype-Phenotype Associations in Alzheimer’s disease and longitudinal follow-up of Genotype-Phenotype Associations in Alzheimer’s disease and Neuroimaging component of Genotype-Phenotype Associations in Alzheimer’s disease” and “National Institute on Aging - Late Onset Alzheimer’s Disease Family Study: Genome-Wide Association Study for Susceptibility Loci”. The genotypic and associated phenotypic data used in the study, “Multi-Site Collaborative Study for Genotype-Phenotype Associations in Alzheimer’s Disease (GenADA)” were provided by the GlaxoSmithKline, R&D Limited. The datasets used for analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/gap through dbGaP accession number phs000219.v1.p1. Funding support for the “Genetic Consortium for Late Onset Alzheimer’s Disease” was provided through the Division of Neuroscience, NIA. The Genetic Consortium for Late Onset Alzheimer’s Disease includes a genome-wide association study funded as part of the Division of Neuroscience, NIA. Assistance with phenotype harmonization and genotype cleaning, as well as with general study coordination, was provided by Genetic Consortium for Late Onset Alzheimer’s Disease. The datasets used for analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000168.v1.p1.

Role of the funding sources

No founding source is given for the present paper.

Footnotes

Conflict of interest

The authors declare that they have no conflict of interest.

References

  • 1.Bird TD. Genetic aspects of Alzheimer disease. Genet Med. 2008;10:231–239. doi: 10.1097/GIM.0b013e31816b64dc. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hebert LE, Weuve J, Scherr PA, Evans DL. Alzheimer disease in the United States (2010–2050) estimated using the 2010 census. Neurology. 2013;80:1778–83. doi: 10.1212/WNL.0b013e31828726f5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Weuve J, Hebert LE, Scherr PA, Evans DA. Prevalence of Alzheimer disease in US states. Epidemiology. 2015;26(1):e4–6. doi: 10.1097/EDE.0000000000000199. [DOI] [PubMed] [Google Scholar]
  • 4.Beydoun MA, Beydoun HA, Gamaldo AA, Rostant OS, Dore GA, Zonderman AB, et al. Nationwide Inpatient Prevalence, Predictors, and Outcomes of Alzheimer’s Disease among Older Adults in the United States, 2002–2012. J Alzheimers Dis. 2015;48(2):361–75. doi: 10.3233/JAD-150228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Steenland K, Goldstein FC, Levey A, Wharton W. A Meta-Analysis of Alzheimer’s Disease Incidence and Prevalence Comparing African-Americans and Caucasians. J Alzheimers Dis. 2016;50(1):71–6. doi: 10.3233/JAD-150778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Brookmeyer R, Johnson E, Ziegler-Graham K, Arrighi HM. Forecasting the global burden of Alzheimer’s disease. Alzheimer’s & Dementia. 2007;3(3):186–91. doi: 10.1016/j.jalz.2007.04.381. [DOI] [PubMed] [Google Scholar]
  • 7.Niu H, Álvarez-Álvarez I, Guillén-Grima F, Aguinaga-Ontoso I. Prevalence and incidence of Alzheimer’s disease in Europe: A meta-analysis. Neurologia. 2017;32(8):523–532. doi: 10.1016/j.nrl.2016.02.016. [DOI] [PubMed] [Google Scholar]
  • 8.Chesser AS, Pritchard SM, Johnson GV. Tau clearance mechanisms and their possible role in the pathogenesis of Alzheimer disease. Front Neurol. 2013;4:122. doi: 10.3389/fneur.2013.00122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Shin JY, Park HJ, Kim HN, Oh SH, Bae JS, Ha HJ, et al. Mesenchymal stem cells enhance autophagy and increase β-amyloid clearance in Alzheimer disease models. Autophagy. 2014;10(1):32–44. doi: 10.4161/auto.26508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gatz M, Reynolds CA, Fratiglioni L, Johansson B, Mortimer JA, Berg S, et al. Role of Genes and Environments for Explaining Alzheimer Disease. Arch Gen Psychiatry. 2006;63(2):168–74. doi: 10.1001/archpsyc.63.2.168. [DOI] [PubMed] [Google Scholar]
  • 11.Daw EW, Payami H, Nemens EJ, Nochlin D, Bird TD, Schellenberg GD, et al. The number of trait loci in late-onset Alzheimer disease. Am J Hum Genet. 2000;66:196–204. doi: 10.1086/302710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Li YJ, Scott WK, Hedges DJ, Zhang F, Gaskell PC, Nance MA, et al. AAO in two common neurodegenerative diseases is genetically controlled. Am J Hum Genet. 2002;70:985–993. doi: 10.1086/339815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Nagase T, Kikuno R, Ishikawa K, Hirosawa M, Ohara O. Prediction of the coding sequences of unidentified human genes. XVI. The complete sequences of 150 new cDNA clones from brain which code for large proteins in vitro. DNA Res. 2000;7:65–73. doi: 10.1093/dnares/7.1.65. [DOI] [PubMed] [Google Scholar]
  • 14.Fernandez CV, Lestou VS, Wildish J, Lee CLY, Sorensen PHB. Detection of a novel t(6;15)(q21;q21) in a pediatric Wilms tumor. Cancer Genet Cytogenet. 2001;129:165–167. doi: 10.1016/s0165-4608(01)00445-9. [DOI] [PubMed] [Google Scholar]
  • 15.Anglesio MS, Evdokimova V, Melnyk N, Zhang L, Fernandez CV, Grundy PE, et al. Differential expression of a novel ankyrin containing E3 ubiquitin-protein ligase, Hace1, in sporadic Wilms’ tumor versus normal kidney. Hum Molec Genet. 2004;13:2061–2074. doi: 10.1093/hmg/ddh215. [DOI] [PubMed] [Google Scholar]
  • 16.Zhang L, Anglesio MS, O’Sullivan M, Zhang F, Yang G, Sarao R, et al. The E3 ligase HACE1 is a critical chromosome 6q21 tumor suppressor involved in multiple cancers. Nature Med. 2007;13:1060–1069. doi: 10.1038/nm1621. [DOI] [PubMed] [Google Scholar]
  • 17.Küçük C, Hu X, Iqbal J, Gaulard P, Klinkebiel D, Cornish A, et al. HACE1 is a tumor suppressor gene candidate in natural killer cell neoplasms. Am J Pathol. 2013;182(1):49–55. doi: 10.1016/j.ajpath.2012.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gao ZF, Wu YN, Bai ZT, Zhang L, Zhou Q, Li X. Tumor-suppressive role of HACE1 in hepatocellular carcinoma and its clinical significance. Oncol Rep. 2016;36(6):3427–3435. doi: 10.3892/or.2016.5205. [DOI] [PubMed] [Google Scholar]
  • 19.Rotblat B, Southwell AL, Ehrnhoefer DE, Skotte NH, Metzler M, Franciosi S, et al. HACE1 reduces oxidative stress and mutant Huntingtin toxicity by promoting the NRF2 response. Proc Natl Acad Sci U S A. 2014;111(8):3032–7. doi: 10.1073/pnas.1314421111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang L, Chen X, Sharma P, Moon M, Sheftel AD, Dawood F, et al. HACE1-dependent protein degradation provides cardiac protection in response to haemodynamic stress. Nat Commun. 2014;5:3430. doi: 10.1038/ncomms4430. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hollstein R, Parry DA, Nalbach L, Logan CV, Strom TM, Hartill VL, et al. HACE1 deficiency causes an autosomal recessive neurodevelopmental syndrome. J Med Genet. 2015;52(12):797–803. doi: 10.1136/jmedgenet-2015-103344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Cetinbas N, Daugaard M, Mullen AR, Hajee S, Rotblat B, Lopez A, DeBerardinis RJ, Sorensen PH, et al. Loss of the tumor suppressor Hace1 leads to ROS-dependent glutamine addiction. Oncogene. 2015;34(30):4005–10. doi: 10.1038/onc.2014.316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hong KW, Ko KP, Ahn Y, Kim CS, Park SJ, Park JK, et al. Epidemiological profiles between equol producers and nonproducers: a genomewide association study of the equol-producing phenotype. Genes Nutr. 2012;7(4):567–74. doi: 10.1007/s12263-012-0292-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Diskin SJ, Capasso M, Schnepp RW, Cole KA, Attiyeh EF, Hou C, et al. Common variation at 6q16 within HACE1 and LIN28B influences susceptibility to neuroblastoma. Nat Genet. 2012;44:1126–30. doi: 10.1038/ng.2387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Capasso M, Diskin SJ, Totaro F, Longo L, De Mariano M, Russo R, Cimmino F, et al. Replication of GWAS-identified neuroblastoma risk loci strengthens the role of BARD1 and affirms the cumulative effect of genetic variations on disease susceptibility. Carcinogenesis. 2013;34(3):605–11. doi: 10.1093/carcin/bgs380. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Zhang Z, Zhang R, Zhu J, Wang F, Yang T, Zou Y, et al. Common variations within HACE1 gene and neuroblastoma susceptibility in a Southern Chinese population. Onco Targets Ther. 2017;10:703–709. doi: 10.2147/OTT.S129042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Li H, Wetten S, Li L, St Jean PL, Upmanyu R, Surh L, et al. Candidate single-nucleotide polymorphisms from a genomewide association study of Alzheimer disease. Arch Neurol. 2008;65(1):45–53. doi: 10.1001/archneurol.2007.3. [DOI] [PubMed] [Google Scholar]
  • 28.Janicki SC, Park N, Cheng R, Clark LN, Lee JH, Schupf N. Estrogen receptor α variants affect age at onset of Alzheimer’s disease in a multiethnic female cohort. Dement Geriatr Cogn Disord. 2014;38(3–4):200–13. doi: 10.1159/000355559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wang KS, Liu X, Wang L, Briones D, Xu C. Genetic variants in the SORL1 gene are associated with age at onset of Alzheimer disease: A survival analysis. International Journal of Medical Genetics. 2014;2014:e689832. [Google Scholar]
  • 30.Stephens M, Balding DJ. Bayesian statistical methods for genetic association studies. Nat Rev Genet. 2009;10(10):681–90. doi: 10.1038/nrg2615. [DOI] [PubMed] [Google Scholar]
  • 31.Sullivan SG, Greenland S. Bayesian regression in SAS software. Int J Epidemiol. 2013;42(1):308–17. doi: 10.1093/ije/dys213. [DOI] [PubMed] [Google Scholar]
  • 32.Stokes M, Chen F, Gunes F. An introduction to Bayesian analysis with SAS/STAT® software. Proceedings of the SAS Global Forum 2014 Conference; Cary, NC: SAS Institute Inc; 2014. [Available at https://support.sas.com/resources/papers/proceedings14/SAS400-2014.pdf.] [Google Scholar]
  • 33.Filippini N, Rao A, Wetten S, Gibson RA, Borrie M, Guzman D, et al. Anatomically-distinct genetic associations of APOE epsilon4 allele load with regional cortical atrophy in Alzheimer’s disease. Neuroimage. 2009;44(3):724–728. doi: 10.1016/j.neuroimage.2008.10.003. [DOI] [PubMed] [Google Scholar]
  • 34.Lee JH, Cheng R, Graff-Radford N, Foroud T, Mayeux R National Institute on Aging Late-Onset Alzheimer’s Disease Family Study Group. Analyses of the National Institute on Aging Late-Onset Alzheimer’s Disease Family Study: implication of additional loci. Arch Neurol. 2008;65(11):1518–26. doi: 10.1001/archneur.65.11.1518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wang KS, Liu X, Xie C, Liu Y, Xu C. Non-parametric survival analysis of EPG5 gene with age at onset of Alzheimer’s disease. Journal of Molecular Neuroscience. 2016;60(4):436–444. doi: 10.1007/s12031-016-0821-9. [DOI] [PubMed] [Google Scholar]
  • 36.Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21(2):263–265. doi: 10.1093/bioinformatics/bth457. [DOI] [PubMed] [Google Scholar]
  • 37.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81(3):559–75. doi: 10.1086/519795. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Dunn OJ. Multiple Comparisons Among Means. Journal of the American Statistical Association. 1961;56(293):52–64. [Google Scholar]
  • 39.Cox DR. Regression models and life-tables. J R Stat Soc Ser B (Methodol) 1972;34(2):187–220. [Google Scholar]
  • 40.Cantor AB. SAS Survival Analysis Techniques for Medical Research. 3. Cary, NC: SAS institute INC; 2007. [Google Scholar]
  • 41.George B, Seals S, Aban I. Survival analysis and regression models. J Nucl Cardiol. 2014;21(4):686–94. doi: 10.1007/s12350-014-9908-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kasza J, Wraith D, Lamb K, Wolfe R. Survival analysis of time-to-event data in respiratory health research studies. Respirology. 2014;19(4):483–92. doi: 10.1111/resp.12281. [DOI] [PubMed] [Google Scholar]
  • 43.Lin D, Wei LJ, Ying Z. Checking the Cox Model with Cumulative Sums of Martingale-Based Residuals. Biometrika. 1993;80:557–572. [Google Scholar]
  • 44.Akaike H. A Bayesian Extension of the Minimum AIC Procedure of Autoregressive Model Fitting. Biometrika. 1979;66:237–242. [Google Scholar]
  • 45.Akaike H. Likelihood of a Model and Information Criteria. J Econ. 1981;16:3–14. [Google Scholar]
  • 46.Spiegelhalter DJ, Best NG, Carlin BP, Van der Linde A. Bayesian Measures of Model Complexity and Fit (with Discussion) Journal of the Royal Statistical Society, Series B. 2002;64(4):583–616. [Google Scholar]
  • 47.Van Steen K, Lange C. PBAT: a comprehensive software package for genome-wide association analysis of complex family-based studies. Hum Genomics. 2005;2:67–9. doi: 10.1186/1479-7364-2-1-67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Lange C, Silverman EK, Xu X, Weiss ST, Laird NM. A multivariate family-based association test using generalized estimating equations: FBAT-GEE. Biostatistics. 2003;4(2):195–206. doi: 10.1093/biostatistics/4.2.195. [DOI] [PubMed] [Google Scholar]
  • 49.Lange C, Blacker D, Laird NM. Family-based association tests for survival and times-to-onset analysis. Stat Med. 2004;23(2):179–89. doi: 10.1002/sim.1707. [DOI] [PubMed] [Google Scholar]
  • 50.Ardlie KG, Kruglyak L, Seielstad M. Patterns of linkage disequilibrium in the human genome. Nat Rev Genet. 2002;3(4):299–309. doi: 10.1038/nrg777. [DOI] [PubMed] [Google Scholar]
  • 51.Einarsdottir E, Bevova MR, Zhernakova A, Monsuur A, Koskinen LL, van’t Slot R, et al. Multiple independent variants in 6q21–22 associated with susceptibility to celiac disease in the Dutch, Finnish and Hungarian populations. Eur J Hum Genet. 2011;19(6):682–6. doi: 10.1038/ejhg.2011.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Holmes C, Cunningham C, Zotova E, Woolford J, Dean C, Kerr S, et al. Systemic inflammation and disease progression in Alzheimer disease. Neurology. 2009;73(10):768–74. doi: 10.1212/WNL.0b013e3181b6bb95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Takeda S, Sato N, Morishita R. Systemic inflammation, blood-brain barrier vulnerability and cognitive/non-cognitive symptoms in Alzheimer disease: relevance to pathogenesis and therapy. Front Aging Neurosci. 2014;6:171. doi: 10.3389/fnagi.2014.00171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Blach-Olszewska Z, Zaczynska E, Gustaw-Rothenberg K, Avila-Rodrigues M, Barreto GE, Leszek J, et al. The Innate Immunity in Alzheimer Disease- Relevance to Pathogenesis and Therapy. Curr Pharm Des. 2015;21(25):3582–8. doi: 10.2174/1381612821666150710144829. [DOI] [PubMed] [Google Scholar]
  • 55.Heppner FL, Ransohoff RM, Becher B. Immune attack: the role of inflammation in Alzheimer disease. Nat Rev Neurosci. 2015;16(6):358–72. doi: 10.1038/nrn3880. [DOI] [PubMed] [Google Scholar]
  • 56.López González I, Garcia-Esparcia P, Llorens F, Ferrer I. Genetic and Transcriptomic Profiles of Inflammation in Neurodegenerative Diseases: Alzheimer, Parkinson, Creutzfeldt-Jakob and Tauopathies. Int J Mol Sci. 2016;17(2):206. doi: 10.3390/ijms17020206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Burke WJ, McLaughlin JR, Chung HD, Gillespie KN, Grossberg GT, Luque FA, et al. Occurrence of cancer in Alzheimer and elderly control patients: an epidemiologic necropsy study. Alzheimer Dis Assoc Disord. 1994;8(1):22–8. doi: 10.1097/00002093-199408010-00005. [DOI] [PubMed] [Google Scholar]
  • 58.Roe CM, Behrens MI, Xiong C, Miller JP, Morris JC. Alzheimer disease and cancer. Neurology. 2005;64(5):895–8. doi: 10.1212/01.WNL.0000152889.94785.51. [DOI] [PubMed] [Google Scholar]
  • 59.Roe CM, Fitzpatrick AL, Xiong C, Sieh W, Kuller L, Miller JP, et al. Cancer linked to Alzheimer disease but not vascular dementia. Neurology. 2010;74(2):106–12. doi: 10.1212/WNL.0b013e3181c91873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Thinnes FP. Why cancer survivors have a lower risk of Alzheimer disease. Mol Genet Metab. 2012;107(3):630–1. doi: 10.1016/j.ymgme.2012.06.016. [DOI] [PubMed] [Google Scholar]
  • 61.Musicco M, Adorni F, Di Santo S, Prinelli F, Pettenati C, Caltagirone C, et al. Inverse occurrence of cancer and Alzheimer disease: a population-based incidence study. Neurology. 2013;81(4):322–8. doi: 10.1212/WNL.0b013e31829c5ec1. [DOI] [PubMed] [Google Scholar]
  • 62.Shi HB, Tang B, Liu YW, Wang XF, Chen GJ. Alzheimer disease and cancer risk: a meta-analysis. J Cancer Res Clin Oncol. 2015;141(3):485–94. doi: 10.1007/s00432-014-1773-5. [DOI] [PubMed] [Google Scholar]

RESOURCES