Abstract
Objective
The association between habitual coffee or caffeine consumption and age at onset (AAO) of Huntington’s disease (HD) is unclear. We employed Mendelian randomization to investigate the causal relationship between coffee consumption and AAO of HD.
Methods
We selected 14 independent genetic variants associated with coffee consumption from a genome-wide association study (GWAS) meta-analysis of 375,833 individuals of European ancestry. Genetic association estimates for AAO of HD were obtained from the Genetic Modifiers of Huntington’s Disease Consortium GWAS meta-analysis including 9,064 HD patients of European ancestry. The inverse variance weighted method was used to evaluate the causal estimate and a comprehensive set of analyses tested the robustness of our results.
Results
Genetically predicted higher coffee consumption was associated with an earlier AAO of HD (β = −1.84 years, 95% confidence interval (CI) = −3.47 to −0.22, P = 0.026). Results were robust to potential pleiotropy and weak instrument bias.
Conclusions
This genetic study suggests high coffee consumption is associated with an earlier AAO of HD. Coffee is widely consumed and thus our findings, if confirmed, offers a potential way to delay the onset of this debilitating autosomal dominant disease.
Keywords: Huntington’s disease, age at onset, coffee consumption, Mendelian randomization
Introduction
Huntington’s disease (HD) is an autosomal dominant neurodegenerative disease caused by a CAG repeat expansion in huntingtin gene (HTT) and is characterized by movement, cognitive and psychiatric disorders.1, 2 The annual incidence of HD is about 0.38 per 100,000,3 with death occurring within 15–20 years after onset.4 CAG repeat length inversely correlates with and accounts for 50–70% of variance in age at onset (AAO), and the remaining variation is likely explained by other genetic and environmental factors.4, 5 Given that HD is a fatal disease and no disease-modifying therapies are available, it is important to identify interventions that can delay the AAO of HD. A single retrospective observational study reported that higher caffeine intake was associated with an earlier AAO of HD.6 Additional epidemiological support is warranted but challenged by lack of large prospective studies of HD, a rare disease.
Mendelian randomization (MR) is an alternative approach for inferring causality that uses genetic variants as instruments for traits of interest (e.g., coffee consumption) and is not susceptible to confounding and reverse causation in observational studies.7–10 With available genome-wide association study (GWAS) results for consumption of coffee, the major source of caffeine for many populations, and AAO of HD,11, 12 MR offers an opportunity to efficiently investigate the casual relationship between coffee consumption and AAO of HD. Therefore, we performed this 2-sample MR study to investigate the causal effects of coffee consumption on AAO of HD.
Materials and methods
We used the summary statistics from published GWAS that previously obtained informed consent and ethical review board approvals.11, 12
Genetic instruments for coffee consumption
We selected genetic instruments for coffee consumption (per 1% increment) based on a GWAS meta-analysis of the UK Biobank and three independent US cohorts including up to 375,833 individuals of European ancestry.11 In UK Biobank (discovery stage), coffee consumption was collected from all participants at baseline using a touchscreen questionnaire.11 Total coffee consumption was based on the question ‘How many cups of coffee do you drink each day (include decaffeinated coffee)? The association analyses were adjusted for age, sex, body mass index (BMI), and top 20 principal components.11 Replication stage was carried out in three independent US corhorts: Women’s Genome Health Study (WGHS), Nurses’Health Study (NHS), and Health Professionals Follow-up Study (HPFS). For these three replication cohorts, coffee consumption data were collected by the same food frequency questionnaire (FFQ). For WGHS, the GWAS analysis was adjusted for age, total caloric intake, top 10 eigenvectors of population substructure, and BMI. For NIHS and HPFS, the GWAS analysis was adjusted for age, total caloric intake, case-control study, top 4 eigenvectors of population substructure, and BMI. Finally, this GWAS meta-analysis identified 15 single nucleotide polymorphisms (SNPs) associated with coffee consumption at genome-wide significance (P < 5 × 10−8).11 We clumped these SNPs for independence (r2 < 0.1, distance window, 10,000kb), and used the remaining 14 SNPs as the primary genetic instrument. The effect sizes for the SNP–coffee consumption associations were expressed per 1% increment in coffee consumption in the GWAS meta-analysis,11 and were rescaled to 50% increment in coffee consumption in this study.
As a secondary instrument, we used two of these 14 SNPs which present with the largest effect estimates, have been confirmed in multiple studies11, 13–15 and are biologically relevant; SNPs near CYP1A1/A2 (rs2472297) and AHR (rs4410790). CYP1A2 encodes cytochrome P450(CYP)1A2 which accounts for ~ 95% of caffeine metabolism in humans, and AHR encodes the aryl hydrocarbon receptor which can regulate the expression of CYP1A1 and CYP1A2.13, 16, 17 The two variants of these loci associated with higher coffee consumption are also associated with lower blood caffeine levels in GWAS.18
Outcome data sources
The primary outcome in this study was AAO of HD corrected for inherited CAG repeat length; herein ‘residual AAO’ as defined by the Genetic Modifiers of Huntington’s Disease (GeM-HD) Consortium.12 Specifically, residual AAO was calculated as the difference between the observed (AAO of diagnostic motor signs) and expected (based on CAG repeat size) AAO, representing years of deviation from the expectation.12 For example, a HD patient with a residual AAO of +4 indicates that he/she developed motor symptoms 4 years later than expected. Genetic association estimates for residual AAO in HD were obtained from GeM-HD Consortium GWAS meta-analysis including 9,064 HD patients of European ancestry (4,417 males; 4,647 females), which sought to discover disease-modifying loci that act prior to clinical diagnosis to delay onset.12 This GWAS revealed 8 common (minor allele frequency > 1%) genome-wide significant (P < 5×10−8) loci; thus demonstrating sufficient power for the current MR.12
Statistical analysis
The primary MR estimates were performed using fixed-effects inverse variance weighted (IVW) meta-analyses.19 IVW estimates might be biased in the presence of directional pleiotropy.19 As a measure of pleiotropy, we evaluated heterogeneity across individual SNP estimate in the IVW analyses with the Cochran Q test (P < 0.05 was deemed statistically significant). We also used alternative MR methods that are relatively robust to pleiotropic SNPs, including MR-Robust Adjusted Profile Score (MR-RAPS),20 weighted median21 and MR-Egger regression.22 The MR-RAPS method provides unbiased estimates in the presence of many weak instruments.20 The weighted median method allows an instrumental variable based on up to 50% potentially invalid SNPs.21 The MR-Egger regression can control for bias from unbalanced pleiotropy, at the cost of reduced power.22 The intercept from MR-Egger was used as an indicator of unbalanced pleiotropy (P < 0.05 was considered significance).22 We further examined the presence of pleiotropic outlier SNPs using the MR-Pleiotropy Residual Sum and Outlier (MR-PRESSO) method.23 The F-statistic was used to evaluate the strength of the instruments.24 All statistical analyses were undertaken in R (v4.0.2) using the MendelianRandomization,25 MR-PRESSO,23 and mr.raps packages.20
Results
The F statistic for each SNP was > 10, suggesting low risk of weak instrument bias (Supplementary Table 1). Genetically predicted 50% increase in coffee consumption was associated with an earlier AAO of HD (β = −1.84 years, 95% confidence interval (CI) = −3.47 to −0.22, P = 0.026; Figure 1). The MR-RAPS, weighted median, and MR-Egger analyses yield similar pattern of effect estimates, although with broader CIs due to lower statistical power (Figure 1). There was no evidence of heterogeneity (P for Cochran’s Q = 0.42) and unbalanced pleiotropy (P for MR-Egger intercept = 0.73). For the 14 SNPs, MR-PRESSO did not identify any significant potential outlier.
Figure 1.

Mendelian randomization estimates of coffee consumption and age at onset of Huntington’s disease.
An MR analyses using a genetic instrument restricted to two biologically relevant variants (rs2472297 and rs4410790) yielded similar results; genetically predicted 50% increase in coffee consumption was associated with an earlier AAO of HD (β = −2.09 years, 95% CI = −4.17 to −0.02, P = 0.048; Figure 1).
Discussion
By leveraging large-scale GWAS data, we employed the first MR to investigate the causal effects of coffee consumption on the AAO of HD. We found that genetically predicted 50% increase in coffee consumption was associated with about 2 years earlier AAO of HD.
Our genetic findings are consistent with the only observational study that has investigated the association between caffeine and AAO of HD.6 This study retrospectively evaluated caffeine consumption in 80 HD patients using a dietary survey, and suggested that pre-diagnosis caffeine consumption > 190 mg/day (~2 small cups of regular coffee) was significantly associated with 3.6 years earlier AAO of HD after adjusting for gender, smoking status and CAG repeat length.6 Our MR study protects against reverse causation and recall bias and thus strengthens these initial retrospective findings. Nevertheless, careful interpretations of our findings are warranted. Most of the coffee-SNPs, particularly those with large effect estimates are associated with increased coffee drinking behavior but lower biological exposure to caffeine according to GWAS for each trait. 18, 26 That is, higher coffee (caffeine) drinking is balanced by higher caffeine metabolism. Thus, our significant findings point to at least two causal hypotheses. First, higher biological exposure to caffeine may delay the onset of HD. Neurological research suggests caffeine exerts antioxidant effects and reduces oxidative stress and inflammation;27, 28 properties which may impact the onset and progression of HD in a beneficial way.29–31 Second, exposure to one or more non-caffeine constituents of coffee may accelerate the onset of HD. We are unaware of any compound in coffee that is potentially neurotoxic. Both hypotheses warrant further investigation.
Our MR-designed investigation presents with a number of strengths to complement and extent traditional epidemiological studies but is not without limitations. First, while we employed rigorous methods to address it, we cannot completely discount potential bias due to pleiotropy. Second, our study was based on data from populations of European ancestry and thus findings may not generalize to other populations. Our focus on European ancestry was due to the limited availability of summary-level data but also to protect against bias resulting from population stratification. Third, our genetic instrument for coffee was dominated by caffeine-metabolism related SNPs which partly complicates interpretation of the results. Coffee is a major source of caffeine but is also an important source of other bioactive constituents and an instrument reflecting other aspects of coffee drinking behavior would have been ideal.
In conclusion, our genetic findings add to preliminary research suggesting caffeine-containing beverages may impact the AAO of HD. However, interpretation of the specific results warrants caution and further investigation. Given widespread consumption of caffeine-containing beverages the potential for caffeine or coffee to impact HD merits further investigation.
Supplementary Material
Acknowledgement
This study was supported by the National Institute on Aging (K01AG053477 to M.C.C). We thank the GeM Euro 9K and GeM-HD Consortium for use of the GWAS summary statistics of age at onset in HD.
Footnotes
Potential Conflicts of Interest
None
References
- 1.The Huntington’s Disease Collaborative Research Group. A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington’s disease chromosomes. Cell. 1993;72(6):971–83. [DOI] [PubMed] [Google Scholar]
- 2.Walker FO. Huntington’s disease. Lancet. 2007;369(9557):218–28. [DOI] [PubMed] [Google Scholar]
- 3.Pringsheim T, Wiltshire K, Day L, Dykeman J, Steeves T, Jette N. The incidence and prevalence of Huntington’s disease: a systematic review and meta-analysis. Mov Disord. 2012;27(9):1083–91. [DOI] [PubMed] [Google Scholar]
- 4.Ross CA, Tabrizi SJ. Huntington’s disease: from molecular pathogenesis to clinical treatment. Lancet Neurol. 2011;10(1):83–98. [DOI] [PubMed] [Google Scholar]
- 5.Wexler NS, Lorimer J, Porter J, et al. Venezuelan kindreds reveal that genetic and environmental factors modulate Huntington’s disease age of onset. Proc Natl Acad Sci U S A. 2004;101(10):3498–503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Simonin C, Duru C, Salleron J, et al. Association between caffeine intake and age at onset in Huntington’s disease. Neurobiol Dis. 2013;58:179–82. [DOI] [PubMed] [Google Scholar]
- 7.O’Donnell CJ, Sabatine MS. Opportunities and Challenges in Mendelian Randomization Studies to Guide Trial Design. JAMA Cardiol. 2018;3(10):967. [DOI] [PubMed] [Google Scholar]
- 8.Smith GD, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1). [DOI] [PubMed] [Google Scholar]
- 9.Burgess S, Thompson SG. Mendelian Randomization: Methods for Using Genetic Variants in Causal Estimation. London, UK: Chapman & Hall/CRC Press. 2015. [Google Scholar]
- 10.Davey Smith G, Holmes MV, Davies NM, Ebrahim S. Mendel’s laws, Mendelian randomization and causal inference in observational data: substantive and nomenclatural issues. Eur J Epidemiol. 2020;35(2). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zhong VW, Kuang A, Danning RD, et al. A genome-wide association study of bitter and sweet beverage consumption. Hum Mol Genet. 2019;28(14):2449–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Genetic Modifiers of Huntington’s Disease (GeM-HD) Consortium. CAG Repeat Not Polyglutamine Length Determines Timing of Huntington’s Disease Onset. Cell. 2019;178(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cornelis MC, Byrne EM, Esko T, et al. Genome-wide meta-analysis identifies six novel loci associated with habitual coffee consumption. Mol Psychiatry. 2015;20(5):647–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Nordestgaard AT, Nordestgaard BG. Coffee intake, cardiovascular disease and all-cause mortality: observational and Mendelian randomization analyses in 95 000–223 000 individuals. Int J Epidemiol. 2016;45(6):1938–52. [DOI] [PubMed] [Google Scholar]
- 15.Sulem P, Gudbjartsson DF, Geller F, et al. Sequence variants at CYP1A1-CYP1A2 and AHR associate with coffee consumption. Hum Mol Genet. 2011;20(10):2071–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kot M, Daniel WA. The relative contribution of human cytochrome P450 isoforms to the four caffeine oxidation pathways: an in vitro comparative study with cDNA-expressed P450s including CYP2C isoforms. Biochem Pharmacol. 2008;76(4):543–51. [DOI] [PubMed] [Google Scholar]
- 17.Le Vee M, Jouan E, Fardel O. Involvement of aryl hydrocarbon receptor in basal and 2,3,7,8-tetrachlorodibenzo-p-dioxin-induced expression of target genes in primary human hepatocytes. Toxicol In Vitro. 2010;24(6):1775–81. [DOI] [PubMed] [Google Scholar]
- 18.Cornelis MC, Kacprowski T, Menni C, et al. Genome-wide association study of caffeine metabolites provides new insights to caffeine metabolism and dietary caffeine-consumption behavior. Hum Mol Genet. 2016;25(24):5472–82. [DOI] [PubMed] [Google Scholar]
- 19.Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zhao Q, Wang J, Hemani G, Bowden J, Small DS. Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score. The Annals of Statistics. 2020;48(3):1742–69, 28. [Google Scholar]
- 21.Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genet Epidemiol. 2016;40(4):304–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44(2):512–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet. 2018;50(5):693–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Pierce BL, Ahsan H, Vanderweele TJ. Power and instrument strength requirements for Mendelian randomization studies using multiple genetic variants. Int J Epidemiol. 2011;40(3):740–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Yavorska OO, Burgess S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int J Epidemiol. 2017;46(6):1734–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Cornelis MC, Munafo MR. Mendelian Randomization Studies of Coffee and Caffeine Consumption. Nutrients. 2018;10(10). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kolahdouzan M, Hamadeh MJ. The neuroprotective effects of caffeine in neurodegenerative diseases. CNS Neurosci Ther. 2017;23(4):272–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Badshah H, Ikram M, Ali W, Ahmad S, Hahm JR, Kim MO. Caffeine May Abrogate LPS-Induced Oxidative Stress and Neuroinflammation by Regulating Nrf2/TLR4 in Adult Mouse Brains. Biomolecules. 2019;9(11). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Browne SE, Ferrante RJ, Beal MF. Oxidative stress in Huntington’s disease. Brain Pathol. 1999;9(1):147–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Stack EC, Matson WR, Ferrante RJ. Evidence of oxidant damage in Huntington’s disease: translational strategies using antioxidants. Ann N Y Acad Sci. 2008;1147:79–92. [DOI] [PubMed] [Google Scholar]
- 31.Kumar A, Ratan RR. Oxidative Stress and Huntington’s Disease: The Good, The Bad, and The Ugly. J Huntingtons Dis. 2016;5(3):217–37. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
