Abstract
While polygenic risk scores (PRS) have been shown to predict many diseases and risk factors, the potential of genomic prediction in harm caused by alcohol use has not yet been extensively studied. Here, we built a novel polygenic risk score of 1.1 million variants for alcohol consumption and studied its predictive capacity in 96,499 participants from the FinnGen study and 39,695 participants from prospective cohorts with detailed baseline data and up to 25 years of follow-up time. A 1 SD increase in the PRS was associated with 11.2 g (=0.93 drinks) higher weekly alcohol consumption (CI = 9.85–12.58 g, p = 2.3 × 10–58). The PRS was associated with alcohol-related morbidity (4785 incident events) and the risk estimate between the highest and lowest quintiles of the PRS was 1.83 (95% CI = 1.66–2.01, p = 1.6 × 10–36). When adjusted for self-reported alcohol consumption, education, marital status, and gamma-glutamyl transferase blood levels in 28,639 participants with comprehensive baseline data from prospective cohorts, the risk estimate between the highest and lowest quintiles of the PRS was 1.58 (CI = 1.26–1.99, p = 8.2 × 10–5). The PRS was also associated with all-cause mortality with a risk estimate of 1.33 between the highest and lowest quintiles (CI = 1.20–1.47, p = 4.5 × 10–8) in the adjusted model. In conclusion, the PRS for alcohol consumption independently associates for both alcohol-related morbidity and all-cause mortality. Together, these findings underline the importance of heritable factors in alcohol-related health burden while highlighting how measured genetic risk for an important behavioral risk factor can be used to predict related health outcomes.
Subject terms: Personalized medicine, Addiction
Introduction
Alcohol drinking is a major dose-dependent contributor to morbidity and mortality. Globally, 3 million annual deaths (5% of all deaths) result from alcohol consumption, and is also linked to more than 200 disease and injury outcomes1. As ethanol is a psychoactive substance with addictive properties2, alcohol consumption can lead to the development of alcohol use disorders (AUDs), globally prevalent mental disorders of pathological addictive or abusive drinking patterns, which are linked to worse health outcomes, negative socioeconomic effects, and increased mortality3. There is a strong connection between the health burden and the level of alcohol consumed4, and in total, alcohol has been estimated to be the most damaging of all substances of abuse, in terms of harm caused to self and others5.
Alcohol-related behaviors are also affected by genetic factors and the estimated heritability of alcohol consumption in twin studies has ranged between 35% and 65% (weighted average 37%)6 and its single nucleotide polymorphism-based heritability has been estimated to be 10%7. Recent large-scale genome-wide association studies (GWAS) have identified multiple loci associated with alcohol consumption, underlining the importance of large study populations for unraveling the genetic architecture underlying alcohol-related traits7,8. Similarly, GWAS of alcohol dependence, AUD, and the Alcohol Use Disorders Identification Test (AUDIT) scores have shown the traits to be genetically distinct but positively correlated9–11.
Polygenic risk scores (PRSs) derived from GWAS summary statistics have showcased improved performance in disease prediction12. PRSs for known risk factors have also been shown to associate with the related disease13, and recently associations between multiple risk factor PRSs and related traits were confirmed and reported14,15. However, the link between PRSs for behavioral traits and associated health outcomes remains poorly understood.
The assessment of potential health risks related to alcohol has so far relied on traditional risk factors, including family history, without explicit measurement of genetic risk. Here we developed a highly polygenic risk score for alcohol consumption and studied whether alcohol-related polygenic burden predicts alcohol-use disorders and other alcohol-related morbidity and mortality in Finnish biobank cohorts (n = 96,499) linked to electronic health records. Furthermore, we studied whether the PRS for alcohol consumption predicts alcohol-related outcomes beyond self-reported alcohol consumption and other related risk factors, thus providing more objective information independent of individual reporting bias or temporal fluctuations.
Methods
Study sample and definition of alcohol-related morbidity
The data are comprised of 96,499 Finnish individuals from FinnGen Data Freeze 2 (https://www.finngen.fi/), which includes prospective epidemiological and disease-based cohorts as well as hospital biobank samples (Contributors S1, Table S1). The data were linked by the unique national personal identification numbers to national hospital discharge, death, and medication reimbursement registries. Additional details and information on the genotyping and imputation are provided in the online-only Supplementary Information.
Alcohol-related baseline measures were available for a subset of the FinnGen dataset consisting of national population survey cohorts: FINRISK, collected in 1992, 1997, 2002, 2007, and 2012 and Health 2000, collected in 2000. The baseline data included self-reported information assessed by questionnaires, anthropometric measures, and blood samples. More detailed descriptions of the FINRISK and Health 2000 studies have been published previously16,17.
Additionally, three Finnish twin cohorts, FinnTwin12, NAG-FIN, and Old Twin, were pooled and analyzed as one dataset. For these datasets, cohort baseline data were available, but the cohorts were not linked to electronical health records. For details regarding the twin datasets, see the online descriptions (https://wiki.helsinki.fi/display/twineng/Twinstudy)18,19.
Using nationwide registries for deaths (1969–2016), hospital discharges (1969–2016), outpatient specialist appointments (1998–2016), and drug purchases (1995–2016), we combined 21 somatic and psychiatric alcohol-related diagnoses into a composite disease endpoint, harmonizing the International Classification of Diseases (ICD) revisions 8, 9, and 10, and ATC-codes (Table S2). These registries spanning decades were electronically linked to the cohort baseline data using the unique national personal identification numbers assigned to all Finnish citizens and residents. The final alcohol-related morbidity endpoint was defined as the first single event coded as any of the conditions in the composite endpoint.
Genotyping and imputation
FinnGen, FINRISK, Health 2000, and Finnish Twin Cohort samples were genotyped with Illumina and Affymetrix genomewide SNP arrays. Individuals with non-European ancestry or uncertain sex were excluded. Within each cohort, every genotyping batch was first imputed separately and then merged together for association analyses. The details about the genotype calling, quality controls, and imputation are provided in the Supplementary material (Methods S1).
Polygenic risk scores
Summary statistics from the largest existing GWAS meta-analysis on alcohol consumption (8) were used for constructing the PRS. To avoid overfitting, a separate ad hoc meta-analysis was performed by GSCAN (Contributors S2), excluding all Finnish and 23andMe samples (n = 527,282 after exclusions). LDpred-method20 was used to account for linkage disequilibrium (LD) among loci with whole-genome sequencing data on 2690 Finns serving as the external LD reference panel. We compared the PRSs generated with LDpred-parameters and their predictive ability in FINRISK (Fig S1). Any threshold above 0.003 worked practically similarly, and for simplicity we chose to use the LDpred-inf PRS in all the analyses. The final scores were generated with PLINK2 (ref. 21) by calculating the weighted sum of risk allele dosages for each variant. The number of variants in the final scores was 1.1 million (1,134,960 in FinnGen, 1,143,220 in FINRISK and Health 2000, and 1,143,138 in the Twin Cohort).
Statistical analysis
The Cox proportional hazard model was used to estimate survival curves, hazard ratios (HRs), and 95% confidence interval (95% CI) in the survival analyses where age was used as the time scale. R’s cox.zph function was used to test whether the proportional assumption criteria applied in our models. Linear regression in FINRISK and Health 2000 and linear mixed model in the Twin Cohort was used for estimating the relationship between the PRS and alcohol consumption. Logistic regression in the FINRISK and Health 2000 cohorts and linear mixed model in the Twin Cohort was used to estimate the relationship between alcohol abstinence and the PRS.
All the cohorts (FinnGen, FINRISK, Health 2000, and the Twin Cohort) were analyzed independently as single datasets where age, sex, genotyping array, and the first ten principal components of ancestry were used as core covariates. Additionally, body mass was used as a covariate in the model estimating the PRS–alcohol consumption relationship. Self-reported weekly average alcohol consumption from the past year (when unavailable, the past week’s consumption) was used as the estimate for alcohol consumption. In the fully adjusted survival model analyses, log(x + 1) -transformed alcohol consumption-estimate, current smoking status, binary higher education status, binary marital/cohabitation status, and gamma-glutamyl transferase (GGT) blood levels at baseline served as covariates. The GGT levels were measured following uniform recommendations of the European Committee for Clinical Laboratory Standards (ECCLS)22 enabling comparability between the cohorts.
In the survival analyses, all prevalent cases (in FINRISK and Health 2000) and individuals with covariate missingness were excluded. The PRS was normalized and included as a continuous variable in the models. In the survival analysis, the highest and lowest genetic risk for alcohol consumption were compared using PRS quintiles.
In analyses using baseline consumption data, the analyses were performed separately in the Health 2000, FINRISK Study, and Twin Cohorts and then meta-analyzed using fixed effects model.
In risk prediction, FINRISK cohorts with at least 10 years of follow-up (from 1992 to 2002) were used to train the model, and the predictive performance was tested in the Health 2000 cohort. The maximal follow-up window was restricted to 10 years. The change in the predictive performance was assessed by comparing models with and without the PRS using the correlated C-index approach23 along with calculating the continuous reclassification improvement (NRI)24 and integrated discrimination improvement (IDI)25. The Hosmer–Lemeshow goodness-of-fit test was used to test model calibration.
Ethical approval
The study was conducted in accordance with the principles of the Helsinki declaration. Written informed consent was obtained from all the study participants. For the Finnish Institute of Health and Welfare (THL)-driven FinnGen preparatory project and FinnGen project, all patients and control subjects had provided informed consent for biobank research, based on the Finnish Biobank Act. Alternatively, FINRISK and Health 2000 cohorts were based on study specific consents and later transferred to the THL Biobank after approval by Valvira, the National Supervisory Authority for Welfare and Health. Recruitment protocols followed the biobank protocols approved by Valvira. The Biobank Access Decisions for FinnGen samples and data utilized in FinnGen Data Freeze 2 include: Auria Biobank AB17-5154, THL Biobank BB2017_55, BB2017_111, BB2018_19, BB_2018_34, Finnish Red Cross Blood Service Biobank 7.12.2017, Helsinki Biobank HUS/359/2017 and Northern Finland Biobank Borealis BB_2017_1013. The Ethical Review Board of the Hospital District of Helsinki and Uusimaa approved the FinnGen study protocol Nr HUS/990/2017. The FinnGen preparatory project as well as the FinnGen project was approved by THL, approval numbers THL/2031/6.02.00/2017, amendments THL/341/6.02.00/2018, THL/2222/6.02.00/2018, THL/1101/5.05.00/2017, VRK43431/2017-3, KELA 131/522/2018, and Statistics Finland TK-53-1041-17. The Twin Cohort studies were approved by the Coordinating Ethical Committee of the Helsinki and Uusimaa Hospital District, reference numbers 246/13/03/00/15, 113/E3/2001, and HUS/1169/2016. The transfer of the FINRISK and Health 2000 sample collections to the THL biobank has been approved by the Coordinating Ethics Committee of Helsinki University Hospital on 10 October 2014 and by the Ministry of Social Affairs and Health on 9 March 2015. This study was conducted under the THL biobank permission BB2017_64 (FINRISK and Health 2000). No additional ethical approval was needed for meta-analyzing the results. All DNA samples and data in this study were pseudonymized.
Results
Cohorts
Our primary dataset (FinnGen) is comprised of 96,499 unrelated individuals (54,262 women) with a total of 55,484,114 person-years of registry-based follow-up and 4785 first-observed alcohol-related major health events. Alcohol consumption estimates were available for a total of 39,695 individuals from the prospective cohorts (FINRISK, Health 2000, and Twin Cohort, Fig. S2). Two cohorts, FINRISK and Health 2000, have full registry data and information on self-reported alcohol consumption and related baseline data, and consist of 28,639 individuals (94.5% of the participants after excluding 964 prevalent alcohol-related morbidity cases), with 424,053 person-years of registry-based follow-up and 988 first ever alcohol-related events (Table 1). The interview-based DSM-IV AUD-status was available in a subset of the Twin cohort for 713 cases and 1460 controls.
Table 1.
FinnGen | FINRISK | Health 2000 | Twin Cohort | |
---|---|---|---|---|
N | 96,499 (54,262 women) | 23,824 (12,513) | 5945 (3260) | 9926 (5036) |
N (incident events) | 4785 | 817 | 171 | NA |
Age (years) | 57.5 (end of follow-up) | 48.6 (baseline) | 54.0 (baseline) | 49.3 (baseline) |
Alcohol drinking (g/week) | NA | 76.5 | 73.9 | 84.8 |
Non-drinkers | NA | 2874 (12%) | 1298 (22%) | 30 (0.3%) |
Current smokers | NA | 5929 (25%) | 1538 (26%) | 3478 (35%) |
Higher education | NA | 8612 (36%) | 1721 (28%) | 1192 (12%) |
Marriage or co-habitation | NA | 17,468 (73%) | 4151 (68%) | 6661 (67%) |
GGT (U/I) | NA | 33.8 | 36.6 | NA |
Alcohol consumption
In a meta-analysis of the three cohorts with alcohol consumption estimates available (n = 39,695), the PRS for alcohol consumption was strongly associated with self-reported alcohol consumption. A 1 SD increase in the PRS was associated with an 11.2 g (=0.93 drinks á 12 g) increase in weekly pure alcohol intake (beta = 11.2 [9.85–12.6 g], p = 2.3 × 10–58) (Fig. 1, cohort-specific figures: Fig. S3). Adding the PRS to the model improved r2 by ~0.6 percentage points (from 9.17% to 9.80%). In addition, the PRS was negatively associated with alcohol abstinence (reported alcohol consumption 0). In FINRISK and Health2000, a 1 SD increase in the PRS for alcohol consumption was associated with a 13.7% reduced likelihood of being a nondrinker (OR = 0.863 [0.833–0.895], p = 6.1 × 10–16) while this was not the case in the Twin Cohort where there were only 30 nondrinkers (OR = 0.999 [0.998–1.00] p = 0.31).
Alcohol-related morbidity
The PRS for alcohol consumption was strongly associated with increased risk for lifelong major alcohol-related events derived from electronic health-records in the FinnGen dataset (n = 96,499, cases = 4785) (Fig. 2). The difference in the risk for alcohol-related morbidity events between the lowest and highest risk quintiles in the PRS was 83% (HR = 1.83 [1.66–2.01], p = 1.6 × 10–36) and a 1 SD increase in the PRS was associated with a 26% increase in risk (HR = 1.26 [1.23–1.30], p = 5.7 × 10–56). The association was similar in both males (HR = 1.26 [1.22–1.31], p = 1.3 × 10–38) and females (HR = 1.27 [1.20–1.33], p = 4.8 × 10–19).
In the cohorts where alcohol consumption estimates and other related baseline data were available at the cohort entry time, the PRS was associated with an increased risk of incident major alcohol-related events and the association was maintained also in the fully adjusted model (n = 28,639, cases = 911). In a meta-analysis of the two cohorts, 1 PRS SD was associated with a 26% increased risk of incident alcohol-related events when the consumption-estimate was not in the model (HR = 1.26 [1.18–1.34], p = 1.1 × 0–12) and with a 15% increase when alcohol consumption was in the model (HR = 1.15 [1.08–1.22], p = 2.1 × 10–5). In a fully adjusted model, including marital status, education, smoking status, and GGT, the estimate was unchanged (HR = 1.15 [1.08–1.22], p = 2.0 × 10–5) (Table 2). The risk estimate between the highest and lowest quintiles of the PRS in the fully adjusted model was 1.58 (HR = 1.58 [1.26–1.99], p = 8.2 × 10–5).
Table 2.
FINRISK | Health 2000 | Meta-analysis | |
---|---|---|---|
(a) Alcohol-related morbidity | Cases = 817 | Cases = 171 | Cases = 988 |
Basic model with age and sex | HR = 1.25 [1.16–1.34], p = 5.9 × 10–10 | HR = 1.32 [1.13–1.53], p = 0.00036 | HR = 1.26 [1.18–1.34], p = 1.1 × 10–12 |
Model with alcohol consumption | HR = 1.13 [1.06–1.21], p = 0.00053 | HR = 1.23 [1.06–1.43], p = 0.0081 | HR = 1.15 [1.08–1.22], p = 2.1 × 10–5 |
Fully adjusted model | HR = 1.14 [1.06–1.22], p = 0.00027 | HR = 1.20 [1.03–1.4], p = 0.022 | HR = 1.15 [1.08–1.22], p = 2.0 × 10–5 |
(b) Alcohol-related mortality | Deaths = 264 | Deaths = 71 | Deaths = 335 |
Basic model with age and sex | HR = 1.21 [1.07–1.37], p = 0.0022 | HR = 1.41 [1.11-1.8], p = 0.0045 | HR = 1.25 [1.12-1.4], p = 5.9 × 10–5 |
Model with alcohol consumption | HR = 1.08 [0.952–1.22], p = 0.24 | HR = 1.34 [1.05–1.71], p = 0.017 | HR = 1.13 [1.01–1.26], p = 0.033 |
Fully adjusted model | HR = 1.08 [0.957–1.23], p = 0.21 | HR = 1.22 [0.965–1.55], p = 0.096 | HR = 1.11 [0.996–1.24], p = 0.058 |
In the fully adjusted model, age, sex, alcohol consumption, smoking, education, marital status, and GGT (U/I) were used as non-genetic covariates
Mortality
We observed a similar increase in the risk of alcohol-related and all-cause mortality. In FinnGen with 7249 deaths, 1 SD increase in the PRS for alcohol consumption was associated with 8% increase in the risk of death (HR = 1.08 [1.06–1.11], p = 2.0 × 10–11). The risk estimate between the highest and lowest 20% in the PRS was 1.27 (HR = 1.27 [1.18–1.37], p = 2.1 × 10–10). In our prospective cohorts, with cause-of-death information available, 4125 deaths were recorded. For all-cause mortality, there was 11% increase in the risk of death per 1 PRS SD in the basic model (HR = 1.11 [1.07–1.14], p = 3.2 × 10–10) and 9% in the fully adjusted model (HR = 1.09 [1.06–1.12], p = 1.1 × 10–7). The risk difference between the highest and lowest quintiles of the PRS was 33% (HR = 1.33 [1.2–1.47], p = 4.5e−08) in the fully adjusted model.
Of the 4125 deaths, 335 were known to be alcohol-related. Without alcohol consumption in the model, the increase in alcohol-related mortality was 26% per 1 PRS SD (HR = 1.26 [1.13–1.4], p = 3.7 × 10–5). When alcohol consumption was included in the model, the increase was 13% (HR = 1.13 [1.01–1.26], p = 0.027) and in a model with all co-variates, 11% (HR = 1.11 [0.996–1.24], p = 0.058) (Table 2). Similarly, the PRS was associated with a higher risk of death from other than alcohol-related causes (n = 3790) when fully adjusted for all covariates (HR = 1.08 [1.05–1.12], p = 1.4 × 10–6).
DSM-IV alcohol-use disorder
The PRS was also associated with an interview-based DSM-IV AUD diagnosis in the Nicotine Addiction Genetics Family cohort (440 cases, 1140 controls) and a subset of FinnTwin16 cohort (273 cases, 320 controls). A meta-analysis of the two cohorts (713 cases) resulted in a combined 20% increase in the prevalence of AUD per 1 PRS SD (OR = 1.20 [1.11–1.31], p = 2.29 × 10–5) in the unadjusted model. Adjusting for marital status, education, and smoking explained part of the effect (OR = 1.14 [1.02–1.28], p = 0.023) and further adjusting with maximal amount of drinks taken explained most of the effect (OR = 1.06 [0.94–1.19], p = 0.35).
Prediction
The predictive performance of the PRS was evaluated in the Health 2000 cohort (5732 complete cases, 110 events) with a follow-up-window of 10 years based on the Cox model trained in the FINRISK cohort (18,427 complete cases with ≥ 10 years of follow-up, 628 events). In a model not including the alcohol consumption estimate, adding the PRS to the model increased the C-index by 0.020, from 0.69 to 0.71 (p = 0.017). Both IDI (0.00242 [0.00102–0.00383], p = 7.3 × 10–4) and NRI (0.335 [0.146–0.523], p = 5.1 × 10–3) shifts were positive and statistically significant. When the log-transformed alcohol consumption estimate was included, a minimal improvement of prediction was observed (C-index = 0.0022 from 0.812 to 0.814, p-value = 0.30; NRI = 0.308 [0.119–0.497], p = 0.0014 and IDI = 0.00173 [0.000726–0.00305], p = 0.017). Similarly, a minimal gain was observed when adding PRS to a model with all available covariates including also marital status, education status, smoking status, and GGT (C-index = 0.00183 from 0.847 to 0.849, p = 0.44; NRI = 0.235 [0.0461–0.423], p = 0.015; IDI = 0.00331 [0.0000254–0.00659], p = 0.048).
Discussion
We developed a highly polygenic risk score for alcohol consumption by obtaining weights from a recently published large-scale discovery sample and showed that the PRS was strongly associated with alcohol consumption in independent biobank cohort samples. An increased polygenic burden for alcohol consumption was associated with higher incidence of major alcohol-induced health events. The associations remained significant when we accounted for self-reported alcohol consumption and other relevant covariates; in a fully adjusted model, the relative risk-estimate between the highest and lowest quintiles of the PRS was 1.6. Furthermore, the PRS was also associated with both alcohol-related, non-alcohol related, and all-cause mortality.
Our PRS shows the utility of genetic information for prediction of alcohol-related harm. The PRS, developed from a genetic analysis of cross-sectional self-reported alcohol consumption, was associated with future risk of major alcohol-related health events. While a large number of PRSs have already been established for various traits and diseases12, the development of PRSs for behavioral traits, such as substance use, has until now been limited26–29 and the studies have not assessed their impact on future major health events.
Our results show that using a large sample size with long follow-up, we were able to build a PRS of alcohol consumption that is associated not only with alcohol consumption in independent samples, but also with future incident alcohol-related health events. In line with the knowledge that alcohol consumption is a major contributor to the worldwide burden of death, especially among working-age adults1, we found the PRS to be associated also with all-cause mortality, further highlighting the importance of alcohol drinking as a cause of premature death.
Our score provides a genetic basis for potentially identifying a subset of high-risk individuals even early on in life, with potential for more targeted prevention of AUDs and other alcohol-related morbidity. Prevention is a cost-effective and efficient strategy to reduce alcohol-related harms30 and it is labeled one of the United Nations main health-related worldwide strategies of sustainable development (https://sustainabledevelopment.un.org/sdg3). A higher genetic predisposition for alcohol-related harms was detected both in the presence and absence of alcohol consumption data, as our PRS predicted alcohol-related harms beyond self-reported alcohol consumption. Health services are encouraged to support initiatives for screening and brief interventions for harmful drinking31 as an effective strategy for tackling alcohol-related harm32. Thus, genetic information could potentially be used to improve the arsenal of possible strategies to detect high-risk individuals for targets of brief interventions. The fact that individuals in the highest PRS quintile showed an elevated risk for alcohol-related health events even in fully adjusted models could justify the use of genetic information even in clinical settings where a detailed history of alcohol consumption estimates, AUDIT-scores, or similar information are attainable. Communicating the information of higher risk for alcohol-related harm to patients could serve as a motivator for reducing drinking or committing to abstinence. However, the true effects of informing patients about their alcohol-related genomic risk warrants further research.
Self-reported alcohol consumption is known to be biased and problematic in terms of reliability and validity for predicting alcohol-related risks33,34. Also, GGT is known to be less-than-ideal biochemical measure of drinking35. Some inaccuracy derives from true measurement error, but another source is the lifelong temporal fluctuation of alcohol-drinking patterns not captured by a measure at one single timepoint. Our PRS was associated with alcohol-related harms even when adjusting for self-reported alcohol consumption estimate. One potential reason for this is that the PRS contains information from the latent genetic predisposition for alcohol consumption, thus overriding both the true measurement error and temporal fluctuations in alcohol drinking volume.
Furthermore, it has been hypothesized that alcohol consumption-based genetic discovery might inform more about low-level drinking than about problematic drinking and AUDs36. However, we built a PRS for alcohol consumption and successfully used it to predict alcohol-related harms. Due to the robustness of a self-reported single timepoint alcohol consumption estimate and the fact that different alcohol-related traits are to some degree genetically distinct9–11, it is expected that a PRS developed directly for alcohol-related morbidity will outperform our PRS in predicting alcohol-related health burden. Supporting this assumption, the general pattern is that PRSs are more strongly associated with their respective diseases than with related phenotypes.14,15. Unfortunately, no high-quality summary statistics for alcohol-related harms including both somatic and psychiatric outcomes yet exist; the performed GWAS have only covered AUD and alcohol dependence10,11 and been smaller in size than our discovery sample of choice, thus making future efforts for large-scale GWAS discovery based on alcohol-related harms more than necessary.
Our PRS was derived using European ancestry discovery samples and tested in the Finnish population. Its applicability in other populations therefore needs further evaluation as the alcohol-related genetic mechanisms may vary between populations. However, it has to be noted that the PRS derived from a non-Finnish sample performed well in the Finnish dataset, even though Finns are somewhat genetically different from the rest of the Europeans37.
Our design allowed us to study outcomes prospectively. Our registry-based follow-up captures alcohol-related outpatient and inpatient visits, withdrawal treatment prescription for alcoholism, and deaths, thus covering major alcohol-related health events over several decades. Nonetheless, some of the milder cases of alcohol-related health problems could have gone undetected.
In conclusion, a PRS for alcohol consumption was associated with elevated risk for incident alcohol-related health events and all-cause mortality. These findings underline the importance of heritable factors driving alcohol-related behavior. A successful attempt to predict alcohol-related health outcomes with a PRS shows promise in possible future utilization of genetic information in risk estimation and prediction of alcohol-related harms.
Supplementary information
Acknowledgements
We would like to thank Lea Urpa for proofreading, and Sari Kivikko, Huei-Yi Shen, and Ulla Tuomainen for management assistance. We would like to thank all participants of the study cohorts for their generous participation. SRi was supported by the Academy of Finland Center of Excellence in Complex Disease Genetics (Grant No. 312062), Academy of Finland (Grant No. 285380), the Finnish Foundation for Cardiovascular Research, the Sigrid Juselius Foundation, and University of Helsinki HiLIFE Fellow grant. The funding agencies had no role in the design and conduct of the study; collection, analysis, and interpretation of data; or the writing of the manuscript or the decision to submit it for publication. The FinnGen project is funded by two grants from Business Finland (HUS 4685/31/2016 and UH 4386/31/2016) and nine industry partners (AbbVie, AstraZeneca, Biogen, Celgene, Genentech, GSK, MSD, Pfizer, and Sanofi). Following biobanks are acknowledged for collecting the FinnGen project samples: Auria Biobank (https://www.auria.fi/biopankki/en), THL Biobank (https://thl.fi/fi/web/thl-biopankki), Helsinki Biobank (https://www.terveyskyla.fi/helsinginbiopankki/en), Northern Finland Biobank Borealis (https://www.ppshp.fi/Tutkimus-ja-opetus/Biopankki), Finnish Clinical Biobank Tampere (https://www.tays.fi/en-US/Research_and_development/Finnish_Clinical_Biobank_Tampere), Biobank of Eastern Finland (https://ita-suomenbiopankki.fi/), Central Finland Biobank (https://www.ksshp.fi/fi-FI/Potilaalle/Biopankki), Finnish Red Cross Blood Service Biobank (https://www.bloodservice.fi/Research%20Projects/biobanking). The Finnish Twin Cohort Nicotine Addictions Genetics family study has been supported by NIH DA12854 to P.A.F.M., genotyping in the twin cohort by Global Research Awards for Nicotine Dependence (GRAND) funded by Pfizer Inc. to J.Ka. and the Welcome Trust Sanger Institute, and the Finntwin16 study by NIH AA-12502, AA-00145, and AA-09203 to R.J.R. J.Ka. has been supported by the Academy of Finland (grants 265240, 263278, 308248, 312073). A.S.H. was supported by the Academy of Finland (Grant no. 321356). T.K., J.T.R., S.Ru., and P.R. were supported by the Doctoral Programme in Population Health, University of Helsinki. J.T.R. was supported by the MD/PhD Program of the Faculty of Medicine, University of Helsinki.
Data availability
The FinnGen data may be accessed through Finnish Biobanks’ FinnBB portal (www.finbb.fi) and THL Biobank data through THL Biobank (https://thl.fi/en/web/thl-biobank). Summary statistics from Liu et al. (2019) “Association Studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use” published in Nature Genetics are archived at the University of Minnesota library (https://conservancy.umn.edu/handle/11299/20156).
Code availability
The full genotyping and imputation protocol for FinnGen is described at https://doi.org//10.17504/protocols.io.nmndc5e.
Conflict of interest
A.P. reports that part of his salary is received from the FinnGen project that is partially funded by nine industry partners AbbVie, AstraZeneca, Biogen, Celgene, Genentech, GSK, Merck/MSD, Pfizer, and Sanofi. J.Ka. reports grants from Academy of Finland, US-PHS NIH/NIDA, US-PHS NIH/NIAAA, Pfizer Inc./GRAND program grant, and EU FP7 during the conduct of the study. V.S. reports a conference trip and an honorarium for participating in an advisory board meeting from Novo Nordisk (unrelated to the present study) and a grant to his institute for research collaboration from Bayer (unrelated to the present study). No other relationships or activities that could appear to have influenced the submitted work.
Footnotes
Members of the FinnGen project and the GSCAN Consortium and the affiliations are provided in the Supplementary File.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary Information accompanies this paper at (10.1038/s41398-019-0676-2).
References
- 1.World Health Organization. Global Status Report on Alcohol And Health 2018. (WHO, 2018).
- 2.Vengeliene V, Bilbao A, Molander A, Spanagel R. Neuropharmacology of alcohol addiction. Br. J. Pharmacol. 2008;154:299–315. doi: 10.1038/bjp.2008.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Grant BF, et al. Epidemiology of DSM-5 alcohol use disorder: results from the National Epidemiologic Survey on Alcohol and Related Conditions III. JAMA Psychiatry. 2015;72:757–766. doi: 10.1001/jamapsychiatry.2015.0584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wood AM, et al. Risk thresholds for alcohol consumption: combined analysis of individual-participant data for 599 912 current drinkers in 83 prospective studies. Lancet. 2018;391:1513–1523. doi: 10.1016/S0140-6736(18)30134-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Nutt DJ, King LA, Phillips LD. Drug harms in the UK: a multicriteria decision analysis. Lancet. 2010;376:1558–1565. doi: 10.1016/S0140-6736(10)61462-6. [DOI] [PubMed] [Google Scholar]
- 6.Dick, D. M. et al. The genetics of substance use and substance use disorders. in Handbook of Behavior Genetics (eds Kim, Y.) pp. 436 (Springer, 2009).
- 7.Clarke T, et al. Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N = 112 117) Mol. Psychiatry. 2017;22:1376. doi: 10.1038/mp.2017.153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Liu, M. et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat. Genet. 51, 237–244 (2019). [DOI] [PMC free article] [PubMed]
- 9.Sanchez-Roige S, et al. Genome-wide association study meta-analysis of the Alcohol Use Disorders Identification Test (AUDIT) in two population-based cohorts. Am. J. Psychiatry. 2018;2018:18040369. doi: 10.1176/appi.ajp.2018.18040369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kranzler HR, et al. Genome-wide association study of alcohol consumption and use disorder in 274,424 individuals from multiple populations. Nat. Commun. 2019;10:1499. doi: 10.1038/s41467-019-09480-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Walters RK, et al. Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nat. Neurosci. 2018;21:1656. doi: 10.1038/s41593-018-0275-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Khera AV, et al. Genome-wide polygenic scores for common diseases identify individuals with risk equivalent to monogenic mutations. Nat. Genet. 2018;50:1219. doi: 10.1038/s41588-018-0183-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Holmes MV, et al. Mendelian randomization of blood lipids for coronary heart disease. Eur. Heart J. 2014;36:539–550. doi: 10.1093/eurheartj/eht571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Richardson TG, Harrison S, Hemani G, Smith GD. An atlas of polygenic risk score associations to highlight putative causal relationships across the human phenome. eLife. 2019;8:e43657. doi: 10.7554/eLife.43657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Khera AV, et al. Polygenic prediction of weight and obesity trajectories from birth to adulthood. Cell. 2019;177:587–596. doi: 10.1016/j.cell.2019.03.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Borodulin K, et al. Forty-year trends in cardiovascular risk factors in Finland. Eur. J. Public Health. 2014;25:539–546. doi: 10.1093/eurpub/cku174. [DOI] [PubMed] [Google Scholar]
- 17.Heistaro, S. Methodology Report: Health 2000 Survey (Publications of the National Health Institute, Helsinki, 2008).
- 18.Kaprio J. The Finnish twin cohort study: an update. Twin Res. Hum. Genet. 2013;16:157–162. doi: 10.1017/thg.2012.142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Kaprio J. Twin studies in Finland 2006. Twin Res. Hum. Genet. 2006;9:772–777. doi: 10.1375/twin.9.6.772. [DOI] [PubMed] [Google Scholar]
- 20.Vilhjálmsson BJ, et al. Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 2015;97:576–592. doi: 10.1016/j.ajhg.2015.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Chang CC, et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Leino A, et al. Health-based reference intervals for ALAT, ASAT and GT in serum, measured according to the recommendations of the European Committee for Clinical Laboratory Standards (ECCLS) Scand. J. Clin. Lab. Investig. 1995;55:243–250. doi: 10.3109/00365519509089619. [DOI] [PubMed] [Google Scholar]
- 23.Antolini L, Nam B, D'Agostino RB. Inference on correlated discrimination measures in survival analysis: a nonparametric approach. Commun. Stat.-Theory Methods. 2004;33:2117–2135. doi: 10.1081/STA-200026579. [DOI] [Google Scholar]
- 24.Pencina MJ, D'Agostino RB, Sr, D'Agostino RB, Jr, Vasan RS. Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat. Med. 2008;27:157–172. doi: 10.1002/sim.2929. [DOI] [PubMed] [Google Scholar]
- 25.Pencina MJ, D'Agostino RB, Sr, Steyerberg EW. Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers. Stat. Med. 2011;30:11–21. doi: 10.1002/sim.4085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Clarke T, et al. Polygenic risk for alcohol dependence associates with alcohol consumption, cognitive function and social deprivation in a population-based cohort. Addict. Biol. 2016;21:469–480. doi: 10.1111/adb.12245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Belsky DW, et al. Polygenic risk and the developmental progression to heavy, persistent smoking and nicotine dependence: evidence from a 4-decade longitudinal study. JAMA Psychiatry. 2013;70:534–542. doi: 10.1001/jamapsychiatry.2013.736. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Vink JM, et al. Polygenic risk scores for smoking: predictors for alcohol and cannabis use? Addiction. 2014;109:1141–1151. doi: 10.1111/add.12491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Marioni RE, et al. Genetic variants linked to education predict longevity. Proc. Natl Acad. Sci. USA. 2016;113:13366–13371. doi: 10.1073/pnas.1605334113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Spoth RL, Guyll M, Day SX. Universal family-focused interventions in alcohol-use disorder prevention: cost-effectiveness and cost-benefit analyses of two interventions. J. Stud. Alcohol. 2002;63:219–228. doi: 10.15288/jsa.2002.63.219. [DOI] [PubMed] [Google Scholar]
- 31.World Health Organization. Global Strategy To Reduce The Harmful Use Of Alcohol. (WHO, 2010). [DOI] [PMC free article] [PubMed]
- 32.Vandenberg, B. Tackling Harmful Alcohol Use: Economics and Public Health Policy (ed Sassi, F.) 240 pp (OECD Publishing, Paris, France, 2015).
- 33.Gmel G, Rehm J. Measuring alcohol consumption. Contemp. Drug Probl. 2004;31:467–540. doi: 10.1177/009145090403100304. [DOI] [Google Scholar]
- 34.Litten, R. Z. & Allen, J. P. Measuring Alcohol Consumption: Psychosocial And Biochemical Methods (Springer Science & Business Media, 2012).
- 35.Conigrave KM, et al. CDT, GGT, and AST as markers of alcohol use: the WHO/ISBRA collaborative project. Alcohol Clin. Exp. Res. 2002;26:332–339. doi: 10.1111/j.1530-0277.2002.tb02542.x. [DOI] [PubMed] [Google Scholar]
- 36.Edenberg HJ, Gelernter J, Agrawal A. Genetics of alcoholism. Curr. Psychiatry Rep. 2019;21:26. doi: 10.1007/s11920-019-1008-1. [DOI] [PubMed] [Google Scholar]
- 37.Lek M, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536:285. doi: 10.1038/nature19057. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The FinnGen data may be accessed through Finnish Biobanks’ FinnBB portal (www.finbb.fi) and THL Biobank data through THL Biobank (https://thl.fi/en/web/thl-biobank). Summary statistics from Liu et al. (2019) “Association Studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use” published in Nature Genetics are archived at the University of Minnesota library (https://conservancy.umn.edu/handle/11299/20156).
The full genotyping and imputation protocol for FinnGen is described at https://doi.org//10.17504/protocols.io.nmndc5e.