Abstract
The alcohol harm paradox, whereby low socioeconomic position (SEP) groups experience greater alcohol-related harms at a given level of alcohol consumption, is not yet fully understood. In observational studies, key drivers are correlated and share similar confounding structures. We used multivariable Mendelian randomization (MVMR) to estimate the direct causal effect of alcohol (drinks per week) and education (years of schooling) on multiple health outcomes, accounting for the effect of the other. Previously published genome-wide association summary (GWAS) statistics for drinks per week and years of schooling were used, and outcome summary statistics were generated from individual-level data from UK Biobank (N = 462,818). Inverse variance weighted analyses demonstrated evidence for direct effects of alcohol and education on liver diseases (alcoholic liver disease: alcohol OR = 50.19, 95% CI 19.35 to 130.21 and education OR = 0.27, 95% CI 0.14 to 0.53; other liver diseases: alcohol OR = 1.82, 95% CI 1.12 to 2.94 and education OR = 0.42, 95% CI 0.30 to 0.58), mental and behavioural disorders due to alcohol (alcohol OR = 12.89, 95% CI 7.46 to 22.27 and education OR = 0.51, 95% CI 0.35 to 0.75), and stroke (alcohol OR = 1.94, 95% CI 1.30 to 2.89 and education OR = 0.73, 95% CI 0.55 to 0.97). There was evidence for direct effects of education on depression, anxiety, influenza/pneumonia, and heart disease. In contrast, there was evidence of total (without considering the effect of education), but not direct, effects of alcohol on depression, influenza/pneumonia, epilepsy, and injuries. Although caution is required when interpreting these results, given weak instruments for alcohol, these results provide some evidence that the alcohol harm paradox is partially due to the protective effect of additional years of education. Replication with strong genetic instruments for drinks per week would be necessary to draw causal inferences.
Author summary
Individuals from lower socioeconomic position backgrounds tend to disproportionately experience alcohol-related physical and mental ill-health, despite reporting lower overall alcohol consumption than those from higher socioeconomic backgrounds. This is known as the alcohol harm paradox. One key difficulty in understanding this paradox is the methodological difficulty of establishing the relative contribution of multiple complex social behaviours. In this study, we used genetic variants associated with alcohol consumption and years of education to explore their direct effects on multiple health outcomes. The findings indicate that greater alcohol consumption and fewer years of education may each, independently increase the likelihood of developing various health conditions, including liver diseases, mental and behavioural disorders due to alcohol, and stroke. This may suggest that the alcohol harm paradox is due to the protective effect of additional years of education amongst those from higher socioeconomic position backgrounds, reducing their likelihood of developing the health conditions. However, these findings are preliminary and limited by various methodological issues, suggesting these findings should be interpreted with caution. Replication and further studies are needed.
Introduction
Alcohol-related harms are common and represent a large burden of morbidity and mortality worldwide, contributing to 5.3% of all deaths and 5.1% of global burden of disease [1]. Moreover, such alcohol-related harms tend to be disproportionately experienced by individuals from low socioeconomic position (SEP) backgrounds, despite reporting lower alcohol consumption compared to people from high SEP backgrounds, a phenomenon known as the alcohol harm paradox [2–5]. Despite evidence for this paradox, there is limited understanding of its origins.
There have been multiple explanations proposed for the alcohol harm paradox. One is that, whilst low SEP individuals report consuming less alcohol on average, other patterns of drinking behaviour are more common. For example, low SEP individuals are more likely to be in the extreme ends of drinking consumption, with a greater proportion abstaining or drinking very lightly compared to high SEP groups but also being more likely to drink excessively or binge drink [6]. The possibility that drinking patterns may account for the alcohol harm paradox has been supported by research demonstrating a greater socioeconomic gradient for acute health outcomes that are wholly attributable to alcohol, compared to chronic conditions that are influenced by a wide range of behaviours [7].
An alternative explanation is that individuals of low SEP may be more likely to engage in other unhealthy behaviours, such as tobacco use, which are associated with multiple adverse health outcomes [8]. Observational studies have supported this explanation, providing evidence that individuals of lower SEP are increasingly likely to smoke [3]. Furthermore, adjusting for tobacco smoking reduces the association between low SEP and hospital admissions [9], while causal mediation analysis demonstrated that 18% of the total effect of income on alcohol-related mortality was due to indirect effects via smoking and BMI [10]. Whilst these proposed explanations have received some support from observational studies, they are limited in their ability to draw causal inferences and are yet to fully account for the alcohol harm paradox.
The primary complication with disentangling the alcohol harm paradox is that a combination of highly correlated causes, including alcohol consumption, SEP, and smoking, could all be important independent drivers. Therefore, it is necessary to establish the relative causal contribution of alcohol consumption alongside these other factors on the likelihood of developing multiple adverse health outcomes with novel causal inference methods, such as Mendelian randomization (MR). MR relies on Mendel’s law of random assortment, using genetic variants associated with the exposure as instrumental variables to assess the total causal effect of the exposure on the outcome [11].
Multivariable MR (MVMR) builds upon this method, using genetic variants associated with multiple exposures to estimate the direct causal effect of each exposure on the outcome, accounting for the effect of the other exposures [12]. The validity of causal inferences drawn using MVMR relies on three key assumptions [13]: (1) each exposure is robustly predicted by the genetic instruments; (2) there are no confounders of the outcome and any of the genetic instruments; and (3) the genetic instruments only impact the outcome through at least one of the exposures.
The current study applies this MVMR framework to the alcohol harm paradox to examine the direct causal effect of alcohol consumption on adverse health outcomes whilst accounting for education (as a proxy for SEP), and vice versa. We first conducted MR to examine the total causal effect of alcohol and education separately on adverse health, followed by MVMR to examine their direct (i.e., unique) causal effects. Comparison of the MR and MVMR results will provide insights into the relative contribution of alcohol and education. For example, if there appears to be a causal effect of alcohol in both the MR and MVMR, this suggests that alcohol consumption impacts the health outcome even when the impact of education is accounted for. However, if a causal effect of alcohol in the MR is attenuated in the MVMR, this may indicate that the impact of education was driving the evidence of a causal effect of alcohol consumption in the MR and imply that an alternative explanation is needed to account for the associations reported between alcohol consumption and the health outcome.
Materials and methods
The analysis plan for this study was pre-registered on the Open Science Framework (https://doi.org/10.17605/OSF.IO/5G8W4), and the methods presented here are consistent with this unless otherwise specified. We have followed the STROBE-MR guidelines in the reporting of this study (Supplementary Note 1) [14,15].
Exposures
Summary-level data reported by the two Genome-Wide Association Studies (GWAS) were utilised to identify suitable genetic instruments associated with the relevant exposure phenotypes: drinks per week and years of schooling. In contrast with our pre-registration, the main analysis reported here focuses solely on these two exposures and the planned analysis including cigarettes per day as a third exposure is presented as an additional exploratory analysis (details can be found in Supplementary Note 2). We focus on the two exposures of drinks per week and years of schooling because alcohol and smoking are highly correlated and both suffer from weak instrument bias in the three-exposure MVMR, making it difficult to draw conclusions from results using this approach. To aid interpretation, the analysis comprised of a two-exposure model to explore whether SEP plays a causal role in the alcohol harm paradox. Consequently, the results of this study should be considered preliminary.
GWAS & Sequencing Consortium of Alcohol and Nicotine Use (GSCAN).
Liu and colleagues [16] report summary-level statistics from a GWAS meta-analysis of alcohol use based on data from participants of European ancestry from 29 cohorts (N = 941,280). Alcohol use was defined as the reported number of drinks (combined across all alcohol types) consumed per week amongst active drinkers. Where individual studies recorded categorical bins of drinks per week (e.g., 1–4, 5–9), the midpoint value was taken. To minimise the impact of outliers, this was subsequently left-anchored and log transformed and, as a result, we can only infer direction of effect and not size of the effect. Analyses adjusted for age, sex, age and sex interaction, and 10 genetic principal components. Independent variants were reported to be associated with drinks per week if they met the standard significance threshold (p < 5x108). The associated SNPs explained 2.5% of the variance in drinks per week. We utilised the publicly available summary statistics from this GWAS, excluding UK Biobank, to minimise sample overlap, and 23andMe, due to data sharing restrictions (N = 226,223). Smoking heaviness (cigarettes per day) was also measured in this GWAS, as described in the Supplementary Material (page 1–2).
Social Science Genetic Association Consortium (SSGAC).
Okbay and colleagues [17] report summary-level statistics from a GWAS meta-analysis of education, defined as years of schooling, based on data from participants of European ancestry from 65 cohorts (N = 293,723). Education is considered a good proxy for SEP as it is correlated with other SEP aspects, such as income and occupation [18]. Reported number of years of schooling was utilised where this was available. Where individual studies reported qualifications, the International Standard Classification of Education (ISCED; 1997) [19] was used to map each major qualification that could be attained in each country into one of seven categories: pre-primary education, primary education, lower secondary education, upper secondary education, post-secondary non-tertiary education, first-stage tertiary education, and second-stage tertiary education. This was subsequently utilised to impute a years of schooling value for each category. Analyses adjusted for age, sex, age and sex interaction, ten genetic principal components, and cohort-specific controls. Variants were reported to be associated with years of schooling if they met the genome-wide significance threshold (p < 5x108) and were independent.
Health outcomes
Individual-level data, including genetic and outcome data for all individual participants, was obtained from UKB to generate summary statistics for multiple health outcomes.
UK Biobank.
UK Biobank is a population-based cohort, which recruited 500,000 people aged between 40–69 years in 2006–2010. With their consent, they provided detailed information about their lifestyle, physical measures and had blood, urine and saliva samples collected and stored for future analysis [20] (detailed information available at www.ukbiobank.ac.uk). Using this individual-level data, we generated summary statistics by regressing each available SNP on each health outcome through the MRC-IEU UK Biobank GWAS pipeline using BOLT-LMM (a linear mixed model restricted to genetically inferred European ancestry) to account for relatedness and population stratification [21]. Furthermore, we additionally adjusted for age, sex, and genotyping chip array. For the main two-exposure MVMR analysis, this was conducted for the overall UKB sample. For comparison with the exploratory three-exposure MVMR, analyses were also conducted in four smaller samples stratified by smoking status (UK Biobank field ID 20116). These included ‘ever smokers’ (defined as those reporting smoking more than once or twice in their lifetime), further separated into ‘current smokers’ and ‘former smokers’, as well as ‘never smokers’ (defined as those reporting never smoking or only smoking once or twice in their lifetime) (Fig 1). This was necessary in the three-exposure MVMR because the exposure GWAS (cigarettes per day) was only conducted using data from ever smokers and we explored the effect in never smokers to assess the likelihood of pleiotropic effects.
Fig 1. Flowchart demonstrating the source of exposure and outcome data and the inclusion and exclusion of genetic variants (single nucleotide polymorphisms; SNPs) in the MR (Mendelian Randomization) and MVMR (multivariable Mendelian Randomization) analyses.
Details of the three-exposure MVMR (grey dashed lines) are in the Supplementary Materials.
Our pre-registered analysis specified 15 outcomes based upon ICD-10 summary diagnoses classification codes (UKB field ID 41270; S1 Table), highlighted by the World Health Organization and were either wholly or partially attributable to alcohol [7,22,23]. This is based upon England-specific alcohol attributable fractions which reflect “the proportion of disease risk that would not have occurred if the exposure to a risk factor had not occurred” [24]. Wholly attributable conditions are therefore those with an AAF of one and partially attributable conditions are those with an AAF greater than zero but less than one. However, once all the necessary exclusions had been made, four outcomes (alcoholic cardiomyopathy, alcoholic myopathy, alcoholic polyneuropathy, and alcoholic poisoning) had a case proportion less than 0.1% and were therefore removed from the analysis. The remaining outcomes included alcoholic liver disease, other liver diseases, stroke, ischaemic heart disease (IHD), influenza/pneumonia, viral hepatitis, epilepsy, injuries, mental and behavioural disorders due to alcohol, depression, and anxiety.
Analysis
Analyses were conducted in R version 4.1.2 [25], using the Mendelian Randomization [26], TwoSampleMR [27], MVMR [28] and MR-3 [29] R packages.
Selection of genetic variants.
Single nucleotide polymorphisms (SNPs) were selected for inclusion in the analysis if they were reported to be associated with the exposure phenotype of interest (alcohol use or education) and (conditionally) independent of any other SNP associations at the genome-wide significance level (p < 5 × 10-8) in the relevant GWAS. In GSCAN, this involved sequential forward selection whereby SNPs within each locus were iteratively included in the set of independent SNPs until a SNP was no longer significant using a partial correlation-based score statistic whereas SSGAC applied conditional joint analysis in the Genome-Wide Complex Trait Analysis software (GCTA-COJO) which selected SNPs through a stepwise conditional regression model. The SNPs were discovered in the full GSCAN sample and subsequently weighted using summary statistics from the sample excluding UKB and 23andMe to ensure unbiased SNP effect estimates for MR, as recommended in previous studies [30]. These SNPs also needed to be available in UK Biobank, resulting in 96 drinks per week and 74 years of schooling SNPs in the univariable MR. For the MVMR analyses, where a SNP from GSCAN (drinks per week) was not available in SSGAC (years of schooling), we selected proxy SNPs with a minimum linkage disequilibrium (LD) R2 of 0.8. Moreover, an additional clumping stage was conducted in the MVMR to ensure all exposure SNPs were independent (LD R2 < 0.1, clumping window > 500 kb), resulting in 71 drinks per week and 62 years of schooling SNPs in the MVMR (Fig 1; see S2 Table for details).
We assessed instrument strength using a conditional F-statistic and instrument validity using Cochran’s Q-statistic. In line with conventional guidelines, the F-statistic should be greater than 10 and the Q-statistic should be less than the number of SNPs [31].
Univariable MR.
We estimated the total causal effects of alcohol use and education on all 11 health outcomes using two-sample, summary level MR. The main analysis was conducted using inverse variance weighted (IVW) regression [32]. We also conducted sensitivity analyses using four complementary MR methods MR-Egger [33], weighted median estimation [34], weighted mode estimation [35], and MR-PRESSO [29]. These methods hold different assumptions regarding pleiotropy and therefore consistent results across the methods provide stronger evidence that the observed effects are causal. A non-zero MR-Egger intercept indicates the presence of directional pleiotropy.
Multivariable MR.
In MVMR analyses, we estimated the direct causal effect of alcohol use, education, and tobacco use on all health outcomes using two sample, summary level MVMR with three complimentary methods – MVMR-IVW, MVMR-Egger, and MVMR-PRESSO.
Results
The MR-IVW, MVMR-IVW, MR-Egger and MVMR-Egger results exploring the total and direct effects of alcohol and education are displayed in Fig 2. The complete results of the MVMR analyses exploring the direct effects of (1) alcohol and education (n = 133 SNPs) and (2) alcohol, education, and smoking (n = 171 SNPs) on health outcomes are displayed in S3-S13 Tables. The results are presented in odds ratios (ORs) per standard deviation (SD) increase in either drinks per week or years of schooling.
Fig 2. Forest plots displaying the effect of alcohol (drinks per week) and education (years of schooling) on eight health outcomes using Mendelian randomization (MR) and multivariable MR (MVMR) analysis: inverse variance weighted (IVW) and Egger regression.
An odds ratio (OR) greater than 1 implies an increase in the likelihood of developing the health outcome with more drinks per week or years of schooling and an OR below 1 implies a decrease in the likelihood of developing the health outcome.
Instrument strength and heterogeneity
Instrument strength was calculated using the conditional F-statistic. In the MR, the F-statistics suggest that the SNPs included in the analysis are strong instruments for alcohol (F = 14.87) and education (F = 35.32). The conditional F-statistics in the two exposure MVMR indicate that the included SNPs are strongly associated with education whilst accounting for the effects of alcohol (F = 17.42), but not strongly associated with alcohol while accounting for education (F = 6.32).
The Cochran’s Q statistics were greater than the number of SNPs included in most of the models and the MVMR-Egger intercepts are non-zero, indicating heterogeneity and directional pleiotropy. Therefore, the results also include MVMR-Egger, which provides estimates that are robust to directional pleiotropy under the assumption that pleiotropy is uncorrelated with the strength of the association between the SNP and exposure. However, MVMR-Egger analyses have reduced statistical power compared with IVW analyses and therefore we must consider the results from both analysis when drawing conclusions. Where Egger results indicate a similar effect size but with larger confidence intervals which cross the null, we interpret this as evidence in support of the IVW analysis. The MR- and MVMR-PRESSO results generally found no evidence of horizontal pleiotropy (global test p value > 0.05) or could not identify outliers. In the few instances where horizontal pleiotropy and outliers were detected, the results did not find statistical evidence that the estimates differed before and after outliers were removed (distortion test p value > 0.05).
Alcoholic liver disease
There was evidence for a total (OR = 36.57, 95% CI 16.46 to 81.24) and direct (OR = 50.19, 95% CI 19.35 to 130.21) effect of alcohol on alcoholic liver disease when accounting for education. There was also evidence that education decreased the odds of alcoholic liver diseases in the univariable (OR = 0.44, 95% CI 0.23 to 0.86) and multivariable (OR = 0.27, 95% CI 0.14 to 0.53) analysis when accounting for alcohol (Fig 2; S3 Table). Whilst the MR-Egger and MVMR-Egger results were consistent with the IVW results for alcohol, for education, the effect estimates were similar but the wider confidence intervals overlapped the null.
Mental and behavioural disorders due to alcohol
There was evidence for a total (OR = 15.88, 95% CI 10.08 to 25.01) and direct (OR = 12.89, 95% CI 7.46 to 22.27) effect of alcohol on mental and behavioural disorders due to alcohol, as well as a total (OR = 0.60, 95% CI 0.44 to 0.83) and direct (OR = 0.51, 95% CI 0.35 to 0.75) effect of education on mental and behavioural disorders due to alcohol (Fig 2; S4 Table). The MR- and MVMR-Egger findings were consistent with the IVW findings for alcohol and showed similar effect estimates for education, although the confidence intervals crossed the null.
Other liver diseases
There was evidence for a total (OR = 1.80, 95% CI 1.21 to 2.70) and direct (OR = 1.82, 95% CI 1.12 to 2.94) effect of alcohol on other liver diseases, as well as evidence for a total (OR = 0.47, 95% CI 0.35 to 0.62) and direct (OR = 0.42, 95% CI 0.30 to 0.58) effect of education on other liver diseases (Fig 2; S5 Table). The MR- and MVMR-Egger results calculated similar effect estimates, although the wider confidence intervals crossed the null for alcohol and the total effect of education.
Stroke
There was evidence for a total (OR = 1.94, 95% CI 1.46 to 2.58) and direct (OR = 1.94, 95% CI 1.30 to 2.89) effect of alcohol on stroke, when accounting for education. Also, whilst there was no clear evidence for a total effect of education on stroke, there was some evidence of a direct effect in the MVMR when accounting for alcohol (OR = 0.73, 95% CI 0.55 to 0.97) (Fig 2; S6 Table). The MR- and MVMR-Egger findings were similar to the IVW findings in terms of the effect estimates. However, the confidence intervals were wider and therefore crossed the null for education and the direct effect of alcohol.
Injuries
There was evidence for a total (OR = 1.47, 95% CI 1.24 to 1.73) and direct (1.29, 95% CI 1.03 to 1.60) effect of alcohol on injuries. However, there was no clear evidence for a total (OR = 0.98, 95% CI 0.85 to 1.14) or direct (OR = 0.87, 95% CI 0.75 to 1.02) effect of education (Fig 2; S7 Table). The MR- and MVMR-Egger results supported the IVW results for education and the total effect of alcohol; however, the confidence intervals crossed the null for the direct effect of alcohol.
Depression
In the univariable MR, there was evidence for a total effect of alcohol on depression (OR = 1.33, 95% CI 1.09 to 1.64); however, there was no clear evidence for a direct effect when accounting for education (OR = 1.11, 95% CI 0.84 to 1.48). There was evidence that education decreased the odds of depression in the univariable (OR = 0.59, 95% CI 0.48 to 0.73) and multivariable analyses when accounting for alcohol (OR = 0.56, 95% CI 0.45 to 0.68) (Fig 2; S8 Table). The MR- and MVMR-Egger results supported the IVW findings of a total, but not direct, effect of alcohol (although the wider confidence intervals crossed the null), as well as a direct effect of education. However, the MR-Egger did not support IVW evidence for a total effect of education.
Epilepsy
Whilst there was evidence for a total effect of alcohol on epilepsy (OR = 1.91, 95% CI 1.33 to 2.75), there was no clear evidence for a direct effect when accounting for education (OR = 1.17, 95% CI 0.72 to 1.89). Moreover, there was no clear evidence for a total (OR = 0.83, 95% CI 0.60 to 1.15) or direct (OR = 0.82, 95% CI 0.58 to 1.14) effect of education on epilepsy (Fig 2; S9 Table). The MR-Egger results supported a total effect of alcohol and also found some evidence for a direct effect, which was not indicated in the IVW findings. The MR- and MVMR-Egger effect estimates for education were in the opposite direction to the IVW results, although all confidence intervals crossed the null.
Influenza and pneumonia
There was evidence for a total (OR = 1.32, 95% CI 1.03 to 1.68) but not a direct effect of alcohol on influenza/pneumonia when accounting for education (OR = 1.28, 95% CI 0.94 to 1.73). There was evidence that education decreased the odds of influenza/pneumonia in the univariable (OR = 0.63, 95% CI 0.53 to 0.76) and multivariable (OR = 0.60, 95% CI 0.48 to 0.74) analysis (Fig 2; S10 Table). The MR- and MVMR-Egger results were primarily consistent with the IVW results, supporting a total and direct effect of education and no direct effect of alcohol. Whilst the effect estimate for the total effect of alcohol was similar, the MR-Egger confidence interval crossed the null.
Other outcomes
There was no clear evidence for a total or direct effect of alcohol on IHD (S11 Table), viral hepatitis (S12 Table), or anxiety (S13 Table). There was evidence for a total (OR = 0.61, 95% CI 0.53 to 0.70) and direct (OR = 0.62, 95% CI 0.52 to 0.74) effect of education on IHD when accounting for alcohol. There was also evidence for a total (OR = 0.67, 95% CI 0.52 to 0.86) and direct (OR = 0.62, 95% CI 0.48 to 0.80) effect of education on anxiety. There was no clear evidence for a total or direct effect of education on viral hepatitis.
Overall, the MR- and MVMR-Egger results were mostly similar to the IVW results for IHD, anxiety, and viral hepatitis. The only differences were that the MR-Egger did not support a total effect of education on IHD or anxiety.
Discussion
We find evidence for direct effects of alcohol and education on alcoholic and other liver diseases, mental and behavioural disorders due to alcohol, and stroke, indicating that alcohol use increases the likelihood of such outcomes, whereas years of education decreases their likelihood. There was also evidence of a direct effect of education on depression, anxiety, influenza/pneumonia, and IHD. In contrast, there was evidence of a total effect, but not a direct effect (i.e., when accounting for education), of alcohol on depression, influenza/pneumonia, epilepsy, and injuries.
The direct effects of alcohol on liver diseases and mental and behavioural disorders due to alcohol are consistent with the aetiology of such conditions, as they are known to be ‘wholly attributable’ to alcohol [7,22]. For such ‘wholly attributable’ outcomes, these results are consistent with previous findings from observational studies suggesting the paradox may be due to the multiplicative contribution of drinks per week and years of schooling, which both may impact the likelihood of developing such health conditions [3,4]. Moreover, the direct effect of alcohol on stroke is supported by a previous MR meta-analysis demonstrating robust causal relationships between alcohol consumption and cardiovascular diseases, including ischaemic stroke [36].
However, the results for other outcomes (‘partially attributable’) imply that years of schooling, but not drinks per week, contribute to their development, including total but not direct effects of alcohol on some conditions such as depression, epilepsy, and injuries when accounting for education. This finding resonates with a recent genome wide meta-analysis which found novel genetic loci associated with problematic alcohol use and its comorbidities, such as depression and other mental health disorders [37], suggesting that the contribution of alcohol to these may be, at least partially, mediated by other correlated factors.
The evidence for a causal contribution of years of schooling is consistent with previous natural and genetic studies suggesting that raising the school leaving age and additional years of education reduce the likelihood of developing adverse health outcomes [38,39]. The protective effects of education observed in conditions such as depression, anxiety, and ischemic heart disease (IHD) are consistent with previous population-based observational studies highlighting the protective effect of education on anxiety and depression [40] and large-scale MR studies demonstrating that education positively impacts coronary heart disease [41].
The finding that drinks per week is not causally related to many of the ‘partially attributable’ conditions may be because drinks per week is a simplified a measure of alcohol consumption. Other, more complex drinking patterns (such as alcohol frequency, drinks consumed per drinking occasion, and alcohol type) may be more important in the development of these conditions. Previous observational research has reported that low SEP groups are more likely to binge drink or exceed high weekly drinking thresholds [6]. This has also been supported by genetic research which have reported opposite genetic correlations between SEP and drinking frequency compared with drinking consumption [42], and have demonstrated relationships between more years of education and reduced binge drinking, number of drinks per drinking occasion, and spirit intake using MR [18]. Therefore, the results for these ‘partially attributable’ conditions may be because their relationship with alcohol consumption is more nuanced than is being captured by the genetic instruments associated with drinks per week.
However, it is important to be cautious and consider the strength of instruments when interpreting these results. The conditional F-statistics indicate that the instruments were not strongly associated with alcohol in the MVMR, despite strong associations in the MR. Therefore, we cannot be as confident or draw causal inferences from our estimates of direct effects; instead, our findings may be indicative of possible causal effects which would need to be replicated with strong instruments to be confident. In addition, we were not able to explore the relative contribution of tobacco use alongside alcohol consumption and SEP as initially intended due to the weak genetic instruments for CPD and, therefore, we can only consider these results as exploratory as they were not all specified in our pre-registered analysis plan.
Strengths and Limitations
This is the first study, to our knowledge, to apply MVMR, a novel causal inference method, to understand the direct causal effects of alcohol and education on multiple alcohol-related conditions to enhance understanding of the alcohol harm paradox. Also, the large sample size of the GWAS for education will have enhanced statistical power and the conditional F-statistics suggest good instrument strength for years of schooling. However, this study also has multiple limitations.
As previously mentioned, the conditional F-statistics indicate that our proxy for alcohol consumption is a weak instrument in the MVMR, which may possibly bias our results either towards or away from the null [31]. Weak instruments, however, tend to increase the size of the confidence intervals, possibly increasing the likelihood of Type 2 error. Heterogeneity is indicated by the Cochran’s Q statistics and, even though we applied MR-Egger to address this, heterogeneity may remain, indicating the analyses may also be biased by invalid instruments. Directional pleiotropy is similarly likely to persist in our analyses despite applying pleiotropy-robust methods as indicated by the MR-Egger intercepts, which are not null in most of the models. This suggests that the genetic instruments may influence the adverse health outcomes within this study via pathways that do not include alcohol or education and therefore may not tell us about the causal effect of alcohol or education on such outcomes.
The drinks per week GWAS has been updated since we published our protocol and conducted the analysis and it is possible the results may differ using this GWAS [43]. However, the marked increase in drinks per week SNPs (over eight times) may lead to a problematic imbalance between alcohol and education, perhaps leading to weak instrument bias, and the increase in sample size (when the sample was already sufficiently large) appears to have introduced noise as evidenced by larger standard errors [31,44]. Also, the F-statistics have been demonstrated to be greater (indicating a stronger instrument) when using SNPs and betas identified in the current GWAS compared with the updated GWAS [45]. Therefore, whilst it may be interesting to examine how results compare with the updated GWAS, we do not believe it would strengthen the analysis. Moreover, drinks per week is a somewhat limited phenotype to reflect alcohol as many factors influence the effects of alcohol beyond number of drinks, including binge patterns and alcohol types. Similarly, education is one factor that contributes to an individual’s SEP; factors such as income, neighbourhood, and occupation are also important and may show different relationships. Therefore, focusing purely on drinks per week and years of schooling may be restricting our understanding. Combined with the exploratory nature of these analyses, these limitations suggest that caution is highly necessary in interpreting the current findings.
Future Research and Implications
Due to limitations within the study regarding weak instruments and possible pleiotropy, the current study should be replicated with newly available GWAS data [43] allowing comparison with data from larger samples, as well as applying newly developed methods robust to weak instruments [46]. Replication will help confirm whether the results presented in this study reflect the true causal effects of alcohol and education or are due to methodological limitations. If the current findings were supported, this would suggest that the alcohol harm paradox may be due to the protective effect of years of schooling on many alcohol-related conditions resulting in a reduced likelihood of higher SEP groups developing such outcomes. Future research may therefore benefit from exploring the mechanisms through which education impacts such health outcomes to inform preventative policy. Beyond this, applying this MVMR framework to explore the contribution of alternative exposures (such as different drinking patterns or problematic drinking behaviours [47], income, and impulsivity/risk-taking behaviour) may provide a useful next step for future research aiming to disentangle the alcohol harm paradox.
Conclusion
This research demonstrates the utility of applying the MVMR framework to enhance understanding of complex behavioural phenomena. We found evidence for a direct effect of alcohol on alcoholic and other liver diseases, mental and behavioural disorders due to alcohol, and stroke. There was also evidence for a total effect of alcohol on depression, epilepsy, influenza/pneumonia, and injuries, which were attenuated when education was accounted for. Despite weak instrument bias, such results cautiously indicate that many partially alcohol-attributable conditions are causally impacted by years of education, which we use here as a proxy for SEP. Replication with strong instruments is necessary to confirm these findings. Future research may endeavour to explore the causal effect of education on other partially alcohol-attributable conditions and identify the mechanisms through which further years of education protects against these conditions.
Supporting information
(DOCX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
Acknowledgments
This research has been conducted using the UK Biobank Resource, a major biomedical database, under application number 9142. This work uses data provided by patients and collected by the NHS as part of their care and support.
Data Availability
This research uses publicly available summary statistics from previously published GWAS, including GSCAN (available at: https://doi.org/10.13020/3b1n-ff32) and SSGAC (available at: doi:10.1038/nature17671 - full article requires institutional access or purchase/subscription, but data is available to download from Supplementary information). We also use data from UK Biobank (application 9142) and this data is available to approved researchers via an application (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access).
Funding Statement
GS, HS, MM, LM, and JK are/were all members of the Medical Research Council (MRC) Integrative Epidemiology Unit at the University of Bristol when this work began. LM is now in Public Health Wales. This publication is the work of the authors, and they will serve as guarantors for the contents of this paper. GS is supported by a Wellcome Trust PhD studentship in Molecular, Genetic and Lifecourse Epidemiology (ref: 218495/Z/19/Z). For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.World Health Organization. Alcohol. 2018 [cited 9 Mar 2022]. https://www.who.int/news-room/fact-sheets/detail/alcohol
- 2.Smith K, Foster Alcohol J. Health Inequalities and the Harm Paradox: Why some groups face greater problems despite consuming less alcohol. 2014. https://www.alcoholpolicy.net/2014/11/alcohol-health-inequalities-and-the-harm-paradox-ias-report-reviews-higher-harms-in-deprived-groups.html
- 3.Bellis MA, Hughes K, Nicholls J, Sheron N, Gilmore I, Jones L. The alcohol harm paradox: using a national survey to explore how alcohol may disproportionately impact health in deprived individuals. BMC Public Health. 2016;16:111. doi: 10.1186/s12889-016-2766-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Beard E, Brown J, West R, Angus C, Brennan A, Holmes J, et al. Deconstructing the Alcohol Harm Paradox: A Population Based Survey of Adults in England. PLoS One. 2016;11(9):e0160666. doi: 10.1371/journal.pone.0160666 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Møller SP, Pisinger VSC, Christensen AI, Tolstrup JS. Socioeconomic position and alcohol-related harm in Danish adolescents. J Epidemiol Community Health. 2019;73(9):839–45. doi: 10.1136/jech-2018-211634 [DOI] [PubMed] [Google Scholar]
- 6.Lewer D, Meier P, Beard E, Boniface S, Kaner E. Unravelling the alcohol harm paradox: a population-based study of social gradients across very heavy drinking thresholds. BMC Public Health. 2016;16:599. doi: 10.1186/s12889-016-3265-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Sadler S, Angus C, Gavens L, Gillespie D, Holmes J, Hamilton J, et al. Understanding the alcohol harm paradox: an analysis of sex- and condition-specific hospital admissions by socio-economic group for alcohol-associated conditions in England. Addiction. 2017;112(5):808–17. doi: 10.1111/add.13726 [DOI] [PubMed] [Google Scholar]
- 8.GBD 2015 Tobacco Collaborators. Smoking prevalence and attributable disease burden in 195 countries and territories, 1990-2015: a systematic analysis from the Global Burden of Disease Study 2015. Lancet. 2017;389(10082):1885–906. doi: 10.1016/S0140-6736(17)30819-X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gartner A, Trefan L, Moore S, Akbari A, Paranjothy S, Farewell D. Drinking beer, wine or spirits - does it matter for inequalities in alcohol-related hospital admission? A record-linked longitudinal study in Wales. BMC Public Health. 2019;19(1):1651. doi: 10.1186/s12889-019-8015-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Peña S, Mäkelä P, Laatikainen T, Härkänen T, Männistö S, Heliövaara M, et al. Joint effects of alcohol use, smoking and body mass index as an explanation for the alcohol harm paradox: causal mediation analysis of eight cohort studies. Addiction. 2021;116(8):2220–30. doi: 10.1111/add.15395 [DOI] [PubMed] [Google Scholar]
- 11.Smith GD, Ebrahim S. “Mendelian randomization”: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1):1–22. doi: 10.1093/ije/dyg070 [DOI] [PubMed] [Google Scholar]
- 12.Sanderson E, Davey Smith G, Windmeijer F, Bowden J. An examination of multivariable Mendelian randomization in the single-sample and two-sample summary data settings. Int J Epidemiol. 2019;48(3):713–27. doi: 10.1093/ije/dyy262 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sanderson E, Glymour MM, Holmes MV, Kang H, Morrison J, Munafò MR, et al. Mendelian randomization. Nat Rev Methods Primers. 2022;2:6. doi: 10.1038/s43586-021-00092-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Skrivankova VW, Richmond RC, Woolf BAR, Yarmolinsky J, Davies NM, Swanson SA, et al. Strengthening the Reporting of Observational Studies in Epidemiology Using Mendelian Randomization: The STROBE-MR Statement. JAMA. 2021;326(16):1614–21. doi: 10.1001/jama.2021.18236 [DOI] [PubMed] [Google Scholar]
- 15.Von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. 2007. www.thelancet.comwww.plosmedicine.org [DOI] [PMC free article] [PubMed]
- 16.Liu M, Jiang Y, Wedow R, Li Y, Brazel DM, Chen F, et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat Genet. 2019;51(2):237–44. doi: 10.1038/s41588-018-0307-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Okbay A, Beauchamp JP, Fontana MA, Lee JJ, Pers TH, Rietveld CA, et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature. 2016;533(7604):539–42. doi: 10.1038/nature17671 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Rosoff DB, Clarke T-K, Adams MJ, McIntosh AM, Davey Smith G, Jung J, et al. Educational attainment impacts drinking behaviors and risk for alcohol dependence: results from a two-sample Mendelian randomization study with ~780,000 participants. Mol Psychiatry. 2021;26(4):1119–32. doi: 10.1038/s41380-019-0535-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.UNESCO. International Standard Classification of Education ISCED 1997. 2006. https://isced.uis.unesco.org/wp-content/uploads/sites/15/2022/12/international-standard-classification-of-education-1997-en_0.pdf
- 20.Sudlow C, Gallacher J, Allen N, Beral V, Burton P, Danesh J, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12(3):e1001779. doi: 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Mitchell R, Elsworth B, Raistrick C, Paternoster L, Hemani G, Gaunt T. MRC IEU UK Biobank GWAS pipeline version 2. 2019. 10.5523/bris.pnoat8cxo0u52p6ynfaekeigi [DOI]
- 22.Public Health England and UK Health Forum. Alcohol-attributable fractions for England: An update. In: GOV.UK [Internet]. 12 Jun 2020 [cited 8 Nov 2022]. https://www.gov.uk/government/publications/alcohol-attributable-fractions-for-england-an-update
- 23.World Health Organization. Global status report on alcohol and health 2018. 2018. https://www.who.int/publications/i/item/9789241565639
- 24.Jones L, Dedman D, Tocque K. Alcohol-Attributable Fractions for England: Alcohol-Attributable Mortality and Hospital Admissions Systematic review of interventions to enhance health service contacts with parents and infants View project. 2008. https://www.researchgate.net/publication/235428834
- 25.R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing. 2021 [cited 18 Mar 2022]. https://www.R-project.org/
- 26.Yavorska OO, Burgess S. MendelianRandomization: an R package for performing Mendelian randomization analyses using summarized data. Int J Epidemiol. 2017;46(6):1734–9. doi: 10.1093/ije/dyx034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7:e34408. doi: 10.7554/eLife.34408 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Spiller W, Bowden J, Sanderson E. MVMR: MVMR. R package version 0.3. 2022. https://github.com/WSpiller/MVMR
- 29.Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat Genet. 2018;50(5):693–8. doi: 10.1038/s41588-018-0099-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Jeong S, Shivakumar M, Jung S-H, Won H-H, Nho K, Huang H, et al. Addressing overfitting bias due to sample overlap in polygenic risk scoring. Alzheimers Dement. 2025;21(4):e70109. doi: 10.1002/alz.70109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Sanderson E, Spiller W, Bowden J. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable Mendelian randomization. Stat Med. 2021;40(25):5434–52. doi: 10.1002/sim.9133 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Burgess S, Butterworth A, Thompson SG. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet Epidemiol. 2013;37(7):658–65. doi: 10.1002/gepi.21758 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44(2):512–25. doi: 10.1093/ije/dyv080 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent Estimation in Mendelian Randomization with Some Invalid Instruments Using a Weighted Median Estimator. Genet Epidemiol. 2016;40(4):304–14. doi: 10.1002/gepi.21965 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Hartwig FP, Davey Smith G, Bowden J. Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption. Int J Epidemiol. 2017;46(6):1985–98. doi: 10.1093/ije/dyx102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Holmes MV, Dale CE, Zuccolo L, Silverwood RJ, Guo Y, Ye Z, et al. Association between alcohol and cardiovascular disease: Mendelian randomisation analysis based on individual participant data. BMJ. 2014;349:g4164. doi: 10.1136/bmj.g4164 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhou H, Sealock JM, Sanchez-Roige S, Clarke T-K, Levey DF, Cheng Z, et al. Genome-wide meta-analysis of problematic alcohol use in 435,563 individuals yields insights into biology and relationships with other traits. Nat Neurosci. 2020;23(7):809–18. doi: 10.1038/s41593-020-0643-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Davies NM, Dickson M, Davey Smith G, Windmeijer F, van den Berg GJ. The causal effects of education on adult health, mortality and income: evidence from Mendelian randomization and the raising of the school leaving age. Int J Epidemiol. 2023;52(6):1878–86. doi: 10.1093/ije/dyad104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Davies NM, Dickson M, Davey Smith G, van den Berg GJ, Windmeijer F. The Causal Effects of Education on Health Outcomes in the UK Biobank. Nat Hum Behav. 2018;2(2):117–25. doi: 10.1038/s41562-017-0279-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Bjelland I, Krokstad S, Mykletun A, Dahl AA, Tell GS, Tambs K. Does a higher educational level protect against anxiety and depression? The HUNT study. Soc Sci Med. 2008;66(6):1334–45. doi: 10.1016/j.socscimed.2007.12.019 [DOI] [PubMed] [Google Scholar]
- 41.Tillmann T, Vaucher J, Okbay A, Pikhart H, Peasey A, Kubinova R, et al. Education and coronary heart disease: mendelian randomisation study. BMJ. 2017;358:j3542. doi: 10.1136/bmj.j3542 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Marees AT, Smit DJA, Ong J-S, MacGregor S, An J, Denys D, et al. Potential influence of socioeconomic status on genetic correlations between alcohol consumption measures and mental health. Psychol Med. 2020;50(3):484–98. doi: 10.1017/S0033291719000357 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Saunders GRB, Wang X, Chen F, Jang S-K, Liu M, Wang C, et al. Genetic diversity fuels gene discovery for tobacco and alcohol use. Nature. 2022;612(7941):720–4. doi: 10.1038/s41586-022-05477-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Escott-Price V, Hardy J. Genome-wide association studies for Alzheimer’s disease: bigger is not always better. Brain Commun. 2022;4(3):fcac125. doi: 10.1093/braincomms/fcac125 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Jiesisibieke ZL, Schooling CM. Impact of Alcohol Consumption on Lifespan: a Mendelian randomization study in Europeans. Sci Rep. 2024;14(1):25321. doi: 10.1038/s41598-024-73333-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Patel A, Lane J, Burgess S. Weak instruments in multivariable Mendelian randomization: methods and practice. 2024. [Google Scholar]
- 47.Zhou H, Kember RL, Deak JD, Xu H, Toikumo S, Yuan K, et al. Multi-ancestry study of the genetics of problematic alcohol use in over 1 million individuals. Nat Med. 2023;29(12):3184–92. doi: 10.1038/s41591-023-02653-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
(DOCX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
(XLSX)
Data Availability Statement
This research uses publicly available summary statistics from previously published GWAS, including GSCAN (available at: https://doi.org/10.13020/3b1n-ff32) and SSGAC (available at: doi:10.1038/nature17671 - full article requires institutional access or purchase/subscription, but data is available to download from Supplementary information). We also use data from UK Biobank (application 9142) and this data is available to approved researchers via an application (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access).


