Abstract
Vitamin D has been associated with a variety of human complex traits and diseases in observational studies, but a causal relationship remains unclear. To examine a putative causal effect of vitamin D across phenotypic domains and disease categories, we conducted Mendelian randomization (MR) analyses using genetic instruments associated with circulating 25-hydroxyvitamin D [25(OH)D] concentrations. We leveraged genome-wide significant 25(OH)D-associated SNPs (N = 138) from a meta-analysis combining a vitamin D GWAS conducted in 401,460 white British UK Biobank (UKBB) participants and an independent vitamin D GWAS including 42,274 samples of European ancestry, and examined 190 large-scale health-related GWAS spanning a broad spectrum of complex traits, diseases and biomarkers. We applied multiple MR methods to estimate the causal effect of vitamin D while testing and controlling for potential biases from horizontal pleiotropy. Consistent with previous findings, genetically predicted increased 25(OH)D levels significantly decreased the risk of multiple sclerosis (OR = 0.824; 95% CI 0.689–0.986). The protective effect estimate was consistent across different MR methods and four different multiple sclerosis GWAS with varying sample sizes and genotyping platforms. On the contrary, we found limited evidence in support of a causal effect of 25(OH)D on anthropometric traits, obesity, cognitive function, sleep behavior, breast and prostate cancer, and autoimmune, cardiovascular, metabolic, neurological and psychiatric traits and diseases, and blood biomarkers. Our results may inform ongoing and future randomized clinical trials of vitamin D supplementation.
Subject terms: Risk factors, Genome-wide association studies, Nutrition, Epidemiology
Introduction
Vitamin D (or its hydroxylated form produced by liver, the 25-hydroxyvitamin D [25(OH)D]) is a group of fat-soluble secosteroids that pose a crucial impact on human health1. Long-term poor dietary intake of vitamin D or inadequate exposure to sunlight (ultraviolet B) can cause rickets and osteomalacia. Several large-scale epidemiological investigations have linked vitamin D deficiency to a range of common diseases such as cancer, autoimmune diseases, and cardiovascular conditions2–6. Yet the observational nature of epidemiological designs hinders causal inference, and the validity of these results can be plagued by measurement error, confounding and reverse causality. Although randomized controlled trials (RCTs) are considered as the gold standard to establish causal relationships, large-scale RCTs of vitamin D supplementation are not readily available due to their high cost, long duration and ethical concerns7,8. A recently completed RCT, VITAL (ClinicalTrials.gov: NCT01169259), which involved a total of 25,871 participants with a median follow-up period of 5.3 years, did not find lowered incidence of invasive cancer or cardiovascular events for those who took vitamin D3 at a dose of 2000 IU per day compared with the placebo group7,9. The ancillary study to VITAL, VITAL-DEP (ClinicalTrials.gov: NCT02521012), recruited and followed up with 18,353 participants, and found the risk of depression or clinically relevant depressive symptoms was not significantly different between the vitamin D3 group and the placebo group; no significant benefit of vitamin D was identified for depression incidence or recurrence either10. RCTs targeting other diseases and conditions such as cognitive change (ClinicalTrials.gov: NCT03613116) are ongoing and have yet to reach conclusions. To better inform the field on the chance of success for such vitamin D trials, an improved understanding of the causal role of vitamin D in human complex traits and diseases, preferably through less costly observational studies, is needed.
Serum levels of 25(OH)D has a polygenic predisposition. Several early genome-wide association studies (GWAS) have demonstrated the associations of GC, NADSYN1/DHCR7, CYP2R1, CYP24A1, SEC23A, and AMDHD1 with circulating vitamin D11–14. One of the latest GWAS of circulating 25(OH)D, conducted by meta-analyzing a GWAS of 401,460 white British UK Biobank (UKBB) participants and a GWAS of 42,274 samples of European ancestry, has identified 138 conditionally independent SNPs in 69 vitamin D associated loci13. These genetic discoveries have enabled causal inference between vitamin D and complex traits through Mendelian randomization (MR), an approach that correlates genetic variants randomly distributed at conception (instrumental variables, IVs) with health outcomes later on15. An unbiased estimate of the causal effect can be obtained using the observed IV-exposure and IV-outcome associations when all model assumptions of MR are satisfied16.
To date, the causal role of vitamin D beyond its importance in bone health remains ambiguous and is under much debate. The causal effects of vitamin D were only replicated in MR studies for adult- and pediatric-onset multiple sclerosis and Alzheimer’s disease17–21; whereas findings for other complex human traits and diseases were inconsistent22–25. Overall, results from MR studies of vitamin D are largely null and do not support conventional observational studies, which have associated vitamin D with a variety of human traits and diseases. That said, it is worth noting that the majority of hitherto MR studies for vitamin D used a small number of IVs (N = 4 or 6 independent loci) identified by the SUNLIGHT consortium meta-GWAS back in 201010. Recent GWAS incorporating UKBB samples have identified a much larger number of 25(OH)D-associated genomic loci with more accurate effect size estimates, providing an opportunity to re-evaluate the causal role of 25(OH)D with much improved power13,14. Moreover, the rapid accumulation of GWAS summary statistics for human complex traits and diseases has allowed us to assess the causal effect of vitamin D across a broad spectrum of health-related outcomes.
Here, we conducted two-sample MR analyses (i.e., IV-exposure and IV-outcome associations are estimated from two separate samples) to examine the causal role of 25(OH)D across phenotypic domains and disease categories. Summary statistics for 138 independent genetic instruments were extracted from one of the latest and largest vitamin D meta-GWAS incorporating UKBB samples. Summary statistics for the IV-outcome associations were obtained from 190 large-scale GWAS spanning complex traits, diseases and biomarkers.
Methods
Data for IV-exposure
We extracted summary statistics for SNPs significantly associated with circulating 25(OH)D concentration from a meta-analysis of two vitamin D GWAS conducted by Manousaki et al.: a GWAS in UK Biobank including 401,460 white British participants, and an independent GWAS in 42,274 individuals of European ancestry13. The method for estimating SNP effects on vitamin D was described in detail in Manousaki et al.13. Briefly, a linear mixed effects model (implemented in the BOLT-LMM software)26 was used to estimate the effect of each SNP in the UK Biobank, which was then meta-analyzed with a previously published vitamin D GWAS using inverse variance weighted fixed effect meta-analysis. In both GWAS, 25(OH)D levels were first log-transformed and then standardized. Therefore, all MR effect estimates in this study correspond to 1 standard deviation increase in log-transformed vitamin D level. A total of 138 conditionally independent vitamin D associated SNPs in 69 independent loci were identified by GCTA-COJO v.1.91.1 using the meta-GWAS27, which collectively explained 4.9% of variation in vitamin D and 30% of the total SNP heritability (16.1%)13. We used these 138 SNPs in our primary analysis as IVs (Supplementary Table 1) without further LD clumping to achieve greater statistical power. We additionally performed a sensitivity analysis using the top SNP from each of the 69 independent loci as our secondary analysis to ensure strict independence between IVs.
Data for IV-outcome
We retrieved three collections of GWAS summary statistics from the IEU OpenGWAS Project (https://gwas.mrcieu.ac.uk/): ieu-a and ieu-b, which include GWAS summary statistics generated by various consortia, and ebi-a, which includes GWAS summary statistics imported from the NHGRI-EBI GWAS catalog (https://www.ebi.ac.uk/gwas/). A total of 765 GWAS were included, which represented a comprehensive coverage of human complex traits, diseases and biomarkers. We then filtered these outcome GWAS by retaining those with over 10,000 samples and 1,000,000 SNPs across the genome to ensure statistical power and SNP coverage. Furthermore, we only included the GWAS with the largest sample size if multiple GWAS are available for the same outcome, with the exception of multiple sclerosis (MS); we included four MS GWAS with varying sample sizes and genotyping platforms (genome-wide coverage vs. customized immunochip) to compare the MR causal effect estimates of vitamin D on MS. Finally, we manually curated summary statistics for 26 GWAS that were either not included in the IEU GWAS repository or had a larger sample size compared with the GWAS in the IEU repository for the same outcome. These data filtering and curation strategies yielded a total of 190 eligible outcome GWAS in our MR analysis. All GWAS were conducted in samples of European ancestry, with the exception of two cardiovascular GWAS, which were conducted in mixed populations including both European and Asian individuals. The descriptions, references and sources of these 190 GWAS are listed in Supplementary Table 2. These outcome GWAS span a broad phenotypic spectrum, including blood biomarkers; brain imaging measurements; lipid; ophthalmology; cancer; metabolic, cardiovascular, orthopedic, autoimmune, neurological and psychiatric traits and diseases; anthropometric traits; behavioral traits; sleep; cognition; personality; and reproductive traits. For each outcome, we retrieved the variant annotations (SNP rsID, chromosome, position, reference and alternate allele) and summary statistics (effect size, standard error, P-value, allele frequency [if available]) for each SNP instrument using the R package TwoSampleMR (https://mrcieu.github.io/TwoSampleMR/), and harmonized the effects and reference alleles between the vitamin D GWAS and outcome GWAS. For strand ambiguous (palindromic) SNPs, we matched the alleles based on minor allele frequencies (MAF) and discarded those with MAF > 0.42 following the default setting in the TwoSampleMR package. The sample sizes of these outcome GWAS ranged from 9,954 (for the manually curated, largest-to-date PTSD GWAS) to 1,030,836 (median 112,561; mean 148,179).
Statistical analysis
MR uses SNPs as IVs to estimate the causal effect of an exposure on an outcome while controlling for potential confounding, by leveraging the random allocation of alleles at conception. Three assumptions need to be satisfied to ensure a valid IV. The first is the relevance assumption, that is, IVs must be strongly associated with the exposure; the second assumption requires no association between IVs and confounders of the exposure-outcome relationship; and the third is the exclusion restriction assumption, which requires that IVs affect the outcome only through the exposure28,29. If all MR assumptions are satisfied, a causal effect can be estimated based on the observed IV-exposure and IV-outcome associations.
We conducted two-sample MR analyses to test for the causal relationship between circulating 25(OH)D and 190 outcomes. We applied three different MR methods, namely MR-PRESSO29, MR-Egger28, and a weighted median estimator approach30, to estimate causal effects. All three methods are robust to horizontal pleiotropy, i.e., an IV affects the outcome via a separate biological pathway from the exposure under investigation (violation of the exclusion restriction assumption), under the InSIDE (instrument strength independent of direct effect) assumption, and the assumption of at least 50% of the genetic variants being valid instruments (no horizontal pleiotropy). For the robustness of our findings, we considered an estimated causal effect to be significant if at least two out of the three methods showed significant false discovery rate (FDR) corrected P-values. In addition, we performed MR-PRESSO, MR-Egger regression, and modified Q and Q’ tests, to detect bias due to horizontal pleiotropy. MR-PRESSO implements a global test to evaluate the presence of horizontal pleiotropy, and an outlier test to detect specific SNP outliers29. In MR-Egger regression, a significant difference of the intercept from zero suggests the existence of average directional horizontal pleiotropy28. Modified Q and Q’ tests are traditionally used to identify over-dispersion and have been applied in the context of MR to detect outliers caused by horizontal pleiotropy31.
Ethics
Our study is a secondary analysis of existing, de-identified, summary-level GWAS data. Specific ethics for each GWAS examined in this study can be found in the original GWAS publications.
Results
As shown in Figs. 1, 2, 3 and Supplementary Table 3 and 5, in our primary analysis using 138 conditionally independent 25(OH)D-associated SNPs as IVs, we did not find a causal effect of vitamin D on the outcomes examined here, except for multiple sclerosis (MS) and white blood cell (basophil) count (accounting for the majority vote of three MR methods).
Using summary statistics from a study based on Immunochip conducted in 2013, we showed that increased 25(OH)D level decreased the risk of MS (MR-PRESSO odds ratio [OR] = 0.433; 95% confidence interval [CI] 0.313–0.599; P = 4.19 × 10–7; PFDR = 7.96 × 10–5). This result was consistent across different MR methods (MR-Egger: OR = 0.390 [0.258–0.588]; weighted median: OR = 0.397 [0.301–0.523]) and was in line with previous findings (Table 1a). Using an earlier MS GWAS with a smaller sample size (conducted in 2011, 2/3 of the sample size of the 2013 Immunochip study), we observed a similar protective effect, which survived multiple testing correction (MR-PRESSO OR = 0.719 [0.606–0.852]; P = 1.48 × 10–4; PFDR = 0.014). We had consistent findings using two other MS GWAS with greater statistical uncertainties (Patsopoulos et al. 2011 GWAS, MR-PRESSO OR = 0.705 [0.551–0.900], P = 0.005, PFDR = 0.149; Patsopoulos et al. 2019 GWAS, MR-PRESSO OR = 0.824 [0.689–0.986], P = 0.035, PFDR = 0.367) (Table 1a). Note that the pleiotropic bias tests suggest that there are various levels of pleiotropic bias for the vitamin D causal effect estimates on MS (Table 1b); however, the causal effect estimates reported here are all adjusted for potential pleiotropic bias. The secondary analysis using 69 top SNP instruments also showed consistent results to the primary analysis (Supplementary Table 4).
Table 1.
(a) Causal effect estimates of 25(OH)D on multiple sclerosis | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Outcome | Outcome GWAS sample size | Outcome GWAS N of SNP | N of SNP used in MR | MR-PRESSO | Median estimate | MR-Egger | |||||||||
Beta | 95% CI | P-value | FDR adjusted P-value | Beta | 95% CI | P-value | FDR adjusted P-value | Beta | 95% CI | P-value | FDR adjusted P-value | ||||
Multiple sclerosis (Patsopoulos N et al. 2011) | 17,698 | 7,395,163 | 66 | − 0.350 | (− 0.596 to − 0.105) | 0.005 | 0.149 | − 0.010 | (− 0.327 to 0.307) | 0.949 | 0.996 | − 0.412 | (− 0.742 to − 0.083) | 0.014 | 0.226 |
Multiple sclerosis (Sawcer et al. 2011) | 27,148 | 6,991,436 | 62 | − 0.330 | (− 0.501 to − 0.160) | 1.48E−04 | 0.014 | − 0.233 | (− 0.429 to − 0.036) | 0.021 | 0.436 | − 0.422 | (− 0.656 to − 0.189) | 3.89E− 04 | 0.031 |
Multiple sclerosis (Beecham et al. 2013 immunochip) | 38,589 | 1,082,242 | 24 | − 0.837 | (− 1.162 to − 0.513) | 4.19E−07 | 7.96E−05 | − 0.925 | (− 1.202 to − 0.648) | 6.33E−11 | 1.20E−08 | − 0.942 | (− 1.353 to − 0.531) | 6.90E− 06 | 0.001 |
Multiple sclerosis (Patsopoulos N et al. 2019) | 115,803 | 6,304,359 | 50 | − 0.193 | (− 0.372 to − 0.014) | 0.035 | 0.367 | − 0.192 | (− 0.365 to − 0.019) | 0.030 | 0.436 | − 0.234 | (− 0.495 to 0.028) | 0.080 | 0.559 |
(b) MR pleiotropic bias tests for 25(OH)D on multiple sclerosis | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Outcome | Pleiotropic bias tests | ||||||||||||||
MR-PRESSO global test P-value | MR-PRESSO N of outlier | Modified Q test P-value | Modified Q' test P-value | MR-Egger intercept test P-value | |||||||||||
Multiple sclerosis (Beecham et al. 2013 immunochip) | 0.029 | 1 | 0.031 | 0.033 | 0.339 | ||||||||||
Multiple sclerosis (Patsopoulos N et al. 2011) | 0.005 | 1 | 0.006 | 0.005 | 0.575 | ||||||||||
Multiple sclerosis (Sawcer et al. 2011) | 0.003 | 1 | 0.002 | 0.003 | 0.159 | ||||||||||
Multiple sclerosis (Patsopoulos N et al. 2019) | 0.003 | 1 | 8.15E−04 | 6.07E−04 | 0.811 |
In addition to MS, we also found white blood cell (basophil) count to be causally affected by vitamin D (MR-PRESSO =0.039, 95% CI 0.017–0.061, PFDR = 0.037) (Fig. 3). However, this result was not significant in our secondary (sensitivity) analysis using 69 top SNPs from independent genomic loci in the vitamin D GWAS (Supplementary Table 4). Also note that the pleiotropic bias tests were significant for the vitamin D causal effect estimates on basophil count (Supplementary Table 4). Further investigations are thus warranted to elucidate this suggestive finding.
Notably, we identified horizontal pleiotropy (as reflected by the MR-PRESSO global test, the MR-Egger intercept, and modified Q and Q’ tests) for 68% (130 out of 190) of the outcomes analyzed here (123 from MR-PRESSO, 25 from MR-Egger regression, 123 from the modified Q test, and 118 from the modified Q’ test) (Supplementary Table 3).
Lastly, we performed a secondary (sensitivity) analysis using a smaller set of IVs (the top SNP in each of the 69 distinct loci) to ensure the independence between the SNP instruments. As shown in Supplementary Table 4, results did not alter substantially by incorporating fewer yet strictly independent IVs. Most of the outcomes were not influenced by genetically predicted 25(OH)D except for mean corpuscular hemoglobin concentration, which, however, did not survive multiple testing correction in the primary analysis (Supplementary Table 3 and Fig. 3).
Discussion
In this study, we performed a comprehensive two-sample MR analysis using 25(OH)D-associated SNPs as genetic instruments and 190 outcome GWAS spanning a wide spectrum of human complex traits and diseases to determine the causal role of 25(OH)D. In line with prior research, we have successfully replicated a causal relationship between 25(OH)D and multiple sclerosis (MS). However, in general, our results, which were based on a markedly increased number of IVs (138 IVs vs. 4 or 6 IVs used by most existing MR studies of 25(OH)D) and largely augmented samples of outcome GWAS, showed limited evidence for a causal effect of 25(OH)D on the majority of the traits and diseases investigated here.
The causal role of 25(OH)D on MS has been shown in three independent MR studies, which consistently identified an association between genetically predicted 25(OH)D and a decreased risk of MS17–19. We successfully replicated this causal relationship using a set of updated and more powerful SNP instruments for vitamin D. As a larger number of vitamin D IVs increased the power of the MR analysis, we obtained a more accurate effect size estimate with narrower confidence intervals for the protective effect of 25(OH)D on MS. Although there seemed to be concrete evidence that genetically instrumented 25(OH)D is associated with MS, the causal effects appeared to be decreasing in magnitude as the sample size and SNP coverage increase in the outcome GWAS (50% decreased risk of MS using the 2013 Immunochip study with the smallest sample size; 30% decreased risk using the 2011 GWAS, and 18% decreased risk using the 2019 GWAS with the largest sample size). Noteworthily, our MR finding is based on adult MS GWAS and adult vitamin D GWAS. As a result, the protective effect of vitamin D on MS may only apply to adulthood intervention, although a previous study also suggested a beneficial role of vitamin D for childhood MS19. It is also likely that the protective effect of extra vitamin D on MS is most prominent during critical periods of development (e.g., childhood/adolescence), which cannot be tested by MR analysis based on GWAS using cross-sectional measures of vitamin D in adulthood. Therefore, the causal role of vitamin D in the prevention and intervention of MS needs to be further explored and dissected by future studies.
In addition to MS, a suggestive association was observed for genetically instrumented 25(OH)D and white blood cell (basophil) counts. However, this result needs to be interpreted with caution due to two reasons. First, this effect did not survive multiple testing correction in our secondary analysis using 69 top SNPs from independent loci identified in the vitamin D GWAS. Second, none of the other blood cell counts, including eosinophil, neutrophil, platelet and lymphocyte, appeared to be affected by genetically instrumented vitamin D (all P > 0.10). The inverse correlation of vitamin D with the circulating level of basophils has only been reported by one longitudinal cohort study involving 989 young adults32. Despite statistical significance, the correlation coefficient was close to zero indicating a weak effect. Further investigations are thus warranted to confirm this association.
Our null findings are in line with previous MR studies. An earlier MR analysis, which aggregated information from 21 adult cohorts with up to 42,024 participants to explore the causal relationship between vitamin D status and obesity, did not identify any effect of genetically instrumented 25(OH)D on BMI (P = 0.57)33. In our study, we expanded this outcome category by incorporating several adipose-related phenotypes in addition to BMI, such as body fat percentage, hip circumference, waist circumference, and waist-to-hip ratio (both BMI adjusted and unadjusted). Despite that we used the latest GWAS summary statistics of these outcomes with substantially augmented sample sizes, ranging from 10,255 in extreme waist-to-hip ratio to 315,347 in BMI, we did not find any evidence in support for a causal role of 25(OH)D in obesity-related traits. Likewise, earlier MR studies have reported null findings for most of the autoimmune diseases including inflammatory bowel disease (Crohn’s disease (P = 0.67), ulcerative colitis (P = 0.42))34, eczema (P = 0.27)35, lupus (P = 0.79) and rheumatoid arthritis (P = 0.66)36. Our results using summary statistics from greatly enlarged GWAS (for both exposure and outcomes) confirmed a null relationship between 25(OH)D and autoimmune diseases. As for cardiovascular and metabolic traits, previous MR studies did not identify any causal role of 25(OH)D in coronary artery disease37, myocardial infarction38, cholesterol levels including triglycerides and low-density lipoprotein (LDL) cholesterol39, type 2 diabetes40,41, fasting insulin and adiponectin42, all of which, were confirmed by our analyses. There are, however, some discrepancies. For example, an earlier study that incorporated 31,435 individuals and used 4 index SNPs from two 25(OH)D-associated genes found no association between 25(OH)D-lowering alleles and non-fasting remnant cholesterol or LDL cholesterol, but identified a 50% decrease in 25(OH)D levels to be genetically associated with 6.0% (P = 0.001) lower high-density lipoprotein (HDL) cholesterol levels39. The effect of 25(OH)D on HDL cholesterol was no longer significant in our analysis when using the summary statistics from a more recent HDL GWAS involving 94,595 individuals (a threefold increase in sample size) and a greatly increased number of IVs. Similar null associations were observed in cancers where our MR analysis did not convincingly support a causal effect of circulating 25(OH)D concentrations on the risk of breast or prostate cancer (P = 0.58 and 0.46), consistent with previous findings (P = 0.47 and 0.99)43. Finally, investigations on the causal relationships between 25(OH)D and neurological or psychiatric traits have been limited. So far, only a few MR studies explored the role of 25(OH)D in cognitive function44, major depression45, and Alzheimer’s disease20,21; and nominally significant findings have only been reported in Alzheimer’s disease. It might be worth noting that previous MR studies for Alzheimer’s disease used either IV-exposure association estimates from a smaller 25(OH)D GWAS (SUNLIGHT consortium 2010)11 or correlated IVs, which affected the validity of the results. In our updated analysis, the causal effect of 25(OH)D on Alzheimer’s disease attenuated to null (P = 0.08 using 2019 Alzheimer’s disease GWAS). Expanding the outcome list by incorporating additional neurological or psychiatric traits also did not identify any significant causal effects. These results are important in informing the ongoing RCT of vitamin D on cognitive change (ClinicalTrials.gov: NCT03613116).
To ensure the validity of MR results, several important assumptions need to be satisfied. First, the relevance assumption, that IVs must be strongly associated with the exposure, is guaranteed by selecting independent 25(OH)D-associated SNPs with genome-wide significance as IVs. The second assumption, that IVs are not associated with confounders of the exposure-outcome relationship, is reassured by the negligible associations between 25(OH)D and various traits and diseases from the current and past MR studies, many of which act as potential confounders for other outcomes (e.g., obesity-related traits). Third, the exclusion restriction assumption requires IVs to affect the outcome only through the exposure (no horizontal pleiotropy). We employed several methods to test and control for horizontal pleiotropy. In particular, MR-Egger regression is designed to detect average pleiotropic bias, whereas the MR-PRESSO global test and modified Q and Q’ tests can detect bias caused by individual outlier SNPs with horizontal pleiotropy. While 68% of the outcomes analyzed here showed significant pleiotropy, we presented MR causal estimates adjusted for pleiotropic bias using three different methods. Notably, the causal effect of vitamin D on MS was also subject to potential pleiotropic bias; however, the effect estimates were significant using multiple different bias-adjusted MR methods.
Our study has several limitations. Sample overlap is a potential concern for our MR analyses, which used genetic instruments derived from the UK Biobank data, given that many recent large-scale outcome GWAS also included UK Biobank samples. Two-sample MR analysis assumes no sample overlap between exposure and outcome GWAS, and the expected covariance of sampling error between the genetic effects on the exposure and the outcome is assumed to be zero. Violating the assumption of no sample overlap in two-sample MR analyses will lead to inflated type I error (false positive) rate for hypothesis testing on the causal effect due to ignoring non-zero covariance of sampling errors46. However, since the majority of our MR results were null, the potentially inflated type I error rate does not change the interpretation of our results. Another limitation of this study is the potential bias in causal effect size estimates due to residual population stratification in the exposure and/or outcome GWAS. However, the vitamin D GWAS we used to extract SNP instruments showed minimal inflation due to population stratification (LD score regression intercept = 1.06; also see Figure S3 in Manousaki et al.)13. In addition, we leveraged the IEU OpenGWAS Project to extract SNP association statistics from outcome GWAS, which has extensively curated the GWAS summary statistics, with quality control (QC) reports publicly available (as an example, see: https://gwas.mrcieu.ac.uk/files/ieu-a-8/ieu-a-8_report.html). With that being said, we still rely on the quality of the original GWAS for our MR analyses and cannot completely exclude the possibility of biased GWAS association estimates due to insufficient control of population stratification or other suboptimal study designs or analysis strategies, which may bias our MR analysis. Finally, although we leveraged three different MR methods that are robust to pleiotropic bias, all these methods rely on the Instrument Strength Independent of Direct Effect (InSIDE) condition/assumption. Given this limitation, we were not able to completely rule out the possibility that InSIDE condition was violated in some of our analyses, which may result in biased results. However, the three MR methods we used in this study adjust for pleiotropic biases through three different routes, namely pleiotropic bias outliers removal (MR-PRESSO), controlling for average pleiotropic bias (MR-Egger), and robust estimation in the presence of pleiotropic bias outliers (median estimate), and are thus complementary and can provide strong evidence of causal effects when their inferences converge, despite the limitation of the InSIDE condition. Several new methods, such as MR-RAPS and CAUSE47,48, have been developed, which do not rely on the InSIDE condition. However, these methods require genome-wide GWAS summary statistics, which are not available for some of the exposure and outcome GWAS we examined here, and therefore cannot be applied in a uniform way to investigate the causal effects of vitamin D on the outcomes included in this study.
In summary, our large-scale MR analysis showed limited evidence of the causal effects of vitamin D on a wide range of human complex traits and diseases, except for multiple sclerosis. Our results may inform future RCTs and help to prioritize the most promising target diseases for vitamin D intervention.
Supplementary Information
Acknowledgements
We thank the various genomic consortia and the IEU OpenGWAS Project for generously sharing the genome-wide association summary statistics. This work was supported by the Swedish Research Council [Vetenskapsrådet International Postdoc grant 2015-06522; starting grant 2018-02247], Swedish Research Council for Health, Working life and Welfare [FORTE 2020-00884] (X.J.) and the National Institute on Aging at the National Institutes of Health [Grant number K99/R00AG054573] (T.G.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Author contributions
X.J. and C.-Y.C. contributed to study conception, data analysis, interpretation of the results and drafting of the manuscript. T.G. contributed to interpretation of the results and critical revision of the manuscript.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Xia Jiang, Email: xiajiang@hsph.harvard.edu.
Chia-Yen Chen, Email: chiayenc@gmail.com.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-020-80655-w.
References
- 1.Doppelt SH. Vitamin D, rickets, and osteomalacia. Orthop. Clin. North Am. 1984;15:671–686. [PubMed] [Google Scholar]
- 2.Scaranti, M., Júnior, G. de C. & Hoff, A. O. Vitamin D and cancer: does it really matter? Curr. Opin. Oncol.28, 205 (2016). [DOI] [PubMed]
- 3.Altieri B, et al. Does vitamin D play a role in autoimmune endocrine disorders? A proof of concept. Rev. Endocr. Metab. Disord. 2017;18:335–346. doi: 10.1007/s11154-016-9405-9. [DOI] [PubMed] [Google Scholar]
- 4.Gois PHF, Ferreira D, Olenski S, Seguro AC. Vitamin D and infectious diseases: Simple bystander or contributing factor? Nutrients. 2017;9:2. doi: 10.3390/nu9070651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Al Mheid I, Quyyumi AA. Vitamin D and cardiovascular disease: Controversy unresolved. J. Am. Coll. Cardiol. 2017;70:89–100. doi: 10.1016/j.jacc.2017.05.031. [DOI] [PubMed] [Google Scholar]
- 6.Issa CM. Vitamin D and type 2 diabetes mellitus. Adv. Exp. Med. Biol. 2017;996:193–205. doi: 10.1007/978-3-319-56017-5_16. [DOI] [PubMed] [Google Scholar]
- 7.Manson JE, et al. The VITamin D and OmegA-3 TriaL (VITAL): Rationale and design of a large randomized controlled trial of vitamin D and marine omega-3 fatty acid supplements for the primary prevention of cancer and cardiovascular disease. Contemp. Clin. Trials. 2012;33:159–171. doi: 10.1016/j.cct.2011.09.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Neale RE, et al. The D-health trial: A randomized trial of vitamin D for prevention of mortality and cancer. Contemp. Clin. Trials. 2016;48:83–90. doi: 10.1016/j.cct.2016.04.005. [DOI] [PubMed] [Google Scholar]
- 9.Manson JE, et al. Vitamin D supplements and prevention of cancer and cardiovascular disease. N. Engl. J. Med. 2019;380:33–44. doi: 10.1056/NEJMoa1809944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Okereke OI, et al. Effect of long-term vitamin D3 supplementation vs placebo on risk of depression or clinically relevant depressive symptoms and on change in mood scores: A randomized clinical trial. JAMA. 2020;324:471–480. doi: 10.1001/jama.2020.10224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wang TJ, et al. Common genetic determinants of vitamin D insufficiency: A genome-wide association study. Lancet. 2010;376:180–188. doi: 10.1016/S0140-6736(10)60588-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Jiang X, et al. Genome-wide association study in 79,366 European-ancestry individuals informs the genetic architecture of 25-hydroxyvitamin D levels. Nat. Commun. 2018;9:260. doi: 10.1038/s41467-017-02662-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Manousaki D, et al. Genome-wide association study for vitamin D levels reveals 69 independent loci. Am. J. Hum. Genet. 2020;106:327–337. doi: 10.1016/j.ajhg.2020.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Revez JA, et al. Genome-wide association study identifies 143 loci associated with 25 hydroxyvitamin D concentration. Nat. Commun. 2020;11:1647. doi: 10.1038/s41467-020-15421-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Davey Smith G, Ebrahim S. ‘Mendelian randomization’: Can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 2003;32:1–22. doi: 10.1093/ije/dyg070. [DOI] [PubMed] [Google Scholar]
- 16.VanderWeele TJ, Tchetgen Tchetgen EJ, Cornelis M, Kraft P. Methodological challenges in mendelian randomization. Epidemiology. 2014;25:427–435. doi: 10.1097/EDE.0000000000000081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mokry LE, et al. Vitamin D and risk of multiple sclerosis: A mendelian randomization study. PLOS Med. 2015;12:e1001866. doi: 10.1371/journal.pmed.1001866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Rhead B, et al. Mendelian randomization shows a causal effect of low vitamin D on multiple sclerosis risk. Neurol. Genet. 2016;2:e97. doi: 10.1212/NXG.0000000000000097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Gianfrancesco MA, et al. Evidence for a causal relationship between low vitamin D, high BMI, and pediatric-onset MS. Neurology. 2017;88:1623–1629. doi: 10.1212/WNL.0000000000003849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Mokry LE, et al. Genetically decreased vitamin D and risk of Alzheimer disease. Neurology. 2016;87:2567–2574. doi: 10.1212/WNL.0000000000003430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Larsson SC, Traylor M, Markus HS, Michaëlsson K. Serum parathyroid hormone, 25-hydroxyvitamin D, and risk of Alzheimer’s disease: A mendelian randomization study. Nutrients. 2018;10:2. doi: 10.3390/nu10091243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Vimaleswaran KS, et al. Association of vitamin D status with arterial blood pressure and hypertension risk: A mendelian randomisation study. Lancet Diabetes Endocrinol. 2014;2:719–729. doi: 10.1016/S2213-8587(14)70113-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Afzal S, Brøndum-Jacobsen P, Bojesen SE, Nordestgaard BG. Genetically low vitamin D concentrations and increased mortality: Mendelian randomisation analysis in three large cohorts. BMJ. 2014;349:g6330. doi: 10.1136/bmj.g6330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Ong J-S, et al. Association of vitamin D levels and risk of ovarian cancer: A Mendelian randomization study. Int. J. Epidemiol. 2016;45:1619–1630. doi: 10.1093/ije/dyw207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Dimitrakopoulou VI, et al. Circulating vitamin D concentration and risk of seven cancers: Mendelian randomisation study. BMJ. 2017;359:j4761. doi: 10.1136/bmj.j4761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Loh P-R, Kichaev G, Gazal S, Schoech AP, Price AL. Mixed-model association for biobank-scale datasets. Nat. Genet. 2018;50:906–908. doi: 10.1038/s41588-018-0144-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yang J, Hong Lee S, Goddard ME, Visscher PM. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: Effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 2015;44:512–525. doi: 10.1093/ije/dyv080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Verbanck M, Chen C-Y, Neale B, Do R. Detection of widespread horizontal pleiotropy in causal relationships inferred from Mendelian randomization between complex traits and diseases. Nat. Genet. 2018;50:693–698. doi: 10.1038/s41588-018-0099-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bowden J, Smith GD, Haycock PC, Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Bowden J, et al. Improving the accuracy of two-sample summary-data Mendelian randomization: Moving beyond the NOME assumption. Int. J. Epidemiol. 2019;48:728–742. doi: 10.1093/ije/dyy258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hollams EM, et al. Vitamin D and atopy and asthma phenotypes in children: a longitudinal cohort study. Eur. Respir. J. 2011;38:1320–1327. doi: 10.1183/09031936.00029011. [DOI] [PubMed] [Google Scholar]
- 33.Vimaleswaran KS, et al. Causal relationship between obesity and vitamin D status: bi-directional Mendelian randomization analysis of multiple cohorts. PLoS Med. 2013;10:e1001383. doi: 10.1371/journal.pmed.1001383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Lund-Nielsen J, et al. Vitamin D and inflammatory bowel disease: Mendelian randomization analyses in the copenhagen studies and UK biobank. J. Clin. Endocrinol. Metab. 2018;103:3267–3277. doi: 10.1210/jc.2018-00250. [DOI] [PubMed] [Google Scholar]
- 35.Manousaki D, et al. Vitamin D levels and susceptibility to asthma, elevated immunoglobulin E levels, and atopic dermatitis: A Mendelian randomization study. PLOS Med. 2017;14:e1002294. doi: 10.1371/journal.pmed.1002294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bae S-C, Lee YH. Vitamin D level and risk of systemic lupus erythematosus and rheumatoid arthritis: A Mendelian randomization. Clin. Rheumatol. 2018;37:2415–2421. doi: 10.1007/s10067-018-4152-9. [DOI] [PubMed] [Google Scholar]
- 37.Manousaki D, Mokry LE, Ross S, Goltzman D, Brent RJ. Mendelian randomization studies do not support a role for vitamin D in coronary artery disease. Circ. Cardiovasc. Genet. 2016;9:349–356. doi: 10.1161/CIRCGENETICS.116.001396. [DOI] [PubMed] [Google Scholar]
- 38.Brondum-Jacobsen P, Benn M, Afzal S, Nordestgaard BG. No evidence that genetically reduced 25-hydroxyvitamin D is associated with increased risk of ischaemic heart disease or myocardial infarction: A Mendelian randomization study. Int. J. Epidemiol. 2015;44:651–661. doi: 10.1093/ije/dyv078. [DOI] [PubMed] [Google Scholar]
- 39.Ooi EM, Afzal S, Nordestgaard BG. Elevated remnant cholesterol in 25-hydroxyvitamin D deficiency in the general population. Circ. Cardiovasc. Genet. 2014;7:650–658. doi: 10.1161/CIRCGENETICS.113.000416. [DOI] [PubMed] [Google Scholar]
- 40.Afzal S, Brøndum-Jacobsen P, Bojesen SE, Nordestgaard BG. Vitamin D concentration, obesity, and risk of diabetes: A mendelian randomisation study. Lancet Diabet. Endocrinol. 2014;2:298–306. doi: 10.1016/S2213-8587(13)70200-6. [DOI] [PubMed] [Google Scholar]
- 41.Lu L, et al. Association of vitamin D with risk of type 2 diabetes: A Mendelian randomisation study in European and Chinese adults. PLOS Med. 2018;15:e1002566. doi: 10.1371/journal.pmed.1002566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Husemoen LLN, et al. Investigating the causal effect of vitamin D on serum adiponectin using a mendelian randomization approach. Eur. J. Clin. Nutr. 2014;68:189–195. doi: 10.1038/ejcn.2013.233. [DOI] [PubMed] [Google Scholar]
- 43.Jiang X, et al. Circulating vitamin D concentrations and risk of breast and prostate cancer: A Mendelian randomization study. Int. J. Epidemiol. 2019;48:1416–1424. doi: 10.1093/ije/dyy284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Maddock J, et al. Vitamin D and cognitive function: A Mendelian randomisation study. Sci. Rep. 2017;7:13230. doi: 10.1038/s41598-017-13189-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Michaëlsson K, Melhus H, Larsson S. Serum 25-hydroxyvitamin D concentrations and major depression: A Mendelian randomization study. Nutrients. 2018;10:1987. doi: 10.3390/nu10121987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Burgess S, Davies NM, Thompson SG. Bias due to participant overlap in two-sample Mendelian randomization. Genet. Epidemiol. 2016;40:597–608. doi: 10.1002/gepi.21998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhao Q, Wang J, Hemani G, Bowden J, Small DS. Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score. Ann. Stat. 2020;48:1742–1769. doi: 10.1214/19-AOS1866. [DOI] [Google Scholar]
- 48.Morrison J, Knoblauch N, Marcus JH, Stephens M, He X. Mendelian randomization accounting for correlated and uncorrelated pleiotropic effects using genome-wide summary statistics. Nat. Genet. 2020;52:740–747. doi: 10.1038/s41588-020-0631-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.