Abstract
We aimed to increase our understanding of the genetic determinants of vitamin D levels by undertaking a large-scale genome-wide association study (GWAS) of serum 25 hydroxyvitamin D (25OHD). To do so, we used imputed genotypes from 401,460 white British UK Biobank participants with available 25OHD levels, retaining single-nucleotide polymorphisms (SNPs) with minor allele frequency (MAF) > 0.1% and imputation quality score > 0.3. We performed a linear mixed model GWAS on standardized log-transformed 25OHD, adjusting for age, sex, season of measurement, and vitamin D supplementation. These results were combined with those from a previous GWAS including 42,274 Europeans. In silico functional follow-up of the GWAS results was undertaken to identify enrichment in gene sets, pathways, and expression in tissues, and to investigate the partitioned heritability of 25OHD and its shared heritability with other traits. Using this approach, the SNP heritability of 25OHD was estimated to 16.1%. 138 conditionally independent SNPs were detected (p value < 6.6 × 10−9) among which 53 had MAF < 5%. Single variant association signals mapped to 69 distinct loci, among which 63 were previously unreported. We identified enrichment in hepatic and lipid metabolism gene pathways and enriched expression of the 25OHD genes in liver, skin, and gastrointestinal tissues. We observed partially shared heritability between 25OHD and socio-economic traits, a feature which may be mediated through time spent outdoors. Therefore, through a large 25OHD GWAS, we identified 63 loci that underline the contribution of genes outside the vitamin D canonical metabolic pathway to the genetic architecture of 25OHD.
Keywords: vitamin D, genome-wide association studies, rare variants, pathway analysis, genetic correlation, partitioned heritability, gene-environment interaction
Introduction
Vitamin D status, as ascertained by 25-hydroxy-vitamin D level (25OHD), is associated with numerous health outcomes.1 However, it is unclear whether lowered 25OHD level plays a causal role in these outcomes and its exact biological mechanisms of action remains unknown.2,3 25OHD is a steroid pro-hormone and a fat-soluble metabolite of cholecalciferol, which is predominately synthesized by exposure to ultraviolet light or obtained from dietary sources including fortified foods, supplements, and oily fish. It plays an important role in regulating calcium and phosphorus concentrations, influences cell proliferation, differentiation, and apoptosis, and has immune-modulating effects.4 Understanding the etiology of low vitamin D levels could have important public health implications by prioritizing individuals who would benefit from supplementation. The body’s vitamin D stores are best reflected by serum 25OHD which is influenced not only by diet and exposure to ultraviolet light, but also by age, body mass index, skin color, and numerous factors regulating exposure to ultraviolet B radiation (including season, geographical latitude, skin coverage).5,6 In addition to these environmental factors, classical twin studies show that 50%–80% of the variability in the concentration of 25OHD is explained by genetic factors7,8 indicating that this is a highly heritable trait.
In recent years, several genome-wide association studies (GWASs) of serum 25OHD have been conducted on participants of Europeans ancestry, with the largest including 79,366 individuals.9 These studies have identified six common genetic variants (minor allele frequency [MAF] > 5%) that are associated with 25OHD level.9, 10, 11, 12 These variants are in loci near genes having an established role in vitamin D synthesis (DHCR7/NADSYN1 [MIM: 602858] [rs12785878] and CYP2R1 [MIM: 608713] [rs10741657]), transportation (GC [MIM: 139200] [rs2282679]), and degradation (CYP24A1 [MIM: 126065] [rs17216707]), as well as outside of known vitamin D metabolism pathways, such as SEC23A (Sec23 homolog A, coat protein complex II component [MIM: 610511] [rs8018720]), involved in endoplasmic reticulum (ER)-Golgi protein trafficking, and AMDHD1 (amidohydrolase domain containing 1 [rs10745742]) an enzyme involved in the histidine, lysine, phenylalanine, tyrosine, proline, and tryptophan catabolic pathway.9 Additionally, a low-frequency genetic variant (MAF < 5%) at CYP2R1 (rs117913124), with a 4-fold larger effect than common variants at that locus, was identified through whole-genome sequencing and deep imputation for low-frequency and rare variants.12
An improved understanding of the genetic determinants of 25OHD has helped re-assess the role of vitamin D in the etiology of complex diseases, such as musculoskeletal disorders,1 autoimmune disease such as multiple sclerosis,13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23 and cancer,24 through methods for causal inference, such as Mendelian randomization (MR).25,26 For example, four separate MR studies have supported a protective effect of vitamin D against multiple sclerosis,12, 13, 14,27 and these results have clinical implications, reflected in recent clinical care guidelines for the use of vitamin D in preventing multiple sclerosis in those at risk, published by the MS Society of Canada.28 More than 60 MR studies have been published to date utilizing genetic variants associated with 25OHD to aid causal effect estimation.29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46 A deeper understanding of the genetic determinants contributing to variation in circulating vitamin D levels could enable an improved instrumentation of vitamin D in MR studies, allow better genomic prediction of vitamin D levels and provide insights into biological mechanisms.
Although the most recent 25OHD GWAS study on 79,366 Europeans9 had doubled the sample size of the previous GWASs, it yielded only two new 25OHD loci (SEC23A and AMDHD1), indicating that 25OHD may be a metabolite with a moderately polygenic architecture. In the same study, little of the 25OHD heritability estimated using all common SNPs was explained (SNP heritability of 7.5%), suggesting that much of its heritability remains to be identified. Against this backdrop, we sought to further understand the phenotypic variance explained by genetic variants and to investigate the genetic architecture of 25OHD by substantially increasing the GWAS sample size.
We hypothesized that we could identify new genes encoding enzymes, or carrier proteins affecting the levels of this metabolite, unveiling a more polygenic architecture. We therefore undertook a GWAS of serum 25OHD levels in 401,460 white British individuals from UK Biobank and combined results of this GWAS in a meta-analysis with results from a previous GWAS study including up to 42,274 Europeans. Using this approach, we validated previously described 25OHD loci and identified genetic determinants of vitamin D. To gain further insight into the genetic control of the vitamin D metabolic pathway, we looked for overlap of our findings with those of an unpublished GWAS on 1,25-dihydroxyvitamin D, the active form of vitamin D, which is downstream of 25OHD in the vitamin D metabolic pathway (Figure 1). We assessed the identified lead 25OHD variants for interaction with season of 25OHD measurement. Finally, we undertook an in silico functional follow-up of our GWAS findings to identify enrichments in gene sets, pathways, and expression in tissues and explore the partitioned heritability of 25OHD and its shared genetic architecture with other GWAS traits.
Figure 1.
The Vitamin D Metabolic Pathway
Material and Methods
Phenotypes
Between 2006 and 2010 approximately half a million British adults were recruited by UK Biobank.47 Participants provided biological samples, physical measurements, and answered questionnaires relating to general health and lifestyle. Ethical approval was granted by the Northwest Multi-Centre Research Ethics Committee, and informed consent was obtained from all participants prior to participation.
Data on 25OHD level (in nmol/L) measured using the Diasorin assay were available from 465,415 samples, representing 449,978 UK Biobank participants. Measurements were performed at baseline (2006–2010) and/or the first follow-up visit (2012–2013). In the present study, we used baseline 25OHD measurements from 401,460 individuals from the white British subset of UK Biobank, as defined below. To account for vitamin D supplement use, we adjusted 25OHD levels by subtracting 21.2 nmol/L from the 25OHD measurement in 24,874 vitamin D supplement users, representing 6% of our study cohort (see Supplemental Material and Methods for definition of vitamin D supplementation). We used 21.2 nmol/L because it is the mean increase in 25OHD levels conferred by taking daily 400 IU of cholecalciferol, the amount of vitamin D most often found in vitamin D supplements.48 In 3,057 participants treated with vitamin D supplements, 25OHD levels were lower than 10 nmol/L (the detection threshold for Diasorin assay) after subtraction, and thus they were set to 10 nmol/L. 25OHD levels were then log transformed and standardized to a mean of 0 and standard deviation of 1 (because of skewness in the distribution of 25OHD levels and to allow comparison with previous 25OHD GWAS). Distribution of the 25OHD levels appears in Figure S1.
GWAS
After stringent quality control, the UK Biobank genotypes, imputed to the combined Haplotype Reference Consortium (HRC)49 and UK10K haplotype resource panel, provided 20,370,874 genetic variants from the autosomes and the X chromosome to test for their association with 25OHD levels. This quality control removed low-quality genetic variants by retaining only SNPs with a minor allele frequency (MAF) > 0.1%, imputation quality score of > 0.3, and Hardy-Weinberg p > 1 × 10−6. For details on genotyping and imputation in UK Biobank, see the Supplemental Material and Methods.
To minimize bias from population stratification, an issue which is particularly relevant in the search for rare genetic variants associated with traits and disease,50 analysis was restricted to individuals of white British ancestry, which comprises the largest single ancestral group represented in the UK Biobank. It is important to distinguish between the self-identified “white British” in UK Biobank and the white British subset used in our analysis, where the latter was defined using a principal component analysis. Specifically, we previously defined this white British subset using high-quality genotypes, employing FlashPCA51 and linkage-disequilibrium-pruned HapMap3 SNPs (MAF > 1%, minor allele count > 5, Hardy-Weinberg equilibrium p > 1 × 10−6), which were projected onto previously computed principal components using the same SNPs set from 1000 Genomes Phase 3 dataset (n = 2,504).52 Henceforth, whenever the term “white British” appears in this paper, it refers to the white British subset defined as above. Details on this analysis are provided in the Supplemental Material and Methods. Descriptive statistics of this white British subset of UK Biobank are detailed in Table S1.
We then tested the additive allelic effects of SNPs on 25OHD levels, using a linear mixed-model in the BOLT-LMM software.53 The model-fitting was performed on hard-called genotypes from 488,377 participants consisting of 803,113 SNPs. Age, sex, season of 25OHD measurement (as a categorical variable; 1 for winter [January to March], 2 for spring [April to June], 3 for summer [July to September], and 4 for fall [October to December]), genotype batch, genotype array, and assessment center (as a proxy for latitude) were included as covariates in the BOLT-LMM. We have previously estimated that 6.6 × 10−9 is an appropriate p value threshold for genome-wide significance for analyzing data from the UK Biobank using the above criteria, accounting for multiple testing.52
Meta-analysis
We compared the results of the GWAS on UK Biobank to those of a previous 25OHD GWAS published by our group (n = 42,274 samples of European ancestry),12 by performing Pearson correlation of the betas of all variants with p values < 1 × 10−6 in both GWASs using the “cor.test” function in R. We then combined the summary-level results of the two GWASs in an inverse variance weighted fixed effects meta-analysis, using the GWAMA54 software. Of note, in both GWASs, 25OHD levels were first log-transformed and then standardized to a mean of 0 and a standard deviation of 1. This approach allowed the inverse variance weighted meta-analysis of the results. 25OHD levels in both GWASs were adjusted for age, sex, genotyping center, and season of measurement. In the earlier GWASs,12 25OHD levels were adjusted for BMI. Since BMI is a heritable trait, we elected not to adjust for it in the UK Biobank GWAS, to avoid introducing collider bias. Also, in the present GWAS on UK Biobank, 25OHD measures were adjusted for vitamin D supplementation, since this information was available for all participants, contrarily to the earlier 25OHD GWAS.
Approximate Conditional Association Analysis
To identify conditionally independent SNPs from this meta-analysis, we used GCTA-COJO v.1.91.1,55,56 which conditions upon the lead SNP per locus by approximating the genotype-phenotype covariance with correlation matrices and summary statistics (Supplemental Material and Methods). Variants with high collinearity (multiple regression R2 > 0.9) were excluded, and those situated more than 20,000 pairs away were assumed to be independent. A reference sample of 50,000 unrelated white British individuals randomly selected from the UK Biobank was created for a previous GWAS52 and was used to model patterns of linkage disequilibrium (LD) between variants. We retained as conditionally independent variants those reaching a genome-wide significant p value pre- and post-conditioning and with at least one genome-wide significant satellite SNP within 250,000 pairs. These variants were then positionally and functionally annotated to the physically closest gene using the hg19 gene range list and the Variant Effect Predictor57 as implemented in PhenoScanner v2.58
Estimation of Variance Explained by Significant Variants and SNP Heritability
We estimated the proportion of 25OHD phenotypic variance tagged by all SNPs on the genotyping array (that is, the SNP heritability) using BOLT-REML function53 in the UK Biobank GWAS. To estimate the variance explained by independent genome-wide significant SNPs (that is, all the genome-wide significant conditionally independent lead SNPs), we summed the variance explained per independent SNP using the formula: variance explained ≈2β 2 ƒ(1– ƒ), where β and ƒ denote the effect estimate and the effect allele frequency of the allele on a standardized phenotype, respectively.59
Interaction Analysis with Season
25OHD levels are affected by the season of their measurement, which is a proxy for exposure to UVB. To assess whether there is an effect modification of the 25OHD SNPs by season, we undertook an interaction analysis of our conditionally independent lead SNPs with season of 25OHD assessment in UK Biobank. First, we visually inspected the mean 25OHD concentrations per season (Figure S2), and we selected two discrete seasons in order to optimize the comparisons between seasons with higher and lower mean 25OHD levels: “winter” individuals assessed January-March (n = 98,674) and “summer” individuals assessed July-September (n = 95,135). Individuals with vitamin D levels assessed in spring (April-June) and fall (October-December) were not included in these analyses. Linear regression was conducted under an additive genetic model. The following variables and co-variables were included in the model: standardized log-transformed serum 25OHD adjusted for vitamin D supplementation as the dependent variable; SNP genotype (coded as 0, 1, or 2) as an independent variable; SNP (genotype)∗ season of 25OHD measurement (coded as a binary variable: 0 for winter and 1 for summer) as an interaction term; and age, sex, and season of 25OHD measurement as covariates. p values below a Bonferroni-corrected threshold (0.05/number of COJO-independent SNPs tested for interaction) for the interaction term implied a significant interaction between season and the tested SNP.
Assessment of Inflationary Bias in GWAS Results
By estimating the lambda GC and the LD score regression (LDSR) intercept, BOLT-LMM software estimated the amount of genomic inflation present in the data that was due to residual population stratification, cryptic relatedness, and other latent sources of bias in the UK Biobank GWAS. We used the lambda GC from GWAMA to estimate the genomic inflation in the meta-analysis of the UK Biobank GWAS and compared this with the previous GWAS meta-analysis.12
In Silico Functional Follow-up
Functional follow-up of the meta-analysis summary statistics was performed using Complex Trait Genomic-Virtual Lab60 web application, which implements a variety of follow-up methods for GWAS summary statistics output from the COJO analysis (Supplemental Material and Methods). In brief, association between predicted gene transcription and 25OHD was estimated using S-MultiXcan61 in the MetaXcan package with the default options implemented. Association statistics for the 48 tissues were combined accounting for correlation between tissues to give transcript-level results, and a Bonferroni correction was applied to account for the number of gene transcripts tested. Gene prioritization and gene set and tissue enrichment analysis were performed using DEPICT (Data-driven Expression-Prioritized Integration for Complex Traits)62,63 to identify likely causal genes at associated loci, highlight gene pathways which are over-represented by associated loci in the single variant results, and test whether expression of these genes is enriched in specific tissue types. Genetic correlation between 25OHD and a range of other traits available as publicly available GWAS summary statistics was examined using bivariate LDSR64 implemented in the LD Hub platform.65 Finally, partitioned heritability by functional annotation with 53 overlapping categories was performed using stratified LDSR using the baseline model from 1000 Genomes phase 3 data (baselineLD_v2.2, February 2019).64,66 Cell-specific heritability was examined using the --h2-cts flag in LDSR and the multi-tissue gene expression file (“Multi_tissue_gene_expr” containing both GTEx data and Franke lab dataset of microarray gene expression).65 These final two analyses were restricted to common variants present in HapMap3 (approximately 1,500,000 SNPs), excluding those within the HLA region defined as Chr6: 25,000,000 to 34,000,000 bases inclusive.
GWAS on 1,25-dihydroxyvitamin D
Study Participants, Genotyping, and Imputation
The Ely Study, established in 1990, is a prospective study of the etiology of type 2 diabetes and has been described in detail elsewhere.67,68 We studied Ely participants with measures of 1,25-dihydroxyvitamin D to estimate genetic effects the active form of vitamin D. Briefly, Ely comprises individuals of European ancestry aged 40–69 years, registered at a single medical practice in Ely, Cambridgeshire, UK and evaluated in three phases. All participants of the Ely Study gave their written informed consent and the study was approved by the local ethics committee. Participants at phase 3 were genotyped using the HumanCoreExome-24 and InfiniumCoreExome arrays. Details of the genotype quality control appear in Supplemental Material and Methods. A total of 1,591 samples and 546,486 variants met the quality-control criteria. Imputation was performed using the Sanger Imputation Server (pre-phase with EAGLE2 and impute with PBWT pipeline) and the HRC 1.1 reference panel.49 Additional variants not captured by the HRC reference panel were imputed using a combined UK10K and 1000 Genomes Phase 3 reference panel resulting in data available for >14 million variants.
1,25-dihydroxyvitamin D Phenotype and Look-up for the 25OHD Conditionally Independent SNPs
Phase 1 1,25-dihydroxyvitamin D levels and genetic data were available for 748 Ely participants. Levels of 1,25-dihydroxyvitamin D were natural log transformed before regressing with the inclusion of age, sex, body mass index, and season as covariates. Residuals from the regression were standardized and used as the final 1,25-dihydroxyvitamin D phenotype. Genetic association analysis was performed for the conditionally independent variants from the 25OHD GWAS meta-analysis using SNPTEST v2.5.4-beta3.69 Bonferroni adjustment was applied to association test p values such that variants with GWAS p values < 4.10 × 10−4 (0.05/122) were considered to meet the corrected significance threshold.
Results
GWAS for 25OHD Levels
The GWAS in UK Biobank included 401,460 participants and 20,370,874 variants. The genomic control lambda in BOLT-LMM was 1.23, and the LDSR intercept was 1.06 (Figure S3). We found a strong correlation between the effect sizes of the UK Biobank GWAS with our previous GWAS meta-analysis.12 Specifically, we compared the betas of 20,787 SNPs achieving p values < 1 × 10−6 in both GWASs (minimum MAF 0.3%) and found a coefficient of correlation (r) of 0.88 (Figure S4). We then performed a meta-analysis of the two GWASs on a total of 16,668,957 SNPs (Figure 2). The lambda GC of the meta-analysis was 1.23. Using approximate conditional analysis as implemented by GCTA-COJO, we observed 138 conditionally independent signals (pre- and post-conditioning p value < 6.6 × 10−9), mapping to 69 loci (a locus was defined as 1 Mb region around the SNP reaching the lowest p value), 63 of which were not reported in previous 25OHD GWASs (Table S2). Of these conditionally independent SNPs, 53 (38%) had MAF < 5% and 85 (62%) were common (MAF ≥ 5%). The 53 SNPs with MAF < 5% conferred an average absolute effect of 0.23 standard deviations on standardized log transformed 25OHD levels per effect allele, compared to 0.03 standard deviations of the 85 SNPs with MAF ≥ 5% (Figure S5).
Figure 2.
Genome-wide Association of 25OHD Graphed by Chromosome Positions and −log10 P Value (Manhattan Plot), and Quantile-Quantile Plot of the GWAS Meta-analysis (QQ-plot) on 443,374 European Individuals
(A) Manhattan plot: The p values were obtained from the fixed-effects inverse variance weighted meta-analysis. Horizontal red dash line represents the thresholds of p = 6.6 × 10−9 for genome-wide significance. Known loci were colored coded as blue diamonds, novel rare loci were color coded as red diamonds, and novel common loci were color coded as white diamonds.
(B) QQ-plot: The y axis (observed −log10 p values) is truncated at 310; the x axis shows the expected −log10 p values. Each SNP is plotted as a blue dot, and the dash red line indicates null hypothesis of no true association. Deviation from the expected p value distribution is evident only in the tail area, with a lambda of 1.23. Note: Known is defined as having been identified in previous 25OHD GWAS; novel is defined as not having been identified in previous 25OHD GWASs.
The total variance explained by the 138 conditionally independent genome-wide significant vitamin D SNPs was 4.9%. When partitioning the variance explained by these lead SNPs into two MAF categories, we found that low-frequency and rare variants explained 1.8% of the variance in 25OHD levels, whereas common variants explained 3.1% of the variance, respectively. The SNP heritability from all SNPs, independent of GWAS p value, as estimated by BOLT-LMM on 805,426 hard called variants in UK Biobank was 16.1%, indicating that genome-wide significant independent variants capture less than a third of the variance explained in 25OHD levels by all directly genotyped markers.
Look-up of the 25OHD GWAS Variants in the 1,25-dihydroxyvitamin D GWAS
We tested 122 out of the 138 conditionally independent variants from the 25OHD GWAS for genetic association with 1,25-dihydroxyvitamin D. The 16 variants that were not tested were not available in the Ely dataset, either because they were not reliably captured through imputation or had low MAF (<0.001), and no suitable proxy variant could be identified. Among the 122 conditionally independent variants tested in Ely for association with 1,25-dihydroxyvitamin D, only one rs6127099 in the CYP24A1 locus on chromosome 20 reached the multiple testing corrected threshold for significance (20:52731402:T_A; β = 0.231; p = 2.5 × 10−4) (Table S2). Finally, among the 122 SNPs, 74 SNPs had a consistent direction of effect on 25OHD and on 1,25-dihydroxyvitamin D levels.
Interaction Analysis with Season
To investigate the hypothesis that the effect of some of the 25OHD variants is modified by season of measurement, we tested the presence of interaction of the 138 conditionally independent variants with season in 193,809 white British participants, whose 25OHD levels were assessed in summer or in winter. We found significant interaction with season in 11 independent SNPs in the CYP2R1 locus on chromosome 11 and in a single variant in the SEC23A locus on chromosome 14 (all p values below the Bonferroni-corrected threshold of 3.6 × 10−4) (Table S2). The strongest interaction was found for rs117913124 (p value for interaction 1.5 × 10−55), a previously described low-frequency variant in CYP2R1 with large effect on 25OHD levels (absolute GWAS beta per allele of 0.35 units in standardized log-transformed 25OHD). For all 12 SNPs achieving significant interaction p values, the direction of the beta for the interaction term genotype∗season summer was in the same direction as the direction of the beta on 25OHD levels, meaning that the vitamin D lowering effect of these SNPs “blunts” the expected increase in 25OHD in summer.
In Silico Functional Follow-up
Gene Prioritization and Enrichment Analyses
Gene prioritization analysis suggested 70 genes with false discovery rate (FDR) < 5% which might plausibly underlie the distribution of association statistics seen in the single variant results. At many loci, genes within the vitamin D metabolism pathway were suggested as plausible candidates. For example, DEPICT prioritized DHCR7 at the lead associated chr11:70,313,961–71,239,227 locus and GC at chr4:72,607,410–72,669,758 locus. Interestingly, ADH6 (MIM: 103735) was a plausible candidate at locus chr4:99,916,771–100,274,184, suggesting this locus may have pleiotropic effects on vitamin D and alcohol metabolism (Table S3).
Gene set enrichment analysis identified enrichment in 418 pre-defined gene sets with FDR < 5%. The strongest statistical evidence for enrichment was in the following gene sets: the alpha-2-HS Glycoprotein (AHSG), a negatively charged serum glycoprotein that is synthesized by hepatocytes involved in several processes, including endocytosis, brain development, and the formation of bone tissue (p = 4.18 × 10−7); the reactome gene set for “metabolism of lipids and lipoprotein” (p = 7.91 × 10−7); several genes involved in immune pathways and therefore expressed in the blood such as “Elastase, Neutrophil Expressed (ELANE)” (p = 8.43 × 10−7); the “Serum albumin (ALB)” (p = 1.19 × 10−6), “Acidic form of complement factor 4 (C4A)” (p = 1.51 × 10−6), and “ENSG00000211949” gene sets, belonging to the immunoglobulin (Ig) heavy chain locus (p = 1.51 × 10−6); biosynethic pathways such as “GO:0044283, small molecule biosynthetic process” (p = 1.89 × 10−6), “GO:0016053, organic acid biosynthetic process” (p = 2.29 × 10−6); “GO:0046394” and “carboxylic acid biosynthetic process” (p = 2.29 × 10−6); and finally liver-associated pathways including “MP:0000599, enlarged liver” (p = 1.33 × 10−6), “GO:0001889, liver development” (p = 3.35 × 10−6), and “GO:0061008, hepaticobiliary system development” (p = 4.15 × 10−6) (Table S4). Finally, expression of 25OHD genes was enriched in 17 cell types with an FDR < 5%, including cell lines representing the liver (hepatocytes, p = 1.63 × 10−6) and skin (keratinocytes, p = 7.73 × 10−3). The tissue-specific analysis found greatest evidence for enrichment in the liver (p = 1.34 × 10−6) and the gastrointestinal tract (p = 2.22 × 10−3) (Table S5), which is in accordance with the fact that 25OHD is hydroxylated in the liver70 but also conjugates with glucuronide71 and sulfate72 to get excreted in the bile and then gets reabsorbed by the enterohepatic circulation. Collectively, these findings suggest that detectable serum 25OHD levels are influenced by a range of metabolic processes within known physiological pathways, but also extending beyond the canonical vitamin D metabolic pathway.
Predicted Gene Transcription Levels
After applying a Bonferroni-corrected multiple testing threshold (p < 1.94 × 10−6), varying expression levels at 377 gene transcripts were predicted to influence 25OHD, out of a total of 25,816 that were tested. Results for all gene transcripts are shown in Figure 3. This indicates that although there are 69 loci associated with vitamin D phenotype, there are potentially 377 gene transcripts across multiple tissues whose expression may influence vitamin D. The lead associated genetic transcripts using S-MulTiXcan61 were consistent with the lead association signals in the single variant results, for example identifying association at NADSYN1 (MIM: 608285) (Z-test p < 1.81 × 10−309), DHCR7 (Z-test p < 1.15 × 10−245), GC (Z-test p < 1.81 × 10−309), CYP2R1 (Z-test p = 2.85 × 10−277), UGT1A4 (MIM: 606429) (Z-test p = 3.25 × 10−34), PADI1 (MIM: 607934) (Z-test p = 3.64 × 10−23). The S-MulTiXcan61 method integrates information from multiple tissue-specific predictions improving the statistical power over the single variant method and highlights additional transcripts associated with 25OHD, with the strongest evidence in various forms of Keratin Associated Protein 5 (KRTAP5 [MIM: 608822]) (Z-test p < 1.81 × 10−309), a protein coding gene involved in keratinization and has been identified as a potential read through for NADSYN1. This adds further evidence that 25OHD is affected through processes beyond the established vitamin D metabolic pathway. Results are shown in Table S6.
Figure 3.
Effect of Predicted Increased Transcription of All Genes on Circulating Vitamin D
Each dot represents the effect of increased transcription (averaged across all tissue-specific predictions using S-MultiXcan) on 25OHD.
Genetic Correlation
Genetic correlation results for 25OHD were available for 774 traits from the LD hub catalog,65 including 517 raw traits from UK Biobank and 257 from other GWASs and consortia (Figure 4). A total of 101 traits passed a multiple testing corrected Bonferroni p value threshold of p < 6.46 × 10−5. The strongest evidence of negative genetic correlation with 25OHD were “Time spent using a computer” (rg = −0.22), “Qualifications: College or University degree” (rg = −0.17), and “Intelligence” (rg = −0.24). Traits pertaining to exercise (“Duration of vigorous activity” [rg = 0.22] and “Number of days/week walked 10+ minutes” [rg = 0.18]) had positive genetic correlations with vitamin D. Traits related to body mass index (BMI) including lipids and diabetes had a negative correlation: “BMI” (rg = −0.14), “Triglycerides” (rg = −0.25), and “Type 2 Diabetes” (rg = −0.19). A full list of results can be found in Table S7.
Figure 4.
Genetic Correlation between 25OHD Levels and GWAS Traits Available within LD Hub
Each dot represented the Rg between 25OHD and an individual trait. The red dashed line represents the Bonferroni-corrected multiple testing threshold at the 5% level.
Tests for Enrichment in Functional Annotations
Using information from all the SNPs in the 25OHD GWAS summary statistics and modeling LD with the 53 functional categories not specific to any cell type in the baseline model, there was evidence for enrichment in 3 out of the 95 functional annotations tested. These were annotations providing evidence for evolutionary conservation with 2% of variants annotated as highly conserved accounting for 20% of the heritability of vitamin D (9-fold enrichment over baseline, p = 1.48 × 10−5) (Table S8). There was little evidence from stratified LDSR66 that vitamin D heritability is enriched in gene sets expressed specifically in given cells or tissue types. However, it is worth noting that the highest LDSR coefficients were seen for genomic regions specifically expressed in hepatocytes (coefficient = 1.17 × 10−8), liver (coefficient = 1.73 × 10−8), and whole blood (coefficient = 1.16 × 10−8), corroborating the cell- and tissue-predicted gene enrichment (Table S9).
Discussion
This large-scale GWAS meta-analysis identified 63 genetic loci which were associated with 25OHD levels in people of European ancestry and at least doubled the estimate of SNP heritability of 25OHD levels. Our study also replicated the 6 known vitamin D loci (in or near CYP2R1, DHCR7, GC, CYP24A1, AMDHD1, SEC23A). In silico follow-up identified enrichment in gene sets and pathways mostly independent from canonical vitamin D synthesis and metabolism pathways. Taken together, these results identify new biological pathways that influence 25OHD levels and demonstrate that this metabolite is moderately polygenic.
The large number of low-frequency and rare variants of large effect among the 138 conditionally independent variants of our GWAS is remarkable and suggests that 25OHD levels have a somewhat distinct genetic architecture when compared to other common traits. Specifically, the average absolute effect on 25OHD of the 53 low-frequency and rare variants was at least 7 times larger than the average effect of the 85 common SNPs, but their contribution to the explained variance of 25OHD was smaller than that of the common SNPs (1.8% versus 3.1%). This is not surprising, given the limited frequency of these variants in the general European population. GWASs with larger sample sizes are needed to further dissect the contribution of rare variants with large effects versus common variants with small effects to the variance of 25OHD levels.
The hypothesis-free approach of GWASs has served to highlight the role of lipid biology in 25OHD levels—a fat-soluble hormone. Specifically, among the 69 identified 25OHD loci, 22 loci are related to serum lipid phenotypes. Examples of these loci are the lipase C (LIPC [MIM: 151670]) on chromosome 15, the low-density lipoprotein receptor (LDLR [MIM: 606945]) and the apolipoprotein C1 (APOC1 [MIM: 107710]) on chromosome 19, and the cholesteryl ester transfer protein (CETP [MIM: 118470]) on chromosome 16. Additionally, our gene enrichment analysis prioritized the metabolism of lipids and lipoprotein gene set, and lipid traits were strongly genetically correlated with 25OHD using LDSR. These findings suggest that 25OHD levels share several of the same biological pathways influencing circulating lipids.
We also found enrichment in loci harboring genes associated with skin keratinization. Among these, an interesting finding was FLG (MIM: 135940) on the chromosome 1, which encodes fillagrin, a protein that plays an important role in the skin barrier’s function, and deregulation of this function might affect vitamin D in the skin, which is also synthetized in the skin. Another locus related to skin keratinization was the KRTAP5, which was prioritized by our in silico analyses. However, functional follow-up of these loci is required, to characterize the causal genes and/or mechanisms underlying the associations with 25OHD levels. Also, we observed enrichment in loci associated with traits outside the vitamin D pathway, which are not directly linked to 25OHD synthesis and metabolism. We can speculate on the exact mechanism of action of these genes on 25OHD—for instance through their effect on time spend outdoors and consequently exposure to sunlight—but follow-up experiments are necessary to validate these hypotheses.
The results of the interaction analysis with season merit some discussion too. We found evidence for significant interaction with multiple independent common, low-frequency, and rare SNPs in the CYP2R1 locus. CYP2R1 encodes the enzyme responsible for 25-hydroxylation of vitamin D in the liver,70 a necessary step in the conversion of vitamin D synthetized in the skin after exposure to UVB to 25OHD. Therefore, it is not surprising that individuals heterozygous or homozygous for variants in or near CYP2R1 show a smaller change in their 25OHD levels as a response to season compared to non-carriers. In other words, we observed that carriers of the effect alleles in this locus have steadily lower 25OHD levels, independently of the season of their measurement. We also observed significant interaction with a common SNP in SEC23A, which is involved in endoplasmic reticulum (ER)-Golgi protein trafficking. Although the exact mechanism with which SEC23A interacts with season to regulate 25OHD levels remains unknown, it might act as a regulator of the enzymatic activity of CYP2R1, which is located in the endoplasmic reticulum. Functional follow-up experiments are warranted to investigate this hypothesis.
The findings of the look-up of the significant 25OHD SNPs in the 1,25-dihyxroxyvitamin D GWAS provide evidence that the two biomarkers of vitamin D in humans have, to a certain extent, a shared genetic component. This may be expected as both biomarkers share at least the same vitamin D catabolic pathway. However, the small sample size of the 1,25-dihydroxyvitamin D GWAS, the only available GWAS on this trait to date, limits the power for characterization of 1,25-dyhydroxyvitamin D loci. We can therefore speculate that there might be a larger overlap of the genetic architecture of the two biomarkers. 1,25-dihydroxyvitamin D is the active metabolite of vitamin D, and although its levels directly regulate the effects of vitamin D on a cellular level, it remains understudied because of its short half-life, low concentration in blood,73 and the body’s ability to buffer 1,25-dihydroxyvitamin D in deficient individuals by increasing parathyroid hormone. In that aspect, any additional evidence, from larger 1,25-dihydroxyvitamin D GWASs, linking 25OHD levels to those of 1,25-dihydroxyvitamin D in the genetic level will be important, as it will add to our understanding of the vitamin D physiology.
Collectively, the results of our analyses suggest that serum levels of 25OHD are in crosstalk with a range of metabolic processes extending within the canonical vitamin D metabolic pathway (skin synthesis, hepatic hydroxylation, sulfonylation, glucuronylation) and beyond (time of computer use, intelligence, educational achievement). Although not specifically tested in the present study, one implication of these findings is that the potential genetic instruments for vitamin D are instrumenting more than the vitamin D pathway, and specifically they also capture variance in traits that relate to environmental confounders that could influence 25OHD levels. Taken together, our findings present a cautionary tale for future MR studies using 25OHD as an exposure, based on this GWAS, since there is a risk of pleiotropic effects for a substantial number of 25OHD-related SNPs mapping to genes not directly involved in 25OHD biology.
In summary, we described 63 loci which are associated with 25OHD levels in Europeans. Further research is warranted to better characterize the identified genetic variants, validate these findings, and identify ancestry-specific variants in other ethnic groups and to better understand the biological pathways influencing 25OHD levels. The genetic instruments for 25OHD identified here should be used with caution in future MR analyses assessing the association between vitamin D and other complex traits and diseases.
Declaration of Interests
The authors declare no competing interests.
Acknowledgments
This research has been conducted using the UK Biobank Resource under application number 16009. J.B.R. is supported by the Canadian Institutes of Health Research (CIHR) and the Canadian Foundation for Innovation and the Fonds de Recherche du Québec – Santé (FRQS). J.B.R. is funded by a FRQS Clinical Research Scholarship and received research support from the National MS Society and the MS Society of Canada. D.M. is funded by the JDRF. R.M. works in program one within the Medical Research Council Integrative Epidemiology Unit at the University of Bristol (grant number MC_UU_00011/1). S.H. is funded by a National Institute for Health Research (NIHR) Academic Clinical Fellowship. The Ely study was supported by the Medical Research Council (MC_UU_12015/1) and NHS Research and Development. The funding agencies had no role in the study design, analysis, or interpretation of data; the writing of the manuscript; or in the decision to submit the article for publication. All data sources used in this study (UK Biobank, Ely Study) received approval from respective national ethical committees for medical research and obtained informed consent from all participants. Additional ethical approval was not required for this study.
Published: February 13, 2020
Footnotes
Supplemental Data can be found online at https://doi.org/10.1016/j.ajhg.2020.01.017.
Web Resources
Genomic-Virtual Lab, https://genoma.io
GRASP, http://grasp.nhlbi.nih.gov/
OMIM, https://www.omim.org/
Supplemental Information
References
- 1.Bouillon R., Marcocci C., Carmeliet G., Bikle D., White J.H., Dawson-Hughes B., Lips P., Munns C.F., Lazaretti-Castro M., Giustina A., Bilezikian J. Skeletal and extra-skeletal actions of vitamin D: Current evidence and outstanding questions. Endocr. Rev. 2019;40:1109–1151. doi: 10.1210/er.2018-00126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Theodoratou E., Tzoulaki I., Zgaga L., Ioannidis J.P. Vitamin D and multiple health outcomes: umbrella review of systematic reviews and meta-analyses of observational studies and randomised trials. BMJ. 2014;348:g2035. doi: 10.1136/bmj.g2035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Autier P., Mullie P., Macacu A., Dragomir M., Boniol M., Coppens K., Pizot C., Boniol M. Effect of vitamin D supplementation on non-skeletal disorders: a systematic review of meta-analyses and randomised trials. Lancet Diabetes Endocrinol. 2017;5:986–1004. doi: 10.1016/S2213-8587(17)30357-1. [DOI] [PubMed] [Google Scholar]
- 4.Haroon M., Fitzgerald O. Vitamin D and its emerging role in immunopathology. Clin. Rheumatol. 2012;31:199–202. doi: 10.1007/s10067-011-1880-5. [DOI] [PubMed] [Google Scholar]
- 5.Lagunova Z., Porojnicu A.C., Lindberg F., Hexeberg S., Moan J. The dependency of vitamin D status on body mass index, gender, age and season. Anticancer Res. 2009;29:3713–3720. [PubMed] [Google Scholar]
- 6.Shea M.K., Benjamin E.J., Dupuis J., Massaro J.M., Jacques P.F., D’Agostino R.B., Sr., Ordovas J.M., O’Donnell C.J., Dawson-Hughes B., Vasan R.S., Booth S.L. Genetic and non-genetic correlates of vitamins K and D. Eur. J. Clin. Nutr. 2009;63:458–464. doi: 10.1038/sj.ejcn.1602959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Karohl C., Su S., Kumari M., Tangpricha V., Veledar E., Vaccarino V., Raggi P. Heritability and seasonal variability of vitamin D concentrations in male twins. Am. J. Clin. Nutr. 2010;92:1393–1398. doi: 10.3945/ajcn.2010.30176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hunter D., De Lange M., Snieder H., MacGregor A.J., Swaminathan R., Thakker R.V., Spector T.D. Genetic contribution to bone metabolism, calcium excretion, and vitamin D and parathyroid hormone regulation. J. Bone Miner. Res. 2001;16:371–378. doi: 10.1359/jbmr.2001.16.2.371. [DOI] [PubMed] [Google Scholar]
- 9.Jiang X., O’Reilly P.F., Aschard H., Hsu Y.H., Richards J.B., Dupuis J., Ingelsson E., Karasik D., Pilz S., Berry D. Genome-wide association study in 79,366 European-ancestry individuals informs the genetic architecture of 25-hydroxyvitamin D levels. Nat. Commun. 2018;9:260. doi: 10.1038/s41467-017-02662-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Wang T.J., Zhang F., Richards J.B., Kestenbaum B., van Meurs J.B., Berry D., Kiel D.P., Streeten E.A., Ohlsson C., Koller D.L. Common genetic determinants of vitamin D insufficiency: a genome-wide association study. Lancet. 2010;376:180–188. doi: 10.1016/S0140-6736(10)60588-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ahn J., Yu K., Stolzenberg-Solomon R., Simon K.C., McCullough M.L., Gallicchio L., Jacobs E.J., Ascherio A., Helzlsouer K., Jacobs K.B. Genome-wide association study of circulating vitamin D levels. Hum. Mol. Genet. 2010;19:2739–2745. doi: 10.1093/hmg/ddq155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Manousaki D., Dudding T., Haworth S., Hsu Y.H., Liu C.T., Medina-Gómez C., Voortman T., van der Velde N., Melhus H., Robinson-Cohen C. Low-Frequency Synonymous Coding Variation in CYP2R1 Has Large Effects on Vitamin D Levels and Risk of Multiple Sclerosis. Am. J. Hum. Genet. 2017;101:227–238. doi: 10.1016/j.ajhg.2017.06.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mokry L.E., Ross S., Ahmad O.S., Forgetta V., Smith G.D., Goltzman D., Leong A., Greenwood C.M., Thanassoulis G., Richards J.B. Vitamin D and Risk of Multiple Sclerosis: A Mendelian Randomization Study. PLoS Med. 2015;12:e1001866. doi: 10.1371/journal.pmed.1001866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Rhead B., Bäärnhielm M., Gianfrancesco M., Mok A., Shao X., Quach H., Shen L., Schaefer C., Link J., Gyllenberg A. Mendelian randomization shows a causal effect of low vitamin D on multiple sclerosis risk. Neurol. Genet. 2016;2:e97. doi: 10.1212/NXG.0000000000000097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Giulietti A., Gysemans C., Stoffels K., van Etten E., Decallonne B., Overbergh L., Bouillon R., Mathieu C. Vitamin D deficiency in early life accelerates Type 1 diabetes in non-obese diabetic mice. Diabetologia. 2004;47:451–462. doi: 10.1007/s00125-004-1329-3. [DOI] [PubMed] [Google Scholar]
- 16.Riachy R., Vandewalle B., Moerman E., Belaich S., Lukowiak B., Gmyr V., Muharram G., Kerr Conte J., Pattou F. 1,25-Dihydroxyvitamin D3 protects human pancreatic islets against cytokine-induced apoptosis via down-regulation of the Fas receptor. Apoptosis. 2006;11:151–159. doi: 10.1007/s10495-006-3558-z. [DOI] [PubMed] [Google Scholar]
- 17.Vitamin D supplement in early childhood and risk for Type I (insulin-dependent) diabetes mellitus. The EURODIAB Substudy 2 Study Group. Diabetologia. 1999;42:51–54. doi: 10.1007/s001250051112. [DOI] [PubMed] [Google Scholar]
- 18.Hyppönen E., Läärä E., Reunanen A., Järvelin M.R., Virtanen S.M. Intake of vitamin D and risk of type 1 diabetes: a birth-cohort study. Lancet. 2001;358:1500–1503. doi: 10.1016/S0140-6736(01)06580-1. [DOI] [PubMed] [Google Scholar]
- 19.Mayer-Davis E.J., Dabelea D., Crandell J.L., Crume T., D’Agostino R.B., Jr., Dolan L., King I.B., Lawrence J.M., Norris J.M., Pihoker C., The N. Nutritional factors and preservation of C-peptide in youth with recently diagnosed type 1 diabetes: SEARCH Nutrition Ancillary Study. Diabetes Care. 2013;36:1842–1850. doi: 10.2337/dc12-2084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Littorin B., Blom P., Schölin A., Arnqvist H.J., Blohmé G., Bolinder J., Ekbom-Schnell A., Eriksson J.W., Gudbjörnsdottir S., Nyström L. Lower levels of plasma 25-hydroxyvitamin D among young adults at diagnosis of autoimmune type 1 diabetes compared with control subjects: results from the nationwide Diabetes Incidence Study in Sweden (DISS) Diabetologia. 2006;49:2847–2852. doi: 10.1007/s00125-006-0426-x. [DOI] [PubMed] [Google Scholar]
- 21.Baumgartl H.J., Standl E., Schmidt-Gayk H., Kolb H.J., Janka H.U., Ziegler A.G. Changes of vitamin D3 serum concentrations at the onset of immune-mediated type 1 (insulin-dependent) diabetes mellitus. Diabetes Res. 1991;16:145–148. [PubMed] [Google Scholar]
- 22.Bierschenk L., Alexander J., Wasserfall C., Haller M., Schatz D., Atkinson M. Vitamin D levels in subjects with and without type 1 diabetes residing in a solar rich environment. Diabetes Care. 2009;32:1977–1979. doi: 10.2337/dc09-1089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Pozzilli P., Manfrini S., Crinò A., Picardi A., Leomanni C., Cherubini V., Valente L., Khazrai M., Visalli N., IMDIAB group Low levels of 25-hydroxyvitamin D3 and 1,25-dihydroxyvitamin D3 in patients with newly diagnosed type 1 diabetes. Horm. Metab. Res. 2005;37:680–683. doi: 10.1055/s-2005-870578. [DOI] [PubMed] [Google Scholar]
- 24.Feldman D., Krishnan A.V., Swami S., Giovannucci E., Feldman B.J. The role of vitamin D in reducing cancer risk and progression. Nat. Rev. Cancer. 2014;14:342–357. doi: 10.1038/nrc3691. [DOI] [PubMed] [Google Scholar]
- 25.Smith G.D., Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 2003;32:1–22. doi: 10.1093/ije/dyg070. [DOI] [PubMed] [Google Scholar]
- 26.Burgess S., Butterworth A., Thompson S.G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 2013;37:658–665. doi: 10.1002/gepi.21758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Gianfrancesco M.A., Stridh P., Rhead B., Shao X., Xu E., Graves J.S., Chitnis T., Waldman A., Lotze T., Schreiner T., Network of Pediatric Multiple Sclerosis Centers Evidence for a causal relationship between low vitamin D, high BMI, and pediatric-onset MS. Neurology. 2017;88:1623–1629. doi: 10.1212/WNL.0000000000003849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Multiple Sclerosis Society of Canada Vitamin D and Multiple Sclerosis Recommendations 2019. 2019. https://mssociety.ca/library/document/Vka6RXcnOizNm9sIwuWvroxejlhLqTJ8/original.pdf
- 29.Larsson S.C., Traylor M., Markus H.S., Michaëlsson K. Serum Parathyroid Hormone, 25-Hydroxyvitamin D, and Risk of Alzheimer’s Disease: A Mendelian Randomization Study. Nutrients. 2018;10:10. doi: 10.3390/nu10091243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.He Y., Timofeeva M., Farrington S.M., Vaughan-Shaw P., Svinti V., Walker M., Zgaga L., Meng X., Li X., Spiliopoulou A., SUNLIGHT consortium Exploring causality in the association between circulating 25-hydroxyvitamin D and colorectal cancer risk: a large Mendelian randomisation study. BMC Med. 2018;16:142. doi: 10.1186/s12916-018-1119-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Aspelund T., Grübler M.R., Smith A.V., Gudmundsson E.F., Keppel M., Cotch M.F., Harris T.B., Jorde R., Grimnes G., Joakimsen R. Effect of Genetically Low 25-Hydroxyvitamin D on Mortality Risk: Mendelian Randomization Analysis in 3 Large European Cohorts. Nutrients. 2019;11:11. doi: 10.3390/nu11010074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Michaëlsson K., Melhus H., Larsson S.C. Serum 25-Hydroxyvitamin D Concentrations and Major Depression: A Mendelian Randomization Study. Nutrients. 2018;10:10. doi: 10.3390/nu10121987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bowman K., Jones L., Pilling L.C., Delgado J., Kuchel G.A., Ferrucci L., Fortinsky R.H., Melzer D. Vitamin D levels and risk of delirium: A mendelian randomization study in the UK Biobank. Neurology. 2019;92:e1387–e1394. doi: 10.1212/WNL.0000000000007136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Lund-Nielsen J., Vedel-Krogh S., Kobylecki C.J., Brynskov J., Afzal S., Nordestgaard B.G. Vitamin D and Inflammatory Bowel Disease: Mendelian Randomization Analyses in the Copenhagen Studies and UK Biobank. J. Clin. Endocrinol. Metab. 2018;103:3267–3277. doi: 10.1210/jc.2018-00250. [DOI] [PubMed] [Google Scholar]
- 35.Sun J.Y., Zhao M., Hou Y., Zhang C., Oh J., Sun Z., Sun B.L. Circulating serum vitamin D levels and total body bone mineral density: A Mendelian randomization study. J. Cell. Mol. Med. 2019;23:2268–2271. doi: 10.1111/jcmm.14153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Jiang X., Dimou N.L., Al-Dabhani K., Lewis S.J., Martin R.M., Haycock P.C., Gunter M.J., Key T.J., Eeles R.A., Muir K., PRACTICAL, CRUK, BPC3, CAPS and PEGASUS consortia Circulating vitamin D concentrations and risk of breast and prostate cancer: a Mendelian randomization study. Int. J. Epidemiol. 2019;48:1416–1424. doi: 10.1093/ije/dyy284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Larsson S.C., Traylor M., Mishra A., Howson J.M.M., Michaëlsson K., Markus H.S., MEGASTROKE Project of the International Stroke Genetics Consortium Serum 25-Hydroxyvitamin D Concentrations and Ischemic Stroke and Its Subtypes. Stroke. 2018;49:2508–2511. doi: 10.1161/STROKEAHA.118.022242. [DOI] [PubMed] [Google Scholar]
- 38.Yarmolinsky J.R.C., Lophatananon A. 472696.). Evaluating causal associations between previously reported risk factors and epithelial ovarian cancer: a Mendelian randomization analysis. bioRxiv. 2018 [Google Scholar]
- 39.Mai X.M., Videm V., Sheehan N.A., Chen Y., Langhammer A., Sun Y.Q. Potential causal associations of serum 25-hydroxyvitamin D with lipids: a Mendelian randomization approach of the HUNT study. Eur. J. Epidemiol. 2019;34:57–66. doi: 10.1007/s10654-018-0465-x. [DOI] [PubMed] [Google Scholar]
- 40.Dong J., Gharahkhani P., Chow W.H., Gammon M.D., Liu G., Caldas C., Wu A.H., Ye W., Onstad L., Anderson L.A., Stomach and Esophageal Cancer Study Consortium No Association Between Vitamin D Status and Risk of Barrett’s Esophagus or Esophageal Adenocarcinoma: A Mendelian Randomization Study. Clin. Gastroenterol. Hepatol. 2019;17:2227–2235.e1. doi: 10.1016/j.cgh.2019.01.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tan V.Y., Biernacka K.M., Dudding T., Bonilla C., Gilbert R., Kaplan R.C., Qibin Q., Teumer A., Martin R.M., Perks C.M., PRACTICAL consortium Reassessing the Association between Circulating Vitamin D and IGFBP-3: Observational and Mendelian Randomization Estimates from Independent Sources. Cancer Epidemiol. Biomarkers Prev. 2018;27:1462–1471. doi: 10.1158/1055-9965.EPI-18-0113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Havdahl A., Mitchell R., Paternoster L., Davey Smith G. Investigating causality in the association between vitamin D status and self-reported tiredness. Sci. Rep. 2019;9:2880. doi: 10.1038/s41598-019-39359-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Milaneschi Y., Peyrot W.J., Nivard M.G., Mbarek H., Boomsma D.I., W J H Penninx B. A role for vitamin D and omega-3 fatty acids in major depression? An exploration using genomics. Transl. Psychiatry. 2019;9:219. doi: 10.1038/s41398-019-0554-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Libuda L., Laabs B.H., Ludwig C., Bühlmeier J., Antel J., Hinney A., Naaresh R., Föcker M., Hebebrand J., König I.R., Peters T. Vitamin D and the Risk of Depression: A Causal Relationship? Findings from a Mendelian Randomization Study. Nutrients. 2019;11:11. doi: 10.3390/nu11051085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Liyanage U.E., Law M.H., Barrett J.H., Iles M.M., MacGregor S., Melanoma Meta-analysis Consortium Is there a causal relationship between vitamin D and melanoma risk? A Mendelian randomization study. Br. J. Dermatol. 2020;182:97–103. doi: 10.1111/bjd.18238. [DOI] [PubMed] [Google Scholar]
- 46.Meng X., Li X., Timofeeva M.N., He Y., Spiliopoulou A., Wei W.Q., Gifford A., Wu H., Varley T., Joshi P. Phenome-wide Mendelian-randomization study of genetically determined vitamin D on multiple health outcomes using the UK Biobank study. Int. J. Epidemiol. 2019;48:1425–1434. doi: 10.1093/ije/dyz182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bycroft C., Freeman C., Petkova D., Band G., Elliott L.T., Sharp K., Motyer A., Vukcevic D., Delaneau O., O’Connell J. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–209. doi: 10.1038/s41586-018-0579-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.McKenna M.J., Murray B.F. Vitamin D dose response is underestimated by Endocrine Society’s Clinical Practice Guideline. Endocr. Connect. 2013;2:87–95. doi: 10.1530/EC-13-0008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Loh P.R., Danecek P., Palamara P.F., Fuchsberger C., A Reshef Y., K Finucane H., Schoenherr S., Forer L., McCarthy S., Abecasis G.R. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 2016;48:1443–1448. doi: 10.1038/ng.3679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Persyn E., Redon R., Bellanger L., Dina C. The impact of a fine-scale population stratification on rare variant association test results. PLoS ONE. 2018;13:e0207677. doi: 10.1371/journal.pone.0207677. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Galinsky K.J., Bhatia G., Loh P.R., Georgiev S., Mukherjee S., Patterson N.J., Price A.L. Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia. Am. J. Hum. Genet. 2016;98:456–472. doi: 10.1016/j.ajhg.2015.12.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Morris J.A., Kemp J.P., Youlten S.E., Laurent L., Logan J.G., Chai R.C., Vulpescu N.A., Forgetta V., Kleinman A., Mohanty S.T., 23andMe Research Team An atlas of genetic influences on osteoporosis in humans and mice. Nat. Genet. 2019;51:258–266. doi: 10.1038/s41588-018-0302-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Loh P.R., Bhatia G., Gusev A., Finucane H.K., Bulik-Sullivan B.K., Pollack S.J., de Candia T.R., Lee S.H., Wray N.R., Kendler K.S., Schizophrenia Working Group of Psychiatric Genomics Consortium Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat. Genet. 2015;47:1385–1392. doi: 10.1038/ng.3431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Mägi R., Morris A.P. GWAMA: software for genome-wide association meta-analysis. BMC Bioinformatics. 2010;11:288. doi: 10.1186/1471-2105-11-288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Willer C.J., Li Y., Abecasis G.R. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yang J., Lee S.H., Goddard M.E., Visscher P.M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011;88:76–82. doi: 10.1016/j.ajhg.2010.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.McLaren W., Gil L., Hunt S.E., Riat H.S., Ritchie G.R., Thormann A., Flicek P., Cunningham F. The Ensembl Variant Effect Predictor. Genome Biol. 2016;17:122. doi: 10.1186/s13059-016-0974-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kamat M.A., Blackshaw J.A., Young R., Surendran P., Burgess S., Danesh J., Butterworth A.S., Staley J.R. PhenoScanner V2: an expanded tool for searching human genotype-phenotype associations. Bioinformatics. 2019;35:4851–4853. doi: 10.1093/bioinformatics/btz469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Park J.H., Wacholder S., Gail M.H., Peters U., Jacobs K.B., Chanock S.J., Chatterjee N. Estimation of effect size distribution from genome-wide association studies and implications for future discoveries. Nat. Genet. 2010;42:570–575. doi: 10.1038/ng.610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Cuellar-Partida G., Lundberg M., Kho P.F., D’Urso S., Gutierrez-Mondragon L.F., Hwang L.-D. Complex-Traits Genetics Virtual Lab: A community-driven web platform for post-GWAS analyses. bioRxiv. 2019 [Google Scholar]
- 61.Barbeira A.N., Pividori M., Zheng J., Wheeler H.E., Nicolae D.L., Im H.K. Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet. 2019;15:e1007889. doi: 10.1371/journal.pgen.1007889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Pers T.H., Karjalainen J.M., Chan Y., Westra H.J., Wood A.R., Yang J., Lui J.C., Vedantam S., Gustafsson S., Esko T., Genetic Investigation of ANthropometric Traits (GIANT) Consortium Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 2015;6:5890. doi: 10.1038/ncomms6890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Ellinghaus D., Jostins L., Spain S.L., Cortes A., Bethune J., Han B., Park Y.R., Raychaudhuri S., Pouget J.G., Hübenthal M., International IBD Genetics Consortium (IIBDGC) International Genetics of Ankylosing Spondylitis Consortium (IGAS) International PSC Study Group (IPSCSG) Genetic Analysis of Psoriasis Consortium (GAPC) Psoriasis Association Genetics Extension (PAGE) Analysis of five chronic inflammatory diseases identifies 27 new associations and highlights disease-specific patterns at shared loci. Nat. Genet. 2016;48:510–518. doi: 10.1038/ng.3528. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Bulik-Sullivan B., Finucane H.K., Anttila V., Gusev A., Day F.R., Loh P.R., Duncan L., Perry J.R., Patterson N., Robinson E.B., ReproGen Consortium. Psychiatric Genomics Consortium. Genetic Consortium for Anorexia Nervosa of the Wellcome Trust Case Control Consortium 3 An atlas of genetic correlations across human diseases and traits. Nat. Genet. 2015;47:1236–1241. doi: 10.1038/ng.3406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Finucane H.K., Reshef Y.A., Anttila V., Slowikowski K., Gusev A., Byrnes A., Gazal S., Loh P.R., Lareau C., Shoresh N., Brainstorm Consortium Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 2018;50:621–629. doi: 10.1038/s41588-018-0081-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Finucane H.K., Bulik-Sullivan B., Gusev A., Trynka G., Reshef Y., Loh P.R., Anttila V., Xu H., Zang C., Farh K., ReproGen Consortium. Schizophrenia Working Group of the Psychiatric Genomics Consortium. RACI Consortium Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 2015;47:1228–1235. doi: 10.1038/ng.3404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Forouhi N.G., Luan J., Hennings S., Wareham N.J. Incidence of Type 2 diabetes in England and its association with baseline impaired fasting glucose: the Ely study 1990-2000. Diabet. Med. 2007;24:200–207. doi: 10.1111/j.1464-5491.2007.02068.x. [DOI] [PubMed] [Google Scholar]
- 68.Williams D.R., Wareham N.J., Brown D.C., Byrne C.D., Clark P.M., Cox B.D., Cox L.J., Day N.E., Hales C.N., Palmer C.R. Undiagnosed glucose intolerance in the community: the Isle of Ely Diabetes Project. Diabet. Med. 1995;12:30–35. doi: 10.1111/j.1464-5491.1995.tb02058.x. [DOI] [PubMed] [Google Scholar]
- 69.Marchini J., Howie B. Genotype imputation for genome-wide association studies. Nat. Rev. Genet. 2010;11:499–511. doi: 10.1038/nrg2796. [DOI] [PubMed] [Google Scholar]
- 70.Cheng J.B., Levine M.A., Bell N.H., Mangelsdorf D.J., Russell D.W. Genetic evidence that the human CYP2R1 enzyme is a key vitamin D 25-hydroxylase. Proc. Natl. Acad. Sci. USA. 2004;101:7711–7715. doi: 10.1073/pnas.0402490101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Wang X., Wang H., Shen B., Overholser B.R., Cooper B.R., Lu Y., Tang H., Zhou C., Sun X., Zhong L. 1-Alpha, 25-dihydroxyvitamin D3 alters the pharmacokinetics of mycophenolic acid in renal transplant recipients by regulating two extrahepatic UDP-glucuronosyltransferases 1A8 and 1A10. Transl. Res. 2016;178:54–62.e6. doi: 10.1016/j.trsl.2016.07.006. [DOI] [PubMed] [Google Scholar]
- 72.Wong T., Wang Z., Chapron B.D., Suzuki M., Claw K.G., Gao C., Foti R.S., Prasad B., Chapron A., Calamia J. Polymorphic Human Sulfotransferase 2A1 Mediates the Formation of 25-Hydroxyvitamin D3-3-O-Sulfate, a Major Circulating Vitamin D Metabolite in Humans. Drug Metab. Dispos. 2018;46:367–379. doi: 10.1124/dmd.117.078428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Zittermann A., Schleithoff S.S., Frisch S., Götting C., Kuhn J., Koertke H., Kleesiek K., Tenderich G., Koerfer R. Circulating calcitriol concentrations and total mortality. Clin. Chem. 2009;55:1163–1170. doi: 10.1373/clinchem.2008.120006. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




