Abstract
Aims/hypothesis
Coronary artery disease (CAD) is a common complication among individuals with diabetes. A better understanding of the genetic background of CAD in this population has the potential to suggest novel molecular targets for screening, risk assessment and drug development.
Methods
We performed a genome-wide association study of CAD in 15,666 unrelated individuals (3,968 CAD cases and 11,698 controls) of white British ancestry with diabetes at inclusion in the UK Biobank study. Our results were compared with results from participants without diabetes.
Results
We found genome-wide significant evidence for association with CAD at the previously well-established LPA locus (lead variant: rs74617384; OR 1.38 [95% CI 1.26, 1.51], p = 3.2 × 10−12) and at 9p21 (lead variant: rs10811652; OR 1.19 [95% CI 1.13, 1.26], p = 6.0 × 10−11). Moreover, other variants previously associated with CAD showed similar effects in the participants with and without diabetes, indicating that the genetic architecture of CAD is largely the same.
Conclusions/interpretation
Our results indicate large similarities between the genetic architecture of CAD in participants with and without diabetes. Larger studies are needed to establish whether there are important diabetes-specific CAD loci.
Electronic supplementary material
The online version of this article (10.1007/s00125-018-4686-z) contains peer-reviewed but unedited supplementary material, which is available to authorised users.
Keywords: Diabetes, Genetic epidemiology, Ischaemic heart disease, UK Biobank
Introduction
Diabetes mellitus confers about two-fold increased risk of coronary artery disease (CAD) [1]. Better understanding of the genetic background of CAD among individuals with diabetes can suggest novel molecular targets for screening, risk assessment, and drug development. The aim of our study was to identify genetic variation associated with CAD among individuals with diabetes and to assess previously established CAD loci.
Methods
Study sample
We used an algorithm [2] to define diabetes status among UK Biobank participants at baseline (2006-2010) based on self-reported disease, medication and age at diabetes onset. Participants were genotyped on the Affymetrix UK Biobank Lung Exome Evaluation (UK BiLEVE) Axiom array or the Affymetrix UK Biobank Axiom array. Quality control and imputation using the Haplotype Reference Consortium (HRC) was conducted centrally at the UK Biobank, yielding a total of about 39 million single-nucleotide polymorphisms (SNPs) [3]. Out of 24,680 individuals with available genetic data and possible/probable type 1 or type 2 diabetes, we proceeded with 20,644 individuals deemed unrelated with genetic data that passed quality control [3]. Of these, 4,978 individuals of non-white non-British ancestry were excluded from main analyses but were eligible for secondary analyses where we defined (1) ‘European, non-British white ancestry’, (2) ‘Asian or Asian British ancestry’ and (3) ‘Black or Black British ancestry’ subsets based on self-reported ancestry and being within ±2 SD of the two first principal components of that population (ESM Fig. 1). Hence, the main data set contained 15,666 individuals of white British ancestry. The outcome of interest was CAD based on UK Biobank’s baseline assessment verbal health interview, combined with linked data from hospital admissions and death registries (accessed 28 November 2017; last event recorded 15 January 2016; ESM Methods). We considered CAD events before and after diabetes onset. UK Biobank received ethics approval from the National Health Service Research Ethics Service (reference 11/NW/0382) and participants gave informed consent. The current analyses were conducted under approved UK Biobank data applications 11140 and 13721.
Genome-wide association study
Using the Genetic Association Study Power Calculator (http://csg.sph.umich.edu/abecasis/cats/gas_power_calculator/reference.html, accessed 14 February 2018), we found that for a significance level of 5 × 10−8 we had >80% power to detect variants with an allele frequency of 25% and a per-allele OR ≥1.17 and to detect variants with an allele frequency of 1% and an OR ≥1.80 (ESM Fig. 2). We tested 9,087,334 autosomal variants with a quality score (IMPUTE2 information metric) >0.8 and a minor allele count ≥30 in cases and controls in a series of logistic regression models adjusting for age, sex, the first 20 principal components and genotyping batch (3 levels; the UK BiLEVE, UK Biobank release 1 and 2). The association tests were performed in PLINK v2.00 [4] (www.cog-genomics.org/plink/2.0/)) using dosages (additive coding). We applied distance-based pruning followed by conditional analysis as described in the ESM Methods. Gene-based and gene-set pathway analyses with MAGMA v1.6 [5] implemented in FUMA [6] (http://fuma.ctglab.nl/, accessed 5 March 2018) were performed based on SNP association results, as described in the ESM Methods. Regional association plots were created with the Locuszoom tool [7].
Secondary analyses
In secondary analyses, we assessed (1) associations of identified SNPs with incident CAD (comparing 2,316 individuals with CAD diagnosis after diabetes diagnosis with controls); (2) associations of identified loci in other ancestry groups; (3) 63 variants previously reported as being associated with CAD in the Coronary ARtery DIsease Genome wide Replication and Meta-analysis plus The Coronary Artery Disease Genetics (CARDIoGRAMplusC4D) consortium [8] using a Bonferroni-corrected p value (0.05/61 available SNPs = 0.0008) in the 15,666 individuals with diabetes, as well as in 321,281 non-related individuals of British ancestry without diabetes from the UK Biobank; (4) two variants previously reported as being associated with CAD among individuals with diabetes: GLUL rs10911021 [9] and PPARG rs1801282 (Pro12Ala) [10]; (5) associations of identified SNPs among those with probable type 1 and type 2 diabetes only and among those with probable type 2 diabetes only; and (6) associations of identified SNPs excluding individuals with angina only.
Results
Genome-wide association study
In our analyses of 15,666 individuals with diabetes (3,968 CAD cases and 11,698 controls; ESM Table 1, ESM Table 2), we found genome-wide significant evidence for association with CAD at two loci (Fig. 1a–c). The strongest association was observed in the LPA locus at 6q25 (lead variant: rs74617384) with the T allele (frequency 8.2%) having an OR of 1.38 (95% CI 1.26, 1.51, p = 3.2 × 10−12). The second association was found in 9p21 (lead variant: rs10811652), with the C allele (frequency 49.4%) having an OR of 1.19 (95% CI 1.13, 1.26, p = 6.0 × 10−11). No additional independent variants were identified in these loci. No indication of genomic inflation was observed (ESM Fig. 3). Results from gene-based analyses suggested variation in PSRC1 and CYGB-PRCD to be associated with CAD (ESM Fig. 4). The CYGB-PRCD locus has not been reported associated with CAD previously, but we noted one variant, rs72860151, showing suggestive association (p = 3.5 × 10−5) in summary statistics from CARDIoGRAMplusC4D [8]. No gene sets were significantly enriched in the pathway analysis.
Secondary analyses
In our secondary analyses including only 2,316 CAD cases occurring after diabetes onset, the two top variants had similar effect estimates: rs74617384, OR 1.35 (95% CI 1.20, 1.50, p = 1.6 × 10−7) and rs10811652, OR 1.17 (95% CI 1.10, 1.25, p = 1.5 × 10−6). We found evidence of an association (OR 1.35; 95% CI 1.01, 1.79) of rs74617384 with CAD in the European, non-British population, but not in the other ethnic groups (Fig. 2a). Five of 61 previously reported CAD variants were associated with CAD in the population with diabetes at a p value of <0.0008 (ESM Table 3). In addition to variants in the LPA and 9p21 loci, variants in the SORT1 and KCNE2 loci were also associated. Out of the remaining 56 variants, 13 were associated at a p value of <0.05 and 36 additional variants showed the same effect direction and similar effect sizes as in the non-diabetic population (Fig. 2b; p value from binomial test <10−6). The previously reported association of two variants with CAD in individuals with diabetes could not be replicated for either variant in the present study. The effect estimates were OR 1.01 (0.95, 1.07) for each T allele of rs10911021; and 1.05 (0.97, 1.14) for each G allele of Pro12Ala (rs1801282), respectively. Association results were similar using different definitions of diabetes and different definitions of CAD (ESM Table 4).
Discussion
We report the largest genome-wide association study of CAD among individuals with diabetes to date. There are three main findings. First, we report two genome-wide significant loci (LPA and 9p21) previously linked to CAD in the general population with similar effect sizes. Second, among 61 previously identified CAD variants in the general population, two—SORT1 and KCNE2—were significantly associated with CAD among individuals with diabetes. Moreover, in the diabetic population, 49 of the remaining 56 non-significant variants showed effect directions and sizes consistent with those observed in the non-diabetic population. Third, we could not replicate findings from previous studies on diabetes-specific CAD loci, highlighting the limitations of small study samples, lack of replication and/or candidate gene-based approaches. Overall, our study provides evidence that the genetic mechanisms underlying CAD in participants with diabetes are similar to those in individuals without diabetes.
Our gene-based analyses provided some evidence for association of variation in or near three genes with CAD in the population with diabetes: PSRC1, CYGB and PRCD. PSRC1 lays within the CELSR2–PSRC1–SORT1 locus, which has been associated with LDL-cholesterol and CAD, where there is convincing evidence for SORT1 being the causal gene [11]. CYGB and PRCD are located in the same locus on chromosome 17, where we also found some orthogonal evidence of a signal in the CARDIoGRAMplusC4D data. The gene CYGB encodes the protein cytoglobin, which is a recently discovered globin expressed in fibroblasts and smooth muscle cells, and is reported to be an important regulator of cardiovascular tone in mice [12]. The results of these gene-based analyses need to be replicated in other data sets, but tentatively suggest CYGB as an interesting candidate gene, especially given increasing recognition of the important role of smooth muscle cells in atherosclerosis [13].
In this study, we did not detect genetic variants specific to CAD in the population with diabetes. In our study, power was limited for lower allele frequencies and smaller effect sizes, and so we cannot exclude the possibility that such variants will be detected in larger future studies. However, the general interpretation of our findings is that the genetic architecture of CAD in the population without diabetes and in those with diabetes is largely similar.
Limitations of this study include a limited power in other ethnicities than white British, although it is reasonable to assume that the underlying biology, but not necessarily linkage disequilibrium structure, is similar across ethnic backgrounds. Another limitation is that individuals in the control group are likely to have subclinical or clinical CAD not captured by our definition. Such misclassification likely attenuated our estimates and thereby increases the chance of false negative findings and underestimation of effects. Finally, our study included only participants with previously diagnosed diabetes and a mean age of 63 years, which is rather young for a diabetes cohort. In the future, when the biomarker data (including glucose and HbA1c) become available in the UK Biobank, it will be possible to extend the current study to those with undiagnosed diabetes.
Our results indicate large similarities between the genetic architecture of CAD in the individuals with and without diabetes. Larger studies are needed to establish whether there are diabetes-specific CAD loci with smaller effect sizes.
Electronic supplementary material
Acknowledgements
We would like to thank S. Eastwood at the University College London, UK, for her valuable assistance with the diabetes classification algorithm. The computations were in part performed on resources provided by the Swedish National Infrastructure for Computing (SNIC) at UPPMAX. This research has been conducted using the UK Biobank Resource under Application Number 11140 and 13721.
Abbreviations
- BiLEVE
Biobank Lung Exome Evaluation
- CAD
Coronary artery disease
- CARDIoGRAMplusC4D
Coronary ARtery DIsease Genome wide Replication and Meta-analysis plus The Coronary Artery Disease Genetics
- SNP
Single-nucleotide polymorphism
Contribution statement
TF is responsible for the integrity of the work as a whole. TF and SG performed statistical analysis. TF, SG, MO-M and EI provided substantial contributions to conception and design of the study and interpretation of data. TF drafted the article and SG, MO-M and EI critically revised it for important intellectual content. All authors approved the final version to be published.
Funding
The study was conducted with support from the Swedish Research Council (2015-03477), Swedish Heart-Lung Foundation (20150429), Borgströms foundation, Göran Gustafssons Stiftelse (1637), National Institutes of Health (R01DK106236) and Axel and Signe Lagerman’s Foundation.
Data availability
This research has been conducted using the UK Biobank resource. All bona fide researchers can apply to use the UK Biobank resource for health-related research that is in the public interest (www.ukbiobank.ac.uk/register-apply/). CARDIoGRAMplusC4D data was accessed from www.cardiogramplusc4d.org/.
Duality of interest
EI is a scientific advisor for Precision Wellness, and has received consulting fees from Olink Proteomics for work unrelated to the present project. The other authors have no duality of interest.
References
- 1.Emerging Risk Factors C. Sarwar N, Gao P, et al. Diabetes mellitus, fasting blood glucose concentration, and risk of vascular disease: a collaborative meta-analysis of 102 prospective studies. Lancet. 2010;375:2215–2222. doi: 10.1016/S0140-6736(10)60484-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Eastwood SV, Mathur R, Atkinson M, et al. Algorithms for the Capture and Adjudication of Prevalent and Incident Diabetes in UK Biobank. PLoS One. 2016;11:e0162388. doi: 10.1371/journal.pone.0162388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bycroft C, Freeman C, Petkova D, et al. (2017) Genome-wide genetic data on ~500,000 UK Biobank participants. bioRxiv. 10.1101/166298
- 4.Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.de Leeuw CA, Mooij JM, Heskes T, Posthuma D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol. 2015;11:e1004219. doi: 10.1371/journal.pcbi.1004219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Watanabe K, Taskesen E, van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nat Commun. 2017;8:1826. doi: 10.1038/s41467-017-01261-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Pruim RJ, Welch RP, Sanna S, et al. LocusZoom: regional visualization of genome-wide association scan results. Bioinformatics. 2010;26:2336–2337. doi: 10.1093/bioinformatics/btq419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nikpay M, Goel A, Won HH, et al. A comprehensive 1,000 Genomes-based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 2015;47:1121–1130. doi: 10.1038/ng.3396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Qi L, Qi Q, Prudente S, et al. Association between a genetic variant related to glutamic acid metabolism and coronary heart disease in individuals with type 2 diabetes. JAMA. 2013;310:821–828. doi: 10.1001/jama.2013.276305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Doney AS, Fischer B, Leese G, Morris AD, Palmer CN. Cardiovascular risk in type 2 diabetes is associated with variation at the PPARG locus: a Go-DARTS study. Arterioscler Thromb Vasc Biol. 2004;24:2403–2407. doi: 10.1161/01.ATV.0000147897.57527.e4. [DOI] [PubMed] [Google Scholar]
- 11.Musunuru K, Strong A, Frank-Kamenetsky M, et al. From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus. Nature. 2010;466:714–719. doi: 10.1038/nature09266. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Liu X, El-Mahdy MA, Boslett J, et al. Cytoglobin regulates blood pressure and vascular tone through nitric oxide metabolism in the vascular wall. Nat Commun. 2017;8:14807. doi: 10.1038/ncomms14807. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Howson JMM, Zhao W, Barnes DR, et al. Fifteen new risk loci for coronary artery disease highlight arterial-wall-specific mechanisms. Nat Genet. 2017;49:1113–1119. doi: 10.1038/ng.3874. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
This research has been conducted using the UK Biobank resource. All bona fide researchers can apply to use the UK Biobank resource for health-related research that is in the public interest (www.ukbiobank.ac.uk/register-apply/). CARDIoGRAMplusC4D data was accessed from www.cardiogramplusc4d.org/.