Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study

Lina Cai; Eleanor Wheeler; Nicola D Kerrison; Jian’an Luan; Panos Deloukas; Paul W Franks; Pilar Amiano; Eva Ardanaz; Catalina Bonet; Guy Fagherazzi; Leif C Groop; Rudolf Kaaks; José María Huerta; Giovanna Masala; Peter M Nilsson; Kim Overvad; Valeria Pala; Salvatore Panico; Miguel Rodriguez-Barranco; Olov Rolandsson; Carlotta Sacerdote; Matthias B Schulze; Annemieke M W Spijkerman; Anne Tjonneland; Rosario Tumino; Yvonne T van der Schouw; Stephen J Sharp; Nita G Forouhi; Elio Riboli; Mark I McCarthy; Inês Barroso; Claudia Langenberg; Nicholas J Wareham

doi:10.1038/s41597-020-00716-7

. 2020 Nov 13;7:393. doi: 10.1038/s41597-020-00716-7

Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study

Lina Cai ¹, Eleanor Wheeler ¹, Nicola D Kerrison ¹, Jian’an Luan ¹, Panos Deloukas ², Paul W Franks ^3,⁴, Pilar Amiano ⁵, Eva Ardanaz ^6,^7,⁸, Catalina Bonet ⁹, Guy Fagherazzi ^10,¹¹, Leif C Groop ³, Rudolf Kaaks ¹², José María Huerta ^8,¹³, Giovanna Masala ¹⁴, Peter M Nilsson ³, Kim Overvad ^15,¹⁶, Valeria Pala ¹⁷, Salvatore Panico ¹⁸, Miguel Rodriguez-Barranco ^8,^19,²⁰, Olov Rolandsson ⁴, Carlotta Sacerdote ²¹, Matthias B Schulze ^22,^23,²⁴, Annemieke M W Spijkerman ²⁵, Anne Tjonneland ²⁶, Rosario Tumino ^27,²⁸, Yvonne T van der Schouw ²⁹, Stephen J Sharp ¹, Nita G Forouhi ¹, Elio Riboli ³⁰, Mark I McCarthy ^31,^32,^33,³⁵, Inês Barroso ³⁴, Claudia Langenberg ¹, Nicholas J Wareham ^1,^✉

¹MRC Epidemiology Unit, University of Cambridge, Cambridge, United Kingdom

²Clinical Pharmacology Centre, Queen Mary University of London, London, United Kingdom

³Department of Clinical Sciences, Clinical Research Center, Skåne University Hospital, Lund University, 20502 Malmö, Sweden

⁴Department of Public Health and Clinical Medicine, Umeå University, 90187 Umeå, Sweden

⁵Ministry of Health of the Basque Government, Public Health Division of Gipuzkoa, Biodonostia Health Research Institute, Donostia-San Sebastian, Spain

⁶Navarra Public Health Institute, Pamplona, Spain

⁷IdiSNA, Navarra Institute for Health Research, Pamplona, Spain

⁸Centro de Investigación Biomédica en Red de Epidemiología y Salud Pública (CIBERESP), Madrid, Spain

⁹Unit of Nutrition and Cancer, Catalan Institute of Oncology - ICO, Nutrition and Cancer Group, Bellvitge Biomedical Research Institute - IDIBELL, L’Hospitalet de Llobregat, Barcelona, 08908 Spain

¹⁰Digital Epidemiology and e-Health Research Hub, Department of Population Health, Luxembourg Institute of Health, 1A-B, rue Thomas Edison, L-1445 Strassen, Luxembourg

¹¹Center of Epidemiology and Population Health UMR 1018, Inserm, Paris South - Paris Saclay University, Gustave Roussy Institute, Villejuif, France

¹²Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Im Neuenheimer Feld 581, 69120 Heidelberg, Germany

¹³Department of Epidemiology, Murcia Regional Health Council, IMIB-Arrixaca, Murcia, Spain

¹⁴Cancer Risk Factors and Life-Style Epidemiology Unit, Institute for Cancer Research, Prevention and Clinical Network - ISPRO, Florence, Italy

¹⁵Department of Public Health, Aarhus University, Bartholins Allé 2, DK-8000 Aarhus C, Denmark

¹⁶Department of Cardiology, Aalborg University Hospital, Sdr. Skovvej 15, DK-9000 Aalborg, Denmark

¹⁷Epidemiology and Prevention Unit, Fondazione IRCCS Istituto Nazionale dei Tumori di Milano, Milan, Italy

¹⁸Dipartimento di Medicina Clinica e Chirurgia, Federico II University, via Pansini 5, 80131 Naples, Italy

¹⁹Escuela Andaluza de Salud Pública (EASP), Granada, Spain

²⁰Instituto de Investigación Biosanitaria ibs.Granada, Granada, Spain

²¹Unit of Cancer Epidemiology, Città della Salute e della Scienza University-Hospital and Center for Cancer Prevention (CPO), Turin, Italy

²²Department of Molecular Epidemiology, German Institute of Human Nutrition Potsdam-Rehbruecke, Nuthetal, Germany

²³German Center for Diabetes Research (DZD), Neuherberg, Germany

²⁴Institute of Nutrition Science, University of Potsdam, Nuthetal, Germany

²⁵National Institute for Public Health and the Environment (RIVM), PO Box 1, 3720 BA Bilthoven, The Netherlands

²⁶Danish Cancer Society Research Center, Strandboulevarden 49, 2100 Copenhagen, Denmark

²⁷Cancer Registry and Histopathology Department, Azienda Sanitaria Provinciale No 7, Piazza Igea Nr 1, 97100 Ragusa, Italy

²⁸Associazone Iblea per la Ricerca Epidemiologica -Organizazione Non Lucrativa di Utilità Sociale, Piazza Amcione No 2, 97100 Ragusa, Italy

²⁹Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, 3584 CG Utrecht, the Netherlands

³⁰School of Public Health, Imperial College London, London, United Kingdom

³¹Wellcome Centre for Human Genetics, University of Oxford, Oxford, United Kingdom

³²Oxford Centre for Diabetes, University of Oxford, Oxford, United Kingdom

³³Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, United Kingdom

³⁴University of Exeter Medical School, Exeter, United Kingdom

³⁵Present Address: 1 DNA Way, South San Francisco, CA 94080 USA

^✉

Corresponding author.

PMCID: PMC7666191 PMID: 33188205

Abstract

Type 2 diabetes (T2D) is a global public health challenge. Whilst the advent of genome-wide association studies has identified >400 genetic variants associated with T2D, our understanding of its biological mechanisms and translational insights is still limited. The EPIC-InterAct project, centred in 8 countries in the European Prospective Investigations into Cancer and Nutrition study, is one of the largest prospective studies of T2D. Established as a nested case-cohort study to investigate the interplay between genetic and lifestyle behavioural factors on the risk of T2D, a total of 12,403 individuals were identified as incident T2D cases, and a representative sub-cohort of 16,154 individuals was selected from a larger cohort of 340,234 participants with a follow-up time of 3.99 million person-years. We describe the results from a genome-wide association analysis between more than 8.9 million SNPs and T2D risk among 22,326 individuals (9,978 cases and 12,348 non-cases) from the EPIC-InterAct study. The summary statistics to be shared provide a valuable resource to facilitate further investigations into the genetics of T2D.

Subject terms: Epidemiology, Genetics research, Type 2 diabetes

Measurement(s)	type 2 diabetes mellitus
Technology Type(s)	case-cohort study • genome wide association study
Factor Type(s)	genotype dosage • genetic principal components • study centre • Age • Sex
Sample Characteristic - Organism	Homo sapiens
Sample Characteristic - Location	Europe

Open in a new tab

Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.12981821

Background & Summary

Diabetes is one of the fastest-growing health challenges of the 21^st century. The most common form of diabetes, type 2 diabetes (T2D), is a complex multifactorial disease which can lead to further severe health consequences such as cardiovascular diseases and premature death. In 2019, 463 million people worldwide were living with diabetes according to the International Diabetes Federation, and this number is expected to rise to 700 million by 2045¹. Genome-wide association studies (GWAS) have made considerable progress in identifying genetic risk factors and in providing evidence for more in-depth understanding of the biological and pathological pathways underlying T2D. A recent study performed a meta-analysis of T2D across 32 GWAS of European ancestry participants and identified 243 genome-wide significant loci (403 distinct genetic variants) associated with T2D risk². The summary statistics from this meta-analysis are publicly available; however, the GWAS results for each participating study, including EPIC-InterAct, cannot be acquired easily.

To date, a growing body of comprehensive methods has been developed for downstream analyses of GWAS. Sharing of summary statistics can help enable these analyses, for example, by providing researchers with a more convenient way to look-up genetic association effect estimates to conduct causal inference analyses using methods such as two-sample Mendelian Randomization which assumes samples are non-overlapping^3,4. In addition, sharing GWAS results can help researchers to further their understanding of the shared genetic basis of T2D with other traits of interest, to perform fine-mapping to pinpoint the causal genetic variants or identify genetic loci shared with other risk factors and disease outcomes. Therefore, the aim of this current work was to provide a reference dataset for researchers to utilize in order to conduct further genetic analyses, generate hypotheses and improve understanding of the aetiology, the biological pathways and mechanisms of T2D and related metabolic and cardiovascular diseases.

Methods

Study design and participants

The EPIC-InterAct study is a large-scale prospective study nested in the European Prospective Investigation into Cancer (EPIC) study, facilitating the investigation of genetic and lifestyle factors on the risk of T2D among European populations. A total of 26 research centres located in eight different European countries (France, Italy, Spain, UK, the Netherlands, Germany, Sweden, and Denmark) were included. The study design, sample collection and genotyping have been described in detail previously^5,6.

In brief, the EPIC-InterAct study adopted a nested case-cohort design. A total of 340,234 participants with stored blood and information reported on diabetes status from the wider EPIC study were followed up for 3.99 million person-years. During the follow-up, researchers from participating study centres ascertained and verified 12,403 incident cases of T2D through self-reported history of T2D, doctor diagnosed T2D and diabetes medication use, linkage to primary care registers, secondary care registers, medication use (pharmacy/ drug registers), hospital admissions and mortality data or local and national diabetes and pharmaceutical registers⁵. To select a representative sub-cohort, a total of 16,835 participants were randomly selected at baseline with numbers proportional to the number of participants in each participating centre. Participants with prevalent (n = 548), unknown (n = 129) and post-censoring diabetes status (n = 4) were excluded, with a total of 16,154 diabetes-free individuals remaining in the EPIC-InterAct sub-cohort (Fig. 1).

Fig. 1 — Overview of the EPIC-InterAct study, genotyping and genome-wide association meta-analysis for T2D in 22,326 participants.

DNA samples and genotyping platforms

Blood samples were collected at recruitment and stored in liquid nitrogen at the International Agency for Research into Cancer (IARC) in Lyon, France, or in local biorepositories except for Umeå where −80 °C freezers were used. DNA was extracted and quantified, with details of sample handling described elsewhere^5,7.

Available EPIC-InterAct DNA samples were genotyped using two genotyping platforms. A total of 10,023 EPIC-InterAct participants were randomly selected for genome-wide genotyping using the Illumina 660W-Quad BeadChip (Illumina, Inc., San Diego, California) at the Wellcome Trust Sanger Institute with the number of individuals selected per centre being proportional to the percentage of total cases in that centre, except the Danish participants who did not have available DNA samples at the time⁷. Samples were excluded if they had a low call rate (<95.4%), a lack of concordance with previous genotyping results, a mismatch between self-reported sex and the sex inferred from genetic data (X chromosome heterozygosity) or missing data, or they were autosomal heterozygosity outliers, overall array intensity outliers, ethnic outliers (non-European ancestry) or duplicate samples. Related individuals in the Illumina 660 W genotyping array group were identified based on an identity by descent (IBD) pi-hat threshold of 0.1875 (mid-point between second-degree (0.25) and third-degree (0.125) relatives), and those with the largest number of relatives or the lowest call rate were removed preferentially. A total of 9,290 samples genotyped on the Illumina 600 W array passed initial sample quality control (QC).

A total 13,474 individuals from the remaining of EPIC-InterAct samples (including the Danish samples) were genotyped using the Illumina core-exome 12v1 and 24v1 arrays at Cambridge Genomic Services in the Department of Pathology at the University of Cambridge. The two core-exome arrays are very similar; hence the genotype data were merged for further analyses. Following comparable QC procedures as above, a total of 13,202 samples genotyped using the core-exome arrays passed initial sample QC.

Following initial sample QC, an additional 166 participants who had relatives (IBD pi-hat threshold of 0.1875) across the different genotyping arrays (Illumina 660 W vs Illumina core-exome) were excluded, and a total of 22,326 individuals were included in the downstream genetic analyses (Fig. 1; Table 1).

Table 1.

Sample size of the EPIC-InterAct T2D GWAS analysis by diabetes outcome status and genotyping array.

Diabetes Outcome	Genotyping Array		Total
Diabetes Outcome	Ill660W	Core-exome	Total
non-case	4,625	7,723	12,348
case	4,553	5,425	9,978
Total	9,178	13,148	22,326

Open in a new tab

Genotype imputation

Prior to imputation, single nucleotide polymorphisms (SNPs) were removed if they had Hardy Weinberg p-value < 10⁻⁶ or were not found in the Haplotype Reference Consortium (HRC) reference panel version 1.0⁸, were A/T or G/C with minor allele frequency (MAF) >0.4, had an allele frequency difference >0.2 with the reference panel, or were short insertion-deletion mutations (indels). A total of 553,115 and 366,044 SNPs passed pre-imputation SNP QC in the Illumina 660W-Quad BeadChip and combined Illumina core-exome arrays, respectively. Imputation was performed using the HRC reference panel and IMPUTE v2.3.2 software⁹. Monomorphic and singleton SNPs and those with imputation quality (info) <0.3 were excluded prior to genetic analyses.

Genome-wide association meta-analysis

For genome-wide association analysis of T2D, all 22,326 included individuals in the EPIC-InterAct study were of European ancestry, including 9,978 type 2 diabetes cases (including 616 cases from the sub-cohort) and 12,348 non-cases from the sub-cohort, among whom 9,178 participants were genotyped on the Illumina 660 W array and 13,148 using the core-exome array (Fig. 1; Table 1). The mean follow-up time for the EPIC-InterAct cases included in the analyses was 6.8 years (standard deviation (s.d.) =3.3 years), and 12.2 years (s.d. =2.0 years) for the sub-cohort.

We used logistic regression to test genome-wide associations with T2D, rather than Prentice- weighted Cox regression that takes into account the case-cohort design of EPIC-InterAct. Logistic regression was chosen both for computational efficiency and because it has been shown to have greater power than Prentice-weighted Cox regression to detect SNP-disease associations¹⁰. All T2D incident cases including those from the sub-cohort were coded as ‘1’, and non-cases from the sub-cohort were coded as ‘0’. To estimate the association between T2D and each genetic variant, we performed logistic regression under an additive genetic model, adjusting for age, sex, study centre and the first four genetic principal components to account for population structure using QUICKTEST Version 0.98¹¹. Dummy variables for each study centre (combining the six centres in France due to the small sample size in each French centre) were included in the model to account for the differences between participants from each country and the potential confounding by larger scale relatedness between participants from each study centre. Genome-wide analyses were performed separately for each genotyping array and combined using an inverse variance weighted fixed-effect meta-analysis in METAL¹². The final meta-analysis had an effective sample size¹² of up to 21,924.

Ethics statement

The EPIC-InterAct study was approved by the local ethics committee in the participating countries and the Internal Review Board of the International Agency for Research on Cancer. All participants gave written informed consent. The study was coordinated by the Medical Research Council Epidemiology Unit at the University of Cambridge.

Data Records

Genome-wide association summary statistics from the meta-analysis of T2D in the EPIC-InterAct study and Cox-regression analysis results for the 370 top T2D SNPs from the recently published DIAMANTE study² are available to download from the Dryad Digital Repository (10.5061/dryad.qnk98sfcg)¹³.

The genome-wide summary statistics are in tab-delimited TXT format, including rsID (based on the HRC reference panel), chromosome, position (using the reference genome GRCh37 (hg19)), effect allele, other allele, frequency of effect allele, effect estimate, standard error of the effect estimate, p-value, assessment of heterogeneity across the two genotyping arrays, total sample size and effective sample size for the SNP.

The Cox-regression analysis results are in tab-delimited TXT format, including MarkerName (hg19), rsID (based on the HRC reference panel), chromosome, position (using the reference genome GRCh37 (hg19)), effect allele, other allele, frequency of effect allele, beta, standard error of beta, hazard ratio (HR), lower-bound of 95% confidence interval (CI) of HR, upper-bound of 95% confidence interval (CI) of HR, p-value, imputation quality, total sample size.

Alternatively, the genome-wide summary statistics data is also available in NHGRI-EBI’s GWAS Catalog with accession ID GCST90006934¹⁴. It can be downloaded via the following ftp link: ftp://ftp.ebi.ac.uk/pub/databases/gwas/summary_statistics/GCST90006934.

In addition, access to individual-level EPIC data is available through the International Agency for Research on Cancer (IARC): https://epic.iarc.fr/access/, where there is a controlled-access repository. A clear and open access request mechanism and data use agreement is in place.

Technical Validation

For the meta-analysis, only SNPs with minor allele frequency (MAF) > 0.5%, imputation information score > 0.4, Hardy-Weinberg Equilibrium p-value > 1 × 10⁻⁶ and association effect standard error < 10 from each genotyping platform were included. After the meta-analysis, 31 SNPs with heterogeneity p-value < 1 × 10⁻⁵ were excluded. A total of 8,924,492 SNPs remained in the shared meta-analysis results. The numbers of genetic variants in each MAF bin are shown in Table 2.

Table 2.

Number of SNPs in each minor allele frequency (MAF) bin in the final EPIC-InterAct T2D GWAS meta-analysis result after quality control.

MAF bin	(0.005,0.01]	(0.01, 0.05]	(0.05, 0.1]	(0.1,0.2]	(0.2, 0.3]	(0.3,0.4]	(0.4,0.5]
N of SNP^a	1,201,837	2,287,670	1,117,493	1,429,429	1,074,208	937,594	876,261

Open in a new tab

^aSNPs with meta-analysis heterogeneity p value < 10⁻⁵, imputation info <0.4 and minor allele count < = 10 were excluded.

The Manhattan plot is shown in Fig. 2. The quantile-quantile plot (Fig. 3) showed no evidence of inflation from confounding or other biases, supported by the LD score regression¹⁵ intercept, which was very close to 1 (1.0054); therefore, no genomic control correction was performed. As a positive control, the top independent genome-wide significant signal from the meta-analysis was the well-established TCF7L2 variant rs7903146¹⁶ (p = 1.30 × 10⁻³⁸).

Because logistic regression may potentially yield inflated effect estimates when applied in a case-cohort study¹⁰, we compared the strength of associations from the GWAS meta-analysis (logistic regression) and Prentice-weighted Cox-regression analyses adjusting for sex, study centre and first four principal components with age as the underlying time-scale variable for established T2D genetic variants. A total of 370 SNPs from the recently published DIAMANTE study² are available in our HRC imputed EPIC-InterAct genotype data. Among these, 175 SNPs with p-value < 0.05 in the EPIC-InterAct meta-analysis results were included in the comparison. The Pearson correlation coefficient between the log of hazard ratios from the Cox-regression model and the log of odds ratios from logistic regression models was 0.98 (p = 3.1 × 10⁻¹²⁶) (Fig. 4), showing the effects are highly comparable.

Fig. 4 — Log(hazard ratios) from the Prentice-weighted Cox regression model and log(odds ratios) from the logistic model for established genetic variants from a previous meta-analysis² with p < 0.05 in the EPIC-InterAct T2D GWAS summary statistics (n = 175).

Acknowledgements

We thank all EPIC participants and staff for their contribution to the study. We thank Nicola Kerrison (MRC Epidemiology Unit, Cambridge) for managing the data for the InterAct Project and staff from the Laboratory Team, Field Epidemiology Team, and Data Functional Group of the MRC Epidemiology Unit in Cambridge, UK, for carrying out sample preparation, DNA provision and quality control, genotyping, and data-handling work. The funding of the EPIC-InterAct study was provided by the EU FP6 Programme [grant number Integrated Project LSHM_CT_2006_037197]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The authors acknowledge support from the Medical Research Council Epidemiology Unit (grants MC_UU_12015/1 and MC_UU_12015/5) and Wellcome WT206194.

Author contributions

Conceived and designed the experiments: L.C., E.W., N.K., J.L., I.B., P.D., C.L., N.J.W.; analysed the data: L.C., N.K., J.L.; wrote and contributed to the writing of the first manuscript draft: L.C., E.W., N.K., J.L., C.L., N.J.W.; all authors contributed to the interpretation of the data, revised the article critically for important intellectual content, and approved the final version of the paper to be published; NJW is the guarantor of this work and, as such, has full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Code availability

IMPUTE v2.3.2: https://mathgen.stats.ox.ac.uk/impute/impute_v2.html QUICKTEST Version 0.98: http://toby.freeshell.org/software/quicktest.shtml METAL: https://genome.sph.umich.edu/wiki/METAL All other analyses, including the Prentice-weighted Cox-regression analyses, were performed using R 3.4.2¹⁷.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.International Diabetes Federation. IDF Diabetes Atlas, 9th edn. https://www.diabetesatlas.org (2019).
2.Mahajan A, et al. Fine-mapping of an expanded set of type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps Individual study design and principal investigators Europe PMC Funders Group. Nat. Genet. 2018;50:1505–1513. doi: 10.1038/s41588-018-0241-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Pierce BL, Burgess S. Efficient design for mendelian randomization studies: Subsample and 2-sample instrumental variable estimators. Am. J. Epidemiol. 2013;178:1177–1184. doi: 10.1093/aje/kwt084. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Bowden J, Smith GD, Haycock PC, Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.The InterAct Consortium. et al. The InterAct Project: an examination of the interaction of genetic and lifestyle factors on the incidence of type 2 diabetes in the EPIC Study. Diabetologia. 2011;54:2272–2282. doi: 10.1007/s00125-011-2182-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Forouhi NG, Wareham NJ. The EPIC-InterAct Study: A study of the interplay between genetic and lifestyle behavioral factors on the risk of type 2 diabetes in European populations. Curr. Nutr. Rep. 2014;3:355–363. doi: 10.1007/s13668-014-0098-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Langenberg, C. et al. Gene-lifestyle interaction and type 2 diabetes: the EPIC InterAct Case-Cohort Study. PLoS Med. 11 (2014). [DOI] [PMC free article] [PubMed]
8.McCarthy S, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 2016;48:1279–1283. doi: 10.1038/ng.3643. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5:e1000529. doi: 10.1371/journal.pgen.1000529. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Staley JR, et al. A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design. Eur. J. Hum. Genet. 2017;25:854–862. doi: 10.1038/ejhg.2017.78. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Kutalik Z, et al. Methods for testing association between uncertain genotypes and quantitative traits. Biostatistics. 2011;12:1–17. doi: 10.1093/biostatistics/kxq039. [DOI] [PubMed] [Google Scholar]
12.Willer CJ, Li Y, Abecasis GR. METAL: Fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Cai L, 2020. EPIC-Data from: Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study. Dryad Digital Repository. [DOI] [PMC free article] [PubMed]
14.2020. GWAS Catalog. GCST90006934
15.Bulik-Sullivan B, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 2015;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Sladek R, et al. A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature. 2007;445:881–885. doi: 10.1038/nature05616. [DOI] [PubMed] [Google Scholar]
17.R Core Team. R: A language and environment for statistical computing. (2014).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Cai L, 2020. EPIC-Data from: Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study. Dryad Digital Repository. [DOI] [PMC free article] [PubMed]
2020. GWAS Catalog. GCST90006934

Data Availability Statement

[CR1] 1.International Diabetes Federation. IDF Diabetes Atlas, 9th edn. https://www.diabetesatlas.org (2019).

[CR2] 2.Mahajan A, et al. Fine-mapping of an expanded set of type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps Individual study design and principal investigators Europe PMC Funders Group. Nat. Genet. 2018;50:1505–1513. doi: 10.1038/s41588-018-0241-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Pierce BL, Burgess S. Efficient design for mendelian randomization studies: Subsample and 2-sample instrumental variable estimators. Am. J. Epidemiol. 2013;178:1177–1184. doi: 10.1093/aje/kwt084. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Bowden J, Smith GD, Haycock PC, Burgess S. Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.The InterAct Consortium. et al. The InterAct Project: an examination of the interaction of genetic and lifestyle factors on the incidence of type 2 diabetes in the EPIC Study. Diabetologia. 2011;54:2272–2282. doi: 10.1007/s00125-011-2182-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Forouhi NG, Wareham NJ. The EPIC-InterAct Study: A study of the interplay between genetic and lifestyle behavioral factors on the risk of type 2 diabetes in European populations. Curr. Nutr. Rep. 2014;3:355–363. doi: 10.1007/s13668-014-0098-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Langenberg, C. et al. Gene-lifestyle interaction and type 2 diabetes: the EPIC InterAct Case-Cohort Study. PLoS Med. 11 (2014). [DOI] [PMC free article] [PubMed]

[CR8] 8.McCarthy S, et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 2016;48:1279–1283. doi: 10.1038/ng.3643. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5:e1000529. doi: 10.1371/journal.pgen.1000529. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Staley JR, et al. A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design. Eur. J. Hum. Genet. 2017;25:854–862. doi: 10.1038/ejhg.2017.78. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Kutalik Z, et al. Methods for testing association between uncertain genotypes and quantitative traits. Biostatistics. 2011;12:1–17. doi: 10.1093/biostatistics/kxq039. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Willer CJ, Li Y, Abecasis GR. METAL: Fast and efficient meta-analysis of genomewide association scans. Bioinformatics. 2010;26:2190–2191. doi: 10.1093/bioinformatics/btq340. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Cai L, 2020. EPIC-Data from: Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study. Dryad Digital Repository. [DOI] [PMC free article] [PubMed]

[CR14] 14.2020. GWAS Catalog. GCST90006934

[CR15] 15.Bulik-Sullivan B, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 2015;47:291–295. doi: 10.1038/ng.3211. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Sladek R, et al. A genome-wide association study identifies novel risk loci for type 2 diabetes. Nature. 2007;445:881–885. doi: 10.1038/nature05616. [DOI] [PubMed] [Google Scholar]

[CR17] 17.R Core Team. R: A language and environment for statistical computing. (2014).

PERMALINK

Genome-wide association analysis of type 2 diabetes in the EPIC-InterAct study

Lina Cai

Eleanor Wheeler

Nicola D Kerrison

Jian’an Luan

Panos Deloukas

Paul W Franks

Pilar Amiano

Eva Ardanaz

Catalina Bonet

Guy Fagherazzi

Leif C Groop

Rudolf Kaaks

José María Huerta

Giovanna Masala

Peter M Nilsson

Kim Overvad

Valeria Pala

Salvatore Panico

Miguel Rodriguez-Barranco

Olov Rolandsson

Carlotta Sacerdote

Matthias B Schulze

Annemieke M W Spijkerman

Anne Tjonneland

Rosario Tumino

Yvonne T van der Schouw

Stephen J Sharp

Nita G Forouhi

Elio Riboli

Mark I McCarthy

Inês Barroso

Claudia Langenberg

Nicholas J Wareham

Abstract

Background & Summary

Methods

Study design and participants

Fig. 1.

DNA samples and genotyping platforms

Table 1.

Genotype imputation

Genome-wide association meta-analysis

Ethics statement

Data Records

Technical Validation

Table 2.

Fig. 2.

Fig. 3.

Fig. 4.

Acknowledgements

Author contributions

Code availability

Competing interests

Footnotes

References

Associated Data

Data Citations

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases