Abstract
Type 2 diabetes (T2D) is the result of metabolic defects in insulin secretion and insulin sensitivity, yet most T2D loci identified to date influence insulin secretion. We hypothesized that T2D loci, particularly those affecting insulin sensitivity, can be identified through interaction with insulin secretion loci. To test this hypothesis, single nucleotide polymorphisms (SNPs) associated with acute insulin response to glucose (AIRg), a dynamic measure of first-phase insulin secretion, were identified in African Americans from the Insulin Resistance Atherosclerosis Family Study (IRASFS; n = 492 subjects). These SNPs were tested for interaction, individually and jointly as a genetic risk score (GRS), using genome-wide association study (GWAS) data from five cohorts (ARIC, CARDIA, JHS, MESA, WFSM; n = 2,725 cases, 4,167 controls) with T2D as the outcome. In single variant analyses, suggestively significant (Pinteraction<5×10−6) interactions were observed at several loci including LYPLAL1 (rs10746381), CHN2 (rs7796525), and EXOC1 (rs4289500). Notable AIRg GRS interactions were observed with SAMD4A (rs11627203) and UTRN (rs17074194). These data support the hypothesis that additional genetic factors contributing to T2D risk can be identified by interactions with insulin secretion loci.
Introduction
Although common variants examined in genome-wide association studies (GWAS) have identified ~80 loci associated with T2D risk, these variants explain only about 15% of T2D heritability [1,2]. A portion of the missing heritability may be explained by epistasis, which occurs when a genetic risk factor is modified by other factors in an individual’s genetic background [3]. Epistasis, or gene-gene interaction, analyses may facilitate the detection of novel loci when non-additive effects exist, but may also provide novel insights illuminating biological mechanisms underlying complex diseases such as T2D [4].
T2D is characterized by impaired insulin secretion arising from pancreatic beta-cell dysfunction and insulin resistance in hepatic, skeletal muscle, and other peripheral tissues, leading to decreased plasma glucose uptake. However, documented T2D loci primarily map to genes influencing insulin secretion or other aspects of beta-cell biology [1]. Given the underlying bimodal pathophysiology, T2D may be a particularly well-suited disease model for hypothesis-driven investigation of epistatic interactions. Genetic insults to both insulin secretion and insulin sensitivity may jointly increase an individual’s T2D risk in a non-additive manner. Considering the higher prevalence rate of T2D, insulin resistance, and obesity, African Americans are optimal for the study of genetic interactions that contribute to T2D risk.
In an effort to identify interactions contributing to T2D and to discover novel insulin sensitivity loci, we hypothesized that T2D risk loci, particularly those affecting insulin sensitivity, could be identified by interaction analyses with insulin secretion loci. In cross-sectional meta-analyses of five T2D studies (ARIC, CARDIA, JHS, MESA, and WFSM), we tested whether 5 insulin secretion SNPs, or a genetic risk score summarizing these SNPs, modified genome-wide SNP associations with T2D risk.
Research Design and Methods
Subjects
Two sources of data were analyzed in this study. Primary inferences of association with insulin secretion were derived from African American participants (n = 492 individuals from 42 families) in the Insulin Resistance Atherosclerosis Family Study (IRASFS), a metabolically well-characterized cohort [5]. Glucose homeostasis traits were measured by the frequently sampled intravenous glucose tolerance test (FSIGT) [5]. Briefly, a 50% glucose solution (0.3g/kg) and regular human insulin (0.03units/kg) were injected intravenously at 0 and 20 minutes, respectively. Blood was collected at −5, 2, 4, 8, 19, 22, 30, 40, 50, 70, 100, and 180 minutes for measurement of plasma glucose and insulin. AIRg was calculated as the increase in insulin at 2–8 minutes above the basal (fasting) insulin level after the bolus glucose injection at 0–1 minute. Insulin sensitivity (SI) was calculated by mathematical modeling using the MINMOD program (version 3.0 [1994]) [6]. Disposition index (DI) was calculated as the product of SI and AIRg.
Inferences of genome-wide epistatic interaction with insulin secretion loci for T2D susceptibility were derived from African American participants from the Atherosclerosis Risk in Communities Study (ARIC; n = 955 T2D cases, 414 controls), Coronary Artery Risk Development in Young Adults (CARDIA; n = 94 T2D cases, 654 controls), Jackson Heart Study (JHS; n = 333 T2D cases, 1,450 controls), Multi-Ethnic Study of Atherosclerosis (MESA; n = 411 T2D cases, 793 controls), and the Wake Forest School of Medicine (WFSM; n = 932 T2D cases, 856 controls) cohorts for a total of 2,725 T2D cases and 4,167 controls [7–12]. T2D was diagnosed according to the American Diabetes Association criteria with at least one of the following: fasting glucose ≥126 mg/dL, 2-h oral glucose tolerance test glucose ≥200 mg/dL, random glucose ≥200 mg/dL, use of oral hypoglycemic agents and/or insulin, or physician diagnosed diabetes. Subjects diagnosed before 25 years of age were excluded. Normal glucose tolerance was defined as fasting glucose <100 mg/dL and 2-h oral glucose tolerance test glucose <140 mg/dL (if available) without reported use of diabetes medications. Control subjects <25 years of age were excluded.
IRB approval was obtained at all sites and all participants provided written informed consent. Descriptions of the T2D study cohorts and funding sources for each individual study are summarized in S1 Appendix.
Genotyping, imputation, and quality control
For the IRASFS samples, genotyping and quality control were performed at the Wake Forest Center for Genomics and Personalized Medicine Research using the Illumina Infinium HumanExome BeadChip v1.0 as previously described [13]. Briefly, the exome chip contained 247,870 variants (92% protein coding), In addition, the chip included 64 SNPs associated with T2D from previous GWAS in Europeans, many of which have been implicated in insulin secretion (exome chip design: http://genome.sph.umich.edu/wiki/Exome_Chip_Design). Sample and autosomal SNP call rates were ≥99%, and SNPs with poor cluster separation (<0.35) were excluded. Mendelian errors were identified using PedCheck [14] and resolved by removing conflicting genotypes. Hardy–Weinberg Equilibrium (HWE) was assessed in unrelated samples (n = 39) using PLINK (http://pngu.mgh.harvard.edu/purcell/plink) [15] to reduce biases introduced by familial allele frequencies. All variants were in accordance with HWE (P>1x10-5).
As described in detail previously[16], the T2D study samples were genotyped using the Affymetrix Genome-Wide Human SNP Array 6.0. For the ARIC, CARDIA, JHS, and MESA cohorts, genotyping and quality control were completed by the National Heart, Lung, and Blood Institute’s (NHLBI’s) Candidate Gene Association Resource (CARe) at the Broad Institute [17]. Genotyping for the WFSM study was performed at the Center for Inherited Disease Research (CIDR). For all T2D studies, imputation was performed using MACH with the function–mle (version 1.0.16, http://www.sph.umich.edu/csg/abecasis/MaCH/) to obtain missing genotypes and replace genotypes inconsistent with reference haplotypes as previously described [18]. SNPs with call rate ≥95% and minor allele frequency (MAF) ≥1% that passed study-specific quality control were used for imputation [17,19]. A 1:1 HapMap II (NCBI Build 36) CEU:YRI (European:African) consensus haplotype was used as reference. A total of 2,713,329 to 2,907,086 autosomal SNPs from each GWAS with call rate ≥95%, MAF ≥1%, and Hardy-Weinberg P-value ≥0.0001 for genotyped SNPs and MAF ≥1% and RSQ ≥0.5 for imputed SNPs were included in subsequent data analyses.
Principal component analysis
For IRASFS, admixture was estimated using principal components (PCs) from 39 ancestry informative markers (AIMs) and including HapMap CEU and YRI samples for comparison [20]. Only PC1 correlated with HapMap populations, and was thus used as a covariate in all analyses.
For the T2D studies, PCs were computed for each study using high-quality SNPs as previously described [13,17–19,21]. The first PC was highly correlated (r2 >0.87) with global African-European ancestry, as measured by ANCESTRYMAP [22], STRUCTURE [23], or FRAPPE [24]. The African American T2D study samples had an average of 80% African ancestry. By analyzing unrelated samples from all studies using SMARTPCA [21], only the first PC appeared to account for substantial genetic variation (data not shown), whereas the subsequent PCs may reflect sampling noise and/or relatedness in samples [22]. The first PC (PC1) was used as a covariate in all analyses to adjust for population substructure.
Analysis of association with measures of glucose homeostasis in IRASFS
To approximate a normal distribution, trait values were transformed by square root (AIRg, DI) or natural logarithm plus a constant (SI). Measured genotype association analyses of exome chip variants with AIRg, SI, and DI were performed under an additive model using the variance components method implemented in Sequential Oligogenic Linkage Analysis Routines (SOLAR) [25] with adjustment for age, gender, body mass index (BMI), and PC1.
Genetic risk score construction
We further explored our interaction approach by constructing genetic risk scores (GRS), both weighted and unweighted, summarizing the effects of selected AIRg SNPs. The AIRg GRS was created using the AIRg-lowering effect alleles for AIRg SNPs (i.e. reflecting poorer insulin secretion) from IRASFS (Table 1). The unweighted risk scores were calculated by summation of the number of risk alleles for each individual across all selected SNPs. The weighted AIRg GRS was calculated as the sum of risk alleles at each locus multiplied by their AIRg effect size from IRASFS. Missing genotypes for a given SNP were imputed as the average number of risk alleles across all samples. The association of each GRS with both AIRg and DI, a combinatorial measure of first-phase insulin secretion and insulin sensitivity, were evaluated in IRASFS using the variance components method implemented in SOLAR [25] adjusted for age, gender, and ancestry proportions.
Table 1. Characteristics and single-SNP association results for AIRg SNPs in IRASFS.
AIRg SNP | Chr | Positiona | Gene | Effect Alleleb | Other Allele | EAFc | Beta | SEd | P |
---|---|---|---|---|---|---|---|---|---|
rs1582418 | 5 | 159882797 | PTTG1 | C | A | 0.66 | -3.12 | 0.86 | 4.00E-04 |
rs10792837 | 11 | 85905715 | EED | A | G | 0.34 | -2.92 | 0.84 | 4.20E-04 |
rs10830963 | 11 | 92708710 | MTNR1B | G | C | 0.10 | -5.80 | 1.30 | 1.20E-05 |
rs10403702 | 19 | 40148584 | LGALS16 | T | C | 0.80 | -3.54 | 0.95 | 2.07E-04 |
rs3827147 | 20 | 256727 | C20orf96 | T | A | 0.30 | -3.09 | 0.88 | 3.19E-04 |
aNCBI build 37.
bAIRg-lowering allele.
cEffect allele frequency.
dStandard error.
Analysis of interaction for T2D risk in the T2D studies
Logistic regression with T2D as the outcome was modeled including genetic main and interaction effects as well as AIRg SNP genotype or GRS, age, sex, and principal component covariates for all samples. Additional models included adjustment for BMI, and individuals with missing values were excluded (n = 110).
(1) |
Y is the log odds of T2D, S is the genotype for the AIRg SNP or the GRS, G is the genotype of the genomic variant, and C is the vector of all remaining covariates. The AIRg SNP genotype, both in single-variant analysis and GRS construction, were additively coded with values of 0, 1, and 2 corresponding to dosage of the AIRg-lowering allele. Genomic variant genotypes were additively coded with values of 0, 1, and 2 corresponding to an arbitrarily selected allele dosage of measured genotype or best-guess imputed genotype. Hypothesis testing included a Wald test statistic following a chi-squared distribution with 1 degree-of-freedom under the null H0:β3 = 0. In each study, PLINK was used to calculate the interaction effect β3. Interaction results with extreme values (absolute β or SE>10), primarily due to low cell counts, were excluded. A large number of SNPs with β and SE outliers were excluded in interaction analyses with the AIRg SNP rs1552224 (2,466,070 to 2,479,156 SNPs excluded) due to its low frequency in the T2D study samples (combined MAF = 0.03). For interaction analyses with all other SNPs and risk scores, the number of SNPs excluded as outliers ranged from 0 to 17,000. Interaction results were combined by fixed-effect inverse variance weighting for each candidate SNP or GRS in METAL (http://www.sph.umich.edu/csg/abecasis/metal/). Each meta-analysis contained results for 486,148 to 2,965,304 SNPs.
Results
Candidate insulin secretion SNP selection
The characteristics of IRASFS subjects are shown in S1 Table. Samples included 492 African Americans with mean age 41.2 years and mean BMI 29.1 kg/m2. Average African ancestry proportion was 0.75. FSIGT was performed for all subjects without T2D (n = 492) to assess measures including insulin secretion (AIRg), insulin sensitivity index (SI), and disposition index (DI).
Directly genotyped SNPs from the exome chip were tested for association with insulin secretion (AIRg) in IRASFS. Initially, five independent SNPs, which showed evidence of association with AIRg, were selected (effect sizes = -2.92 to -5.80). The MTNR1B SNP, rs10830963 is a well-documented locus associated with fasting glucose and T2D risk in Europeans. It was most powerfully associated with AIRg in IRASFS (P = 1.20x10-5, Table 1). An additional four variants with the strongest AIRg associations in IRASFS (P<5x10-4) and MAF≥0.2 (rs1582418, rs10792837, rs10403702, and rs3827147) were also selected (Table 1).
Interaction analysis
The selected SNPs were examined for genome-wide first order multiplicative interactions with 1) individual insulin secretion SNPs and 2) risk scores summarizing these insulin secretion SNPs. To maximize power, these analyses were performed in five studies (ARIC, CARDIA, JHS, MESA, and WFSM) including 2,725 T2D cases and 4,167 non-diabetic controls and results were meta-analyzed. Representative meta-analysis q-q plots are provided in S1 and S2 Figs. The magnitude of interaction effects cannot be interpreted in terms of disease risk because one must consider the context of marginal effects from each locus when an interaction exists. For this reason, interaction effects have been presented as beta values, not odds ratios, in Tables 2 and 3.
Table 2. Top meta-analyzed interactions with AIRg SNPs regressed on T2D risk in ARIC, CARDIA, JHS, MESA, and WFSM.
AIRg SNP (Nearest Gene) | Intxn SNPa (Nearest Gene) | Chr | Positionb | MAFc | βintxnd | Pintxnd | Phete | βintxn_adj_bmif | Pintxn_adj_bmif |
---|---|---|---|---|---|---|---|---|---|
rs10403702 (LGALS16) | rs12026223 (DPYD) | 1 | 97866857 | 0.16 | 0.53 | 1.11E-06 | 0.85 | 0.53 | 1.98E-04 |
rs10403702 (LGALS16) | rs10746381 (LYPLAL1) | 1 | 219106550 | 0.46 | -0.33 | 4.43E-06 | 0.38 | -0.32 | 1.75E-03 |
rs10403702 (LGALS16) | rs17044602 (DPP10) | 2 | 116273665 | 0.10 | 0.54 | 3.73E-06 | 0.51 | 0.54 | 2.43E-04 |
rs10403702 (LGALS16) | rs3822387 (ARHGAP26) | 5 | 142488305 | 0.45 | 0.35 | 5.05E-07 | 0.97 | 0.33 | 2.16E-04 |
rs10403702 (LGALS16) | rs4975846 (MRPL36) | 5 | 1794921 | 0.32 | -0.35 | 1.63E-06 | 0.21 | -0.36 | 4.94E-05 |
rs10403702 (LGALS16) | rs2201886 (NOX3) | 6 | 155999177 | 0.16 | -0.47 | 1.40E-06 | 0.95 | -0.49 | 1.66E-04 |
rs10403702 (LGALS16) | rs1655028 (SNTB1) | 8 | 122048009 | 0.44 | 0.34 | 2.21E-06 | 0.38 | 0.36 | 7.34E-06 |
rs10403702 (LGALS16) | rs244783 (WFDC1) | 16 | 84360055 | 0.47 | -0.32 | 3.07E-06 | 0.96 | -0.33 | 1.11E-04 |
rs10792837 (EED) | rs7796525 (CHN2) | 7 | 29410867 | 0.10 | -0.71 | 1.53E-06 | 0.27 | -0.70 | 5.39E-04 |
rs10792837 (EED) | rs1342119 (Intergenic) | 9 | 104799619 | 0.26 | 0.35 | 2.91E-06 | 0.49 | 0.35 | 2.12E-03 |
rs10792837 (EED) | rs4556497 (Intergenic) | 11 | 25663415 | 0.48 | -0.56 | 4.19E-07 | 0.80 | -0.58 | 9.73E-06 |
rs10792837 (EED) | rs1408201 (GJA3) | 13 | 20741094 | 0.34 | 0.46 | 4.14E-06 | 0.83 | 0.47 | 1.53E-04 |
rs10792837 (EED) | rs12978873 (ZNF761) | 19 | 53928911 | 0.24 | -0.40 | 3.36E-06 | 0.46 | -0.39 | 3.84E-04 |
rs10830963 (MTNR1B) | rs4289500 (EXOC1) | 4 | 56638538 | 0.44 | -0.56 | 4.60E-06 | 0.58 | -0.38 | 5.19E-02 |
rs10830963 (MTNR1B) | rs2640666 (MTRR) | 5 | 7926642 | 0.38 | 0.58 | 3.20E-06 | 0.66 | 0.42 | 2.42E-02 |
rs10830963 (MTNR1B) | rs7277627 (LCA5L) | 21 | 40813558 | 0.34 | -0.63 | 1.65E-06 | 0.98 | -0.44 | 2.25E-02 |
rs1582418 (PTTG1) | rs2781575 (ARHGAP29) | 1 | 94619077 | 0.07 | 0.63 | 4.02E-06 | 0.83 | 0.60 | 5.67E-03 |
rs1582418 (PTTG1) | rs7587317 (CYS1) | 2 | 10234179 | 0.29 | -0.31 | 4.77E-06 | 0.28 | -0.28 | 2.88E-02 |
rs1582418 (PTTG1) | rs16924460 (KIAA1217) | 10 | 24561095 | 0.08 | -0.60 | 1.70E-07 | 0.55 | -0.65 | 6.84E-04 |
rs1582418 (PTTG1) | rs6575130 (CALM1) | 14 | 90849847 | 0.45 | 0.31 | 1.88E-06 | 0.23 | 0.31 | 8.63E-03 |
rs1582418 (PTTG1) | rs7150527 (TCL1B) | 14 | 96125876 | 0.20 | -0.39 | 3.71E-06 | 0.33 | -0.39 | 6.71E-03 |
rs1582418 (PTTG1) | rs12597244 (BC108660) | 16 | 5776269 | 0.37 | -0.35 | 8.37E-07 | 0.65 | -0.36 | 2.47E-03 |
rs3827147 (C20orf96) | rs10975898 (KDM4C) | 9 | 6919984 | 0.22 | -0.37 | 3.72E-06 | 0.70 | -0.35 | 6.36E-04 |
rs3827147 (C20orf96) | rs10483995 (KCNK10) | 14 | 88716607 | 0.07 | 0.81 | 8.75E-07 | 0.83 | 0.81 | 8.47E-05 |
aSNP interacting with selected AIRg SNP with nearest gene within 500 kb in parentheses.
bNCBI build 37.
cMinor allele frequency.
dMeta-analyzed effect size and p-value from interaction models adjusted for age, gender, and PC1.
eHeterogeneity p-values across studies from interaction models adjusted for age, gender, and PC1.
fMeta-analyzed effect size and p-value from interaction models adjusted for age, gender, PC1, and BMI.
Table 3. Top meta-analyzed interactions with weighted AIRg GRS regressed on T2D risk in ARIC, CARDIA, JHS, MESA, and WFSM.
Intxn SNPa (Nearest Gene) | Chr | Positionb | MAFc | βintxnd | Pintxnd | Phete | βintxn_adj_bmif | Pintxn_adj_bmif |
---|---|---|---|---|---|---|---|---|
rs4975846 (MRPL36) | 5 | 1794921 | 0.32 | 0.05 | 2.03E-07 | 0.86 | 0.05 | 1.28E-06 |
rs17074194 (UTRN) | 6 | 145107349 | 0.09 | -0.08 | 1.87E-06 | 0.71 | -0.07 | 4.02E-05 |
rs10274367 (C7orf50) | 7 | 1117436 | 0.40 | -0.05 | 1.41E-06 | 0.47 | -0.04 | 9.63E-06 |
rs4921630 (Intergenic) | 8 | 16445870 | 0.06 | -0.11 | 3.98E-06 | 0.61 | -0.12 | 4.90E-06 |
rs11625046 (SRP54) | 14 | 35497011 | 0.10 | -0.07 | 4.86E-06 | 0.68 | -0.07 | 4.32E-05 |
rs799466 (SRP54) | 14 | 35503326 | 0.10 | -0.07 | 4.53E-06 | 0.53 | -0.07 | 3.80E-05 |
rs11627203 (SAMD4A) | 14 | 55221543 | 0.16 | -0.07 | 6.57E-07 | 0.07 | -0.06 | 1.81E-05 |
aSNP interacting with the weighted AIRg GRS with nearest gene within 500 kb in parentheses.
bNCBI build 37.
cMinor allele frequency.
dMeta-analyzed effect size and p-value from interaction models adjusted for age, gender, and PC1.
eHeterogeneity p-values across studies from interaction models adjusted for age, gender, and PC1.
fMeta-analyzed effect size and p-value from interaction models adjusted for age, gender, PC1, and BMI.
The characteristics of T2D case (n = 2,725) and control subjects (n = 4,167) for each study cohort are shown in S2 Table. Mean age at examination ranged from 38.2 (CARDIA) to 67.6 (MESA) years. Mean age at diagnosis for T2D cases ranged from 35.0 (CARDIA) to 54.6 (MESA) years. In all cohorts except WFSM, BMI was >3 kg/m2 higher in cases compared to controls.
AIRg SNP interactions
Five AIRg SNPs were tested for genome-wide interactions for T2D risk in the ARIC, CARDIA, JHS, MESA, and WFSM cohorts. Individual AIRg SNP results were meta-analyzed across cohorts. No interactions reached conventional GWAS thresholds for significance (Pinteraction<5×10−8). However, a total of 24 SNP-pairs were observed with suggestive interaction signals (Pinteraction<5×10−6; Table 2). The most significant AIRg SNP interaction observed was between rs1582418 at the PTTG1 locus (AIRg SNP) and rs16924460 (interacting SNP; P = 1.70x10-7). This interacting SNP is an intronic variant in the gene KIAA1217, which encodes the human homolog of murine Skt (Sickle tail). BMI adjustment of the top AIRg SNP interactions resulted in attenuation of the interaction p-value by three to four orders of magnitude for most SNPs (Table 2). Other notable interacting SNPs included rs10746381 (LYPLAL1), rs7796525 (CHN2), rs4289500 (EXOC1), rs10483995 (KCNK10), rs6575130 (CALM1), rs4975846 (MRPL36), rs2640666 (MTRR), rs10975898 (KDM4C), and rs1655028 (SNTB1).
GRS validation and interaction analysis
Each GRS was tested for association with AIRg and DI under an additive model using the variance components method with adjustment for age, gender, and PC1 in IRASFS (S3 Table). The weighted AIRg GRS was associated with AIRg in IRASFS with or without BMI adjustment (P = 4.96x10-2 and P = 3.34x10-2, respectively). Since the weighted risk score was associated with measures of glucose homeostasis, analysis of this risk score was emphasized in the tests for genome-wide interaction in the ARIC, CARDIA, JHS, MESA, and WFSM cohorts.
Meta-analyzed estimates of genome-wide interactions with the weighted AIRg GRS are presented in Table 3. No interactions met conventional GWAS thresholds for significance. However, seven interactions with the weighted AIRg GRS reached a suggestive level of significance (Pinteraction <5x10-6; Table 3). In the AIRg GRS interaction analysis the most significant association was observed with interacting SNP rs4975846 (Table 3; Pinteraction = 2.03x10-7). This is an intergenic SNP upstream of the gene MRPL36, which encodes mitochondrial ribosomal protein L36. This same interacting SNP was implicated in single variant analyses with AIRg SNP rs10403702 (LGALS16). Other notable interacting SNPs included rs11627203 (SAMD4A, Pinteraction = 6.57x10-7) and rs17074194 (UTRN, Pinteraction = 1.87x-10−6). Top interactions with the AIRg GRS remained robust after BMI adjustment.
For comparison purposes, meta-analysis of single-SNP association performed under an additive logistic regression model with adjustment for age, sex, and principal component covariates was conducted for all interaction SNPs presented in Tables 2 and 3 (S4 Table). None of these SNPs had an association P<0.05.
Discussion
Meta-analyses of five African American T2D studies did not reveal statistically significant first-order interactions with insulin secretion SNPs or composite risk scores. However, the observed interactions (Pinteraction<5×10−6) suggest that a candidate insulin secretion SNP/GRS interaction approach is a valid method for identifying insulin sensitivity and T2D risk loci. For example, analyses with the AIRg SNP rs10403702 (LGALS16) revealed an interaction with rs10746381, an intergenic SNP upstream of the LYPLAL1 gene encoding lysophospholipase-like 1. Variants at this locus have previously been associated with T2D, fasting insulin, HDL-cholesterol, triglycerides, and measures of body fat distribution, a signature common to insulin sensitivity loci [26–29]. The AIRg SNP rs10792837 (EED) showed interaction with rs7796525, an intronic SNP in CHN2. Interaction of CHN2 with the insulin receptor gene INSR results in severe insulin resistance [30]. Additionally the AIRg SNP rs10830963 (MTNR1B) displayed interaction with rs4289500, an intergenic SNP upstream of EXOC1, which encodes a component of the exocyst complex, required for translocation of glucose transporter type 4 (GLUT4) vesicles to the plasma membrane in insulin-stimulated glucose uptake [31].
Several genes related to pancreatic beta-cell function were also identified; suggesting interactions are not limited to insulin resistance as in our initial hypothesis. Analyses with AIRg SNP rs3827147 (C20orf96) revealed interactions with rs10483995, an intronic SNP in KCNK10. A study of pancreatic beta-cell line MIN6 cells has shown that Kcnk10, a member of the two-pore domain potassium channels, may regulate depolarization-induced secretion of insulin in these cells [32]. The AIRg SNP rs1582418 (PTTG1) interaction analyses identified an interaction with an intergenic SNP upstream of CALM1 (rs6575130). Inhibition of the product of this gene, calmodulin, with phenothiazines inhibits glucose-stimulated insulin secretion in isolated rat islet cells [33–37]. Several other variants detected in our analyses show interactions with similar biological relationships to insulin secretion and T2D.
Interestingly, we observed interactions discrete for individual loci. For example, analyses with rs10830963 (MTNR1B) revealed an interaction with rs2640666, an intronic variant downstream of MTRR. MTRR encodes 5-methyltetrahydrofolate-homocysteine methyltransferase reductase, implicated in metabolic syndrome [38] and epigenetic instability [39]. MTRR is highly expressed in the pineal gland in a circadian pattern [40], an intriguing result considering that the ligand for melatonin receptor 1B (encoded by MTNR1B), melatonin, is secreted from the pineal gland in a circadian pattern. Also revealed with rs10830963 was an association with rs4289500, which is ~200 kb upstream of CLOCK, a classic circadian clock gene. Further, analyses with other insulin secretion SNPs revealed genes involved in epigenetic modification. Analyses with rs3827147 (C20orf96) revealed an interaction with rs10975898, an intronic SNP in KDM4C which encodes a lysine-specific demethylase. We also identified 2 dystrophin-related genes. Analyses with rs10403702 (LGALS16) revealed an interaction with rs1655028, an intronic SNP upstream of SNTB1. This gene encodes beta 1 syntrophin, a peripheral membrane protein that associates with dystrophin and dystrophin-related proteins. Evaluations of the AIRg GRS revealed an interaction with rs17074194, an intronic SNP in UTRN. This gene encodes utrophin, a cytoskeletal protein which has homology with dystrophin and is found at the neuromuscular synapse and myotendinous junctions in muscle cells. These observations may reflect different, input-dependent physiological characteristics of interaction results, and may lead to mechanistic insights about the underlying causes of T2D and defects in glucose homeostasis in expanded analyses.
Although results varied widely between interaction analyses, interaction with one locus, MRPL36, was replicated in multiple analyses. Functional characteristics of MRPL36 related to T2D and glucose homeostasis pathophysiology are not evident in the current literature.
Previous GWAS have largely ignored epistatic contributions to T2D risk due to the heavy multiple testing burden and computational challenges of exhaustive analytical approaches, and when they have considered this contribution, results have not been striking. For example, a recent genome-wide scan for two-locus interactions in the Wellcome Trust Case Control Consortium T2D GWAS data did not reveal any significant epistatic signals at a Bonferroni-corrected p-value threshold of 2.14x10-11 after adjusting for the main effects of the most strongly associated T2D locus, TCF7L2 [41]. Further, Herold et al. estimated that analysis of all pairwise interactions among 550,000 SNPs in 1,200 samples on a 3 GHz computer would require a running time of 120 days [42]. The interaction analysis presented here overcomes the issue of a heavy multiple testing burden by using a candidate SNP approach. A recent study by Becker et al. demonstrated that a multiple test correction of 0.4m, where m is the number of SNP pairs tested, is sufficiently conservative for large-scale allelic interaction tests [43]. Further, Babron et al. show that a correction for the effective number of SNP pairs is equally sufficient [44]. Li et al. previously demonstrated that the effective number of SNPs for an imputed dataset is ~106. These findings suggest that a significance threshold of 1x10-8 is appropriate for this study.
We did not detect interactions even at the conventional GWAS threshold of 5x10-8 in the current study. In part, this likely reflects the challenge of inherently reduced power of interaction models due to the low frequency of compound genotypes [45]. Computational resources required for this study were equivalent to the requirements for running 12 GWAS (5 candidate insulin secretion SNPs plus a GRS, with and without BMI adjustment). This is a significant reduction compared to exhaustive approaches examining genome-wide interactions with all available SNP pairs. Although the results of a meta-analysis of five cohorts are presented here, this study lacks replication of the observed interaction associations in additional studies. Future work will require examination of associated interactions in African American and other populations.
In summary, our findings demonstrate that genome-wide interaction studies with selected insulin secretion variants is a powerful approach for the detection of T2D risk, insulin secretion, and insulin sensitivity loci. The use of a high-quality measure of first-phase insulin secretion, AIRg, to identify candidate interaction SNPs yielded compelling associations. These results justify an expansion of the current study and further investigation of putative insulin sensitivity loci, namely LYPLAL1, CHN2, and EXOC1.
Supporting Information
Acknowledgments
The authors would like to acknowledge the contributions of the involved research institutions, study investigators, field staff, and study participants of ARIC, CARDIA, JHS, MESA, and WFSM.
Data Availability
Genotype and phenotype data for ARIC, CARDIA, JHS, MESA, and WFSM have been deposited into dbGAP under accession numbers phs000557.v2.p1, phs000613.v1.p2, phs000499.v2.p1, phs000283.v7.p3, and phs000140.v1.p1, respectively.
Funding Statement
Genotyping services for the WFSM study were provided by CIDR. CIDR is fully funded through a federal contract from the National Institutes of Health (NIH) to The Johns Hopkins University (contract HHSC268200782096C). The work at Wake Forest was supported by NIH grants K99-DK-081350 (N.D.P.), R01-DK-066358 (D.W.B.), R01-DK-053591 (D.W.B.), R01-HL-56266 (B.I.F.), and R01-DK-070941 (B.I.F.), and in part by the General Clinical Research Center of the WFSM grant M01-RR-07122. This work was also supported by the NHLBI. The following four parent studies have contributed parent study data, ancillary study data, and DNA samples through the Massachusetts Institute of Technology-Broad Institute (N01-HC-65226) to create this genotype/phenotype database for wide dissemination to the biomedical research community: ARIC, CARDIA, JHS, and MESA. The Atherosclerosis Risk in Communities (ARIC) Study is carried out as a collaborative study supported by National Heart, Lung, and Blood Institute contracts (HHSN268201100005C, HHSN268201100006C, HHSN268201100007C, HHSN268201100008C, HHSN268201100009C, HHSN268201100010C, HHSN268201100011C, and HHSN268201100012C), R01HL087641, R01HL59367 and R01HL086694; National Human Genome Research Institute contract U01HG004402; and National Institutes of Health contract HHSN268200625226C. The authors thank the staff and participants of the ARIC study for their important contributions. Infrastructure was partly supported by grant number UL1RR025005, a component of the National Institutes of Health and NIH Roadmap for Medical Research. The authors thank the staff and participants of the ARIC study for their important contributions. The Coronary Artery Risk Development in Young Adults (CARDIA) Study is conducted and supported by the National Heart, Lung, and Blood Institute in collaboration with the University of Alabama at Birmingham (HHSN268201300025C & HHSN268201300026C), Northwestern University (HHSN268201300027C), University of Minnesota (HHSN268201300028C), Kaiser Foundation Research Institute (HHSN268201300029C), and Johns Hopkins University School of Medicine (HHSN268200900041C). CARDIA is also partially supported by the Intramural Research Program of the National Institute on Aging. Genotyping was funded as part of the NHLBI Candidate-gene Association Resource (N01-HC-65226) and the NHGRI Gene Environment Association Studies (GENEVA) (U01-HG004729, U01-HG04424, and U01-HG004446). This manuscript has been reviewed and approved by CARDIA for scientific content. The Jackson Heart Study (JHS) is supported by contracts HHSN268201300046C, HHSN268201300047C, HHSN268201300048C, HHSN268201300049C, HHSN268201300050C from the National Heart, Lung, and Blood Institute and the National Institute on Minority Health and Health Disparities. The authors thank the participants and data collection staff of the Jackson Heart Study. Multi-Ethnic Study of Atherosclerosis (MESA), and the MESA SHARe project are conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with MESA investigators. Support for MESA is provided by contracts N01-HC-95159, N01-HC-95160, N01-HC-95161, N01-HC-95162, N01-HC-95163, N01-HC-95164, N01-HC-95165, N01-HC-95166, N01-HC-95167, N01-HC-95168, N01-HC-95169, UL1-TR-001079, UL1-TR-000040, and DK063491. The MESA CARe data used for the analyses described in this manuscript were obtained through Genetics (CMP00068). Funding for CARe genotyping was provided by NHLBI contract N01-HC-65226. The views expressed in this manuscript are those of the authors and do not necessarily represent the views of the National Heart, Lung, and Blood Institute; the National Institutes of Health; or the U.S. Department of Health and Human Services.
References
- 1.Prasad RB, Groop L. Genetics of type 2 diabetes-pitfalls and possibilities. Genes. 2015;6: 87–123. 10.3390/genes6010087 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Morris AP, Voight BF, Teslovich TM, Ferreira T, Segrè AV, Steinthorsdottir V, et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet. 2012;44: 981–990. 10.1038/ng.2383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Cordell HJ. Detecting gene-gene interactions that underlie human diseases. Nat Rev Genet. 2009;10: 392–404. 10.1038/nrg2579 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Moore JH. The ubiquitous nature of epistasis in determining susceptibility to common human diseases. Hum Hered. 2003;56: 73–82. [DOI] [PubMed] [Google Scholar]
- 5.Henkin L, Bergman RN, Bowden DW, Ellsworth DL, Haffner SM, Langefeld CD, et al. Genetic epidemiology of insulin resistance and visceral adiposity. The IRAS Family Study design and methods. Ann Epidemiol. 2003;13: 211–217. [DOI] [PubMed] [Google Scholar]
- 6.Pacini G, Bergman RN. MINMOD: a computer program to calculate insulin sensitivity and pancreatic responsivity from the frequently sampled intravenous glucose tolerance test. Comput Methods Programs Biomed. 1986;23: 113–122. [DOI] [PubMed] [Google Scholar]
- 7.The ARIC Investigators. The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. Am J Epidemiol. 1989;129: 687–702. [PubMed] [Google Scholar]
- 8.Friedman GD, Cutter GR, Donahue RP, Hughes GH, Hulley SB, Jacobs DR, et al. CARDIA: study design, recruitment, and some characteristics of the examined subjects. J Clin Epidemiol. 1988;41: 1105–1116. [DOI] [PubMed] [Google Scholar]
- 9.Taylor HA, Wilson JG, Jones DW, Sarpong DF, Srinivasan A, Garrison RJ, et al. Toward resolution of cardiovascular health disparities in African Americans: design and methods of the Jackson Heart Study. Ethn Dis. 2005;15: S6-4–17. [PubMed] [Google Scholar]
- 10.Bild DE, Bluemke DA, Burke GL, Detrano R, Diez Roux AV, Folsom AR, et al. Multi-ethnic study of atherosclerosis: objectives and design. Am J Epidemiol. 2002;156: 871–881. [DOI] [PubMed] [Google Scholar]
- 11.McDonough CW, Palmer ND, Hicks PJ, Roh BH, An SS, Cooke JN, et al. A genome-wide association study for diabetic nephropathy genes in African Americans. Kidney Int. 2011;79: 563–572. 10.1038/ki.2010.467 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Palmer ND, McDonough CW, Hicks PJ, Roh BH, Wing MR, An SS, et al. A genome-wide association search for type 2 diabetes genes in African Americans. PloS One. 2012;7: e29202 10.1371/journal.pone.0029202 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hellwege JN, Palmer ND, Raffield LM, Ng MCY, Hawkins GA, Long J, et al. Genome-wide family-based linkage analysis of exome chip variants and cardiometabolic risk. Genet Epidemiol. 2014;38: 345–352. 10.1002/gepi.21801 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.O’Connell JR, Weeks DE. PedCheck: a program for identification of genotype incompatibilities in linkage analysis. Am J Hum Genet. 1998;63: 259–266. 10.1086/301904 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81: 559–575. 10.1086/519795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ng MCY, Saxena R, Li J, Palmer ND, Dimitrov L, Xu J, et al. Transferability and fine mapping of type 2 diabetes loci in African Americans: the Candidate Gene Association Resource Plus Study. Diabetes. 2013;62: 965–976. 10.2337/db12-0266 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lettre G, Palmer CD, Young T, Ejebe KG, Allayee H, Benjamin EJ, et al. Genome-wide association study of coronary heart disease and its risk factors in 8,090 African Americans: the NHLBI CARe Project. PLoS Genet. 2011;7: e1001300 10.1371/journal.pgen.1001300 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ng MCY, Saxena R, Li J, Palmer ND, Dimitrov L, Xu J, et al. Transferability and fine mapping of type 2 diabetes loci in African Americans: the Candidate Gene Association Resource Plus Study. Diabetes. 2013;62: 965–976. 10.2337/db12-0266 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hester JM, Wing MR, Li J, Palmer ND, Xu J, Hicks PJ, et al. Implication of European-derived adiposity loci in African Americans. Int J Obes 2005. 2012;36: 465–473. 10.1038/ijo.2011.131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Palmer ND, Mychaleckyj JC, Langefeld CD, Ziegler JT, Williams AH, Bryer-Ash M, et al. Evaluation of DLG2 as a positional candidate for disposition index in African-Americans from the IRAS Family Study. Diabetes Res Clin Pract. 2010;87: 69–76. 10.1016/j.diabres.2009.10.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Patterson N, Price AL, Reich D. Population structure and eigenanalysis. PLoS Genet. 2006;2: e190 10.1371/journal.pgen.0020190 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Patterson N, Hattangadi N, Lane B, Lohmueller KE, Hafler DA, Oksenberg JR, et al. Methods for high-density admixture mapping of disease genes. Am J Hum Genet. 2004;74: 979–1000. 10.1086/420871 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155: 945–959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Keene KL, Mychaleckyj JC, Leak TS, Smith SG, Perlegas PS, Divers J, et al. Exploration of the utility of ancestry informative markers for genetic association studies of African Americans with type 2 diabetes and end stage renal disease. Hum Genet. 2008;124: 147–154. 10.1007/s00439-008-0532-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Almasy L, Blangero J. Multipoint quantitative-trait linkage analysis in general pedigrees. Am J Hum Genet. 1998;62: 1198–1211. 10.1086/301844 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Berndt SI, Gustafsson S, Mägi R, Ganna A, Wheeler E, Feitosa MF, et al. Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture. Nat Genet. 2013;45: 501–512. 10.1038/ng.2606 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium, Asian Genetic Epidemiology Network Type 2 Diabetes (AGEN-T2D) Consortium, South Asian Type 2 Diabetes (SAT2D) Consortium, Mexican American Type 2 Diabetes (MAT2D) Consortium, Type 2 Diabetes Genetic Exploration by Nex-generation sequencing in muylti-Ethnic Samples (T2D-GENES) Consortium, Mahajan A, et al. Genome-wide trans-ancestry meta-analysis provides insight into the genetic architecture of type 2 diabetes susceptibility. Nat Genet. 2014;46: 234–244. 10.1038/ng.2897 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Scott RA, Fall T, Pasko D, Barker A, Sharp SJ, Arriola L, et al. Common genetic variants highlight the role of insulin resistance and body fat distribution in type 2 diabetes, independent of obesity. Diabetes. 2014;63: 4378–4387. 10.2337/db14-0319 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Scott RA, Lagou V, Welch RP, Wheeler E, Montasser ME, Luan J ‘an, et al. Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat Genet. 2012;44: 991–1005. 10.1038/ng.2385 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Suliman SGI, Stanik J, McCulloch LJ, Wilson N, Edghill EL, Misovicova N, et al. Severe insulin resistance and intrauterine growth deficiency associated with haploinsufficiency for INSR and CHN2: new insights into synergistic pathways involved in growth and metabolism. Diabetes. 2009;58: 2954–2961. 10.2337/db09-0787 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Inoue M, Chang L, Hwang J, Chiang S-H, Saltiel AR. The exocyst complex is required for targeting of Glut4 to the plasma membrane by insulin. Nature. 2003;422: 629–633. 10.1038/nature01533 [DOI] [PubMed] [Google Scholar]
- 32.Kang D, Choe C, Kim D. Functional expression of TREK-2 in insulin-secreting MIN6 cells. Biochem Biophys Res Commun. 2004;323: 323–331. 10.1016/j.bbrc.2004.08.089 [DOI] [PubMed] [Google Scholar]
- 33.Schubart UK, Erlichman J, Fleischer N. The role of calmodulin in the regulation of protein phosphorylation and insulin release in hamster insulinoma cells. J Biol Chem. 1980;255: 4120–4124. [PubMed] [Google Scholar]
- 34.Sugden MC, Christie MR, Ashcroft SJ. Presence and possible role of calcium-dependent regulator (calmodulin) in rat islets of Langerhans. FEBS Lett. 1979;105: 95–100. [DOI] [PubMed] [Google Scholar]
- 35.Valverde I, Sener A, Lebrun P, Herchuelz A, Malaisse WJ. The stimulus-secretion coupling of glucose-induced insulin release. XLVII. The possible role of calmodulin. Endocrinology. 1981;108: 1305–1312. 10.1210/endo-108-4-1305 [DOI] [PubMed] [Google Scholar]
- 36.Henquin JC. Effects of trifluoperazine and pimozide on stimulus-secretion coupling in pancreatic B-cells. Suggestion for a role of calmodulin? Biochem J. 1981;196: 771–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Krausz Y, Wollheim CB, Siegel E, Sharp GW. Possible role for calmodulin in insulin release. Studies with trifluoperazine in rat pancreatic islets. J Clin Invest. 1980;66: 603–607. 10.1172/JCI109893 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yang B, Fan S, Zhi X, Wang D, Li Y, Wang Y, et al. Associations of MTHFR C677T and MTRR A66G gene polymorphisms with metabolic syndrome: a case-control study in Northern China. Int J Mol Sci. 2014;15: 21687–21702. 10.3390/ijms151221687 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Padmanabhan N, Jia D, Geary-Joo C, Wu X, Ferguson-Smith AC, Fung E, et al. Mutation in Folate Metabolism Causes Epigenetic Instability and Transgenerational Effects On Development. Cell. 2013;155 10.1016/j.cell.2013.09.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.MTRR (5-methyltetrahydrofolate-homocysteine methyltransferase reductase) | Gene Report | BioGPS [Internet]. [cited 25 Aug 2015]. Available: http://ds.biogps.org/?dataset=GSE1133&gene=4552
- 41.Bell JT, Timpson NJ, Rayner NW, Zeggini E, Frayling TM, Hattersley AT, et al. Genome-wide association scan allowing for epistasis in type 2 diabetes. Ann Hum Genet. 2011;75: 10–19. 10.1111/j.1469-1809.2010.00629.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Herold C, Steffens M, Brockschmidt FF, Baur MP, Becker T. INTERSNP: genome-wide interaction analysis guided by a priori information. Bioinforma Oxf Engl. 2009;25: 3275–3281. 10.1093/bioinformatics/btp596 [DOI] [PubMed] [Google Scholar]
- 43.Becker T, Herold C, Meesters C, Mattheisen M, Baur MP. Significance levels in genome-wide interaction analysis (GWIA). Ann Hum Genet. 2011;75: 29–35. 10.1111/j.1469-1809.2010.00610.x [DOI] [PubMed] [Google Scholar]
- 44.Babron M-C, Etcheto A, Dizier M-H. A New Correction for Multiple Testing in Gene-Gene Interaction Studies. Ann Hum Genet. 2015; 10.1111/ahg.12113 [DOI] [PubMed] [Google Scholar]
- 45.Lucas G, Lluís-Ganella C, Subirana I, Musameh MD, Gonzalez JR, Nelson CP, et al. Hypothesis-Based Analysis of Gene-Gene Interactions and Risk of Myocardial Infarction. PLoS ONE. 2012;7 10.1371/journal.pone.0041730 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Genotype and phenotype data for ARIC, CARDIA, JHS, MESA, and WFSM have been deposited into dbGAP under accession numbers phs000557.v2.p1, phs000613.v1.p2, phs000499.v2.p1, phs000283.v7.p3, and phs000140.v1.p1, respectively.