Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2014 Sep 8;111(38):13924–13929. doi: 10.1073/pnas.1402665111

Global genomic and transcriptomic analysis of human pancreatic islets reveals novel genes influencing glucose metabolism

João Fadista a,1, Petter Vikman a, Emilia Ottosson Laakso a, Inês Guerra Mollet a, Jonathan Lou Esguerra a, Jalal Taneera a, Petter Storm a, Peter Osmark a, Claes Ladenvall a, Rashmi B Prasad a, Karin B Hansson a, Francesca Finotello b, Kristina Uvebrant a, Jones K Ofori a, Barbara Di Camillo b, Ulrika Krus a, Corrado M Cilio a, Ola Hansson a, Lena Eliasson a, Anders H Rosengren a, Erik Renström a, Claes B Wollheim a,c, Leif Groop a,1
PMCID: PMC4183326  PMID: 25201977

Significance

We provide a comprehensive catalog of novel genetic variants influencing gene expression and metabolic phenotypes in human pancreatic islets. The data also show that the path from genetic variation (SNP) to gene expression is more complex than hitherto often assumed, and that we need to consider that genetic variation can also influence function of a gene by influencing exon usage or splice isoforms (sQTL), allelic imbalance, RNA editing, and expression of noncoding RNAs, which in turn can influence expression of target genes.

Abstract

Genetic variation can modulate gene expression, and thereby phenotypic variation and susceptibility to complex diseases such as type 2 diabetes (T2D). Here we harnessed the potential of DNA and RNA sequencing in human pancreatic islets from 89 deceased donors to identify genes of potential importance in the pathogenesis of T2D. We present a catalog of genetic variants regulating gene expression (eQTL) and exon use (sQTL), including many long noncoding RNAs, which are enriched in known T2D-associated loci. Of 35 eQTL genes, whose expression differed between normoglycemic and hyperglycemic individuals, siRNA of tetraspanin 33 (TSPAN33), 5′-nucleotidase, ecto (NT5E), transmembrane emp24 protein transport domain containing 6 (TMED6), and p21 protein activated kinase 7 (PAK7) in INS1 cells resulted in reduced glucose-stimulated insulin secretion. In addition, we provide a genome-wide catalog of allelic expression imbalance, which is also enriched in known T2D-associated loci. Notably, allelic imbalance in paternally expressed gene 3 (PEG3) was associated with its promoter methylation and T2D status. Finally, RNA editing events were less common in islets than previously suggested in other tissues. Taken together, this study provides new insights into the complexity of gene regulation in human pancreatic islets and better understanding of how genetic variation can influence glucose metabolism.


Type 2 diabetes (T2D) is an increasing global health problem (1). Although genome-wide association studies (GWAS) have yielded more than 70 loci associated with T2D or related traits (2, 3), they have not provided the expected breakthrough in our understanding of the pathogenesis of the disease. They have nonetheless pointed at a central role of the pancreatic islets and β-cell dysfunction in the development of the disease (4, 5). It therefore seems pertinent to focus on human pancreatic islets to obtain insights into the molecular mechanisms causing the disease (6, 7). Given that most SNPs associated with T2D lie in noncoding regions, the majority of causal variants are likely to regulate gene expression rather than protein function per se. Therefore, combination of DNA and RNA sequencing in the same individuals may help to disentangle the role these SNPs play in the pathogenesis of the disease (8). Although the human pancreatic islet transcriptome has been previously described (6, 918), using microarrays or RNA sequencing of a limited number of nondiabetic individuals, this has not allowed a more global analysis of the complexity of the islet transcriptome in T2D. Here we combined genotypic imputation, expression microarrays, and exome and RNA sequencing (Exome-Seq and RNA-Seq) in a large number of human pancreatic islets from deceased donors with and without T2D. This study identified a number of novel genes, including long intergenic noncoding RNAs (lincRNAs), whose expression and/or splicing influences insulin secretion and is associated with glycemia. In addition, we provide a catalog of RNA editing and allele-specific expression events in human pancreatic islets (SI Appendix, Fig. S1).

Results

Genes Showing Differential Expression Between Islets from Normoglycemic and Hyperglycemic Donors.

To obtain a profile of gene expression variation in human islets, we sequenced the polyadenylated fraction of RNA from 89 individuals with different degrees of glucose tolerance, using 101 base pairs paired-end on an Illumina HiSeq sequencer (Dataset S1 and SI Appendix, Fig. S1). Each individual transcriptome yielded, on average, 38.2 ± 4.4 (mean ± SD) million paired-end reads mapped to the human genome, with ∼88% mapping to known exons from the RefSeq Gene database. Because any expression cutoff is arbitrary, we considered a gene to be expressed if it was observed in at least 5% of the samples. Applying this definition, we detected 91% of RefSeq genes. However, most of these genes are expressed at low levels, supporting the view of pervasive transcription and leakage in the human transcriptome (SI Appendix, Fig. S2) (19). Moreover, in support of previous results (20), we observed a good correlation between gene expression based on RNA-seq and microarrays in the 89 samples (r = 0.83; P < 0.001) (SI Appendix, Fig. S3). To evaluate how well the RefSeq genes in our RNA-seq dataset are covered, we sequenced one sample at high depth of coverage (∼150 million paired-end reads). As seen in SI Appendix, Fig. S4, our average sample coverage of 38.2 million paired-end reads is deep enough to detect the majority of known genes, transcripts, exons, and junctions. As expected, glucagon, insulin, and other known pancreatic genes showed the highest expression (Fig. 1A and Dataset S1). To identify genes whose expression is influenced by glycemia (cause or consequence), we related gene expression to HbA1c, a measure of long-term glycemia, and compared expression in islets from donors with normal glucose tolerance (HbA1c < 6%), impaired glucose tolerance (IGT; 6% ≤ HbA1c < 6.5%), and T2D (HbA1c ≥ 6.5%). By using a linear model adjusting for age and sex, we detected 1619 genes associated with HbA1c levels in both RNA-seq and microarrays (Database S1 and SI Appendix). Briefly, genes were kept if both microarray and RNA-seq gene expression were nominally associated with HbA1c levels, with both nominal and permutation P values < 0.05. In addition, 271 genes showed specific exon associations, with HbA1c levels not detected at the gene level (Database S1). Of the genes associated with HbA1c levels, 70 were also associated with in vitro insulin secretion in human islets, further highlighting their role in glucose metabolism (Database S1). Of particular interest are the genes whose expression is associated with lower HbA1c levels and higher insulin secretion, such as RAS guanyl releasing protein 1 (RASGRP1) (6), transcription factor RFX3 (21), and nicotinamide nucleotide transhydrogenase (NNT) (22), all of which have been suggested to regulate insulin secretion (SI Appendix, Figs. S5–S7). RFX3 has also been suggested to regulate the glucokinase promoter, and thereby its expression in a mouse insulinoma cell line MIN6 (21). In line with these findings, we observed a clear coexpression between the RFX3 and GCK genes in human islets (SI Appendix, Fig. S19). Of the established T2D and glycemic associated loci (2, 3, 2328) whose gene expression proxies were associated with HbA1c levels, solute carrier family 30 (zinc transporter), member 8 (SLC30A8), glucose-6-phosphatase, catalytic, 2 (G6PC2), and proprotein convertase subtilisin/kexin type 1 (PCSK1) showed the highest expression using RNA-seq (Fig. 1B and Database S1). This is in line with our previous findings using microarrays (6). SLC30A8, G6PC2, and PCSK1 also showed a strong positive correlation with glucagon expression (SI Appendix, Fig. S8). By using the RABT Cufflinks transcript assembly method (29), we also detected 445 potential novel genes with exon–exon junctions in addition to the existing GENCODE (30), UCSC, and Ensembl gene structure annotations (Database S1). Of these potential novel gene loci, 28 (6%) show coding potential, as assessed by the CPAT tool (31), and 391 (88%) are within 5 kb of known human islet active chromatin DNase, FAIRE, or H3K4me3 peaks (3234), pointing to candidate nearby promoters for those genes (Database S1). One of these potential novel genes, although not showing any coding potential nor close to any known islet open chromatin mark, was also associated with HbA1c, and this new gene locus is in a ∼10-kb region nominally significant in the MAGIC database for fasting glucose (23) (SI Appendix, Fig. S9).

Fig. 1.

Fig. 1.

Genes expressed in 89 human pancreatic islets stratified by glucose tolerance status. (A) RNA-seq normalized median expression of the top 25 nonribosomal genes expressed in islets. (B) RNA-seq normalized median expression of the 17 genes that show significant islet expression association with glucose tolerance status and are putatively associated with established T2D and glycemic associated loci (2, 3, 2328). Genes are ordered by decreasing median expression in all 89 islet donors. Normal corresponds with normoglycemic donors (HbA1c < 6%; n = 51), IGT corresponds with impaired glucose-tolerant donors (6% ≤ HbA1c < 6.5%; n = 15), and T2D corresponds with diabetic donors (HbA1c ≥ 6.5%; n = 12). Error bars represent SEM values. *Genes that show significant expression association with glucose tolerance status detected both by expression arrays and RNA-seq with both nominal and permutation P values < 0.05 (after performing 10,000 permutations). §Additional genes that show significant expression association with glucose tolerance status detected only with RNA-seq (at permuted P value < 0.05).

Effect of SNPs on Gene Expression (eQTLs) and Splicing (sQTLs) in Human Pancreatic Islets.

Because many SNPs are located in noncoding regions, suggesting they may influence gene expression, we analyzed whether any SNP genotyped in our islet samples and further imputed to the 1000 Genomes reference panel (35) would influence RNA-seq gene expression (eQTL) or exon use (sQTL) in cis (within 250 kb of the SNP). For analysis purposes, we identified a single best “sentinel” SNP for each gene or exon, defined as the SNP with the lowest P value per eQTL or sQTL gene. Applying these criteria and thresholds, we identified 616 cis eQTLs for known genes (Fig. 2A and Database S1), whereas 24 eQTLs were detected in previously unknown transcribed loci (Database S1; Materials and Methods). Notably, 54% of these eQTLs would have been missed in a microarray because the gene is not probed on the array or shows low expression (SI Appendix, Fig. S10).

Fig. 2.

Fig. 2.

Landscape of cis-eQTLs (≤250 kb) in 89 human pancreatic islets and eQTL genes validated to interfere with glucose metabolism and insulin secretion. (A) Manhattan plot of the best P value per SNP, showing the top 10 eQTL genes (FDR < 1% line drawn in black at P value = 2.267e−05). (B) eQTL SNP genotype versus eQTL gene expression (n = 89). All eQTLs pass FDR < 1% and 10,000 permutations. (C) eQTL gene expression stratified by glucose tolerance status [normal, HbA1c < 6% (n = 51); IGT, 6% ≤ HbA1c < 6.5% (n = 15); T2D, HbA1c ≥ 6.5% (n = 12)]. All genes have a nominal and permutation P value < 0.05 in both RNA-seq and microarrays (after 10,000 permutations). (D) Insulin secretion in response to 2.8 and 16.7 mM glucose 72 h after siRNA transfection, as measured during 1 h static incubation. Data are shown from three independent experiments for each siRNA. Data are normalized for protein content. Bars represent mean ± SD. **P < 0.01 and *P < 0.05 versus control siRNA. The knockdown efficiency of NT5E, PAK7, and TMED6 were above 80%, whereas for TSPAN33 it was 50%.

Our sample size permitted us to detect significant eQTLs at >90% power with an effect size of 0.5 (beta) or more (SI Appendix, Fig. S11). Notably, only in about half of the cases did the eQTL SNPs influence expression on the nearest gene. The strongest eQTLs were detected in the lactate dehydrogenase C (LDHC), tRNA-yW synthesizing protein 1 homolog B (Saccharomyces cerevisiae) (TYW1B), and endoplasmic reticulum aminopeptidase 2 (ERAP2) genes (Fig. 2A). LDHC encodes the enzyme lactate dehydrogenase C, which catalyzes the glycolytic conversion of lactate to pyruvate. Although expression of another lactate dehydrogenase, LDHA, is suggested to be repressed in pancreatic β cells because of a minor role of anaerobic glycolysis in the adult β cell (36), LDHC is expressed at similar levels in α and β cells (17). Knock-down of LDHC and TYW1B in INS-1 cells using siRNA did not affect insulin secretion. ERAP2 has been ascribed a role in autoimmunity and type 1 diabetes (37), and its eQTL sentinel SNP is in high linkage disequilibrium (r2 > 0.8) with the genome-wide significant GWAS SNP rs1019503 for glucose levels 2 h after an oral glucose challenge (3). Furthermore, we observed 371 splicing QTLs (sQTLs) not reflected by changes in expression at the gene level (Database S1 and SI Appendix, Fig. S13). There was eQTL and sQTL enrichment in regions of islet active chromatin, such as those characterized by DNase I hypersensitivity [Fisher exact test, P value < 2.2e−16 (odds ratio = 2.1) for eQTLs; and P value = 3.8e−11 (odds ratio = 1.9 for sQTLs)], H3K4m3 [P value < 2.2e−16(odds ratio = 3.1), for eQTLs and P value < 2.2e−16 (odds ratio = 2.3) for sQTLs], and FAIRE [P value = 2.1e−07 (odds ratio = 2.7) for eQTLs and P value = 0.02 (odds ratio = 1.9) for sQTLs] (3234). There was no indication that these eQTLs and sQTLs were enriched in evolutionarily conserved sites.

Because GWAS for T2D only enabled identification of loci, rather than genes, we examined whether SNPs known to associate with T2D or related traits (glucose, insulin) would have a cis effect on gene expression or exon use. We found enrichment for GWAS T2D/glycemic trait loci in eQTLs (Fisher exact test, P value 4.1e−03; odds ratio = 5.0), with five GWAS SNPs showing a cis eQTL effect, and in the case of rs1535500, the effect was not on the nearest gene (Table 1 and SI Appendix, Fig. S14). Notably, of the 1,619 genes whose expression correlated with HbA1c, 35 (2%) had an eQTL (Database S1). We examined whether the eQTL SNPs in these genes were associated with insulin and glucose concentrations in the DIAGRAM and MAGIC databases (2, 3, 2328). The sentinel eQTL SNP for sorting nexin 19 (SNX19), rs3751034, was nominally associated with HbA1c in MAGIC (P value < 0.01) (28). SNX19 has also been shown to regulate insulin secretion in a mouse pancreatic β-cell line (38). Finally, we tested whether the three eQTL genes [tetraspanin 33 (TSPAN33), 5′-nucleotidase, ecto (NT5E), and transmembrane emp24 protein transport domain containing 6 (TMED6)] showing the strongest effect on HbA1c levels would also influence insulin secretion by disrupting their expression in INS-1 cells. We also tested p21 protein activated kinase 7 (PAK7), a gene associated with HbA1c levels in both RNA-seq and microarray but only detected as an eQTL gene by RNA-seq. Down-regulation of the expression of these genes was associated with significantly reduced glucose-stimulated insulin secretion (Fig. 2 BD). Taken together, we present a list of SNPs influencing gene expression in human pancreatic islets with a likely role in regulating glucose homeostasis.

Table 1.

Genome-wide significant GWAS T2D/glycemic hits as eQTLs in human pancreatic islets

SNP Nearest gene eQTL gene eQTL P value Allele change eQTL direction GWAS trait GWAS effect allele
rs1019503 ERAP2 ERAP2 7.1e−24 G > A + 2-h glucose (3) A
rs2028299 AP3S2 AP3S2 1.1e−14 C > A T2D (24) A
rs505922 ABO ABO 5.3e−08 T > C + Disposition index (25) C
rs10830963 MTNR1B MTNR1B 8.6e−08 C > G + T2D, fasting glucose (2, 26) G
rs1535500 KCNK16 KCNK17 1.2e−06 G > T + T2D (27) T

eQTLs and HbA1c Influence Expression of lincRNAs.

lincRNAs have recently been ascribed a role in the regulation of gene transcription, including pancreatic islets (16). We identified 493 RefSeq lincRNAs expressed in the pancreatic islets, with 54 of those being influenced by eQTLs or sQTLs and/or related to HbA1c levels (Database S1). Of these 54 lincRNAs, seven (13%) have also been reported in a study of lincRNAs in human pancreatic β cells (15). Of the 616 eQTLs we identified (Fig. 2A), 33 (5%) influenced the expression of lincRNAs, eight of which have also been reported in other tissues. Moreover, six (2%) of 371 sQTLs were seen within lincRNAs. Notably, 17 lincRNAs were significantly associated with HbA1c levels, two of which also had an eQTL (LOC283177 and SNHG5) (Database S1). To obtain insight into putative target genes of these two lincRNAs, we performed a coexpression analysis linking their expression with all other genes in pancreatic islets. This analysis showed a strong coexpression of the MAP-kinase activating death domain (MADD), synaptotagmin 11 (SYT11), and paired box 6 (PAX6) genes with LOC283177 (Fig. 3). All these genes have been ascribed a key role in islet function. Synaptotagmin 11 (SYT11) is known to regulate exocytosis of insulin (39) and MADD proinsulin synthesis (25), and PAX6 is involved in development of pancreatic islets (40). In support of this, LOC283177 expression was directly associated with insulin exocytosis in the islets (Fig. 3F). The lincRNA ANRIL (also known as CDKN2B-AS1), located in a locus on chromosome 9p, has been associated with both T2D (2) and cardiovascular disease. Although eQTLs for ANRIL have been reported in human blood (41), we could not detect any eQTL for ANRIL in human islets or any coexpressed genes.

Fig. 3.

Fig. 3.

Expression analysis of the lincRNA LOC283177 in 89 human pancreatic islets. (A) The lincRNA LOC283177 has an eQTL (n = 89, significant at FDR < 1% and 10,000 permutations), (B) which is associated with HbA1c in the islet donors (nominal and permutation P value < 0.05 after 10,000 permutations). Normal corresponds to normoglycemic donors (HbA1c < 6%; n = 51), IGT corresponds to impaired glucose tolerant donors (6% ≤ HbA1c < 6.5%; n = 15), and T2D corresponds to diabetic donors (HbA1c ≥ 6.5%; n = 12). LOC283177 is coexpressed with the diabetic genes (C) MADD, (D) PAX6, (E) SYT11, and (F) associates with depolarization-evoked insulin exocytosis (Spearman correlation test significance at FDR < 1%).

Allelic Expression Imbalance.

Both allelic expression imbalance (AEI) and cis-QTL analysis detect genetic effects on gene transcription, although they frequently do not capture the same loci. Whereas eQTL and sQTL refer to the effect of a SNP on expression of the gene or specific exons/isoforms, respectively, AEI refers to imbalance between expression of maternal and paternal alleles and, consequently, can only be detected in the case of heterozygosity. To further elucidate the cis-regulatory potential in islets, we searched for genes showing allelic expression imbalance. We compared transcriptome and exome sequencing from the same individuals, using Fisher test to define significant deviation from the expected 50/50 allelic distribution for the SNPs. Thereby, we could detect 1,528 SNPs showing potential allelic imbalance in at least two samples at false discovery rate (FDR) <1% (Database S1). These encompass 1,102 genes, 14% of which have been previously suggested to be imprinted and/or showing imbalance of expression in other human tissues (Database S1). Only 3% of eQTLs and 0.5% of sQTLs were in strong linkage disequilibrium with an AEI SNP. We validated, with Sanger sequencing, an AEI in the MMP7 gene and showed that the missense variant rs10502001 is nominally associated with exocytosis of insulin (SI Appendix, Fig. S15). To detect allelic imbalance sites relevant to T2D, we filtered AEI sites ascertained in at least 50% of the samples and found PEG3 (paternally expressed gene 3). PEG3 is a gene known to be imprinted in other tissues and to change its methylation levels in murine oocytes of diabetic females (42), suggesting a link between allelic imbalance and imprinting/methylation. Notably, we show clear differences in the degree of methylation in a region of the PEG3 promoter, being hypomethylated in T2D islets, which do not have allelic imbalance (Fig. 4). These data indicate that differential methylation could be the cause of allelic imbalance, which in turn could influence susceptibility to T2D. Of SNPs associated with T2D or related glycemic traits, we found enrichment for allelic imbalance (Fisher exact test P value = 2.3e−06; odds ratio = 10.1), with eight showing evidence of allelic imbalance in solute carrier family 2, member 2 (SLC2A2), adaptor-related protein complex 3, sigma 2 subunit (AP3S2), thyroid adenoma associated (THADA), MADD, ERAP2, aminomethyltransferase (AMT), forkhead box A2 (FOXA2), and La ribonucleoprotein domain family, member 6 (LARP6) loci (Table 2).

Fig. 4.

Fig. 4.

Allelic imbalance in the PEG3 gene is associated with diabetic status. (A) Methylation fraction of different parts of PEG3 was assessed by Illumina’s 450k chip in a subset of islets (adjusted t-test P value depicted). Cases were three patients with T2D, and controls were 21 normal patients or patients with IGT, sampled at random from the 89 total samples. Methylation of a CpG site in the promoter of PEG3 was significantly higher in the nondiabetic islets. (B) The CpG site is in the islet active promoter of PEG3 (43). The black bar corresponds to PEG3 promoter, and the red bar corresponds to an islet active enhancer region. (C) Probed with the SNP rs3143, the RNA allelic imbalance is more pronounced in nondiabetic (normal and IGT, n = 37) than diabetic (T2D, n = 9) islets (P value = 0.03, Wilcoxon rank sum test). The y axis being 0.5 means no allelic imbalance, whereas y axis = 1 or 0 means expression of only one allele; that is, imprinting. Not all normal, IGT, or T2D were tested because of the harsh filtering criteria before allelic imbalance testing (SI Appendix). TSS1500, 1,500 base pairs from transcription start site. TSS200, 200 base pairs from transcription start site. UTR3′, 3′ untranslated region. UTR5′, 5′ untranslated region.

Table 2.

Allelic imbalance loci in high linkage disequilibrium with genome-wide significant GWAS T2D/glycemic trait hits

AEI gene AEI SNPs Linkage disequilibrium r2 > 0.8 with GWAS top SNP GWAS trait GWAS effect allele
SLC2A2 rs55679742, rs55989805 rs11920090 Fasting glucose (26) T
AP3S2 rs2028299 rs2028299 T2D (24) A
LARP6 rs3825970 rs1549318 Fasting proinsulin (25) T
THADA rs7578597 rs10203174 T2D (2) C
MADD rs35233100 rs35233100 Fasting proinsulin (25) C
ERAP2 rs2287988, rs2548538 rs1019503 2-h glucose (3) A
AMT rs6997 rs11715915 Fasting glucose (3) C
FOXA2 rs6048192 rs6113722 Fasting glucose (3) G

RNA Editing.

Finally, we assessed the frequency of RNA editing events in the pancreatic islet transcriptome, using a stringent pipeline to identify differences between DNA and RNA sequences, by comparing exome and RNA sequencing data (SI Appendix, Fig. S16). We found 65 loci showing potential RNA editing in at least two individuals overlapping 61 genes, two of which were in loci associated with T1D and T2D, GLIS3 (44) and ZFAND3 (2) (Database S1). Seven of the RNA editing events have also been reported before. As previously observed (45), the majority of RNA editing events were localized in the 3′UTR region or downstream of genes (67%), suggesting that RNA editing might play a role in miRNA-mediated regulation of gene expression by altering miRNA target sites or by affecting degradation of RNA. As described for other tissues, the vast majority of RNA editing events were canonical A-to-G events (83% if T-to-C events indicative of A-to-I editing on the opposite strand also are included) (SI Appendix, Fig. S17) (45). By randomly choosing nine editing events, we could validate three of the six A-to-G events by Sanger sequencing, but none of the three non A-to-G events (SI Appendix, Fig. S18). This low validation rate brings into question some previous reports of thousands of RNA editing events based simply on RNA sequencing (46).

Discussion

By combining RNA and exome sequencing of human pancreatic islets with in vitro and in vivo functional studies, we present novel insights into the molecular mechanisms by which impaired islet function can contribute to deregulated glucose metabolism. Coexpression analysis showed that expression of many genes correlated strongly with glucagon, not least SLC30A8 encoding the zinc transport ZnT8 (SI Appendix, Fig. S8A). Rare loss-of-function variants in the SLC30A8 gene have recently been associated with lowering of blood glucose and protection from T2D (47), but the mechanism for this glucose-lowering effect has been unclear, especially as disruption of the SLC30A8 gene in mice has yielded the opposite phenotype: glucose intolerance (48).

The current data might thus shed some light on this paradox: the lower the expression in human pancreatic islets of SLC30A8, the lower the expression of glucagon. It remains to be shown whether carriers of these loss-of-function mutation carriers also show inappropriately low glucagon concentrations. Because most SNPs associated with T2D are intronic or intergenic, it has been assumed that most of them would influence expression, rather than function, of a gene. Although the nearest genes often have been suggested as targets, this has not previously been formally tested in human islets, which represent the culprit in the pathogenesis of T2D. Our current study in a large number of human islets allowed this analysis and showed enrichment of GWAS SNPs associated with T2D or glycemic traits in eQTLs (Table 1) and in genetic variants showing allelic imbalance (Table 2). Although we often assume that both parental alleles are expressed to the same degree, this was not the case for SNPs in 1,102 genes, including eight T2D-associated genes (Table 2). This could easily mask an association if the effect of the two parental alleles is bidirectional. We also found allelic imbalance to be often associated with DNA methylation. Genes found to be differentially methylated in human pancreatic islets of non-T2D versus T2D donors (49) were enriched to have allelic imbalance of expression in our dataset (Fisher exact test P value = 6.8e−4; odds ratio = 1.5). Moreover, PEG3 was here detected to have its allelic imbalance associated with diabetic status (Fig. 4). PEG3 encodes for a zinc finger protein that may play a role in cell proliferation and p53-mediated apoptosis (50), mechanisms that could be involved in the regulation of functional β-cell mass (51).

We also found several eQTLs and sQTLs associated with measures of β-cell function and glucose metabolism, most notably variation in the TMED6, NT5E, PAK7, and TSPAN33 genes, whose disruption in INS-1 cells resulted in impaired insulin secretion (Fig. 2 BD). These and the other identified genes with an eQTL associated with in vitro and in vivo effects on glucose metabolism could be further explored as potential novel drug targets (Database S1). In addition, many eQTLs and sQTLs influenced expression of noncoding RNAs, many of which seem to target genes of importance for β-cell function. Among them, the lincRNA LOC283177 was found to be coexpressed with key genes implicated in islet function (PAX6, SYT11, and MADD) (25, 39, 40), and its expression correlated with HbA1c levels and insulin exocytosis (Fig. 3). Finally, we also provide, to the best of our knowledge, the first genome-wide catalog of RNA editing events in human islets mostly related to A-to-G events, but our data also emphasize the need for validation rather than simply relying on RNA sequencing.

There are some limitations with the study we need to take into account. One caveat could be purity of human cadaver islets and differences in contribution of exocrine and endocrine tissue or different contribution of α and β cells between normoglycemic and hyperglycemic donors. We focused on whole islets, as sorting of islet cells would have limited the amount of tissue available for the different analyses. Furthermore, there is important additional information to gain from studying the microorgan islet as an entity, as shown by the expression of other pancreatic hormones and their coexpression. However, some information on cell-specific expression is available from three recent papers (1618) on a small number of sorted β cells. As described in the Materials and Methods, Database S1, and SI Appendix, Fig. S20, there was no difference in purity between individuals with NGT, IGT, and T2D (Kruskal-Wallis rank sum test P value = 0.83). In addition, the contribution of exocrine and endocrine tissue did not significantly differ between diabetic and nondiabetic islets, as indicated by expression of pancreatic-specific exocrine (alpha 2 amylase) and endocrine (glucagon in alpha cells, MAFA in beta cells, and somatostatin in delta cells) genes (SI Appendix, Fig. S21). Moreover, beta cell content, as measured by FACS β/α cells ratio, was also not significantly different among NGT, IGT, and T2D (Kruskal-Wallis rank sum test P value = 0.14) (SI Appendix, Fig. S22). Acknowledging these limitations, only large enough numbers can outweigh the problems of heterogeneity and purity. To this end, the current study, to our knowledge, represents the largest collection of human islets published thus far. In conclusion, we provide a comprehensive catalog of novel genetic variants influencing gene expression in human pancreatic islets and metabolic phenotypes to facilitate diabetes research.

Materials and Methods

Detailed materials and methods, including all statistical analysis, are available in SI Appendix. Islets from 89 cadaver donors of European ancestry were provided by the Nordic Islet Transplantation Program and processed as previously described (6). Microarray analysis was performed using oligo (52) and sva (53) Bioconductor packages and processed with the standard Affymetrix protocol. Sample preparation for RNA-seq was performed using Illumina’s TruSeq RNA Sample Preparation Kit. Output reads were aligned to the human reference genome (hg19) with TopHat v.2.0.2 (54), using Bowtie v.0.12.8 (55). The dexseq_count python script was used by counting uniquely mapped reads in each exon (56). Gene and exon expression normalizations were then performed using the TMM method (57), and further normalization was applied by adjusting the expression to gene or exon length, respectively. A linear model adjusting for age and sex as implemented in the R Matrix eQTL package (58) was used to determine the expression of genes and exons associated with HbA1c class. Exome sequencing was performed using the Illumina exome sequencing protocols. Reads were aligned to the human genome (hg19) with BWA v.0.6.2 (59). Postalignment processing and SNP calling was done with GATK v.1.6.2 (60). Allelic imbalance of expression was analyzed by Fisher exact test to calculate the proportion of reference/alternative alleles in the exome sequencing versus RNA-seq for each sample. RNA editing sites were called on autosomes in positions that were homozygous in the exome sequencing but heterozygous in the RNA-seq data. Genotyping was performed on the Illumina HumanOmniExpress 12v1 C chips, and all of the samples passed standard genotype QC metrics. Genotypes were imputed to 1000 Genomes data, using IMPUTE2 (61) and SHAPEIT (62). cis-eQTL and cis-sQTL associations were computed between gene expression levels (eQTL) or exon expression levels (sQTL) and all SNPs within 250 kb up- or downstream of each of these genes. We used a linear model adjusting for age and sex, as implemented in the R Matrix eQTL package (58).

Supplementary Material

Supplementary File
Supplementary File

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

Database deposition: The data reported in this paper have been deposited in the Gene Expression Omnibus (GEO) database, www.ncbi.nlm.nih.gov/geo (accession no. GSE50398).

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1402665111/-/DCSupplemental.

References

  • 1.Scully T. Diabetes in numbers. Nature. 2012;485(7398):S2–S3. doi: 10.1038/485s2a. [DOI] [PubMed] [Google Scholar]
  • 2.Morris AP, et al. Wellcome Trust Case Control Consortium Meta-Analyses of Glucose and Insulin-related traits Consortium (MAGIC) Investigators Genetic Investigation of ANthropometric Traits (GIANT) Consortium Asian Genetic Epidemiology Network–Type 2 Diabetes (AGEN-T2D) Consortium South Asian Type 2 Diabetes (SAT2D) Consortium DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet. 2012;44(9):981–990. doi: 10.1038/ng.2383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Scott RA, et al. DIAbetes Genetics Replication and Meta-analysis (DIAGRAM) Consortium Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat Genet. 2012;44(9):991–1005. doi: 10.1038/ng.2385. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lyssenko V, et al. Clinical risk factors, DNA variants, and the development of type 2 diabetes. N Engl J Med. 2008;359(21):2220–2232. doi: 10.1056/NEJMoa0801869. [DOI] [PubMed] [Google Scholar]
  • 5.Dimas AS, et al. MAGIC Investigators Impact of type 2 diabetes susceptibility variants on quantitative glycemic traits reveals mechanistic heterogeneity. Diabetes. 2014;63(6):2158–2171. doi: 10.2337/db13-0949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Taneera J, et al. A systems genetics approach identifies genes and pathways for type 2 diabetes in human islets. Cell Metab. 2012;16(1):122–134. doi: 10.1016/j.cmet.2012.06.006. [DOI] [PubMed] [Google Scholar]
  • 7.Mahdi T, et al. Secreted frizzled-related protein 4 reduces insulin secretion and is overexpressed in type 2 diabetes. Cell Metab. 2012;16(5):625–633. doi: 10.1016/j.cmet.2012.10.009. [DOI] [PubMed] [Google Scholar]
  • 8.Dermitzakis ET. Cellular genomics for complex traits. Nat Rev Genet. 2012;13(3):215–220. doi: 10.1038/nrg3115. [DOI] [PubMed] [Google Scholar]
  • 9.Dorrell C, et al. Transcriptomes of the major human pancreatic cell types. Diabetologia. 2011;54(11):2832–2844. doi: 10.1007/s00125-011-2283-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Eizirik DL, et al. The human pancreatic islet transcriptome: Expression of candidate genes for type 1 diabetes and the impact of pro-inflammatory cytokines. PLoS Genet. 2012;8(3):e1002552. doi: 10.1371/journal.pgen.1002552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Maffei A, et al. Identification of tissue-restricted transcripts in human islets. Endocrinology. 2004;145(10):4513–4521. doi: 10.1210/en.2004-0691. [DOI] [PubMed] [Google Scholar]
  • 12.Gunton JE, et al. Loss of ARNT/HIF1β mediates altered gene expression and pancreatic-islet dysfunction in human type 2 diabetes. Cell. 2005;122(3):337–349. doi: 10.1016/j.cell.2005.05.027. [DOI] [PubMed] [Google Scholar]
  • 13.Kutlu B, et al. Detailed transcriptome atlas of the pancreatic beta cell. BMC Med Genomics. 2009;2:3. doi: 10.1186/1755-8794-2-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lyttle BM, et al. Transcription factor expression in the developing human fetal endocrine pancreas. Diabetologia. 2008;51(7):1169–1180. doi: 10.1007/s00125-008-1006-z. [DOI] [PubMed] [Google Scholar]
  • 15.Marselli L, et al. Gene expression profiles of Beta-cell enriched tissue obtained by laser capture microdissection from subjects with type 2 diabetes. PLoS ONE. 2010;5(7):e11499. doi: 10.1371/journal.pone.0011499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Morán I, et al. Human β cell transcriptome analysis uncovers lncRNAs that are tissue-specific, dynamically regulated, and abnormally expressed in type 2 diabetes. Cell Metab. 2012;16(4):435–448. doi: 10.1016/j.cmet.2012.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bramswig NC, et al. Epigenomic plasticity enables human pancreatic α to β cell reprogramming. J Clin Invest. 2013;123(3):1275–1284. doi: 10.1172/JCI66514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Nica AC, et al. Cell-type, allelic, and genetic signatures in the human pancreatic beta cell transcriptome. Genome Res. 2013;23(9):1554–1562. doi: 10.1101/gr.150706.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Djebali S, et al. Landscape of transcription in human cells. Nature. 2012;489(7414):101–108. doi: 10.1038/nature11233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y. RNA-seq: An assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9):1509–1517. doi: 10.1101/gr.079558.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ait-Lounis A, et al. The transcription factor Rfx3 regulates beta-cell differentiation, function, and glucokinase expression. Diabetes. 2010;59(7):1674–1685. doi: 10.2337/db09-0986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Aston-Mourney K, et al. Increased nicotinamide nucleotide transhydrogenase levels predispose to insulin hypersecretion in a mouse strain susceptible to diabetes. Diabetologia. 2007;50(12):2476–2485. doi: 10.1007/s00125-007-0814-x. [DOI] [PubMed] [Google Scholar]
  • 23.Manning AK, et al. DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium Multiple Tissue Human Expression Resource (MUTHER) Consortium A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nat Genet. 2012;44(6):659–669. doi: 10.1038/ng.2274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kooner JS, et al. DIAGRAM MuTHER Genome-wide association study in individuals of South Asian ancestry identifies six new type 2 diabetes susceptibility loci. Nat Genet. 2011;43(10):984–989. doi: 10.1038/ng.921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Huyghe JR, et al. Exome array analysis identifies new loci and low-frequency variants influencing insulin processing and secretion. Nat Genet. 2013;45(2):197–201. doi: 10.1038/ng.2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Dupuis J, et al. DIAGRAM Consortium GIANT Consortium Global BPgen Consortium Anders Hamsten on behalf of Procardis Consortium MAGIC investigators New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk. Nat Genet. 2010;42(2):105–116. doi: 10.1038/ng.520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Cho YS, et al. DIAGRAM Consortium MuTHER Consortium Meta-analysis of genome-wide association studies identifies eight new loci for type 2 diabetes in east Asians. Nat Genet. 2012;44(1):67–72. doi: 10.1038/ng.1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Soranzo N, et al. WTCCC Common variants at 10 genomic loci influence hemoglobin A₁(C) levels via glycemic and nonglycemic pathways. Diabetes. 2010;59(12):3229–3239. doi: 10.2337/db10-0502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Roberts A, Pimentel H, Trapnell C, Pachter L. Identification of novel transcripts in annotated genomes using RNA-Seq. Bioinformatics. 2011;27(17):2325–2329. doi: 10.1093/bioinformatics/btr355. [DOI] [PubMed] [Google Scholar]
  • 30.Harrow J, et al. GENCODE: The reference human genome annotation for The ENCODE Project. Genome Res. 2012;22(9):1760–1774. doi: 10.1101/gr.135350.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wang L, et al. CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model. Nucleic Acids Res. 2013;41(6):e74. doi: 10.1093/nar/gkt006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Stitzel ML, et al. NISC Comparative Sequencing Program Global epigenomic analysis of primary human pancreatic islets provides insights into type 2 diabetes susceptibility loci. Cell Metab. 2010;12(5):443–455. doi: 10.1016/j.cmet.2010.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Gaulton KJ, et al. A map of open chromatin in human pancreatic islets. Nat Genet. 2010;42(3):255–259. doi: 10.1038/ng.530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.1000 Genomes Project Consortium A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Sekine N, et al. Low lactate dehydrogenase and high mitochondrial glycerol phosphate dehydrogenase in pancreatic beta-cells. Potential role in nutrient sensing. J Biol Chem. 1994;269(7):4895–4902. [PubMed] [Google Scholar]
  • 37.Fierabracci A, Milillo A, Locatelli F, Fruci D. The putative role of endoplasmic reticulum aminopeptidases in autoimmunity: Insights from genomic-wide association studies. Autoimmun Rev. 2012;12(2):281–288. doi: 10.1016/j.autrev.2012.04.007. [DOI] [PubMed] [Google Scholar]
  • 38.Harashima S, et al. Sorting nexin 19 regulates the number of dense core vesicles in pancreatic β-cells. J Diabetes Investig. 2012;3(1):52–61. doi: 10.1111/j.2040-1124.2011.00138.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Andersson SA, et al. Reduced insulin secretion correlates with decreased expression of exocytotic genes in pancreatic islets from patients with type 2 diabetes. Mol Cell Endocrinol. 2012;364(1-2):36–45. doi: 10.1016/j.mce.2012.08.009. [DOI] [PubMed] [Google Scholar]
  • 40.Gosmain Y, et al. Pax6 is crucial for β-cell function, insulin biosynthesis, and glucose-induced insulin secretion. Mol Endocrinol. 2012;26(4):696–709. doi: 10.1210/me.2011-1256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Cunnington MS, Santibanez Koref M, Mayosi BM, Burn J, Keavney B. Chromosome 9p21 SNPs Associated with Multiple Disease Phenotypes Correlate with ANRIL Expression. PLoS Genet. 2010;6(4):e1000899. doi: 10.1371/journal.pgen.1000899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Ge ZJ, et al. Maternal diabetes causes alterations of DNA methylation statuses of some imprinted genes in murine oocytes. Biol Reprod. 2013;88(5):117. doi: 10.1095/biolreprod.112.105981. [DOI] [PubMed] [Google Scholar]
  • 43.Pasquali L, et al. Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants. Nat Genet. 2014;46(2):136–143. doi: 10.1038/ng.2870. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Nogueira TC, et al. GLIS3, a susceptibility gene for type 1 and type 2 diabetes, modulates pancreatic beta cell apoptosis via regulation of a splice variant of the BH3-only protein Bim. PLoS Genet. 2013;9(5):e1003532. doi: 10.1371/journal.pgen.1003532. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Chen L. Characterization and comparison of human nuclear and cytosolic editomes. Proc Natl Acad Sci USA. 2013;110(29):E2741–E2747. doi: 10.1073/pnas.1218884110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Pickrell JK, Gilad Y, Pritchard JK. Comment on “Widespread RNA and DNA sequence differences in the human transcriptome”. Science. 2012;335(6074):1302. doi: 10.1126/science.1210484. author reply 1302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Flannick J, et al. Go-T2D Consortium T2D-GENES Consortium Loss-of-function mutations in SLC30A8 protect against type 2 diabetes. Nat Genet. 2014;46(4):357–363. doi: 10.1038/ng.2915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Hardy AB, et al. Effects of high-fat diet feeding on Znt8-null mice: Differences between β-cell and global knockout of Znt8. Am J Physiol Endocrinol Metab. 2012;302(9):E1084–E1096. doi: 10.1152/ajpendo.00448.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Dayeh T, et al. Genome-wide DNA methylation analysis of human pancreatic islets from type 2 diabetic and non-diabetic donors identifies candidate genes that influence insulin secretion. PLoS Genet. 2014;10(3):e1004160. doi: 10.1371/journal.pgen.1004160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Jiang X, et al. The imprinted gene PEG3 inhibits Wnt signaling and regulates glioma growth. J Biol Chem. 2010;285(11):8472–8480. doi: 10.1074/jbc.M109.069450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Zhou Y, et al. Survival of pancreatic beta cells is partly controlled by a TCF7L2-p53-p53INP1-dependent pathway. Hum Mol Genet. 2012;21(1):196–207. doi: 10.1093/hmg/ddr454. [DOI] [PubMed] [Google Scholar]
  • 52.Carvalho BS, Irizarry RA. A framework for oligonucleotide microarray preprocessing. Bioinformatics. 2010;26(19):2363–2367. doi: 10.1093/bioinformatics/btq431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Leek JT, Johnson WE, Parker HS, Jaffe AE, Storey JD. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–883. doi: 10.1093/bioinformatics/bts034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Trapnell C, Pachter L, Salzberg SL. TopHat: Discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25(9):1105–1111. doi: 10.1093/bioinformatics/btp120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Anders S, Reyes A, Huber W. Detecting differential usage of exons from RNA-seq data. Genome Res. 2012;22(10):2008–2017. doi: 10.1101/gr.133744.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11(3):R25. doi: 10.1186/gb-2010-11-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Shabalin AA. Matrix eQTL: Ultra fast eQTL analysis via large matrix operations. Bioinformatics. 2012;28(10):1353–1358. doi: 10.1093/bioinformatics/bts163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.McKenna A, et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet. 2009;5(6):e1000529. doi: 10.1371/journal.pgen.1000529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Delaneau O, Marchini J, Zagury JF. A linear complexity phasing method for thousands of genomes. Nat Methods. 2012;9(2):179–181. doi: 10.1038/nmeth.1785. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File
Supplementary File

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES