Introductory paragraph
Gene expression is tightly regulated with many genes exhibiting cell-specific silencing when their protein product would disrupt normal cellular function1. This silencing is largely controlled by non-coding elements and their disruption might cause human disease2. We performed gene-agnostic screening of the non-coding regions to discover new molecular causes of congenital hyperinsulinism. This identified 14 non-coding de novo variants affecting a 42bp conserved region encompassed by a regulatory element in intron 2 of Hexokinase 1 (HK1). HK1 is widely expressed across all tissues except for liver and pancreatic beta-cells and is thus termed a “disallowed gene” in these specific tissues. We demonstrated that the variants result in a loss of repression of HK1 in pancreatic beta-cells, thereby causing insulin secretion and congenital hyperinsulinism. Using epigenomic data accessed from public repositories, we demonstrated that these variants reside within a regulatory region that we determine to be critical for cell-specific silencing. Importantly, this has revealed a disease mechanism for non-coding variants that cause inappropriate expression of a disallowed gene.
Genetic discovery in Mendelian disease has focused on identifying highly penetrant variants affecting the function of genes expressed in clinically affected tissue(s). Whilst this approach has proven successful the underlying aetiology of over 3000 presumed monogenic diseases remains undefined2–5, of which many have significant clinical and genetic heterogeneity6,7. Congenital Hyperinsulinism (CHI), is characterised by inappropriate insulin secretion during hypoglycaemia. It is a clinically and genetically heterogeneous disease where, despite extensive sequencing efforts, the underlying aetiology is not known in up to 50% of individuals8,9.
To identify new aetiologies, we performed whole-genome sequencing (WGS) on 135 individuals with biochemically confirmed, persistent CHI without an identified disease-causing variant in a known gene (Supplementary Table 1). We initially searched for genes containing new coding de novo or biallelic variants in two or more individuals. When no such genes were found, we turned to the non-coding genome and searched for de novo copy number variants. This identified a single locus within intron 2 of Hexokinase 1 (HK1) containing two heterozygous ~4.5Kb deletions with a 2,787bp overlap in two unrelated individuals (Extended Data Figure 1). Digital droplet PCR confirmed the deletions in both probands and an affected twin.
We next searched for de novo single nucleotide variants and indels within the minimal deleted region within our whole genome sequencing discovery cohort and in a replication cohort of 27 individuals with CHI who had undergone pancreatectomy as part of routine clinical care (Supplementary Table 1). This identified 7 different de novo variants in 12 probands. In two cases the variants had been inherited by similarly affected offspring (Figure 1). All variants were new10,11 and within a 42bp conserved region constrained against variation in gnomAD.12
Figure 1. Schematic representation of the HK1 gene and the identified variants.
a) Schematic representation of the HK1 gene (GRCh37/hg19 chr10:71,029,756-71,161,637). The full-length testis-specific isoform (ENST00000448642) and the shorter ubiquitously expressed isoform (ENST00000359426) are depicted. The positions of the two ~4.5Kb deletions within intron 2 of the shorter isoform and the 7 different heterozygous single nucleotide variants and indels within the minimal deleted region identified in 14 probands are shown. The number of probands with each variant is provided. An asterisk (*) denotes a single proband with two de novo variants (see Extended Data Figure 8). The variant positions are given according to the GRCh37/hg19 genomic coordinates. b) Partial pedigrees showing inherited HK1 variants in 3 families. Filled symbols represent individuals with congenital hyperinsulinism. (‡) Haplotype analysis confirmed that the HK1 variant had arisen de novo in patient 3.2. M, HK1 variant; N, no variant; NA, DNA not available. Pedigrees for the 11 probands with de novo HK1 variants are not shown.
Overall, the finding of 9 different de novo variants in 14 probands, co-segregating with disease in three additional family members provides overwhelming evidence of disease-causality.
All 17 individuals with a HK1 non-coding variant had severe early-onset CHI (median age at diagnosis: Birth [IQR: 0-14 d]). These variants were not associated with macrosomia at birth (median birthweight z score: 0.61 [IQR: 0.10-1.84]). This suggests that insulin secretion was not markedly increased in utero given its role as a potent growth factor during fetal development. In all cases the hyperinsulinism persisted, the eldest individual still known to be affected was 18 years of age. These findings suggest that the variants act to disrupt glucose-induced insulin secretion throughout post-natal life and into adulthood but have less impact during fetal development.
The HK1 variants caused a beta-cell specific defect with no common additional features between patients. In five individuals, medical management was ineffective, leading to pancreatic resection. Of these, the only case that had resolution of the CHI developed insulin-dependent diabetes following near-total pancreatectomy. Histopathological analysis of resected pancreatic tissue (n=2) demonstrated a discrete pathology, distinguishable from diffuse disease resulting from ABCC8 variants, the commonest cause of CHI (Extended Data Figure 2). An overview of the clinical features is provided in Supplementary Tables 2-3.
HK1 encodes a glycolytic enzyme that is silenced in the pancreas and liver but expressed in all other mature tissues (Extended Data Figure 3a-c) where it supports glucose metabolism, ensuring cell survival13. Within the pancreatic beta-cell, glucose is phosphorylated to glucose-6-phosphate by Glucokinase (GCK; Hexokinase 4) which, due to a low-binding affinity (Km ~8mM) acts as the pancreatic glucose-sensor coupling insulin release to the prevailing glucose concentration14. Hexokinase 1 has a markedly higher affinity for glucose (Km <50µM)14. Silencing of HK1 in favour of GCK in the beta-cell therefore ensures appropriate glucose-sensing, minimising insulin release at low glucose levels. Whilst bi-allelic loss-of-function HK1 variants have been reported to cause non-spherocytic haemolytic anaemia15, dominant or recessive coding variants have not been described in individuals with defects in glucose homeostasis; in keeping with the absence of HK1 protein in beta-cells.
In resected pancreatic tissue of individuals with non-coding variants, we found that HK1 was expressed and co-localised with insulin in the islets of affected tissue but not in controls (Figure 2). HK1 did not co-localise with glucagon (secreted by pancreatic alpha-cells) suggesting that the impact of the variant was beta-cell specific (Extended Data Figure 4). These results confirmed that the variants cause HK1 to be inappropriately expressed in the pancreatic beta-cell and explain why there is increased insulin secretion in these individuals during hypoglycaemia.
Figure 2. HK1 expression is present in the beta-cells of donors with HK1-variant CHI but not in non-hyperinsulinism (Non-HI) controls.
a) Staining of HK1 (red); insulin (cyan) and DAPI (dark blue) in pancreatic tissue resected from patient 6 (Supplementary Table 2) with a HK1 variant. Confocal imaging of sections demonstrates that HK1 co-localises with insulin whereas, no HK1 expression is observed in the non-HI control donor. Scale bar – 100µm. b) HK1 expression (Median fluorescence intensity (MFI)) is significantly increased in the HK1 HI donors (Patient 1 and 6; n=2 donors; 17015 beta-cells), when compared to Non-HI control donors (n=2 donors; 21408 beta-cells). Two-tailed Mann-Whitney (U=590830; ****p<0.0001). All cell data within a violin plot with the median (line) and IQR (dashed lines) presented. Source data provided in Supplementary Information.
We identified a putative islet cis-regulatory domain encompassing our critical region16 bound by a broad set of islet transcription factors, with NKX2-2 and FOXA2 most prominent (Figure 3). Single-nuclei ATAC-seq (snATAC-seq) data in human islets17 revealed a peak of open chromatin in beta-cells and showed that the relevant HK1 promoter remains accessible (Figure 3b). Human islet Hi-C data18 confirmed that both our region and the relevant HK1 promoter are contained within a well-insulated domain (Extended Data Figure 5), without evidence of contact to distal loci (Figure 3d, Extended Data Figure 5b) further supporting direct regulation by the critical region on the promoter.
Figure 3. HK1 variants disrupt transcription factor binding sites within a cis-regulatory region.
a) Expression of HK1 and GCK for comparison over the course of pancreatic differentiation from embryonic stem cells to maturing beta-cells, expression data over a beta-cell differentiation pseudotime19, Gaussian process regression median (line) and 95% confidence interval (shaded regions)I shown. DE – definitive endoderm, GT – gut tube, FG – foregut, PE – pancreatic endoderm, EP – endocrine precursors, IB – immature beta-cells. See Extended Data Figure 3 for further HK1 expression. b) Transcription factor binding in human islets16 and chromatin accessibility from snATAC-seq in the cluster of cells assigned to beta-cells17 over HK1 locus, the region containing the variants is marked as grey box. Chromatin accessibility in other endocrine cell types is provided in Extended Data Figure 6c) Chromatin accessibility during pancreatic differentiation over critical region, stages: ES – embryonic stem cell, DE – definitive endoderm, GT – gut tube, Early/Late PP – pancreatic progenitor, describe ATAC-seq from in vitro differentiation21 Beta cells give accessibility in snATAC-seq17 HiC data in human islets d)18 is contained within a well-insulated domain, heatmap gives log scale contact frequencies, white triangle and circles mark chromatin loops called in same study that span HK1, additional loops marked in Extended Data Figure 5 mark chromatin loops called in same study that span HK1, additional loops marked in . Region ± 850kb of HK1. Region ± 850kb of HK1. e) Transcription factor motif families disrupted by the variants, shown are those motif families maximally disrupted by each variant (red font). Notable secondary motif families given in Extended Data Figure 8 and Supplementary Table 6. Left, scatterplot of normalised motif scores for the reference versus mutated score (motif score normalised by maximal motif score), black line gives 1-1 line of equal motif. Number gives final two digits of chromatin position, e.g. 45 – 71108645. Right gives sequence logo for the three maximally disrupted families. Members of each family are expressed in beta-cells Extended Data Figure 7.
During development, HK1 is highly expressed in embryonic stem cells but is progressively downregulated during pancreatic cell-type differentiation becoming extinguished during beta-cell maturation19,20 (Figure 3a, Extended Data Figure 3d). GCK shows increased expression during differentiation as HK1 is downregulated (Figure 3a). Despite being absent from beta-cells, HK1 is expressed in mature stellate- and duct-cells suggesting differential regulation across pancreatic cell-types (Extended Data Figure 3e).
Chromatin accessibility data shows our critical region remains closed over pancreatic-cell differentiation until pancreatic progenitor stages, when it becomes accessible and bound by a complement of factors in the pancreatic progenitor regulatory network21,22 (Figure 3c, Extended Data Figure 6b-c). This suggests the region is necessary for HK1 repression in late pancreatic development and mediating beta-cell specific control in adult cells but that it does not have a role in the reduction of HK1 expression between embryonic stem cells and early pancreatic progenitors (Figure 3a).
Analysis of histone modifications during pancreatic differentiation, in human islets21,23, cell sorted human pancreatic alpha, beta and exocrine cells24 and the EndoC-βH1 beta-cell line25, revealed that repression is actively maintained. The critical region is marked by a bivalent state encompassed by focal peaks of active enhancer marks H3K4me1 and H3K27ac and a broader and dominant polycomb H3K27me3 repressive domain (Extended Data Figure 6d). This domain is diminished in exocrine cells supporting that this mode of repression is specific to endocrine cell types (Extended Data Figure 5d).
Analysis of transcription factor motif families revealed that the variants disrupt transcription factor binding sites for FOX, NKX2 and NFAT families (Supplementary Table 5, Figure 3e), which each have family members expressed in beta-cells (Extended Data Figure 7). This is supported by ChIP-seq data revealing that NKX2-2 and FOXA2 were prominently bound at this region in islets (Figure 3b). FOXA2 loss-of-function causes CHI26, and NKX2-2 is able to recruit a large repressive complex that regulates beta-cell specification through DNA methylation27 and may therefore have a role in maintaining the repressive epigenetic state at the HK1 locus21,23–25,28. Multiple members of the NFAT family are expressed in beta-cells (Extended Data Figure 7), with NFATC2 inducing beta-cell proliferation in human islets29. Analysis of snATAC-data from human islets revealed endocrine cells present in hormone-high and hormone-low states17, with the former associated with increased promoter accessibility over secreted endocrine genes and the latter over cell-cycle genes. Our critical region is accessible in the hormone-high state of alpha, beta, and delta-cells, and when compared between hormone-low state cells, only remains accessible in beta cells (Extended Data Figure 6a). This suggests that HK1 repression is actively maintained during beta-cell proliferation and not during alpha or delta-cell proliferation, which given NFATC2’s capacity to induce beta-cell proliferation, supports this factor in maintaining HK1 repression.
Discovering non-coding regulatory variants controlling HK1 and access to a broad range of publicly available epigenomic data16–25,30,31 enabled us to identify a regulatory element critical for the selective silencing of an otherwise ubiquitously expressed gene within a single cell-type. It is interesting to note that interruption of transcription factor binding is implicated in each of the 9 distinct variants and the locus is decorated by both active and repressive epigenetic marks suggesting that continual transcription factor binding is necessary for maintenance of repression in beta-cells. Future work should prioritise regulatory regions surrounding disallowed genes with similar epigenetic marks and to explore whether different regulatory mechanisms exist to silence disallowed genes within the beta-cell.
Over 60 beta-cell disallowed genes have been described, although the mechanism(s) controlling cell-specific silencing in humans have not been fully determined1,32. Linkage analysis of HK1 to CHI33 and HK1 expression in beta-cells34 have been reported in several patients, but the genetic aetiology was not established. Promoter variants in the beta-cell disallowed gene, SLC16A1 have also been reported in two families with exercise–induced hyperinsulinism. However, these variants increased transcription across cell types leading to the hypothesis that differences in the post-transcriptional regulation of mRNA across tissues could explain the beta-cell-specific phenotype35.
The identification of non-coding HK1 variants in individuals with CHI represents a rare example of regulatory non-coding variants affecting a gene in which coding variation does not cause the same phenotype36. These findings highlight a role for undiscovered regulatory variants causing disease through inappropriate expression of a normally functioning protein in a specific cell-type.
Non-coding variants affecting the regulation of HK1 cause CHI through its aberrant expression in beta-cells. These findings are important for future efforts to discover non-coding regulatory variants as they establish a critical role for disallowed genes in Mendelian disease.
Methods
Subjects
Congenital hyperinsulinism (CHI) was defined as an inappropriately high level of plasma insulin at the time of hypoglycaemia associated with inappropriately supressed ketones and free fatty acids presenting within the first 12 months of life. The definition of hypoglycaemia was based on the recommendations of the Pediatric Endocrine Society (blood glucose <2.8mmol/L in the presence of detectable insulin)37. Clinical details of the subjects are provided in Supplementary Table 1. Subjects with CHI were recruited by their clinicians for molecular genetic analysis to The Exeter Genomics Laboratory. Disease-causing variants in the known CHI genes had been excluded by targeted-next generation sequencing in all cases38.
Ethical considerations
This study was approved by the North Wales Research Ethics Committee (17/WA/0327, IRAS project ID: 231760) and was conducted in accordance with the Declaration of Helsinki, with all subjects or their parents providing informed consent for genetic testing. All tissue samples were studied with full ethics approval (West of Scotland Research Ethics Committee, reference: 20/WS/0074 IRAS project ID: 283620 or nPOD)39. No participant received compensation for entering this study.
Sample collection and DNA Extraction
Peripheral whole blood samples (1-5mls) were taken in EDTA blood collection tubes. Automated DNA extracted was undertaken on a Chemagic STAR (Hamilton Bonaduz AG, Switzerland) using the Chemagic STAR DNA blood extraction kit (CMG-1756, [Perkin Elmer, Waltham,USA]).
Whole-genome sequencing
Whole-genome sequencing of DNA extracted peripheral blood leukocytes was completed on 135 probands with CHI (n=3 with Illumina HiSeq 2500, n=69 with Illumina HiSeq X10, n=63 with BGISeq-500). The mean read depth across the whole genome was 36.9 (stddev 4.9). An additional 191 family members were also sequenced. The sequence data was aligned using BWA MEM version 0.7.15, and processed using a pipeline based on the GATK best practices (Picard version 2.7.1, GATK version 3.7). Variants were annotated using Alamut batch standalone version 1.11 (Rouen, France).
CNVs were called across the genome by SavvyCNV40 using a bin size of 2Kbp. We also developed a new tool (FindLargeInsertSizes, available at https://github.com/rdemolgen/SavvySuite) to detect small (≥1Kbp) deletions, insertions, inversions, and translocations using read pair information. We used this tool to screen all whole genome sequenced samples for structural variants within HK1 intron 2.
When a variant was identified, Sanger sequencing (for single nucleotide variants/indels) or digital droplet PCR (for deletions) was performed on samples from each available family member to confirm genome sequencing results.
Digital droplet PCR (ddPCR): was performed on leukocyte DNA from patients 1 and 2.1 and their unaffected parents to confirm the results of the whole-genome sequencing deletion analysis. ddPCR was also performed on the affected twin brother of patient 2.1 and the 27 individuals with CHI of unknown cause who had undergone pancreatectomy.
Relevant primers (Supplementary Table 4) were used in reactions for droplet generation, PCR and detection using the Bio-Rad QX200 ddPCR EvaGreen system (Hercules, CA, USA) to search for copy number changes. Reactions were performed as per the manufacturer’s recommendations with an annealing/extension temperature during PCR of 59°C. All data were analysed using Bio-Rad QuantaSoft software version v1.6.6.0320.
Sanger sequencing of the HK1 regulatory element
Sanger sequencing was performed on DNA extracted from peripheral blood leukocytes from 27 individuals with CHI of unknown cause who had undergone pancreatectomy. Briefly, a 397bp genomic region (GRCh37/hg19, Chr10:71,108,536-71,108,932) within intron 2 of HK1 was amplified by PCR (primer sequences are listed in Supplementary Table 4). PCR products were sequenced on an ABI3730 capillary machine (Applied Biosystems, Warrington, UK) and analysed using Mutation Surveyor v3.24 software (SoftGenetics, State College, PA, USA). When a variant was identified, samples from family members were tested to investigate co-segregation and microsatellite analysis using the PowerPlex kit (Promega, Southampton, UK) was performed to confirm family relationships.
Histopathology
Formalin-fixed paraffin-embedded pancreatic tissue was available for immediate analysis from two individuals with HK1 variants (patients 1 and 6, Supplementary Table 2) age-matched non-HI controls (n=2) and age-matched tissues from individuals with HI due to an ABCC8 variant (HI controls n=2). Immunohistochemistry was performed on 5-µm-thick sections of tissue41. For high-content quantification of tissue, sections were first digitized using the 3DHistech Pannoramic 250 Flash II slide scanner (3DHISTECH, Budapest, Hungary) and quantified using QuPath (available at https://qupath.github.io)42.
Immunofluorescence
After dewaxing and rehydration, pancreatic samples from HK1 patients 1 and 6 (Supplementary Table 2) and 4 age-matched control tissues (HI controls (individuals with ABCC8 variants (n=2); non-hyperinsulinism donors (n=2)) were subjected to heat–induced epitope retrieval (HIER) in 10mM citrate pH6 buffer for 20 minutes. After blocking, the sections were probed in a sequential manner with rabbit monoclonal anti-hexokinase 1 (Abcam ab150423; 1/100 overnight), followed by detection with goat-anti-rabbit Alexa Fluor™ 488 (Invitrogen, Paisley, U.K). Following washes, sections were then probed with mouse monoclonal anti-glucagon (Abcam ab10988; 1/2000 for 1h) followed by detection with goat-anti-mouse Alexa Fluor™ 555. Finally, sections were probed with guinea-pig anti-insulin (Agilent; C#IR002; 1/5 for 1h), followed by detection with goat-anti-guinea-pig Alexa Fluor™ 647 in the presence of DAPI (1µg/ml) to identify cell nuclei. Sections were not stripped between successive antibody incubations. After mounting, the sections were imaged via a Leica DMi8 confocal microscope (Leica Microsystems UK, Milton Keynes, UK) (using a focal plane of 0.4µm) and the distribution of HK1, insulin and glucagon examined in multiple islets. In addition, sections were scanned at 40x magnification using an Akoya Biosciences Vectra® Polaris™ Automated Quantitative Pathology Imaging System for quantification analyses. The quantification of the HK1 expression in islets was determined using the Random Forest Classifier Module (Version 3.2.1851.354), DenseNet AI V2 and HighPlex FL v4.04 modules included in Indica Labs HALO Image analysis platform (Version 3.2.1851.354). The Random Forest Classifier was used to identify the islets, then the DenseNet AI V2 module was used to identify beta and alpha cells within the islets. This enabled exclusion of HK1 positive non-islet cells such as red blood cells, stellate cells and endothelial cells which are frequently found in proximity to the islet. The median fluorescence intensity (MFI) of HK1 expression in beta and alpha cells were then calculated using the HighPlex FL v4.04 module. This pipeline was applied to all corresponding tissue sections. GraphPad Prism V9.2.0 (332) was used to demonstrate the median and IQR of beta and alpha cells MFI in each of the donors assessed.
Epigenomic analysis
All published ChIP-seq, ATAC-seq, RNA-seq and scRNA-seq datasets used were obtained from accessions provided in Supplementary Table 5. All code and source data required for all analysis, including to determine disrupted motifs and generate figures is provided in https://github.com/owensnick/HK1FigureNotebook.jl and DOI:10.5281/zenodo.6815326.
Public Bulk ChIP and ATAC-seq
ChIP-seq and ATAC-seq reads aligned with Bowtie243 to the GRCh37/hg19 genome with Bowtie2 v2.3.5.1 with default parameters for single end reads and with additional options “-I 0 -X 1000 --no-discordant --no-mixed” for paired-end reads. Alignments were filtered for those with mapping quality > 30 and then reads with identical aligning coordinates were treated as duplicates and collapsed to a single alignment. For islet H3K27me3 data from roadmap epigenomics project, aligned reads were downloaded in BED/TagAlign format23. All data is visualised as reads per million using https://github.com/owensnick/GenomeFragments.jl. For human islet single-nuclei ATAC-seq data17 (GSE160472), reads from the authors’ combinatorial barcode approach were aligned as above and separated into cell types using author provided cluster labels: https://github.com/kjgaulton/pipelines/tree/master/islet_snATAC_pipeline.
Human islet Hi-C data was obtained from experiment accession TSTSR043623 and file accession DFF064KIG (.hic file) and TSTFF938730 (bedpe file)18. The .hic file from experiment was downloaded and converted to .cool format using hic2cool https://github.com/4dn-dcic/hic2cool with default parameters. Contact matrices were obtained using Cooler44 at 5kb resolution using KR balance and log-transformed for visualisation. To quantify histogram of contact strengths for all islet loops, loop bases were extended by two 5kb HiC bins up- and downstream and the mean of the KR balance normalised matrix of this region quantified per loop.
scRNA-seq Expression Datasets
scRNA-seq data collected over a time course of pancreatic differentiation19 projected onto a differentiation pseudotime was obtained from the reference’s supplementary table. We identity consistent temporal trends using Gaussian Process (GP) regression, following the approach we have previously applied45. Briefly, to stabilise variance we transform data log(αy+ β) where y is the gene expression for a given gene and α = 100, β = 1, then perform GP regression using Matern52 kernel and invert the transformation to report GP median and 95% confidence intervals. All GP regression was performed with GaussianProcesses.jl (https://github.com/STOR-i/GaussianProcesses.jl; https://arxiv.org/abs/1812.09064).
For scRNA-seq data in human islets30 accession GSE101207 we used gene counts per cell for the size healthy donors and normalised by depth per cell. In Extended Data Figures 3, 7 and 8 we show boxplots of normalised counts for each gene over healthy donors.
Finally, we obtained data on gene expression during beta-cell maturation from the Broad Single Cell portal20. In Extended Data Figures 3, 7 and 8 we show mean normalised counts for each gene over annotated cell types.
Motif analysis
To assess motifs maximally disrupted by the variants, we took all motifs in the non-redundant JASPAR database46 and found the maximal scoring match that spanned the position of each variant for both reference and mutated sequences using https://github.com/exeter-tfs/MotifScanner.jl. Motif scores are likelihood-ratio scores of motif PWM over a background of the A,C,G,T frequencies in the hg19 genome. We normalised all motif scores, by the maximal score for each PWM and to both primary and secondary candidate motifs that may be disrupted by the variants. We considered two tiers; Tier 1 having normalised motif score ≥ 0.6, and Tier 2 having 0.45 ≤ normalised motif score < 0.45. For each tier we calculated a disruption score by subtracting the mutated sequence score from the reference sequence score and ranked these across all motifs. To report a single disrupted motif family, we grouped multiple motifs with overlapping alignments by family by removing any trailing numbers from the gene symbol (e.g. NKX2-2 → NKX2, HIC2 → HIC) and all FOX factors we grouped in the FOX family. Tier 1 disrupted motif families include NFAT, NKX2 and FOX (Figure 3e). Patient 11 has two de novo variants, the leftmost is shared with other patients and is a candidate for NFAT disruption, and the rightmost interrupts a HIC family motif (Extended Data Figure 8a). HIC family members include HIC1 and HIC2, with HIC2 the most greatly expressed in beta-cells (Extended Data Figure 8b). Interestingly, HIC2 is a transactivator of SIRT147 and the loss of SIRT1 impairs glucose sensing in beta-cells in mice48. Tier 2 disrupted motif families include TEAD and SMAD (Extended Data Figure 8c). The TEAD family motif shares TTCA consensus with the NFAT family and is an alternative candidate to NFAT, TEAD1 is expressed in beta-cells (Extended Data Figure 8d) and plays a critical role in pancreatic progenitors22, however it should be noted that TEAD1 does not bind the critical region in pancreatic progenitors when the region is bound by FOXA2 (Figure 3b). Multiple members of the SMAD family are expressed in beta-cells (Extended Data Figure 8e), SMAD factors are signal transducers of TGF-beta signalling and play an important role in beta-cell development, function, and proliferation49.
Statistics and Reproducibility
No statistical method was used to predetermine sample size as this study involved genetic testing in individuals with a rare monogenic disease.
Immunofluorescence studies were performed on pancreatic samples from 6 donors. This included two individuals with HK1-hyperinsulinism (patients 1 and 6, Supplementary Table 2), two ABCC8-HI controls and two non-HI controls. Each experiment was performed once due to limited sample availability. Representative data from these imaging studies are shown for one of each donor type in Figure 2a and Extended Data Figure 4a-b.
No data were excluded from any of the experiments described.
Extended Data
Extended Data Figure 1. Integrative Genomics Viewer (IGV) screenshot of the ~4.5Kb deleted regions encompassing the regulatory element in intron 2 of HK1 in patients 1 and 2.1.
Both deletions are heterozygous, so reads are present in the deleted regions from the other haplotype, although the read depth is reduced. Reads highlighted in red have a long insert size. Both features are consistent with a deletion. The deleted region in patient 2.1 has detectable break points, as shown by the discontinuity in read depth and the reads clipped at the deletion boundary. In patient 1, the breakpoints do not result in read depth discontinuity or clipped reads as the deletion occurs between two matching repeating sequences.
Extended Data Figure 2. Histological appearance of islets from HK1-CHI and ABCC8-HI tissue.
Panels A and B illustrate islets from two HK1 CHI patients (patients 1 and 6 Supplemental Table 2) in comparison to age-matched tissue from ABCC8 CHI patients with diffuse pancreatic disease (C and D). The yellow dashed line delineates a single islet in each sample. Nuclear enlargement (B, arrowhead) and nucleomegaly (C, D arrows) was virtually undetected in control islets (n=12/72,145 islet cells) but found in 0.8% of HK1 islets (n=374/46,885 cells) and 4.9% of ABCC8 islet cells (n=70,929). Inserts illustrate the corresponding haematoxylin and eosin images. Scale bar 50 µm
Extended Data Figure 3. HK1 expression in GTEx.
a) Boxplots describing gene level HK1 expression in Genotype-Tissue Expression GTEx project ordered by expression with least expression in liver and pancreas. Data shows sum of GTEx v8 isoform level data calculated with RSEM. The GTEx Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health, and by NCI, NHGRI, NHLBI, NIDA, NIMH, and NINDS. The data used for the analyses described in this manuscript were obtained from the GTEx Portal on 08/26/21. b) Isoform level HK1 expression in GTEX tissues as a), top 3 most expressed isoforms are given. Long HK1 isoform only expressed in testis, short isoform ubiquitously expressed across all tissues. c) Transcript models of top three HK1 isoforms given in b). d) Expression of HK1 and GCK for comparison in scRNA-seq across alpha and beta-cell maturation, all data quantified by (GSE167880)20, which quantified their own data over maturation and adult endocrine cells (GSE114297)31. e) HK1 expression in human islet cell types by scRNA-seq (GSE101207)30. Data points from independent human donors, the central lines in the boxplots correspond to the median, boxes span from the first to the third quartiles, and whiskers extend to the furthest data point within 1.5xIQR from the boxes. HK1 expression only in duct and pancreatic stellate cells, expression absent in endocrine cell types.
Extended Data Figure 4. HK1 expression is present in the beta-cells of donors with HK1-CHI, but not ABCC8-Hyperinsulinism or Non-HI controls.
a) Confocal imaging of HK1 (red); glucagon (yellow); insulin (cyan) and DAPI (dark blue) in pancreatic tissue resected from patient 6 (Supplementary Table 2) with a HK1 variant. HK1 co-localises with insulin, but not with glucagon. Staining in tissue resected from individuals with ABCC8-Hyperinsulinism (HI-control) and non-HI control, showed no expression of HK1 in either alpha (glucagon-positive) or beta (insulin-positive cells). Scale bar – 100 µm. b) Enlarged image of a portion of the islet shown in (a) from an individual with HK1-HI (Patient 6). This image confirms the expression of HK1 specifically in beta cells (white arrows) and absence of HK1 in alpha cells (orange arrows). Scale bar – 20 µm c) HK1 expression (Median fluorescence intensity (MFI)) is higher in the HK1-HI donor beta-cells within HK1 positive islets, when compared to alpha cells (Patient 1 - beta-cells (n=1580); alpha-cells (n=161); and Patient 6 - beta-cells (n=2284); alpha-cells (n=9321)). Two-tailed Mann-Whitney (****p<0.0001). This suggests the impact of the HK1 variant is beta-cell specific. Violin plots shown with the median (dashed line) and IQR (dotted lines) presented. Source data provided in Supplementary Information.
Extended Data Figure 5. Human Islet chromatin conformation at loci spanning HK1 and secreted endocrine genes.
a) Histogram of contact strengths of loops in human islet HiC data18, data shown for all loops called in that study. Vertical line marks the strength of strongest and smallest loop spanning HK1 gene (marked with white arrow in c)). b) Cumulative frequency plot of contacts shown in a), vertical line marks HK1 loop (white arrow in c)), the HK1 loop is in the 86th percentile of all islet HiC loop strengths. c) Heatmap of HiC data shown in a) for region ± 850 kb of HK1 as Figure 3d, now with additional loops within region marked. White arrow gives principal loop spanning HK1 gene marked in a, b). d,e,f) HiC data and loops for endocrine cell secreted genes d) INS, e) SST, and f) GCG, shown at same resolution and scale as HK1 locus c) for context.
Extended Data Figure 6. Chromatin accessibility, Transcription Factor binding and Epigenetic modifications at the HK1 locus over pancreatic differentiation.
a) snATAC-seq data in human islets showing mean chromatin accessibility over nuclei assigned to alpha, beta and delta cell clusters in hormone high and hormone low conditions (GSE160472)17. Critical region containing variants encompassed within grey box. Alpha, beta and delta cells each show accessibility in hormone high state, whereas only beta-cells retain accessibility in hormone low cells. Hormone high and low states characterised by level of accessibility over endocrine cell secreted hormone promoter, alpha – GCG, beta – INS, delta – SST. Data show in reads per million (RPM). b) Chromatin accessibility of pancreatic differentiation assessed by ATAC-seq (GSE149148)21, wider locus view of same data shown in Figure 3c. Critical region containing variants encompassed within grey box. Stages: ES – embryonic stem cell, DE – definitive endoderm, Early/Late PP – pancreatic progenitor. c) Transcription factor binding in pancreatic progenitors (GSE149148)21 and in Carnegie Stage 16-18 liver bud for FOXA2 (E-MTAB-3061)22, data shown as a) and b). Liver bud data reveals that whilst FOXA2 binds critical region in pancreatic progenitors and human islets (Figure 3b), it does not bind in liver bud, suggesting that critical region encodes pancreas specific regulation of HK1. d) Epigenetic modifications over HK1 locus, shown are active marks H3K27ac and H3K4me1 and repressive mark H3K27me3 over in vitro pancreatic cell differentiation (GSE149148)21 stages as in b); in vitro beta-cell line EndoC-BH1 (GSE118588)25 and human islets H3K27ac and H3K4me1 (E-MTAB-1919)16 and H3K27me3 (E087 – roadmap epienomics23); cell-sorted pancreatic alpha, beta and exocrine cells (GSE50386)24. Grey box encompasses critical region containing variants. Data reveals that H3K27ac broadly marks short isoform promoter (Supplemental Figure 3) at ES cells stage is reduced by DE stage, whilst focal mark over regulatory region is present in EndoC-BH1 and human islets. Similarly, H3K4me1 shows string focal enrichment over regulatory region in EndoC-BH1 and islets. Finally, repressive mark H3K27me3 is absent over the locus over the course of pancreatic cell differentiation and is present in a broad domain in adult islets, EndoC-BH1 cells, beta, cells and to reduced extent in exocrine cells.
Extended Data Figure 7. Expression of family members of disrupted transcription factor binding motifs in critical region.
NFAT family, NKX2-2 and FOXA2: a) In single-cell RNA-seq in human islets (GSE101207)30, boxplots of mean normalised counts from independent islet donors in cells assigned to islet cell type clusters (data shown as in Supplemental Figure 3e). Data points from independent human donors, the central lines correspond to the median, boxes span from the first to the third quartiles, and whiskers extend to the furthest data point within 1.5xIQR from the boxes. b) Over the course of pancreatic differentiation from embryonic stem cells to maturing beta-cells, expression data over a beta-cell differentiation psuedotime19, Gaussian process regression median and 95% CI shown. ES – embryonic stem cell, DE – definitive endoderm, GT – gut tube, FG – foregut, PE – pancreatic endoderm, EP – endocrine precursors, IB – immature beta-cells, MB – maturing beta-cells. c) scRNA-seq across alpha and beta-cell maturation, all data quantified by (GSE167880)20, which quantified their own data over maturation and adult endocrine cells (GSE114297)31.
Extended Data Figure 8. Additional transcription factor motifs disrupted by variants.
a-b) Sanger sequencing identified 2 in cis de novo variants 12 base pairs apart (g.71,108,648C>G and g.71,108,660G>A) in patient 11. g.71,108,648 is affected by 3 different de novo substitutions in 6 individuals and resides within a predicted NFATC binding motif. Two further de novo variants identified in 3 patients are also predicted to impact on NFATC binding (Figure 3). In contrast, further variants affecting g.71,108,660 have not been identified, this suggests that the g.71,108,648C>G is causative of the disease in this patient. Nevertheless, it is of note that g.71,108,660G>A is predicted to affect a HIC family motif a) and human islet scRNA-seq (GSE101207)30 determines HIC2 is expressed in beta-cells b) Interestingly, HIC2 is a transactivator of SIRT147 and the loss of SIRT1 impairs glucose sensing in beta-cells in mice48. c-e) Tier 2, secondary candidate motifs for disruption have lower normalised motif scores than Tier 1 matches. c) Two additional motif families are implicated TEAD and SMAD. TEAD shares partial consensus with NFAT (TTCCA) and is alternative candidate to NFAT. TEAD1 but not TEAD2 is expressed in beta-cells e) and TEAD1 plays a critical role in pancreatic progenitors22, however it should be noted that TEAD1 does not bind the critical region in pancreatic progenitors when the region is bound by FOXA2 (Figure 3b). The SMAD family are signal transducers of TGF-beta signalling and play an important role in beta-cell development, function, and proliferation49. For all boxplots, in panels (b, d, e) central lines correspond to the median, boxes span from the first to the third quartiles, and whiskers extend to the furthest data point within 1.5xIQR from the boxes.
Supplementary Material
Acknowledgements
We wish to thank Dr Pia Leete (IBEx, University of Exeter) for technical assistance with confocal microscopy. SEF has a Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (105636/Z/14/Z). MaNW and MBJ are the recipients of an Independent Fellowship and TWL and NO a Lectureship from the Exeter Diabetes Centre of Excellence funded by Research England’s Expanding Excellence in England (E3) fund. EDF is the recipient of a Diabetes UK RD Lawrence fellowship (19/0005971). KAP has a Career Development fellowship funded by the Wellcome Trust (219606/Z/19/Z). IB and MJD were supported by the Northern Congenital Hyperinsulinism (NORCHI) charitable fund, by the Manchester Academic Health Sciences Centre and by The University of Manchester MRC Confidence in Concept (CiC) Award (MC_PC_18056). SJR and NGM are grateful to JDRF for funding (2-SRA-2018-474-S-B). Ivo Barić is member of the European Reference Network for Rare Hereditary Metabolic Disorders (MetabERN), Project ID No 739543.This research was performed with the support of the Network for Pancreatic Organ donors with Diabetes (nPOD; RRID:SCR_014641), a collaborative type 1 diabetes research project supported by JDRF (nPOD: 5-SRA-2018-557-Q-R) and The Leona M. & Harry B. Helmsley Charitable Trust (Grant#2018PG-T1D053, G-2108-04793). The content and views expressed are the responsibility of the authors and do not necessarily reflect the official view of nPOD. Organ Procurement Organizations (OPO) partnering with nPOD to provide research resources are listed at http://www.jdrfnpod.org/for-partners/npod-partners/. This research has been conducted using the UK Biobank Resource. This work was carried out under UK Biobank project number 9055 and 9072. Targeted next-generation sequencing to exclude known disease-causing variants was funded by Congenital Hyperinsulinism International (a 501(c)3 organisation) for 20 individuals within the discovery cohort. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
This manuscript is dedicated to the memory of our colleague and friend, Professor Mark Dunne, who will be greatly missed. Mark was an inspirational scientist whose pioneering research served to improve the lives of children and families affected by Congenital Hyperinsulinism.
This research was funded in whole, or in part, by Wellcome [105636/Z/14/Z]. For the purpose of open access, the author has applied a CC BY public copyright licence to any Author accepted Manuscript version arising from this submission
Footnotes
Author Contributions Statement
S.E.F. designed the study. A.D, S.E., I.B., S.E.F. and the International Congenital Hyperinsulinism Consortium recruited patients to the study. Ma.N.W*, M.B.J., J.A.L.H., R.V-H., S.E. and S.E.F. performed the molecular genetic analysis and the interpretation of resulting data. Ma.N.W. and Mi.N.W. performed the bioinformatic analysis, T.I.H., J.H., I.B., A.T.H. and S.E.F. analysed the clinical data. E.C. and M.J.D. performed the pathological analysis of the pancreatic tissue, J.R.H, C.S.F., R.W., N.G.M. and S.J.R. designed and performed the immunohistochemistry studies, N.D.L.O. designed and performed the epigenomic data analysis, Ma.N.W., N.D.L.O., J.R.H, M.B.J., T.W.L., E.D-F., K.A.P., A.T.H., M.J.D., S.J.R and S.E.F. prepared the draft manuscript. All authors contributed to the discussion of the results and to manuscript preparation.
Competing Interests Statement
The authors declare no competing interests.
Data Availability
All non-clinical data analysed during this study are included in this published article (and its supplementary information files). Clinical and genotype data can be used to identify individuals and is therefore available only through collaboration to experienced teams working on approved studies examining the mechanisms, cause, diagnosis and treatment of diabetes and other beta-cell disorders. Requests for collaboration will be considered by a steering committee following an application to the Genetic Beta Cell Research Bank (https://www.diabetesgenes.org/current-research/genetic-beta-cell-research-bank/). Contact by email should be directed to Prof Sarah Flanagan (S.Flanagan@exeter.ac.uk). All requests for access to data will be responded to within 14 days.
Accession codes and DOI numbers for all ChIP-seq, ATAC-seq, RNA-seq and scRNA-seq datasets are provided in Supplementary Table 5. We used the Genome Reference Consortium Human Build 37 (GRCh37) to annotate genetic data. Accession number: GCF_000001405.13. Details of this assembly are provided at: https://www.ncbi.nlm.nih.gov/assembly/GCF_000001405.13/.
Source data
Source data for Figure 2b and Extended Figure 4c are provided in the Additional Supplementary Files. For Figure 3 and Extended Data Figures 3, 5, 6, 7, 8 all source data is contained in https://github.com/owensnick/HK1FigureNotebook.jl and DOI:10.5281/zenodo.6815326.
Code Availability
The code used in this study can be freely downloaded from:
https://github.com/rdemolgen/SavvySuite
https://github.com/owensnick/HK1FigureNotebook.jl
https://github.com/owensnick/GenomeFragments.jl
https://github.com/kjgaulton/pipelines/tree/master/islet_snATAC_pipeline
https://github.com/4dn-dcic/hic2cool
https://github.com/STOR-i/GaussianProcesses.jl
References
- 1.Pullen TJ, et al. Identification of genes selectively disallowed in the pancreatic islet. Islets. 2010;2:89–95. doi: 10.4161/isl.2.2.11025. [DOI] [PubMed] [Google Scholar]
- 2.Spielmann M, Mundlos S. Looking beyond the genes: the role of non-coding variants in human disease. Hum Mol Genet. 2016;25:R157–R165. doi: 10.1093/hmg/ddw205. [DOI] [PubMed] [Google Scholar]
- 3.Ferreira CR. The burden of rare diseases. Am J Med Genet A. 2019;179:885–892. doi: 10.1002/ajmg.a.61124. [DOI] [PubMed] [Google Scholar]
- 4.Online Mendelian Inheritance in Man. OMIM. McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University; Baltimore, MD: [Accessed 18/11/2021]. World Wide Web URL: https://omim.org/ [Google Scholar]
- 5.Turro E, et al. Whole-genome sequencing of patients with rare diseases in a national health system. Nature. 2020;583:96–102. doi: 10.1038/s41586-020-2434-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shashi V, et al. The utility of the traditional medical genetics diagnostic evaluation in the context of next-generation sequencing for undiagnosed genetic disorders. Genet Med. 2014;16:176–82. doi: 10.1038/gim.2013.99. [DOI] [PubMed] [Google Scholar]
- 7.Sawyer SL, et al. Utility of whole-exome sequencing for those near the end of the diagnostic odyssey: time to address gaps in care. Clin Genet. 2016;89:275–84. doi: 10.1111/cge.12654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kapoor RR, et al. Clinical and molecular characterisation of 300 patients with congenital hyperinsulinism. Eur J Endocrinol. 2013;168:557–64. doi: 10.1530/EJE-12-0673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Snider KE, et al. Genotype and phenotype correlations in 417 children with congenital hyperinsulinism. J Clin Endocrinol Metab. 2013;98:E355–63. doi: 10.1210/jc.2012-2169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Halldorsson BV, et al. The sequences of 150,119 genomes in the UK biobank. bioRxiv. 2021:2021.11.16.468246 [Google Scholar]
- 11.Lek M, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536:285–91. doi: 10.1038/nature19057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Karczewski KJ, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–443. doi: 10.1038/s41586-020-2308-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Robey RB, Hay N. Mitochondrial hexokinases, novel mediators of the antiapoptotic effects of growth factors and Akt. Oncogene. 2006;25:4683–96. doi: 10.1038/sj.onc.1209595. [DOI] [PubMed] [Google Scholar]
- 14.Becker TC, BeltrandelRio H, Noel RJ, Johnson JH, Newgard CB. Overexpression of hexokinase I in isolated islets of Langerhans via recombinant adenovirus. Enhancement of glucose metabolism and insulin secretion at basal but not stimulatory glucose levels. J Biol Chem. 1994;269:21234–8. [PubMed] [Google Scholar]
- 15.Bianchi M, Magnani M. Hexokinase mutations that produce nonspherocytic hemolytic anemia. Blood Cells Mol Dis. 1995;21:2–8. doi: 10.1006/bcmd.1995.0002. [DOI] [PubMed] [Google Scholar]
- 16.Pasquali L, et al. Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants. Nat Genet. 2014;46:136–143. doi: 10.1038/ng.2870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Chiou J, et al. Single-cell chromatin accessibility identifies pancreatic islet cell type- and state-specific regulatory programs of diabetes risk. Nat Genet. 2021;53:455–466. doi: 10.1038/s41588-021-00823-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Greenwald WW, et al. Pancreatic islet chromatin accessibility and conformation reveals distal enhancer networks of type 2 diabetes risk. Nat Commun. 2019;10:2078. doi: 10.1038/s41467-019-09975-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Weng C, et al. Single-cell lineage analysis reveals extensive multimodal transcriptional control during directed beta-cell differentiation. Nat Metab. 2020;2:1443–1458. doi: 10.1038/s42255-020-00314-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Balboa D, et al. Functional, metabolic and transcriptional maturation of human pancreatic islets derived from stem cells. Nat Biotechnol. 2022 doi: 10.1038/s41587-022-01219-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Geusz RJ, et al. Pancreatic progenitor epigenome maps prioritize type 2 diabetes risk genes with roles in development. Elife. 2021;10 doi: 10.7554/eLife.59067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cebola I, et al. TEAD and YAP regulate the enhancer network of human embryonic pancreatic progenitors. Nat Cell Biol. 2015;17:615–626. doi: 10.1038/ncb3160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Roadmap Epigenomics C, et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–30. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Bramswig NC, et al. Epigenomic plasticity enables human pancreatic alpha to beta cell reprogramming. J Clin Invest. 2013;123:1275–84. doi: 10.1172/JCI66514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lawlor N, et al. Multiomic Profiling Identifies cis-Regulatory Networks Underlying Human Pancreatic beta Cell Identity and Function. Cell Rep. 2019;26:788–801.:e6. doi: 10.1016/j.celrep.2018.12.083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Giri D, et al. Novel FOXA2 mutation causes Hyperinsulinism, Hypopituitarism with Craniofacial and Endoderm-derived organ abnormalities. Hum Mol Genet. 2017;26:4315–4326. doi: 10.1093/hmg/ddx318. [DOI] [PubMed] [Google Scholar]
- 27.Papizan JB, et al. Nkx2.2 repressor complex regulates islet beta-cell specification and prevents beta-to-alpha-cell reprogramming. Genes Dev. 2011;25:2291–305. doi: 10.1101/gad.173039.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Dhawan S, et al. DNA methylation directs functional maturation of pancreatic beta cells. J Clin Invest. 2015;125:2851–60. doi: 10.1172/JCI79956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Keller MP, et al. The Transcription Factor Nfatc2 Regulates beta-Cell Proliferation and Genes Associated with Type 2 Diabetes in Mouse and Human Islets. PLoS Genet. 2016;12:e1006466. doi: 10.1371/journal.pgen.1006466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fang Z, et al. Single-Cell Heterogeneity Analysis and CRISPR Screen Identify Key beta-Cell-Specific Disease Genes. Cell Rep. 2019;26:3132–3144.:e7. doi: 10.1016/j.celrep.2019.02.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Xin Y, et al. Pseudotime Ordering of Single Human beta-Cells Reveals States of Insulin Production and Unfolded Protein Response. Diabetes. 2018;67:1783–1794. doi: 10.2337/db18-0365. [DOI] [PubMed] [Google Scholar]
- 32.Lemaire K, Thorrez L, Schuit F. Disallowed and Allowed Gene Expression: Two Faces of Mature Islet Beta Cells. Annu Rev Nutr. 2016;36:45–71. doi: 10.1146/annurev-nutr-071715-050808. [DOI] [PubMed] [Google Scholar]
- 33.Pinney SE, et al. Dominant form of congenital hyperinsulinism maps to HK1 region on 10q. Horm Res Paediatr. 2013;80:18–27. doi: 10.1159/000351943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Henquin JC, et al. Congenital hyperinsulinism caused by hexokinase I expression or glucokinase-activating mutation in a subset of beta-cells. Diabetes. 2013;62:1689–96. doi: 10.2337/db12-1414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Otonkoski T, et al. Physical exercise-induced hypoglycemia caused by failed silencing of monocarboxylate transporter 1 in pancreatic beta cells. Am J Hum Genet. 2007;81:467–74. doi: 10.1086/520960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Almeida AM, et al. Hypomorphic promoter mutation in PIGM causes inherited glycosylphosphatidylinositol deficiency. Nat Med. 2006;12:846–51. doi: 10.1038/nm1410. [DOI] [PubMed] [Google Scholar]
- 37.Thornton PS, et al. Recommendations from the Pediatric Endocrine Society for Evaluation and Management of Persistent Hypoglycemia in Neonates, Infants, and Children. J Pediatr. 2015;167:238–45. doi: 10.1016/j.jpeds.2015.03.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ellard S, et al. Improved genetic testing for monogenic diabetes using targeted next-generation sequencing. Diabetologia. 2013;56:1958–63. doi: 10.1007/s00125-013-2962-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Campbell-Thompson M, et al. Network for Pancreatic Organ Donors with Diabetes (nPOD): developing a tissue biobank for type 1 diabetes. Diabetes Metab Res Rev. 2012;28:608–17. doi: 10.1002/dmrr.2316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Laver TW, et al. SavvyCNV: Genome-wide CNV calling from off-target reads. PLoS Comput Biol. 2022;18:e1009940. doi: 10.1371/journal.pcbi.1009940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Han B, et al. Enhanced Islet Cell Nucleomegaly Defines Diffuse Congenital Hyperinsulinism in Infancy but Not Other Forms of the Disease. Am J Clin Pathol. 2016;145:757–68. doi: 10.1093/ajcp/aqw075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Bankhead P, et al. QuPath: Open source software for digital pathology image analysis. Sci Rep. 2017;7:16878. doi: 10.1038/s41598-017-17204-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Abdennur N, Mirny LA. Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics. 2020;36:311–316. doi: 10.1093/bioinformatics/btz540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Owens NDL, et al. Measuring Absolute RNA Copy Numbers at High Temporal Resolution Reveals Transcriptome Kinetics in Development. Cell Rep. 2016;14:632–647. doi: 10.1016/j.celrep.2015.12.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Fornes O, et al. JASPAR 2020: update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2020;48:D87–D92. doi: 10.1093/nar/gkz1001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Song JY, et al. HIC2, a new transcription activator of SIRT1. FEBS Lett. 2019;593:1763–1776. doi: 10.1002/1873-3468.13456. [DOI] [PubMed] [Google Scholar]
- 48.Luu L, et al. The loss of Sirt1 in mouse pancreatic beta cells impairs insulin secretion by disrupting glucose sensing. Diabetologia. 2013;56:2010–20. doi: 10.1007/s00125-013-2946-5. [DOI] [PubMed] [Google Scholar]
- 49.Lee JH, Lee JH, Rane SG. TGF-beta Signaling in Pancreatic Islet beta Cell Development and Function. Endocrinology. 2021;162 doi: 10.1210/endocr/bqaa233. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All non-clinical data analysed during this study are included in this published article (and its supplementary information files). Clinical and genotype data can be used to identify individuals and is therefore available only through collaboration to experienced teams working on approved studies examining the mechanisms, cause, diagnosis and treatment of diabetes and other beta-cell disorders. Requests for collaboration will be considered by a steering committee following an application to the Genetic Beta Cell Research Bank (https://www.diabetesgenes.org/current-research/genetic-beta-cell-research-bank/). Contact by email should be directed to Prof Sarah Flanagan (S.Flanagan@exeter.ac.uk). All requests for access to data will be responded to within 14 days.
Accession codes and DOI numbers for all ChIP-seq, ATAC-seq, RNA-seq and scRNA-seq datasets are provided in Supplementary Table 5. We used the Genome Reference Consortium Human Build 37 (GRCh37) to annotate genetic data. Accession number: GCF_000001405.13. Details of this assembly are provided at: https://www.ncbi.nlm.nih.gov/assembly/GCF_000001405.13/.
Source data
Source data for Figure 2b and Extended Figure 4c are provided in the Additional Supplementary Files. For Figure 3 and Extended Data Figures 3, 5, 6, 7, 8 all source data is contained in https://github.com/owensnick/HK1FigureNotebook.jl and DOI:10.5281/zenodo.6815326.
The code used in this study can be freely downloaded from:
https://github.com/rdemolgen/SavvySuite
https://github.com/owensnick/HK1FigureNotebook.jl
https://github.com/owensnick/GenomeFragments.jl
https://github.com/kjgaulton/pipelines/tree/master/islet_snATAC_pipeline
https://github.com/4dn-dcic/hic2cool
https://github.com/STOR-i/GaussianProcesses.jl











