Abstract
Understanding pancreas development can provide clues for better treatments of pancreatic diseases. However, the molecular heterogeneity and developmental trajectory of the early human pancreas are poorly explored. Here, we performed large-scale single-cell RNA sequencing and single-cell assay for transposase accessible chromatin sequencing of human embryonic pancreas tissue obtained from first-trimester embryos. We unraveled the molecular heterogeneity, developmental trajectories and regulatory networks of the major cell types. The results reveal that dorsal pancreatic multipotent cells in humans exhibit different gene expression patterns than ventral multipotent cells. Pancreato-biliary progenitors that generate ventral multipotent cells in humans were identified. Notch and MAPK signals from mesenchymal cells regulate the differentiation of multipotent cells into trunk and duct cells. Notably, we identified endocrine progenitor subclusters with different differentiation potentials. Although the developmental trajectories are largely conserved between humans and mice, some distinct gene expression patterns have also been identified. Overall, we provide a comprehensive landscape of early human pancreas development to understand its lineage transitions and molecular complexity.
Subject terms: Organogenesis, Data publication and archiving, Embryogenesis
Here, the authors revealed molecular heterogeneity, developmental trajectory and regulatory network of early human pancreas development, and depict the whole progression of pancreatic organogenesis during the first trimester at the single-cell level.
Introduction
The pancreas is an essential digestive and endocrine organ responsible for nutrient metabolism in the body1,2. Ninety-nine percent of the pancreatic epithelium consists of exocrine tissue, including the acinar cells that secrete digestive enzymes and the ductal cells that transport these enzymes to the intestine. The remaining 1% consists of endocrine tissue, known as the islets of Langerhans, which comprise five distinct endocrine cell types, including α/β/δ/PP/ε cells, mostly responsible for regulating glucose homeostasis1. Dysfunctions of these cells cause a variety of disorders, such as pancreatitis, pancreatic cancer and diabetes. Understanding the embryonic development of the pancreas, especially cell fate decisions and endocrine cell differentiation, may help us to improve the differentiation protocols for pancreatic cells from human pluripotent stem cells (hPSCs) in vitro3–9.
Benefitting from powerful genetic tools and animal models, previous studies have revealed the important molecular events for pancreas organogenesis in rodents1,10,11. The pancreas originates from both the dorsal and ventral endoderm domains, which receive different signals from adjacent tissues. This process begins on embryonic day (E) 8.5 in mice and at 27-29 days post conception in humans12,13. The ventral pancreas emerges later than the dorsal pancreas, and it shares common progenitors with the liver and extrahepatic bile ducts (EHBD)14,15. The two pancreatic buds contain multipotent progenitor (MP) cells that can differentiate into all lineages of the pancreatic epithelium and eventually fuse to form a single organ due to gut tube rotation. PDX1, FOXA2 and PTF1A are the key regulators of MP cell specification16,17. Cell proliferation enlarges the pancreatic epithelium and reshapes it to form branched tubular structures. Simultaneously, MP cells develop into tip cells that have the potential to differentiate into acinar cells and trunk cells that have the bipotential to differentiate into endocrine and ductal cells18,19. Two transcription factors (TFs), PTF1A which promotes tip fate and NKX6-1 which induces trunk fate are the master regulators in this process19. Endocrine progenitor (EP) cells are regulated by NEUROG3 and delaminate from the trunk domain to differentiate into endocrine cells20,21. Beta cell maturation, which is regulated by certain factors such as MAFA, is a long process22,23. While many conclusions about pancreas development and in vitro differentiation protocols have been made based on the studies in mice, species differences between humans and mice have been noticed24,25; for example, there are two peaks of endocrine differentiation in mice and only a single phase in humans13,26.
Single-cell RNA sequencing (scRNA-seq) and single-cell assay for transposase accessible chromatin sequencing (scATAC-seq) are powerful tools that have already been applied in developmental biology. Recent studies in mice and humans have revealed the cellular composition, molecular heterogeneity and developmental trajectory of pancreatic cells in fetuses at the single-cell resolution27–36. However, due to the scarcity of human embryo samples and the difficulty of pancreas isolation from early embryos, little is known about the molecular features and regulatory network of early pancreatic development in humans, especially before post-conception week (PCW) 8.
Here, we performed scRNA-seq of human embryonic pancreas tissues collected from PCW 4 to 11 and scATAC-seq of tissues from PCW 8 to 11. We profiled the major cell types of pancreatic epithelial cells, including dorsal and ventral MP cells, and revealed their molecular heterogeneity and developmental trajectories. We analyzed the TFs, regulatory networks and signaling pathways of acinar and ductal lineage cells and revealed the regulatory network of endocrinogenesis and the transcription dynamics of EP and endocrine cells. We further compared the developmental trajectories between humans and mice and identified several distinct features in each species, such as differences in gene expression patterns. Taken together, our data depict the whole trajectory of pancreatic organogenesis during the first trimester (PCW 4-11) at the single-cell level.
Results
Cell diversity of the human pancreas in early development
We collected human embryonic pancreas samples at 8 time points from PCW 4 to 11 from 17 donors, including 6 males and 11 females (Supplementary Data 1). After the digestion of the isolated pancreas, we performed scRNA-seq of all 17 processed samples using the 10x Genomics platform (Fig. 1a, Supplementary Data 1). In total, 68,714 cells passed the quality control procedures, with an average of 3,000 expressed genes per cell (Supplementary Fig. 1a). Our data showed high similarity among the samples from the same time point (Supplementary Fig. 1b, c). Because PCW 4-6 samples also contained non-pancreatic cells, we analyzed and presented our dataset in two groups for batch correction, dimension reduction and clustering. A total of six major cell-type classes were identified, including epithelial (EPCAM + ), mesenchymal (COL3A1 + ), endothelial (PECAM1 + ), neural (ASCL1 + ), immune (PTPRC + ), and erythroid (HBA1 + ) cells (Supplementary Fig. 1d–h). Our data showed the continuity of cells in the same cell class across different time points (Supplementary Fig. 1i). Mesenchymal cells constituted the majority of both two datasets, and their proportion differed more obviously between PCW 4-6 and PCW 7-11 owing to the different sample isolation methods (Supplementary Fig. 1h, i). From PCW 7 to 11, the proportion of mesenchymal cells gradually decreased, while epithelial and other classes of cells increased (Supplementary Fig. 1i).
Molecular heterogeneity in the early human embryonic pancreatic epithelium
Epithelial cells are the main components of the pancreas and perform its basic functions of the pancreas. To investigate the transcriptional profile of the development of pancreatic epithelial cells, we used EPCAM and PDX1 as pancreatic epithelial markers. Based on this approach, we obtained a total of 17,135 pancreatic epithelial cells from PCW 4 to 11 and included all of them into a common cluster analysis. We identified 13 single-cell-type clusters, which represented all epithelial lineages in the early development of the pancreas (Fig. 1b, c, Supplementary Fig. 2a, b). MP cells only existed in PCW 4 to 5, during which their numbers gradually decreased (Fig. 1c). The numbers of early tip and early trunk cells gradually increased from PCW 4 to 7. Duct and acinar cells emerged after PCW 10 (Fig. 1c). A large number of EPs and endocrine cells were generated after PCW 8 (Fig. 1c). The expression patterns of the top 100 differentially expressed genes (DEGs) in each cluster were visualized in a heatmap, with TFs highlighted (Fig. 1d, Supplementary Fig. 2d and Supplementary Data 2). GATA4- and FOXA2-positive MP cells could be divided into two groups; the dorsal MP cells expressed NR2F1, while the ventral MP cells expressed TBX3 (Fig. 1d, e, Supplementary Fig. 2d, e). We present the analyses of the dorsal and ventral MP cells in later sections. Early tip, tip, and acinar cells expressed increasing levels of CPA2, RBPJL and CTRB2 (Fig. 1e, Supplementary Fig. 2d, e). The acinar cells expressing CLPS and CTRB1, which mainly appeared in PCW 11, did not show the expression of any amylase-associated genes (Fig. 1c, Supplementary Fig. 2d, e). These results indicated that the acinar cells identified in our dataset during embryonic development were still immature and lacked sufficient digestive enzymes. Early trunk, trunk, and duct cells expressed increasing levels of HES4, DCDC2, and CFTR (Fig. 1d, e, Supplementary Fig. 2d, e). Several endocrine cells expressed high levels of endocrine hormones, and EP cells expressed high levels of NEUROG3 (Fig. 1d, Supplementary Fig. 2d, e). A negligible number of endocrine cells were found in PCW 4 samples, and these cells disappeared in PCW 5-7 samples (Fig. 1c, Supplementary Fig. 2b, c). Then, the population of endocrine cells gradually increased in pancreatic epithelial cells after PCW 8 and peaked at PCW 10 and 11 (Fig. 1c, Supplementary Fig. 2b, c), suggesting that large numbers of endocrine cells differentiated after PCW 8. The developmental trajectory constructed by Monocle337 showed the branches of acinar, duct, and endocrine lineage cells (Fig. 1f). Considering these results combination with RNA velocity analysis38,39, we inferred that dorsal and ventral MP cells differentiated into early tip and trunk cells. Then, the early tip cells developed into tip and acinar cells, while the early trunk cells developed into trunk and duct cells. Trunk cells are bipotential cells that differentiate into EP and duct cells (Supplementary Fig. 2f). Collectively, these results showed the developmental trajectory of pancreatic epithelial cells in PCW 4-11. Next, we focused on the development of major lineages in pancreatic epithelial cells.
Dorsal and ventral multipotent progenitor cells have different expression patterns
The pancreas arises from two buds located on the dorsal and ventral sides of the distal foregut endoderm. Beginning in PCW 6 in humans, the ventral pancreas contacts the dorsal pancreas as a result of gut rotation and finally fuses in PCW 840–42. To characterize the differences between dorsal and ventral MP cells, a differential gene expression analysis was conducted (Fig. 2a). Gene Ontology (GO) analysis of the identified DEGs showed that dorsal MP cells were related to Wnt signaling, cell junction assembly and synapse organization, while ventral MP cells were associated with ribosome assembly, muscle tissue development and myoblast differentiation (Fig. 2b). Both dorsal and ventral MP cells expressed common pancreatic markers such as PDX1, PTF1A, and NKX6-1 (Fig. 2c). However, ventral MP cells specifically expressed the TFs TBX3 and SOX6, while dorsal MP cells expressed NR2F1 and SIM1 (Fig. 2a, c). Wnt signaling-related genes (GPC3, FRZB, FZD5 and LYPD6) were highly expressed in dorsal MP cells, while the BMP signaling target TFs (ID1, ID2 and ID3) were highly expressed in ventral MP cells (Fig. 2c). These results implied that dorsal and ventral MP cells received different signals during their development.
Ventral multipotent progenitor cells originate from pancreato-biliary progenitors
Other endoderm-derived organ primordia, such as the liver, EHBD (including gallbladder) and duodenum, are adjacent to the dorsal and ventral pancreas14,15. The hepato-pancreato-biliary organ system originates from a common ventral endoderm progenitor compartment in mice14.To further characterize the relationships between these organs, we analyzed the epithelial cells from these organs in PCW 4 to 5. Six cell clusters were identified based on tissue-specific genes, including dorsal MP (PDX1 + /NR2F1 + ), ventral MP (PDX1 + /TBX3 + ), pancreato-biliary (PB) progenitors (ISL1 + /SULT1E1 + ), EHBD (SPP1 + /SULT1E1 + ), enterocyte (CDX2 + ), and hepatoblast (ALB + ) (Fig. 2d, e, Supplementary Fig. 3a–c and Supplementary Data 3). We identified a small population of cells with low expression of PDX1 and high expression of HHEX, ISL1, SULT1E1, and NKX6-2 and defined them as PB progenitors (Fig. 2d–f). GO analysis of DEGs in PB progenitors showed that these cells were related to pattern specification process, Wnt signaling pathway, neuron projection development, growth factor stimulus and gland development (Fig. 2g). RNA velocity analysis demonstrated that ventral MP and EHBD cells originated from PB progenitors (Fig. 2h). We also depicted the developmental trajectory of PB, EHBD, and ventral MP cells and calculated the pseudotime for each cell type with Monocle3 (Supplementary Fig. 3d)37. The results revealed that PB progenitors have the potential to differentiate into ventral MP and EHBD cells. No connectivity between PB progenitors and hepatoblasts was observed. We performed differential gene expression analysis between ventral MP and EHBD cells. Some TFs, such as PDX1, NKX6-1, RBPJ, and PTF1A, were expressed at high levels in ventral MP cells, while NR2F2, SOX4, HHEX, DACH1, ONECUT2, and NR2F1 were expressed at moderate levels in EHBD cells (Supplementary Fig. 3e). FGF and Slit-Robo signaling were increased in EHBD cells, while Notch signaling was upregulated in ventral MP cells (Supplementary Fig. 3e). Collectively, PB progenitors could differentiate into ventral MP and EHBD cells (Fig. 2i).
scATAC-seq profiling of the developing human pancreas
To investigate the underlying gene regulatory programs driving the bifurcation of cell fate and the continuous differentiation of each pancreatic lineage, we performed scATAC-seq profiling of human embryonic pancreas tissues from PCW 8-11 (Fig. 3a). We obtained 12,288 cells after quality control (Supplementary Fig. 4a). The major cell classes identified by scATAC-seq were consistent with the scRNA-seq results (Supplementary Fig. 4b, c), among which epithelial cells were inspected more closely to focus on pancreatic epithelium differentiation. Cell types in acinar, ductal and endocrine lineages were identified in the scATAC-seq data, consistent with the scRNA-seq results (Fig. 3b). The chromatin accessibility patterns of marker genes for each cell type were consistent with their cellular expression (Supplementary Fig. 4d). Intriguingly, specific cell populations, including acinar, duct EP and endocrine cells, were identifiable earlier in the scATAC-seq data than in the scRNA-seq; that is, the corresponding marker regions appeared to be accessible earlier than the mRNAs were detected (Fig. 3c, Supplementary Fig. 4e), which may reflect the occurrence of gene regulation prior to gene expression. Pseudotime analysis revealed similar differential trajectories to those observed in scRNA-seq, reflecting high consistency between the scRNA-seq and scATAC-seq profiles (Fig. 3d). The expression patterns of cell-type-specific peaks and their corresponding genes were compared for each cell type (Fig. 3e, Supplementary Data 4). Notably, there were far more distinct peaks in mature acinar cells and endocrine lineage cells, possibly due to their relative maturity and gradual acquisition of more complex functions (Fig. 3e).
Cell fate determination in developing acinar and ductal lineage cells
The main populations in the pancreatic epithelium are composed of acinar (tip) and ductal (trunk) lineage cells. To investigate the cell fate determination of these two lineages, we compared their gene expression patterns. HES4 and RBPJL were differentially expressed between early tip and early trunk cells, and the differences were more pronounced between mature tip and trunk cells (Supplementary Fig. 5a, Fig. 4a). We performed branched gene expression analysis on the differential directions of tip and trunk cells and highlighted the TFs (Fig. 4b). GO analysis of these genes revealed that Notch, Wnt, MAPK pathway, actin, cell junctions and nervous system development were activated in trunk and duct cells, whereas the metal ion homeostasis, peptidase activity, digestion and intrinsic apoptotic signaling pathway were activated in tip and acinar cells (Fig. 4c). The active Notch signaling pathway and its downstream TF Hes1 make MP cells acquire a trunk and ductal fate in mice19,43–45. We found that HES1 and another Notch signaling-related TF, HEY1, were highly expressed in ductal lineage cells (Fig. 4d). Moreover, ductal lineage cells showed higher expression of HES4, which is a homeolog of HES1 and shows no orthologous genes in mice (Fig. 4d, e). This finding indicated that the Notch signaling pathway is important for ductal lineage cell specification in the human pancreas. To characterize the signaling pathway between tip, trunk cells and supporting cells, we divided supporting cells into six clusters, including fibroblasts, mesothelial cells, pericytes, immune cells, neural cells and endothelial cells (Supplementary Fig. 5b, c). Cell‒cell communication analysis identified interactions via the Notch signaling pathway mainly involving DLK1 and JAG1 in fibroblasts and pericytes and NOTCH1, NOTCH2 and NOTCH3 in trunk and duct cells (Fig. 4f, Supplementary Fig. 5e, f). The FGF signaling pathway is important for pancreatic development46–48. We found that FGF signaling was much stronger in trunk and duct cells than in tip and acinar cells (Supplementary Fig. 5d). FGFR2 was highly expressed in trunk and ductal cells, and its ligands FGF7 and FGF9 were expressed in fibroblasts and mesothelial cells (Fig. 4g, Supplementary Fig. 5d, f, g). In addition, an NTF4-NTRK2 interaction was identified between mesothelial cells and ductal lineage cells, and a BDNF-NTRK2 interaction was identified between pericytes and ductal lineage cells (Fig. 4g, Supplementary Fig. 5f, g). FGFR2 and NTRK2 are both tyrosine kinase receptors that take part in the MAPK cascade. These results showed that Notch and MAPK signaling promoted trunk and duct cell differentiation. In addition, we found that the HGF signaling pathway was important for acinar lineage cells (Supplementary Fig. 5e). MET was specifically expressed in tip and acinar cells, and its ligand HGF was expressed in pericytes and mesothelial cells (Fig. 4g, Supplementary Fig. 5f, g).
To construct gene regulatory networks (GRNs) that regulate cell differentiation, we integrated time-matched scATAC-seq and scRNA-seq based on canonical correlation analysis (CCA) and generated GRNs according to IReNA2 with slight modifications49. We combined the cell types in each lineage together in scATAC-seq analysis and focused on the inter-relationships between cell lineages. Key regulators and the corresponding regulatory networks driving acinar and ductal lineage specification were identified. We found sets of TFs that were specifically enriched in cell-type-specific peaks in either of these two lineages, as well as a set of TFs that were commonly enriched in both lineages (Fig. 4h). The motif enrichment of lineage-specific TFs was consistent with their expression levels (Fig. 4i). The average expression levels of the three sets of TFs in scRNA-seq were consistent with their footprint enrichment in scATAC-seq (Fig. 4j, Supplementary Fig. 5h). Among the lineage-specific TFs, the Notch signaling-related TFs HES1, HES4 and HEY1 were enriched in the ductal lineage, consistent with their high expression levels in ductal lineage cells (Fig. 4d). Known key regulators identified in mouse acinar cells, such as XBP1 and ONECUT1, were also enriched in human acinar lineages, as well as a few potential new regulators, such as CEBPD and SOX6. EPAS1 may also play an important role in acinar lineage development, as it targets other key acinar TFs, such as XBP1 and JUNB (Fig. 4i, Supplementary Fig. 5i). Epas1 was transiently expressed in mouse multipotent pancreatic progenitors and not in differentiated endocrine or exocrine cells50, while in the developing human pancreas, EPAS1 was uniquely expressed in acinar cells. Both deficiency and overexpression of Epas1 have been reported to severely impair acinar cell development in mice50,51, although its role in human acinar development remains to be investigated. The lineage-specific TFs had lineage-specific targets, including markers of either acinar or duct cells, such as REG4 and LGR4 for acinar cells or KRT17, KRT19 and HES4 for duct cells (Supplementary Fig. 5i). For the lineage-common TFs, GRN analysis revealed that despite enriched footprints being observed in both lineages, these TFs had different targets in the two lineages. In acinar lineages, their targets included various kinds of enzymes and protease inhibitors, such as CELA2A and SERPINA4, and in ductal lineages, their targets were COL4A1, COL4A2 and KRT17, which are marker genes of duct cells, and other ductal lineage-specific TFs, such as HES4 and HES1 (Fig. 4k, Supplementary Fig. 5i).
Transcriptional dynamics in bipotent trunk cell development
Trunk cells are bipotent progenitors with the potential to generate both duct cells and endocrine cells, for which the driving force has not been fully deciphered. To address this issue, we performed a branched gene expression analysis on the developmental directions of EP and duct cells with TFs highlighted (Fig. 5a). The GO analysis of cluster-1 genes, including ASCL2, indicated that these genes were associated with stem cell differentiation, tight junction assembly and BMP signaling (Fig. 5b). ASCL2 was highly expressed in trunk and EP cells but was expressed at low levels in duct cells (Fig. 5c, d). These findings indicated that ASCL2 might be related to endocrinogenesis. Cluster-2 genes, such as NEUROG3, FEV and INSM1, were highly expressed in EP cells and enriched in GO terms for endocrine pancreas development, neurogenesis, peptide secretion and cytoskeleton (Fig. 5a, b). Cluster-3 genes, such as HES4 and NFIB, were upregulated in duct cells and were related to the Notch, Wnt, TGFβ, and ERK pathways and epithelial cell proliferation (Fig. 5a, b). We also found that the histone deacetylase HDAC2, which can recruit KDM1A and RCOR2 to form a complex, was upregulated in EP cells (Fig. 5e, Supplementary Fig. 6a), indicating a potential role of epigenetic regulation in cell fate determination. This inference was corroborated in another published human embryonic pancreas dataset (Supplementary Fig. 6b)28.
To explore the candidate regulators favoring differentiation toward a specific cell lineage, we focused on TFs in trunk cells whose targets were other TFs that were specifically enriched in either duct or endocrine progenitors (details in Methods) (Fig. 5f). In the trunk endocrine transition, YBX1 upregulated INSM1, which is a key regulator in the endocrine lineage and has been reported to participate in mouse islet development and β-cell identity maintenance52,53. INSM1, in turn, upregulated a series of genes confined to endocrine lineages, including FEV and PAX6, which are reported to be key regulators in the development of EP cells and to trigger a complex network that sustains EP identity (Fig. 5g and Supplementary Fig. 6c)27,28,31,34,36. Coexpression patterns and chromatin accessibility patterns corroborated the GRN prediction of the YBX1-INSM1-FEV relationship (Fig. 5h, Supplementary Fig. 6d). Likewise, as targets of ASCL2, a decreased level of ELF3 suppressed duct commitment, and an increased level of MAFA favored the development of EP cells (Fig. 5f). On the other hand, ID4 upregulated HES4, and HES4 downregulated FOXA2, which in turn upregulated the known key EP regulator NEUROD154,55 to suppress endocrine commitment (Fig. 5f, Supplementary Fig. 5g). HES4 also upregulated GLIS3 and other TFs in duct cells to favor duct commitment (Fig. 5g).
Molecular heterogeneity of developing human pancreatic endocrine cells
To further investigate the molecular mechanisms of endocrine development, we performed a clustering analysis of EPs and endocrine cells. A total of five clusters of EP cells expressing NEUROG3 and four clusters of endocrine cells expressing endocrine hormone genes were identified (Fig. 6a and c, Supplementary Fig. 7b and Supplementary Data 5). Five subclusters of EP cells expressed NEUROG3 at different levels (Fig. 6c, Supplementary Fig. 7b). Some trunk cell markers, including SOX9 and ID3, were expressed in EP early cells (Fig. 6c, Supplementary Fig. 7b), suggesting the role of EP early cells in the transitional state of trunk and EP cells. EP mid cells expressed both high levels of NEUROG3 and HES6 (Fig. 6c, Supplementary Fig. 7b). Some TFs, including FEV, HES6, PAX4, and NKX6-1, were highly expressed in EP late cells (Fig. 6c, Supplementary Fig. 7b). EP alpha and EP beta cells still expressed low levels of NEUROG3 and exhibited their own differentiation potential. EP alpha cells expressed ARX, FEV and ISL1 but not GCG. EP beta cells expressed high levels of NKX6-1, FEV, and PAX4 and low levels of INS, with no expression of mature beta cell marker genes, such as MAFA (Fig. 6c, Supplementary Fig. 7b). A recent study identified four subtypes of EP cells in PCW 9 to 1928. We compared our dataset with their dataset and found that most cell types could be matched (Supplementary Fig. 7c, d). Our newly defined EP alpha and EP beta cells mostly corresponded to their alpha/PP-Pro cells and EP4 cells (Supplementary Fig. 7c, d). Next, we focused on the TFs in endocrine cells and evaluated their activity by pySCENIC. Similar to the case for DEGs, the regulon (a given TF and its direct gene targets) activities were much different in each cluster (Fig. 6d, Supplementary Data 6). We highlighted the top 5 scoring regulons in each cluster and found that the ARX and ETV1 regulons scored highly in EP alpha cells and that the NKX6-1 and LMX1B regulons scored highly in EP beta cells (Fig. 6d).
Almost all EPs and endocrine cells were generated after PCW 8 (Fig. 6a, Supplementary Fig. 7a). The percentage of EP cells gradually decreased, and that of endocrine cells increased between PCW 8 and 11 (Fig. 6b, Supplementary Fig. 7a). We performed RNA velocity and developmental trajectory analysis of pancreatic endocrine cells (Fig. 6e, f). EP early cells were considered the starting point of all endocrine cells. The epsilon cells were one branch from EP mid cells. Then, EP late cells differentiated first into EP alpha cells and then into delta and EP beta cells. Finally, EP alpha and EP beta cells only generated alpha or beta cells (Fig. 6e, f). Next, we analyzed transcriptional dynamics according to their pseudotime. We divided the DEGs of EP early, EP mid, and EP late cells into four clusters along the pseudotime axis (Fig. 6g). Cluster-1 genes, including TEAD2, SOX4, and NEUROG3, were upregulated in EP early and EP mid cells (Fig. 6g). A GO analysis indicated that these genes participated in epithelial tube morphogenesis and neuron differentiation (Fig. 6h). Cluster-2 genes, such as SOX9, were highly expressed in EP early cells, and the GO terms enriched among these genes indicated that Notch, Wnt, BMP, TGF-β and extracellular signal-regulated kinase (ERK) pathways were downregulated during EP development. The epithelial-mesenchymal transition (EMT) factor SNAI1 and other related genes were also upregulated in EP early cells (Fig. 6g, h). The EMT process is believed to help EP cells delaminate into the mesenchyme from the trunk region in mice56. Cluster-3 genes, such as PAX4, FEV, RFX3, NEUROD1, NKX6-1 and INSM1, were upregulated in EP late cells and associated with cytoskeletal organization for vesicle transport and protein folding (Fig. 6g, h). Cluster-4 genes associated with neurogenesis, including NKX2-2, RFX6, NEUROD2 and CDX2, were upregulated in EP mid cells (Fig. 6g, h).
Alpha and beta cells constitute ~90% of the pancreatic endocrine cells in adults57,58. In the developing mouse pancreas, alpha cells emerge as early as E9.5 and precede beta cells. In the human pancreas, we found that INS expression starting from PCW 8 occurred earlier than GCG expression from PCW 9 (Supplementary Fig. 7f). This result indicated that the generation of insulin-expressing cells occurred prior to that of glucagon-expressing cells in human pancreas development and is consistent with previous findings40,59. To further evaluate alpha and beta cell differentiation, we focused on the branching of EP alpha and EP beta cells (Supplementary Fig. 7g). Insulin secretion, peptide transport, glucose homeostasis, and negative regulation of Notch signaling pathway-related genes were gradually upregulated in the beta branch (Supplementary Fig. 7g, h). Oxidative phosphorylation-, glucose homeostasis-, and peptide secretion-related genes were expressed at increasing levels in the alpha cell branch (Supplementary Fig. 7g, h). Taken together, our data depicted the differentiation trajectories of EPs and endocrine cells during the embryonic development of the human pancreas (Fig. 6i).
Comparison of the embryonic pancreatic epithelium between humans and mice
To identify the molecular features of major pancreatic epithelial cell types between humans and mice during embryo development, we compared our pancreatic dataset with two published datasets for mouse embryonic pancreas from E9.5 to E17.5, the stage corresponding to human PCW 4 to 1127,28. We separately integrated the nonendocrine and endocrine datasets to make the results clearer (Fig. 7a, b). Our newly defined human dorsal MP cells were close to MP-early cells in mice (Fig. 7a, c). Some TFs, including GATA5, RFX6, NKX6-2 and NEUROG3, were only expressed in mouse MP cells, while other TFs, including NR2F1, TBX3 and ID4, were only expressed in human dorsal or ventral MP cells (Fig. 7d). The human early tip and early trunk cells that we identified were closer to a transition state between MP-late and tip or trunk cells in mice (Fig. 7a, c). This finding indicates that early tip-trunk differentiation in humans is milder than that in mice. We also identified some TFs that were differentially expressed between humans and mice (Fig. 7d); for example, NR4A1, MAFF and EPAS1 were only expressed in human tip lineage cells, and HES4, ASCL2 and ID4 were only expressed in human trunk lineage cells (Fig. 7d). Regarding endocrine cells, human EP cells showed similar heterogeneity to mouse EP cells (Fig. 7b, c). Some TFs, such as ZBTB18 and ZNF503, were only expressed in human EP cells (Fig. 7e). Epsilon cells had the potential to generate alpha and PP cells in mice, which was not observed in the human dataset, and consistent with previous results28. Alpha/PP cells still shared similar expression patterns with epsilon cells, for some TFs, such as ARX, ETV1 and ISL1 (Fig. 6c). SOX4, SOX6, LRRFIP1 and ZFHX3 were highly expressed in human epsilon and alpha/PP cells (Fig. 7e). While MAFA was highly expressed in human beta cells, other functional maturation genes, such as SLC2A2, IAPP and G6PC2, were highly expressed in mouse beta cells (Fig. 7e). Additionally, TFs such as PLAGL1, ASCL2, MNX1 and SAMD11 were upregulated in human beta cells (Fig. 7e). Taken together, the results showed that although the developmental trajectories are conserved between humans and mice, a few distinct features, such as different gene expression patterns, were revealed between the two species.
Discussion
In this study, we present an extensive analysis of the single-cell transcriptomic and chromatin accessibility profiles of human embryonic pancreas samples from the first trimester. The pancreas arises from both the dorsal and ventral endoderm domains, but it is not clear how the two parts contribute to the pancreas development. We first identified dorsal and ventral MP cells in humans and identified two new marker genes of these cells, NR2F1 (dorsal) and TBX3 (ventral). These results are similar to those of previous studies involving laser capture and deep sequencing in the human early dorsal pancreas60. In contrast to mouse pancreas development, there is only a single phase of NEUROG3 expression and endocrine differentiation after PCW 8 in the human pancreas13,26. It might be that TFs related to endocrinogenesis, such as RFX6 and NEUROG3, were expressed in mouse MP-early cells27 but were not detected in our MP cells (Fig. 7d). The early dorsal pancreas and ventral pancreas receive different signals from adjacent tissues and may have distinct differentiation abilities14,42. Our data showed that the Wnt signaling pathway was more involved in dorsal MP cell development. A study in transgene-labeled mice showed that more endocrine cells were generated in the dorsal pancreas61. No significant differences in differentiation potentials between dorsal and ventral MP cells were observed in our data. In addition, these published scRNA-seq datasets also identified intermediate progenitors in the mouse ventral domain, which could generate hepatoblasts, EHBD cells, and pancreatic progenitors61,62. We only identified PB progenitors that could differentiate into ventral MP and EHBD cells in our dataset. Our earliest samples were from the CS13 stage, and intermediate progenitors that could also generate hepatoblasts might exist before CS13 in humans. ISL1 was only expressed in the ventral foregut domain and not in the dorsal foregut domain in human CS10 and CS11 embryos63. Our identified PB progenitors showed high expression of ISL1, suggesting that PB progenitors exist only in the ventral foregut domain. In contrast, dorsal cell heterogeneity is less distinct than ventral cell heterogeneity, and no similar progenitor cell type has been identified for dorsal MP cells, suggesting that the dorsal foregut domain only generates the dorsal pancreas.
Tip-trunk differentiation is an important event in the development of the pancreas, and corresponding regulatory mechanism needs to be further investigated. The early tip and early trunk cells included in our dataset were not identified in a previous mouse dataset. They showed similar gene expression patterns and low expression of tip and trunk marker genes. We assume that early tip-trunk differentiation is less obvious in the early stage in humans than in mice. While PTF1A and NKX6-1 are considered the key TFs involved in tip and trunk differentiation, respectively, in mice19. Human tip cells still expressed NKX6-1 (Supplementary Fig. 2d). We also found that Notch signaling and MAPK signaling from mesenchymal cells promoted trunk identity and HGF signaling for tip identity in humans. Benefiting from scATAC-seq data, we constructed GRNs of tip and trunk cells to identify important lineage-specific TFs. TFs downstream of Notch signaling, such as HES1, HES4 and HEY1, drive MP cells to become ductal lineage cells, while TFs such as PTF1A, XBP1 and EPAS1 drive MP cells to become acinar lineage cells. The newly identified roles of TFs such as GLIS3, NFIB and EHF in ductal lineage cell differentiation and EPAS1 in acinar lineage cell differentiation require further investigation. Notably, Glis3 has been reported to regulate beta cell and duct cell development in mice64, while our data imply that it is involved mainly in duct development in humans. Likewise, Epas1 has been reported to regulate beta cell and acinar cell development in mice, while its expression is confined to developing acinar cells in humans. Hence, their roles in human pancreas development need to be further studied in experiments with human developmental models such as organoids, rather than mouse embryos.
How EP cells differentiate from trunk cells is also poorly understood. Our data are consistent with previous studies showing that inhibiting the Notch, TGFβ, Wnt, BMP and ERK signaling pathways can promote EP cell differentiation from trunk cells in mice27,43,65–68. These conclusions have already been applied in the generation of endocrine cells from hPSCs in vitro3,4,8,66,69,70. Additionally, we found that epigenetic regulators, such as the histone deacetylase HDAC2 and its components, were highly expressed in EP cells and might take part in EP cell differentiation. However, some previous studies showed that HDAC inhibitor treatment of the embryonic rodent pancreas increased the number of EP cells71,72. The function of HDAC in EP cell differentiation in the human pancreas still needs to be validated. Moreover, in a combined analysis with scATAC-seq data, we identified TFs in trunk cells governing cell fate choice in endocrinogenesis; these TFs included YBX1, which upregulated INSM1, a reported important regulator in mouse islet development52,53, to favor endocrine lineage differentiation. Our data suggested that ID4, which upregulated HES4 in trunk cells, could suppress endocrine differentiation and favor ductal lineage development. Lineage tracing studies in mouse embryonic pancreas or human stem cell-derived organoids may further corroborate these discoveries.
Endocrineogenesis is the most attractive part of pancreas development. EP cell differentiation is a complex process and involves multiple intermediate cell states. We identified five EP subclusters with distinct differentiation potentials. Our data are largely consistent with a previous human model in which EP cells were clustered into four subgroups28. We detected the same number of genes included in their mSTRT-seq dataset, while more EP cells were detected in our dataset (Supplementary Fig. 7e). In our model, epsilon cells were differentiated from EP mid cells corresponding to the EP2 and partial EP3 cells in the previous study28. EP late cells, which mainly related to most EP3 cells and some EP4 cells, showed multipotential to generate alpha, delta and beta cells. Moreover, we defined EP alpha cells as those corresponding to alpha/PP-Pro cells and EP beta cells corresponding to most EP4 cells. Compared with mouse endocrinogenesis, alpha and beta cell differentiation in humans takes more time. Epsilon cells generating alpha and PP cells and beta cell maturation may occur in the second and third trimesters. In addition, our data showed the MAFA expression in human fetal beta cells, consistent with previous studies showing that MAFA expression is low in the human developing pancreatic epithelium from PCW 9 onward73,74. MAFA is important for beta cell maturation, and its expression is increased in adult beta cells. However, nuclear MAFA protein did not appear until PCW 2159,74, indicating that MAFA was located in the cytoplasm of early fetal beta cells and may not play a regulatory role as a transcription factor at this time point.
Taken together, our work provides a valuable resource for transcriptional dynamics in early human pancreas development as well as a blueprint for generating pancreatic cells in vitro.
Methods
Sample collection
The research complies with all relevant ethical regulations and guidelines. With approval from the Ethics Committee of The First Affiliated Hospital of Hainan Medical University (certificate #201901) and informed consent from the patients taking voluntary abortions, we acquired pancreas from human embryos in PCW 4-11.
The PCW of the embryos was determined by combining gestational age information, ultrasound assessments and anatomical features of the embryos according to the guidelines (Supplementary Data 1)75–77. In PCW 4-6, the GI tract part beneath the stomach and above the duodenum was dissected to include both the dorsal and ventral pancreas. In PCW 7-11, when the dorsal and ventral pancreas merged and the pancreas became distinct, the developing pancreas was separated, and as much of the surrounding mesenchyme was removed as much as possible. The dissected tissues were processed as appropriate for the subsequent experiments.
scRNA-seq library construction and sequencing
For scRNA-seq library construction, the samples were washed with DPBS (Gibco), cut into millimeter pieces if necessary and digested in 0.25% trypsin (Gibco) at 37 °C for 5-10 min with regular inspection of the digestion status. Ten percent FBS (Gibco) was used to terminate the digestion once large tissue pieces were no longer visible, and the suspension was pipetted several times to further dissociate cell clumps. After washing with DPBS, the cell suspensions were filtered through 40-μm strainers (Bel-Art) to remove remaining cell clumps, followed by trypan blue (Sigma‒Aldrich) staining for live cell counting. Cell counting was performed manually, and samples with a minimal cell viability of 90% were used for scRNA-seq with a target recovery of 5000-8000 cells. Library construction was performed using Chromium Next GEM Single Cell 3’ Reagent Kits v3.1 (10x Genomics) according to the manufacturer’s instructions. The library was processed on the Illumina NovaSeq 6000 platform for sequencing with 150 bp paired-end reads.
scATAC-seq library construction and sequencing
For scATAC-seq library construction, the samples were cryo-preserved with CryostorCS10 (STEMCELL). The nuclei were extracted according to the 10x Genomics user guide. Cryopreserved samples were recovered in prewarmed media (RPMI 1640 + 10% FBS) and then subjected to centrifugation at 300×g for 5 min at 4 °C. After DPBS washing, the cells were lysed in chilled lysis buffer (10 mM pH 7.4 Tris-HCl, 10 mM NaCl, 3 mM MgCl2, 0.1% Tween-20, 0.1% Nonidet P40 Substitute, 0.01% digitonin, 1% BSA) with gentle pipette mixing and incubation on ice for 3 min. After washing with chilled wash buffer (10 mM pH 7.4 Tris-HCl, 10 mM NaCl, 3 mM MgCl2, 0.1% Tween-20, and 1% BSA), the nuclei were resuspended in chilled diluted nuclei buffer (10x Genomics) and filtered through 40-μm strainers. The quality and quantity of nuclei were manually examined by trypan blue staining. Nuclei suspensions with a target recovery of 3000-6000 nuclei were subjected to Chromium Next GEM Single Cell ATAC Reagent Kits v1.1 (10x Genomics) according to the manufacturer’s instructions. The library was processed on the Illumina NovaSeq6000 platform for sequencing with 50 bp paired-end reads.
Cryosectioning and immunohistochemistry
For immunohistochemistry analysis, the samples were fixed in 4% paraformaldehyde (PFA, Sigma‒Aldrich) for 24 h, embedded in optimum cutting temperature compound (OCT compound, Leica), quickly frozen in liquid nitrogen and stored at -80 °C. Cryosections were sectioned every 10 μm at −20 °C. Tissue sections were sequentially treated with 0.5% Triton X-100 for 10 min at room temperature, 5% BSA for 1 h at room temperature, diluted primary antibody solution at 4 °C overnight and diluted secondary antibody solution for 1 h at room temperature, with PBS wash between each step. Sections were incubated with primary or secondary antibodies at the following dilutions: Mouse anti-DCDC2 (C-4) (1:50, Santa Cruz, sc-166051), Mouse anti-ASCL2 (7E2) (1:100, Millipore, MAB4418), Rabbit anti-PDX1 (EPR22002) (1:100, Abcam, ab219207), Rabbit anti-HES4 (1:100, Invitrogen, PA5-84551), Goat anti-Mouse IgG (H + L) Cross-Adsorbed Secondary Antibody, Alexa Fluor 488 (1:400, Invitrogen, A-11001), Donkey anti-Rabbit IgG (H + L) Highly Cross-Adsorbed Secondary Antibody, Alexa Fluor 568 (1: 400, Invitrogen, A-10042). Immunofluorescence images were taken with an Olympus FV3000 confocal microscope.
scRNA-seq processing
The sequencing output files were processed with Cell Ranger 4.0.0 with default parameters, aligning reads to the GRCh38 (hg38) reference genome. SoupX78 was deployed to remove ambient RNA contamination from each count matrix, and the data were then inputted to the Seurat workspace79. Cells with UMI counts between 4000 and 50,000, gene numbers between 500-8000 and mitochondrial count percentages <15% were used for further analysis.
The Seurat objects from the PCW 4-6 and PCW 7-11 samples were simply merged together, respectively. The two datasets were normalized by the total counts per cell. The 3000 most variable genes were selected. Cell cycle scores were calculated by the CellCycleScoring function and regressed out together with sequencing depth and batch. Principal component analysis (PCA) was performed with a z score matrix, followed by UMAP and Louvain clustering applied with the top 50 PCs and a resolution of 1. Cells coexpressing markers of more than one cell class were treated as doublets and removed from the datasets. The markers used for cell type annotation are listed in Supplementary Fig. 1e, g.
Then, pancreatic epithelium from different datasets (PCW 4-6 or PCW 7-11) were extracted and merged together, followed by repeating the analysis process described above with parameters modified. Briefly, UMAP and Louvain clustering were applied with the top 40 PCs and a resolution of 2.5. The markers used for cell type annotation are listed in Supplementary Fig. 2d.
For the PCW 4-5 epithelial cells shown in Fig. 2, epithelial cells in PCW 4-5 samples were extracted, followed by repeating the analysis process demonstrated above with some procedures modified. Briefly, 2000 highly variable genes were selected, and cell cycle scores and sequencing depth were regressed out. After PCA, we used the package Harmony to perform batch effect correction. Next, UMAP and Louvain clustering were applied with the top 30 Harmony reduction and a resolution of 0.7. The markers used for cell type annotation are listed in Fig. 2e.
For the pancreatic endocrine cell dataset, EP, alpha, beta, delta and epsilon cells were extracted from the pancreatic epithelial cell dataset, followed by repeating the analysis process described above with some parameters modified. Briefly, 2000 highly variable genes were selected, and cell cycle scores and sequencing depth were regressed out. After PCA, we used the package Harmony80 to perform batch effect correction. Next, UMAP and Louvain clustering were applied with the top 35 Harmony reduction and a resolution of 2. The markers used for cell type annotation are listed in Fig. 6c.
DEGs for each cluster in all datasets were listed by the FindAllMarkers function with the default parameters, and only genes for p-value adjusted <0.05 were selected.
Pseudotime and trajectory analysis
Pseudotime analysis was completed with the Monocle3 package37. The gene expression matrix and UMAP layout were imported to the CellDataSet object. The learn_graph functions were used to build trajectories with the parameters minimal_branch_len = 4. The order_cells function was employed to measure the pseudotime of each cell, and cells were manually selected as root nodes of the trajectory graph. For gene expression dynamics visualization, we extracted a DEG-related expression matrix for cell types of interest and arranged the cells by pseudotime. Then, smooth.spline was applied to fit the gene expression. The package ComplexHeatmap81 was applied to visualize the gene expression and cluster genes by the k-means method.
RNA velocity analysis
Loom files containing unspliced and spliced reads were generated from velocyto.py for downstream analysis with default parameters38. Then, the python package scVelo39 was applied to estimate RNA velocity with a dynamic model by default parameters. Later, RNA velocities were projected into UMAP embedding and visualized by cell types.
GO analysis
GO analysis was performed with the package clusterProfiler82. Only biological process GO terms with a p-value < 0.05 were selected. The results were further visualized with the ggplot2 package.
Cell‒cell interaction analysis
Cell‒cell interaction analysis was performed with the package CellChat83. The CellChatDB.human database used in our analysis contains secreted signaling, ECM-receptor and cell‒cell contact ligand‒receptor (LR) interactions. The Seurat object was imported into CellChat object and processed according to the guidelines. The chord diagram of the signaling pathway and LR interactions of interest was fulfilled by the netVisual_chord_gene function. Only LR interactions with a p-value < 0.05 were selected.
Identification of regulons of endocrine cells
The count matrix of endocrine cells was input into the pySCENIC workflow with default parameters84,85. First, the gene coexpression network was constructed by the grn step. Second, each TF-target module was pruned with a previously known regulatory motif in the cisTarget database by the ctx step. Then, the AUC score for each regulon was calculated at the single-cell level by aucell step. Finally, differentially activated regulons in each cluster were identified by the Wilcoxon test, and only p-values adjusted <0.05 were selected.
scATAC-seq processing
The sequence output files were processed with Cell Ranger ATAC 2.0.0 with default parameters, aligning reads to the GRCh38 (hg38) reference genome and identifying transposase cut sites. The Signac package86 was used to preprocess scATAC-seq data. Cells with a TSS enrichment score between 4 to 20, nucleosome signal score <4 and blacklist ratio <0.02 and total <50,000 were selected for further analysis.
Genomes were segmented into 2.5 kb windows, and windows containing blacklist regions were removed. Count matrices of each sample were constructed by the FeatureMatrix function, binarized and merged together prior to term frequency inverse document frequency (TF-IDF) normalization (method 3 of function RunTFIDF). The 25000 most accessible windows were subjected to singular value decomposition (SVD), and the RunHarmony function was used to remove batch effects with 2-30 LSI components. The 2-30 Harmony components were then used for UMAP (n.neighbors = 50 L, min.dist = 0.5) and Louvain clustering (resolution = 0.8). The gene activity matrix was generated using the GeneActivity function. Cells coexpressing markers of more than one cell classes or expressing none of the markers of interested cell classes were removed from the datasets. The markers used for cell class annotation are listed in Supplementary Fig. 4c.
Epithelial cells were extracted from the whole dataset and then subjected to peak calling. The method described in a previous study was used to generate 500 bp peaks87. Briefly, the CallPeaks function in Signac was used to call peaks for each Seurat cluster with additional parameters “--call-summits --nolambda --qval 5e-2 --keep-dup all” passed to MACS288. After extending peak summits by 250 bp on both sides, any peaks aligning to blacklist regions were removed. The 200000 most significant peaks were selected after the iterative selection described in a previous study87. The peak matrices were generated by applying the FeatureMatrix function to the selected peaks, binarizing the peaks and subjecting them to the clustering process described above. Due to the inherent sparsity of scATAC-seq data, we applied a broader strategy in annotating epithelial cells in scATAC-seq by ignoring differences between different stages of progenitors (e.g., early tip and tip cells were both annotated as tip cells) and focusing on differences between lineages. The markers used for cell type annotation are listed in Supplementary Fig. 4d. Differentially accessible peaks for each cell type were calculated by the FindAllMarkers function with the parameter min.pct set to 0.01 and logfc.threshold set to 0.15. Motif information was added by the AddMotifs function with the filtered cisBP database (human_pwms_v2 from the chromVARmotifs package)89. Motif enrichment for each cell type was calculated by performing FindMotifs on differentially accessible peaks for each cell type.
Joint analysis of scRNA-seq and scATAC-seq
The ArchR package90 was employed in integrated analysis, and the established LSI coordinates, Harmony coordinates, UMAP coordinates, cell type annotation and gene activity matrix described above were inputted to ArchR. To integrate time-matched scRNA-seq and scATAC-seq data, data for PCW 8-11 epithelial cells were extracted from scRNA-seq datasets. scRNA-seq and scATAC-seq were integrated by unconstrained integration using the addGeneIntegrationMatrix function on the gene activity matrix imported from Signac. Peak-to-gene links were then added using the addPeak2GeneLinks function.
GRN analysis
GRNs were constructed according to IReNA2 with slight modifications49. Briefly, TF footprints were calculated by TOBIAS91. Motif enrichment and peak-to-gene links were calculated as described above. TF footprint and motif enrichment were used to assess TF activity, while peak-to-gene links and gene coexpression relationships were exploited to predict targets. Expression correlations were used to screen predicted TF-target pairs. Default parameters described in the original paper were used49.
For acinar lineage GRN analysis, cell types involved in the acinar lineage, namely, tip and acinar cells for scATAC-seq and early tip, tip and acinar cells for scRNA-seq, were treated all as acinar lineage cells, and cell type distinctions within this group were ignored. For ductal lineage GRN analysis, a similar strategy was employed, ignoring differences between trunk and duct cells for scATAC-seq and early trunk, trunk and duct cells for scRNA-seq.
For the analysis of GRNs regulating the differentiation of trunk cells to duct or endocrine progenitors, GRNs of trunk, duct and endocrine progenitors were constructed separately. Then, TF-target pairs regulating bidirectional differentiation were selected based on two criteria (exemplified by trunk-endocrine transition): (a) the targets in trunk cells were TFs in endocrine cells but not in duct cells; (b) the TF-target pairs were shared between trunk cells and endocrine cells but not duct cells, and for trunk-duct transitions, the opposite criteria were applied.
Comparison with published another dataset for endocrine cells
We used the mSTRT-seq dataset from OMIX236 for human embryonic pancreas in PCW 9 to 19 to compare with our endocrine cell dataset28. Only endocrine cells were extracted. Then, the standard workflow of Seurat was applied to integrate the two datasets by canonical correlation analysis92. The two datasets were separately visualized in the same UMAP space. Cell type identity transfer was performed using the FindTransferAnchors and TransferData functions with default parameters and visualized by an alluvial plot in the ggplot2 package.
Comparison across species between humans and mice
Cross-species comparison was based on genes with a 1:1 ortholog between humans and mice by the BioMart from Ensembl genome annotation system (http://www.ensembl.org/index.html), and the mouse gene names were converted to human gene names. To overcome biases due to unbalanced cell numbers, we sampled 200 cells per cell type for our human nonendocrine cells to form a balanced dataset. This dataset was integrated with the Smart-seq2 dataset from GSE11593127. Our endocrine cell dataset was integrated with the Smart-seq2 dataset from GSE13962728. Only cells from E9.5 to E17.5 were applied to integrate with our data. Then, the standard workflow of Seurat was applied to integrate two datasets by canonical correlation analysis. The two datasets were separately visualized in the same UMAP space. DEGs between humans and mice across the same cell types were calculated by the FindMarkers function.
Statistics and reproducibility
Statistical parameters are reported in the respective figures and figure legends.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
This work was supported by grants from the Ministry of Science and Technology of the People’s Republic of China (2021YFA1300301, 2018YFA0507101), the Beijing Natural Science Foundation (5212016), the National Natural Research Foundation of China and the Swedish Foundation for International Cooperation in Research and Higher Education (NSFC-STINT) Cooperation Project (32011530118), the Academic Promotion Program of Shandong First Medical University (922-001003130RC), the SciLifeLab & Wallenberg Data Driven Life Science Program (KAW 2020.0239), the Major Science and Technology Program of Hainan Province (ZDKJ2017007), the National Natural Science Foundation of China (81960283), Hainan Provincial Science and Technology Program for Clinical Medical Research Center (LCYX202102), the Hainan Province Clinical Medical Center, the Innovation Platform for Academicians of Hainan Province. We are grateful to Dr. Zhen Fan from Center for High Throughout Sequencing for the technical support with the preparation of single-cell samples and the construction of the scRNA-seq and scATAC-seq library. We thank Dr. Yan Teng and her group from Center for Biological imaging for helping with the bioimaging. We would also like to thank the Guangdong-Hong Kong-Macao Greater Bay Area Center of Bioinformation (GCBI) for the support of computing resources.
Author contributions
E.S. and T.X. conceived the project and designed the experiments. Z.M., X.Z., E.S., H.Y. and Y.M. collected the human embryo samples. Z.M. and X.Z. performed tissue processing, single-cell RNA-seq and single-cell ATAC-seq experiments. Z.M., W.Z. and X.C. analyzed the scRNA-seq data. X.Z. and W.Z. analyzed the scATAC-seq data. X.C. contributed to data visualization. Z.M., X.Z., and Y.Z. performed immunostaining, imaging, and data analysis. Z.M. and X.Z., E.S., W.Z. and T.X. wrote and revised the manuscript. All authors contributed to the discussion and interpretation of the results.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
The raw sequencing data generated in this study have been deposited in the Genome Sequence Archive for Human (GSA-Human) under accession code HRA002757. The data in GSA are available under restricted access for privacy protection, access can be obtained by contacting Tao Xu (xutao@ibp.ac.cn). China’s Ministry of Science and Technology has approved the export of raw sequencing data (approval # 2023BAT1021). The processed gene expression matrix for scRNA-seq and Tn5 fragment files and filtered peak-barcode matrix for scATAC-seq data are available at OMIX database under accession code OMIX001616. Source data are provided with this paper. Other published datasets we used in this study could be obtained from GSE115931, GSE139627 and OMIX236. Source data are provided with this paper.
Code availability
The code for scRNA-seq and scATAC-seq analysis during this study is available at Github: https://github.com/zhuoma888/fetal_pancreas.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Zhuo Ma, Xiaofei Zhang, and Wen Zhong.
These authors jointly supervised this work: Yanlin Ma, Eli Song, and Tao Xu.
Contributor Information
Yanlin Ma, Email: mayanlinma@hotmail.com.
Eli Song, Email: songali@ibp.ac.cn.
Tao Xu, Email: xutao@ibp.ac.cn.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-40893-8.
References
- 1.Fusco, J., El-Gohary, Y. & Gittes, G. K. in Shackelford’s Surgery of the Alimentary Tract Vol. 2, 1062–1075 (Elsevier Health, US, 2019).
- 2.Zhou Q, Melton DA. Pancreas regeneration. Nature. 2018;557:351–358. doi: 10.1038/s41586-018-0088-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Rezania A, et al. Reversal of diabetes with insulin-producing cells derived in vitro from human pluripotent stem cells. Nat. Biotechnol. 2014;32:1121–1133. doi: 10.1038/nbt.3033. [DOI] [PubMed] [Google Scholar]
- 4.Pagliuca FeliciaW, et al. Generation of functional human pancreatic β. Cells Vitr. Cell. 2014;159:428–439. doi: 10.1016/j.cell.2014.09.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hogrebe NJ, Augsornworawat P, Maxwell KG, Velazco-Cruz L, Millman JR. Targeting the cytoskeleton to direct pancreatic differentiation of human pluripotent stem cells. Nat. Biotechnol. 2020;38:460–470. doi: 10.1038/s41587-020-0430-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Velazco-Cruz L, et al. Acquisition of dynamic function in human stem cell-derived beta cells. Stem Cell Rep. 2019;12:351–365. doi: 10.1016/j.stemcr.2018.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yoshihara E, et al. Immune-evasive human islet-like organoids ameliorate diabetes. Nature. 2020;586:606–611. doi: 10.1038/s41586-020-2631-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hogrebe NJ, Maxwell KG, Augsornworawat P, Millman JR. Generation of insulin-producing pancreatic beta cells from multiple human stem cell lines. Nat. Protoc. 2021;16:4109–4143. doi: 10.1038/s41596-021-00560-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Balboa D, et al. Functional, metabolic and transcriptional maturation of human pancreatic islets derived from stem cells. Nat. Biotechnol. 2022;40:1042–1055. doi: 10.1038/s41587-022-01219-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Shih HP, Wang A, Sander M. Pancreas organogenesis: from lineage determination to morphogenesis. Annu Rev. Cell Dev. Biol. 2013;29:81–105. doi: 10.1146/annurev-cellbio-101512-122405. [DOI] [PubMed] [Google Scholar]
- 11.Bastidas-Ponce A, Scheibner K, Lickert H, Bakhti M. Cellular and molecular mechanisms coordinating pancreas development. Development. 2017;144:2873–2888. doi: 10.1242/dev.140756. [DOI] [PubMed] [Google Scholar]
- 12.Slack JM. Developmental biology of the pancreas. Development. 1995;121:1569–1580. doi: 10.1242/dev.121.6.1569. [DOI] [PubMed] [Google Scholar]
- 13.Jennings RE, et al. Development of the human pancreas from foregut to endocrine commitment. Diabetes. 2013;62:3514–3522. doi: 10.2337/db12-1479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zorn AM, Wells JM. Vertebrate endoderm development and organ formation. Annu Rev. Cell Dev. Biol. 2009;25:221–251. doi: 10.1146/annurev.cellbio.042308.113344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zaret KS, Grompe M. Generation and regeneration of cells of the liver and pancreas. Science. 2008;322:1490–1494. doi: 10.1126/science.1161431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Burlison JS, Long Q, Fujitani Y, Wright CV, Magnuson MA. Pdx-1 and Ptf1a concurrently determine fate specification of pancreatic multipotent progenitor cells. Dev. Biol. 2008;316:74–86. doi: 10.1016/j.ydbio.2008.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Gao N, et al. Dynamic regulation of Pdx1 enhancers by Foxa1 and Foxa2 is essential for pancreas development. Genes Dev. 2008;22:3435–3448. doi: 10.1101/gad.1752608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zhou Q, et al. A multipotent progenitor domain guides pancreatic organogenesis. Dev. Cell. 2007;13:103–114. doi: 10.1016/j.devcel.2007.06.001. [DOI] [PubMed] [Google Scholar]
- 19.Schaffer AE, Freude KK, Nelson SB, Sander M. Nkx6 transcription factors and Ptf1a function as antagonistic lineage determinants in multipotent pancreatic progenitors. Dev. Cell. 2010;18:1022–1029. doi: 10.1016/j.devcel.2010.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Schwitzgebel VM, et al. Expression of neurogenin3 reveals an islet cell precursor population in the pancreas. Development. 2000;127:3533–3542. doi: 10.1242/dev.127.16.3533. [DOI] [PubMed] [Google Scholar]
- 21.Gradwohl G, Dierich A, LeMeur M, Guillemot F. neurogenin3 is required for the development of the four endocrine cell lineages of the pancreas. Proc. Natl Acad. Sci. USA. 2000;97:1607–1611. doi: 10.1073/pnas.97.4.1607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hang Y, Stein R. MafA and MafB activity in pancreatic beta cells. Trends Endocrinol. Metab. 2011;22:364–373. doi: 10.1016/j.tem.2011.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Nishimura W, Takahashi S, Yasuda K. MafA is critical for maintenance of the mature beta cell phenotype in mice. Diabetologia. 2015;58:566–574. doi: 10.1007/s00125-014-3464-9. [DOI] [PubMed] [Google Scholar]
- 24.Nair G, Hebrok M. Islet formation in mice and men: lessons for the generation of functional insulin-producing beta-cells from human pluripotent stem cells. Curr. Opin. Genet Dev. 2015;32:171–180. doi: 10.1016/j.gde.2015.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Jennings RE, Berry AA, Strutt JP, Gerrard DT, Hanley NA. Human pancreas development. Development. 2015;142:3126–3137. doi: 10.1242/dev.120063. [DOI] [PubMed] [Google Scholar]
- 26.Salisbury RJ, et al. The window period of NEUROGENIN3 during human gestation. Islets. 2014;6:e954436. doi: 10.4161/19382014.2014.954436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Yu, X. X. et al. Defining multistep cell fate decision pathways during pancreatic development at single-cell resolution. EMBO J. 38, e100164 (2019). [DOI] [PMC free article] [PubMed]
- 28.Yu XX, et al. Sequential progenitor states mark the generation of pancreatic endocrine lineages in mice and humans. Cell Res. 2021;31:886–903. doi: 10.1038/s41422-021-00486-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.van Gurp, L. et al. A transcriptomic roadmap to alpha- and beta-cell differentiation in the embryonic pancreas. Development146, dev173716 (2019). [DOI] [PubMed]
- 30.Sharon N, et al. A peninsular structure coordinates asynchronous differentiation with morphogenesis to generate pancreatic Islets. Cell. 2019;176:790–804.e713. doi: 10.1016/j.cell.2018.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Scavuzzo MA, et al. Endocrine lineage biases arise in temporally distinct endocrine progenitors during pancreatic morphogenesis. Nat. Commun. 2018;9:3356. doi: 10.1038/s41467-018-05740-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Ramond, C. et al. Understanding human fetal pancreas development using subpopulation sorting, RNA sequencing and single-cell profiling. Development145, dev165480 (2018). [DOI] [PMC free article] [PubMed]
- 33.Goncalves CA, et al. A 3D system to model human pancreas development and its reference single-cell transcriptome atlas identify signaling pathways required for progenitor expansion. Nat. Commun. 2021;12:3144. doi: 10.1038/s41467-021-23295-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Byrnes LE, et al. Lineage dynamics of murine pancreatic development at single-cell resolution. Nat. Commun. 2018;9:3922. doi: 10.1038/s41467-018-06176-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bastidas-Ponce, A. et al. Comprehensive single cell mRNA profiling reveals a detailed roadmap for pancreatic endocrinogenesis. Development146, dev173849 (2019). [DOI] [PubMed]
- 36.Duvall E, et al. Single-cell transcriptome and accessible chromatin dynamics during endocrine pancreas development. Proc. Natl Acad. Sci. USA. 2022;119:e2201267119. doi: 10.1073/pnas.2201267119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Cao J, et al. The single-cell transcriptional landscape of mammalian organogenesis. Nature. 2019;566:496–502. doi: 10.1038/s41586-019-0969-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.La Manno G, et al. RNA velocity of single cells. Nature. 2018;560:494–498. doi: 10.1038/s41586-018-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Bergen V, Lange M, Peidli S, Wolf FA, Theis FJ. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 2020;38:1408–1414. doi: 10.1038/s41587-020-0591-3. [DOI] [PubMed] [Google Scholar]
- 40.Piper K, et al. Beta cell differentiation during early human pancreas development. J. Endocrinol. 2004;181:11–23. doi: 10.1677/joe.0.1810011. [DOI] [PubMed] [Google Scholar]
- 41.Baeyens L, et al. (Re)generating human beta cells: status, pitfalls, and perspectives. Physiol. Rev. 2018;98:1143–1167. doi: 10.1152/physrev.00034.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Qadir, M. M. F., Lanzoni, G., Ricordi, C. & Domínguez-Bendala, J. in Transplantation, Bioengineering, and Regeneration of the Endocrine Pancreas 183–200 (Academic Press Inc, 2020).
- 43.Murtaugh LC, Stanger BZ, Kwan KM, Melton DA. Notch signaling controls multiple steps of pancreatic differentiation. Proc. Natl Acad. Sci. USA. 2003;100:14920–14925. doi: 10.1073/pnas.2436557100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Esni F, et al. Notch inhibits Ptf1 function and acinar cell differentiation in developing mouse and zebrafish pancreas. Development. 2004;131:4213–4224. doi: 10.1242/dev.01280. [DOI] [PubMed] [Google Scholar]
- 45.Shih HP, et al. A Notch-dependent molecular circuitry initiates pancreatic endocrine and ductal cell differentiation. Development. 2012;139:2488–2499. doi: 10.1242/dev.078634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Pulkkinen M-A, Spencer-Dene B, Dickson C, Otonkoski T. The IIIb isoform of fibroblast growth factor receptor 2 is required for proper growth and branching of pancreatic ductal epithelium but not for differentiation of exocrine or endocrine cells. Mech. Dev. 2003;120:167–175. doi: 10.1016/s0925-4773(02)00440-9. [DOI] [PubMed] [Google Scholar]
- 47.Bhushan A, et al. Fgf10 is essential for maintaining the proliferative capacity of epithelial progenitor cells during early pancreatic organogenesis. Development. 2001;128:5109–5117. doi: 10.1242/dev.128.24.5109. [DOI] [PubMed] [Google Scholar]
- 48.Elghazi L, Cras-Meneur C, Czernichow P, Scharfmann R. Role for FGFR2IIIb-mediated signals in controlling pancreatic endocrine progenitor cell proliferation. Proc. Natl Acad. Sci. USA. 2002;99:3884–3889. doi: 10.1073/pnas.062321799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Lyu P, et al. Gene regulatory networks controlling temporal patterning, neurogenesis, and cell-fate specification in mammalian retina. Cell Rep. 2021;37:109994. doi: 10.1016/j.celrep.2021.109994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Chen H, et al. Impaired pancreatic development in Hif2-alpha deficient mice. Biochem Biophys. Res Commun. 2010;399:440–445. doi: 10.1016/j.bbrc.2010.07.111. [DOI] [PubMed] [Google Scholar]
- 51.Flores-Martinez A, Garcia-Nunez A, Rojas A, Cano DA. Stabilization of HIF-2alpha impacts pancreas growth. Sci. Rep. 2018;8:13713. doi: 10.1038/s41598-018-32054-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Osipovich AB, et al. Insm1 promotes endocrine cell differentiation by modulating the expression of a network of genes that includes Neurog3 and Ripply3. Development. 2014;141:2939–2949. doi: 10.1242/dev.104810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Liang X, et al. The SNAG domain of Insm1 regulates pancreatic endocrine cell differentiation and represses beta- to delta-cell transdifferentiation. Diabetes. 2021;70:1084–1097. doi: 10.2337/db20-0883. [DOI] [PubMed] [Google Scholar]
- 54.Bohuslavova, R. et al. NEUROD1 is required for the early alpha and beta endocrine differentiation in the pancreas. Int. J. Mol. Sci.22, 6713v (2021). [DOI] [PMC free article] [PubMed]
- 55.Itkin-Ansari P, et al. NeuroD1 in the endocrine pancreas: localization and dual function as an activator and repressor. Dev. Dyn. 2005;233:946–953. doi: 10.1002/dvdy.20443. [DOI] [PubMed] [Google Scholar]
- 56.Rukstalis JM, Habener JF. Snail2, a mediator of epithelial-mesenchymal transitions, expressed in progenitor cells of the developing endocrine pancreas. Gene Expr. Patterns. 2007;7:471–479. doi: 10.1016/j.modgep.2006.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Cabrera O, et al. The unique cytoarchitecture of human pancreatic islets has implications for islet cell function. Proc. Natl Acad. Sci. USA. 2006;103:2334–2339. doi: 10.1073/pnas.0510790103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Brissova M, et al. Assessment of human pancreatic islet architecture and composition by laser scanning confocal microscopy. J. Histochem Cytochem. 2005;53:1087–1097. doi: 10.1369/jhc.5C6684.2005. [DOI] [PubMed] [Google Scholar]
- 59.Jeon J, Correa-Medina M, Ricordi C, Edlund H, Diez JA. Endocrine cell clustering during human pancreas development. J. Histochem Cytochem. 2009;57:811–824. doi: 10.1369/jhc.2009.953307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Jennings RE, et al. Laser capture and deep sequencing reveals the transcriptomic programmes regulating the onset of pancreas and liver differentiation in human embryos. Stem Cell Rep. 2017;9:1387–1394. doi: 10.1016/j.stemcr.2017.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Li, L. C. et al. Single-cell transcriptomic analyses reveal distinct dorsal/ventral pancreatic programs. EMBO Rep.19, e46148 (2018). [DOI] [PMC free article] [PubMed]
- 62.Willnow D, et al. Quantitative lineage analysis identifies a hepato-pancreato-biliary progenitor niche. Nature. 2021;597:87–91. doi: 10.1038/s41586-021-03844-1. [DOI] [PubMed] [Google Scholar]
- 63.Li LC, et al. Single-cell patterning and axis characterization in the murine and human definitive endoderm. Cell Res. 2021;31:326–344. doi: 10.1038/s41422-020-00426-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Kang HS, et al. Transcription factor Glis3, a novel critical player in the regulation of pancreatic beta-cell development and insulin gene expression. Mol. Cell Biol. 2009;29:6366–6379. doi: 10.1128/MCB.01259-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Apelqvist A, et al. Notch signalling controls pancreatic cell differentiation. Nature. 1999;400:877–881. doi: 10.1038/23716. [DOI] [PubMed] [Google Scholar]
- 66.Sharon N, et al. Wnt signaling separates the progenitor and endocrine compartments during pancreas development. Cell Rep. 2019;27:2281–2291.e2285. doi: 10.1016/j.celrep.2019.04.083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Murtaugh LC, Law AC, Dor Y, Melton DA. Beta-catenin is essential for pancreatic acinar but not islet development. Development. 2005;132:4663–4674. doi: 10.1242/dev.02063. [DOI] [PubMed] [Google Scholar]
- 68.Tulachan SS, et al. TGF-beta isoform signaling regulates secondary transition and mesenchymal-induced endocrine development in the embryonic mouse pancreas. Dev. Biol. 2007;305:508–521. doi: 10.1016/j.ydbio.2007.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Nostro MC, et al. Stage-specific signaling through TGFbeta family members and WNT regulates patterning and pancreatic specification of human pluripotent stem cells. Development. 2011;138:861–871. doi: 10.1242/dev.055236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Russ HA, et al. Controlled induction of human pancreatic progenitors produces functional beta-like cells in vitro. EMBO J. 2015;34:1759–1772. doi: 10.15252/embj.201591058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Haumaitre C, Lenoir O, Scharfmann R. Histone deacetylase inhibitors modify pancreatic cell fate determination and amplify endocrine progenitors. Mol. Cell Biol. 2008;28:6373–6383. doi: 10.1128/MCB.00413-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Haumaitre C, Lenoir O, Scharfmann R. Directing cell differentiation with small-molecule histone deacetylase inhibitors: the example of promoting pancreatic endocrine cells. Cell Cycle. 2009;8:536–544. doi: 10.4161/cc.8.4.7610. [DOI] [PubMed] [Google Scholar]
- 73.Sarkar SA, et al. Global gene expression profiling and histochemical analysis of the developing human fetal pancreas. Diabetologia. 2008;51:285–297. doi: 10.1007/s00125-007-0880-0. [DOI] [PubMed] [Google Scholar]
- 74.Riedel MJ, et al. Immunohistochemical characterisation of cells co-producing insulin and glucagon in the developing human pancreas. Diabetologia. 2012;55:372–381. doi: 10.1007/s00125-011-2344-9. [DOI] [PubMed] [Google Scholar]
- 75.O’rahilly R, Müller F. Developmental stages in human embryos: revised and new measurements. Cells Tissues Organs. 2010;192:73–84. doi: 10.1159/000289817. [DOI] [PubMed] [Google Scholar]
- 76.Bullen, P. & Wilson, D. in Molecular Genetics of Early Human Development 27–35 (BIOS Scientific, 1997).
- 77.Warren M, HERN M. Correlation of fetal age and measurements between 10 and 26 weeks of gestation. Obstet. Cynecol. 1984;63:26–32. [PubMed] [Google Scholar]
- 78.Young, M. D. & Behjati, S. SoupX removes ambient RNA contamination from droplet-based single-cell RNA sequencing data. Gigascience9, giaa151 (2020). [DOI] [PMC free article] [PubMed]
- 79.Hao Y, et al. Integrated analysis of multimodal single-cell data. Cell. 2021;184:3573–3587.e3529. doi: 10.1016/j.cell.2021.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Korsunsky I, et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods. 2019;16:1289–1296. doi: 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Gu Z, Eils R, Schlesner M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32:2847–2849. doi: 10.1093/bioinformatics/btw313. [DOI] [PubMed] [Google Scholar]
- 82.Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Jin S, et al. Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 2021;12:1088. doi: 10.1038/s41467-021-21246-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Aibar S, et al. SCENIC: single-cell regulatory network inference and clustering. Nat. Methods. 2017;14:1083–1086. doi: 10.1038/nmeth.4463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Van de Sande B, et al. A scalable SCENIC workflow for single-cell gene regulatory network analysis. Nat. Protoc. 2020;15:2247–2276. doi: 10.1038/s41596-020-0336-2. [DOI] [PubMed] [Google Scholar]
- 86.Stuart T, Srivastava A, Madad S, Lareau CA, Satija R. Single-cell chromatin state analysis with Signac. Nat. Methods. 2021;18:1333–1341. doi: 10.1038/s41592-021-01282-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Granja JM, et al. Single-cell multiomic analysis identifies regulatory programs in mixed-phenotype acute leukemia. Nat. Biotechnol. 2019;37:1458–1465. doi: 10.1038/s41587-019-0332-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Schep AN, Wu B, Buenrostro JD, Greenleaf WJ. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods. 2017;14:975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Granja JM, et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 2021;53:403–411. doi: 10.1038/s41588-021-00790-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Bentsen M, et al. ATAC-seq footprinting unravels kinetics of transcription factor binding during zygotic genome activation. Nat. Commun. 2020;11:4267. doi: 10.1038/s41467-020-18035-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Stuart T, et al. Comprehensive Integration of Single- Cell Data. Cell. 2019;177:1888–1902.e1821. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequencing data generated in this study have been deposited in the Genome Sequence Archive for Human (GSA-Human) under accession code HRA002757. The data in GSA are available under restricted access for privacy protection, access can be obtained by contacting Tao Xu (xutao@ibp.ac.cn). China’s Ministry of Science and Technology has approved the export of raw sequencing data (approval # 2023BAT1021). The processed gene expression matrix for scRNA-seq and Tn5 fragment files and filtered peak-barcode matrix for scATAC-seq data are available at OMIX database under accession code OMIX001616. Source data are provided with this paper. Other published datasets we used in this study could be obtained from GSE115931, GSE139627 and OMIX236. Source data are provided with this paper.
The code for scRNA-seq and scATAC-seq analysis during this study is available at Github: https://github.com/zhuoma888/fetal_pancreas.