Single-cell co-mapping reveals relationship between chromatin state and gene expression in early zebrafish development

Vivek Bhardwaj; Alberto Griffa; Helena Viñas Gaza; Peter Zeller; Alexander van Oudenaarden

doi:10.7554/eLife.110400

. 2026 Apr 21;15:RP110400. doi: 10.7554/eLife.110400

Single-cell co-mapping reveals relationship between chromatin state and gene expression in early zebrafish development

Vivek Bhardwaj ^1,^2,^†,^‡,^✉, Alberto Griffa ^1,^2,^†, Helena Viñas Gaza ^1,², Peter Zeller ^1,², Alexander van Oudenaarden ^1,^2,^✉

Editors: H Efsun Arda³, Didier YR Stainier⁴

PMCID: PMC13099137 PMID: 42011025

Abstract

Establishing a cell type-specific chromatin landscape is crucial for the maintenance of cell identity during embryonic development. However, our knowledge of how this landscape is set during vertebrate embryogenesis has been limited, due to the lack of methods to jointly detect chromatin modifications and gene expression in the same cell. Here we present a multimodal measurement of full-length transcriptome and histone modifications in individual cells during early embryonic development in zebrafish. We show that before the formation of germ layers, the chromatin and transcription states of cells are uncoupled and become progressively connected during gastrulation and somitogenesis. Silencing of developmental genes is achieved by local spreading of repressive chromatin together with cell type-specific demethylation. Combining transcription factor (TF) expression and chromatin states within an interpretable machine learning model, we classify TFs as lineage-specific activators and repressors and identify a subset of TFs that are epigenetically regulated. Altogether, our data resolves the dynamic relationship between chromatin and transcription during early vertebrate development and clarifies how these two layers interact to establish cell identity.

Research organism: Zebrafish

Introduction

Early embryonic development in animals is characterized by the controlled movement and positioning of cells, establishment of a body plan, and specification of tissue-specific cell states. While the spatial gradients of morphogens dominate the former two events (Xu et al., 2014), the maintenance of cell identity is believed to be mainly regulated by chromatin state (Bogdanović et al., 2012). Similar to the morphogens that regulate the patterning of an embryo, the chromatin state can also be transgenerationally inherited (Fitz-James and Cavalli, 2022). This might play an important role in predefining the spatiotemporal expression of genes during early development and regulate cell fates. The relationship between chromatin state and gene expression has been studied using whole-genome assays applied to cultured cell populations, whole tissues, or enriched cell populations sorted using cell surface or transgenic markers (Abascal et al., 2020; Kundaje et al., 2015). Recently, chromatin and DNA methylome mapping techniques have been developed to resolve cellular heterogeneity of epigenetic states at the level of individual cells. Major progress has been made using DNA methylome profiling of single cells, which have mostly been applied to study adult tissues (Bai et al., 2025; Liu et al., 2021; Nichols et al., 2022). We and others have applied single-cell methods to study chromatin states in adult tissues (Bartosovic et al., 2021; Cheung et al., 2018; Wu et al., 2021; Zeller et al., 2023). However, genome-wide studies mapping temporal chromatin changes of single cells during early embryogenesis are rare (Argelaguet et al., 2019; Clark et al., 2022; Fu et al., 2025; Guo et al., 2017; Liu et al., 2025; Zhao et al., 2022). This leaves a gap in our understanding of the process of establishment and propagation of cell type-specific chromatin states during early embryonic development.

In this study, we asked how the active and silenced chromatin states of cells are shaped during early vertebrate embryogenesis, using zebrafish as a model system. As the chromatin and gene expression are highly dynamic in embryos across cells, bulk assays would average out these biologically important differences. Similarly, a single-cell assay profiling either chromatin or transcriptome alone cannot measure how these two layers interact with each other during development in a single cell. Therefore, we applied our single-cell co-mapping assay T-ChIC (Zeller et al., 2024) to jointly profile the genome-wide active and silencing histone modifications together with full-length transcriptome from the same single cells during early zebrafish development (4–24 hpf). Using this data, we infer continuous developmental trajectories and ask how the chromatin state correlates with the expression of transcription factors and other developmentally important genes during cell fate commitment.

Results

Paired profiling of histone modifications and transcriptome of single cells during zebrafish embryogenesis

We recently developed a single-cell multi-omics method, termed T-ChIC (transcriptome and chromatin immuno-cleavage), which extends the previously described sortChIC (Zeller et al., 2023) and VASA-seq (Salmen et al., 2022) protocols, by integrating them in a single workflow (Zeller et al., 2024). This allows us to quantify the pattern of histone modifications at kilobase resolution, while simultaneously providing full-length transcriptome coverage in single cells. To apply T-ChIC to study multiple time points across early zebrafish development, we extended this protocol with an optimized cell dissociation and sample multiplexing strategy that allows collection of embryos from different time points of an experiment while reducing batch effects (Figure 1a, ‘Materials and methods’). We applied this modified workflow, termed ‘whole-organism T-ChIC’ (woT-ChIC), to quantify the polycomb complex-mediated histone mark, H3K27me3, in zebrafish embryos collected at six selected time points post-fertilization, obtaining a total of 18,432 cells. This dataset provides us with complete coverage of gastrulation (4, 6, 8, 10 hpf), along with the beginning and the end of somitogenesis (12 and 24 hpf, respectively, Figure 1b). We produced the woT-ChIC dataset in four independent biological replicates, along with two additional replicates that contained cells without a functional antibody, to validate data quality (Supplementary file 1). This subset (labeled ‘T-noChIC’) showed a similar number of detected transcripts and was co-clustered with woT-ChIC cells, confirming that the transcriptome quality of woT-ChIC is independent of the ChIC fraction (Figure 1—figure supplement 1a and b). After removing cells with low numbers of MNase cuts and potentially over-fragmented cells, we observed a strong enrichment of chromatin signal over specific genomic regions (Figure 1—figure supplement 1d and e).

Figure 1. — (a) woT-ChIC experimental workflow: (1) single cells from different time points are labeled with a combination of CellTracer dyes and permeabilized either to retain cytoplasmic RNA (whole-cell), or exclude it (nuclei) before antibody + pA-MNase incubation. (2) Cells are pooled and sorted in 384-well plates with position-indexed RNA and ChIC barcodes. Addition of Ca⁺⁺ activates the MNase to cut on the target regions. (3) DNA and RNA fragments are ligated and repaired before re-pooling them for IVT and PCR amplification to produce sequencing libraries. (b) UMAP projections of single cells using signals from H3K27me3 (left) and transcriptome (right) and colored by timepoints. The six sampled time points (right) are pooled into three groups (early/middle/late) based on the complexity of H3K27me3 signal. (c) Single-cell track plot showing signal on the 450 kb region around the *hoxc* gene cluster. The heatmaps show signals (read counts) in single cells (capped from -1 to +1, where -ve signal shows RNA counts and +ve signal shows ChIC counts). The coverage tracks on top show the pseudo-bulk signal (blue: RNA, pink: H3K27me3). Publicly available bulk H3K27me3 datasets are shown for comparison (gray). (d) UMAP projections (based on transcriptome signal) showing gene-level normalized signal for RNA (blue) or H3K27me3 (pink) on two selected genes.

Figure 1—figure supplement 1. — (a) woT-ChIC experimental workflow: (1) single cells from different time points are labeled with a combination of CellTracer dyes and permeabilized either to retain cytoplasmic RNA (whole-cell), or exclude it (nuclei) before antibody + pA-MNase incubation. (2) Cells are pooled and sorted in 384-well plates with position-indexed RNA and ChIC barcodes. Addition of Ca⁺⁺ activates the MNase to cut on the target regions. (3) DNA and RNA fragments are ligated and repaired before re-pooling them for IVT and PCR amplification to produce sequencing libraries. (b) UMAP projections of single cells using signals from H3K27me3 (left) and transcriptome (right) and colored by timepoints. The six sampled time points (right) are pooled into three groups (early/middle/late) based on the complexity of H3K27me3 signal. (c) Single-cell track plot showing signal on the 450 kb region around the *hoxc* gene cluster. The heatmaps show signals (read counts) in single cells (capped from -1 to +1, where -ve signal shows RNA counts and +ve signal shows ChIC counts). The coverage tracks on top show the pseudo-bulk signal (blue: RNA, pink: H3K27me3). Publicly available bulk H3K27me3 datasets are shown for comparison (gray). (d) UMAP projections (based on transcriptome signal) showing gene-level normalized signal for RNA (blue) or H3K27me3 (pink) on two selected genes.

Early zebrafish embryos contain a high load of maternal transcripts required for early embryonic development, which are temporally replaced with newly transcribed, zygotic RNA (Fishman et al., 2023). Consistent with this transition, we observed a substantial decrease in unique fragment counts from spliced reads compared to unspliced reads with developmental time (Figure 1—figure supplement 1f). Despite these dynamics, our overall number of detected genes with both spliced and unspliced counts is higher than previously reported in scRNA-seq studies (Farrell et al., 2018; Wagner et al., 2018) due to the increased sensitivity and full-length RNA recovery (Supplementary file 2). Moreover, with our total RNA profiling approach, we were also able to detect developmentally important non-coding RNAs such as miR-430 (in pluripotent cells) known to be critical for clearance of maternal RNA in zebrafish (Giraldez et al., 2006; Liu et al., 2020), and miR-124 (in neural ectoderm), a known regulator of neuronal differentiation which is conserved across species (Gourishetti et al., 2023; Figure 1—figure supplement 1c). At the chromatin level, we observed increased MNase cuts (representing H3K27me3 signal) per cell as development progresses (Figure 1—figure supplement 1g). This corroborates previous observations of increasing H3K27me3 abundance during development based on bulk chromatin assays (de la Calle Mustienes et al., 2015; Vastenhouw et al., 2010). Overall, we retained 9275 cells with both transcriptome and H3K27me3 signal for further analysis.

We divided our data into early (4–6 hpf), intermediate (8, 10, 12 hpf), and late (24 hpf) time points, and compared our H3K27me3 signal with publicly available datasets at the corresponding time points (Figure 1c, ‘Materials and methods’). While our pseudo-bulk H3K27me3 profiles showed a high genome-wide correlation with publicly available bulk ChIP data from matched time points (Figure 1—figure supplement 2a), the analysis of genomic bins ranked by H3K27me3 signal shows improved signal enrichment of our data relative to the publicly available bulk sequencing data at a comparable sequencing depth (Figure 1—figure supplement 2b). Moreover, H3K27me3 levels show a clear relationship with the silencing of associated genes in single cells at all time points (Figure 1d, Figure 1—figure supplement 2c). We observed high H3K27me3 levels associated with the silencing of gene expression as early as 4 hpf for hoxc3a, involved in anterior-posterior patterning, as well as silencing of genes such as pcdh1b late during development (Figure 1d, Figure 1—figure supplement 2c). Furthermore, we also observed a transient association of H3K27me3 on genes. For example, rfx4, expressed in the central nervous system and neural rod, was silenced in non-neural ectoderm cells by H3K27me3 during gastrulation (Figure 1—figure supplement 2c and d). These results suggest that our data allows us to gain quantitative insight into the relationship between H3K27me3 and gene expression during development.

Spatiotemporal spreading of H3K27me3 associates with the silencing of gene expression during development

To annotate cell types in our data, we performed Leiden clustering of cells using their gene expression signal, followed by canonical correlation analysis of gene expression with that of a previously published time-course scRNA-seq data set (Wagner et al., 2018; Figure 2—figure supplement 1a, ‘Materials and methods’). Virtually all our cells matched one of the annotated cells from Wagner et al. with high confidence, allowing successful label transfer into our data (Figure 2—figure supplement 1b and c). We further refined these labels based on cell ontologies from the Zebrafish Information Network (Bradford et al., 2022), to categorize our cells into 34 cell types (Figure 2—figure supplement 1e). Cell type proportions were consistent between the biological replicates of woT-ChIC (Figure 2—figure supplement 1f). To get a fine-grained view of cellular heterogeneity while reducing signal dropouts, we aggregated cells that are transcriptionally similar to each other into ‘metacells’ (Persad et al., 2023; Figure 2—figure supplement 1d, ‘Materials and methods’). Interestingly, most of the cell types annotated based on their gene expression profiles also show a clear separation based on their H3K27me3 enrichment as early as 8–12 hpf, suggesting that distinct, cell-state-specific H3K27me3 patterns already start to appear during gastrulation (Figure 2a).

Figure 2. — (a) UMAP projection of cells based on H3K27me3 signal (same as Figure 1b, left) indicating the different cell types annotated using the transcriptome signal. (b) An example genomic locus that demonstrates the cis-spreading of H3K27me3 signal (pink) around the *zic3* gene with time during development. Note that apart from the *zic3* gene, the spreading also correlates with a downregulation of the expression (blue) for the nearby gene (*rbmx*) with time. (c) Single-cell heatmap. Each row is a single cell selected from ectoderm lineage (34.5% of all cells) showing the average H3K27me3 signal for the top 100 genes detected by the linear model, showing increase in spreading of signal at the 100 kb region surrounding the center bin with pseudotime. The center bin was identified as the bin with non-zero signal in pluripotent cells. PT = pseudotime, RT = real time. (d) Line plots comparing H3K27me3 spreading and gene expression with pseudotime. The bottom panel shows the average fraction of bins that show H3K27me3 signal on two sets of genes (spreading, non-spreading) with time, while the top panel shows the average gene expression of these gene sets along pseudotime. (e) Heatmap of H3K27me3 signals across cells (grouped by cell type, top) for genes with cell type-specific demethylation at 24hpf. Genes (rows) are grouped by celltype in which they are demethelyated. CT = cell type; PT = pseudotime. The analyzed cell types are indicated in the right legend, while top legend shows all cell types (colors same as a). (f) Correlation of H3K27me3 fold-change with change in gene expression, for the selected cell types. Top 5 genes with H3K27me3 loss are labeled per cell type.

Figure 2—figure supplement 1. — (a) UMAP projection of cells based on H3K27me3 signal (same as Figure 1b, left) indicating the different cell types annotated using the transcriptome signal. (b) An example genomic locus that demonstrates the cis-spreading of H3K27me3 signal (pink) around the *zic3* gene with time during development. Note that apart from the *zic3* gene, the spreading also correlates with a downregulation of the expression (blue) for the nearby gene (*rbmx*) with time. (c) Single-cell heatmap. Each row is a single cell selected from ectoderm lineage (34.5% of all cells) showing the average H3K27me3 signal for the top 100 genes detected by the linear model, showing increase in spreading of signal at the 100 kb region surrounding the center bin with pseudotime. The center bin was identified as the bin with non-zero signal in pluripotent cells. PT = pseudotime, RT = real time. (d) Line plots comparing H3K27me3 spreading and gene expression with pseudotime. The bottom panel shows the average fraction of bins that show H3K27me3 signal on two sets of genes (spreading, non-spreading) with time, while the top panel shows the average gene expression of these gene sets along pseudotime. (e) Heatmap of H3K27me3 signals across cells (grouped by cell type, top) for genes with cell type-specific demethylation at 24hpf. Genes (rows) are grouped by celltype in which they are demethelyated. CT = cell type; PT = pseudotime. The analyzed cell types are indicated in the right legend, while top legend shows all cell types (colors same as a). (f) Correlation of H3K27me3 fold-change with change in gene expression, for the selected cell types. Top 5 genes with H3K27me3 loss are labeled per cell type.

Next, we asked how the global H3K27me3 landscape is established in the cells during lineage commitment. We observed that with time, the number of genomic bins with H3K27me3 signal increased. In contrast, the average signal in detected bins plateaued at 24 hpf, indicating new regions acquiring H3K27me3 signal instead of enrichment of signal on pre-marked regions (Figure 2—figure supplement 2a). Therefore, we asked whether this increase in signal comes from a de novo gain in H3K27me3 or as a result of spill-over (indicating ‘cis-spreading’) of increasing H3K27me3 density from previously enriched regions. At least a subset of enriched regions in pluripotent cells displays a cis-spreading of signal with differentiation, covering developmentally important genes such as the zic locus (Figure 2b). To quantify cis-spreading genome-wide, we first subsetted the genomic bins, which had signal in at least 5% of filtered Pluripotent cells (at 4 hpf). Apart from tightly repressed genes such as gata3, nr2f1a, six3b, pax9a, foxc1b, zic1/4, hox clusters, and the pcdh1/2 cluster with a broadly distributed signal, all other H3K27me3 signal was localized within 5 kb bins and the majority (65%) of these bins overlapped with a promoter region. We then calculated the signal on these bins compared to the background signal (averaged over 100 kb region) surrounding these bins in single cells (‘Materials and methods’). These two signals correlated positively for about 30% of the 5 kb bins, suggesting a spillover from the main signal peak to the surrounding background. In contrast, the remaining 70% displays a low correlation with background indicating a localized enrichment without spill-over to the surrounding background (Figure 2c, Figure 2—figure supplement 2b).

We confirmed this enrichment with an alternative approach based on domain calling on the pooled, pseudo-bulk dataset (‘Materials and methods’). While wider H3K27me3 domains detected on the pooled data correspond to the signal at 24 hpf, the sharper subpeaks within those domains were observed at early (4–6 hpf) time points (Figure 2—figure supplement 2d). Relatively mature cell types, particularly from the neural ectoderm (such as differentiating neurons), show a higher correlation of subpeaks to the background, suggesting a wider spread in signal (Figure 2—figure supplement 2e). We further stratified this signal in search of bins with a significant difference between lineages (‘Materials and methods’). We only detected a handful of bins with statistically significant differences between lineages, and the mean H3K27me3 signal indicated that these results are not robust (Figure 2—figure supplement 2c). Therefore, the spread of H3K27me3 signal does not appear to be lineage-specific. To test whether this characteristic might be conserved across species, we re-analyzed a previously published ChIP-seq dataset of mouse embryos (Xiang et al., 2020; ‘Materials and methods’). Comparing H3K27me3 signals at and around gene promoters between E5.5 epiblast and post-gastrulation ectoderm lineage, we see the evidence of signal spreading from a subset of these sites (Figure 2—figure supplement 2f), suggesting that the spread of H3K27me3 signal from promoters might be a conserved phenomenon.

To understand how this spread of H3K27me3 relates to gene expression in time, we plotted the expression of the ‘host’ gene (genes with promoter enrichment of H3K27me3 in pluripotent cells), and the ‘nearby’ genes (with promoter within 100 kb region) over single cells arranged in pseudotime (Figure 2d). Interestingly, the ‘host’ genes displayed an increased expression before the spread of H3K27me3 signal, followed by silencing post-spreading (Figure 2d). In contrast, the ‘nearby’ genes displayed relatively smaller changes in transcription during this process but were also downregulated at a later stage (Figure 2—figure supplement 2g). To identify genes whose expression is silenced as a result of spreading of H3K27me3, we applied linear regression to predict gene expression as a function of H3K27me3 density (defined as the number of reads per kb) on their nearest, or overlapping domains in metacells (‘Materials and methods’). Silenced genes showed a strong negative correlation of H3K27me3 density with their expression, with the strongest targets being hox and pcdh1 gene clusters (Figure 2—figure supplement 2h).

Considering most genes seem to be in the process of gaining H3K27me3 until 24 hpf, we asked whether there is a subset of genes that lose H3K27me3. For this, we performed a differential H3K27me3 signal analysis for each cell type at 24 hpf stage compared to cell types from earlier stages and selected genes with a significant loss of H3K27me3 in specific cell types (log2-FC < –1, FDR < 0.05). We identified 265 genes across 10 cell types, with most genes being detected in periderm and differentiating neurons (Figure 2e, Supplementary file 3). For almost all of these genes, we observed that the H3K27me3 was specifically lost in their cell type of origin, while being gained in almost all other cell types compared to pluripotent cells, suggesting H3K27me3 loss is a cell type-specific process active during development. This loss was also proportional to a change in gene expression signal in those cell types and affected key developmental genes associated with these cells, such as POU family of transcription factors (mid/hindbrain), tet3 and tet2 (neurons/optic cup), myod1 (muscle), and tal1 (endothelium) (Figure 2f). This list also included many of the significant genes from our regression analysis which showed cell type-specific expression at 24 hpf, such as gata2a, dlx3b, shha, and tet2 (Figure 2—figure supplement 2i).

Overall, our analysis shows that for a specific set of genes, silencing of gene expression is achieved once sufficient gene-body H3K27me3 coverage is achieved via cis-spreading, a process seemingly uncoupled with the prior transcription state of these genes. Further, we see that H3K27me3 demethylation can occur later in development as key developmental genes are re-activated in their corresponding cell types in a cell type-specific manner.

Global chromatin state of cells is decoupled from gene expression during early development

Considering the heterogeneity in the repressive chromatin landscape of cells observed as early as gastrulation, we asked how the interplay between active and silenced chromatin is established at this stage. To map the active chromatin, we focused on H3K4me1, a histone modification associated with active and poised enhancers and promoters, which, unlike H3K27me3, has been observed before zygotic genome activation (ZGA) in zebrafish (Murphy et al., 2018). To mitigate the interference of maternally contributed RNA, we implemented a new cell preparation protocol within woT-ChIC (‘Materials and methods’), which leads to the expulsion of cytoplasmic RNA from the cells (hereafter referred to as ‘nuclei’ batch). We generated woT-ChIC data for H3K4me1 at 4, 6, 8, 10, and 12 hpf. As expected, our nuclei dataset shows a fourfold higher ratio of unspliced RNA compared to spliced RNA, and an overall lower number of detected genes compared to the whole-cell data, in line with the expected lack of spliced maternal RNA in the nuclei (Figure 3—figure supplement 1a and b, Supplementary file 2). The chromatin quality was unaffected, exemplified by the similar number and pattern of H3K27me3 MNase cuts with time from the ‘nuclei’ and ‘whole cell’ batch (Figure 3—figure supplement 1c). Finally, we integrated our nuclei dataset with the 4–12 hpf subset of the whole-cell H3K27me3 woT-ChIC dataset, creating a high-quality multi-omic dataset of 15,961 cells (H3K27me3: 9197, H3K4me1: 6764) covering zebrafish gastrulation (Figure 3a, Figure 3—figure supplement 1d and e, ‘Materials and methods’).

Figure 3. — (a) UMAP projection of the single cells based on the three modalities H3K27me3 (left), RNA (center), and H3K4me1 (right) after integration of the two batches and annotation of cells. (b) Total UMI counts per cell on H3K4me1 enriched regions from the integrated dataset, divided into H3K4me1-unique regions and regions co-enriched for H3K27me3. H3K4me1 signal (top panels) on these regions remains unchanged with time, while H3K27me3 signal (bottom panels) increases. (c) Correlation between histone modification signal over gene bodies and gene expression, per metacell. Metacells are ordered by latent time (X-axis) and the Pearson correlation coefficient (Y-axis) between H3K4me1 and RNA (top) and H3K27me3 and RNA (bottom). (d) Scatterplot showing H3K4me1 signal and unspliced RNA counts for all genes of the two selected metacells indicated in (c), early (1) and late (2) in latent time. Colors corresponds to the germ layer annotation of the metacell, as indicated in (c).

Figure 3—figure supplement 1. — (a) UMAP projection of the single cells based on the three modalities H3K27me3 (left), RNA (center), and H3K4me1 (right) after integration of the two batches and annotation of cells. (b) Total UMI counts per cell on H3K4me1 enriched regions from the integrated dataset, divided into H3K4me1-unique regions and regions co-enriched for H3K27me3. H3K4me1 signal (top panels) on these regions remains unchanged with time, while H3K27me3 signal (bottom panels) increases. (c) Correlation between histone modification signal over gene bodies and gene expression, per metacell. Metacells are ordered by latent time (X-axis) and the Pearson correlation coefficient (Y-axis) between H3K4me1 and RNA (top) and H3K27me3 and RNA (bottom). (d) Scatterplot showing H3K4me1 signal and unspliced RNA counts for all genes of the two selected metacells indicated in (c), early (1) and late (2) in latent time. Colors corresponds to the germ layer annotation of the metacell, as indicated in (c).

With our integrated dataset, we first asked how the global chromatin state of the cells changes with time. Comparing total MNase cuts for H3K4me1 and H3K27me3 with time, we observed that while the H3K27me3 signal globally increases in cells with time, the H3K4me1 signal decreases (Figure 3—figure supplement 1c). To understand whether this global change stems from a change in the activity of cis-regulatory elements (CREs), we separated the data into H3K4me1 enriched regions (representing active or poised promoters and enhancers), H3K27me3-enriched regions (mostly observed near genes/promoters in earlier analysis), and other (mostly intergenic) regions. While the majority (84%) of H3K27me3-enriched regions were found to overlap with an H3K4me1 domain and show increasing H3K27me3 signal with time, this increase is not accompanied by a decrease in H3K4me1 on these regions (Figure 3b, Figure 3—figure supplement 2a). Instead, the decrease in signal was observed in a minor fraction of H3K27me3-unique sites, and random genomic regions away from enriched sites (Figure 3—figure supplement 2b), suggesting that this global change in signal does not represent a change in CRE activity. Further, the ratio of H3K4me1 to H3K27me3 suggests that most promoters remain in a ‘bivalent’ chromatin state during 4–12 hpf in all germ layers, with a small fraction showing increased H3K4me1 activity in any specific germ layer (Figure 3—figure supplement 2c).

To obtain a more fine-grained view of cellular differentiation time and lineages on our integrated data, we took advantage of the high unspliced counts from our protocol. We applied the RNA velocity model (La Manno et al., 2018), which uses the ratio between spliced and unspliced reads of genes to obtain the cell’s differentiation path and assigns a ‘latent time’ to the cell, indicating their differentiation stage (Figure 3—figure supplement 3a–e). We then asked how the change in the cell’s chromatin state relates to the transcription of genes during their differentiation. For this, we aggregated transcriptionally similar cells into metacells and correlated the H3K4me1 and H3K27me3 signals with unspliced (i.e., newly transcribed) RNA signals for all genes in each metacell. Interestingly, we observed that the correlation between the gene-body H3K4me1 and transcription increases with the average latent time of a metacell (Figure 3c and d). Promoter regions, however, did not show this trend (Figure 3—figure supplement 2d). For H3K27me3, the global signal was mostly uncorrelated with transcription on both promoters and gene bodies (Figure 3c, Figure 3—figure supplement 2d). This suggests that despite increasing heterogeneity of chromatin signal, the overall chromatin state of a cell is decoupled from its transcriptional state during early development, and this coupling increases as the cells mature.

The chromatin state of binding sites predicts the function of transcription factors during gastrulation

We next asked whether our integrated dataset could inform us about the regulation of transcription factor (TF) networks and their role in lineage specification. While many lineage-defining TFs are biochemically predicted to have both activation and silencing functions, we hypothesized that the level of H3K4me1 on transcription factor binding sites (TFBS) might indicate which function the TFs play in a cell. For example, if a TF expression is correlated to a gain in H3K4me1 on its binding sites, it could indicate its role as a transcriptional activator, while a loss in H3K4me1 on TFBS might indicate a silencing function in that cell. Further, if a TF function is epigenetically regulated, then the chromatin state of the TF itself would also be predictive of its function. Based on this idea, we built a prediction model that combines the chromatin state and expression of TFs with their H3K4me1 activity on TFBS within cells (‘Materials and methods’). With a combined model, we aim to classify TFs based on both their own regulation via chromatin state (regulated/independent), as well as their action on their targets (activator/repressor) (Figure 4a).

Figure 4. — (a) Schematic and outputs of the prediction model. (1) (top) Normalized H3K4me1 and H3K27me3 signal on the TF locus, TF (spliced) RNA signal, and indicators of cell state (pseudotime and lineage) are used to predict TF activity (bottom). (2) The lasso regression model is used to select the most useful predictors for each TF. The bottom right plot shows the TFs sorted by the R² values on the independent test dataset. (3) Coefficients from the final models are ranked and compared to categorize the TFs. The number of TFs classified as activator/repressor, or regulated/independent is shown in the right plot. (b) UMAPs showing the expression and motif activities of TFs classified as ‘activators’ or ‘repressors’ using the above model. TF expression (left) is based on (normalized) spliced RNA and TF activity (right) is based on H3K4me1 signal on TFBS. For activators, the motif activities on TFBS are gained in cells where the TFs are expressed, while for the repressors, the motif activities are lost in the cells expressing the TF. (c) UMAPs show (normalized) H3K4me1 signal and H3K27me3 signal on TF gene body, for TFs classified as ‘regulated’ or ‘independent’. The histone modifications on regulated TFs are correlated with their activities.

Figure 4—figure supplement 1. — (a) Schematic and outputs of the prediction model. (1) (top) Normalized H3K4me1 and H3K27me3 signal on the TF locus, TF (spliced) RNA signal, and indicators of cell state (pseudotime and lineage) are used to predict TF activity (bottom). (2) The lasso regression model is used to select the most useful predictors for each TF. The bottom right plot shows the TFs sorted by the R² values on the independent test dataset. (3) Coefficients from the final models are ranked and compared to categorize the TFs. The number of TFs classified as activator/repressor, or regulated/independent is shown in the right plot. (b) UMAPs showing the expression and motif activities of TFs classified as ‘activators’ or ‘repressors’ using the above model. TF expression (left) is based on (normalized) spliced RNA and TF activity (right) is based on H3K4me1 signal on TFBS. For activators, the motif activities on TFBS are gained in cells where the TFs are expressed, while for the repressors, the motif activities are lost in the cells expressing the TF. (c) UMAPs show (normalized) H3K4me1 signal and H3K27me3 signal on TF gene body, for TFs classified as ‘regulated’ or ‘independent’. The histone modifications on regulated TFs are correlated with their activities.

Our model predicted the H3K4me1 activity at TFBS with high accuracy (R² > 0.6) for 45 TFs. Our classification captured the well-established developmental function of TFs, such as the activating function of tbx16 in regulating paraxial mesoderm formation (Payumo et al., 2016), and that of tfap2a, a transcriptional activator shown to be important for neural crest induction (Dooley et al., 2019; Figure 4b). Further, it helped resolve the cell type-specific functions of TFs predicted to be activators or repressors based on their protein domains (Figure 4—figure supplement 1a, Supplementary file 4). For example, zbtb16a, predicted to have a DNA-binding transcriptional repressor activity, and zeb1a, speculated as a context-dependent activator/repressor (Gheldof et al., 2012), were both revealed as a repressor during neural ectoderm (hindbrain) specification. Next, we asked whether the gain/loss of H3K4me1 activity is reflected in the gene transcription, measured as a change in nascent (unspliced) transcripts on the genes nearest to the TFBS. For our top predicted activators and repressors, we observed the expected up and downregulation of average nascent (unspliced) RNA signal of the target genes, corresponding to the change in TF expression and activity (Figure 4—figure supplement 1b). This indicated that our model can capture new cell type-specific activation/repression functions of TFs during gastrulation. Additionally, our model also detects TFs that are epigenetically regulated (Figure 4c, Figure 4—figure supplement 1c, Supplementary file 4). For TFs such as sox13, tbx16, lhx1a, tfap2a, a gain of H3K4me1, or a loss of H3K27me3, or a combination of both was associated with their respective TFBS activity. This indicates that while the majority of TFs expressed early in development appear to be regulated by alternative mechanisms, the chromatin state could play an important role in establishing the gene expression memory for a subset of developmentally important TFs.

Discussion

In this study, we adopted our recently developed T-ChIC method (Zeller et al., 2024), to study the dynamics of active (H3K4me1) and silencing (H3K27me3) histone modifications during early zebrafish development. We observe a dynamic spatiotemporal localization of these histone modifications, previously unresolved by bulk chromatin profiling assays. This data allows for a direct comparison of the chromatin state and the expression of genes in individual cells and helps to understand the role of this interaction in regulating cell fates during early development.

We observe that H3K27me3 shows a promoter-anchored spread during development. At the start of differentiation, selected genomic loci with multiple promoters are pre-marked with a broadly distributed H3K27me3 (such as the hox and pcdh1 loci), while loci with single promoter (such as pax7b) appear as focused H3K27me3 domains. A recent study has shown that such pre-marking is established by a non-canonical interaction between the two polycomb (PcG) complexes, PRC1 and PRC2 (Hickey et al., 2022). Here, we find that a selected set of these loci shows the spreading of H3K27me3 with differentiation, which eventually confers the silencing of host genes. A recent study using mouse embryonic stem cells proposes nucleation and spreading as a way to maintain PcG silencing (Veronezi and Ramachandran, 2024). Based on our observations, we propose that this mechanism could also help to propagate the spread of silencing during development. While this spread appears to be mostly not lineage-specific, we do observe a cell type-specific demethylation of many genes which are developmentally important for cell type specification, suggesting that the regulation of developmental genes via H3K27me3 could be established through a lineage-agnostic spread, followed by cell type-specific demethylation. This could likely be a fundamental mechanism to establish a cell type-specific gene expression memory via the polycomb complex considering the silencing of developmental genes in ‘alternate’ lineages is a conserved function of H3K27me3 (Guo et al., 2021). We see that in the absence of H3K27 demethylation, important developmental genes such as hox, pax, and shh genes are silenced in a spatiotemporal manner. A mislocalized expression of these known PcG targets has been observed after a deletion of the core PRC2 enzyme, ezh2 (San et al., 2016; Yette et al., 2021). Apart from confirming these known targets, we additionally identify developmental genes such as rfx4, important for neural tube formation (Sedykh et al., 2018), dlx3b, important for placode development (Esterberg and Fritz, 2009), among others, as novel PcG targets.

Although a rather large number of gene promoters are marked by H3K27me3 in pluripotent cells, only a minority of these show a genomic spread and silencing of the genes. By comparing the H3K4me3 to H3K27me3 signal on promoters, we find that most of these promoters are co-marked at 4–12 hpf, suggesting that they might serve as a ‘placeholder’ for activation or silencing later in development. This provides an explanation to why the cis-spreading, but not the promoter enrichment of H3K27me3 is linked to gene silencing during development.

In line with previous studies (Kaaij et al., 2016; Murphy et al., 2018), we find that H3K4me1 is widespread in the genome of pluripotent cells, marking a large number of TF motifs and other genomic regions. While this chromatin mark systematically disappears in regions outside of cis-regulatory elements (CREs) during development, its activity on the CREs does not show a monotonous change with time. In fact, a systematic increase in H3K27me3 without a loss of H3K4me1 leads to a bivalent chromatin state on most CREs, together with a lineage-specific gain or loss of H3K4me1 on selected CREs. We show that these changes in H3K4me1 levels can be leveraged to predict the lineage-specific activator or repressor functions of TFs, by correlating this activity with the TF’s own expression and chromatin states. Using this approach, we find novel functions of TFs in lineage specification, such as the role of zbtb16a/b, zeb1a/b as negative regulators during ectoderm specification, and the tfap2a/b as a positive driver of non-neural ectoderm during gastrulation. We also find selected lineage-specifying TFs such as zfhx3, foxc1a, and irx3a whose activity seems to be regulated by their own chromatin state during gastrulation. These results might point to a new pathway through which the chromatin states of the cells play a role in specifying cell fates, that is, by establishing a transcriptional memory on key lineage regulators.

Overall, comparing the active and silenced chromatin states of cells, we observe that the active state is a better predictor of a cell’s functional (transcriptomic) state in early development. A caveat is that we have not mapped other important silencing chromatin states, such as H3K9me3 or DNA methylation, which may show complementary dynamics in early development. We also see that both active and silencing states are rather uncoupled from transcription in pluripotent cells and get correlated as the cells mature in development. Note that this maturation time is not necessarily the same as the developmental time (hpf) of the embryo, as the transcriptionally mature cells collected from early time points also show a high correlation of active chromatin state and transcription. Therefore, we propose that a correlation of chromatin and transcriptional state of cells could be a hallmark of cell identity formation during development. Future studies to systematically map the overall chromatin state of single cells and gene expression would further explain how cell fates are established during embryogenesis.

Materials and methods

Whole-organism T-ChIC of zebrafish embryos

The detailed, step-by-step woT-ChIC protocol of zebrafish embryos (from embryo collection to the preparation of sequencing libraries) is available at: https://dx.doi.org/10.17504/protocols.io.q26g7pbe8gwz/v2. Below, we briefly describe the cell collection and staining steps used to produce this dataset.

Wild-type TL embryos were collected 20 minutes after fertilization in a Petri dish with E3 medium and kept at 28.5°C in an incubator. During the first hour, the unfertilized embryos were discarded. At the desired stage, embryos were dechorionated by incubation in 1 mg/mL of pronase, and 30–50 embryos were deyolked in Ca-free Ringer’s solution, pelleted, and washed with 500 µL of PBS + 10% FBS. For early time points (4, 6, and 8 hpf), cells were dissociated with the addition of 200 µL of pre-warmed FACSmax cell dissociation solution (Genlantis T200100) for 5 minutes resuspending gently up and down at room temperature (RT). For later time points (10, 12, and 24 hpf), cells were dissociated with the addition of 200 µL of pre-warmed Protease solution for 6 minutes on a shaker at 28°C and 400 rpm, resuspending gently every 2 minutes. After dissociation, cells were filtered with a 35 µL sieve (Corning, 352235) and washed with 500 µL of PBS + 10% FBS and resuspended in Wash Buffer 1 (WB1, described in the online protocol) and kept at +4°C before starting the CellTracer staining. For the ‘nuclei’ batch, WB1 was modified with 0.05% Saponin (Sigma, 47036-250G-F) instead of 0.05% Tween20. Cells were vortexed well and kept in the dark at +4°C for 20 minutes to stain with a combination of CellTrace dyes (Thermo Fisher C34570, Thermo Fisher C34573, Thermo Fisher C3457, and a combination of two of these). The staining was stopped with the addition of 70 µL of rat serum (Sigma-Aldrich, R9759-5ML) and a 5-minute incubation at RT. Lastly, cells were washed and resuspended in WB1 with spermidine solution (0.072 µL/mL) and 4 µL/mL 0.5 M EDTA. Once all time points had been stained with their appropriate dye/dyes combinations, they were pooled in a 0.5 mL protein-low binding tube with approximately 1 million cells in total. Cells were incubated overnight at 4°C with primary antibodies (1:200 H3K27me3 rabbit mAB, Cell Signalling #9733; 1:100 H3K4me1 polyclonal Ab, Thermo Fisher #710795). The next day, the cells are washed and incubated with pA-MNase in WB1 for 1 hour, washed, and sorted into indexed 384-well plates containing CelSeq2 adapters. Cells were incubated for 30 minutes at 4°C for MNase digestion and stopped with the stop solution before proceeding with the rest of the library preparation steps. The pA-MNase fusion protein was produced as described earlier (Schmid et al., 2004). Following T-ChIC library preparation and QC, the final DNA libraries are sequenced paired-end 100 bp, on either a NovaSeq or NextSeq2000, at a sequencing depth between 15 and 25 million reads per sample (384-well plate).

Processing and quality control of T-ChIC data

The first-in-pair reads from the T-ChIC protocol contain an RNA or ChIC barcode in the following format “RNA: 6N7X, ChIC: 3N8X”; where N=UMI nucleotide and X=Cell barcode nucleotide. We used a custom Python script to split the raw .fastq files into the ChIC and RNA fractions based on which one of the two barcode patterns is observed at the start. The two fractions are then independently mapped to the GRCz11/danRer11 genome. A complete processing workflow (from .fastq to count tables) with all parameters is available at https://github.com/bhardwaj-lab/scChICflow, copy archived at Bhardwaj and Sancho Gómez, 2025 (v 0.4) and is briefly described below.

The RNA fraction was trimmed using cutadapt (v2.1) (Martin, 2011) with parameters `-e 0.1 -q 16 -O 3 --trim-n --minimum-length 10 --nextseq-trim=16 A W{'10'}`, along with Illumina truseq barcodes provided as `-a and -b ` options. The trimmed reads are mapped to the genome using STAR (v 2.7.11) (Dobin et al., 2013), using the “StarSolo” mode, with these important parameters `--sjdbGTFfile <dr11_ens104.gtf> --outFilterIntronMotifs RemoveNoncanonical --soloCBmatchWLtype Exact --soloType CB_UMI_Simple`, where “dr11_ens104.gtf” refers to the ENSEMBL annotation version 104 (GRCz11) (Cunningham et al., 2022). Secondary and supplementary alignments and low-quality mappings (<MAPQ 255) were removed using samtools (v1.21) (Li et al., 2009) and reads were de-duplicated with UMI-tools (v.1.0.0) (Smith et al., 2017) using cell barcode and UMI position, along with options `--method unique --spliced-is-unique`. Coverage files were created using deepTools bamCoverage with CPM normalization (Ramírez et al., 2016).

For the ChIC fraction, barcodes were moved into the read header using UMI-tools extract (v.1.0.0). Reads were trimmed using cutadapt (v2.1) with parameters `-e 0.1 -O 5 -u 1 -u –2 -U –2 -a W{10} -A W{10} -q 30 --trim-n --minimum-length 20 --nextseq-trim=30`, along with illumina truseq barcodes provided as `-a and -b ` options. The trimmed reads are mapped to the genome using hisat2 (v2.2.1) (Kim et al., 2017), with parameters `--sensitive --no-spliced-alignment --no-mixed --no-discordant --no-softclip -X 1000`. Reads were de-duplicated with UMI-tools (v.1.0.0) using cell barcode and UMI position, along with options `--method unique --spliced-is-unique --soft-clip-threshold 2`. Quality control was performed using deepTools. Reads were counted on 50 kb windows in the genome using sincei (Bhardwaj and Mourragui, 2024).

Analysis of publicly available data

We downloaded the raw .fastq files for 6 hpf bulk CUTnRUN data of H3K27me3 from Akdogan-Ozdilek et al. (GSE178343) (Akdogan-Ozdilek et al., 2022), and raw .fastq files of 12 hpf and 24 hpf ChIP-seq data from the danio-code portal (accessions - 12 hpf H3K27me3: DCD003854SQ, 12 hpf H3K4me1: DCD003854SQ, 24 hpf H3K27me3: DCD003200SQ). All .fastq files were mapped to the GRCz11 genome using snakePipes’ DNA-mapping workflow, with parameters `--trim --fastqc --mapq 5 --dedup --bwBinSize 1000` (Bhardwaj et al., 2019). The de-duplicated BAM files were subsampled to match the sequencing depth of the corresponding pooled time points (early vs 6 hpf, middle vs 12 hpf, late vs 24 hpf), and the read coverage was compared using deepTools (Ramírez et al., 2016) multiBAMSummary (with parameter `-bs 50000`) and plotFingerPrint (with parameters `--skipZeros -bs 10,000-n 50000`). For the analysis of H3K27me3 signal in mouse embryos, we downloaded the H3K27me3 bedgraph files corresponding to stages: E5.5, Endoderm, Ectoderm, and Mesoderm by Xiang et al., 2020 (GSE125318), and plotted the signal over the +50 kb bins surrounding the mouse gene transcription start sites (mm9 genome) using deeptools ComputeMatrix (with additional parameters `-bs 500 --missingDataAsZero --skipZeros --maxThreshold 1000`) and plotHeatmap.

Cell clustering and annotation using RNA signal

For clustering of single cells based on RNA signal, we used the ‘spliced’ count matrices for ‘whole-cell’ T-ChIC data, and ‘total’ (spliced + unspliced + ambiguous) counts for ‘nuclei’ T-ChIC data. Filtering and clustering of cells were performed in scanpy (v1.9.1) (Wolf et al., 2018). We removed cells with `total_counts <1000, or n_genes_by_counts >10000, or pct_counts_in_top_100_genes >0.6`. We also removed cells with <70% of counts on protein-coding genes. We selected genes present in at least 1% of cells (or at least 50 cells, whichever is smaller) and selected the top 4000 variable genes based on their analytical Pearson residuals (Lause et al., 2021). We used the Pearson residuals to calculate principal components (PCs) and built a neighbor graph using 50 PCs and 30 neighbors (20 for nuclei data). We then used it to build a paga graph (Wolf et al., 2019) based on Leiden clusters (paga threshold = 0.1, leiden resolution = 1.5). For 2D representation, UMAPs were initiated with the paga graph, along with additional parameters `min_dist = 1, spread = 5` (spread = 1 for nuclei).

For the annotation of cell types and all other analyses, we calculated the normalized ChIC and RNA signal using the ‘shifted log transform’ method (`1/sqrt(alpha) log(4 * alpha * x+1)`) (Ahlmann-Eltze and Huber, 2023), with a fixed overdispersion (alpha) of 0.05 and total counts (“normed_sum”) as library size factors. For annotation of cells, we obtained the raw count matrices from Wagner et al., 2018 and subsetted for the 4, 6, 8, 10, 14, and 24 hpf timepoints (for the ‘nuclei’' batch, we also excluded 24 hpf). We normalized the counts in the same manner as our counts and selected the top 4000 variable genes (using `FindVariableFeatures(selection.method = “vst”)` in seurat). We then performed CCA-MNN analysis in Seurat using `FindTransferAnchors(method=”cca”)` and used the transfer score to predict labels for single cells. For top predicted labels for each cluster, we then manually confirmed the marker gene expression from ZFIN in the respective cluster, followed by renaming the cluster to suitable ZFIN cell ontology (Bradford et al., 2022).

Integrated analysis of nuclei and whole-cell data

To integrate the ‘nuclei’' and ‘whole-cell’ subsets of data, we merged the cells from the ‘nuclei’' batch with that of 4–12 hpf subset of the ‘whole-cell’ batch, resulting in 15,961 cells. We then removed genes with total spliced or unspliced counts <100, or genes detected in <100 cells, from the merged data, and calculated the top 4000 variable genes (HVGs) based on their analytical Pearson residuals (Lause et al., 2021) and used the intersection of the HVGs from the two batches (3091 genes) to perform PCA based on the Pearson residuals of the ‘unspliced’ counts from the two batches. Top 50 PCs were then used to align the two batches using harmony (Korsunsky et al., 2019). The harmony-corrected PCs were used for further analysis of the joint dataset (clustering, annotation, metacells, and RNA velocity).

For the calculation of latent time and lineages of cells on the integrated 4–12 hpf data, we combined the RNA velocity and diffusion pseudotime approach, using cellrank (v1.5.1) (Lange et al., 2022). We calculated RNA-velocity using the ‘dynamical’ model as described in the scVelo package (v0.2.4) (Bergen et al., 2020). The cell-specific moments Ms and Mu were calculated using the HVGs and PCs from the above analysis, and top 1000 genes were used for the dynamical model to calculate the gene-shared latent time and cell-specific velocities (scv.tl.recover_dynamics with parameters: fit_connected_states = False, max_iter = 50, t_max = 12, fit_basal_transcription = True, scv.tl.velocity with parameters: min_r2=0.2, groups_for_fit = <8–12 hpf>). The latent time was combined with connectivities using cellrank (cr.tl.transition_matrix parameter: weight_connectivities = 0.2), and one initial and six terminal states were calculated using cellrank’s GPCCA estimator.

Metacell analysis

To obtain a detailed view of cellular heterogeneity while reducing dropouts, we aggregated transcriptionally similar cells into the so-called ‘metacells’, based on archetype analysis implemented in the SEACells python package (Persad et al., 2023). We used SEAcells with parameters `n_SEACells = nc, n_waypoint_eigs = 15, convergence_epsilon = 1e-5`, where nc = 180 for whole-cell data and nc = 160 for the integrated 4–12 hpf data. Each metacell was then annotated with the mean latent time or pseudotime of underlying single-cells, and the max number of cells belonging to an annotated cell type or collection time (hpf). For analysis involving a comparison of H3K4me1 and H3K27me3, the underlying number of single cells was downsampled for each metacell, such that equal number of cells from both the histone modifications were assigned to each metacell, and only metacells with a minimum of 20 cells from both histone modifications were kept, to assure robust results.

Cell clustering using the ChIC signal

For the clustering of single cells based on ChIC signal, we performed latent semantic analysis (LSA) using the gensim package in python (Řehůřek and Sojka, 2010). The Cells*Regions sparse matrix was treated as a vector of documents (Cells), where region counts represent word frequency. The documents were then transformed with log term-frequency (TF), inverse document frequency (IDF) as follows:

t f_{t d} = 1 + {l o g}_{2} f_{i_{k}}

i d f (t, D) = {l o g}_{2} (\frac{N}{n_{k}})

T F - I D F (t, d, D) = t f_{t}, d * i d f_{t}, D

Where f_ik refers to the count frequency of a (50 kb) genomic bin T_k in a cell, D_i and nk refer to the number of cells containing non-zero counts for the bin. T_k The output is subjected to a pivoted unique normalization (Singhal et al., 2017) to take into account the difference in total number of detected regions per cell.

p i v o t e d n o r m = (1.0 - s l o p e) * p i v o t + s l o p e * T F - I D F (t, d, D)

In our case, we calculated pivot as the average number of non-zero bins across all cells, and fixed the slope to 0.25 (recommended by Singhal et al., 2017). The resulting matrix is subjected to a truncated SVD (singular value decomposition) (Halko et al., 2009), yielding Cell*Topic and Region*Topic matrices. Similar to scRNA-seq, we calculated 30 nearest neighbors using the 50 topics from the LSA output (dropping Topic-1, which strongly correlates with read depth), and used it to build the paga graph (with threshold = 0.1). UMAPs were initialized using the PAGA graph, with parameters `min_dist = 0.1, spread = 5`. Leiden clusters were calculated on the neighborhood graph with `resolution = 1.5`.

Peak calling and annotation

For the detection of regions with both narrow and broad enrichment in the genome, we used a two-step peak calling approach. We first pooled all our filtered cells from 4 to 24-hour time points into a BAM file and used histoneHMM (v1.7) function `call_regions` with parameters `-bs 750 P 0.1` (Heinig et al., 2015), and further removed the detected regions with average posterior probability <0.4, and referred to them as ‘domains’. Next, we performed peak-calling using MACS2 (v2.2.4) (Zhang et al., 2008) on the same file, with parameters `--mfold 0 50 --extsize 200 --broad --keep-dup all`, and overlapped these peaks with the domains detected from histoneHMM. Peaks overlapping with the histoneHMM domains, and having a local enrichment score ≥ 50, were referred to as ‘subpeaks’. For further analysis, we replaced the domains containing subpeaks with their respective subpeaks, resulting in 11221 enriched domains for H3K27me3 and 74,004 domains for H3K4me1.

For peak annotation, we used the genomicRanges R package (Lawrence et al., 2013) to classify these domains into ‘promoter’ (within +300 or –200 bases of a transcription start site), genic (overlapping a gene body, but not promoter), and ‘intergenic’ (outside promoters or gene body). The ‘genic’ domains were reclassified into ‘gene covering’ if they covered ≥ 80% of a gene. All domains were annotated with the gene(s) that overlapped with or (in the case of intergenic domains) were nearest to them. To detect which cis-regulatory elements are present inside these domains, we overlapped them with the location of ‘consensus PADREs’ annotated by the danio-code project (Baranasic et al., 2022). We used the gimmemotifs (van Heeringen and Veenstra, 2011) (v0.18.0), with parameters `scan -N 30` to annotate these peaks for the associated transcription factor binding sites (TFBS), using the `vertebrate.v5.0` motif database. The detected motifs were then filtered for zebrafish TF motifs that are also present in the SwissRegulon (dr11) database (Pachkov et al., 2007), resulting in 590 motifs belonging to 912 TFs.

H3K27me3 spreading and demethylation analysis

To detect the regions in the genome showing H3K27me3 cis-spreading, we used the table of 5 kb bin counts in single cells. We defined the ‘center’ bin as the bins showing non-zero counts in ≥ 5% of ‘pluripotent’ cells (784 bins), and ‘neighbor’ bins as the 10 up and downstream bins to the center bins. We then applied linear regression to predict the counts in ‘neighbor’ bins ( $\hat{Y}$ ) as a function of the sum of counts in the “center” bins ( $\hat{X}$ ) across metacells.

{\hat{Y}}_{i} = {\hat{β}}_{0} + {\hat{β}}_{1} X_{i} + {\hat{ϵ}}_{i}

To test if the spread is germ layer-specific, we compared this model to a second model including germ layer covariate ( ${\hat{X}}_{2}$ ) via a likelihood ratio test.

{\hat{Y}}_{i} = {\hat{β}}_{0} + {\hat{β}}_{1} X_{i} + {\hat{β}}_{2} X_{2} + {\hat{ϵ}}_{i}

Similarly, we used a linear regression model formulation above for the prediction of H3K27me3 silenced genes, where

\hat{Y} = l o g 2 ((s u m (c o u n t s_{u n}) * s u m (i n t r o n s_{u n}) / 1000))

and

\hat{X} = l o g 2 (c o u n t s_{k 27} * l e n g t h_{k 27} / 1000)

counts_un represent unspliced counts of genes overlapping a H3K27me3 peak, and introns_un is the length of their introns. counts_k₂₇ are ChIC counts on the H3K27me3 domain and length_k₂₇ is the length of the domain.

For the analysis of loss of H3K27me3 upon differentiation. We assigned cell types to metacells based on the highest proportion of cell labels in that group, then summed up the gene-level H3K27me3 counts. We then filtered the cell types with <3 metacells and used edgeR (Robinson et al., 2010) to perform a differential signal analysis between cell types using metacells as biological replicates. We used the following workflow: `filterByExpr(dge, min.count=5, min.total.count=20, min.prop=0.3); calcNormFactors(), estimateDisp(design), glmQLFit(dge, design)` with default parameters. Next, we filtered genes with `FDR <0.05, logFC < –1` to only retain genes with a loss of H3K27me3. We perform the same analysis for spliced and unspliced RNA counts for these genes (except filtering by logFC) to compare their results with that of H3K27me3 in the same cell types.

TF activity prediction and classification

To calculate the TF activity using H3K4me1 signal, we filtered our annotated H3K4me1 peaks (with assigned cPADREs) for peaks uniquely enriched for H3K4me1. Zebrafish TF motifs from the SwissRegulon (dr11) database (Pachkov et al., 2007) (590 motifs belonging to 912 TFs) were assigned to the peaks, based on motif match score inside the cPADREs within those peaks. Next, we obtained the H3K4me1 counts per peak per cell and assigned these counts to each TF motifs annotated with these peaks. Finally, we converted these raw counts into bias-corrected TF motif deviance scores per cell using chromVar (Schep et al., 2017). We also calculated deviance scores for metacells, using the aggregated counts per metacell, instead of single cells.

For TF activity prediction, we calculated the normalized H3K4me1 and H3K27me3 and (spliced) RNA counts per metacell, along with the metacell annotations (germ layer, latent time) and used them to predict the TF activity using lasso-penalized regression (Tibshirani, 1996). We first divided the data into a 70–30 (training/test) set and used the training set to tune the penalty parameter (λ) using grid search on 10-fold resamples. The top model was then run on each TF separately on the test set and evaluated based on R² estimates. The R² estimates were then compared to the permutation-based estimates to obtain a significance score (p-value) and adjusted for multiple testing via Benjamini–Hochberg (BH) correction to select top TFs for classification (Padj < 0.01). For the classification of TFs, we extracted the weights for the final model for each TF at the highest, λ , and interpreted them based on previous knowledge about these histone modifications. For example, since H3K4me1 activity represents active or poised enhancer, a positive correlation (weight >0) between a TF expression and H3K4me1 activity suggests its action as an activator, while a negative correlation (weight <0) suggests a repressor. Similarly, a non-zero weight for a TF’s H3K4me1 or H3K27me3 level would indicate that a TF activity is regulated by either, or both of the marks.

Code availability

The code for processing of T-ChIC data from raw (fastq) files up to count tables is available open source with GPLv3.0 license at https://github.com/bhardwaj-lab/scChICflow (Bhardwaj and Sancho Gómez, 2025). The config files containing all preprocessing parameters for scChICflow, together with scripts to reproduce our figures, are available open source with CC 4.0 license on Zenodo: https://doi.org/10.5281/zenodo.16813408.

Materials availability

Reagents, antibodies, and oligonucleotides used for our experiments are available from commercial providers (see our online protocol for a full list: https://dx.doi.org/10.17504/protocols.io.q26g7pbe8gwz/v2).

Acknowledgements

We acknowledge the Utrecht Sequencing Facility (USEQ) for providing sequencing service and data. USEQ is subsidized by the UMC Utrecht and The Netherlands X-omics Initiative (NWO project 184.034.019). We thank Reinier van der Linden (Hubrecht FACS facility) for performing single-cell sorting. This work was supported by European Research Council Advanced under grant ERC-AdG 101053581-scTranslatomics, and the NWO consortium grant OCENW.GROOT.2019.017 to AvO. The SNF (P2BSP3-174991), HFSP (LT000209/2018-L), and Marie Skłodowska-Curie Actions (798573) supported PZ. EMBO LTF (ALTF 1197–2019) supported VB.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Vivek Bhardwaj, Email: v.bhardwaj@uu.nl.

Alexander van Oudenaarden, Email: a.vanoudenaarden@hubrecht.eu.

H Efsun Arda, National Cancer Institute, United States.

Didier YR Stainier, Max Planck Institute for Heart and Lung Research, Germany.

Funding Information

This paper was supported by the following grants:

Nederlandse Organisatie voor Wetenschappelijk Onderzoek OCENW.GROOT.2019.017 to Alexander van Oudenaarden.
European Molecular Biology Organization ALTF 1197-2019 to Vivek Bhardwaj.
Human Frontier Science Program LT000209/2018-L to Peter Zeller.
H2020 Marie Skłodowska-Curie Actions 798573 to Peter Zeller.
European Research Council ERC-AdG 101053581 to Alexander van Oudenaarden.
Swiss National Science Foundation P2BSP3-174991 to Peter Zeller.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Investigation, Visualization, Methodology, Writing – original draft, Project administration, Writing – review and editing.

Data curation, Validation, Investigation, Writing – original draft, Writing – review and editing.

Data curation, Investigation, Visualization, Writing – original draft.

Conceptualization, Resources, Methodology, Writing – original draft.

Resources, Supervision, Funding acquisition, Writing – original draft, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Number of cells acquired per timepoint, mark and batch.

elife-110400-supp1.xlsx^{(9.2KB, xlsx)}

Supplementary file 2. Comparison of median counts and detected genes per cell.

elife-110400-supp2.xlsx^{(9KB, xlsx)}

Supplementary file 3. Genes with differential H3K27me3 signal between selected 24hpf cell types and others.

elife-110400-supp3.xlsx^{(76.9KB, xlsx)}

Supplementary file 4. Classification of TFs based on the results of the penalized regression model.

elife-110400-supp4.xlsx^{(20.1KB, xlsx)}

MDAR checklist

elife-110400-mdarchecklist1.docx^{(87.8KB, docx)}

Data availability

Raw sequencing data (.fastq), count tables (.h5ad/anndata format) with gene and cell-level metadata (including annotations) from this study are publicly available at GEO (GSE265874). Additionally, the GEO repository also contains signal tracks (.bigwigs) and peaks (.bed) files. Other source data behind our figures are available on Zenodo: https://doi.org/10.5281/zenodo.16813408.

The following datasets were generated:

Bhardwaj V, Viñas Gaza H, Griffa A, Zeller P, van Oudenaarden A. 2025. Single-cell multi-omic dataset mapping chromatin modifications and transcriptome during zebrafish development. NCBI Gene Expression Omnibus. GSE265874

Bhardwaj V. 2025. Replication package for: Single-cell multi-omic analysis reveals principles of transcription-chromatin interaction during embryogenesis. Zenodo.

References

Abascal F, Acosta R, Addleman NJ, Adrian J, Afzal V, Ai R, Aken B, Akiyama JA, Jammal OA, Amrhein H, Anderson SM, Andrews GR, Antoshechkin I, Ardlie KG, Armstrong J, Astley M, Banerjee B, Barkal AA, Barnes IHA, Barozzi I, Barrell D, Barson G, Bates D, Baymuradov UK, Bazile C, Beer MA, Beik S, Bender MA, Bennett R, Bouvrette LPB, Bernstein BE, Berry A, Bhaskar A, Bignell A, Blue SM, Bodine DM, Boix C, Boley N, Borrman T, Borsari B, Boyle AP, Brandsmeier LA, Breschi A, Bresnick EH, Brooks JA, Buckley M, Burge CB, Byron R, Cahill E, Cai L, Cao L, Carty M, Castanon RG, Castillo A, Chaib H, Chan ET, Chee DR, Chee S, Chen H, Chen H, Chen J-Y, Chen S, Cherry JM, Chhetri SB, Choudhary JS, Chrast J, Chung D, Clarke D, Cody NAL, Coppola CJ, Coursen J, D’Ippolito AM, Dalton S, Danyko C, Davidson C, Davila-Velderrain J, Davis CA, Dekker J, Deran A, DeSalvo G, Despacio-Reyes G, Dewey CN, Dickel DE, Diegel M, Diekhans M, Dileep V, Ding B, Djebali S, Dobin A, Dominguez D, Donaldson S, Drenkow J, Dreszer TR, Drier Y, Duff MO, Dunn D, Eastman C, Ecker JR, Edwards MD, El-Ali N, Elhajjajy SI, Elkins K, Emili A, Epstein CB, Evans RC, Ezkurdia I, Fan K, Farnham PJ, Farrell NP, Feingold EA, Ferreira A-M, Fisher-Aylor K, Fitzgerald S, Flicek P, Foo CS, Fortier K, Frankish A, Freese P, Fu S, Fu X-D, Fu Y, Fukuda-Yuzawa Y, Fulciniti M, Funnell APW, Gabdank I, Galeev T, Gao M, Giron CG, Garvin TH, Gelboin-Burkhart CA, Georgolopoulos G, Gerstein MB, Giardine BM, Gifford DK, Gilbert DM, Gilchrist DA, Gillespie S, Gingeras TR, Gong P, Gonzalez A, Gonzalez JM, Good P, Goren A, Gorkin DU, Graveley BR, Gray M, Greenblatt JF, Griffiths E, Groudine MT, Grubert F, Gu M, Guigó R, Guo H, Guo Y, Guo Y, Gursoy G, Gutierrez-Arcelus M, Halow J, Hardison RC, Hardy M, Hariharan M, Harmanci A, Harrington A, Harrow JL, Hashimoto TB, Hasz RD, Hatan M, Haugen E, Hayes JE, He P, He Y, Heidari N, Hendrickson D, Heuston EF, Hilton JA, Hitz BC, Hochman A, Holgren C, Hou L, Hou S, Hsiao Y-HE, Hsu S, Huang H, Hubbard TJ, Huey J, Hughes TR, Hunt T, Ibarrientos S, Issner R, Iwata M, Izuogu O, Jaakkola T, Jameel N, Jansen C, Jiang L, Jiang P, Johnson A, Johnson R, Jungreis I, Kadaba M, Kasowski M, Kasparian M, Kato M, Kaul R, Kawli T, Kay M, Keen JC, Keles S, Keller CA, Kelley D, Kellis M, Kheradpour P, Kim DS, Kirilusha A, Klein RJ, Knoechel B, Kuan S, Kulik MJ, Kumar S, Kundaje A, Kutyavin T, Lagarde J, Lajoie BR, Lambert NJ, Lazar J, Lee AY, Lee D, Lee E, Lee JW, Lee K, Leslie CS, Levy S, Li B, Li H, Li N, Li S, Li X, Li YI, Li Y, Li Y, Li Y, Lian J, Libbrecht MW, Lin S, Lin Y, Liu D, Liu J, Liu P, Liu T, Liu XS, Liu Y, Liu Y, Long M, Lou S, Loveland J, Lu A, Lu Y, Lécuyer E, Ma L, Mackiewicz M, Mannion BJ, Mannstadt M, Manthravadi D, Marinov GK, Martin FJ, Mattei E, McCue K, McEown M, McVicker G, Meadows SK, Meissner A, Mendenhall EM, Messer CL, Meuleman W, Meyer C, Miller S, Milton MG, Mishra T, Moore DE, Moore HM, Moore JE, Moore SH, Moran J, Mortazavi A, Mudge JM, Munshi N, Murad R, Myers RM, Nandakumar V, Nandi P, Narasimha AM, Narayanan AK, Naughton H, Navarro FCP, Navas P, Nazarovs J, Nelson J, Neph S, Neri FJ, Nery JR, Nesmith AR, Newberry JS, Newberry KM, Ngo V, Nguyen R, Nguyen TB, Nguyen T, Nishida A, Noble WS, Novak CS, Novoa EM, Nuñez B, O’Donnell CW, Olson S, Onate KC, Otterman E, Ozadam H, Pagan M, Palden T, Pan X, Park Y, Partridge EC, Paten B, Pauli-Behn F, Pazin MJ, Pei B, Pennacchio LA, Perez AR, Perry EH, Pervouchine DD, Phalke NN, Pham Q, Phanstiel DH, Plajzer-Frick I, Pratt GA, Pratt HE, Preissl S, Pritchard JK, Pritykin Y, Purcaro MJ, Qin Q, Quinones-Valdez G, Rabano I, Radovani E, Raj A, Rajagopal N, Ram O, Ramirez L, Ramirez RN, Rausch D, Raychaudhuri S, Raymond J, Razavi R, Reddy TE, Reimonn TM, Ren B, Reymond A, Reynolds A, Rhie SK, Rinn J, Rivera M, Rivera-Mulia JC, Roberts BS, Rodriguez JM, Rozowsky J, Ryan R, Rynes E, Salins DN, Sandstrom R, Sasaki T, Sathe S, Savic D, Scavelli A, Scheiman J, Schlaffner C, Schloss JA, Schmitges FW, See LH, Sethi A, Setty M, Shafer A, Shan S, Sharon E, Shen Q, Shen Y, Sherwood RI, Shi M, Shin S, Shoresh N, Siebenthall K, Sisu C, Slifer T, Sloan CA, Smith A, Snetkova V, Snyder MP, Spacek DV, Srinivasan S, Srivas R, Stamatoyannopoulos G, Stamatoyannopoulos JA, Stanton R, Steffan D, Stehling-Sun S, Strattan JS, Su A, Sundararaman B, Suner M-M, Syed T, Szynkarek M, Tanaka FY, Tenen D, Teng M, Thomas JA, Toffey D, Tress ML, Trout DE, Trynka G, Tsuji J, Upchurch SA, Ursu O, Uszczynska-Ratajczak B, Uziel MC, Valencia A, Biber BV, van der Velde AG, Van Nostrand EL, Vaydylevich Y, Vazquez J, Victorsen A, Vielmetter J, Vierstra J, Visel A, Vlasova A, Vockley CM, Volpi S, Vong S, Wang H, Wang M, Wang Q, Wang R, Wang T, Wang W, Wang X, Wang Y, Watson NK, Wei X, Wei Z, Weisser H, Weissman SM, Welch R, Welikson RE, Weng Z, Westra H-J, Whitaker JW, White C, White KP, Wildberg A, Williams BA, Wine D, Witt HN, Wold B, Wolf M, Wright J, Xiao R, Xiao X, Xu J, Xu J, Yan K-K, Yan Y, Yang H, Yang X, Yang Y-W, Yardımcı GG, Yee BA, Yeo GW, Young T, Yu T, Yue F, Zaleski C, Zang C, Zeng H, Zeng W, Zerbino DR, Zhai J, Zhan L, Zhan Y, Zhang B, Zhang J, Zhang J, Zhang K, Zhang L, Zhang P, Zhang Q, Zhang X-O, Zhang Y, Zhang Z, Zhao Y, Zheng Y, Zhong G, Zhou X-Q, Zhu Y, Zimmerman J, Moore JE, Purcaro MJ, Pratt HE, Epstein CB, Shoresh N, Adrian J, Kawli T, Davis CA, Dobin A, Kaul R, Halow J, Van Nostrand EL, Freese P, Gorkin DU, Shen Y, He Y, Mackiewicz M, Pauli-Behn F, Williams BA, Mortazavi A, Keller CA, Zhang X-O, Elhajjajy SI, Huey J, Dickel DE, Snetkova V, Wei X, Wang X, Rivera-Mulia JC, Rozowsky J, Zhang J, Chhetri SB, Zhang J, Victorsen A, White KP, Visel A, Yeo GW, Burge CB, Lécuyer E, Gilbert DM, Dekker J, Rinn J, Mendenhall EM, Ecker JR, Kellis M, Klein RJ, Noble WS, Kundaje A, Guigó R, Farnham PJ, Cherry JM, Myers RM, Ren B, Graveley BR, Gerstein MB, Pennacchio LA, Snyder MP, Bernstein BE, Wold B, Hardison RC, Gingeras TR, Stamatoyannopoulos JA, Weng Z, The ENCODE Project Consortium Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature. 2020;583:699–710. doi: 10.1038/s41586-020-2493-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ahlmann-Eltze C, Huber W. Comparison of transformations for single-cell RNA-seq data. Nature Methods. 2023;20:665–672. doi: 10.1038/s41592-023-01814-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Akdogan-Ozdilek B, Duval KL, Meng FW, Murphy PJ, Goll MG. Identification of chromatin states during zebrafish gastrulation using CUT&RUN and CUT&Tag. Developmental Dynamics. 2022;251:729–742. doi: 10.1002/dvdy.430. [DOI] [PMC free article] [PubMed] [Google Scholar]
Argelaguet R, Clark SJ, Mohammed H, Stapel LC, Krueger C, Kapourani CA, Imaz-Rosshandler I, Lohoff T, Xiang Y, Hanna CW, Smallwood S, Ibarra-Soria X, Buettner F, Sanguinetti G, Xie W, Krueger F, Göttgens B, Rugg-Gunn PJ, Kelsey G, Dean W, Nichols J, Stegle O, Marioni JC, Reik W. Multi-omics profiling of mouse gastrulation at single-cell resolution. Nature. 2019;576:487–491. doi: 10.1038/s41586-019-1825-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bai D, Zhang X, Xiang H, Guo Z, Zhu C, Yi C. Simultaneous single-cell analysis of 5mC and 5hmC with SIMPLE-seq. Nature Biotechnology. 2025;43:85–96. doi: 10.1038/s41587-024-02148-9. [DOI] [PubMed] [Google Scholar]
Baranasic D, Hörtenhuber M, Balwierz PJ, Zehnder T, Mukarram AK, Nepal C, Várnai C, Hadzhiev Y, Jimenez-Gonzalez A, Li N, Wragg J, D’Orazio FM, Relic D, Pachkov M, Díaz N, Hernández-Rodríguez B, Chen Z, Stoiber M, Dong M, Stevens I, Ross SE, Eagle A, Martin R, Obasaju O, Rastegar S, McGarvey AC, Kopp W, Chambers E, Wang D, Kim HR, Acemel RD, Naranjo S, Łapiński M, Chong V, Mathavan S, Peers B, Sauka-Spengler T, Vingron M, Carninci P, Ohler U, Lacadie SA, Burgess SM, Winata C, van Eeden F, Vaquerizas JM, Gómez-Skarmeta JL, Onichtchouk D, Brown BJ, Bogdanovic O, van Nimwegen E, Westerfield M, Wardle FC, Daub CO, Lenhard B, Müller F. Multiomic atlas with functional stratification and developmental dynamics of zebrafish cis-regulatory elements. Nature Genetics. 2022;54:1037–1050. doi: 10.1038/s41588-022-01089-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bartosovic M, Kabbe M, Castelo-Branco G. Single-cell CUT&Tag profiles histone modifications and transcription factors in complex tissues. Nature Biotechnology. 2021;39:825–835. doi: 10.1038/s41587-021-00869-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bergen V, Lange M, Peidli S, Wolf FA, Theis FJ. Generalizing RNA velocity to transient cell states through dynamical modeling. Nature Biotechnology. 2020;38:1408–1414. doi: 10.1038/s41587-020-0591-3. [DOI] [PubMed] [Google Scholar]
Bhardwaj V, Heyne S, Sikora K, Rabbani L, Rauer M, Kilpert F, Richter AS, Ryan DP, Manke T. snakePipes: facilitating flexible, scalable and integrative epigenomic analysis. Bioinformatics. 2019;35:4757–4759. doi: 10.1093/bioinformatics/btz436. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bhardwaj V, Mourragui S. User-friendly exploration of epigenomic data in single cells using sincei. bioRxiv. 2024 doi: 10.1101/2024.07.27.605424. [DOI]
Bhardwaj V, Sancho Gómez F. Software Heritage; 2025. https://archive.softwareheritage.org/swh:1:dir:13956aa2f66cb18a5f271d4bbb43d513b1f67003;origin=https://github.com/bhardwaj-lab/scChICflow;visit=swh:1:snp:3ec092121cb0f6fdf279408aff4ce8cf1f92312e;anchor=swh:1:rev:b61bcd4783530f0fb6d27f99bc7eb18e2d78e4ee [Google Scholar]
Bogdanović O, van Heeringen SJ, Veenstra GJC. The epigenome in early vertebrate development. Genesis. 2012;50:192–206. doi: 10.1002/dvg.20831. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bradford YM, Van Slyke CE, Ruzicka L, Singer A, Eagle A, Fashena D, Howe DG, Frazer K, Martin R, Paddock H, Pich C, Ramachandran S, Westerfield M. Zebrafish information network, the knowledgebase for Danio rerio research. Genetics. 2022;220:iyac016. doi: 10.1093/genetics/iyac016. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cheung P, Vallania F, Warsinske HC, Donato M, Schaffert S, Chang SE, Dvorak M, Dekker CL, Davis MM, Utz PJ, Khatri P, Kuo AJ. Single-cell chromatin modification profiling reveals increased epigenetic variations with aging. Cell. 2018;173:1385–1397. doi: 10.1016/j.cell.2018.03.079. [DOI] [PMC free article] [PubMed] [Google Scholar]
Clark SJ, Argelaguet R, Lohoff T, Krueger F, Drage D, Göttgens B, Marioni JC, Nichols J, Reik W. Single-cell multi-omics profiling links dynamic DNA methylation to cell fate decisions during mouse early organogenesis. Genome Biology. 2022;23:202. doi: 10.1186/s13059-022-02762-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cunningham F, Allen JE, Allen J, Alvarez-Jarreta J, Amode MR, Armean IM, Austine-Orimoloye O, Azov AG, Barnes I, Bennett R, Berry A, Bhai J, Bignell A, Billis K, Boddu S, Brooks L, Charkhchi M, Cummins C, Rin Fioretto L, Davidson C, Dodiya K, Donaldson S, El Houdaigui B, El Naboulsi T, Fatima R, Giron CG, Genez T, Martinez JG, Guijarro-Clarke C, Gymer A, Hardy M, Hollis Z, Hourlier T, Hunt T, Juettemann T, Kaikala V, Kay M, Lavidas I, Le T, Lemos D, Marugán JC, Mohanan S, Mushtaq A, Naven M, Ogeh DN, Parker A, Parton A, Perry M, Piližota I, Prosovetskaia I, Sakthivel MP, Salam AIA, Schmitt BM, Schuilenburg H, Sheppard D, Pérez-Silva JG, Stark W, Steed E, Sutinen K, Sukumaran R, Sumathipala D, Suner MM, Szpak M, Thormann A, Tricomi FF, Urbina-Gómez D, Veidenberg A, Walsh TA, Walts B, Willhoft N, Winterbottom A, Wass E, Chakiachvili M, Flint B, Frankish A, Giorgetti S, Haggerty L, Hunt SE, IIsley GR, Loveland JE, Martin FJ, Moore B, Mudge JM, Muffato M, Perry E, Ruffier M, Tate J, Thybert D, Trevanion SJ, Dyer S, Harrison PW, Howe KL, Yates AD, Zerbino DR, Flicek P. Ensembl 2022. Nucleic Acids Res. 2022;50:D988–D995. doi: 10.1093/nar/gkab1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
de la Calle Mustienes E, Gómez-Skarmeta JL, Bogdanović O. Genome-wide epigenetic cross-talk between DNA methylation and H3K27me3 in zebrafish embryos. Genomics Data. 2015;6:7–9. doi: 10.1016/j.gdata.2015.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dooley CM, Wali N, Sealy IM, White RJ, Stemple DL, Collins JE, Busch-Nentwich EM. The gene regulatory basis of genetic compensation during neural crest induction. PLOS Genetics. 2019;15:e1008213. doi: 10.1371/journal.pgen.1008213. [DOI] [PMC free article] [PubMed] [Google Scholar]
Esterberg R, Fritz A. dlx3b/4b are required for the formation of the preplacodal region and otic placode through local modulation of BMP activity. Developmental Biology. 2009;325:189–199. doi: 10.1016/j.ydbio.2008.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
Farrell JA, Wang Y, Riesenfeld SJ, Shekhar K, Regev A, Schier AF. Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis. Science. 2018;360:eaar3131. doi: 10.1126/science.aar3131. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fishman L, Nechooshtan G, Erhard F, Regev A, Farrell JA, Rabani M. Single-cell temporal dynamics reveals the relative contributions of transcription and degradation to cell-type specific gene expression in zebrafish embryos. bioRxiv. 2023 doi: 10.1101/2023.04.20.537620. [DOI]
Fitz-James MH, Cavalli G. Molecular mechanisms of transgenerational epigenetic inheritance. Nature Reviews. Genetics. 2022;23:325–341. doi: 10.1038/s41576-021-00438-5. [DOI] [PubMed] [Google Scholar]
Fu M, Pang L, Wu Z, Wang M, Jin J, Ai S, Li X. Single-cell multi-omics delineates the dynamics of distinct epigenetic codes coordinating mouse gastrulation. BMC Genomics. 2025;26:454. doi: 10.1186/s12864-025-11619-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gheldof A, Hulpiau P, van Roy F, De Craene B, Berx G. Evolutionary functional analysis and molecular regulation of the ZEB transcription factors. Cellular and Molecular Life Sciences. 2012;69:2527–2541. doi: 10.1007/s00018-012-0935-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Giraldez AJ, Mishima Y, Rihel J, Grocock RJ, Van Dongen S, Inoue K, Enright AJ, Schier AF. Zebrafish MiR-430 promotes deadenylation and clearance of maternal mRNAs. Science. 2006;312:75–79. doi: 10.1126/science.1122689. [DOI] [PubMed] [Google Scholar]
Gourishetti K, Balaji Easwaran V, Mostakim Y, Ranganath Pai KS, Bhere D. MicroRNA (miR)-124: a promising therapeutic gateway for oncology. Biology. 2023;12:922. doi: 10.3390/biology12070922. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guo F, Li L, Li J, Wu X, Hu B, Zhu P, Wen L, Tang F. Single-cell multi-omics sequencing of mouse early embryos and embryonic stem cells. Cell Research. 2017;27:967–988. doi: 10.1038/cr.2017.82. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guo Y, Zhao S, Wang GG. Polycomb gene silencing mechanisms: PRC2 chromatin targeting, H3K27me3 “readout”, and phase separation-based compaction. Trends in Genetics. 2021;37:547–565. doi: 10.1016/j.tig.2020.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
Halko N, Martinsson PG, Tropp JA. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. arXiv. 2009 https://arxiv.org/abs/0909.4061
Heinig M, Colomé-Tatché M, Taudt A, Rintisch C, Schafer S, Pravenec M, Hubner N, Vingron M, Johannes F. histoneHMM: Differential analysis of histone modifications with broad genomic footprints. BMC Bioinformatics. 2015;16:60. doi: 10.1186/s12859-015-0491-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hickey GJ, Wike CL, Nie X, Guo Y, Tan M, Murphy PJ, Cairns BR. Establishment of developmental gene silencing by ordered polycomb complex recruitment in early zebrafish embryos. eLife. 2022;11:e67738. doi: 10.7554/eLife.67738. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaaij LJT, Mokry M, Zhou M, Musheev M, Geeven G, Melquiond ASJ, de Jesus Domingues AM, de Laat W, Niehrs C, Smith AD, Ketting RF. Enhancers reside in a unique epigenetic environment during early zebrafish development. Genome Biology. 2016;17:146. doi: 10.1186/s13059-016-1013-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim D, Langmead B, Salzberg S. HISAT2: graph-based alignment of next-generation sequencing reads to a population of genomes. 2.1.0Github. 2017 https://daehwankimlab.github.io/hisat2/
Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K, Baglaenko Y, Brenner M, Loh PR, Raychaudhuri S. Fast, sensitive and accurate integration of single-cell data with Harmony. Nature Methods. 2019;16:1289–1296. doi: 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ, Amin V, Whitaker JW, Schultz MD, Ward LD, Sarkar A, Quon G, Sandstrom RS, Eaton ML, Wu Y-C, Pfenning AR, Wang X, Claussnitzer M, Liu Y, Coarfa C, Harris RA, Shoresh N, Epstein CB, Gjoneska E, Leung D, Xie W, Hawkins RD, Lister R, Hong C, Gascard P, Mungall AJ, Moore R, Chuah E, Tam A, Canfield TK, Hansen RS, Kaul R, Sabo PJ, Bansal MS, Carles A, Dixon JR, Farh K-H, Feizi S, Karlic R, Kim A-R, Kulkarni A, Li D, Lowdon R, Elliott G, Mercer TR, Neph SJ, Onuchic V, Polak P, Rajagopal N, Ray P, Sallari RC, Siebenthall KT, Sinnott-Armstrong NA, Stevens M, Thurman RE, Wu J, Zhang B, Zhou X, Beaudet AE, Boyer LA, De Jager PL, Farnham PJ, Fisher SJ, Haussler D, Jones SJM, Li W, Marra MA, McManus MT, Sunyaev S, Thomson JA, Tlsty TD, Tsai L-H, Wang W, Waterland RA, Zhang MQ, Chadwick LH, Bernstein BE, Costello JF, Ecker JR, Hirst M, Meissner A, Milosavljevic A, Ren B, Stamatoyannopoulos JA, Wang T, Kellis M, Roadmap Epigenomics Consortium Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]
La Manno G, Soldatov R, Zeisel A, Braun E, Hochgerner H, Petukhov V, Lidschreiber K, Kastriti ME, Lönnerberg P, Furlan A, Fan J, Borm LE, Liu Z, van Bruggen D, Guo J, He X, Barker R, Sundström E, Castelo-Branco G, Cramer P, Adameyko I, Linnarsson S, Kharchenko PV. RNA velocity of single cells. Nature. 2018;560:494–498. doi: 10.1038/s41586-018-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lange M, Bergen V, Klein M, Setty M, Reuter B, Bakhti M, Lickert H, Ansari M, Schniering J, Schiller HB, Pe’er D, Theis FJ. CellRank for directed single-cell fate mapping. Nature Methods. 2022;19:159–170. doi: 10.1038/s41592-021-01346-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lause J, Berens P, Kobak D. Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data. Genome Biology. 2021;22:258. doi: 10.1186/s13059-021-02451-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan MT, Carey VJ. Software for computing and annotating genomic ranges. PLOS Computational Biology. 2013;9:e1003118. doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu Y, Zhu Z, Ho IHT, Shi Y, Li J, Wang X, Chan MTV, Cheng CHK. Genetic deletion of miR-430 disrupts maternal-zygotic transition and embryonic body plan. Frontiers in Genetics. 2020;11:853. doi: 10.3389/fgene.2020.00853. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu H, Zhou J, Tian W, Luo C, Bartlett A, Aldridge A, Lucero J, Osteen JK, Nery JR, Chen H, Rivkin A, Castanon RG, Clock B, Li YE, Hou X, Poirion OB, Preissl S, Pinto-Duarte A, O’Connor C, Boggeman L, Fitzpatrick C, Nunn M, Mukamel EA, Zhang Z, Callaway EM, Ren B, Dixon JR, Behrens MM, Ecker JR. DNA methylation atlas of the mouse brain at single-cell resolution. Nature. 2021;598:120–128. doi: 10.1038/s41586-020-03182-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu M, Yue Y, Chen X, Xian K, Dong C, Shi M, Xiong H, Tian K, Li Y, Zhang QC, He A. Genome-coverage single-cell histone modifications for embryo lineage tracing. Nature. 2025;640:828–839. doi: 10.1038/s41586-025-08656-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
Murphy PJ, Wu SF, James CR, Wike CL, Cairns BR. Placeholder nucleosomes underlie germline-to-embryo DNA methylation reprogramming. Cell. 2018;172:993–1006. doi: 10.1016/j.cell.2018.01.022. [DOI] [PubMed] [Google Scholar]
Nichols RV, O’Connell BL, Mulqueen RM, Thomas J, Woodfin AR, Acharya S, Mandel G, Pokholok D, Steemers FJ, Adey AC. High-throughput robust single-cell DNA methylation profiling with sciMETv2. Nature Communications. 2022;13:7627. doi: 10.1038/s41467-022-35374-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pachkov M, Erb I, Molina N, van Nimwegen E. SwissRegulon: a database of genome-wide annotations of regulatory sites. Nucleic Acids Research. 2007;35:D127–D131. doi: 10.1093/nar/gkl857. [DOI] [PMC free article] [PubMed] [Google Scholar]
Payumo AY, McQuade LE, Walker WJ, Yamazoe S, Chen JK. Tbx16 regulates hox gene activation in mesodermal progenitor cells. Nature Chemical Biology. 2016;12:694–701. doi: 10.1038/nchembio.2124. [DOI] [PMC free article] [PubMed] [Google Scholar]
Persad S, Choo Z-N, Dien C, Sohail N, Masilionis I, Chaligné R, Nawy T, Brown CC, Sharma R, Pe’er I, Setty M, Pe’er D. SEACells infers transcriptional and epigenomic cellular states from single-cell genomics data. Nature Biotechnology. 2023;41:1746–1757. doi: 10.1038/s41587-023-01716-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
Řehůřek R, Sojka P. Software framework for topic modelling with large corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. ELRA; 2010. pp. 45–50. [DOI] [Google Scholar]
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
Salmen F, De Jonghe J, Kaminski TS, Alemany A, Parada GE, Verity-Legg J, Yanagida A, Kohler TN, Battich N, van den Brekel F, Ellermann AL, Arias AM, Nichols J, Hemberg M, Hollfelder F, van Oudenaarden A. High-throughput total RNA sequencing in single cells using VASA-seq. Nature Biotechnology. 2022;40:1780–1793. doi: 10.1038/s41587-022-01361-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
San B, Chrispijn ND, Wittkopp N, van Heeringen SJ, Lagendijk AK, Aben M, Bakkers J, Ketting RF, Kamminga LM. Normal formation of a vertebrate body plan and loss of tissue maintenance in the absence of ezh2. Scientific Reports. 2016;6:24658. doi: 10.1038/srep24658. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schep AN, Wu B, Buenrostro JD, Greenleaf WJ. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nature Methods. 2017;14:975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schmid M, Durussel T, Laemmli UK. ChIC and ChEC; genomic mapping of chromatin proteins. Molecular Cell. 2004;16:147–157. doi: 10.1016/j.molcel.2004.09.007. [DOI] [PubMed] [Google Scholar]
Sedykh I, Keller AN, Yoon B, Roberson L, Moskvin OV, Grinblat Y. Zebrafish Rfx4 controls dorsal and ventral midline formation in the neural tube. Developmental Dynamics. 2018;247:650–659. doi: 10.1002/dvdy.24613. [DOI] [PMC free article] [PubMed] [Google Scholar]
Singhal A, Buckley C, Mitra M. Pivoted Document Length Normalization. ACM SIGIR Forum. 2017;51:176–184. doi: 10.1145/3130348.3130365. [DOI] [Google Scholar]
Smith T, Heger A, Sudbery I. UMI-tools: modeling sequencing errors in unique molecular identifiers to improve quantification accuracy. Genome Research. 2017;27:491–499. doi: 10.1101/gr.209601.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B. 1996;58:267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x. [DOI] [Google Scholar]
van Heeringen SJ, Veenstra GJC. GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments. Bioinformatics. 2011;27:270–271. doi: 10.1093/bioinformatics/btq636. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vastenhouw NL, Zhang Y, Woods IG, Imam F, Regev A, Liu XS, Rinn J, Schier AF. Chromatin signature of embryonic pluripotency is established during genome activation. Nature. 2010;464:922–926. doi: 10.1038/nature08866. [DOI] [PMC free article] [PubMed] [Google Scholar]
Veronezi GMB, Ramachandran S. Nucleation and spreading maintain Polycomb domains every cell cycle. Cell Reports. 2024;43:114090. doi: 10.1016/j.celrep.2024.114090. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wagner DE, Weinreb C, Collins ZM, Briggs JA, Megason SG, Klein AM. Single-cell mapping of gene expression landscapes and lineage in the zebrafish embryo. Science. 2018;360:981–987. doi: 10.1126/science.aar4362. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biology. 2018;19:15. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wolf FA, Hamey FK, Plass M, Solana J, Dahlin JS, Göttgens B, Rajewsky N, Simon L, Theis FJ. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biology. 2019;20:59. doi: 10.1186/s13059-019-1663-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wu SJ, Furlan SN, Mihalas AB, Kaya-Okur HS, Feroze AH, Emerson SN, Zheng Y, Carson K, Cimino PJ, Keene CD, Sarthy JF, Gottardo R, Ahmad K, Henikoff S, Patel AP. Single-cell CUT&Tag analysis of chromatin modifications in differentiation and tumor progression. Nature Biotechnology. 2021;39:819–824. doi: 10.1038/s41587-021-00865-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xiang Y, Zhang Y, Xu Q, Zhou C, Liu B, Du Z, Zhang K, Zhang B, Wang X, Gayen S, Liu L, Wang Y, Li Y, Wang Q, Kalantry S, Li L, Xie W. Epigenomic analysis of gastrulation identifies a unique chromatin state for primed pluripotency. Nature Genetics. 2020;52:95–105. doi: 10.1038/s41588-019-0545-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xu PF, Houssin N, Ferri-Lagneau KF, Thisse B, Thisse C. Construction of a vertebrate embryo from two opposing morphogen gradients. Science. 2014;344:87–89. doi: 10.1126/science.1248252. [DOI] [PubMed] [Google Scholar]
Yette GA, Stewart S, Stankunas K. Zebrafish polycomb repressive complex-2 critical roles are largely Ezh2- over Ezh1-driven and concentrate during early embryogenesis. bioRxiv. 2021 doi: 10.1101/2020.12.31.424918. [DOI]
Zeller P, Yeung J, Viñas Gaza H, de Barbanson BA, Bhardwaj V, Florescu M, van der Linden R, van Oudenaarden A. Single-cell sortChIC identifies hierarchical chromatin dynamics during hematopoiesis. Nature Genetics. 2023;55:333–345. doi: 10.1038/s41588-022-01260-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zeller P, Blotenburg M, Bhardwaj V, Barbanson BA, Salmén F, Oudenaarden A. T-ChIC: multi-omic detection of histone modifications and full-length transcriptomes in the same single cell. bioRxiv. 2024 doi: 10.1101/2024.05.09.593364. [DOI]
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of chIP-seq (MACS) Genome Biology. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhao C, Biondic S, Vandal K, Björklund ÅK, Hagemann-Jensen M, Sommer TM, Canizo J, Clark S, Raymond P, Zenklusen DR, Rivron N, Reik W, Petropoulos S. Single-cell multi-omics of human preimplantation embryos shows susceptibility to glucocorticoids. Genome Research. 2022;32:1627–1641. doi: 10.1101/gr.276665.122. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife. doi: 10.7554/eLife.110400.2.sa0

eLife Assessment

H Efsun Arda ¹

In this valuable study, the authors examine transcription and chromatin dynamics during early zebrafish development by simultaneously profiling histone modifications and full-length transcriptomes in thousands of single cells, providing solid analysis that chromatin and transcriptional states are initially weakly correlated in early embryonic cells and become progressively more aligned as differentiation proceeds. The work also supports a model in which promoter-anchored cis-spreading of H3K27me3 contributes to stable gene silencing during development. Future functional perturbations and orthogonal validations will be needed to determine the causal contribution of Polycomb spreading to fate commitment. Overall, the dataset and accompanying analyses provide a robust resource and a quantitative framework for studying chromatin-transcription relationships during vertebrate embryogenesis.

[Editors note: this paper was reviewed by Review Commons.]

eLife. doi: 10.7554/eLife.110400.2.sa1

Reviewer #1 (Public review):

Anonymous

This manuscript presents a comprehensive and technically impressive study investigating the interplay between active (H3K4me1) and silencing (H3K27me3) chromatin states and gene expression during early zebrafish development. By applying an optimized single-cell multi-omics method (whole-organism T-ChIC) to profile histone modifications and transcriptomes simultaneously in thousands of cells from 4 to 24 hours post-fertilization, the work addresses a significant gap in understanding how epigenetic states are established and propagated during vertebrate embryogenesis.

There are several obvious strengths:

(1) Innovative Methodology: The adaptation and application of the T-ChIC protocol to a whole-organism, multiplexed time-course design is a major technical achievement. The generation of a high-quality, paired chromatin (H3K27me3 and H3K4me1) and full-length transcriptome dataset from the same single cells is a powerful resource for the field.

(2) Novel Biological Insights:

(2.1) It provides single-cell evidence for the promoter-anchored cis-spreading of H3K27me3 as a mechanism for gene silencing during differentiation, a process that appears largely lineage-agnostic.

(2.2) It demonstrates that global chromatin states (both active and repressive) are initially decoupled from transcriptional output in pluripotent cells and become correlated as cells mature, suggesting this coupling is a hallmark of identity formation.

(2.3) It develops a predictive model using TF expression and the H3K4me1 state at TF binding sites to infer lineage-specific activator/repressor functions and epigenetic regulation of TFs themselves, revealing novel roles for factors like zbtb16a and zeb1a.

There are also several weaknesses for further clarification:

(1) The study focuses on H3K27me3 and H3K4me1. Why these two specific histone modifications were chosen as the primary focus for this study on early fate commitment?

(2) There are some similar single-cell techniques available (histone modifications and transcription from the same single cell), what is the performance of T-ChIC when comparing to other methods？

Comments on revised version:

Other histone modifications and TFs, or even DNA methylation could be tested to see the robustness of T-ChIC.

eLife. doi: 10.7554/eLife.110400.2.sa2

Reviewer #2 (Public review):

Anonymous

Summary:

Joint analysis of multiple modalities in single cells will provide a comprehensive view of cell fate states. In this manuscript, Bhardwaj et al developed a single-cell multi-omics assay, T-ChIC, to simultaneously capture histone modifications and the full-length transcriptome and applied the method to early embryos of zebrafish. The authors observed a decoupled relationship between the chromatin modifications and gene expression at early developmental stages. The correlation becomes stronger as development proceeds, as genes are silenced by the cis-spreading of the repressive marker H3k27me3. Overall, the work is well performed, and the results are meaningful and interesting to readers in the epigenomic and embryonic development fields.

Strengths:

This work utilized a new single-cell multi-omics method and generated abundant epigenomics and transcriptomics datasets for cells covering multiple key developmental stages of zebrafish.

Weaknesses:

The data analysis was superficial and mainly focused on the correspondence between the two modalities. The discussion of developmental biology was limited.

Overall, the T-ChIC method is efficient and user-friendly, and the single-cell datasets for zebrafish early development are also valuable. Audiences in the field of epigenomic and embryonic development will benefit from this work.

Comments on revised version:

The authors have answered my previous concerns.

eLife. 2026 Apr 21;15:RP110400. doi: 10.7554/eLife.110400.2.sa3

Author response

Vivek Bhardwaj, Alberto Griffa, Helena Viñas Gaza, Peter Zeller, Alexander van Oudenaarden

General Statements

We thank all three reviewers for their time taken to provide valuable feedback on our manuscript, and for appreciating the quality and usefulness of our data and results presented in our study. We have improved the manuscript based on their suggestions and provide a detailed, point-by-point response below.

Point-by-point description of the revisions

Reviewer #1 (Evidence, reproducibility and clarity):

The authors have a longstanding focus and reputation on single cell sequencing technology development and application. In this current study, the authors developed a novel single-cell multi-omic assay termed "T-ChIC" so that to jointly profile the histone modifications along with the full-length transcriptome from the same single cells, analyzed the dynamic relationship between chromatin state and gene expression during zebrafish development and cell fate determination. In general, the assay works well, the data look convincing and conclusions are beneficial to the community.

Thank you for your positive feedback.

There are several single-cell methodologies all claim to co-profile chromatin modifications and gene expression from the same individual cell, such as CoTECH, Paired-tag and others. Although T-ChIC employs pA-Mnase and IVT to obtain these modalities from single cells which are different, could the author provide some direct comparisons among all these technologies to see whether T-ChIC outperforms?

In a separate technical manuscript describing the application of T-ChIC in mouse cells (Zeller, Blotenburg et al 2024, (Zeller et al., 2024)), we have provided a direct comparison of data quality between T-ChIC and other single-cell methods for chromatin-RNA co-profiling (Please refer to Fig. 1C,D and Fig. S1D, E, of the preprint). We show that compared to other methods, T-ChIC is able to better preserve the expected biological relationship between the histone modifications and gene expression in single cells.

In current study, T-ChIC profiled H3K27me3 and H3K4me1 modifications, these data look great. How about other histone modifications (eg H3K9me3 and H3K36me3) and transcription factors?

While we haven’t profiled these other modifications using T-ChIC in Zebrafish, we have previously published high quality data on these histone modifications using the sortChIC method, on which T-ChIC is based (Zeller, Yeung et al 2023)(Zeller et al., 2022). In our comparison, we find that histone modification profiles between T-ChIC and sortChIC are very similar (Fig. S1C in Zeller, Blotenburg et al 2024). Therefore the method is expected to work as well for the other histone marks.

T-ChIC can detect full length transcription from the same single cells, but in FigS3, the authors still used other published single cell transcriptomics to annotate the cell types, this seems unnecessary?

We used the published scRNA-seq dataset with a larger number of cells to homogenize our cell type labels with these datasets, but we also cross-referenced our cluster-specific marker genes with ZFIN and homogenized the cell type labels with ZFIN ontology. This way our annotation is in line with previous datasets but not biased by it. Due the relatively smaller size of our data, we didn’t expect to identify unique, rare cell types, but our full-length total RNA assay helps us identify non-coding RNAs such as miRNA previously undetected in scRNA assays, which we have now highlighted in new figure S1c .

Throughout the manuscript, the authors found some interesting dynamics between chromatin state and gene expression during embryogenesis, independent approaches should be used to validate these findings, such as IHC staining or RNA ISH?

We appreciate that the ISH staining could be useful to validate the expression pattern of genes identified in this study. But to validate the relationships between the histone marks and gene expression, we need to combine these stainings with functional genomics experiments, such as PRC2-related knockouts. Due to their complexity, such experiments are beyond the scope of this manuscript (see also reply to reviewer #3, comment #4 for details).

In Fig2 and FigS4, the authors showed H3K27me3 cis spreading during development, this looks really interesting. Is this zebrafish specific? H3K27me3 ChIP-seq or CutTag data from mouse and/or human embryos should be reanalyzed and used to compare. The authors could speculate some possible mechanisms to explain this spreading pattern?

Thanks for the suggestion. In this revision, we have reanalysed a dataset of mouse ChIP-seq of H3K27me3 during mouse embryonic development by Xiang et al (Nature Genetics 2019) and find similar evidence of spreading of H3K27me3 signal from their pre-marked promoter regions at E5.5 epiblast upon differentiation (new Figure S4i). This observation, combined with the fact that the mechanism of pre-marking of promoters by PRC1-PRC2 interaction seems to be conserved between the two species (see (Hickey et al., 2022), (Mei et al., 2021) & (Chen et al., 2021)), suggests that the dynamics of H3K27me3 pattern establishment is conserved across vertebrates. But we think a high-resolution profiling via a method like T-ChIC would be more useful to demonstrate the dynamics of signal spreading during mouse embryonic development in the future. We have discussed this further in our revised manuscript.

Reviewer #1 (Significance):

The authors have a longstanding focus and reputation on single cell sequencing technology development and application. In this current study, the authors developed a novel single-cell multi-omic assay termed "T-ChIC" so that to jointly profile the histone modifications along with the full-length transcriptome from the same single cells, analyzed the dynamic relationship between chromatin state and gene expression during zebrafish development and cell fate determination. In general, the assay works well, the data look convincing and conclusions are beneficial to the community.

Thank you very much for your supportive remarks.

Reviewer #2 (Evidence, reproducibility and clarity):

Joint analysis of multiple modalities in single cells will provide a comprehensive view of cell fate states. In this manuscript, Bhardwaj et al developed a single-cell multi-omics assay, T-ChIC, to simultaneously capture histone modifications and full-length transcriptome and applied the method on early embryos of zebrafish. The authors observed a decoupled relationship between the chromatin modifications and gene expression at early developmental stages. The correlation becomes stronger as development proceeds, as genes are silenced by the cis-spreading of the repressive marker H3k27me3. Overall, the work is well performed, and the results are meaningful and interesting to readers in the epigenomic and embryonic development fields. There are some concerns before the manuscript is considered for publication.

We thank the reviewer for appreciating the quality of our study.

Major concerns:

(1) A major point of this study is to understand embryo development, especially gastrulation, with the power of scMulti-Omics assay. However, the current analysis didn't focus on deciphering the biology of gastrulation, i.e., lineage-specific pioneer factors that help to reform the chromatin landscape. The majority of the data analysis is based on the temporal dimension, but not the cell-type-specific dimension, which reduces the value of the single-cell assay.

We focussed on the lineage-specific transcription factor activity during gastrulation in Figure 4 and S8 of the manuscript and discovered several interesting regulators active at this stage. During our analysis of the temporal dimension for the rest of the manuscript, we also classified the cells by their germ layer and “latent” developmental time by taking the full advantage of the single-cell nature of our data. Additionally, we have now added the cell-type-specific H3K27me3 demethylation results for 24hpf in response to your comment below. We hope that these results, together with our openly available dataset would demonstrate the advantage of the single-cell aspect of our dataset.

(2) The cis-spreading of H3K27me3 with developmental time is interesting. Considering H3k27me3 could mark bivalent regions, especially in pluripotent cells, there must be some regions that have lost H3k27me3 signals during development. Therefore, it's confusing that the authors didn't find these regions (30% spreading, 70% stable). The authors should explain and discuss this issue.

Indeed we see that ~30% of the bins enriched in the pluripotent stage spread, while 70% do not seem to spread. In line with earlier observations(Hickey et al., 2022; Vastenhouw et al., 2010), we find that H3K27me3 is almost absent in the zygote and is still being accumulated until 24hpf and beyond. Therefore the majority of the sites in the genome still seem to be in the process of gaining H3K27me3 until 24hpf, explaining why we see mostly “spreading” and “stable” states. Considering most of these sites are at promoters and show signs of bivalency, we think that these sites are marked for activation or silencing at later stages. We have discussed this in the manuscript (“discussion”). However, in response to this and earlier comment, we went back and searched for genes that show H3K27me3 demethylation in the most mature cell types (at 24 hpf) in our data, and found a subset of genes that show K27 demethylation after acquiring them earlier. Interestingly, most of the top genes in this list are well-known as developmentally important for their corresponding cell types. We have added this new result and discussed it further in the manuscript (Fig. 2d,e, , Supplementary table 3).

Minors:

(1) The authors cited two scMulti-omics studies in the introduction, but there have been lots of single-cell multi-omics studies published recently. The authors should cite and consider them.

We have cited more single-cell chromatin and multiome studies focussed on early embryogenesis in the introduction now.

(2) bT-ChIC seems to have been presented in a previous paper (ref 15). Therefore, Fig. 1a is unnecessary to show.

Figure 1a. shows a summary of our Zebrafish TChIC workflow, which contains the unique sample multiplexing and sorting strategy to reduce batch effects, which was not applied in the original TChIC workflow. We have now clarified this in “Results”.

(3) It's better to show the percentage of cell numbers (30% vs 70%) for each heatmap in Figure 2C.

We have added the numbers to the corresponding legends.

(4) Please double-check the citation of Fig. S4C, which may not relate to the conclusion of signal differences between lineages.

The citation seems to be correct (Fig. S4C supplements Fig. 2C, but shows mesodermal lineage cells) but the description of the legend was a bit misleading. We have clarified this now.

(5) Figure 4C has not been cited or mentioned in the main text. Please check.

Thanks for pointing it out. We have cited it in Results now.

Reviewer #2 (Significance):

Strengths:

This work utilized a new single-cell multi-omics method and generated abundant epigenomics and transcriptomics datasets for cells covering multiple key developmental stages of zebrafish.

Limitations:

The data analysis was superficial and mainly focused on the correspondence between the two modalities. The discussion of developmental biology was limited.

Advance:

The zebrafish single-cell datasets are valuable. The T-ChIC method is new and interesting.

The audience will be specialized and from basic research fields, such as developmental biology, epigenomics, bioinformatics, etc.

I'm more specialized in the direction of single-cell epigenomics, gene regulation, 3D genomics, etc.

Thank you for your remarks.

Reviewer #3 (Evidence, reproducibility and clarity):

This manuscript introduces T‑ChIC, a single‑cell multi‑omics workflow that jointly profiles full‑length transcripts and histone modifications (H3K27me3 and H3K4me1) and applies it to early zebrafish embryos (4-24 hpf). The study convincingly demonstrates that chromatin-transcription coupling strengthens during gastrulation and somitogenesis, that promoter‑anchored H3K27me3 spreads in cis to enforce developmental gene silencing, and that integrating TF chromatin status with expression can predict lineage‑specific activators and repressors.

Major concerns

(1) Independent biological replicates are absent, so the authors should process at least one additional clutch of embryos for key stages (e.g., 6 hpf and 12 hpf) with T‑ChIC and demonstrate that the resulting data match the current dataset.

Thanks for pointing this out. We had, in fact, performed T-ChIC experiments in four rounds of biological replicates (independent clutch of embryos) and merged the data to create our resource. Although not all timepoints were profiled in each replicate, two timepoints (10 and 24hpf) are present in all four, and the celltype composition of these replicates from these 2 timepoints are very similar. We have added new plots in figure S2f and added (new) supplementary table (#1) to highlight the presence of biological replicates.

(2) The TF‑activity regression model uses an arbitrary R² {greater than or equal to} 0.6 threshold; cross‑validated R² distributions, permutation‑based FDR control, and effect‑size confidence intervals are needed to justify this cut‑off.

Thank you for this suggestion. We did use 10-fold cross validation during training and obtained the R²> values of TF motifs from the independent test set as an unbiased estimate. However, the cutoff of R² > 0.6 to select the TFs for classification was indeed arbitrary. In the revised version, we now report the FDR-adjusted p-values for these R² estimates based on permutation tests, and select TFs with a cutoff of padj < 0.01. We have updated our supplementary table #4 to include the p-values for all tested TFs. However, we see that our arbitrary cutoff of 0.6 was in fact, too stringent, and we can classify many more TFs based on the FDR cutoffs. We also updated our reported numbers in Fig. 4c to reflect this. Moreover, supplementary table #4 contains the complete list of TFs used in the analysis to allow others to choose their own cutoff.

(3) Predicted TF functions lack empirical support, making it essential to test representative activators (e.g., Tbx16) and repressors (e.g., Zbtb16a) via CRISPRi or morpholino knock‑down and to measure target‑gene expression and H3K4me1 changes.

We agree that independent validation of the functions of our predicted TFs on target gene activity would be important. During this revision, we analysed recently published scRNA-seq data of Saunders et al. (2023) (Saunders et al., 2023), which includes CRISPR-mediated F0 knockouts of a couple of our predicted TFs, but the scRNAseq was performed at later stages (24hpf onward) compared to our H3K4me1 analysis (which was 4-12 hpf). Therefore, we saw off-target genes being affected in lineages where these TFs are clearly not expressed (attached Fig 1). We therefore didn’t include these results in the manuscript. In future, we aim to systematically test the TFs predicted in our study with CRISPRi or similar experiments.

(4) The study does not prove that H3K27me3 spreading causes silencing; embryos treated with an Ezh2 inhibitor or prc2 mutants should be re‑profiled by T‑ChIC to show loss of spreading along with gene re‑expression.

We appreciate the suggestion that indeed PRC2-disruption followed by T-ChIC or other forms of validation would be needed to confirm whether the H3K27me3 spreading is indeed causally linked to the silencing of the identified target genes. But performing this validation is complicated because of multiple reasons: 1) due to the EZH2 contribution from maternal RNA and the contradicting effects of various EZH2 zygotic mutations (depending on where the mutation occurs), the only properly validated PRC2-related mutant seems to be the maternal-zygotic mutant MZezh2, which requires germ cell transplantation (see Rougeot et al. 2019 (Rougeot et al., 2019)) , and San et al. 2019 (San et al., 2019) for details). The use of inhibitors have been described in other studies (den Broeder et al., 2020; Huang et al., 2021), but they do not show a validation of the H3K27me3 loss or a similar phenotype as the MZezh2 mutants, and can present unwanted side effects and toxicity at a high dose, affecting gene expression results. Moreover, in an attempt to validate, we performed our own trials with the EZH2 inhibitor (GSK123) and saw that this time window might be too short to see the effect within 24hpf (attached Fig. 2). Therefore, this validation is a more complex endeavor beyond the scope of this study. Nevertheless, our further analysis of H3K27me3 de-methylation on developmentally important genes (new Fig. 2e-f, Sup. table 3) adds more confidence that the polycomb repression plays an important role, and provides enough ground for future follow up studies.

Minor concerns

(1) Repressive chromatin coverage is limited, so profiling an additional silencing mark such as H3K9me3 or DNA methylation would clarify cooperation with H3K27me3 during development.

We agree that H3K27me3 alone would not be sufficient to fully understand the repressive chromatin state. Extension to other chromatin marks and DNA methylation would be the focus of our follow up works.

(2) Computational transparency is incomplete; a supplementary table listing all trimming, mapping, and peak‑calling parameters (cutadapt, STAR/hisat2, MACS2, histoneHMM, etc.) should be provided.

As mentioned in the manuscript, we provide an open-source pre-processing pipeline “scChICflow” to perform all these steps (github.com/bhardwaj-lab/scChICflow). We have now also provided the configuration files on our zenodo repository (see below), which can simply be plugged into this pipeline together with the fastq files from GEO to obtain the processed dataset that we describe in the manuscript. Additionally, we have also clarified the peak calling and post-processing steps in the manuscript now.

(3) Data‑ and code‑availability statements lack detail; the exact GEO accession release date, loom‑file contents, and a DOI‑tagged Zenodo archive of analysis scripts should be added.

We have now publicly released the .h5ad files with raw counts, normalized counts, and complete gene and cell-level metadata, along with signal tracks (bigwigs) and peaks on GEO. Additionally, we now also released the source datasets and notebooks (Rmarkdown format) on Zenodo that can be used to replicate the figures in the manuscript, and updated our statements on “Data and code availability”.

(4) Minor editorial issues remain, such as replacing "critical" with "crucial" in the Abstract, adding software version numbers to figure legends, and correcting the SAMtools reference.

Thank you for spotting them. We have fixed these issues.

Reviewer #3 (Significance):

The method is technically innovative and the biological insights are valuable; however, several issues-mainly concerning experimental design, statistical rigor, and functional validation-must be addressed to solidify the conclusions.

Thank you for your comments. We hope to have addressed your concerns in this revised version of our manuscript.

Author response image 1. — (1) (top) expression of tbx16, which was one of the common TFs detected in our study and also targeted by Saunders et al by CRISPR. tbx16 expression is restricted to presomitic mesoderm lineage by 12hpf, and is mostly absent from 24hpf cell types. (bottom) shows DE genes detected in different cellular neighborhoods (circled) in tbx16 crispants from 24hpf subset of cells in Saunders et al. None of these DE genes were detected as “direct targets” in our analysis and therefore seem to be downstream effects. (2) Effect of 3 different concentrations of EZH2 inhibitor (GSK123) on global H3K27me3 quantified by flow cytometry using fluorescent coupled antibody (same as we used in T-ChIC) in two replicates. The cells were incubated between 3 and 10 hpf and collected afterwards for this analysis. We observed a small shift in H3K27me3 signal, but it was inconsistent between replicates.

References

Chen, Z., Djekidel, M. N., & Zhang, Y. (2021). Distinct dynamics and functions of H2AK119ub1 and H3K27me3 in mouse preimplantation embryos. Nature Genetics, 53(4), 551–563. den Broeder, M. J., Ballangby, J., Kamminga, L. M., Aleström, P., Legler, J., Lindeman, L. C., & Kamstra, J. H. (2020). Inhibition of methyltransferase activity of enhancer of zeste 2 leads to enhanced lipid accumulation and altered chromatin status in zebrafish. Epigenetics & Chromatin, 13(1), 5.

Hickey, G. J., Wike, C. L., Nie, X., Guo, Y., Tan, M., Murphy, P. J., & Cairns, B. R. (2022). Establishment of developmental gene silencing by ordered polycomb complex recruitment in early zebrafish embryos. eLife, 11, e67738.

Huang, Y., Yu, S.-H., Zhen, W.-X., Cheng, T., Wang, D., Lin, J.-B., Wu, Y.-H., Wang, Y.-F., Chen, Y., Shu, L.-P., Wang, Y., Sun, X.-J., Zhou, Y., Yang, F., Hsu, C.-H., & Xu, P.-F. (2021). Tanshinone I, a new EZH2 inhibitor restricts normal and malignant hematopoiesis through upregulation of MMP9 and ABCG2. Theranostics, 11(14), 6891–6904.

Mei, H., Kozuka, C., Hayashi, R., Kumon, M., Koseki, H., & Inoue, A. (2021). H2AK119ub1 guides maternal inheritance and zygotic deposition of H3K27me3 in mouse embryos. Nature Genetics, 53(4), 539–550.

Rougeot, J., Chrispijn, N. D., Aben, M., Elurbe, D. M., Andralojc, K. M., Murphy, P. J., Jansen, P. W. T. C., Vermeulen, M., Cairns, B. R., & Kamminga, L. M. (2019). Maintenance of spatial gene expression by Polycomb-mediated repression after formation of a vertebrate body plan. Development (Cambridge, England), 146(19), dev178590.

San, B., Rougeot, J., Voeltzke, K., van Vegchel, G., Aben, M., Andralojc, K. M., Flik, G., & Kamminga, L. M. (2019). The ezh2(sa1199) mutant zebrafish display no distinct phenotype. PloS One, 14(1), e0210217.

Saunders, L. M., Srivatsan, S. R., Duran, M., Dorrity, M. W., Ewing, B., Linbo, T. H., Shendure, J., Raible, D. W., Moens, C. B., Kimelman, D., & Trapnell, C. (2023). Embryo-scale reverse genetics at single-cell resolution. Nature, 623(7988), 782–791.

Vastenhouw, N. L., Zhang, Y., Woods, I. G., Imam, F., Regev, A., Liu, X. S., Rinn, J., & Schier, A. F. (2010). Chromatin signature of embryonic pluripotency is established during genome activation. Nature, 464(7290), 922–926.

Zeller, P., Blotenburg, M., Bhardwaj, V., de Barbanson, B. A., Salmén, F., & van Oudenaarden, A. (2024). T-ChIC: multi-omic detection of histone modifications and full-length transcriptomes in the same single cell. In bioRxiv (p. 2024.05.09.593364). https://doi.org/10.1101/2024.05.09.593364

Zeller, P., Yeung, J., Viñas Gaza, H., de Barbanson, B. A., Bhardwaj, V., Florescu, M., van der Linden, R., & van Oudenaarden, A. (2022). Single-cell sortChIC identifies hierarchical chromatin dynamics during hematopoiesis. Nature Genetics. https://doi.org/10.1038/s41588-022-01260-3

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Bhardwaj V, Viñas Gaza H, Griffa A, Zeller P, van Oudenaarden A. 2025. Single-cell multi-omic dataset mapping chromatin modifications and transcriptome during zebrafish development. NCBI Gene Expression Omnibus. GSE265874
Bhardwaj V. 2025. Replication package for: Single-cell multi-omic analysis reveals principles of transcription-chromatin interaction during embryogenesis. Zenodo. [DOI]

Supplementary Materials

Supplementary file 1. Number of cells acquired per timepoint, mark and batch.

elife-110400-supp1.xlsx^{(9.2KB, xlsx)}

Supplementary file 2. Comparison of median counts and detected genes per cell.

elife-110400-supp2.xlsx^{(9KB, xlsx)}

Supplementary file 3. Genes with differential H3K27me3 signal between selected 24hpf cell types and others.

elife-110400-supp3.xlsx^{(76.9KB, xlsx)}

Supplementary file 4. Classification of TFs based on the results of the penalized regression model.

elife-110400-supp4.xlsx^{(20.1KB, xlsx)}

MDAR checklist

elife-110400-mdarchecklist1.docx^{(87.8KB, docx)}

Data Availability Statement

The following datasets were generated:

Bhardwaj V. 2025. Replication package for: Single-cell multi-omic analysis reveals principles of transcription-chromatin interaction during embryogenesis. Zenodo.

[bib2] Ahlmann-Eltze C, Huber W. Comparison of transformations for single-cell RNA-seq data. Nature Methods. 2023;20:665–672. doi: 10.1038/s41592-023-01814-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] Akdogan-Ozdilek B, Duval KL, Meng FW, Murphy PJ, Goll MG. Identification of chromatin states during zebrafish gastrulation using CUT&RUN and CUT&Tag. Developmental Dynamics. 2022;251:729–742. doi: 10.1002/dvdy.430. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Argelaguet R, Clark SJ, Mohammed H, Stapel LC, Krueger C, Kapourani CA, Imaz-Rosshandler I, Lohoff T, Xiang Y, Hanna CW, Smallwood S, Ibarra-Soria X, Buettner F, Sanguinetti G, Xie W, Krueger F, Göttgens B, Rugg-Gunn PJ, Kelsey G, Dean W, Nichols J, Stegle O, Marioni JC, Reik W. Multi-omics profiling of mouse gastrulation at single-cell resolution. Nature. 2019;576:487–491. doi: 10.1038/s41586-019-1825-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] Bai D, Zhang X, Xiang H, Guo Z, Zhu C, Yi C. Simultaneous single-cell analysis of 5mC and 5hmC with SIMPLE-seq. Nature Biotechnology. 2025;43:85–96. doi: 10.1038/s41587-024-02148-9. [DOI] [PubMed] [Google Scholar]

[bib6] Baranasic D, Hörtenhuber M, Balwierz PJ, Zehnder T, Mukarram AK, Nepal C, Várnai C, Hadzhiev Y, Jimenez-Gonzalez A, Li N, Wragg J, D’Orazio FM, Relic D, Pachkov M, Díaz N, Hernández-Rodríguez B, Chen Z, Stoiber M, Dong M, Stevens I, Ross SE, Eagle A, Martin R, Obasaju O, Rastegar S, McGarvey AC, Kopp W, Chambers E, Wang D, Kim HR, Acemel RD, Naranjo S, Łapiński M, Chong V, Mathavan S, Peers B, Sauka-Spengler T, Vingron M, Carninci P, Ohler U, Lacadie SA, Burgess SM, Winata C, van Eeden F, Vaquerizas JM, Gómez-Skarmeta JL, Onichtchouk D, Brown BJ, Bogdanovic O, van Nimwegen E, Westerfield M, Wardle FC, Daub CO, Lenhard B, Müller F. Multiomic atlas with functional stratification and developmental dynamics of zebrafish cis-regulatory elements. Nature Genetics. 2022;54:1037–1050. doi: 10.1038/s41588-022-01089-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] Bartosovic M, Kabbe M, Castelo-Branco G. Single-cell CUT&Tag profiles histone modifications and transcription factors in complex tissues. Nature Biotechnology. 2021;39:825–835. doi: 10.1038/s41587-021-00869-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Bergen V, Lange M, Peidli S, Wolf FA, Theis FJ. Generalizing RNA velocity to transient cell states through dynamical modeling. Nature Biotechnology. 2020;38:1408–1414. doi: 10.1038/s41587-020-0591-3. [DOI] [PubMed] [Google Scholar]

[bib9] Bhardwaj V, Heyne S, Sikora K, Rabbani L, Rauer M, Kilpert F, Richter AS, Ryan DP, Manke T. snakePipes: facilitating flexible, scalable and integrative epigenomic analysis. Bioinformatics. 2019;35:4757–4759. doi: 10.1093/bioinformatics/btz436. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] Bhardwaj V, Mourragui S. User-friendly exploration of epigenomic data in single cells using sincei. bioRxiv. 2024 doi: 10.1101/2024.07.27.605424. [DOI]

[bib11] Bhardwaj V, Sancho Gómez F. Software Heritage; 2025. https://archive.softwareheritage.org/swh:1:dir:13956aa2f66cb18a5f271d4bbb43d513b1f67003;origin=https://github.com/bhardwaj-lab/scChICflow;visit=swh:1:snp:3ec092121cb0f6fdf279408aff4ce8cf1f92312e;anchor=swh:1:rev:b61bcd4783530f0fb6d27f99bc7eb18e2d78e4ee [Google Scholar]

[bib12] Bogdanović O, van Heeringen SJ, Veenstra GJC. The epigenome in early vertebrate development. Genesis. 2012;50:192–206. doi: 10.1002/dvg.20831. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Bradford YM, Van Slyke CE, Ruzicka L, Singer A, Eagle A, Fashena D, Howe DG, Frazer K, Martin R, Paddock H, Pich C, Ramachandran S, Westerfield M. Zebrafish information network, the knowledgebase for Danio rerio research. Genetics. 2022;220:iyac016. doi: 10.1093/genetics/iyac016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] Cheung P, Vallania F, Warsinske HC, Donato M, Schaffert S, Chang SE, Dvorak M, Dekker CL, Davis MM, Utz PJ, Khatri P, Kuo AJ. Single-cell chromatin modification profiling reveals increased epigenetic variations with aging. Cell. 2018;173:1385–1397. doi: 10.1016/j.cell.2018.03.079. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Clark SJ, Argelaguet R, Lohoff T, Krueger F, Drage D, Göttgens B, Marioni JC, Nichols J, Reik W. Single-cell multi-omics profiling links dynamic DNA methylation to cell fate decisions during mouse early organogenesis. Genome Biology. 2022;23:202. doi: 10.1186/s13059-022-02762-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Cunningham F, Allen JE, Allen J, Alvarez-Jarreta J, Amode MR, Armean IM, Austine-Orimoloye O, Azov AG, Barnes I, Bennett R, Berry A, Bhai J, Bignell A, Billis K, Boddu S, Brooks L, Charkhchi M, Cummins C, Rin Fioretto L, Davidson C, Dodiya K, Donaldson S, El Houdaigui B, El Naboulsi T, Fatima R, Giron CG, Genez T, Martinez JG, Guijarro-Clarke C, Gymer A, Hardy M, Hollis Z, Hourlier T, Hunt T, Juettemann T, Kaikala V, Kay M, Lavidas I, Le T, Lemos D, Marugán JC, Mohanan S, Mushtaq A, Naven M, Ogeh DN, Parker A, Parton A, Perry M, Piližota I, Prosovetskaia I, Sakthivel MP, Salam AIA, Schmitt BM, Schuilenburg H, Sheppard D, Pérez-Silva JG, Stark W, Steed E, Sutinen K, Sukumaran R, Sumathipala D, Suner MM, Szpak M, Thormann A, Tricomi FF, Urbina-Gómez D, Veidenberg A, Walsh TA, Walts B, Willhoft N, Winterbottom A, Wass E, Chakiachvili M, Flint B, Frankish A, Giorgetti S, Haggerty L, Hunt SE, IIsley GR, Loveland JE, Martin FJ, Moore B, Mudge JM, Muffato M, Perry E, Ruffier M, Tate J, Thybert D, Trevanion SJ, Dyer S, Harrison PW, Howe KL, Yates AD, Zerbino DR, Flicek P. Ensembl 2022. Nucleic Acids Res. 2022;50:D988–D995. doi: 10.1093/nar/gkab1049. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] de la Calle Mustienes E, Gómez-Skarmeta JL, Bogdanović O. Genome-wide epigenetic cross-talk between DNA methylation and H3K27me3 in zebrafish embryos. Genomics Data. 2015;6:7–9. doi: 10.1016/j.gdata.2015.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib18] Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] Dooley CM, Wali N, Sealy IM, White RJ, Stemple DL, Collins JE, Busch-Nentwich EM. The gene regulatory basis of genetic compensation during neural crest induction. PLOS Genetics. 2019;15:e1008213. doi: 10.1371/journal.pgen.1008213. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] Esterberg R, Fritz A. dlx3b/4b are required for the formation of the preplacodal region and otic placode through local modulation of BMP activity. Developmental Biology. 2009;325:189–199. doi: 10.1016/j.ydbio.2008.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Farrell JA, Wang Y, Riesenfeld SJ, Shekhar K, Regev A, Schier AF. Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis. Science. 2018;360:eaar3131. doi: 10.1126/science.aar3131. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] Fishman L, Nechooshtan G, Erhard F, Regev A, Farrell JA, Rabani M. Single-cell temporal dynamics reveals the relative contributions of transcription and degradation to cell-type specific gene expression in zebrafish embryos. bioRxiv. 2023 doi: 10.1101/2023.04.20.537620. [DOI]

[bib23] Fitz-James MH, Cavalli G. Molecular mechanisms of transgenerational epigenetic inheritance. Nature Reviews. Genetics. 2022;23:325–341. doi: 10.1038/s41576-021-00438-5. [DOI] [PubMed] [Google Scholar]

[bib24] Fu M, Pang L, Wu Z, Wang M, Jin J, Ai S, Li X. Single-cell multi-omics delineates the dynamics of distinct epigenetic codes coordinating mouse gastrulation. BMC Genomics. 2025;26:454. doi: 10.1186/s12864-025-11619-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] Gheldof A, Hulpiau P, van Roy F, De Craene B, Berx G. Evolutionary functional analysis and molecular regulation of the ZEB transcription factors. Cellular and Molecular Life Sciences. 2012;69:2527–2541. doi: 10.1007/s00018-012-0935-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Giraldez AJ, Mishima Y, Rihel J, Grocock RJ, Van Dongen S, Inoue K, Enright AJ, Schier AF. Zebrafish MiR-430 promotes deadenylation and clearance of maternal mRNAs. Science. 2006;312:75–79. doi: 10.1126/science.1122689. [DOI] [PubMed] [Google Scholar]

[bib27] Gourishetti K, Balaji Easwaran V, Mostakim Y, Ranganath Pai KS, Bhere D. MicroRNA (miR)-124: a promising therapeutic gateway for oncology. Biology. 2023;12:922. doi: 10.3390/biology12070922. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Guo F, Li L, Li J, Wu X, Hu B, Zhu P, Wen L, Tang F. Single-cell multi-omics sequencing of mouse early embryos and embryonic stem cells. Cell Research. 2017;27:967–988. doi: 10.1038/cr.2017.82. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Guo Y, Zhao S, Wang GG. Polycomb gene silencing mechanisms: PRC2 chromatin targeting, H3K27me3 “readout”, and phase separation-based compaction. Trends in Genetics. 2021;37:547–565. doi: 10.1016/j.tig.2020.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] Halko N, Martinsson PG, Tropp JA. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. arXiv. 2009 https://arxiv.org/abs/0909.4061

[bib31] Heinig M, Colomé-Tatché M, Taudt A, Rintisch C, Schafer S, Pravenec M, Hubner N, Vingron M, Johannes F. histoneHMM: Differential analysis of histone modifications with broad genomic footprints. BMC Bioinformatics. 2015;16:60. doi: 10.1186/s12859-015-0491-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] Hickey GJ, Wike CL, Nie X, Guo Y, Tan M, Murphy PJ, Cairns BR. Establishment of developmental gene silencing by ordered polycomb complex recruitment in early zebrafish embryos. eLife. 2022;11:e67738. doi: 10.7554/eLife.67738. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] Kaaij LJT, Mokry M, Zhou M, Musheev M, Geeven G, Melquiond ASJ, de Jesus Domingues AM, de Laat W, Niehrs C, Smith AD, Ketting RF. Enhancers reside in a unique epigenetic environment during early zebrafish development. Genome Biology. 2016;17:146. doi: 10.1186/s13059-016-1013-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] Kim D, Langmead B, Salzberg S. HISAT2: graph-based alignment of next-generation sequencing reads to a population of genomes. 2.1.0Github. 2017 https://daehwankimlab.github.io/hisat2/

[bib35] Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K, Baglaenko Y, Brenner M, Loh PR, Raychaudhuri S. Fast, sensitive and accurate integration of single-cell data with Harmony. Nature Methods. 2019;16:1289–1296. doi: 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ, Amin V, Whitaker JW, Schultz MD, Ward LD, Sarkar A, Quon G, Sandstrom RS, Eaton ML, Wu Y-C, Pfenning AR, Wang X, Claussnitzer M, Liu Y, Coarfa C, Harris RA, Shoresh N, Epstein CB, Gjoneska E, Leung D, Xie W, Hawkins RD, Lister R, Hong C, Gascard P, Mungall AJ, Moore R, Chuah E, Tam A, Canfield TK, Hansen RS, Kaul R, Sabo PJ, Bansal MS, Carles A, Dixon JR, Farh K-H, Feizi S, Karlic R, Kim A-R, Kulkarni A, Li D, Lowdon R, Elliott G, Mercer TR, Neph SJ, Onuchic V, Polak P, Rajagopal N, Ray P, Sallari RC, Siebenthall KT, Sinnott-Armstrong NA, Stevens M, Thurman RE, Wu J, Zhang B, Zhou X, Beaudet AE, Boyer LA, De Jager PL, Farnham PJ, Fisher SJ, Haussler D, Jones SJM, Li W, Marra MA, McManus MT, Sunyaev S, Thomson JA, Tlsty TD, Tsai L-H, Wang W, Waterland RA, Zhang MQ, Chadwick LH, Bernstein BE, Costello JF, Ecker JR, Hirst M, Meissner A, Milosavljevic A, Ren B, Stamatoyannopoulos JA, Wang T, Kellis M, Roadmap Epigenomics Consortium Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] La Manno G, Soldatov R, Zeisel A, Braun E, Hochgerner H, Petukhov V, Lidschreiber K, Kastriti ME, Lönnerberg P, Furlan A, Fan J, Borm LE, Liu Z, van Bruggen D, Guo J, He X, Barker R, Sundström E, Castelo-Branco G, Cramer P, Adameyko I, Linnarsson S, Kharchenko PV. RNA velocity of single cells. Nature. 2018;560:494–498. doi: 10.1038/s41586-018-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] Lange M, Bergen V, Klein M, Setty M, Reuter B, Bakhti M, Lickert H, Ansari M, Schniering J, Schiller HB, Pe’er D, Theis FJ. CellRank for directed single-cell fate mapping. Nature Methods. 2022;19:159–170. doi: 10.1038/s41592-021-01346-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Lause J, Berens P, Kobak D. Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data. Genome Biology. 2021;22:258. doi: 10.1186/s13059-021-02451-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan MT, Carey VJ. Software for computing and annotating genomic ranges. PLOS Computational Biology. 2013;9:e1003118. doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib41] Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] Liu Y, Zhu Z, Ho IHT, Shi Y, Li J, Wang X, Chan MTV, Cheng CHK. Genetic deletion of miR-430 disrupts maternal-zygotic transition and embryonic body plan. Frontiers in Genetics. 2020;11:853. doi: 10.3389/fgene.2020.00853. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] Liu H, Zhou J, Tian W, Luo C, Bartlett A, Aldridge A, Lucero J, Osteen JK, Nery JR, Chen H, Rivkin A, Castanon RG, Clock B, Li YE, Hou X, Poirion OB, Preissl S, Pinto-Duarte A, O’Connor C, Boggeman L, Fitzpatrick C, Nunn M, Mukamel EA, Zhang Z, Callaway EM, Ren B, Dixon JR, Behrens MM, Ecker JR. DNA methylation atlas of the mouse brain at single-cell resolution. Nature. 2021;598:120–128. doi: 10.1038/s41586-020-03182-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] Liu M, Yue Y, Chen X, Xian K, Dong C, Shi M, Xiong H, Tian K, Li Y, Zhang QC, He A. Genome-coverage single-cell histone modifications for embryo lineage tracing. Nature. 2025;640:828–839. doi: 10.1038/s41586-025-08656-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal. 2011;17:10. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]

[bib46] Murphy PJ, Wu SF, James CR, Wike CL, Cairns BR. Placeholder nucleosomes underlie germline-to-embryo DNA methylation reprogramming. Cell. 2018;172:993–1006. doi: 10.1016/j.cell.2018.01.022. [DOI] [PubMed] [Google Scholar]

[bib47] Nichols RV, O’Connell BL, Mulqueen RM, Thomas J, Woodfin AR, Acharya S, Mandel G, Pokholok D, Steemers FJ, Adey AC. High-throughput robust single-cell DNA methylation profiling with sciMETv2. Nature Communications. 2022;13:7627. doi: 10.1038/s41467-022-35374-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] Pachkov M, Erb I, Molina N, van Nimwegen E. SwissRegulon: a database of genome-wide annotations of regulatory sites. Nucleic Acids Research. 2007;35:D127–D131. doi: 10.1093/nar/gkl857. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib49] Payumo AY, McQuade LE, Walker WJ, Yamazoe S, Chen JK. Tbx16 regulates hox gene activation in mesodermal progenitor cells. Nature Chemical Biology. 2016;12:694–701. doi: 10.1038/nchembio.2124. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib50] Persad S, Choo Z-N, Dien C, Sohail N, Masilionis I, Chaligné R, Nawy T, Brown CC, Sharma R, Pe’er I, Setty M, Pe’er D. SEACells infers transcriptional and epigenomic cellular states from single-cell genomics data. Nature Biotechnology. 2023;41:1746–1757. doi: 10.1038/s41587-023-01716-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib52] Řehůřek R, Sojka P. Software framework for topic modelling with large corpora. Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. ELRA; 2010. pp. 45–50. [DOI] [Google Scholar]

[bib53] Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib54] Salmen F, De Jonghe J, Kaminski TS, Alemany A, Parada GE, Verity-Legg J, Yanagida A, Kohler TN, Battich N, van den Brekel F, Ellermann AL, Arias AM, Nichols J, Hemberg M, Hollfelder F, van Oudenaarden A. High-throughput total RNA sequencing in single cells using VASA-seq. Nature Biotechnology. 2022;40:1780–1793. doi: 10.1038/s41587-022-01361-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib55] San B, Chrispijn ND, Wittkopp N, van Heeringen SJ, Lagendijk AK, Aben M, Bakkers J, Ketting RF, Kamminga LM. Normal formation of a vertebrate body plan and loss of tissue maintenance in the absence of ezh2. Scientific Reports. 2016;6:24658. doi: 10.1038/srep24658. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] Schep AN, Wu B, Buenrostro JD, Greenleaf WJ. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nature Methods. 2017;14:975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib57] Schmid M, Durussel T, Laemmli UK. ChIC and ChEC; genomic mapping of chromatin proteins. Molecular Cell. 2004;16:147–157. doi: 10.1016/j.molcel.2004.09.007. [DOI] [PubMed] [Google Scholar]

[bib58] Sedykh I, Keller AN, Yoon B, Roberson L, Moskvin OV, Grinblat Y. Zebrafish Rfx4 controls dorsal and ventral midline formation in the neural tube. Developmental Dynamics. 2018;247:650–659. doi: 10.1002/dvdy.24613. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib59] Singhal A, Buckley C, Mitra M. Pivoted Document Length Normalization. ACM SIGIR Forum. 2017;51:176–184. doi: 10.1145/3130348.3130365. [DOI] [Google Scholar]

[bib60] Smith T, Heger A, Sudbery I. UMI-tools: modeling sequencing errors in unique molecular identifiers to improve quantification accuracy. Genome Research. 2017;27:491–499. doi: 10.1101/gr.209601.116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib61] Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B. 1996;58:267–288. doi: 10.1111/j.2517-6161.1996.tb02080.x. [DOI] [Google Scholar]

[bib62] van Heeringen SJ, Veenstra GJC. GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments. Bioinformatics. 2011;27:270–271. doi: 10.1093/bioinformatics/btq636. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib63] Vastenhouw NL, Zhang Y, Woods IG, Imam F, Regev A, Liu XS, Rinn J, Schier AF. Chromatin signature of embryonic pluripotency is established during genome activation. Nature. 2010;464:922–926. doi: 10.1038/nature08866. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib64] Veronezi GMB, Ramachandran S. Nucleation and spreading maintain Polycomb domains every cell cycle. Cell Reports. 2024;43:114090. doi: 10.1016/j.celrep.2024.114090. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib65] Wagner DE, Weinreb C, Collins ZM, Briggs JA, Megason SG, Klein AM. Single-cell mapping of gene expression landscapes and lineage in the zebrafish embryo. Science. 2018;360:981–987. doi: 10.1126/science.aar4362. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib66] Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biology. 2018;19:15. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib67] Wolf FA, Hamey FK, Plass M, Solana J, Dahlin JS, Göttgens B, Rajewsky N, Simon L, Theis FJ. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biology. 2019;20:59. doi: 10.1186/s13059-019-1663-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib68] Wu SJ, Furlan SN, Mihalas AB, Kaya-Okur HS, Feroze AH, Emerson SN, Zheng Y, Carson K, Cimino PJ, Keene CD, Sarthy JF, Gottardo R, Ahmad K, Henikoff S, Patel AP. Single-cell CUT&Tag analysis of chromatin modifications in differentiation and tumor progression. Nature Biotechnology. 2021;39:819–824. doi: 10.1038/s41587-021-00865-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib69] Xiang Y, Zhang Y, Xu Q, Zhou C, Liu B, Du Z, Zhang K, Zhang B, Wang X, Gayen S, Liu L, Wang Y, Li Y, Wang Q, Kalantry S, Li L, Xie W. Epigenomic analysis of gastrulation identifies a unique chromatin state for primed pluripotency. Nature Genetics. 2020;52:95–105. doi: 10.1038/s41588-019-0545-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib70] Xu PF, Houssin N, Ferri-Lagneau KF, Thisse B, Thisse C. Construction of a vertebrate embryo from two opposing morphogen gradients. Science. 2014;344:87–89. doi: 10.1126/science.1248252. [DOI] [PubMed] [Google Scholar]

[bib71] Yette GA, Stewart S, Stankunas K. Zebrafish polycomb repressive complex-2 critical roles are largely Ezh2- over Ezh1-driven and concentrate during early embryogenesis. bioRxiv. 2021 doi: 10.1101/2020.12.31.424918. [DOI]

[bib72] Zeller P, Yeung J, Viñas Gaza H, de Barbanson BA, Bhardwaj V, Florescu M, van der Linden R, van Oudenaarden A. Single-cell sortChIC identifies hierarchical chromatin dynamics during hematopoiesis. Nature Genetics. 2023;55:333–345. doi: 10.1038/s41588-022-01260-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib73] Zeller P, Blotenburg M, Bhardwaj V, Barbanson BA, Salmén F, Oudenaarden A. T-ChIC: multi-omic detection of histone modifications and full-length transcriptomes in the same single cell. bioRxiv. 2024 doi: 10.1101/2024.05.09.593364. [DOI]

[bib74] Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of chIP-seq (MACS) Genome Biology. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib75] Zhao C, Biondic S, Vandal K, Björklund ÅK, Hagemann-Jensen M, Sommer TM, Canizo J, Clark S, Raymond P, Zenklusen DR, Rivron N, Reik W, Petropoulos S. Single-cell multi-omics of human preimplantation embryos shows susceptibility to glucocorticoids. Genome Research. 2022;32:1627–1641. doi: 10.1101/gr.276665.122. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Single-cell co-mapping reveals relationship between chromatin state and gene expression in early zebrafish development

Vivek Bhardwaj

Alberto Griffa

Helena Viñas Gaza

Peter Zeller

Alexander van Oudenaarden

Roles

Abstract

Introduction

Results

Paired profiling of histone modifications and transcriptome of single cells during zebrafish embryogenesis

Figure 1. Whole-organism T-ChIC of zebrafish embryos quantifies full-length transcripts and histone modifications from the same single cell.

Figure 1—figure supplement 1. Quality control for the RNA and ChIC fraction.

Figure 1—figure supplement 2. Evaluation of the quantitative chromatin signal from woT-ChIC.

Spatiotemporal spreading of H3K27me3 associates with the silencing of gene expression during development

Figure 2. Spatio-temporal spreading of H3K27me3 correlates with gene silencing.

Figure 2—figure supplement 1. Annotation of H3K27me3-RNA woT-ChIC data (4–24 hpf).

Figure 2—figure supplement 2. H3K27me3 shows cis-spreading with time.

Global chromatin state of cells is decoupled from gene expression during early development

Figure 3. Integrative analysis of H3K27me3, H3K4me1, and transcriptome.

Figure 3—figure supplement 1. Quality control and comparison of H3K27me and H3K4me1 signal in nuclei.

Figure 3—figure supplement 2. Comparison of genomic distribution of H3K4me1 and H3K27me3 with time.

Figure 3—figure supplement 3. Derivation of latent time and lineages using RNA velocity on 4–12 hpf data.

The chromatin state of binding sites predicts the function of transcription factors during gastrulation

Figure 4. Prediction of TF activity using TF epigenetics and transcription.

Figure 4—figure supplement 1. Examples of TFs with predicted activation and repression functions and their epigenetic regulation.

Discussion

Materials and methods

Whole-organism T-ChIC of zebrafish embryos

Processing and quality control of T-ChIC data

Analysis of publicly available data

Cell clustering and annotation using RNA signal

Integrated analysis of nuclei and whole-cell data

Metacell analysis

Cell clustering using the ChIC signal

Peak calling and annotation

H3K27me3 spreading and demethylation analysis

TF activity prediction and classification

Code availability

Materials availability

Acknowledgements

Funding Statement

Contributor Information

Funding Information

Additional information

Competing interests

Author contributions

Additional files

Data availability

References

eLife Assessment

H Efsun Arda

Roles

Reviewer #1 (Public review):

Anonymous

Roles

Reviewer #2 (Public review):

Anonymous

Roles

Author response

Vivek Bhardwaj

Alberto Griffa

Helena Viñas Gaza

Peter Zeller

Alexander van Oudenaarden

Roles

Author response image 1.

Associated Data

Data Citations

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases