Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2025 Dec 12.
Published in final edited form as: Cell Rep. 2025 Jul 28;44(8):116047. doi: 10.1016/j.celrep.2025.116047

A spatiotemporal atlas of mouse gastrulation and early organogenesis to explore axial patterning and project in vitro models onto in vivo space

Luke TG Harland 1,2,3,*, Tim Lohoff 4,15, Noushin Koulena 5,16, Nico Pierson 5, Constantin Pape 6, Farhan Ameen 7,8,9, Jonathan Griffiths 10,11,17, Bart Theeuwes 1,2, Nicola K Wilson 1,2, Anna Kreshuk 12, Wolf Reik 4,13,17,#, Jennifer Nichols 14,#, Long Cai 5,#, John C Marioni 10,12,13,#, Berthold Göttgens 1,2,*, Shila Ghazanfar 7,8,9,19,*
PMCID: PMC7618461  EMSID: EMS211405  PMID: 40728928

Summary

During gastrulation, mouse epiblast cells form the three germ layers that establish the body plan and initiate organogenesis. While single-cell atlases have advanced our understanding of lineage diversification, spatial aspects of differentiation remain poorly defined. Here, we applied spatial transcriptomics to mouse embryos at embryonic (E) E7.25 and E7.5 days and integrated these data with existing E8.5 spatial and E6.5–E9.5 single-cell RNA-seq atlases. This resulted in a spatiotemporal atlas of over 150,000 cells with 82 refined cell-type annotations. The resource enables exploration of gene expression dynamics across anterior-posterior and dorsal-ventral axes, uncovering spatial logic guiding mesodermal fate decisions in the primitive streak. We also developed a computational pipeline to project additional single-cell datasets into this framework for comparative analysis. Freely accessible through an interactive web portal, this atlas offers a valuable tool for the developmental and stem cell biology communities to investigate mouse embryogenesis in a spatial and temporal context.


Graphical abstract.

Graphical abstract

Introduction

Gastrulation is a critical phase in early embryonic development during which a relatively simple, bi-layered arrangement of cells undergoes rapid proliferation, migration, and differentiation into various mesoderm, endoderm, and ectodermal precursors that establish the body plan and give rise to all major organs.1,2 Although significant progress has been made in profiling these early stages of murine development at both single-cell and molecular levels,39 the intricate spatial interactions between cells and tissues as specified and patterned during gastrulation remain underexplored. Spatial omics technologies, including those that allow profiling of tissues at the resolution of hundreds of genes, have yet to be applied to the study of these early stages of mouse gastrulation, representing an important gap in our understanding.

We recently generated a comprehensive, time-resolved single-cell RNA sequencing (scRNA-seq) atlas, spanning embryonic (E) 6.5 to E9.5 days of mouse development, encompassing gastrulation and early organogenesis.8 This atlas, comprising over 400,000 single-cell transcriptomes, defines 88 transcriptionally distinct cell types across embryonic and extraembryonic tissues. Using a previously published atlas3 spanning E6.5–E8.5, we identified 351 marker genes (Table S1) that distinguish cell types in mouse embryos across this time span, selected for their suitability in spatial transcriptomics via sequential fluorescent in situ hybridization (seqFISH).10 In our previous work, we examined the expression patterns of these 351 markers in sagittal sections of three E8.5 mouse embryos.11

In this study, we transformed the previous static snapshot spatial data into a temporally resolved spatial atlas of mouse gastrulation and early organogenesis by applying the same seqFISH approach to sagittal sections (five optical sections/embryo) of four additional mouse embryos captured during gastrulation at E7.25 and E7.5.12 This advanced atlas provides insights into the heterogeneous transcriptional profiles that align with spatial locations throughout mouse development. Additionally, it enables exploration of gene expression dynamics along anterior-posterior (AP) and dorsal-ventral (DV) axes, shedding light on rapid, region-specific mesodermal fate decisions within the primitive streak at E7.25. Finally, we developed a computational pipeline to project additional scRNA-seq datasets into this spatiotemporal atlas. As a proof of concept, we mapped the AP distribution of cell types in gastruloid mouse models,13 showcasing the atlas as a powerful tool for benchmarking developmental processes in vitro.

Results

Exploring gene expression patterns during mouse gastrulation using seqFISH

To explore gene expression patterns during murine gastrulation, we performed seqFISH10 on a total of 20 optical sagittal sections from four embryos (five optical sections/embryo) that were collected at E6.5 and E7.5 (Figures 1A–1E and S1A; Videos S1 and S2). Embryos collected at E6.5 (1 and 2) contained primitive streaks that extended to the distal tip of the egg cylinder, consistent with late-streak stage embryos, which are typically classified as E7.25.12 After sample preparation, imaging, cell segmentation, and mRNA dot calling, we computed normalized gene expression levels for 351 genes across 14,794 cells (Figure 1E). Before clustering, we performed Seurat rPCA integration,14 first between embryos at the same developmental stage (e.g., embryos 1 and 2) and then across stages (E7.25 and E7.5). As a quality control step after the first round of integration, we excluded clusters with abnormal RNA/feature counts as well as a cluster from the most distal region of the ectoplacental cone in E7.25 embryos (Figure S1B, red clusters). In total, 17 transcriptionally distinct clusters were identified, which we visualized in a low-dimensional UMAP (uniform manifold approximation and projection) and their spatial context (Figures 2A–2C and S1C–S1G). The cell types were annotated based on marker gene expression patterns and spatial localization (Figures 2B–2F and S1C–S1G; Table S1).

Figure 1. Spatial transcriptomic mapping of early mouse development using seqFISH.

Figure 1

(A and C) 3D illustrations of Theiler stage (TS)10 (A) and TS11 (C) mouse embryos, adapted from eMouse Atlas. Dotted red lines mark the estimated positions of sagittal optical tissue sections shown in (B) and (D). Orientation abbreviations: D, distal; V, ventral; R, right; L, left; A, anterior; P, posterior; PR, proximal.

(B and D) Tile scans of 4-μm sagittal sections from two independently sampled E7.25 embryos (B) and one E7.5 embryo (D), imaged using seqFISH with DAPI nuclear staining (white).

(E) Schematic outline of the seqFISH pipeline adapted from Lohoff et al. 2022.11 See also Table S1.

Figure 2. High-resolution spatial profiling of gene expression during mouse gastrulation.

Figure 2

(A and B) UMAP projections generated from integrated seqFISH expression data. In (A), cells are colored by their embryonic origin, and in (B), by cell type.

(C) Spatial maps of E7.25 and E7.5 embryos, with cells colored according to their cell types. The black dotted line indicates the ExE-Em boundary. BI, blood islands; EPC, ectoplacental cone; HEP, hematoendothelial progenitors.

(D) Representative visualization of normalized log expression counts of selected genes, measured by seqFISH to validate the performance in E7.25 (top) and E7.5 (bottom) embryos.

(E) Stacked bar chart showing the proportion of cell types per seqFISH embryo.

(F) Dot plot displaying the average gene expression for marker genes across different cell types identified in the E7.25 and E7.5 seqFISH embryos. See also Table S1.

Murine gastrulation begins at approximately E6.25 when the primitive streak initiates formation at the prospective posterior of the embryo near the extraembryonic-embryonic (ExE-Em) junction. By E7.25, the primitive streak extends to the distal tip of the egg cylinder, at which point the distal half of the embryo consists of three developing germ layers: an outer visceral endoderm, an inner epiblast, and an emerging middle layer of nascent mesoderm and definitive endoderm. The mesodermal and definitive endoderm layers originate from epiblast cells migrating through the primitive streak. In embryos 1 and 2, we identified an inner epiblast cell layer expressing Dnmt3a, Dnmt3b, Pou5f1, and Slc7a3 (Figures 1H–1K and S1H), as well as a primitive streak population marked by Sp5, T (Brachyury), and Lhx1 (Figure S1H). Cells expressing Lhx1, Sox17, Sfrp1, Foxa2, and Gpc4 visible along the axial midline of embryo 2 were labeled axial mesendoderm, and cells with a similar molecular signature, located in the distal tip of the egg cylinders in embryos 1 and 2, that intermingled with the overlying visceral endoderm, were annotated definitive endoderm (DE) precursors (Figures 2C–2F, S1D, S1F, and S1H; Table S1).

At E7.25, several mesodermal populations were observed, including extraembryonic (ExE) mesoderm cells expressing Dlk1 and mesodermal wings at the anterior ExE-Em boundary in embryo 2, which expressed Foxf1, Hand1, Tbx3, and Msx1 (Figures 2C–2F, S1D, S1F, and S1H). Hematoendothelial progenitors, marked by Runx1, Gata1, and Tal1, were detected in developing yolk sac (YS) blood islands (Figures 1H–1K, S1D, S1F, and S1H). Various Col4a1+ visceral endoderm populations surrounding the egg cylinder were identified, including an ExE endoderm population expressing Clic6 and Soat2 (Figures 2C–2F, S1D, S1F, and S1H). Finally, distinct ExE ectodermal populations displayed unique gene expression patterns, in line with their proximal-distal locations relative to the ExE-Em junction (Figures 2C–2F, S1D, S1F, and S1H).

By E7.5, the primitive streak has elongated to the distal tip of the egg cylinder, and the allantoic bud begins arising from the posterior-most region of the streak and extending toward the chorionic layer. In seqFISH embryo 3, a prominent allantoic bud composed of ExE mesoderm expressing Tbx4 was observed (Figures 2C–2F). Additional extraembryonic tissues in embryos 3 and 4 contained ExE mesoderm, including the visceral YS, amnion, and chorion (Figures 1H, S1C, S1E, and S1G). E7.5 embryos also contained hematoendothelial cell populations (#1 and #2) expressing Runx1, Gata1, Lmo2, and Fgf3, adjacent to ExE endoderm in YS blood islands (Figures 2C–2F, S1C, S1E, S1G, and S1I). In contrast to E7.25 embryos, distal halves of E7.5 embryos consisted of caudal epiblast cells expressing Cdx1, Hoxb1, and Nkx1-2 and reduced expression of E7.25 epiblast markers (Figures 2C–2F, S1C, S1E, S1G, and S1I) as well as rostral ectoderm cells marked by expression of Cxcl12 (Figures 2C–2F and S1C). Altogether, these analyses highlight the utility of seqFISH and our 351-gene probe set to resolve transcriptional diversity associated with spatially arranged cell types at the onset of murine gastrulation.

Generating an integrated spatiotemporal transcriptional atlas covering murine gastrulation and early organogenesis

To fully leverage these new spatial transcriptomic datasets, we integrated them with three additional E8.5 seqFISH embryo datasets11 (Figures S2A–S2D) and applied StabMap15 and reducedMNN16 to align all quality-controlled seqFISH cells (Figures S1B and S2A) with a time-resolved scRNA-seq (scRNA) atlas8 spanning E6.5 to E9.5 of murine embryogenesis (Figures 3A–3C). After integration, we performed clustering, UMAP visualization, cell-type label transfer, and imputed missing gene expression patterns in the seqFISH embryos. Clusters with poor representation across both datasets, referred to as “poor joint clusters,” were identified and filtered prior to further analyses (Figure S3A). For instance, YS endothelium and YS mesothelium clusters were composed almost entirely (>98%) of scRNA cells, in line with their absence from seqFISH samples. Additionally, we identified seqFISH cells that mis-aligned with expected embryonic stages (~12%) (Figure S3A), which may represent interesting cell populations that are either further or less differentiated (see metadata “poor stage mapping”).

Figure 3. Integration of spatial and single-cell transcriptomics to construct a spatiotemporal atlas of mouse gastrulation and early organogenesis.

Figure 3

(A) Overview of the single-cell transcriptomic datasets (seqFISH in blue and scRNA-seq in red) integrated to create a spatiotemporal transcriptional atlas of mouse gastrulation and early organogenesis (E6.5–E9.5). Gray bars represent the total number of cells per dataset.

(B and C) UMAP projection of cells from the extended gastrulation atlas (scRNA) and seqFISH into a unified reduced-dimensional space. Cells are colored by embryonic stage (A) and refined cell type (B). See also Figures S3–S6 and Table S1.

Cells were initially annotated by assigning joint clusters with the highly voted scRNA labels. These annotations were assessed and, in some cases, further refined by analyzing marker gene expression patterns and visually inspecting the spatial distribution of cell types by seqFISH embryos (see STAR Methods for details). Altogether, our approach generated a comprehensive spatiotemporal transcriptional atlas of mouse gastrulation and early organogenesis, encompassing 82 refined cell-type annotations across more than 150,000 cells (with good alignment across both seqFISH and scRNA datasets) spanning developmental stages E6.5 to E9.5 (Figures 3B and 3C). This new resource enables exploration of cellular diversity in both spatial and temporal contexts during mouse embryogenesis (Figures 4A and S3–S5), providing new single-cell resolved spatial maps during gastrulation at E7.25–E7.5 and an increase in cell-type resolution at early stages of organogenesis from ~30 to 55 cell types at E8.5.

Figure 4. Exploring cell type distributions in space across time points during mouse embryogenesis.

Figure 4

(A) Spatial maps of E7.25, E7.5, and E8.5 embryos, organized by germ layer, with cells colored by refined cell type labels.

(B) Heatmap showing the proportion of cell types across embryonic stages in the extended gastrulation atlas (scRNA) and seqFISH samples. Proportions are row-scaled, and the columns are hierarchically clustered. ExE, extraembryonic; PGC, primordial germ cell; LPM, lateral plate mesoderm; FHF, first heart field; SHF, second heart field; EMP, erythron-myeloid progenitors; MEP, myeloid-erythroid progenitor; NMP, neuro-mesodermal progenitor; Chorioall.-der., chorioallantoic-derived. See also Figures S3–S6 and Table S1.

A comparison of the cellular composition of seqFISH embryos with scRNA embryo pools from different stages revealed that E7.25 seqFISH embryos closely align with E6.5–E7.25 scRNA samples, while E7.5 and E8.5 seqFISH embryos correspond to E7.5 and E8.5–E9.5, respectively (Figure 4B). It is worth noting that our seqFISH datasets do not comprehensively capture the full medial-lateral axis of the embryos. As a result, the observed cell type proportions reflect only a subset of the total embryonic composition. The cell counts per cell type across all seqFISH embryos (embryos 1–7) are provided in Figures S3C–S3F. Additionally, the spatial distribution of the refined cell types with key marker gene expression patterns from both seqFISH and scRNA cells is presented by embryonic stage and germ layer in Figures S4–S6, Video S3; Tables S1. The integrated dataset is accessible through a user-friendly web portal (see data and code availability section), allowing researchers to perform virtual dissections, investigate both raw and imputed seqFISH gene expression patterns, and identify differentially expressed genes in this context.

Accounting for spatial location enables refined cell-type annotation

Consistent with our analyses in Figure 2, E7.25 and E7.5 seqFISH embryos contained various epiblast-derived cell populations (E6.5–E7.25 scRNA samples), including cells labeled primitive streak, nascent mesoderm (Mesp1, Lefty2, Mixl1, and Eomes),17,18 hematoendothelial progenitors (Kdr, Tal1, and Etv2),19 and Gata1+ blood progenitors (Figures 3B, 3C, 4A, 4B, and S4A–S4E, Video S5-2; Table S1). An anterior primitive streak population (Cer1, Sox17, Foxa2, and Lefty1)1,2022 was positioned in the distal domain of the primitive streak region and extended into the axial midline of embryos 1/2 (Figures 3B, 3C, 4A, 4B, and S4A–S4C; Video S3; Table S1). Notably, this population includes both anterior primitive cells as well as anterior primitive streak derivatives in seqFISH embryos, including cell populations labeled DE precursors and axial mesendoderm in Figure 1H. Therefore, the cell population labeled anterior primitive streak in the extended scRNA mouse gastrulation atlas8 likely contains anterior primitive streak cells and anterior primitive streak derivatives.

E7.25 and E7.5 seqFISH embryos also contained ectodermal cells (Ptn and Cxcl12), ExE mesodermal cells (Foxf1 and Bmp4),23,24 and primordial germ cells (Figures 4A and S4A–S4E; Table S1; Video S3). Primitive endoderm-derived populations, including visceral endoderm and ExE endoderm, were identified on the exteriors of egg cylinders (Figure 4A). Proximal and distal ExE ectodermal cells, assigned based on their spatial orientation relative to the ExE-Em junction (Figure 2C), were found in extraembryonic regions (Figures 4A and S4A–S4C, Video S3). Rostral ectoderm and caudal epiblast (Cdx1 and Nkx1-2) cells were present by E7.5 (Figures 4A and S4A–S4C; Video S3). Notably, the higher proportion of distal ExE ectoderm cells in E7.25 seqFISH embryos, compared to scRNA-seq samples, is likely due to the removal of the “sticky” ectoplacental cone prior to sequencing scRNA samples (Figure 4B). Finally, comparison of imputed gene expression patterns in early-stage seqFISH embryos with whole-mount in situ hybridization images from TS10/11 embryos in the eMouse Atlas25 highlights the accuracy and reliability of imputed gene expression patterns (Figures S4F and S4G).

Previously, we assigned ~ 30 cell-type labels to E8.5 seqFISH embryos11 using a gastrulation atlas covering stages E6.5–E8.53. In this study, aligning E8.5 seqFISH embryos with the expanded gastrulation atlas8 (E6.5–E9.5) substantially increased the number of annotated cell types at E8.5 to 55 (Figures 4B and S2B). E8.5 seqFISH embryos contained first and second heart field cardiomyocytes (Nkx2-5, Isl1),26 as well as endocardial cells #1 and #2 (Hand2, Vwf, Gata5, Pecam1, and Sox17),27,28 cardiopharyngeal progenitors (Smarcd3),29 and epicardium (Wt1 and Tcf21),30,31 which localize to distinctive heart regions in embryos 5–7 (Figures S5A–S5F; Table S1; Video S4). The allantois endothelium (Dlk1 and Bambi)3,8 was positioned in the posterior regions of embryos 5 and 7, adjacent to the allantois (Tbx4 and Hoxa10)32,33 cell population in embryo 7 (Figures S5A–S5F; Video S4). Embryo proper endothelium #1 and #2, which was depleted in cardiac tissues, contributed to intersomitic and cranial vessels and extended along the trunk (Figures S5D–S5F; Video S4).

Somitic tissues, including the sclerotome (Pax1),34 dermomyotome (Pax3),35 and endotome, were also observed in the trunk, while somitic and presomitic mesoderm (Tbx6, T, Hes7, and Dll1/3)36 localized to the posterior regions, corresponding to the differentiation front, where cells in the presomitic mesoderm transition from a proliferative state to form somites (Figures S5G–S5I; Video S4). The gut tube was composed of pharyngeal endoderm (Nkx2-3 and Pitx1),37 thyroid primordium, foregut (Foxa1 and Gata6),38,39 and hindgut (Cdx2 and Hoxa7),39,40 distributed in distinct patterns along the anterior-to-posterior axes of E8.5 embryos, while notochord (Noto and Nog)41 cells were located dorsally to the gut tube in embryos 5 and 7 (Figures S6A–S6C; Table S1; Video S5). Additionally, various neural tube and ectodermal cell types were identified in their expected anatomical locations, including spinal cord progenitors (Hoxb8),42 the optic vesicle (Lhx2, Six3, and Otx2),4345 placodal ectoderm (Six3 and Pitx1), and the otic placode (Figures S6D–S6I; Video S5).

Notably, our spatiotemporal analysis provided additional insight into the annotations of certain cell types in the scRNA atlas, including ExE ectoderm and lateral plate mesoderm (LPM) (Figure S3B). Spatially distinct ExE ectodermal cell types emerge during mouse embryogenesis to form functionally specialized placental lineages. Our spatiotemporal atlas allowed us to identify distal ExE ectoderm cells, expressing Ascl2 and Gata2, located in the ectoplacental cone region at E7.25, which likely include trophoblast stem/progenitor cells46 (Figures 4A and S4A–S4D; Table S1; Video S5). In contrast, proximal ExE ectoderm cells aligned with chorion progenitors, marked by Elf5 and Perp, found adjacent to the epiblast at E7.25 and in the chorion layer at E7.5 (Figures 4A and S4A–S4E; Video S3).

LPM gives rise to an array of critical cell types during murine embryogenesis, including those that form the cardiovascular system, such as the heart and blood vessels.47 We relabeled a subset of Hand1- and Foxf1-expressing LPM24,48 cells “ExE mesoderm and Anterior LPM,” as they localized to extraembryonic tissues at E7.25/E7.5 and were also present in the medial trunk of seqFISH embryos at E8.5, highlighting transcriptional similarities between ExE mesoderm and developing intraembryonic medial LPM (Figures S4A–S4C and S5A–S5C; Table S1). In the scRNA atlas, the relabeled “ExE mesoderm and Anterior LPM” cell population predominated at the earlier stages (E7.0–E8.25) and contained cells originally labeled LPM, mesenchyme, and nascent mesoderm. Moreover, a distinctive subset of LPM cells, marked by Cdx449 and Pitx1, was localized to the posterior of embryos 5–7 and identified in the allantoic bud of embryo 4, prompting relabeling as “ExE mesoderm and Posterior LPM” (Figures S5A–S5C; Video S4). These “ExE mesoderm and Posterior LPM” cells predominated in the scRNA datasets at later stages (E7.75–E9.25) and contained cells originally labeled LPM and allantois.

Altogether, our integrated spatiotemporal atlas provides a high-resolution framework for investigating the spatial and transcriptomic organization of embryonic cell populations, serving as a valuable resource for the research community. To support cell-type assessment, refinement, and characterization, all code and metadata are readily accessible (https://github.com/ltgharland/Spatiotemporal-Atlas-of-Mouse-Gastrulation). Table S1 details marker gene information, while Videos S3–S5 illustrate the spatial localization of cell types across seqFISH embryos, alongside imputed gene expression patterns and high-resolution clustering. We anticipate these resources will encourage continued exploration and refinement, ensuring that the spatiotemporal atlas remains a dynamic and valuable tool for developmental biology research.

Spatiotemporal analysis reveals dynamic gene expression changes along the AP and DV axes

During embryogenesis, gene expression patterns coordinate the establishment of embryonic axes, which orchestrate spatiotemporal signals for organ development and delineate the body plan. To investigate gene expression changes along these axes, we assigned normalized AP and DV coordinates to seqFISH embryos (Figures 5A and S7A–S7G). Next, using our integrated atlas, we imputed AP and DV values into the scRNA reference atlas (Figures 5B and S7H). Cells from E9.25-E9.5 scRNA samples, isolated from the anterior, medial, or posterior regions of sub-dissected embryos, were accurately assigned imputed AP values consistent with their known anatomical origins8 (Figure 5C, see Figure 3A for sub-dissection). This approach provided finer spatial resolution compared to sub-dissection alone, as evidenced by AP gradients within medial and posterior sections (Figure 5C). Moreover, it enabled the estimation of AP and DV axes in earlier-stage embryos that had not been sub-dissected (pooled, Figure 5C). The average AP and DV positions of all cell types in seqFISH embryos are shown in Figure S7I.

Figure 5. Assigning and imputing AP positional identity across spatial and suspension single-cell transcriptomic datasets during mouse gastrulation.

Figure 5

(A) Spatial maps of E7.25, E7.5, and E8.5 embryos, with cells colored by normalized AP values. Gray cells represent ExE regions or low-quality cells filtered during QC. Orientation abbreviations: D, distal; V, ventral; R, right; L, left; A, anterior; P, posterior; PR, proximal.

(B) Joint projection of seqFISH and extended gastrulation atlas (scRNA) cells, colored by normalized AP values and imputed normalized AP values, respectively (red-blue).

(C) Cells from anatomical sub-dissections of the extended gastrulation atlas (scRNA), colored by imputed AP values. Anterior, medial, and posterior sections are from E9.25–9.5 embryos.

(D) Binned imputed expression of Hoxa and Hoxb genes along the AP axis in E7.25, E7.5, and E8.5 seqFISH embryos. Curves represent locally weighted LOESS (locally estimated scatter plot smoothing) regression fits across AP bins; shading indicates 95% confidence intervals around the fitted mean. See also Figure S7.

Incorporating spatial coordinate information into the integrated spatiotemporal atlas provides the developmental biology community with a powerful tool to explore gene expression dynamics along embryonic axes in cell types and tissues of interest at specific time points. For instance, we analyzed the imputed expression patterns of Hox genes (key developmental regulators with well-established AP-biased expression) along seqFISH embryos from different embryonic stages. Predicted Hox expression patterns (Figure 5D) align with previous studies,50 highlighting the utility of our approach.

Next, we investigated gene expression changes in a tissue context along the primitive streak region of embryo 2, focusing on three key populations—primitive streak cells, nascent mesoderm, and the adjacent endoderm layer—spanning positions 0–50 along the AP axis (Figures 6A and 6B). This region was selected because mesodermal cells migrating through different AP regions along the primitive streak are known to acquire distinct fates.5153 Previous studies have explored bulk RNA-expression patterns in embryonic slices along the AP axis.54,55 However, single-cell spatially resolved experiments during this initial process of cell fate diversification have not been performed.

Figure 6. Regionalized gene expression along the AP axis highlights rapid fate allocation in the primitive streak during gastrulation.

Figure 6

(A) Spatial plot highlighting primitive streak (PS), nascent mesoderm (NM), and endoderm (END) cells in embryo 2 (red-blue, AP 0–50), with positions indicated by dotted black lines.

(B) Ridge plots showing density distribution of major cell types along the AP axis in seqFISH embryo 2. ExE, extraembryonic; Visc., visceral; PS, primitive streak; APS, anterior primitive streak; NM, nascent mesoderm; Epi., epiblast.

(C) Heatmap of smoothed, scaled predicted gene expression profiles (using Tradeseq) along the AP axis in primitive streak, nascent mesoderm, and endoderm cells, ordered by AP values and grouped by cell type (END = endoderm, PS = primitive streak, NM = nascent mesoderm). Significant associations identified by Tradeseq74 (p < 0.01, mean log fold-change [logFC] > 0.25). Colored rectangles to the right of the heatmap indicate gene clusters that were identified using hierarchical clustering. Enriched GO terms were identified using Metascape.75

(D) Visualization of imputed normalized log expression counts of selected genes, in the primitive streak region indicated in (c). See also Table S2.

Our analysis of imputed gene expression reveals that numerous genes are dynamically expressed along the AP axis in a germ layer-specific manner within the primitive streak region of embryo 2 (Figure 6C; Table S2). Genes associated with the transport of vitamins and organic anions were enriched in posterior endodermal (Posterior END) cells, while genes related to in utero embryonic development, such as Tead1, Junb/d, and Gjb3, were expressed in the anterior endoderm (Anterior END) (Figure 6C). Primitive streak cells expressed pluripotency genes, including Nanog and Pou5f1, that were downregulated in the nascent mesoderm, which displayed limited expression changes along the AP axis. By contrast, the nascent mesoderm expressed genes in the anterior related to paraxial mesoderm development and Notch signaling, including Tbx6, Mesp2, and Dll1/3, while posterior nascent mesoderm was enriched for genes associated with Wnt signaling and vasculature development including Etv2, Hand1, Foxf1, Isl1, Kdr, Msx1/2, Tbx3, and Wnt2 (Figures 6C and 6D).

These findings revealed the nature of rapidly established transcriptional programs along the AP axis in nascent mesodermal cells, distinguishing between posterior ExE mesoderm and anterior paraxial mesoderm, in line with fate biases observed in previous fate mapping studies within the primitive streak. To validate the accuracy of these predicted gene expression pattern changes from our imputed gene expression counts in seqFISH embryo 2 (Figure 6C), we explored the expression patterns of the same gene set in the tissue sections of E7.25 embryos profiled via GEO-seq (geographical position sequencing).54 This comparison confirms the reliability of our predictions in capturing spatial gene expression changes (Figure S7J). Altogether, these findings highlight the power of our integrated spatiotemporal atlas to explore gene expression dynamics at single-cell resolution across embryonic axes.

Projecting models of mouse gastrulation into in vivo spatial and temporal contexts

Determining which aspects of embryonic development in vitro models faithfully recapitulate is crucial for accurately interpreting experimental outcomes and improving differentiation protocols in stem cell biology. A key strength of our spatio-temporal atlas is its ability to integrate and project external single-cell transcriptional datasets into an in vivo spatiotemporal context. To facilitate this, we have developed a bioinformatics pipeline that enables researchers to explore their datasets within the spatial framework of mouse development. As a proof of concept, we projected scRNA-seq datasets from gastruloids, embryonic stem cell-derived models of murine gastrulation, into our atlas (Figure 7A).

Figure 7. A bioinformatics pipeline to project additional scRNA-seq datasets into a spatial and temporal context during murine embryogenesis.

Figure 7

(A) Schematic of the bioinformatics pipeline used to project additional scRNA-seq datasets from day 4 (96 h)–7 (168 h) gastruloids, into the spatiotemporal atlas of murine gastrulation and early organogenesis.

(B) Spatial maps showing gastruloid cells projected onto the seqFISH embryos in gray, with gastruloid cells colored according to their spatiotemporal cell type annotations.

(C) Gastruloid cells projected onto the spatiotemporal atlas, with each panel colored by different factors. Top row: stage; gastruloid annotations from Rossi et al., 2021; refined cell type annotations from the spatiotemporal atlas (current study). Bottom row: spatial cluster quality control (QC); imputed anterior-posterior (AP); imputed dorsal-ventral (DV) values. Please see STAR Methods section for further information about spatial QC.

(D and E): Ridge plots displaying the density distributions of selected cell types from different germ layers (D) or somitic cell types (E) along the AP axis in the scRNA atlas (red) and gastruloids (blue). Density difference plots are shown below, Δ(g-s), which highlight the regions of increased (red) or decreased (blue) gastruloid density relative to the scRNA atlas in (D). A Kolmogorov-Smirnov (KS) test was performed to assess differences in density distributions. To quantify cell type composition similarities along the AP axes per germ layer, binned Jaccard indices (mean ± standard deviation, determined by bootstrapping) are displayed above the ridge plots in 10 AP bins.

(F) Heatmap of smoothed, scaled gene expression profiles along the imputed AP axis for the somitic cell types shown in (E) for gastruloids (left) and the scRNA-seq atlas (right). Significant associations were identified using Tradeseq association test (p < 0.01, mean log fold-change > 0.6). See also Figure S7 and Table S2.

Gastruloids replicate key aspects of the AP axis development in the absence of extraembryonic tissues.5660 To assess their transcriptional and spatial alignment with in vivo development, we projected scRNA-seq datasets from day 4 (96 h) to 7 (168 h) gastruloids, grown using an enhanced induction protocol that promotes cardiovascular development13 (Figure 7A). In line with previous findings,13,61 our projections highlight the formation of first and second heart field cardiovascular progenitors and endocardial cells as well as hematoendothelial progenitors and YS endothelial cells (Figures 7B and 7C). Beyond cardiovascular progenitors, gastruloids also developed an array of mesodermal, ectodermal, and endodermal cell types (Figures 7B and 7C). These included a substantial amount of paraxial mesoderm-derived tissues, such as anterior and posterior somites, and other key cell types typically formed during mouse embryogenesis between E6.5 and E9.5 (Figures 7B and 7C).

Since the spatiotemporal atlas is constructed from seqFISH cells collected from embryonic tissue sections at E7.25, E7.5, and E8.5, without full coverage of the medio-lateral axis, it represents only a subset of the cell populations captured in the scRNA-seq atlas, which includes whole embryos and yolk sacs spanning E6.5 to E9.5 (Figure 3A). Certain regions of the spatiotemporal atlas are therefore dominated by scRNA-seq cells alone, lacking seqFISH reference points (see red cells in poor joint clusters, Figure S3A). To make users aware of this, our projection pipeline includes a spatial QC (quality control) step that flags query cells aligning with regions where seqFISH data are absent or underrepresented (see STAR Methods, spatial QC in the bioinformatics pipeline, for details). For example, a subset of gastruloid cells, including YS endothelial cells and SHF cardiomyocytes, align with scRNA-seq-dominated regions and therefore fail spatial QC as they lack seqFISH reference points, making their spatial positioning less reliable (Figure 7C, “spatial cluster QC”).

Next, we assigned imputed AP and DV values to the gastruloid cells (Figure 7C). Comparing cell densities of gastruloid versus scRNA-seq cells along imputed AP axes, stratified by germ layer, revealed that this cardiovascular gastruloid model does not fully capture the complete range of AP development in mesodermal and ectodermal germ layers (Figure 7D). More specifically, a subset of the most posterior mesodermal cell types, including posterior LPM and allantois (Figures 7C and S7K), with the most anterior ectodermal cell types, were underrepresented, highlighting a need for further protocol optimization to better induce these tissues. Nonetheless, these cardiovascular gastruloids successfully generated a diverse array of somitic cell types (Figures 7C–7E).

Taking advantage of the true single-cell resolution of our approach, we assessed the predicted AP distribution of somitic cell types and gene expression patterns in gastruloids, induced using the modified cardiovascular protocol, and compared them to in vivo somitogenesis using our scRNA-seq datasets (Figures 7E and 7F). Previous TOMO-seq (tomography sequencing) studies,57 which assess gene expression patterns in tissue slices but not at single-cell resolution, have revealed that the dynamic transcriptional patterns along the AP axis of individual 120-h gastruloids, generated using standard somitogenesis-inducing protocols, appear to align with those observed in E8.5 mouse embryos. Our analysis similarly revealed that cardiovascular gastruloids generate the full spectrum of somitic cell types typically found along the AP axis in developing embryos (Figures 7C–7E). Moreover, a suite of genes exhibited similar expression gradients along imputed AP axes in both gastruloids and embryonic somitic tissues (Figure 7F; Table S2). Many genes showed expression patterns consistent with those previously observed in TOMO-seq-analyzed gastruloids,57 thus (i) validating our approach, (ii) bench-marking the effectiveness of gastruloids in modeling key aspects of early development, and (iii) introducing a broadly relevant strategy for assessing the cellular output of in vitro models at true single-cell spatiotemporal resolution.

Discussion

Here, we present an integrated spatiotemporal atlas of mouse gastrulation and early organogenesis, introducing broadly applicable computational strategies to merge spatial and suspension transcriptomic datasets, leveraging the strengths of both technologies. By extracting per-cell coordinates along the AP and DV axes from spatial transcriptomic data, we accurately predicted the AP and DV positions of cells profiled using dissociation-based single-cell technologies. Additionally, we enriched the spatial transcriptomics dataset by imputing whole-transcriptome gene expression at the single-cell level, creating a comprehensive transcriptional resource for studying mouse embryogenesis within a spatiotemporal framework. To ensure broad accessibility, we developed a user-friendly web portal (http://shiny.maths.usyd.edu.au/SpatiotemporalMouseAtlas/) for providing researchers with an intuitive platform to explore, visualize, and analyze these complex datasets and derive new hypotheses to advance our understanding of fundamental biological processes as well as inform new stem cell differentiation strategies for disease modeling and cellular therapy applications.

Our analysis of gene expression changes in the primitive streak of a late-streak stage embryo underscores the utility of assigning axial coordinates. Previous studies that examined genome-wide expression along the AP axis in mini-pools derived from tissue slices of mouse gastrulas at similar stages identified gene expression domains associated with mesodermal fate patterning.54,55 The single-cell resolution of our dataset reveals that these significant transcriptional changes along the AP axis predominantly occur in migratory nascent mesodermal wings rather than in cells immediately exiting the primitive streak. This precise spatial resolution offers fresh insights into the dynamic shifts in gene expression during gastrulation and highlights the potential of our atlas to explore axial patterning in an unbiased manner, providing a powerful tool for understanding gene regulation during embryogenesis.

A recent study62 employed whole-mount transcriptomics using cycleHCR (hybridization chain reaction) to measure the expression of 254 genes in a mid-streak stage mouse gastrula (E6.5) in a three-dimensional (3D) context. Consistent with our findings for our earliest time point, this single snapshot study identified nine transcriptionally distinct cell clusters, including epiblast, primitive streak, nascent mesoderm, ExE mesoderm, visceral endoderm, and extraembryonic ectoderm, mapped precisely to their spatial locations. In addition, Qiu et al. presented spatial transcriptomic datasets of whole E9.5 and E11.5 mouse embryos, accompanied by a computational framework for analyzing and exploring 3D spatiotemporal data.63 While our study focuses on earlier developmental stages (E7.25, E7.5, and E8.5), integrating these complementary datasets in future computational efforts will extend the temporal window of investigation, offering a more complete spatiotemporal roadmap of early embryogenesis. Moreover, as additional spatial transcriptomic datasets emerge, particularly those covering key stages of mouse gastrulation that only measure hundreds of genes, our computational methods can be applied to align these datasets with our spatiotemporal atlas, enabling transcriptome-wide gene expression imputation and refined cell type characterization.

Mapping cardiovascular gastruloids13 onto the spatiotemporal atlas using our data integration approach provides a qualitative benchmark of in vitro murine models against the normal spatiotemporal context of mouse development. Our analysis reveals that while these gastruloids effectively recapitulate key aspects of embryogenesis, including anterior-posterior patterning of somitic tissues, they lack specific posterior LPM cell types. Additional datasets from alternative murine culture systems or in vivo perturbations, such as drug-treated, mutant, embryoid body,64 or chimera cells, can similarly be mapped and benchmarked using the atlas, although special care needs to be taken to avoid biases due to the mapping process.65 Moreover, our group and others are actively developing computational pipelines to cross-examine the early developmental processes across species.66,67 Given the sparsity of human spatial reference data during gastrulation, these methodologies will be especially valuable in extending benchmarking practices to emerging human pluripotent stem cell-derived models.6870

Looking ahead, advancing methods to integrate comprehensive spatiotemporal transcriptional atlases with time-resolved 3D embryo models, such as those generated through light-sheet imaging,71 will pave the way for creating 4D models that predict transcriptome-wide gene expression changes throughout development. Such future approaches will enable detailed exploration of gene expression dynamics across differentiating cellular lineages, providing essential boundary conditions for performing trajectory inference with suspension datasets and offering deeper insights into cell-cell communication during developmental processes. This work will likely contribute to creating a “virtual tissue” of mouse gastrulation and early organogenesis, offering an unprecedented tool for modeling and understanding embryonic development at a new level of precision.

Limitations of the study

Despite the advances we describe in generating this atlas, some further avenues must be explored. While our atlas contains spatial coordinates and whole-transcriptome level gene expression, a consistent “average” coordinate system is yet to be built over both spatial and temporal axes. Further work could leverage recent methodological advances in mapping cells between spatial and temporal coordinates that harness optimal transport, such as moscot72 or DeST-OT (developmental spatiotemporal optimal transport).73 However, using these methods would only be applicable for spatially resolved cells rather than leveraging the entire set of cells in the spatiotemporal mouse gastrulation atlas. Moreover, it is important to consider that for any abundance-based downstream analysis, cells in the spatial data may be subject to enrichment or depletion biases due to sample selection. Here, we ameliorated this potential issue by comparing cell densities between gastruloids and scRNA-seq-resolved cells rather than with spatial transcriptomic-resolved cells, though it may be that further biases remain across the entire spatio-temporal atlas, in which case further effort is required to determine the effective matched controls for comparison against a reference atlas.65 Finally, while we provide refined AP and DV coordinate predictions for scRNA-seq-resolved cells using spatial integration, these annotations are based on computational inference. Without further direct experimental validation, their spatial accuracy remains putative and should be interpreted with appropriate caution.

It is important to note that our seqFISH datasets do not fully capture the complete medio-lateral extent of each embryo. Consequently, the observed cell types represent only a subset of the total cellular composition at the developmental stages examined. Moreover, while imputed gene expression patterns—particularly in earlier stage embryos such as E7.25—show strong concordance with established in vivo expression domains, discrepancies are more apparent at later stages. These may arise from technical challenges in spatial integration and imputation, embryo-to-embryo variability, or inherent limitations in spatial transcriptomic resolution and reconstruction. These factors should be considered when interpreting individual gene expression patterns within the atlas.

Resource Availability

Lead contact

Further information and requests for resources should be directed to and will be fulfilled by the lead contact, Shila Ghazanfar (shila.ghazanfar@sydney.edu.au).

Materials availability

Reagents generated in this study are available upon request from the lead contact, and other materials in this study are commercially available.

Star★Methods

Key Resources Table

REAGENT or RESOURCE SOURCE IDENTIFIER
Antibodies
Anti-E-cadherin (clone 36) BD Biosciences Ref. 610181; RRID: AB_397580
Anti-N-cadherin (13A9) Cell Signaling Technology Ref. 14215; RRID: AB_2798427
Anti-mouse IgG secondary 
antibody (oligo-conjugated)
Custom See Lohoff et al.11
Anti-pan-cadherin Abcam Ref: ab22744; RRID: AB_447300
Anti-B-catenin (15B8) Abcam Ref: ab6301; RRID: AB_305406
Chemicals, peptides, and recombinant proteins
4% Paraformaldehyde (PFA) Thermo Scientific Ref. 28908
Acryloyl-X SE Thermo Fisher Ref: A20770
Alexa Fluor 647 Thermo Fisher Ref: A20006
Poly-D-Lysine Sigma Ref: P6407
Proteinase K NEB Ref: P8107S
SDS
Salmon Sperm DNA Thermo Fisher Ref: AM9680
Uracil-specific excision reagent NEB Ref: N5505S
DAPI Sigma Ref: D8417
Dextran Sulfate Sigma Ref: D8906
Formamide Sigma Ref: F9027
Bovine Serum Albumin (BSA) Thermo Fisher Ref: AM2616
APS Sigma Ref: A3078
TEMED Sigma Ref: T7024
Tris-HCl Invitrogen Ref: AM9856
EDTA Invitrogen Ref. 15575020
NaCl Sigma Ref: S5150
Trolox Sigma Ref. 238813
D-glucose Sigma Ref: G7528
Catalase Sigma Ref: C3155
Glucose oxidase Sigma Ref: G2133
Ethylene carbonate Sigma Ref: E26258
Dextran sulfate (EC buffer) Sigma Ref: D4911
Acryoloyl-X succinimidyl ester Thermo Fisher Ref: A20770
M2 medium Sigma Aldrich Ref. 7167
PBS Invitrogen Ref: AM9624
Rnase-free sucrose Sigma Aldrich Ref. 84097
Tissue base mold Sakura Ref. 4162
OCT compound Sakura Ref. 4583
Isopropanol VWR Ref. 20842
Critical commercial assays
QIAquick PCR Purification Kit Qiagen Ref. 28104
Deposited data
seqFISH Images This paper https://zenodo.org/records/13977985.
Processed seqFISH data This paper http://shiny.maths.usyd.edu.au/SpatiotemporalMouseAtlas/
Code repository This paper https://github.com/ltgharland/Spatiotemporal-Atlas-of-Mouse-Gastrulation
Experimental models: Organisms/strains
C57BL/6J Mice Charles River
Software and algorithms
Napari Python (https://zenodo.org/records/8115575)
U-Net Ronneberger et al.76 http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net.
StarDist Schmidt et al.77
CellPose Stringer et al.78 www.github.com/mouseland/cellpose
Ilastik V1.4.0 Berg et al. https://www.ilastik.org/
scran V1.26.2
princurve Hastie & Stuetzle79
Seurat V4.2.0 Hao et al.14 https://github.com/satijalab/seurat
StabMAP V0.1.8 Ghazanfar et al.15 https://marionilab.github.io/StabMap/index.html
batchelor V1.12.3 Haghverdi et al.16 https://bioconductor.org/packages/release/bioc/html/batchelor.html
Tradeseq V1.10.0 Van den Berge et al.74 https://www.bioconductor.org/packages/release/bioc/html/tradeSeq.html
ComplexHeatmap V2.15.3 Gu et al.80 http://www.bioconductor.org/packages/devel/bioc/html/ComplexHeatmap.html
Metascape Zhou et al.75 https://metascape.org/gp/index.html
ImageJ
scater V1.24.0 Bioconductor https://www.bioconductor.org/packages/release/bioc/html/scater.html
SingleCellExperiment V 1.20.0 Bioconductor https://www.bioconductor.org/packages/release/bioc/html/SingleCellExperiment.Html
data.table v 1.14.8 CRAN https://cran.r-project.org/web/packages/data.table/index.html
dplyr v 1.1.0 CRAN https://cran.r-project.org/web/packages/dplyr/index.html
ggplot2 V3.4.1 CRAN https://cran.r-project.org/web/packages/ggplot2/index.html
ggridges V0.5.4 CRAN https://cran.r-project.org/web/packages/ggridges/index.html
R N/A https://www.r-project.org

Experimental Model and Study Participant Details

Mice and tissue preparation

Experiments were performed in accordance with EU guidelines for the care and use of laboratory animals, and under the authority of appropriate UK governmental legislation. Eight-to 12-week-old wildtype C57BL/6J (Charles River) were used and all mice used in this project were housed under a 12-h light/12-h dark cycle, with constant access to food and water. No sex selection of the used embryos was performed. Tissue sections were prepared and sectioned ready for seqFISH imaging as previously described,11 further details are provided in the method details section below.

Method Details

Library and probe design

We used the same library selection as previously described,11 using a total of 351 seqFISH barcoded genes (Table S1) and 36 additional non-barcoded sequential smFISH imaging. We used the same primary probe design, readout probe design, primary probe library construction, readout probe synthesis, encoding strategy, and coverslip functionalization as described11 descibred briefly in the next sections.

Library and probe design – Gene panel selection

To construct a spatial transcriptomic gene panel tailored to mouse gastrulation, we selected marker genes that discriminated between cell types in a single-cell RNA-seq atlas (Pijuan-Sala et al., 2019). Differentially expressed genes were identified using the find-Markers() function from the scran R package with pval.type = "any" and a minimum log fold change threshold of 0.5. This analysis was repeated at each stage from E6.5 to E8.5 in 0.25-day intervals, excluding cell types with fewer than 10 cells per stage. Genes were filtered out if the upper quartile of normalized expression for any cell type exceeded a threshold of 20 counts, to prevent optical crowding during imaging. The top five discriminative genes per cell type and stage were selected based on the Top column from findMarkers(), which reflects the minimal gene set needed to distinguish the cell type from all others. Genes were aggregated across all stages to form a comprehensive panel. To further refine the panel, we ensured that the total estimated number of transcripts per cell type remained below 300, calculated by exponentiating log-normalized expression values. This constraint was informed by prior benchmarks suggesting 200–300 transcripts per cell as an upper limit for reliable detection without optical saturation. Genes of particular interest (e.g., transcription factors) were added manually, followed by iterative re-filtering to manage total transcript load. The final panel comprised 387 genes: 351 for barcoded detection by seqFISH and 36 for non-barcoded detection by sequential smFISH (Table S1).

Library and probe design – Primary probe design

We designed 30-nt gene-specific primary probes targeting the coding sequences of the 387 selected genes using the mm10 mouse genome and UCSC annotations. Probes were required to have 45–65% GC content and avoid homopolymers of ≥5 identical bases. For each gene, 28–48 probes were designed, with genes yielding fewer than 28 usable probes excluded from the panel. All probe candidates were aligned against the mouse transcriptome using BLAST to remove off-target sequences with >15-nt matches to un-intended transcripts. To prevent cross-hybridization within the probe set, a local BLAST database of all probe sequences was built. Probes predicted to have more than seven off-target hits against other library probes were iteratively removed until all probes passed specificity criteria.

Library and probe design – Readout probe design

Readout probes of 15 nucleotide length were designed as reported in Shah et al.81 In brief, readout sequences were randomly generated to have a GC content of 40–60% and were screened by BLAST to ensure no homology to the mouse transcriptome. To reduce cross-hybridization between readout probes, all pairs of sequences with ≥10 consecutive identical bases were eliminated. The reverse complements of these validated sequences were included in the primary probes.

Library and probe design – Primary probe library construction

The final library of 15,989 primary probes (targeting 387 genes) was synthesized as an oligo pool by Twist Bioscience. Each probe consisted of a 30-nt mRNA-targeting region, four gene-specific readout sequences (15-nt each) separated by 2-nt spacers, and universal primer sequences for amplification. Two formats were used.

  • (1)

    seqFISH probes (barcoded): 5′–(primer 1)–(readout 1)–(readout 2)–(probe)–(readout 3)–(readout 4)–(primer 2)–3′

  • (2)

    smFISH probes (non-barcoded): identical readout sequence used four times: 5′–(primer 1)–(readout 1)–(readout 1)–(probe)–(readout 1)–(readout 1)–(primer 2)–3′

The probe pool was amplified by limited-cycle PCR to produce a template for in vitro transcription (NEB, E2040S), followed by reverse transcription using uracil-containing primers (Thermo Fisher, EP7051). Forward primers were cleaved enzymatically using uracil-specific excision reagent (NEB, N5505S), and single-stranded DNA was alkaline hydrolyzed with 1 M NaOH at 65°C for 15 min. After neutralization with 1 M acetic acid, probes were purified via ethanol precipitation and phenol–chloroform extraction. Probes were resuspended in hybridization buffer (40% formamide, 2× SSC, 10% dextran sulfate) at 40 nM per probe and stored at −20°C.

Library and probe design – Readout probe synthesis

Fifteen-nucleotide readout probes were synthesized by Integrated DNA Technologies (IDT) and conjugated to fluorophores (Alexa Fluor 488, Cy3B, Alexa Fluor 647). Probes were stored at −20°C in nuclease-free water.

Library and probe design – Encoding strategy

We implemented a 12-pseudocolor encoding scheme, distributing pseudocolors evenly across three fluorescent channels (Alexa Fluor 488, Cy3B, Alexa Fluor 647). The 12 pseudocolors were cycled over four imaging rounds to generate 20,736 unique barcodes. An additional round of pseudocolor imaging was performed to obtain error-correctable barcodes, yielding robust gene identification across the 351 barcoded targets.

Library and probe design – Coverslip functionalization

Glass coverslips (Thermo Scientific, 3421) were washed in nuclease-free water and immersed in 100% ethanol, air-dried, and plasma-cleaned for 5 min (Harrick Plasma, PDC-001, high setting). Coverslips were incubated in 1% bind-silane (GE, 17-1330-13) in 10% acidic ethanol (pH 3.5) for 1 h at room temperature, rinsed three times in ethanol, and heat dried at >90°C for 30 min. They were then incubated with 100 μg/mL poly-D-lysine (Sigma, P6407) in water for at least 1 h, rinsed in nuclease-free water, and air-dried. Coverslips were stored at 4°C and used within one week.

seqFISH microscopy

Three tissue sections from two experimental blocks, containing four embryos (2 from E7.25 and 2 from E7.5), were imaged as previously described,10,81 which is described in brief in the following sections. For each FOV, snapshots were acquired with 4 μm z steps for six z slices. Serial hybridization and imaging were repeated for 29 rounds.

seqFISH microscopy – Tissue preparation and hybridization

Embryos were dissected from uteri, washed in M2 medium (Sigma Aldrich, M7167), and fixed in 4% paraformaldehyde (PFA; Thermo Scientific, 28908) in 1× PBS (Invitrogen, AM9624) for 30 min at room temperature. Fixed embryos were washed and incubated in 30% RNase-free sucrose (Sigma Aldrich, 84097) in 1× PBS at 4°C until they sank. Each embryo was then embedded sagittally in optimal cutting temperature (OCT) compound (Sakura, 4583) using tissue base molds (Sakura, 4162), snap-frozen in dry ice–isopropanol (VWR, 20842), and stored at −80°C. Tissue sections (20 μm) were cut using a cryostat, collected on functionalized coverslips (see STAR Methods: Library and probe design – Coverslip functionalization), and stored at −80°C.

Tissue sections were post-fixed in 4% PFA in 1× PBS for 15 min at room temperature, permeabilized with 70% ethanol for 1 h, and cleared with 8% SDS in 1× PBS for 20 min. Sections were washed with 70% ethanol, air-dried, and blocked for at least 2 h at room temperature in blocking buffer (1× PBS, 0.25% Triton X-100, 10 mg/mL BSA [Thermo Fisher, AM2616], and 0.5 mg/mL salmon sperm DNA [Thermo Fisher, AM9680]) in a humid chamber. Tissue sections were incubated for 2 h at room temperature with primary antibodies diluted 1:200 in blocking buffer: anti-pan-cadherin (Abcam, ab22744), anti-N-cadherin (Cell Signaling, 14215), anti-β-catenin (Abcam, ab6301), and anti-E-cadherin (BD Biosciences, 610181). After three washes with PBS containing 0.1% Triton X-100 (PBS-T), sections were incubated with a CCTTACACCAACCCT-conjugated secondary antibody (anti-mouse IgG) diluted 1:500 in blocking buffer for ≥2 h at room temperature. Samples were again washed three times in PBS-T and post-fixed in 4% PFA in 1× PBS for 15 min, followed by three 10-min washes in 2× SSC (Thermo Fisher, 15557036).

Hybridization was performed in primary probe hybridization buffer (40% formamide [Sigma, F9027], 2× SSC, 10% dextran sulfate [Sigma, D8906]) containing ~2.5 nM of each primary probe, 1 nM Eef2 probe set A and B, and 1 μM locked nucleic acid oligo-d(T)30 (Qiagen). Samples were incubated in a humidified chamber at 37°C for 24–36 h. They were washed with 40% formamide wash buffer (40% formamide, 0.1% Triton X-100 in 2× SSC) for 30 min at 37°C and rinsed three times with 2× SSC. Sections were then hybridized with 200 nM tertiary probe (/5Acryd/AG GGT TGG TGT AAG GTT TAC CTG GCG TTG CGA CGA CTA A) in EC buffer (10% ethylene carbonate [Sigma, E26258], 10% dextran sulfate [Sigma, D4911], and 4× SSC) for ≥2 h. Samples were washed for 5 min in 10% formamide wash buffer (10% formamide, 0.1% Triton X-100 in 2× SSC), followed by two 5-min washes in 2× SSC. For gel embedding, samples were treated with 0.1 mg/mL Acryloyl-X SE (Thermo Fisher, A20770) in PBS for 30 min, rinsed three times in 2× SSC, and post-fixed in 4% PFA in PBS for 15 min, followed by three washes in 2x SSC. Sections were incubated for 30 min in 4% acryl-amide/bis-acrylamide (1:19 ratio) in 2× SSC, followed by gelation with degassed 4% hyrdogel solution containing 0.05% ammonium persulfate (APS; Sigma, A3078) and 0.05% TEMED (Sigma, T7024). The sample was sandwiched by a GelSlick-coated slide (Lonza, 50640) in a nitrogen chamber for 30 min at room temperature and then at 37°C for ≥3 h. After polymerization, the hydrogel was separated from the coverslip and rinsed with 2× SSC. Embedded tissue was cleared at 37°C for 3 h in digestion buffer (1:100 Proteinase K [NEB, P8107S], 50 mM Tris-HCl pH 8.0 [Invitrogen, AM9856], 1 mM EDTA [Invitrogen, 15575020], 0.5% Triton X-100, 1% SDS, and 500 mM NaCl [Sigma, S5150]). Sections were rinsed with 2× SSC and treated with 0.1 mg/mL label-X for 45 min at 37°C for chemical labeling. Samples were then re-embedded in a second 4% hydrogel (2.5-h gelation), trimmed, and mounted into custom flow cells for imaging.

seqFISH microscopy – Imaging

Three tissue sections from two experimental blocks, containing four embryos, were imaged. Flow cells were connected to an automated fluidics system. Prior to hybridization rounds, samples were stained with 10 μg/mL DAPI (Sigma, D8417) in 4× SSC to identify fields of view (FOVs). All imaging was performed in antibleaching buffer (50 mM Tris-HCl pH 8.0, 300 mM NaCl, 2× SSC, 3 mM Trolox [Sigma, 238813], 0.8% D-glucose [Sigma, G7528], 1:100 catalase [Sigma, C3155], and 0.5 mg/mL glucose oxidase [Sigma, G2133]). Sixteen hybridization rounds were imaged for the decoding of the barcoded mRNA seqFISH probes followed by a repeat of the first hybridization. Then, 12 serial hybridization rounds were imaged for 36 non-barcoded sequential smFISH probes, followed by 1 hybridization to visualize the cell segmentation staining using Cy3B conjugated to/5AmMC6/TTAGTCGTCGCAACG. Hybridization rounds were automated using a deep-well 96-well plate system. Each hybridization buffer contained three 15-nt readout probes (Alexa Fluor 647, Cy3B, Alexa Fluor 488; 50 nM each) in EC buffer. The cell segmentation round used a single Cy3B-labeled readout (/5AmMC6/TTAGTCGTCGCAACG). Each hybridization step lasted 25 min at room temperature in the dark, followed by washing with 10% formamide wash buffer, rinsing in 4× SSC, and DAPI staining (1.5 min, 10 μg/mL in 4× SSC). The flow cell was then filled with antibleaching buffer, and all selected FOVs were imaged. Imaging was performed using a Leica DMi8 microscope equipped with a Yokogawa CSU-W1 spinning disk, an Andor Zyla 4.2 Plus sCMOS camera, a Leica 63 ×1.40-NA oil objective, a motorized stage (ASI MS2000), and Semrock filters. For each FOV, images were acquired using 4 μm z-steps across six z-planes. After each round, readout probes were stripped using 55% formamide wash buffer (55% formamide, 0.1% Triton X-100 in 2× SSC) for 4 min, followed by a 4× SSC rinse. Serial hybridization and imaging were repeated for 29 rounds in total. RNA integrity was verified using colocalization of interleaved Eef2 probe sets visualized with distinct fluorophores. Integration of the automated fluidics and image acquisition system was managed using custom μManager scripts.

Imaging processing and registration

Raw image data were processed as described previously.11 Briefly, effects of chromatic aberration, tissue background and auto-fluorescence were removed, non-uniform background was corrected and background signal subtracted following the same approach as described previously.11 For each round of hybridization, Images were registered to the DAPI images of the first hybridization for each FOV using the same two-dimensional phase correlation algorithm as previously described.11

Image processing and registration – Chromatic correction, background subtraction, and alignment

Raw imaging data were processed to correct for chromatic aberration, remove background signal, and register images across hybridization rounds. To correct for chromatic aberration between fluorescence channels, 0.1 μm TetraSpeck microspheres (Thermo Scientific, T7279) were imaged, and geometric transforms were computed to align all fluorescence channels. Tissue background and autofluorescence were removed by dividing background with the fluorescence images. A flat field correction was applied by dividing the normalized background illumination with each of the fluorescence images while preserving the intensity profile of the fluorescent points. Background signal was then subtracted using ImageJ rolling ball background subtraction algorithm using a radius of 3 pixels and despeckle algorithm to remove any hot pixels. For each round of hybridization, the 405 nm channel capturing DAPI signal was used for registration. Within each field of view (FOV), all DAPI images from subsequent hybridization rounds were registered to the DAPI image from the first round using a two-dimensional phase correlation algorithm.

Transcript detection and smFISH processing

We called individual transcript barcodes using the dot matching algorithm with the same parameters as described in Lohoff et al., with more details in Shah et al.81 In brief, detected fluorescent spots across all hybridizations and channels were matched using a 2.45-pixel search radius to identify symmetric nearest neighbors. Spot sets corresponding to a unique barcode were directly assigned to the on-target barcode list. For ambiguous matches, combinations were filtered by minimizing residual spatial distance, and assigned if a single on-target barcode with a Hamming distance of 1 remained. If ambiguity persisted, the point was excluded. This matching process was repeated using each hybridization round as the seed, and only barcodes consistently detected in at least three out of four rounds were retained as valid gene calls. For the 36 genes that we probed using smFISH, we found that assignment of an optimal light intensity threshold to separate background noise from hybridized mRNA was particularly challenging due to each gene’s expression only being measured over a single round of hybridization. To address this problem, we extracted a continuous measure of gene expression per cell, taken to be the 95th percentile of the pixel intensities across all pixels within each cell’s segmented area, where a high value corresponds to some evidence of detection of expression of each gene in each cell. We did not use the smFISH-captured genes for further high-dimensional analysis.

Cell segmentation

Since we visually inspected images associated with the cell membrane channels in the first hybridization and found there was very low signal intensity associated with the cell membranes, we opted for an alternative cell segmentation strategy to what was performed previously.11 We describe the steps of this segmentation strategy below.

Human-in-the-loop cell segmentation

In this situation, we used a human-in-the-loop cell segmentation strategy, where we performed manual annotation with napari (https://zenodo.org/records/8115575) of single points for each DAPI-stained nucleus for each E7.25 and E7.5 image across all z slices. Then, for each z slice and FOV, we performed a seeded watershed algorithm to extend the manually annotated points into solid segments covering the nucleus of each cell. The watershed used nuclear boundary predictions from a U-Net76 as heightmap and foreground predictions from the same U-Net as mask. This network was initially trained on the nucleus segmentation dataset82 and then retrained several times on the segmentation results obtained via the watershed from manual seed points. Note that we tried using StarDist77 and CellPose78 for the nuclear segmentation, but found the segmentation quality of their pretrained models insufficient. To further extend nuclei to full cell segments, we trained a feature based classifier to segment the full cell and cell boundaries using ilastik83 pixel classification on a few cells using the DAPI channel and the total set of detected transcripts as input channels. We then used a watershed with nuclear segmentations as seeds and ilastik prediction as heightmap and mask to obtain the cellular segmentation.

3D segments for early embryos

Our data comprises optical sections captured 4 μm apart within 20 μm thick physical sections. These optical sections were merged into 3D reconstructions without requiring additional registration across separate physical sections (see Videos S1 and S2 which display the optical sections). Since the z slices were 4um apart, a distance that could conceivably cover a single cell over multiple z slices, we performed a matching algorithm to identify matching 2D segments assigned to a single 3D segment. We applied this matching algorithm to all seqFISH data, including the previously published E8.5 data. The matching algorithm worked by querying consecutive z slices from z slice 2 and so on, identifying for each segment the next segment with the largest overlapping area, and calling the pair of segments a match if the Jaccard index between the pixels was 0.5 or above. Finally, for each matched segment, we calculated the new 3D cell centroid as the centroid of the pixel locations across the x, y, and z planes.

Data processing - Filtering and quality control

Following cell segmentation, we assigned each transcript belonging to the barcoded seqFISH library to the corresponding cell in which the segment belonged. We then filtered cells to retain only those that i) had at least three detected genes, and ii) had between 5 and 1,000 total counts. To further check for cells that may be low quality, we performed unsupervised clustering of the log-transformed gene counts, and identified clusters with particularly low total counts to be removed from further analysis.

Data processing - Normalisation

To normalise the gene expression data, we first needed to account for the systematic effect where we observed fewer counts with higher z-slice values. To do this, we first extracted the size factors for embryo 1 and z-slice 2 using the calculation embedded within the ‘logNormCounts()’ function in the scran package, ignoring the Xist gene in the calculation of size factors. We then normalised all of the seqFISH-resolved cells using the ‘logNormCounts()’ function in scran, where size factors were quantile normalised to match the distribution of those of embryo 1 z-slice 2. In doing so, we extracted normalised logcounts for the cells, which we used for subsequent analyses.

Data processing - Defining AP- and DV-axes

To ascertain anterior-posterior (AP) axis positioning of the seqFISH-resolved embryos, using anatomical landmarks, we performed manual annotation of the AP axis, as well as manual annotation of the extraembryonic region and the embryo proper for the E7.25-E7.5 samples. Then we used the ‘princurve’ package79 to extract the relative positions along the AP axis as well as the perpendicular distances to the AP axis curve to determine the positions along the dorsal-ventral (DV) axis. To enable consistent comparison between each of the seqFISH-resolved embryos, we used z-scaling followed by reordering to start on the anterior part of the AP axis. These scaled AP values were then used for further downstream analysis.

scRNA-seq analyses - Quality control filtering after integration and clustering

ChatGPT (GPT-4, OpenAI; accessed via ChatGPT Plus in 2024–2025) was used to assist in writing and optimizing R scripts for scRNA-seq analyses. All code generated using this tool was reviewed, edited, and validated by the authors. The full analysis code is available in the accompanying repository. Initially, we applied batch correction using the rPCA method in Seurat v414 to log-normalized counts matrices from E7.25, E7.5 and E8.5 seqFISH embryos. rPCA integration was performed across different embryos (e.g., embryo 1 and embryo 2) within each embryonic stage (e.g., E7.25). To perform rPCA integration, principal components (PCs) were generated by running ScaleData and RunPCA using all seqFISH genes set as features. Integration was then carried out using FindIntegrationAnchors (with k.anchor = 5, scale = FALSE, and reduction = ‘rpca’), followed by IntegrateData (with features.to. integrate = all features (genes) and k.weight = 100). Finally, clustering was performed after applying ScaleData and RunPCA on the batch-corrected matrices. We used FindNeighbors(dims = 1:30) and FindClusters(resolution = 1) to identify clusters. As a quality control measure, we excluded clusters with abnormal RNA/feature counts, as well as a cluster from the most distal region of the ecto-placental cone in E7.25 embryos (see Figures S1B and S1D red clusters) prior to further analyses described below.

scRNA-seq analyses - integration, clustering and UMAP for E7.25/E7.5 seqFISH Embryos

We applied multiple rounds of batch correction using the rPCA method in Seurat v414 to log-normalized counts from E7.25 and E7.5 seqFISH embryos. Cells that passed the previously described quality control criteria were included in this analysis. In round 1, rPCA integration was performed across different embryos (e.g., embryo 1 and embryo 2) within each embryonic stage (e.g., E7.25). First, principal components (PCs) were generated by running ScaleData and RunPCA on all seqFISH probes. Integration was then carried out using FindIntegrationAnchors (with k.anchor = 5, scale = FALSE, and reduction = ‘rpca’), followed by IntegrateData (with features. to.integrate = all features (genes) and k.weight = 100). In round 2, batch-corrected count matrices from round 1 were re-integrated using rPCA, with k.weight = 200 and k.anchor = 5. Finally, clustering was performed on the batch-corrected matrices after applying ScaleData and RunPCA. We used FindNeighbors(dims = 1:30) and FindClusters(resolution = 1.2) to identify clusters, followed by dimensionality reduction using UMAP, generated via RunUMAP(dims = 1:30). A second round of clustering was performed on a cluster that was initially annotated ‘Anterior Primitive Streak’ using FindNeighbors(dims = 1:30) and FindClusters(resolution = 1).

This unsupervised sub-clustering revealed three transcriptionally and spatially distinct subpopulations within the original APS cluster: axial mesendoderm, definitive endoderm precursors, and a mixed cluster comprising primitive streak and axial mesendoderm cells.

scRNA-seq analyses - integration, clustering and UMAP for E8.5 seqFISH Embryos

For the E8.5 seqFISH embryos, a similar procedure was followed. After quality control, batch-corrected count matrices from within-stage integration were processed with ScaleData and RunPCA. Clustering was then performed using FindNeighbors(dims = 1:30) and FindClusters(resolution = 1), and clusters were visualized via UMAP, generated using RunUMAP(dims = 1:30).

scRNA-seq analyses - Marker gene identification

To identify marker genes for the clusters from the E7.25/E7.5 and E8.5 seqFISH embryos, we applied FindAllMarkers(slot = "data"), filtering for marker genes with an average log2 fold change (avg_log2FC) greater than log2(1.5).

scRNA-seq analyses - principal components from the extended gastrulation scRNA atlas

We used log-normalized counts from a downsampled version (10,000 cells per embryonic stage from E6.5 to E9.5) of the mouse gastrulation atlas from Imaz-Rosshandler et al., 2024, to perform three rounds of rPCA integration using Seurat v4.14 In round 1, rPCA integration was carried out across sequencing batches within each embryonic stage (e.g., E6.5) using 2,000 highly variable features identified with FindVariableFeatures. We applied FindIntegrationAnchors (k.anchor = 5, scale = FALSE), followed by IntegrateData (features.to.integrate = all features (genes), k.weight = 100). In round 2, batch-corrected count matrices from round 1 were integrated across all embryonic stages within each atlas version (original vs. extended). In round 3, integration was performed across both the original and extended atlas versions. For rounds 2 and 3, rPCA integration was executed with k.weight = 200 and k. anchor = 5, using PCs that were identified using a combined list of VariableFeatures (5716 genes) identified from each embryonic stage during round 1 of rPCA integration.

scRNA-seq analyses - principal components from the seqFISH datasets

We applied multiple rounds of batch correction using the rPCA method in Seurat v414 to log-normalized counts from E7.25, E7.5 and E8.5 seqFISH embryos. Cells that passed the previously described quality control criteria were included in this analysis. In round 1, rPCA integration was performed across different embryos (e.g., embryo 1 and embryo 2) within each embryonic stage (e.g., E7.25). First, principal components (PCs) were generated by running ScaleData and RunPCA on all seqFISH probes. Integration was then carried out using FindIntegrationAnchors (with k.anchor = 5, scale = FALSE, and reduction = ‘rpca’), followed by IntegrateData (with features.to.integrate = all features (genes) and k.weight = 100). In round 2, batch-corrected count matrices from round 1 were re-integrated using rPCA, with k.weight = 200 and k.anchor = 5. Finally, principal components were calculated on the batch-corrected matrices after applying ScaleData and RunPCA.

scRNA-seq analyses - StabMAP and reducedMNN to align seqFISH and scRNA datasets

To align all seqFISH cells that passed quality control with the downsampled scRNA cells from the extended gastrulation atlas, we applied StabMAP15 followed by reducedMNN.16 For StabMAP, we used the top 30 principal components calculated for both the scRNA and seqFISH datasets, as described above, with projectALL = TRUE. The resulting StabMAP embedding was then reweighted to ensure equal contribution from both datasets using reWeightEmbedding(). To correct for any remaining technical differences between the seqFISH and scRNA datasets, we applied reducedMNN_batchFactor() with k = 10, setting the batch factor to either seqFISH or scRNA as appropriate.

scRNA-seq analyses - UMAP and joint clustering for spatiotemporal atlas

A low-dimensional UMAP was generated to visualize both scRNA and seqFISH cells within the same space, using runUMAP applied to the batch-corrected StabMAP embedding. Joint clustering and subsequent sub-clustering were performed on this embedding with FindNeighbors (dims = 1:60) and FindClusters (resolution = 1 for clustering; resolution = 3 for sub-clustering). As a quality control step, clusters containing more than 98% or fewer than 2% of either seqFISH or scRNA cells were labeled as ‘poor joint clusters.’

scRNA-seq analyses - Label transfer across scRNA and seqFISH datasets

We applied K-nearest neighbors (K = 5) to classify the seqFISH cells based on cell type annotations from the extended mouse atlas, incorporating both extended and original atlas labels, as well as embryonic stage and subdissection labels. Additionally, we used the same K-nearest neighbors approach (K = 5) to classify the scRNA cells according to the cell types assigned to the seqFISH cells.

scRNA-seq analyses - Gene expression imputation for seqFISH

We leveraged the scRNA atlas data, which provides whole transcriptome counts, to impute full gene expression profiles for the seqFISH-resolved cells. For each seqFISH-resolved cell, we identified the K-nearest neighbors (K = 5) and calculated the mean expression vector across all genes from the extended mouse atlas dataset.

scRNA-seq analyses - Gene expression and AP/DV coordinate imputation for seqFISH and scRNA

We utilized the seqFISH atlas data, which includes AP and DV axial coordinates, to impute these coordinates in the scRNA cells. To achieve this, we identified the K-nearest neighbors (K = 5) for each scRNA cell and calculated the mean expression vector across the AP or DV coordinates from the seqFISH dataset.

scRNA-seq analyses - Refining cell type annotations using seqFISH data

After obtaining cell type annotations for the seqFISH cells, we visualized their spatial distribution to further distinguish subtypes based on both spatial positioning and gene expression. This analysis was focused on the endothelium, lateral plate mesoderm, and ExE ectoderm. We then back-mapped the scRNA-seq extended mouse atlas to correlate the newly refined cell types with the broader atlas, allowing us to manually annotate subclusters with more specific cell type labels (e.g., proximal and distal ExE ectoderm, anterior and posterior LPM, embryo proper endothelium 1 and 2, and endocardium 1 and 2) based on additional spatial localization information.

Manual curation of cell type annotations

Cell type annotations within the spatiotemporal atlas were initially assigned using a semi-automated pipeline that incorporated high-resolution clustering and label transfer from the extended gastrulation atlases. While many annotations were reliably transferred via exploration of physical localization and marker gene expression patterns, additional manual curation was performed to refine specific populations, particularly embryo proper endothelium, lateral plate mesoderm (LPM), and extraembryonic ectoderm (ExE ectoderm). Some annotations may still require further refinement, and we provide the full dataset, clustering data, and transferred labels as an open-access resource to enable researchers to make additional improvements based on their expertise.

To enhance transparency and support further evaluation, Table S1 provides detailed marker genes for each cell type, while Videos S3–S5 illustrate the spatial distribution of these annotated cell types across the seven seqFISH embryos, along with gene expression patterns for the related cell types. Moreover, to support ongoing refinement by the community, we encourage users to provide feed-back or suggestions for improved annotations to the lead contact: Shila Ghazanfar (shila.ghazanfar@sydney.edu.au).

Manual curation - Updating ExE ectoderm and LPM annotations

To improve the resolution of populations initially labeled as ExE ectoderm and lateral plate mesoderm (LPM) following the first round of majority voting, we refined annotations by back-referencing and assigning majority-vote seqFISH cell type labels. These labels provided crucial spatial localization information, allowing us to better resolve these populations. Subsequently, the labels were consolidated into ‘ExE ectoderm proximal’, ‘ExE ectoderm distal’, ‘ExE mesoderm and anterior LPM’, or ‘ExE mesoderm and posterior LPM’, in alignment with their distinct spatial positions as indicated by seqFISH labels.

Manual curation - Reclassification of embryo proper endothelium

Annotations for embryo proper endothelium were refined by integrating spatial information from seqFISH embryos and clustering data. Cells initially labeled embryo proper endothelium through majority voting were reassigned to more specific subtypes, including endocardium #1/#2, Allantois endothelium, and Embryo proper endothelium #1/#2.

Manual curation - Refining poorly labeled populations using spatial information

To improve annotation accuracy, we refined a subset of primitive streak cells based on their sub-cluster identities and spatial positioning, reassigning them as rostral ectoderm. Similarly, a sub-cluster of epiblast was relabeled ExE endoderm, and another sub-cluster initially labeled haematoendothelial progenitors were reclassified as ExE mesoderm, better reflecting anatomical localization. Cells previously assigned as parietal endoderm were reassigned to visceral endoderm, while a group of anterior primitive streak cells were relabeled notochord.

Manual curation - Resolving ambiguous labels

As the initial annotation process relied on majority voting, two populations required further refinement to resolve ambiguous labels. Cells initially labeled ‘Dorsal spinal cord progenitors, Neural tube’ were relabeled as ‘Dorsal spinal cord progenitors’, while cells assigned ‘Non-neural ectoderm, Surface ectoderm’ were renamed ‘Surface ectoderm’ to reflect their spatial identity more accurately.

Overall, these refinements aimed to ensure that cell type annotations align more accurately with the spatial and transcriptomic landscape captured in our dataset. By combining automated label transfer with expert-driven curation, we improve annotation specificity while maintaining flexibility for further refinement by the research community. All updated annotations, code, and metadata are publicly available to ensure transparency and reproducibility.

Marker gene identification and expression visualization

To identify marker genes for the refined cell types, we used the FindAllMarkers function on seqFISH cells with the imputed gene expression data (slot = "imputed gene expression") and a curated list of established marker genes from Imaz Rosshandler et al. (2024). This analysis was performed on subsets of cell types (Figures S2–S4), with markers filtered for those exhibiting an average log2 fold change (avg_log2FC) greater than log2(1.5). The original gene expression patterns for these marker genes in both seqFISH and scRNA cells are visualized in dot plots (Figures S2–S4).

Differential gene expression analyses along AP axes in the primitive streak region and somitic tissues in the gastruloids and scRNA cells

To identify genes with altered expression over the AP axes in the primitive streak region of embryo 2, or the gastruloids somitic tissues, the AP coordinates were provided as pseusotime values to Tradeseq74 (1.10.0; Van den Berge et al., 2020). Generalised additive models were fit (fitGAM, nknots = 6, cellWeights = 1) to genes that were highly variable [top 2000 highly variable genes were identified using VariableFeatures function in Seurat (4.2.0)] among the cells that were part of each cell type. The associationTest function was then used to identify which of these genes had altered expression over the AP axes. ComplexHeatmap80 (2.15.3; Gu et al., 2016) was used to visualise the expression patterns from the GAMs with the highest 300 waldStat scores (p-value < 0.01 and mean-LogFC>2). Metascape75 (Zhou et al., 2019) was used to identify Gene Ontology (GO) terms that were enriched for clusters of genes.

Bioinformatics pipeline to project in vitro models into a spatiotemporal framework

To project additional scRNA-seq datasets into the spatiotemporal atlas, we used StabMAP15 and reducedMNN.16 Principal components were calculated for the gastruloid cells after performing rPCA integration using Seurat v414 as described above in the ‘Calculating Principal Components’ sections. Briefly, multiple rounds of rPCA integration were performed, first across samples within each gastruloid timepoint and then across timepoints. Next, to align all gastruloid cells with the spatiotemporal atlas cells, we applied StabMAP followed by reducedMNN. For StabMAP, we used the top 30 principal components calculated for the scRNA, seqFISH and query gastruloid datasets, as described above, with projectALL = TRUE. The resulting StabMAP embedding was then reweighted to ensure equal contribution from all three datasets using reWeightEmbedding(). To correct for any remaining technical differences between the seqFISH, scRNA and gastruloid datasets, we applied reducedMNN_batchFactor() with k = 10, setting the batch factor to either seqFISH, scRNA or gastruloid as appropriate. Cell type labels, and imputed AP and DV values for the gastruloid cells were assigned as described in the ‘Label Transfer across scRNA and seqFISH Datasets’ and ‘Gene Expression and AP/DV Co-ordinate Imputation for seqFISH and scRNA Cells’ sections above.

Spatial QC in the bioinformatics pipeline

To construct the spatiotemporal atlas, we integrated seqFISH and scRNA-seq datasets into a shared low-dimensional embedding and performed high-resolution clustering (see UMAP Generation and Joint Clustering for spatiotemporal Atlas sub-section in scRNA-seq Analyses section). Poor joint clusters were identified as clusters where >98% of cells originated from only one dataset (see Figure S3A). scRNA-seq cells in these clusters lacked sufficient neighboring seqFISH cells, making spatial imputation (e.g., AP or DV values) unreliable. This discrepancy arises because the scRNA dataset includes tissues absent from seqFISH slices (e.g., mature yolk sac endothelium and mesothelium). When projecting additional query datasets, such as gastruloids, onto the spatiotemporal atlas, poor joint cluster label information is transferred to the query cells. Query cells receiving the poor joint cluster label = TRUE are identified as those cells failing spatial QC (Figure 4C), as their spatial information is unreliable due to alignment with scRNA-seq cells that lack spatial reference points in seqFISH. For instance, in Figure 4, some gastruloid cardiomyocytes and yolk sac endothelium fail spatial QC because their scRNA-seq counterparts lack seqFISH equivalents.

Quantification and Statistical Analysis

Statistical analysis of cell type distribution along the AP axis

To quantitatively assess differences in cell type distribution between query cell and scRNA cells along the anteroposterior (AP) axis, we implemented two complementary statistical tests into the bioinformatics pipeline.

  • (1)

    Kolmogorov-Smirnov (KS) Test: To determine whether the distribution of cells along the AP axis differs significantly between query and scRNA-seq cells, we applied a KS test. This test provides a statistical measure of divergence in AP positioning between the two datasets.

  • (2)

    Jaccard Index Analysis: To further assess similarity in cell type composition, we calculated Jaccard indices across discrete bins spanning the AP axis. This analysis identifies regions long the AP axis where cell type composition is similar or divergent in query versus scRNA datasets.

Additional Resources

Data can be interactively explored at this link: http://shiny.maths.usyd.edu.au/SpatiotemporalMouseAtlas/.

Processed data can be downloaded using the links provided on the front page of the interactive exploration Shiny app above.

Supplementary Material

Supplemental Information

Supplemental information can be found online at https://doi.org/10.1016/j.celrep.2025.116047.

Supplemental Information
Table S1
Table S2

Highlights.

  • Integrated spatiotemporal atlas of mouse embryogenesis from E6.5 to E9.5

  • Resolves 80+ refined cell types across germ layers and embryonic stages

  • Spatial gene expression across the anterior-posterior and dorsal-ventral axes

  • Enables projection of additional datasets for comparative developmental analysis

In brief.

Harland et al. present a spatiotemporal transcriptomic atlas of early mouse development, integrating spatial and single-cell transcriptional data from gastrulation through organogenesis. This resource explores axial patterning across space and time and provides a bioinformatic pipeline to benchmark in vitro models against in vivo development.

Acknowledgments

The authors thank all their colleagues, particularly at The University of Sydney, Sydney Precision Data Science Center, Charles Perkins Center, and the University of Cambridge Stem Cell Institute, for their support and intellectual engagement. The following sources of funding are gratefully acknowledged. S.G. was supported by a Royal Society Newton International Fellowship (NIF\R1\181950), Australian Research Council DECRA Fellowship (DE220100964), and Chan Zuckerberg Initiative Single Cell Biology Data Insights grant (2022-249319). J.C.M. acknowledges the core funding from EMBL and core support from Cancer Research UK (C9545/A29580). This work was supported by the Human Biomolecular Atlas Project (NIH 1OT2OD026673-01). The work at Cambridge was supported by Wellcome, including a Wellcome Collaborative Gastrulation Consortium Award (220379/B/20/Z) to B.G. and a Wellcome Early-Career Award (226309/Z/22/Z) to L.T. G.H. C.P.’s research was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy - EXC 2067/1-390729940.

The funding sources mentioned above had no role in the study design, in the collection, analysis, and interpretation of data, in the writing of the manuscript, and in the decision to submit the manuscript for publication. This research was funded in whole or in part by the Wellcome Trust. For Open Access, the author has applied a CC BY public copyright license to any Author Accepted Manuscript version arising from this submission.

Footnotes

Author Contributions

T.L. and N.K. performed the experiments with input from J.N., W.R., L.C., and J.G., and T.L. performed gene selection with input from J.C.M. and L.C. N.P. performed primary image analysis of raw data with input from L.C. C.P.P. and S.G. performed cell segmentation with input from A.K. and J.C.M. L.T.G.H. and S.G. performed analysis of the processed data with input from B.T., N. W., J.C.M., and B.G. F.A. built the Shiny app interface with input from S.G. L.T.G.H. wrote the manuscript with input from B.G. and S.G.. All authors read and approved the final manuscript.

Declaration of Interests

T.L. is an employee of Forbion. J.G. and W.R. are employees of Altos Labs. W. R. is a consultant and shareholder of Biomodal. J.C.M. has been an employee of Genentech since September 2022. N.K. is an employee of Merck Co.

Declaration of Generative AI And AI-Assisted Technologies in the Writing Process

During the preparation of this work, the authors used ChatGPT to improve the clarity of writing. After using this tool/service, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.

Data and code availability

References

  • 1.Arnold SJ, Robertson EJ. Making a commitment: cell lineage allocation and axis patterning in the early mouse embryo. Nat Rev Mol Cell Biol. 2009;10:91–103. doi: 10.1038/nrm2618. [DOI] [PubMed] [Google Scholar]
  • 2.Tam PPL, Loebel DAF. Gene function in mouse embryogenesis: get set for gastrulation. Nat Rev Genet. 2007;8:368–381. doi: 10.1038/nrg2084. [DOI] [PubMed] [Google Scholar]
  • 3.Pijuan-Sala B, Griffiths JA, Guibentif C, Hiscock TW, Jawaid W, Calero-Nieto FJ, Mulas C, Ibarra-Soria X, Tyser RCV, Ho DLL, et al. A single-cell molecular map of mouse gastrulation and early organogenesis. Nature. 2019;566:490–495. doi: 10.1038/s41586-019-0933-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ibarra-Soria X, Jawaid W, Pijuan-Sala B, Ladopoulos V, Scialdone A, Jörg DJ, Tyser RCV, Calero-Nieto FJ, Mulas C, Nichols J, et al. Defining murine organogenesis at single-cell resolution reveals a role for the leukotriene pathway in regulating blood progenitor formation. Nat Cell Biol. 2018;20:127–134. doi: 10.1038/s41556-017-0013-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Mittnenzweig M, Mayshar Y, Cheng S, Ben-Yair R, Hadas R, Rais Y, Chomsky E, Reines N, Uzonyi A, Lumerman L, et al. A single-embryo, single-cell time-resolved model for mouse gastrulation. Cell. 2021;184:2825–2842.:e22. doi: 10.1016/j.cell.2021.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Argelaguet R, Clark SJ, Mohammed H, Stapel LC, Krueger C, Kapourani CA, Imaz-Rosshandler I, Lohoff T, Xiang Y, Hanna CW, et al. Multi-omics profiling of mouse gastrulation at single-cell resolution. Nature. 2019;576:487–491. doi: 10.1038/s41586-019-1825-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hadas R, Rubinstein H, Mittnenzweig M, Mayshar Y, Ben-Yair R, Cheng S, Aguilera-Castrejon A, Reines N, Orenbuch AH, Lifshitz A, et al. Temporal BMP4 effects on mouse embryonic and extra-embryonic development. Nature. 2024;634:652–661. doi: 10.1038/s41586-024-07937-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Imaz-Rosshandler I, Rode C, Guibentif C, Harland LTG, Ton MLN, Dhapola P, Keitley D, Argelaguet R, Calero-Nieto FJ, Nichols J, et al. Tracking early mammalian organogenesis - prediction and validation of differentiation trajectories at whole organism scale. Development. 2024;151:dev201867. doi: 10.1242/dev.201867. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Qiu C, Martin BK, Welsh IC, Daza RM, Le TM, Huang X, Nichols EK, Taylor ML, Fulton O, O’Day DR, et al. A single-cell time-lapse of mouse prenatal development from gastrula to birth. Nature. 2024;626:1084–1093. doi: 10.1038/s41586-024-07069-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Eng CHL, Lawson M, Zhu Q, Dries R, Koulena N, Takei Y, Yun J, Cronin C, Karp C, Yuan GC, Cai L. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH. Nature. 2019;568:235–239. doi: 10.1038/s41586-019-1049-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lohoff T, Ghazanfar S, Missarova A, Koulena N, Pierson N, Griffiths JA, Bardot ES, Eng CHL, Tyser RCV, Argelaguet R, et al. Integration of spatial and single-cell transcriptomic data elucidates mouse organogenesis. Nat Biotechnol. 2022;40:74–85. doi: 10.1038/s41587-021-01006-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Downs KM, Davies T. Staging of gastrulating mouse embryos by morphological landmarks in the dissecting microscope. Development. 1993;118:1255–1266. doi: 10.1242/dev.118.4.1255. [DOI] [PubMed] [Google Scholar]
  • 13.Rossi G, Broguiere N, Miyamoto M, Boni A, Guiet R, Girgin M, Kelly RG, Kwon C, Lutolf MP. Capturing Cardiogenesis in Gastruloids. Cell Stem Cell. 2021;28:230–240.:e6. doi: 10.1016/j.stem.2020.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hao Y, Hao S, Andersen-Nissen E, Mauck WM, 3rd, Zheng S, Butler A, Lee MJ, Wilk AJ, Darby C, Zager M, et al. Integrated analysis of multimodal single-cell data. Cell. 2021;184:3573–3587.:e29. doi: 10.1016/j.cell.2021.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ghazanfar S, Guibentif C, Marioni JC. Stabilized mosaic single-cell data integration using unshared features. Nat Biotechnol. 2024;42:284–292. doi: 10.1038/s41587-023-01766-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Haghverdi L, Lun ATL, Morgan MD, Marioni JC. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat Biotechnol. 2018;36:421–427. doi: 10.1038/nbt.4091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Costello I, Pimeisl IM, Dräger S, Bikoff EK, Robertson EJ, Arnold SJ. The T-box transcription factor Eomesodermin acts upstream of Mesp1 to specify cardiac mesoderm during mouse gastrulation. Nat Cell Biol. 2011;13:1084–1091. doi: 10.1038/ncb2304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Harland LTG, Simon CS, Senft AD, Costello I, Greder L, Imaz-Rosshandler I, Göttgens B, Marioni JC, Bikoff EK, Porcher C, et al. The T-box transcription factor Eomesodermin governs haemogenic competence of yolk sac mesodermal progenitors. Nat Cell Biol. 2021;23:61–74. doi: 10.1038/s41556-020-00611-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Rasmussen TL, Shi X, Wallis A, Kweon J, Zirbes KM, Koyano-Nakagawa N, Garry DJ. VEGF/Flk1 signaling cascade transactivates Etv2 gene expression. PLoS One. 2012;7:e50103. doi: 10.1371/journal.pone.0050103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Probst S, Arnold SJ, Tosic J, Tosic J, Schwan C, Grün D. Spatiotemporal sequence of mesoderm and endoderm lineage segregation during mouse gastrulation. Development. 2021;148:dev193789. doi: 10.1242/dev.193789. [DOI] [PubMed] [Google Scholar]
  • 21.Scheibner K, Schirge S, Burtscher I, Büttner M, Sterr M, Yang D, Böttcher A, Ansarullah Irmler M, Beckers J, et al. Epithelial cell plasticity drives endoderm formation during gastrulation. Nat Cell Biol. 2021;23:692–703. doi: 10.1038/s41556-021-00694-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Burtscher I, Lickert H. Foxa2 regulates polarity and epithelialization in the endoderm germ layer of the mouse embryo. Development. 2009;136:1029–1038. doi: 10.1242/dev.028415. [DOI] [PubMed] [Google Scholar]
  • 23.Fleury M, Eliades A, Carlsson P, Lacaud G, Kouskoff V. FOXF1 inhibits hematopoietic lineage commitment during early mesoderm specification. Development. 2015;142:3307–3320. doi: 10.1242/dev.124685. [DOI] [PubMed] [Google Scholar]
  • 24.Mahlapuu M, Ormestad M, Enerbäck S, Carlsson P. The forkhead transcription factor Foxf1 is required for differentiation of extra-embryonic and lateral plate mesoderm. Development. 2001;128:155–166. doi: 10.1242/dev.128.2.155. [DOI] [PubMed] [Google Scholar]
  • 25.Richardson L, Venkataraman S, Stevenson P, Yang Y, Moss J, Graham L, Burton N, Hill B, Rao J, Baldock RA, Armit C. EMAGE mouse embryo spatial gene expression database: 2014 update. Nucleic Acids Res. 2014;42:D835–D844. doi: 10.1093/nar/gkt1155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Cai CL, Liang X, Shi Y, Chu PH, Pfaff SL, Chen J, Evans S. Isl1 identifies a cardiac progenitor population that proliferates prior to differentiation and contributes a majority of cells to the heart. Dev Cell. 2003;5:877–889. doi: 10.1016/s1534-5807(03)00363-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Saba R, Kitajima K, Rainbow L, Engert S, Uemura M, Ishida H, Kokkinopoulos I, Shintani Y, Miyagawa S, Kanai Y, et al. Endocardium differentiation through Sox17 expression in endocardium precursor cells regulates heart development in mice. Sci Rep. 2019;9:11953. doi: 10.1038/s41598-019-48321-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Nemer G, Nemer M. Cooperative interaction between GATA5 and NF-ATc regulates endothelial-endocardial differentiation of cardiogenic cells. Development. 2002;129:4045–4055. doi: 10.1242/dev.129.17.4045. [DOI] [PubMed] [Google Scholar]
  • 29.Devine WP, Wythe JD, George M, Koshiba-Takeuchi K, Bruneau BG. Early patterning and specification of cardiac progenitors in gastrulating mesoderm. eLife. 2014;3:e03848. doi: 10.7554/eLife.03848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Zhou B, Ma Q, Rajagopal S, Wu SM, Domian I, Rivera-Feliciano J, Jiang D, von Gise A, Ikeda S, Chien KR, Pu WT. Epicardial progenitors contribute to the cardiomyocyte lineage in the developing heart. Nature. 2008;454:109–113. doi: 10.1038/nature07060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Christoffels VM, Grieskamp T, Norden J, Mommersteeg MTM, Rudat C, Kispert A. Tbx18 and the fate of epicardial progenitors. Nature. 2009;458:E8–E10. doi: 10.1038/nature07916. discussion E9-10. [DOI] [PubMed] [Google Scholar]
  • 32.Scotti M, Kmita M. Recruitment of 5′ Hoxa genes in the allantois is essential for proper extra-embryonic function in placental mammals. Development. 2012;139:731–739. doi: 10.1242/dev.075408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Naiche LA, Arora R, Kania A, Lewandoski M, Papaioannou VE. Identity and fate of Tbx4-expressing cells reveal developmental cell fate decisions in the allantois, limb, and external genitalia. Dev Dyn. 2011;240:2290–2300. doi: 10.1002/dvdy.22731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wallin J, Wilting J, Koseki H, Fritsch R, Christ B, Balling R. The role of Pax-1 in axial skeleton development. Development. 1994;120:1109–1121. doi: 10.1242/dev.120.5.1109. [DOI] [PubMed] [Google Scholar]
  • 35.Buckingham M, Relaix F. The role of Pax genes in the development of tissues and organs: Pax3 and Pax7 regulate muscle progenitor cell functions. Annu Rev Cell Dev Biol. 2007;23:645–673. doi: 10.1146/annurev.cellbio.23.090506.123438. [DOI] [PubMed] [Google Scholar]
  • 36.Wittler L, Shin EH, Grote P, Kispert A, Beckers A, Gossler A, Werber M, Herrmann BG. Expression of Msgn1 in the presomitic mesoderm is controlled by synergism of WNT signalling and Tbx6. EMBO Rep. 2007;8:784–789. doi: 10.1038/sj.embor.7401030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Biben C, Wang CC, Harvey RP. NK-2 class homeobox genes and pharyngeal/oral patterning: Nkx2-3 is required for salivary gland and tooth morphogenesis. Int J Dev Biol. 2002;46:415–422. [PubMed] [Google Scholar]
  • 38.Kaestner KH. The making of the liver: developmental competence in foregut endoderm and induction of the hepatogenic program. Cell Cycle. 2005;4:1146–1148. doi: 10.4161/cc.4.9.2033. [DOI] [PubMed] [Google Scholar]
  • 39.Nowotschin S, Setty M, Kuo YY, Liu V, Garg V, Sharma R, Simon CS, Saiz N, Gardner R, Boutet SC, et al. The emergent landscape of the mouse gut endoderm at single-cell resolution. Nature. 2019;569:361–367. doi: 10.1038/s41586-019-1127-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Garriock RJ, Chalamalasetty RB, Zhu J, Kennedy MW, Kumar A, Mackem S, Yamaguchi TP. A dorsal-ventral gradient of Wnt3a/beta-catenin signals controls mouse hindgut extension and colon formation. Development. 2020;147:dev185108. doi: 10.1242/dev.185108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Tamplin OJ, Cox BJ, Rossant J. Integrated microarray and ChIP analysis identifies multiple Foxa2 dependent target genes in the notochord. Dev Biol. 2011;360:415–425. doi: 10.1016/j.ydbio.2011.10.002. [DOI] [PubMed] [Google Scholar]
  • 42.Medina-Martinez O, Ramirez-Solis R. In vivo mutagenesis of the Hoxb8 hexapeptide domain leads to dominant homeotic transformations that mimic the loss-of-function mutations in genes of the Hoxb cluster. Dev Biol. 2003;264:77–90. doi: 10.1016/j.ydbio.2003.07.020. [DOI] [PubMed] [Google Scholar]
  • 43.Sinn R, Wittbrodt J. An eye on eye development. Mech Dev. 2013;130:347–358. doi: 10.1016/j.mod.2013.05.001. [DOI] [PubMed] [Google Scholar]
  • 44.Hagglund AC, Dahl L, Carlsson L. Lhx2 is required for patterning and expansion of a distinct progenitor cell population committed to eye development. PLoS One. 2011;6:e23387. doi: 10.1371/journal.pone.0023387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Lagutin O, Zhu CC, Furuta Y, Rowitch DH, McMahon AP, Oliver G. Six3 promotes the formation of ectopic optic vesicle-like structures in mouse embryos. Dev Dyn. 2001;221:342–349. doi: 10.1002/dvdy.1148. [DOI] [PubMed] [Google Scholar]
  • 46.Home P, Kumar RP, Ganguly A, Saha B, Milano-Foster J, Bhattacharya B, Ray S, Gunewardena S, Paul A, Camper SA, et al. Genetic redundancy of GATA factors in the extraembryonic trophoblast lineage ensures the progression of preimplantation and post-implantation mammalian development. Development. 2017;144:876–888. doi: 10.1242/dev.145318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Prummel KD, Nieuwenhuize S, Mosimann C. The lateral plate mesoderm. Development. 2020;147:dev175059. doi: 10.1242/dev.175059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Barnes RM, Firulli BA, Conway SJ, Vincentz JW, Firulli AB. Analysis of the Hand1 cell lineage reveals novel contributions to cardiovascular, neural crest, extra-embryonic, and lateral mesoderm derivatives. Dev Dyn. 2010;239:3086–3097. doi: 10.1002/dvdy.22428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Gamer LW, Wright CV. Murine Cdx-4 bears striking similarities to the Drosophila caudal gene in its homeodomain sequence and early expression pattern. Mech Dev. 1993;43:71–81. doi: 10.1016/0925-4773(93)90024-r. [DOI] [PubMed] [Google Scholar]
  • 50.Forlani S, Lawson KA, Deschamps J. Acquisition of Hox codes during gastrulation and axial elongation in the mouse embryo. Development. 2003;130:3807–3819. doi: 10.1242/dev.00573. [DOI] [PubMed] [Google Scholar]
  • 51.Kinder SJ, Loebel DA, Tam PP. Allocation and early differentiation of cardiovascular progenitors in the mouse embryo. Trends Cardiovasc Med. 2001;11:177–184. doi: 10.1016/s1050-1738(01)00091-3. [DOI] [PubMed] [Google Scholar]
  • 52.Kinder SJ, Tsang TE, Quinlan GA, Hadjantonakis AK, Nagy A, Tam PP. The orderly allocation of mesodermal cells to the extra-embryonic structures and the anteroposterior axis during gastrulation of the mouse embryo. Development. 1999;126:4691–4701. doi: 10.1242/dev.126.21.4691. [DOI] [PubMed] [Google Scholar]
  • 53.Tam PP, Parameswaran M, Kinder SJ, Weinberger RP. The allocation of epiblast cells to the embryonic heart and other mesodermal lineages: the role of ingression and tissue movement during gastrulation. Development. 1997;124:1631–1642. doi: 10.1242/dev.124.9.1631. [DOI] [PubMed] [Google Scholar]
  • 54.Peng G, Suo S, Cui G, Yu F, Wang R, Chen J, Chen S, Liu Z, Chen G, Qian Y, et al. Molecular architecture of lineage allocation and tissue organization in early mouse embryo. Nature. 2019;572:528–532. doi: 10.1038/s41586-019-1469-8. [DOI] [PubMed] [Google Scholar]
  • 55.Peng G, Suo S, Chen J, Chen W, Liu C, Yu F, Wang R, Chen S, Sun N, Cui G, et al. Spatial Transcriptome for the Molecular Annotation of Lineage Fates and Cell Identity in Mid-gastrula Mouse Embryo. Dev Cell. 2016;36:681–697. doi: 10.1016/j.devcel.2016.02.020. [DOI] [PubMed] [Google Scholar]
  • 56.Suppinger S, Zinner M, Aizarani N, Lukonin I, Ortiz R, Azzi C, Stadler MB, Vianello S, Palla G, Kohler H, et al. Multimodal characterization of murine gastruloid development. Cell Stem Cell. 2023;30:867–884.:e11. doi: 10.1016/j.stem.2023.04.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.van den Brink SC, Alemany A, van Batenburg V, Moris N, Blotenburg M, Vivié J, Baillie-Johnson P, Nichols J, Sonnen KF, Martinez Arias A, van Oudenaarden A. Single-cell and spatial transcriptomics reveal somitogenesis in gastruloids. Nature. 2020;582:405–409. doi: 10.1038/s41586-020-2024-3. [DOI] [PubMed] [Google Scholar]
  • 58.Beccari L, Moris N, Girgin M, Turner DA, Baillie-Johnson P, Cossy AC, Lutolf MP, Duboule D, Arias AM. Multi-axial self-organization properties of mouse embryonic stem cells into gastruloids. Nature. 2018;562:272–276. doi: 10.1038/s41586-018-0578-0. [DOI] [PubMed] [Google Scholar]
  • 59.Turner DA, Girgin M, Alonso-Crisostomo L, Trivedi V, Baillie-Johnson P, Glodowski CR, Hayward PC, Collignon J, Gustavsen C, Serup P, et al. Anteroposterior polarity and elongation in the absence of extra-embryonic tissues and of spatially localised signalling in gastruloids: mammalian embryonic organoids. Development. 2017;144:3894–3906. doi: 10.1242/dev.150391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.van den Brink SC, Baillie-Johnson P, Balayo T, Hadjantonakis AK, Nowotschin S, Turner DA, Martinez Arias A. Symmetry breaking, germ layer specification and axial organisation in aggregates of mouse embryonic stem cells. Development. 2014;141:4231–4242. doi: 10.1242/dev.113001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Rossi G, Giger S, Hübscher T, Lutolf MP. Gastruloids as in vitro models of embryonic blood development with spatial and temporal resolution. Sci Rep. 2022;12:13380. doi: 10.1038/s41598-022-17265-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Gandin V, Kim J, Yang LZ, Lian Y, Kawase T, Hu A, Rokicki K, Fleishman G, Tillberg P, Castrejon AA, et al. Deep-tissue transcriptomics and subcellular imaging at high spatial resolution. Science. 2025;388:eadq2084. doi: 10.1126/science.adq2084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Qiu X, Zhu DY, Lu Y, Yao J, Jing Z, Min KH, Cheng M, Pan H, Zuo L, King S, et al. Spatiotemporal modeling of molecular holograms. Cell. 2024;187:7351–7373.:e61. doi: 10.1016/j.cell.2024.10.011. [DOI] [PubMed] [Google Scholar]
  • 64.Theeuwes B, Harland LTG, Bisia AM, Costello I, Ton MLN, Lohoff T, Clark SJ, Argelaguet R, Wilson NK, Reik W, et al. Eomes directs the formation of spatially and functionally diverse extraembryonic hematovascular tissues. Dev Cell. 2025;60:1–12. doi: 10.1016/j.devcel.2025.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Dann E, Cujba AM, Oliver AJ, Meyer KB, Teichmann SA, Marioni JC. Precise identification of cell states altered in disease using healthy single-cell references. Nat Genet. 2023;55:1998–2008. doi: 10.1038/s41588-023-01523-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Ton MLN, Keitley D, Theeuwes B, Guibentif C, Ahnfelt-Rønne J, Andreassen TK, Calero-Nieto FJ, Imaz-Rosshandler I, Pijuan-Sala B, Nichols J, et al. An atlas of rabbit development as a model for single-cell comparative genomics. Nat Cell Biol. 2023;25:1061–1072. doi: 10.1038/s41556-023-01174-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Mayshar Y, Raz O, Cheng S, Ben-Yair R, Hadas R, Reines N, Mittnenzweig M, Ben-Kiki O, Lifshitz A, Tanay A, Stelzer Y. Time-aligned hourglass gastrulation models in rabbit and mouse. Cell. 2023;186:2610–2627.:e18. doi: 10.1016/j.cell.2023.04.037. [DOI] [PubMed] [Google Scholar]
  • 68.Moris N, Anlas K, van den Brink SC, Alemany A, Schröder J, Ghimire S, Balayo T, van Oudenaarden A, Martinez Arias A. An in vitro model of early anteroposterior organization during human development. Nature. 2020;582:410–415. doi: 10.1038/s41586-020-2383-9. [DOI] [PubMed] [Google Scholar]
  • 69.Hamazaki N, Yang W, Kubo CA, Qiu C, Martin BK, Garge RK, Regalado SG, Nichols EK, Pendyala S, Bradley N, et al. Retinoic acid induces human gastruloids with posterior embryo-like structures. Nat Cell Biol. 2024;26:1790–1803. doi: 10.1038/s41556-024-01487-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Miao Y, Djeffal Y, De Simone A, Zhu K, Lee JG, Lu Z, Silberfeld A, Rao J, Tarazona OA, Mongera A, et al. Reconstruction and deconstruction of human somitogenesis in vitro. Nature. 2023;614:500–508. doi: 10.1038/s41586-022-05655-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.McDole K, Guignard L, Amat F, Berger A, Malandain G, Royer LA, Turaga SC, Branson K, Keller PJ. In Toto Imaging and Reconstruction of Post-Implantation Mouse Development at the Single-Cell Level. Cell. 2018;175:859–876.:e33. doi: 10.1016/j.cell.2018.09.031. [DOI] [PubMed] [Google Scholar]
  • 72.Klein D, Palla G, Lange M, Klein M, Piran Z, Gander M, Meng-Papaxanthos L, Sterr M, Saber L, Jing C, et al. Mapping cells through time and space with moscot. Nature. 2025;638:1065–1075. doi: 10.1038/s41586-024-08453-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Halmos P, Liu X, Gold J, Chen F, Ding L, Raphael BJ. DeST-OT: Alignment of spatiotemporal transcriptomics data. Cell Syst. 2025;16:101160. doi: 10.1016/j.cels.2024.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Van den Berge K, Roux de Bézieux H, Street K, Saelens W, Cannoodt R, Saeys Y, Dudoit S, Clement L. Trajectory-based differential expression analysis for single-cell sequencing data. Nat Commun. 2020;11:1201. doi: 10.1038/s41467-020-14766-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, Benner C, Chanda SK. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10:1523. doi: 10.1038/s41467-019-09234-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Ronneberger O, Fischer P, Brox T. U-net: Convolutional Networks for Biomedical Image Segmentation. Springer International Publishing; 2015. pp. 234–241. [Google Scholar]
  • 77.Schmidt U, Weigert M, Broaddus C, Myers G. Cell Detection with Star-Convex Polygons. Springer; 2018. pp. 265–273. [Google Scholar]
  • 78.Stringer C, Wang T, Michaelos M, Pachitariu M. Cellpose: a generalist algorithm for cellular segmentation. Nat Methods. 2021;18:100–106. doi: 10.1038/s41592-020-01018-x. [DOI] [PubMed] [Google Scholar]
  • 79.Hastie T, Stuetzle W. Principal Curves. J Am Stat Assoc. 1989;84:502–516. doi: 10.1080/01621459.1989.10478797. [DOI] [Google Scholar]
  • 80.Gu Z, Eils R, Schlesner M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32:2847–2849. doi: 10.1093/bioinformatics/btw313. [DOI] [PubMed] [Google Scholar]
  • 81.Shah S, Takei Y, Zhou W, Lubeck E, Yun J, Eng CHL, Koulena N, Cronin C, Karp C, Liaw EJ, et al. Dynamics and Spatial Genomics of the Nascent Transcriptome by Intron seqFISH. Cell. 2018;174:363–376.:e16. doi: 10.1016/j.cell.2018.05.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Caicedo JC, Goodman A, Karhohs KW, Cimini BA, Ackerman J, Haghighi M, Heng C, Becker T, Doan M, McQuin C, et al. Nucleus segmentation across imaging experiments: the 2018 Data Science Bowl. Nat Methods. 2019;16:1247–1253. doi: 10.1038/s41592-019-0612-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Berg S, Kutra D, Kroeger T, Straehle CN, Kausler BX, Haubold C, Schiegg M, Ales J, Beier T, Rudy M, et al. ilastik: interactive machine learning for (bio)image analysis. Nat Methods. 2019;16:1226–1232. doi: 10.1038/s41592-019-0582-9. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Information
Table S1
Table S2

Data Availability Statement

RESOURCES