spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics

Chao Zhang; Lin Liu; Ying Zhang; Mei Li; Shuangsang Fang; Qiang Kang; Ao Chen; Xun Xu; Yong Zhang; Yuxiang Li

doi:10.1093/gigascience/giae042

. 2024 Jul 19;13:giae042. doi: 10.1093/gigascience/giae042

spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics

Chao Zhang ^1,^#, Lin Liu ^2,^#, Ying Zhang ^3,^#, Mei Li ⁴, Shuangsang Fang ^5,⁶, Qiang Kang ⁷, Ao Chen ^8,⁹, Xun Xu ^10,⁴, Yong Zhang ^11,^12,^13,^✉, Yuxiang Li ^14,^15,^16,^✉

PMCID: PMC11258913 PMID: 39028588

Abstract

Background

Integrative analysis of spatially resolved transcriptomics datasets empowers a deeper understanding of complex biological systems. However, integrating multiple tissue sections presents challenges for batch effect removal, particularly when the sections are measured by various technologies or collected at different times.

Findings

We propose spatiAlign, an unsupervised contrastive learning model that employs the expression of all measured genes and the spatial location of cells, to integrate multiple tissue sections. It enables the joint downstream analysis of multiple datasets not only in low-dimensional embeddings but also in the reconstructed full expression space.

Conclusions

In benchmarking analysis, spatiAlign outperforms state-of-the-art methods in learning joint and discriminative representations for tissue sections, each potentially characterized by complex batch effects or distinct biological characteristics. Furthermore, we demonstrate the benefits of spatiAlign for the integrative analysis of time-series brain sections, including spatial clustering, differential expression analysis, and particularly trajectory inference that requires a corrected gene expression matrix.

Keywords: spatial transcriptomics, batch effect, data integration, domain adaptation, contrastive learning

Introduction

The rapid advancements of spatially resolved transcriptomics (SRT) have revolutionized our understanding of the spatial organization and heterogeneity of cells within complex tissues and developmental processes [1]. Cutting-edge in situ capturing technologies (e.g., 10x Genomics Visium [2], Slide-seq [3], Stereo-seq [4], and Seq-scope [5]) have facilitated the simultaneous measurement of tens of thousands of genes in their spatial context, achieving unprecedented cellular or even subcellular resolution. The SRT datasets are typically acquired from different tissue sections, each potentially representing a fragmented profiling of the targeted biological system. Hence, integrating multiple datasets for joint analysis is imperative to decipher the whole biological system. However, integrative analysis presents significant challenges due to the inherent biological variability and batch effects caused by nonbiological factors such as technology differences and different experimental batches.

Prior efforts to tackle this task have conventionally focused on single-cell RNA sequencing (scRNA-seq) technologies [6, 7], which can be roughly classified into 2 main categories: methods that (i) generate a joint embedding space [8–13] and (ii) calculate a corrected feature matrix [14–17]. For example, Harmony [8] projects cells into a shared embedding by maximum diversity clustering and iteratively learning a cell-specific linear correction function that regresses out biological effects within clusters. SCALEX [13], a deep learning method, provides a truly online tool to project cells into a batch-invariant, common cell-embedding space. Although these methods prove valuable for capturing the overall characteristics of cells, such as combined clustering, they are not applicable to downstream gene-level analysis tasks, such as differentially expressed gene (DEG) analysis. In contrast, popular MNN-based methods such as Seurat v3 [16] efficiently address batch effects in gene expression, but their limitation lies in the ability to align only 2 batches at a time, and they become impractical when dealing with many batches. However, it is worth noting that these scRNA-seq data integration tools have focused on harmonizing gene expression profiles across different experimental batches and do not consider the spatial context of spots/cells.

In the field of SRT studies, embedding spatial information has proven beneficial for downstream analysis, such as spatial domain identification [18, 19], imputation [20, 21], clustering [22], and cell-type annotation [23]. More recently, works have been published to improve the integration of SRT datasets by exploiting spatial information. PRECAST leverages spatial smoothness in both the cluster label and lower-dimensional representations to estimate aligned embeddings for multiple tissue sections, effectively capturing the spatial relationship between cells/spots [24]. GraphST introduces a graph self-supervised contrastive learning model to reconstruct gene expression by minimizing the embedding distance between spatially adjacent spots [25]. However, PRECAST primarily focuses on generating a low-dimensional aligned embedding, and GraphST requires registering the spatial coordinates of samples first to ensure its integration performance; thus, their applications are limited in certain scenarios.

To address these challenges, we propose spatiAlign, an unsupervised method that leverages spatial embedding and across-domain adaptation strategies for aligning SRT datasets. spatiAlign offers 3 key advantages as follows. First, it effectively captures the underlying relationships between spots/cells in both the spatial neighborhoods and gene expression to learn latent representations with a deep graph infomax (DGI) [26] framework. Second, spatiAlign aligns biological effects by adapting the semantic similarities between spots/cells and/or pseudoclusters from one section to another without relying on external labeled data, resulting in a joint batch-corrected embedding. Third, benefiting from a symmetric decoder in DGI, spatiAlign outputs the reconstructed spatial gene expression matrices, in which gene expression is adjusted and batch effects are corrected. We validate the 3 advantages of spatiAlign with 4 applications on publicly available 10x Genomics Visium, Slide-seq, and Stereo-seq datasets of human and mouse tissues. The benchmarking analysis demonstrates spatiAlign’s superiority in learning low-dimensional representations compared with 8 established methods, including GraphST and PRECAST, which were recently developed for SRT datasets. Compared with the original spatial expression of brain region-specific markers, the counts reconstructed using spatiAlign more accurately represent their laminar organization, exhibiting distinctly clear boundaries between regions. We also validate the capability of spatiAlign to capture the unique characteristics of 3 Slide-seq mouse hippocampus slices, which contain regions with different structures. The comprehensive integrated analysis of developing mouse brain slices indicates that the aligned joint representations, which embed cellular neighborhoods, improve the identification of cell clusters. In addition, the reconstructed features from our proposed spatiAlign method facilitate the identification of DEGs under different developmental stages and the recovery of cellular trajectories.

Results

Overview of spatiAlign

spatiAlign takes as inputs multiple SRT datasets, comprising the expression of all measured genes and spatial locations of spots/cells, to achieve 2 objectives: low-dimensional semantic alignment and high-dimensional gene expression reconstruction (Fig. 1A). In low-dimensional alignment, the primary strategy underlying spatiAlign is to implement a self-supervised contrastive learning architecture (DGI-based framework) to reduce dimensions while simultaneously propagating neighboring spatial context between spots/cells (Fig. 1C). Furthermore, it employs an across-domain adaptation technique to align joint embeddings, effectively accounting for batch effects across multiple tissue sections (Fig. 1B). In high-dimensional gene expression reconstruction, we utilize a decoder included in the DGI to reverse aligned representations back into the raw gene expression space, thereby reconstructing the gene expression counts.

Figure 1: — Overview of spatiAlign. (A) spatiAlign takes as inputs multiple spatially resolved transcriptomics (SRT) datasets that consist of gene expression profiles for all measured genes and spatial locations of spots/cells. Using semantic alignment, spatiAlign generates a shared batch-corrected embedding, where biological effects are aligned. Moreover, spatiAlign reconstructs the full high-dimensional expression space, enhancing and correcting gene expression counts. In addition to SRT dataset integration and gene feature correction, spatiAlign returns a final joint embedding and enhanced gene expression matrices to facilitate downstream analysis, such as joint spatial clustering, joint differential expression analysis, and joint trajectory inference. (B) spatiAlign takes multiple SRT datasets as inputs. Latent embeddings are first generated using deep graph infomax (DGI) as feature extractors. Then, with the utilization of across-domain adaptation and memory bank strategies, spatiAlign brings similar semantic spots/cells closer together and pushes dissimilar spots/cells farther apart, irrespective of their original datasets. These self-batch and across-batch contrastive learning processes align biological effects while correcting batch effects. (C) A DGI framework takes as inputs the normalized gene expression matrix and corresponding spatial coordinates from an SRT dataset. A spatial neighboring graph (i.e., adjacent matrix A) is built to represent the spatial relationships between adjacent spots/cells. To create an augmented gene expression matrix X′, a random perturbation is applied to shuffle the original gene expression X while maintaining the spatial neighboring graph unchanged. Deep neural network (DNN)–based autoencoders are used to learn gene representations Z and Z′ by reducing the dimension of gene expression matrix X and the augmented expression matrix X′. These representations are individually fed into a variational graph autoencoder (VGAE), along with the spatial neighboring graph, which performs spatial embedding for the gene representations and outputs the final latent representations S and S′ that capture the rich information in original/augmented gene expression profiles and spatial information. Afterward, embeddings S are optimized using our self-supervised contrastive learning strategy, which ensures that spatially adjacent cells have similar embeddings while nonadjacent cells have dissimilar embeddings. Finally, the final embeddings S can be reversed back to the original feature space, resulting in a reconstructed gene expression matrix.

Formally, given a series of SRT datasets, gene expression profiles are transformed into cell/spot-gene matrices (e.g., gene expression matrix X) and spatial neighboring graphs between cells/spots (e.g., cell‒cell adjacent matrix A), where the connective relationships of cells/spots are negatively associated with Euclidean distance. We design a deep neural network (DNN)–based autoencoder to learn the low-dimensional gene representations Z from the original gene expression matrix. The adjacency matrix A and the reduced gene representations Z are fed into a variational graph autoencoder (VGAE) [27] that propagates spatial neighboring context for the gene representations, resulting in a final joint representation S (positive samples) that captures comprehensive characteristics of the gene expression profile and cellular neighborhoods. Thereafter, the adjusted gene expression matrices can be reconstructed using a symmetric decoder architecture, which reverses the joint representations S back to the original space.

To improve spatiAlign’s ability to exploit potential information in SRT datasets, augmentation-based contrastive learning is adopted [25, 28, 29]. Technically, a gene expression matrix X is augmented by randomly shuffling the gene expression vector of spots/cells to create a corrupted gene expression matrix Inline graphic while keeping the spatial neighboring graph unchanged. Then, the corrupted gene expression matrix and adjacency matrix A are fed into the aforementioned model, which utilizes the shared model weights to generate corrupted joint representations (negative samples). We then use self-supervised contrastive learning to bring the positive samples closer within the spatial neighboring context while pushing the negative samples far apart within the same neighboring context (Fig. 1C).

Using an across-domain adaptation [28, 30, 31] and deep clustering [32] strategy, spatiAlign aims to align biological effects while maximizing the preservation of biological variances in the latent embedding of spots/cells. Specifically, we use a memory bank to store the final latent representations for each dataset that will be used to measure the similarity between spots/cells or pseudoclusters for self-batch/across-batch contrastive learning. For each tissue section, spatiAlign minimizes the similarity distance between the current latent representations and the corresponding memory bank entries to bring similar semantic spots/cells closer together and push dissimilar semantic spots/cells far apart. In parallel, inspired by the idea of “label as representation,” we assume that the dimension of the final latent embedding is equal to the number of pseudoprototypical clusters, and the spots/cells vector denotes its soft label accordingly. Thus, each spot/cell is assigned to a different pseudo-cluster, and all pseudo-clusters should differ from each other. Identically, spatiAlign employs “current pseudo-cluster representation” (transposition latent representation) and “cached pseudo-cluster representation” (transposition corresponding memory bank) to bring the same pseudo-cluster spots/cells closer together and push dissimilar pseudo-cluster spots/cells far apart. In across-batch contrastive learning, cross-similarity between spots/cells, measured by the current latent representation and memory bank of other sections, is minimized to align biological effects across sections, ensuring similar semantic spots/cells closer together, regardless of which sections they are from.

spatiAlign outperforms the control methods in integrating dorsolateral prefrontal cortex datasets

We evaluated the effectiveness of spatiAlign in analyzing a series of 10x Genomics Visium datasets from the human dorsolateral prefrontal cortex (DLPFC). This dataset includes 12 tissue sections, which are organized into 3 groups. Each group (approximately 14,000 spots and 14,000 genes) consists of 4 sections that are manually annotated in the original study to 6 tissue layers (Layer_1 to Layer_6) and white matter (WM) (Fig. 2A, Supplementary Fig. S1a) [33]. We first performed graph-based clustering (Leiden) on the latent representations of spatiAlign and the other 8 benchmarked methods to assess their capability in aligning embedding space. Before comparison, we merged the Leiden clusters of each method with the ground truth using a maximum matching strategy for certain categories to produce final clustering results (Supplementary Fig. S1b–f). spatiAlign achieved the highest adjusted Rand index (ARI) [34] score with a mean of 0.5967 on all 4 sections and outperformed all the control methods (Fig. 2B). In addition, spatiAlign achieved the highest mean weighted F1 score of the local inverse Simpson’s index (LISI) [8] of 0.8402 (Fig. 2C), where sufficient mixing and variation preservation were equally evaluated. In comparison, MNN showed ineffectiveness in fusing the sections together and obtained the lowest weighted F1 score of LISI. The uniform manifold approximation and projection (UMAP) visualization for each method revealed that spatiAlign outperformed other control methods in separating clusters while simultaneously integrating slices (Fig. 2D). In particular, methods such as GraphST, SCALEX, Harmony, and Combat did not clearly separate spots belonging to distinct layers, and the batches did not mix well when using MNN. Although PRECAST appeared to separate clusters and integrate batches well, it resulted in Layer_1 being split into 2 groups.

Figure 2: — spatiAlign outperforms the control methods in integrating the human dorsolateral prefrontal cortex (DLPFC) datasets. (A) Manual annotation of sample ID 151673 from the original study. (B) Bar plots of the mean scores of the adjusted Rand index (ARI) for the combined clusters from spatiAlign and other control methods. (C) Bar plots of the weighted F1 scores of the local inverse Simpson’s index (LISI), assessing both batch mixing and cell-type separation, for the integration results from different data integration methods. (D) UMAP plots for the integrated batches and identified cell types from spatiAlign and other control methods. For the integration result of each method, dots in the right panel are colored by batch, and dots in the left panel are colored by cell type. (E) PAGA graphs of spatiAlign and other control methods. (F) Spatial visualization of spatiAlign-adjusted (top panel) and unadjusted normalized (bottom panel) expression of layer-marker genes. (G) Violin plots of the raw (left panel) and spatiAlign-adjusted (right panel) expression of layer-marker genes. The cortical layers corresponding to the layer-marker genes are highlighted with red boxes.

Furthermore, we validated the latent embeddings with the inferred trajectory from PAGA [35] (Fig. 2E). The PAGA path derived from spatiAlign embeddings exhibited a clear and nearly linear spatial trajectory from Layer_1 to Layer_6, with significant similarities observed between adjacent layers, in accordance with the developmental process of the neurons [36]. In contrast, the PAGA results of the other benchmarked methods were intermixed. Finally, we compared the spatial expression patterns of layer marker genes before and after spatiAlign (Fig. 2F). The results revealed that the expression of layer-marker genes, as reconstructed by spatiAlign, exhibited improved laminar enrichment and more refined distributions compared with the original data. For example, CXCL14 in Layer_1 and Layer_2, ETV1 in Layer_5 and Layer_6, and VAT1L in Layer_5 were consistent with previous studies [37], whereas their raw gene expression did not show discernible spatial laminar patterns. In addition, violin plots illustrating gene expression both pre- and post-spatiAlign processing highlighted the expression adjustment achieved by spatiAlign (Fig. 2G). For example, the reconstructed expression of SEMA3C significantly populated Layer 6 compared with the original data. Such expression adjustment was also observed in other sections (e.g., in sample ID 151674), thereby further validating the reliability of the reconstructed expressions (Supplementary Fig. S2c).

spatiAlign enables the alignment of multiple olfactory bulb datasets from different SRT platforms

To demonstrate the efficiency of spatiAlign in integrating datasets from different sequencing platforms, we used 3 mouse olfactory bulb datasets. One slice was profiled by 10x Genomics Visium (1,184 spots and 13,956 genes), while the other 2 slices were obtained from Stereo-seq (Fig. 3A) (1,123 spots, 20,317 genes and 812 spots, 16,370 genes), respectively. Before integration, we manually annotated each dataset (Fig. 3C) by leveraging unsupervised clustering (Supplementary Fig. S3a, b), reported marker genes (Supplementary Fig. S3c–f), and the ssDNA image (Fig. 3B). This provided a ground truth for calculating the weighted F1 score of LISI, which quantified the performance of the methods in aligning batches and separating cells from different clusters. As a result, spatiAlign achieved the highest score of 0.7935, outperforming other methods such as PRECAST (0.6863) and SCALEX (0.6099), while MNN was the poorest, with a score of 0.0485 (Fig. 3D). Next, on the UMAP plots, we illustrated the batch effects present before alignment (Fig. 3E). After integration, spatiAlign demonstrated successful batch merging, in contrast to the outputs of PRECAST, GraphST, Harmony, Combat, and other control methods, where prominent batch effects remained observable. In addition, spatiAlign found separate clusters that aligned well across the 3 sections (Fig. 3F). Even though BBKNN and SCALEX also generated separate clusters, batch effects were still visible after their integration. Hence, compared with combined clustering results produced by the control methods, those detected using spatiAlign embeddings better corresponded to the annotated ground truth and showed a higher consistency across different sections.

Figure 3: — spatiAlign integrates 3 mouse olfactory bulb datasets from Stereo-seq and 10x Genomics Visium sequencing platforms. (A) The mouse olfactory bulb datasets consisted of 3 sections, with 2 sections sequenced using Stereo-seq and the third section generated from 10x Genomics Visium. The 2 Stereo-seq datasets were sequenced on different types of chips, with spots having center-to-center distances of 500 and 715 nm, respectively (middle panel). Hence, the 2 Stereo-seq datasets were individually binned at Bin140 and Bin200 to ensure that the center-to-center distances of spots across the 3 sections were uniformly sized at 100 µm (right panel). (B) Organization of mouse olfactory bulb annotated by ssDNA image. (C) Manual annotation as a ground truth for benchmarking analysis. Spots are colored by cell type. (D) Bar plots of the weighted F1 scores of LISI for the integration results from spatiAlign and the other control methods. (E) Visualization of batch effects present in batches and cell types before integration. (F) UMAP plots for the integrated batches and identified cell types from spatiAlign and other control methods. For the integration result of each method, dots in the right panel are colored by batch, and dots in the left panel are colored by cell type. (G) Spatial visualization of spatiAlign-adjusted (top panel) and raw (middle panel) expression of marker genes, together with the associated cell types (bottom panel). spatiAlign denoised and adjusted the spatial expression pattern of marker genes compared with unadjusted normalized data.

Furthermore, we showed that the reconstructed gene expression from spatiAlign (Fig. 3G, Supplementary Fig. S4d, e, top panel) was adjusted compared with the raw gene expression (Fig. 3G, Supplementary Fig. S4d, e, middle panel). For some marker genes [38] (e.g., Cmtm5, Cdhr1, Doc2g, and Pcp4), the spatial expression pattern was clearly adjusted and more consistent with the spatial locations of the corresponding cell types (Fig. 3G, Supplementary Fig. S4d, e, bottom panel).

spatiAlign preserves heterogeneous characteristics among slices while aligning datasets

We utilized 3 mouse hippocampal slices from Slide-seq (Fig. 4A and Supplementary Table S1) to assess the performance of spatiAlign and the benchmarked methods in integrating datasets with different biological characteristics. These mouse hippocampus slices were collected from different regions in the mouse brain [3, 39, 40], which included a total of 69,528 cells and 11,376 genes in the merged dataset. As shown on the UMAP plots, spatiAlign accurately integrated disparate datasets and revealed diverse clusters of structural heterogeneity (Fig. 4B, D, G). Quantitatively, spatiAlign excelled over other control methods with an integrated LISI (iLISI) index of 0.6230, except for SCALEX. However, despite achieving the highest iLISI index, SCALEX was unable to preserve the biological difference among slices (Fig. 4C, Supplementary Fig. S5a, d).

Figure 4: — spatiAlign identifies distinct brain structures specific to each slice while integrating 3 mouse hippocampus datasets. (A) Spatial heatmap of total transcripts in the 3 mouse hippocampal slices measured by Slide-seq. (B) UMAP plot for the integrated slices from spatiAlign. (C) Bar plots of the integration LISI (iLISI) scores, evaluating batch mixing, for the integration results from spatiAlign and other control methods. (D) Spatial visualization (left) and UMAP plot (right) for the joint clustering results from spatiAlign. (E) Boxplots of global Moran’s I index for the joint clusters from spatiAlign and other control methods. (F) Boxplots of local join counts for the joint clusters from spatiAlign and other control methods. (G). The expression matrix plot of markers of the CA1, CA3, and DG regions. (H) Spatial visualization (left) and UMAP plot (right) of CA1, CA3, and DG regions that were only identified by spatiAlign. (I) Spatial visualization (left) and UMAP plot (right) of V3 and the habenula that are specific to slice 1 and slice 2. (J) Spatial expression of the marker genes Enpp2 in V3 and Tac2 in the habenula. (K) Volcano plot of differentially expressed genes (DEGs) between the habenula and rest. (L) Top 10 highly enriched Gene Ontology terms for the top 100 ranked DEGs.

Furthermore, we adopted hierarchical clustering to validate the effectiveness of each method in identifying the brain regions. The resulting cell clusters after spatiAlign displayed strong spatial aggregation with clear boundaries and higher consistency with the anatomical structures of the Allen Brain Atlas [41] (Fig. 4D and Supplementary Fig. S5b). Such observation was further evidenced by the global Moran’s I index and the local join counts, which measures spatial autocorrelation (Fig. 4E, F). Regarding finding the substructural regions, our proposed spatiAlign was the only method that identified the substructures of the hippocampus, including CA1, CA2, and dentate gyrus (DG), on all 3 slices (Fig. 4G). The successful hippocampus-related-region identification of spatiAlign had higher consistency across 3 slices than others (Fig. 4H), while GraphST detected incorrect regions due to a lack of registering spatial coordinates (Supplementary Fig. S5d). For preserving heterogeneous characteristics, we observed that the identified habenula and third ventricle (V3) regions were barely enriched on slice 3 but highly populated on the other 2 slices, as expected (Fig. 4H). Such results were in high concordance with the expression spatial pattern of the associated marker genes [42] Enpp2 for V3 and Tac2 for habenula (Fig. 4J). To validate the biological traits of heterogeneous embedding, we implemented DEG and Gene Ontology (GO) analyses on the detected habenular cell groups. We found key marker genes [42] for habenula among the highly expressed genes of the merged dataset (e.g., Gm5741, Nwd2, Gng8, and Lrrc55) (Fig. 4K). In addition, the GO enrichment analysis showed that the habenula is actively involved in the production and synthesis of ATP (Fig. 4L). This finding was in accordance with biological understandings that ATP not only plays a crucial role in energy metabolism for habenular cells but also acts as a neurotransmitter to modulate neuronal activity and synaptic transmission [43].

spatiAlign facilitates joint gene-level analysis of time-series mouse embryonic brain

Finally, we utilized a series of mouse brain datasets [4] extracted from multiple developing mouse embryos (Fig. 5A), measured by Stereo-seq, to demonstrate the benefits of spatiAlign for downstream gene-level analysis. These brain sections were collected at different embryonic days from E9.5 to E16.5, which included a total of 104,974 cells and 22,864 genes in the merged dataset. Herein, we initially evaluated the inherent batch effects present prior to alignment. Before applying spatiAlign, cells were primarily grouped by batch (Fig. 5B). In comparison, spatiAlign well aligned these datasets within its lower-dimensional representations, where the batch effects were adjusted. The cells were then clustered into coherent groups in an unsupervised manner, and we next manually labeled them by referring to the expression of marker genes reported by the atlas of the developing mouse brain [44] (Fig. 5B). These marker genes (e.g., Ccnd2 of NeuB, Col4a1 of fibroblast, Sncg of FMN, Slc1a3 of Hb VZ, and Hcrtr2 of Spall VZ) exhibited the highest expression levels in their corresponding cell types that had a relatively high fraction (Fig. 5C). In particular, we found 2 subtypes of GABAergic interneurons in the subpallial region that were characterized by the Dlx5 and Gpm6a genes, which we named SPall Gpm6a and SPall Dlx5, respectively (Fig. 5C). The validity of these annotations was also confirmed by the strong correspondence observed in the spatial distributions between cell types and relevant marker genes (Supplementary Fig. S6a).

Figure 5: — spatiAlign facilitates joint analysis of time-series mouse embryonic brain sections. (A) Unsupervised clustering of time-series brain sections extracted from the mouse embryos across E9.5 to E16.5 (E9.5, E12.5, E13.5, E14.5, E15.5, and E16.5) after integration using spatiAlign. Spots are colored by their annotation (right panel). Astro, astrocyte; Cere, cerebellum; Chor, choroid plexus; Die, diencephalon; Ery, erythrocyte; FMN, facial motor nucleus; Hb, hindbrain; Hy, hypothalamus; Mb, dorsal midbrain; NeuB, neuroblast; OB, olfactory bulb; OPC, oligodendrocyte precursor cell; SPall, subpallium; SpC, spinal cord; VZ, ventricular zone. (B) UMAP plots for batch mixing before spatiAlign (left) and after spatiAlign (middle) and the labeled combined clusters from spatiAlign (right). (C) Expression dot plots showing the gene expression specificity of typical marker genes for identified cell types. Dot size represents the proportion of expressing cells, and color indicates the average expression level in each identified cell type. (D) Top 3 highly enriched GO terms for differentially expressed genes from E9.5 to E16.5. (E) PAGA graph of spatiAlign embeddings. Each node represents a batch that is connected by weighted edges that quantify the connectivity between batches. (F) Age-specific genes traced along the PAGA graph paths. (G) Cellular trajectory across different time points inferred by the spatiAlign-corrected feature matrix (left) and raw expression (right), with black arrows representing transition trends. (H) Cellular state transitions across cell types (middle panel) and expression of reported driver genes for neuronal (top panel) and nonneuronal cells (bottom panel).

A key benefit of our proposed spatiAlign is its ability to obtain aligned gene expression with batch effects removed, thereby enabling downstream gene-level analysis. Based on the reconstructed expression features, we identified DEGs across E9.5 to E16.5 using the Wilcoxon test in SCANPY. A heatmap of the expression of the top 5 ranked DEGs (Supplementary Fig. S6b) illustrated high specificity across different developmental stages. In our observations, the detected DEGs (e.g., Id2, Lbx1, Id3, Cdh8, and Nlgn1) have been reported to play crucial roles in neuronal differentiation and maturation processes, such as neurogenesis and synaptic plasticity. Specifically, Id2, with differential expression at E9.5, has been extensively studied for its involvement in balancing neuronal proliferation and differentiation [45]. Similarly, Id3, showing specificity to E13.5, was widely recognized for its function in controlling the timing of neurogenesis in the embryo [46]. Conversely, the top-ranked DEGs identified at E16.5, such as Nlgn1, Cadm2, Nrg1, and Ccser1, have been well studied for their contributions to synapse formation, myelination, synaptic plasticity, and connectivity [47–49], suggesting the final stage of neurogenesis with synaptogenesis and the formation of synaptic connections between neurons at E16.5. The subsequent GO-based enrichment analysis (Fig. 5D) revealed distinct functional enrichments during different developmental stages in the mouse embryonic brain. Negative regulation of haemopoiesis was observed at E9.5, followed by dendrite morphogenesis at E12.5, early endosome at E13.5, synaptic cleft at E14.5, long-term synaptic potentiation at E15.5, and synaptic membrane adhesion at E16.5. These findings were in line with the major developmental events observed at different embryonic stages, encompassing the initiation of neurogenesis (E9.5) [50], early neuronal connection (E12.5) [51], increased neurogenesis (E13.5, E14.5, and E15.5) [52, 53], and the refinement of synaptic plasticity (E16.5).

We further demonstrated the effectiveness of spatiAlign for combined trajectory analysis by employing 2 distinct approaches: PAGA, a graph abstraction technique based on low-dimensional embedding space [35], and CellRank [54], a state-of-the-art cell fate mapping algorithm using a high-dimensional count matrix as input. The PAGA graph of spatiAlign embeddings (Fig. 5E) exhibited a nearly linear development trajectory from E9.5 to E16.5, as well as a high similarity between adjacent time points. Furthermore, the age-specific genes that were identified could be traced along the PAGA path (Fig. 5F). We proceeded to compare the reconstructed trajectory obtained from CellRank using 2 different inputs: the raw and spatiAlign-reconstructed feature matrices. The recovered trajectory, derived from reconstructed features (Fig. 5G, H), illustrated a clear transition path across cell types and a similar distribution across different time points, consistent with previous observations [55]. In contrast, the batch effects present in the raw count matrix may lead to infeasible and chaotic fate potentials across different batches (Supplementary Fig. S6c). Additionally, the expression patterns of reported driver genes associated with neuronal (i.e., Tuba1a [56], Tenm2 [57], Rbfox1) as well as nonneuronal (Dcn, Col1a1, Col1a2) development [58] (Fig. 5H) were consistent with the predicted cell fate, thereby validating the feasibility of the estimated pseudotime and affirming the reliability of our analysis.

Discussion

In this article, we develop spatiAlign, an advanced deep learning methodology that tackles the challenge of integrating multiple SRT datasets. spatiAlign first transforms spatial information into a neighboring adjacency matrix to perform spatial embedding that aggregates gene expression profiles together with spatial neighboring context for spot/cell representations. The obtained representations are subsequently fine-tuned through augmentation-based contrastive learning, which incorporates spatial context information to improve their informativeness and distinguishability. Next, regarding aligning biological effects, spatiAlign adopts across-domain adaptation and deep clustering strategies to bring the semantic similarity of spots/cells closer and push dissimilar spots/cells apart, regardless of which datasets they are from. Collectively, beyond SRT dataset integration and batch effect correction, spatiAlign-integrated datasets can be used for downstream analysis, such as identifying combined clusters and DEGs and trajectory inference.

Naturally, one might be concerned that achieving a sufficient mix of serial tissue sections could result in the inability to distinguish spots/cells from different clusters. Therefore, in this study, we introduce a weighted F1 score of LISI, which evaluates the integration mixing and separation of each cluster, to perform comparison analysis. We presented a series of benchmarking analyses on 4 publicly available SRT datasets with different characteristics. On the human DLPFC datasets, with the manual annotation as ground truth, spatiAlign achieves the highest ARI and weighted F1 score of LISI compared with other control methods. This quantitative assessment highlights its superiority in integrating different samples while also identifying separate clusters. Furthermore, the superior performance of spatiAlign on aggregated datasets of olfactory bulbs sequenced by different platforms demonstrates its efficacy in integrating multiple datasets with complex technical variations. In addition, the reconstructed expression of region-specific marker genes exhibits a greater spatial specificity compared with the original data. However, we point out here that our effort was not intended to develop a new imputation method over existing methods but to demonstrate that spatiAlign-reconstructed matrices denoise gene counts.

Moreover, there is concern regarding the potential loss of distinct biological characteristics during the batch alignment process. Herein, we unequivocally affirm that spatiAlign not only effectively preserves the intrinsic variation among sections but also adeptly harmonizes batches, as demonstrated through its successful application to 3 distinct brain sections characterized by heterogeneous structures. However, the benchmarked methods are unable to match the performance of spatiAlign. When applied to a time-series dataset, spatiAlign significantly facilitates downstream analysis, such as combined clustering, combined differential expression analysis, and trajectory inference. In the results, various subtypes of neurons were successfully identified, with the typical marker genes displaying the highest expression in their corresponding cell types. Upon analyzing the reconstructed full expression space, we identified DEGs and significant GO terms specific to different developmental stages that showed high consistency with previous studies on mouse brain development. Comparing the trajectories inferred from corrected expression features and the raw data, we verify that spatiAlign not only aligns multiple batches into a joint low-dimensional embedding space but also corrects the batch effects in their full expression space. This capability empowers users to perform preprocessing for methods that require a full gene expression matrix, such as CellRank.

To further explore more potentials of spatiAlign, we conducted benchmarking analyses on datasets involving liver tumors [59], which lack a stereotypical structure, and MERFISH datasets [60], which only provide profiles for a few hundred genes. As shown in Supplementary Figs. S7 and S8, spatiAlign effectively corrected batch effects and identified distinct cell clusters. In contrast, the outputs of the benchmarked methods still exhibited noticeable batch effects or resulted in fragmented cell clusters. Moreover, spatiAlign achieved the highest F1 score among all the control methods. These findings highlight the tremendous potential of spatiAlign in applications involving a wider range of SRT datasets. To unleash more potentials of spatiAlign in clinical studies, we plan to collect and integrate additional case/control datasets to investigate the comprehensive molecular mechanisms between disease and healthy conditions.

We have developed spatiAlign, a user-friendly tool designed to seamlessly integrate SRT datasets, offering a novel and effective solution. As an unsupervised domain adaptation method, spatiAlign is inherently influenced by various dataset characteristics, such as phenotypical differences and cell heterogeneity. The selection of hyperparameters during the construction of spatial neighborhood graphs also affects the performance of spatiAlign. Looking ahead, our vision is to extend spatiAlign’s capabilities to support integrative and multimodal spatial molecular dataset analysis, including epigenetics, proteomics, and microbiomics, and facilitate deeper exploration of biological phenomena and significantly contribute to scientific discoveries.

Methods

Motivation for the use of across-domain adaptation contrastive learning

As genomic sequencing technology continues to advance, an increasing number of SRT datasets are being generated from various platforms. Joint analysis of multiple datasets can be used to facilitate the extraction of maximum reliable information, but inconsistent data distributions between different sections due to batch effects may affect the reliability of downstream analysis results. To address this issue and maximize the preservation of biological variations, it is desirable to amalgamate disparate datasets and bring similar cell types closer together while keeping dissimilar cell types far apart. Across-domain adaptation contrastive learning, an unsupervised domain adaptation method, can be used for this purpose. This method can align data distributions, preserve biological variations, and remove batch effects while also incorporating spatial information of the SRT dataset into the newly generated latent embedding and reconstructed matrix.

Data preprocessing

spatiAlign utilizes a series of gene expression matrices and associated spatial coordinates as inputs. The gene expression profiles are stored in a Inline graphic matrix of unique molecular identifier (UMI) counts, where N is the number of spots/cells and D is the number of genes, and it also includes 2-dimensional spatial coordinates for each spot/cell. The raw gene expression matrices were first filtered according to criteria and for each dataset using SCANPY (version: 1.9.1), followed by normalization and log transformation of individual spots.

Spatial neighbor graph construction for the SRT dataset

To fully exploit the spatial local neighboring context, we convert the spatial coordinates into an undirected neighborhood graph Inline graphic by Euclidean distance with a predefined neighbor parameter k, where V represents the SRT dataset spots/cells and E represents the connected edges between the current spot/cell and neighboring spots/cells. The adjacency matrix of graph G is denoted by A, in which spot/cell with k nearest-neighbor spots/cells; if spot/cell Inline graphic is the neighbor of spot/cell u, ; otherwise, it is 0. Specifically, we selected the top 15 nearest neighbors for each spot/cell in the SRT gene expression spatial coordinates.

Batch-specific variations to separate using domain-specific batch normalization

Batch normalization (BN) [61] is widely used to solve the problem of internal covariate shift during DNN training. It can reduce the problems of vanishing gradients and overfitting. For a mini-batch of data Inline graphic , the BN layer can be calculated using the following parameterization:

(1)

where Inline graphic is the mean of the mini-batch, is the variance of the mini-batch, is the normalized output by the mean and variance of the mini-batch, is a small constant to avoid dividing by zero, and is the output of the BN layer, which is obtained by scaling and shifting with learned parameters Inline graphic and .

Domain-specific batch normalization (DSBN) [62] is used in unsupervised domain adaptation with multiple source datasets to separate domain-specific variations from different datasets. In spatiAlign, DSBN consists of multiple sets of BN layers that select the corresponding BN with the batch label b. DSBN can be represented as follows:

(2)

where Inline graphic and are batch-specific affine parameters for batch b.

DSBN has been proposed to capture and utilize the batch-specific information in datasets by learning affine parameters for each dataset, which enables the model to learn the batch-specific variations that exist within the datasets [13, 62].

DGI-based feature extractor for reducing dimensions and propagating spatial neighboring context

spatiAlign leverages the DGI framework to project a gene expression matrix into a latent space while simultaneously performing dimension reduction and propagating spatial neighboring spots/cells context. To reduce the dimension of higher-dimensional SRT data, we employed a DNN-based autoencoder mapping model Inline graphic , where represents the parameters of the mapping model, is a latent embedding with dimensionsd, and . The DNN-based mapping model, a feature embedding block, consists of a fully connected block and 2 stacked residual bottleneck blocks. Specifically, the fully connected block comprises a linear connected layer, a DSBN layer, an exponential linear unit (ELU) as a nonlinear activation function, and a dropout layer in sequence. Each residual bottleneck block consists of 2 stacked fully connected blocks, and the output of the residual bottleneck block is passed through an ELU layer (Fig. 1B). Notably, the feature embedding block only takes the gene expression matrix as input.

To propagate the spatial neighboring context in the reduced dimensionality space, we employ a VGAE framework. The VGAE framework takes the latent embedding Z obtained from the feature embedding model and the adjacency matrix A as input and generates Y as output. The VGAE encoder includes 2 stacked graph convolutional network (GCN) layers and uses the rectified linear unit (ReLU) as a nonlinear activation function. The first GCN layer generates a lower-dimensional spatial embedding and aggregates the spatial neighboring context, while the second GCN layer generates the mean Inline graphic and variance . The spatial embedding Y is then reparametrized from , where . The final latent representation S is generated from the feature fusion block, which includes 2 stacked fully connected layers, as well as a DSBN layer followed by each connected layer in sequence, and takes concatenated feature embedding as input, which is obtained by concatenating the reduction dimensionality embedding Z and the spatial embedding Y. The final latent embedding S is then used to reconstruct the original gene expression matrix Inline graphic in the DNN-based autoencoder and the spatial neighboring adjacency matrix in the VGAE network.

Training the DNN-based autoencoder and VGAE network minimizes the loss of the reconstructed gene expression matrix and maximizes the log-likelihood of the observed SRT sequencing latent representation S. We first employed the scale-invariant mean squared error (MSE) [63] to measure the DNN-based loss. In addition, the loss function of the VGAE includes a binary cross-entropy loss to minimize the difference between the input spatial neighboring adjacency matrix A and the reconstructed adjacency matrix Inline graphic . Additionally, a Kullback‒Leibler divergence loss was used to optimize the log-likelihood between the posterior distribution and prior distribution , where . The dimension reduction and spatial neighboring context propagation loss can be calculated as follows:

(3)

where Inline graphic is the binary cross-entropy, is the Kullback‒Leibler divergence, is the scale-invariant MSE, and are hyperparameters.

(4)

where k is the number of spots/cells in the input gene expression matrix, Inline graphic is a vector of ones of length k, is the squared norm, and is a hyperparameter.

Self-supervised contrastive learning for representation enhancement

DGI is a self-supervised learning architecture that maximizes mutual information between local neighbors of a graph to learn representations of nodes. spatiAlign takes original and corrupted gene expression matrices as inputs and generates latent representation matrices S and Inline graphic , respectively. The corrupted matrix is a rowwise random perturbation of the original matrix, and we assume that the corrupted gene expression profiles have the same neighboring adjacency matrix as the original profiles. Formally, given a spot i, we form a positive pair consisting of its representation Inline graphic and the neighboring graph spot vector g, while the corresponding corrupted representation from the corrupted matrix and the same neighboring graph spot vector g form a negative pair. A self-supervised contrastive learning method was used to train the DGI framework, and the loss function was designed to maximize the mutual information of positive pairs while minimizing the mutual information of negative pairs:

(5)

where Inline graphic is a discriminator, a bilinear layer, and follows a sigmoid layer, to distinguish the positive pairs from negative pairs.

Biological effects alignment using across-domain adaptation contrastive learning

To align biological effects using across-domain adaptation contrastive learning, we propose a criterion for forming pairs based on the assumption that datasets from multiple tissue sections share at least 1 common cell type in the current alignment setting. To achieve this, we perform in-batch instance-level contrastive learning and across-batch instance-level contrastive learning for each tissue section separately. Specifically, we maintain a memory bank Inline graphic for each tissue section, which is used to store the latent embedding and prototype spot/cell type representations within the batch.

(6)

where Inline graphic is the stored feature vector of , initialized with final latent representation S, and updated with a momentum m after each iteration for each dataset:

(7)

In-batch instance level contrastive learning

The pairwise similarity distributions Inline graphic are measured by the cosine distance between latent embedding and the corresponding memory bank to perform in-batch instance discrimination,

(8)

where Inline graphic is the temperature parameter, which can determine the concentration level of the similarity distribution. Finally, cross-entropy was employed to minimize the in-batch instance discrimination.

(9)

where i denotes the unique index of the spot of Inline graphic .

Pseudoprototypical cluster-level contrastive learning

Inspired by unsupervised contrastive clustering [32], we map each spot/cell i into an embedding space with d dimensions, where d is equal to the number of pseudoprototypical clusters. Since each spot belongs to only 1 cluster, ideally, the row of the latent embedding Inline graphic tends to be one-hot, meaning that the column of represents the cluster. Similar to in-batch instance-level contrastive learning, our method uses cosine distance to measure the similarity between latent embedding and the corresponding memory bank and maximize the pseudo cluster pair similarity using cross-entropy. Specifically, the loss function can be expressed as

(10)

where Inline graphic is the entropy of the pseudo-cluster assignment probabilities , which can help to avoid the trivial solution in which most spots are assigned to the same cluster [32].

Across-batch instance self-supervised learning

To explicitly align biological effects and ensure that spatiAlign learns discriminative representations of dissimilar cell types between different batches, we perform across-batch feature matching. Specifically, we minimize the entropy of the pairwise similarity distribution between latent embeddings in one batch and the latent embeddings stored in the memory bank of another batch. The loss function for across-batch spot/cell pair matching can be formalized as

(11)

The overall objective for spatiAlign is to minimize:

(12)

Comparisons of methods

We conducted a comparative analysis between spatiAlign and other state-of-the-art methods of data integration using 4 comprehensive representative SRT datasets that exhibit diverse characteristics.

We applied the following integration methods: (1) Harmony [8] implemented in the SCANPY package external module harmony_integrate, (2) Combat [64] implemented in the SCANPY package module combat, (3) Scanorama [11] implemented in the SCANPY package external module scanorama_integrate, (4) BBKNN [12] implemented in the SCANPY package external module bbknn, (5) MNN [15] implemented in the SCANPY package external module mnn_correct, (6) SCALEX [13] implemented in the Python package scalex, and spatial-base methods: (7) PRECAST [24] implemented in the R package PRECAST and (8) GraphST [25] implemented in the Python package GraphST. We input the preprocessed datasets into spatiAlign and several other tested methods. The first 6 methods were developed for scRNA-seq datasets, whereas PRECAST and GraphST were specifically designed for SRT datasets.

Evaluation metrics

We evaluate the performance of spatiAlign and other control methods in both data integration and the preservation of biological variation using the following metric.

F1 score of local inverse Simpson’s index

To simultaneously evaluate the separation of same-cell-type aggregation and across-batch fusion in the data integration, we calculated the LISI [8] using 2 different groupings: (i) grouping using different datasets as the batch Inline graphic and (ii) grouping using known cell types as the spot . In the data integration, a larger value of indicates sufficient mixing of the different batch datasets, while a smaller value of suggests better preservation of the biological variations between spot types. The 2 metrics can be summarized using the Inline graphic score as follows:

(13)

where Inline graphic is a smaller constant. A higher score indicates superior data integration, which effectively retains the biological variations between spot types while eliminating other noncellular biological variations across multiple batches, thereby enhancing the fidelity of the biological information.

Adjusted Rand index

To evaluate the efficacy of merge clustering when utilizing lower-dimensional gene expression representations, we utilized the ARI [34] as a performance metric. ARI represents an enhanced version of the Rand index (RI), which overcomes several of its limitations. By measuring the degree of similarity between 2 partitions, ARI provides a numerical value that ranges between −1 and 1, with a higher value indicating a higher degree of similarity between the 2 partitions being compared. Moreover, ARI attains a value of 1 when the 2 partitions under comparison are equivalent up to a permutation. Hence, ARI serves as a reliable and robust tool for evaluating the performance of merge clustering approaches.

Hierarchical clustering, Moran’s I, and the join counts index calculation

The spatial regions were identified by a hierarchical clustering algorithm with a lower-dimensional representation from different methods. The agglomerative clustering function in the scikit-learn package was implemented with 16 clusters (n_cluster = 16). Then, we calculate the global Moran’s I index for each region on each slice. First, the batch labels were encoded to one-hot vectors, and spatial coordinates were used to calculate spatial neighbors (edge weights = 1). Then, the Moran function and the join counts function in the ESDA (2.4.3) Python package was applied to calculate Moran’s I index and “black-black” statistic.

Differential expression analysis and GO enrichment analysis

We employed the FindMarkers() function of the Scanpy package to identify DEGs for the spatial domain using “T test” implementation and cutting of the adjusted P value at 0.05. To perform GO enrichment analysis for the DEGs, we utilized the ClusterProfiler (v4.8.1) R package.

Trajectory inference analysis

We used the joint low-dimensional embeddings that were generated from spatiAlign and the control methods to infer the PAGA [35] path by the scanpy.tl.paga function in SCANPY. CellRank [54] was implemented to estimate pseudotime using the CytoTraceKernel algorithm and compute_transition_matrix beyond RNA velocity because the spliced and unspliced counts were not available in the mouse embryonic brain datasets. Before applying the CytoTRACE kernel on the raw expression matrix, we performed some basic preprocessing of the data using sc.pp.normalize_per_cell and sc.pp.log1p. However, since the expression counts were already normalized before being inputted into spatiAlign, we did not perform additional normalization on the spatiAlign-adjusted expression matrix. We visualized the directed transition matrix CellRank calculated with the same sort of arrows that are used for RNA velocity. There is no RNA velocity in this study.

Availability of Source Code and Requirements

Project name: spatiAlign

Project homepage: https://github.com/STOmics/Spatialign.git

Tutorials: https://spatialign-tutorials.readthedocs.io/en/latest/index.html

Operating system(s): Platform independent

Programming language: Python 3.8 or higher

License: MIT License

RRID: SCR_024828

BiotoolsID: spatiAlign

Supplementary Material

giae042_GIGA-D-23-00259_Original_Submission

giae042_giga-d-23-00259_original_submission.pdf^{(23.4MB, pdf)}

giae042_GIGA-D-23-00259_Revision_1

giae042_giga-d-23-00259_revision_1.pdf^{(53.9MB, pdf)}

giae042_GIGA-D-23-00259_Revision_2

giae042_giga-d-23-00259_revision_2.pdf^{(53.6MB, pdf)}

giae042_GIGA-D-23-00259_Revision_3

giae042_giga-d-23-00259_revision_3.pdf^{(53.5MB, pdf)}

giae042_Response_to_Reviewer_Comments_Original_Submission

giae042_response_to_reviewer_comments_original_submission.pdf^{(93.6KB, pdf)}

giae042_Response_to_Reviewer_Comments_Revision_1

giae042_response_to_reviewer_comments_revision_1.pdf^{(75.5KB, pdf)}

giae042_Response_to_Reviewer_Comments_Revision_2

giae042_response_to_reviewer_comments_revision_2.pdf^{(47.7KB, pdf)}

giae042_Reviewer_1_Report_Original_Submission

Lambda Moses -- 10/15/2023 Reviewed

giae042_reviewer_1_report_original_submission.pdf^{(131.8KB, pdf)}

giae042_Reviewer_1_Report_Revision_1

Lambda Moses -- 1/13/2024 Reviewed

giae042_reviewer_1_report_revision_1.pdf^{(118.9KB, pdf)}

giae042_Reviewer_2_Report_Original_Submission

Stefano Monti -- 10/30/2023 Reviewed

giae042_reviewer_2_report_original_submission.pdf^{(114KB, pdf)}

giae042_Reviewer_3_Report_Original_Submission

Jose Fernandez Navarro, PhD -- 10/31/2023 Reviewed

giae042_reviewer_3_report_original_submission.pdf^{(128KB, pdf)}

giae042_Reviewer_3_Report_Revision_1

Jose Fernandez Navarro, PhD -- 1/18/2024 Reviewed

giae042_reviewer_3_report_revision_1.pdf^{(119.3KB, pdf)}

giae042_Reviewer_3_Report_Revision_2

Jose Fernandez Navarro, PhD -- 3/25/2024 Reviewed

giae042_reviewer_3_report_revision_2.pdf^{(120KB, pdf)}

giae042_Supplemental_Figures_and_Tables

giae042_supplemental_figures_and_tables.zip^{(112.6MB, zip)}

Acknowledgement

We thank China National GeneBank for providing data support for this study, Guangdong Bigdata Engineering Technology Research Center for Life Sciences for supporting this study, and Prof. Dr. Junjun Jiang, Dr. Jian Zhang, Dr. Ke Fan, and Dr. Min Xie for their help.

Contributor Information

Chao Zhang, BGI Research, Shenzhen 518083, China.

Lin Liu, BGI Research, Shenzhen 518083, China.

Ying Zhang, BGI Research, Shenzhen 518083, China.

Mei Li, BGI Research, Shenzhen 518083, China.

Shuangsang Fang, BGI Research, Shenzhen 518083, China; BGI Research, Beijing 102601, China.

Qiang Kang, BGI Research, Shenzhen 518083, China.

Ao Chen, BGI Research, Shenzhen 518083, China; BGI Research, Chongqing 401329, China.

Xun Xu, BGI Research, Wuhan 430074, China.

Yong Zhang, BGI Research, Shenzhen 518083, China; BGI Research, Wuhan 430074, China; Guangdong Bigdata Engineering Technology Research Center for Life Sciences, BGI Research, Shenzhen 518083, China.

Yuxiang Li, BGI Research, Shenzhen 518083, China; BGI Research, Wuhan 430074, China; Guangdong Bigdata Engineering Technology Research Center for Life Sciences, BGI Research, Shenzhen 518083, China.

Additional Files

Supplementary Fig. S1. Manual annotation of human DLPFC datasets and joint clustering results from spatiAlign and other control methods, related to Fig. 2. (a) Manual annotation of 4 DLPFC sections from the original study. (b) UMAP plots for joint Leiden clusters (Leiden) from spatiAlign and the control methods, together with the final clusters (Mapping) that merged Leiden clusters with the ground truth using a maximum matching strategy. (c–f) Spatial visualization of the Leiden clusters and the mapping clusters of sample ID 151673 (c), sample ID 151674 (d), sample ID 151675 (e), and sample ID 151676 (f).

Supplementary Fig. S2. Benchmarking analysis on human DLPFC datasets, related to Fig. 2. (a, b) Bar plots of integration LISI (iLISI) (a) and cell-type LISI (cLISI) (b) scores for integration results from different methods. (c) Visualization of spatiAlign-denoised (top panel) and unadjusted normalized (bottom panel) spatial expression of layer-marker genes in sample 151674.

Supplementary Fig. S3. Manual annotation of olfactory bulb datasets, related to Fig. 3. (a) UMAP plot for the left clusters of a Stereo-seq olfactory bulb dataset and its spatial visualization (b). (c) Heatmap of marker genes associated with their cell types. (d–f) Spatial pattern of marker genes and the corresponding cell types on the 3 olfactory bulb slices.

Supplementary Fig. S4. Benchmarking analysis on olfactory bulb datasets, related to Fig. 3. (a–c) Bar plots of integration LISI (iLISI) (a), cell-type LISI (cLISI) (b), and ARI (c) scores for integration results from different methods. (d, e) Spatial visualization of spatiAlign-enhanced (top panel) and raw (middle panel) spatial expression of marker genes, together with their corresponding cell types (bottom panel), on 2 olfactory bulb sections.

Supplementary Fig. S5. Integration results of 3 mouse hippocampus slices from the control methods, related to Fig. 4. (a) UMAP plots for the joint clustering result from the control methods, colored by slices (top panel) and cluster labels (bottom panel). (b) Spatial visualization of the joint clustering results from the control methods on the 3 slices. (c) Expression heatmaps of marker genes for the CA1, CA3, and DG regions in joint clusters from spatiAlign and the control methods. Clusters with high expression specificity are highlighted by red boxes. (d) Spatial visualization of the hippocampus-related regions on 3 slices identified by the control methods.

Supplementary Fig. S6. Application to time-series mouse embryonic brain, related to Fig. 5. (a) Spatial visualization of the labeled clusters and the corresponding marker genes. (b) Expression heatmap of the top 5 differentially expressed genes from E9.5 to E16.5. (c) CellRank trajectory of cell types reconstructed using the raw expression counts. (d) Estimated pseudotime scores by spatiAlign-corrected gene expression matrices.

Supplementary Fig. S7. Application to liver cancer datasets. (a) Spatial visualization of the Stereo-seq datasets from the tumor margin area of 2 liver cancer patients. On the left, the color scheme represents total UMI counts, which provide information on gene expression levels. In the middle, the color scheme represents cell types annotated in the original study. Given the extensive size of the original data, we cropped each subslice of these 2 datasets (right panel) to do the integration benchmark analysis. (b) UMAP plots for the spatiAlign and control methods, colored by slices (top panel) and cell types (bottom panel). (c–e) Bar plots of LISI. (c) Integration LISI (iLISI). (d) Category LISI (cLISI). (e) F1 score of LISI. (f) Top: Visualization of tumor boundary using Leiden clustering on spatiAlign embedding, colored by cluster types. Bottom: Manual mapping of tumor boundary area, colored by proximity to tumor (darker color indicates closer proximity).

Supplementary Fig. S8. Application to MERFISH datasets. (a, b) Spatial visualization of the 2 MERFISH datasets, colored by cell type. (c) UMAP plots for the spatiAlign and control methods, colored by slices (top panel) and cell types (bottom panel). (d–f) Bar plots of LISI. (d) F1 score of LISI. (e) Integration LISI (iLISI). (f) Category LISI (cLISI).

Supplementary Table S1. Source of mouse hippocampal slices.

Abbreviations

ARI: adjusted Rand index; BN: batch normalization; DEG: differentially expressed gene; DG: dentate gyrus; DGI: deep graph infomax; DLPFC: dorsolateral prefrontal cortex; DNN: deep neural network; DSBN: domain-specific batch normalization; ELU: exponential linear unit; GCN: graph convolutional network; GO: Gene Ontology; iLISI: integrated LISI; LISI: local inverse Simpson’s index; MSE: mean squared error; ReLU: rectified linear unit; RI: Rand index; scRNA-seq: single-cell RNA sequencing; SRT: spatially resolved transcriptomics; UMAP: uniform manifold approximation and projection; UMI: unique molecular identifier; VGAE: variational graph autoencoder; WM: white matter.

Author Contributions

Conceptualization: Chao Zhang. Project administration and supervision: Xun Xu, Ao Chen, Yong Zhang, and Yuxiang Li. Algorithm development and implementation: Chao Zhang. Public datasets collection, processing, and application: Chao Zhang, Lin Liu, Ying Zhang, Mei Li, and Shuangsang Fang. Methods comparisons: Chao Zhang, Lin Liu, and Ying Zhang. Biological interpretation: Chao Zhang, Lin Liu, and Ying Zhang. Manuscript writing and figure generation: Chao Zhang, Lin Liu, and Ying Zhang. Manuscript reviewing: Shuangsang Fang, Qiang Kang. All authors approved the manuscript.

Funding

This study was funded by National Key R&D Program of China (2022YFC3400400).

Data Availability

The public datasets are freely available as follows: the Stereo-seq data have been deposited into the CNGB Sequence Archive (CNSA) of the China National GenBank DataBase (CNGBdb) with accession number CNP0001543. The spatiotemporal dataset of the mouse embryonic brain can be download from MOSTA [4], and the 10x Genomics Visium mouse olfactory bulb data can be download from the 10x Genomics website [65]. The LIBD human dorsolateral prefrontal cortex (DLPFC) dataset and mouse breast datasets can be download from Zenodo [66]. Mouse hippocampus data can be downloaded from the slide-seq website [3, 38, 39]. All processed data are available in Zenodo [67]. Additional supporting data, also including a link to DOME-ML (Data, Optimization, Model and Evaluation in Machine Learning) annotations, are available via the GigaScience database, GigaDB [68].

Competing Interests

The authors declare no competing interests.

References

1. Marx V. Method of the Year: spatially resolved transcriptomics. Nat Methods. 2021;18(1):9–14. 10.1038/s41592-020-01033-y. [DOI] [PubMed] [Google Scholar]
2. Stahl PL, Salmen F, Vickovic S, et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016;353(6294):78–82. 10.1126/science.aaf2403. [DOI] [PubMed] [Google Scholar]
3. Rodriques SG, Stickels RR, Goeva A, et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science. 2019;363(6434):1463–67. 10.1126/science.aaw1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Chen A, Liao S, Cheng M, et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell. 2022;185(10):1777–92. e1721. 10.1016/j.cell.2022.04.003. [DOI] [PubMed] [Google Scholar]
5. Cho CS, Xi J, Si Y, et al. Microscopic examination of spatial transcriptome using Seq-Scope. Cell. 2021;184(13):3559–72. e3522. 10.1016/j.cell.2021.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Luecken MD, Buttner M, Chaichoompu K, et al. Benchmarking atlas-level data integration in single-cell genomics. Nat Methods. 2022;19(1):41–50. 10.1038/s41592-021-01336-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Tran HTN, Ang KS, Chevrier M, et al. A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 2020;21(1):12. 10.1186/s13059-019-1850-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Korsunsky I, Millard N, Fan J, et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods. 2019;16(12):1289–96. 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Welch JD, Kozareva V, Ferreira A, et al. Single-cell multi-omic integration compares and contrasts features of brain cell identity. Cell. 2019;177(7):1873–87..e1817. 10.1016/j.cell.2019.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Li X, Wang K, Lyu Y, et al. Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis. Nat Commun. 2020;11(1):2338. 10.1038/s41467-020-15851-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Hie B, Bryson B, Berger B. Efficient integration of heterogeneous single-cell transcriptomes using Scanorama. Nat Biotechnol. 2019;37(6):685–91. 10.1038/s41587-019-0113-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Polański K, Young MD, Miao Z, et al. BBKNN: fast batch alignment of single cell transcriptomes. Bioinformatics. 2020;36(3):964–65. 10.1093/bioinformatics/btz625. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Xiong L, Tian K, Li Y, et al. Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space. Nat Commun. 2022;13(1):6118. 10.1038/s41467-022-33758-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8(1):118–27. 10.1093/biostatistics/kxj037. [DOI] [PubMed] [Google Scholar]
15. Haghverdi L, Lun AT, Morgan MD, et al. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat Biotechnol. 2018;36(5):421–27. 10.1038/nbt.4091. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Stuart T, Butler A, Hoffman P, et al. Comprehensive integration of single-cell data. Cell. 2019;177(7):1888–902. e1821. 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Lotfollahi M, Wolf FA, Theis FJ: scGen predicts single-cell perturbation responses. Nat Methods. 2019;16(8):715–21. 10.1038/s41592-019-0494-8. [DOI] [PubMed] [Google Scholar]
18. Hu J, Li X, Coleman K, et al. SpaGCN: integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nat Methods. 2021;18(11):1342–51. 10.1038/s41592-021-01255-8. [DOI] [PubMed] [Google Scholar]
19. Dong K, Zhang S. Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder. Nat Commun. 2022;13(1):1739. 10.1038/s41467-022-29439-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Zhao Y, Wang K, Hu G. DIST: spatial transcriptomics enhancement using deep learning. Brief Bioinform. 2023;24:(2):bbad013. 10.1093/bib/bbad013. [DOI] [PubMed] [Google Scholar]
21. Wang Y, Song B, Wang S, et al. Sprod for de-noising spatially resolved transcriptomics data based on position and image information. Nat Methods. 2022;19(8):950–58. 10.1038/s41592-022-01560-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Zhao E, Stone MR, Ren X, et al. Spatial transcriptomics at subspot resolution with BayesSpace. Nat Biotechnol. 2021;39(11):1375–84. 10.1038/s41587-021-00935-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Shen R, Liu L, Wu Z, et al. Spatial-ID: a cell typing method for spatially resolved transcriptomics via transfer learning and spatial embedding. Nat Commun. 2022;13(1):7640. 10.1038/s41467-022-35288-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Liu W, Liao X, Luo Z, et al. Probabilistic embedding, clustering, and alignment for integrating spatial transcriptomics data with PRECAST. Nat Commun. 2023;14(1):296. 10.1038/s41467-023-35947-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Long Y, Ang KS, Li M, et al. Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphST. Nat Commun. 2023;14(1):1155. 10.1038/s41467-023-36796-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
26. Velickovic P, Fedus W, Hamilton WL, et al. Deep graph infomax. ICLR (Poster). 2019;2(3):4. https://arxiv.org/pdf/1809.10341. [Google Scholar]
27. Kipf TN, Welling M. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 2016. https://arxiv.org/abs/1611.07308.
28. Wang R, Wu Z, Weng Z, et al. Cross-domain contrastive learning for unsupervised domain adaptation. IEEE Trans Multimedia. 2022;25:1665–73.. 10.1109/TMM.2022.3146744. [DOI] [Google Scholar]
29. You Y, Chen T, Sui Y, et al. Graph contrastive learning with augmentations. Adv Neural Inform Process Syst. 2020;33:5812–23.. https://proceedings.neurips.cc/paper_files/paper/2020/file/3fe230348e9a12c13120749e3f9fa4cd-Paper.pdf. [Google Scholar]
30. Xie S, Zheng Z, Chen L, et al. Learning semantic representations for unsupervised domain adaptation. International Conference on Machine Learning. PMLR; 2018:5423–32.. https://proceedings.mlr.press/v80/xie18c/xie18c.pdf. [Google Scholar]
31. Yue X, Zheng Z, Zhang S, et al. Prototypical cross-domain self-supervised learning for few-shot unsupervised domain adaptation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021:13834–44.. https://arxiv.org/pdf/2103.16765. [Google Scholar]
32. Li Y, Hu P, Liu Z, et al. Contrastive clustering. Proceedings of the AAAI Conference on Artificial Intelligence. 2021:8547–55.. https://arxiv.org/pdf/2009.09687. [Google Scholar]
33. Maynard KR, Collado-Torres L, Weber LM, et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat Neurosci. 2021;24(3):425–36. 10.1038/s41593-020-00787-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
34. Hubert L, Arabie P. Comparing partitions. J Classification. 1985;2:193–218. 10.1007/BF01908075. [DOI] [Google Scholar]
35. Wolf FA, Hamey FK, Plass M, et al. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biol. 2019;20:1–9. 10.1186/s13059-019-1663-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
36. Gilmore EC, Herrup K. Cortical development: layers of complexity. Curr Biol. 1997;7(4):R231–34. 10.1016/S0960-9822(06)00108-4. [DOI] [PubMed] [Google Scholar]
37. Zeng H, Shen EH, Hohmann JG, et al. Large-scale cellular-resolution gene profiling in human neocortex reveals species-specific molecular signatures. Cell. 2012;149(2):483–96. 10.1016/j.cell.2012.02.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
38. Wang I-H, Murray E, Andrews G, et al. Spatial transcriptomic reconstruction of the mouse olfactory glomerular map suggests principles of odor processing. Nat Neurosci. 2022;25(4):484–92. 10.1038/s41593-022-01030-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Cable DM, Murray E, Zou LS, et al. Robust decomposition of cell type mixtures in spatial transcriptomics. Nat Biotechnol. 2022;40(4):517–26. 10.1038/s41587-021-00830-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
40. Stickels RR, Murray E, Kumar P, et al. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat Biotechnol. 2021;39(3):313–19. 10.1038/s41587-020-0739-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
41. Sunkin SM, Ng L, Lau C, et al. Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system. Nucleic Acids Res. 2013;41(D1):D996–D1008. 10.1093/nar/gks1042. [DOI] [PMC free article] [PubMed] [Google Scholar]
42. Zeisel A, Hochgerner H, Lönnerberg P, et al. Molecular architecture of the mouse nervous system. Cell. 2018;174(4):999–1014. e1022. 10.1016/j.cell.2018.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
43. Robertson SJ, Edwards FA. ATP and glutamate are released from separate neurones in the rat medial habenula nucleus: frequency dependence and adenosine-mediated inhibition of release. J Physiol. 1998;508(Pt 3):691. 10.1111/j.1469-7793.1998.691bp.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
44. La Manno G, Siletti K, Furlan A, et al. Molecular architecture of the developing mouse brain. Nature. 2021;596(7870):92–96. 10.1038/s41586-021-03775-x. [DOI] [PubMed] [Google Scholar]
45. Yokota Y, Mori S. Role of Id family proteins in growth control. J Cell Physiol. 2002;190(1):21–28. 10.1002/jcp.10042. [DOI] [PubMed] [Google Scholar]
46. Lyden D, Young AZ, Zagzag D, et al. Id1 and Id3 are required for neurogenesis, angiogenesis and vascularization of tumour xenografts. Nature. 1999;401(6754):670–77. 10.1038/44334. [DOI] [PubMed] [Google Scholar]
47. Sudhof TC. Neuroligins and neurexins link synaptic function to cognitive disease. Nature. 2008;455(7215):903–11. 10.1038/nature07456. [DOI] [PMC free article] [PubMed] [Google Scholar]
48. Mei L, Nave KA. Neuregulin-ERBB signaling in the nervous system and neuropsychiatric diseases. Neuron. 2014;83(1):27–49. 10.1016/j.neuron.2014.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
49. Jun H, Mohammed Qasim Hussaini S, Rigby MJ, et al. Functional role of adult hippocampal neurogenesis as a therapeutic strategy for mental disorders. Neural Plast. 2012;2012:854285. 10.1155/2012/854285. [DOI] [PMC free article] [PubMed] [Google Scholar]
50. Guillemot F, Zimmer C. From cradle to grave: the multiple roles of fibroblast growth factors in neural development. Neuron. 2011;71(4):574–88. 10.1016/j.neuron.2011.08.002. [DOI] [PubMed] [Google Scholar]
51. Kriegstein A, Alvarez-Buylla A. The glial nature of embryonic and adult neural stem cells. Annu Rev Neurosci. 2009;32:149–84. 10.1146/annurev.neuro.051508.135600. [DOI] [PMC free article] [PubMed] [Google Scholar]
52. Hatten ME, Heintz N. Mechanisms of neural patterning and specification in the developing cerebellum. Annu Rev Neurosci. 1995;18:385–408. 10.1146/annurev.ne.18.030195.002125. [DOI] [PubMed] [Google Scholar]
53. Rakic P. Evolution of the neocortex: a perspective from developmental biology. Nat Rev Neurosci. 2009;10(10):724–35. 10.1038/nrn2719. [DOI] [PMC free article] [PubMed] [Google Scholar]
54. Lange M, Bergen V, Klein M, et al. CellRank for directed single-cell fate mapping. Nat Methods. 2022;19(2):159–70. 10.1038/s41592-021-01346-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
55. Qiu C, Cao J, Martin BK, et al. Systematic reconstruction of cellular trajectories across mouse embryogenesis. Nat Genet. 2022;54(3):328–41. 10.1038/s41588-022-01018-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
56. Vayer T, Chapel L, Flamary R, et al. Fused Gromov-Wasserstein distance for structured objects. Algorithms. 2020;13(9):212. 10.3390/a13090212. [DOI] [Google Scholar]
57. Peterson VM, Zhang KX, Kumar N, et al. Multiplexed quantification of proteins and transcripts in single cells. Nat Biotechnol. 2017;35(10):936–39. 10.1038/nbt.3973. [DOI] [PubMed] [Google Scholar]
58. Maynard KR, Collado-Torres L, Weber LM, et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat Neurosci. 2021;24:425–36. 10.1038/s41593-020-00787-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
59. Wu L, Yan J, Bai Y, et al. An invasive zone in human liver cancer identified by Stereo-seq promotes hepatocyte–tumor cell crosstalk, local immunosuppression and tumor progression. Cell Res. 2023;33:585–603. 10.1038/s41422-023-00831-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
60. Allen WE, Blosser TR, Sullivan ZA, et al. Molecular and spatial signatures of mouse brain aging at single-cell resolution. Cell. 2023;186(1):194–208. e18. 10.1016/j.cell.2022.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
61. Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. International Conference on Machine Learning.PMLR,2015:448–56.. https://arxiv.org/pdf/1502.03167. [Google Scholar]
62. Chang W-G, You T, Seo S, et al. Domain-specific batch normalization for unsupervised domain adaptation. Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition.2019:7354–62.. https://arxiv.org/pdf/1906.03950. [Google Scholar]
63. Bousmalis K, Trigeorgis G, Silberman N, et al. Domain separation networks. Adv Neural Inform Process Syst. 2016;29. https://proceedings.neurips.cc/paper_files/paper/2016/file/45fbc6d3e05ebd93369ce542e8f2322d-Paper.pdf. [Google Scholar]
64. Kim K-Y, Kim SH, Ki DH, et al. An attempt for combining microarray data sets by adjusting gene expressions. Cancer Res Treat. 2007;39(2):74–81. 10.4143/crt.2007.39.2.74. [DOI] [PMC free article] [PubMed] [Google Scholar]
65. 10X Genomics , Adult Mouse Olfactory Bulb, 2022. https://www.10xgenomics.com/datasets/adult-mouse-olfactory-bulb-1-standard.
66. Long Y. DeepST: a versatile graph contrastive learning framework for spatially informed clustering, integration and deconvolution of spatial transcriptomics (Version 1) [Data set]. Zenodo.2022. 10.5281/zenodo.6925603. [DOI]
67. Zhang C, Liu L, Zhang Y et al., spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics (Version 1) [Data set]. Zenodo. 2024. 10.5281/zenodo.10453192. [DOI] [PMC free article] [PubMed]
68. Zhang C, Liu L, Zhang Y, et al. Supporting data for “spatiAlign: An Unsupervised Contrastive Learning Model for Data Integration of Spatially Resolved Transcriptomics.”. GigaScience Database. 2024. 10.5524/102533. [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Long Y. DeepST: a versatile graph contrastive learning framework for spatially informed clustering, integration and deconvolution of spatial transcriptomics (Version 1) [Data set]. Zenodo.2022. 10.5281/zenodo.6925603. [DOI]
Zhang C, Liu L, Zhang Y et al., spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics (Version 1) [Data set]. Zenodo. 2024. 10.5281/zenodo.10453192. [DOI] [PMC free article] [PubMed]
Zhang C, Liu L, Zhang Y, et al. Supporting data for “spatiAlign: An Unsupervised Contrastive Learning Model for Data Integration of Spatially Resolved Transcriptomics.”. GigaScience Database. 2024. 10.5524/102533. [DOI] [PMC free article] [PubMed]

Supplementary Materials

giae042_GIGA-D-23-00259_Original_Submission

giae042_giga-d-23-00259_original_submission.pdf^{(23.4MB, pdf)}

giae042_GIGA-D-23-00259_Revision_1

giae042_giga-d-23-00259_revision_1.pdf^{(53.9MB, pdf)}

giae042_GIGA-D-23-00259_Revision_2

giae042_giga-d-23-00259_revision_2.pdf^{(53.6MB, pdf)}

giae042_GIGA-D-23-00259_Revision_3

giae042_giga-d-23-00259_revision_3.pdf^{(53.5MB, pdf)}

giae042_Response_to_Reviewer_Comments_Original_Submission

giae042_response_to_reviewer_comments_original_submission.pdf^{(93.6KB, pdf)}

giae042_Response_to_Reviewer_Comments_Revision_1

giae042_response_to_reviewer_comments_revision_1.pdf^{(75.5KB, pdf)}

giae042_Response_to_Reviewer_Comments_Revision_2

giae042_response_to_reviewer_comments_revision_2.pdf^{(47.7KB, pdf)}

giae042_Reviewer_1_Report_Original_Submission

Lambda Moses -- 10/15/2023 Reviewed

giae042_reviewer_1_report_original_submission.pdf^{(131.8KB, pdf)}

giae042_Reviewer_1_Report_Revision_1

Lambda Moses -- 1/13/2024 Reviewed

giae042_reviewer_1_report_revision_1.pdf^{(118.9KB, pdf)}

giae042_Reviewer_2_Report_Original_Submission

Stefano Monti -- 10/30/2023 Reviewed

giae042_reviewer_2_report_original_submission.pdf^{(114KB, pdf)}

giae042_Reviewer_3_Report_Original_Submission

Jose Fernandez Navarro, PhD -- 10/31/2023 Reviewed

giae042_reviewer_3_report_original_submission.pdf^{(128KB, pdf)}

giae042_Reviewer_3_Report_Revision_1

Jose Fernandez Navarro, PhD -- 1/18/2024 Reviewed

giae042_reviewer_3_report_revision_1.pdf^{(119.3KB, pdf)}

giae042_Reviewer_3_Report_Revision_2

Jose Fernandez Navarro, PhD -- 3/25/2024 Reviewed

giae042_reviewer_3_report_revision_2.pdf^{(120KB, pdf)}

giae042_Supplemental_Figures_and_Tables

giae042_supplemental_figures_and_tables.zip^{(112.6MB, zip)}

Data Availability Statement

[bib1] 1. Marx V. Method of the Year: spatially resolved transcriptomics. Nat Methods. 2021;18(1):9–14. 10.1038/s41592-020-01033-y. [DOI] [PubMed] [Google Scholar]

[bib2] 2. Stahl PL, Salmen F, Vickovic S, et al. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016;353(6294):78–82. 10.1126/science.aaf2403. [DOI] [PubMed] [Google Scholar]

[bib3] 3. Rodriques SG, Stickels RR, Goeva A, et al. Slide-seq: a scalable technology for measuring genome-wide expression at high spatial resolution. Science. 2019;363(6434):1463–67. 10.1126/science.aaw1219. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] 4. Chen A, Liao S, Cheng M, et al. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell. 2022;185(10):1777–92. e1721. 10.1016/j.cell.2022.04.003. [DOI] [PubMed] [Google Scholar]

[bib5] 5. Cho CS, Xi J, Si Y, et al. Microscopic examination of spatial transcriptome using Seq-Scope. Cell. 2021;184(13):3559–72. e3522. 10.1016/j.cell.2021.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] 6. Luecken MD, Buttner M, Chaichoompu K, et al. Benchmarking atlas-level data integration in single-cell genomics. Nat Methods. 2022;19(1):41–50. 10.1038/s41592-021-01336-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] 7. Tran HTN, Ang KS, Chevrier M, et al. A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 2020;21(1):12. 10.1186/s13059-019-1850-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] 8. Korsunsky I, Millard N, Fan J, et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods. 2019;16(12):1289–96. 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] 9. Welch JD, Kozareva V, Ferreira A, et al. Single-cell multi-omic integration compares and contrasts features of brain cell identity. Cell. 2019;177(7):1873–87..e1817. 10.1016/j.cell.2019.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] 10. Li X, Wang K, Lyu Y, et al. Deep learning enables accurate clustering with batch effect removal in single-cell RNA-seq analysis. Nat Commun. 2020;11(1):2338. 10.1038/s41467-020-15851-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] 11. Hie B, Bryson B, Berger B. Efficient integration of heterogeneous single-cell transcriptomes using Scanorama. Nat Biotechnol. 2019;37(6):685–91. 10.1038/s41587-019-0113-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] 12. Polański K, Young MD, Miao Z, et al. BBKNN: fast batch alignment of single cell transcriptomes. Bioinformatics. 2020;36(3):964–65. 10.1093/bioinformatics/btz625. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13. Xiong L, Tian K, Li Y, et al. Online single-cell data integration through projecting heterogeneous datasets into a common cell-embedding space. Nat Commun. 2022;13(1):6118. 10.1038/s41467-022-33758-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] 14. Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8(1):118–27. 10.1093/biostatistics/kxj037. [DOI] [PubMed] [Google Scholar]

[bib15] 15. Haghverdi L, Lun AT, Morgan MD, et al. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat Biotechnol. 2018;36(5):421–27. 10.1038/nbt.4091. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] 16. Stuart T, Butler A, Hoffman P, et al. Comprehensive integration of single-cell data. Cell. 2019;177(7):1888–902. e1821. 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] 17. Lotfollahi M, Wolf FA, Theis FJ: scGen predicts single-cell perturbation responses. Nat Methods. 2019;16(8):715–21. 10.1038/s41592-019-0494-8. [DOI] [PubMed] [Google Scholar]

[bib18] 18. Hu J, Li X, Coleman K, et al. SpaGCN: integrating gene expression, spatial location and histology to identify spatial domains and spatially variable genes by graph convolutional network. Nat Methods. 2021;18(11):1342–51. 10.1038/s41592-021-01255-8. [DOI] [PubMed] [Google Scholar]

[bib19] 19. Dong K, Zhang S. Deciphering spatial domains from spatially resolved transcriptomics with an adaptive graph attention auto-encoder. Nat Commun. 2022;13(1):1739. 10.1038/s41467-022-29439-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20. Zhao Y, Wang K, Hu G. DIST: spatial transcriptomics enhancement using deep learning. Brief Bioinform. 2023;24:(2):bbad013. 10.1093/bib/bbad013. [DOI] [PubMed] [Google Scholar]

[bib21] 21. Wang Y, Song B, Wang S, et al. Sprod for de-noising spatially resolved transcriptomics data based on position and image information. Nat Methods. 2022;19(8):950–58. 10.1038/s41592-022-01560-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22. Zhao E, Stone MR, Ren X, et al. Spatial transcriptomics at subspot resolution with BayesSpace. Nat Biotechnol. 2021;39(11):1375–84. 10.1038/s41587-021-00935-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] 23. Shen R, Liu L, Wu Z, et al. Spatial-ID: a cell typing method for spatially resolved transcriptomics via transfer learning and spatial embedding. Nat Commun. 2022;13(1):7640. 10.1038/s41467-022-35288-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] 24. Liu W, Liao X, Luo Z, et al. Probabilistic embedding, clustering, and alignment for integrating spatial transcriptomics data with PRECAST. Nat Commun. 2023;14(1):296. 10.1038/s41467-023-35947-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25. Long Y, Ang KS, Li M, et al. Spatially informed clustering, integration, and deconvolution of spatial transcriptomics with GraphST. Nat Commun. 2023;14(1):1155. 10.1038/s41467-023-36796-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] 26. Velickovic P, Fedus W, Hamilton WL, et al. Deep graph infomax. ICLR (Poster). 2019;2(3):4. https://arxiv.org/pdf/1809.10341. [Google Scholar]

[bib27] 27. Kipf TN, Welling M. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 2016. https://arxiv.org/abs/1611.07308.

[bib28] 28. Wang R, Wu Z, Weng Z, et al. Cross-domain contrastive learning for unsupervised domain adaptation. IEEE Trans Multimedia. 2022;25:1665–73.. 10.1109/TMM.2022.3146744. [DOI] [Google Scholar]

[bib29] 29. You Y, Chen T, Sui Y, et al. Graph contrastive learning with augmentations. Adv Neural Inform Process Syst. 2020;33:5812–23.. https://proceedings.neurips.cc/paper_files/paper/2020/file/3fe230348e9a12c13120749e3f9fa4cd-Paper.pdf. [Google Scholar]

[bib30] 30. Xie S, Zheng Z, Chen L, et al. Learning semantic representations for unsupervised domain adaptation. International Conference on Machine Learning. PMLR; 2018:5423–32.. https://proceedings.mlr.press/v80/xie18c/xie18c.pdf. [Google Scholar]

[bib31] 31. Yue X, Zheng Z, Zhang S, et al. Prototypical cross-domain self-supervised learning for few-shot unsupervised domain adaptation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021:13834–44.. https://arxiv.org/pdf/2103.16765. [Google Scholar]

[bib32] 32. Li Y, Hu P, Liu Z, et al. Contrastive clustering. Proceedings of the AAAI Conference on Artificial Intelligence. 2021:8547–55.. https://arxiv.org/pdf/2009.09687. [Google Scholar]

[bib33] 33. Maynard KR, Collado-Torres L, Weber LM, et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat Neurosci. 2021;24(3):425–36. 10.1038/s41593-020-00787-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] 34. Hubert L, Arabie P. Comparing partitions. J Classification. 1985;2:193–218. 10.1007/BF01908075. [DOI] [Google Scholar]

[bib35] 35. Wolf FA, Hamey FK, Plass M, et al. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biol. 2019;20:1–9. 10.1186/s13059-019-1663-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36. Gilmore EC, Herrup K. Cortical development: layers of complexity. Curr Biol. 1997;7(4):R231–34. 10.1016/S0960-9822(06)00108-4. [DOI] [PubMed] [Google Scholar]

[bib37] 37. Zeng H, Shen EH, Hohmann JG, et al. Large-scale cellular-resolution gene profiling in human neocortex reveals species-specific molecular signatures. Cell. 2012;149(2):483–96. 10.1016/j.cell.2012.02.052. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] 38. Wang I-H, Murray E, Andrews G, et al. Spatial transcriptomic reconstruction of the mouse olfactory glomerular map suggests principles of odor processing. Nat Neurosci. 2022;25(4):484–92. 10.1038/s41593-022-01030-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] 39. Cable DM, Murray E, Zou LS, et al. Robust decomposition of cell type mixtures in spatial transcriptomics. Nat Biotechnol. 2022;40(4):517–26. 10.1038/s41587-021-00830-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] 40. Stickels RR, Murray E, Kumar P, et al. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat Biotechnol. 2021;39(3):313–19. 10.1038/s41587-020-0739-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib41] 41. Sunkin SM, Ng L, Lau C, et al. Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system. Nucleic Acids Res. 2013;41(D1):D996–D1008. 10.1093/nar/gks1042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] 42. Zeisel A, Hochgerner H, Lönnerberg P, et al. Molecular architecture of the mouse nervous system. Cell. 2018;174(4):999–1014. e1022. 10.1016/j.cell.2018.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] 43. Robertson SJ, Edwards FA. ATP and glutamate are released from separate neurones in the rat medial habenula nucleus: frequency dependence and adenosine-mediated inhibition of release. J Physiol. 1998;508(Pt 3):691. 10.1111/j.1469-7793.1998.691bp.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] 44. La Manno G, Siletti K, Furlan A, et al. Molecular architecture of the developing mouse brain. Nature. 2021;596(7870):92–96. 10.1038/s41586-021-03775-x. [DOI] [PubMed] [Google Scholar]

[bib45] 45. Yokota Y, Mori S. Role of Id family proteins in growth control. J Cell Physiol. 2002;190(1):21–28. 10.1002/jcp.10042. [DOI] [PubMed] [Google Scholar]

[bib46] 46. Lyden D, Young AZ, Zagzag D, et al. Id1 and Id3 are required for neurogenesis, angiogenesis and vascularization of tumour xenografts. Nature. 1999;401(6754):670–77. 10.1038/44334. [DOI] [PubMed] [Google Scholar]

[bib47] 47. Sudhof TC. Neuroligins and neurexins link synaptic function to cognitive disease. Nature. 2008;455(7215):903–11. 10.1038/nature07456. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] 48. Mei L, Nave KA. Neuregulin-ERBB signaling in the nervous system and neuropsychiatric diseases. Neuron. 2014;83(1):27–49. 10.1016/j.neuron.2014.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib49] 49. Jun H, Mohammed Qasim Hussaini S, Rigby MJ, et al. Functional role of adult hippocampal neurogenesis as a therapeutic strategy for mental disorders. Neural Plast. 2012;2012:854285. 10.1155/2012/854285. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib50] 50. Guillemot F, Zimmer C. From cradle to grave: the multiple roles of fibroblast growth factors in neural development. Neuron. 2011;71(4):574–88. 10.1016/j.neuron.2011.08.002. [DOI] [PubMed] [Google Scholar]

[bib51] 51. Kriegstein A, Alvarez-Buylla A. The glial nature of embryonic and adult neural stem cells. Annu Rev Neurosci. 2009;32:149–84. 10.1146/annurev.neuro.051508.135600. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib52] 52. Hatten ME, Heintz N. Mechanisms of neural patterning and specification in the developing cerebellum. Annu Rev Neurosci. 1995;18:385–408. 10.1146/annurev.ne.18.030195.002125. [DOI] [PubMed] [Google Scholar]

[bib53] 53. Rakic P. Evolution of the neocortex: a perspective from developmental biology. Nat Rev Neurosci. 2009;10(10):724–35. 10.1038/nrn2719. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib54] 54. Lange M, Bergen V, Klein M, et al. CellRank for directed single-cell fate mapping. Nat Methods. 2022;19(2):159–70. 10.1038/s41592-021-01346-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib55] 55. Qiu C, Cao J, Martin BK, et al. Systematic reconstruction of cellular trajectories across mouse embryogenesis. Nat Genet. 2022;54(3):328–41. 10.1038/s41588-022-01018-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] 56. Vayer T, Chapel L, Flamary R, et al. Fused Gromov-Wasserstein distance for structured objects. Algorithms. 2020;13(9):212. 10.3390/a13090212. [DOI] [Google Scholar]

[bib57] 57. Peterson VM, Zhang KX, Kumar N, et al. Multiplexed quantification of proteins and transcripts in single cells. Nat Biotechnol. 2017;35(10):936–39. 10.1038/nbt.3973. [DOI] [PubMed] [Google Scholar]

[bib58] 58. Maynard KR, Collado-Torres L, Weber LM, et al. Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex. Nat Neurosci. 2021;24:425–36. 10.1038/s41593-020-00787-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib63] 59. Wu L, Yan J, Bai Y, et al. An invasive zone in human liver cancer identified by Stereo-seq promotes hepatocyte–tumor cell crosstalk, local immunosuppression and tumor progression. Cell Res. 2023;33:585–603. 10.1038/s41422-023-00831-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib64] 60. Allen WE, Blosser TR, Sullivan ZA, et al. Molecular and spatial signatures of mouse brain aging at single-cell resolution. Cell. 2023;186(1):194–208. e18. 10.1016/j.cell.2022.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib59] 61. Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. International Conference on Machine Learning.PMLR,2015:448–56.. https://arxiv.org/pdf/1502.03167. [Google Scholar]

[bib60] 62. Chang W-G, You T, Seo S, et al. Domain-specific batch normalization for unsupervised domain adaptation. Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition.2019:7354–62.. https://arxiv.org/pdf/1906.03950. [Google Scholar]

[bib61] 63. Bousmalis K, Trigeorgis G, Silberman N, et al. Domain separation networks. Adv Neural Inform Process Syst. 2016;29. https://proceedings.neurips.cc/paper_files/paper/2016/file/45fbc6d3e05ebd93369ce542e8f2322d-Paper.pdf. [Google Scholar]

[bib62] 64. Kim K-Y, Kim SH, Ki DH, et al. An attempt for combining microarray data sets by adjusting gene expressions. Cancer Res Treat. 2007;39(2):74–81. 10.4143/crt.2007.39.2.74. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib70] 65. 10X Genomics , Adult Mouse Olfactory Bulb, 2022. https://www.10xgenomics.com/datasets/adult-mouse-olfactory-bulb-1-standard.

[bib65] 66. Long Y. DeepST: a versatile graph contrastive learning framework for spatially informed clustering, integration and deconvolution of spatial transcriptomics (Version 1) [Data set]. Zenodo.2022. 10.5281/zenodo.6925603. [DOI]

[bib71] 67. Zhang C, Liu L, Zhang Y et al., spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics (Version 1) [Data set]. Zenodo. 2024. 10.5281/zenodo.10453192. [DOI] [PMC free article] [PubMed]

[bib72] 68. Zhang C, Liu L, Zhang Y, et al. Supporting data for “spatiAlign: An Unsupervised Contrastive Learning Model for Data Integration of Spatially Resolved Transcriptomics.”. GigaScience Database. 2024. 10.5524/102533. [DOI] [PMC free article] [PubMed]

PERMALINK

spatiAlign: an unsupervised contrastive learning model for data integration of spatially resolved transcriptomics

Chao Zhang

Lin Liu

Ying Zhang

Mei Li

Shuangsang Fang

Qiang Kang

Ao Chen

Xun Xu

Yong Zhang

Yuxiang Li

Abstract

Background

Findings

Conclusions

Introduction

Results

Overview of spatiAlign

Figure 1:

spatiAlign outperforms the control methods in integrating dorsolateral prefrontal cortex datasets

Figure 2:

spatiAlign enables the alignment of multiple olfactory bulb datasets from different SRT platforms

Figure 3:

spatiAlign preserves heterogeneous characteristics among slices while aligning datasets

Figure 4:

spatiAlign facilitates joint gene-level analysis of time-series mouse embryonic brain

Figure 5:

Discussion

Methods

Motivation for the use of across-domain adaptation contrastive learning

Data preprocessing

Spatial neighbor graph construction for the SRT dataset

Batch-specific variations to separate using domain-specific batch normalization

DGI-based feature extractor for reducing dimensions and propagating spatial neighboring context

Self-supervised contrastive learning for representation enhancement

Biological effects alignment using across-domain adaptation contrastive learning

In-batch instance level contrastive learning

Pseudoprototypical cluster-level contrastive learning

Across-batch instance self-supervised learning

Comparisons of methods

Evaluation metrics

F1 score of local inverse Simpson’s index

Adjusted Rand index

Hierarchical clustering, Moran’s I, and the join counts index calculation

Differential expression analysis and GO enrichment analysis

Trajectory inference analysis

Availability of Source Code and Requirements

Supplementary Material

Acknowledgement

Contributor Information

Additional Files

Abbreviations

Author Contributions

Funding

Data Availability

Competing Interests

References

Associated Data

Data Citations

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases