Aquila: a spatial omics database and analysis platform

Yimin Zheng; Yitian Chen; Xianting Ding; Koon Ho Wong; Edwin Cheung

doi:10.1093/nar/gkac874

. 2022 Oct 16;51(D1):D827–D834. doi: 10.1093/nar/gkac874

Aquila: a spatial omics database and analysis platform

Yimin Zheng ^1,^2,^3,⁴, Yitian Chen ^5,^6,^7,⁸, Xianting Ding ⁹, Koon Ho Wong ^10,^11,^12,¹³, Edwin Cheung ^14,^15,^16,^17,^✉

PMCID: PMC9825501 PMID: 36243967

Abstract

Spatial omics is a rapidly evolving approach for exploring tissue microenvironment and cellular networks by integrating spatial knowledge with transcript or protein expression information. However, there is a lack of databases for users to access and analyze spatial omics data. To address this limitation, we developed Aquila, a comprehensive platform for managing and analyzing spatial omics data. Aquila contains 107 datasets from 30 diseases, including 6500+ regions of interest, and 15.7 million cells. The database covers studies from spatial transcriptome and proteome analyses, 2D and 3D experiments, and different technologies. Aquila provides visualization of spatial omics data in multiple formats such as spatial cell distribution, spatial expression and co-localization of markers. Aquila also lets users perform many basic and advanced spatial analyses on any dataset. In addition, users can submit their own spatial omics data for visualization and analysis in a safe and secure environment. Finally, Aquila can be installed as an individual app on a desktop and offers the RESTful API service for power users to access the database. Overall, Aquila provides a detailed insight into transcript and protein expression in tissues from a spatial perspective. Aquila is available at https://aquila.cheunglab.org.

INTRODUCTION

Highly complex biological processes, such as the development of the neurological system or the progression of diseases like diabetes and cancer, are extremely challenging to study due to the heterogeneous cellular nature of tissues and the complex interactions between cells. However, the spatial distribution of cells within tissues is positively related to their function and lineage trajectories (1,2). Thus, understanding the spatial organization of tissues and the evolutionary relationship among cells is vital for capturing the different steps of developmental processes and the complete portrait of diseases. The rapid development and adoption of spatial omics technologies recently have accelerated our understanding of these essential processes.

Many spatial omics technologies have been developed so far, with the majority geared toward interrogating the spatial transcriptome of tissues (3). Spatial transcriptome methods can be broadly divided into either sequencing- or imaging-based techniques. Sequencing-based techniques capture and preserve spatial gene expression information of tissues by in situ hybridizing RNA transcripts onto spatially immobilized arrays of indexed sequences. Visium is currently the most popular sequencing-based tool, although, at present, this technology and similar technologies like Slide-seq (4) and DBiT-seq (5) are not yet at single-cell resolution (6). In contrast, newer techniques such as spatial enhanced resolution omics sequencing (Stereo-seq) (7), Seq-Scope (8) and sci-Space (9) are at single-cell or subcellular resolution. Imaging-based spatial transcriptome technologies include sequential fluorescence in situ hybridization (seqFISH) (10), multiplexed error-robust fluorescence in situ hybridization (MERFISH) (11) and spatially resolved transcript amplicon readout mapping (STARmap) (12). These technologies require designing fluorescent-labeled probes to hybridize with known RNA targets and a super-resolution microscope to capture the fluorescent signal.

Besides measuring transcript information, spatial omics technologies can also detect the proteome of tissues (13). Compared to spatial transcriptome technologies, the throughput of spatial proteomic technologies is currently limited due to the availability of fluorescent or metal-conjugated antibodies required for detecting target proteins. As with spatial transcriptome technologies, spatial proteomic technologies can also be categorized into two main groups. For example, cyclic immunofluorescence (14) and co-detection by indexing (15) are imaging-based techniques that measure protein abundance by repeatedly staining tissues with three to five fluorescent-labeled antibodies at a time. In comparison, imaging mass cytometry (IMC) (16) and mass ion beam imaging (MIBI) (17) are mass cytometry-based technologies that stain and quantify tissues using a maximum of 40 metal-conjugated antibodies at once.

Since spatial omics technologies produce a rich amount of data, they have been widely adopted in the scientific field, resulting in many exciting new findings. For example, Stereo-seq has been applied extensively for the spatial–temporal gene expression profiling of early embryonic stages at unprecedented resolution in mice (7), flies (18) and zebrafish (19). In a multimodal study integrating scRNA-seq, Visium and MIBI, Khavari and colleagues examined the spatial architecture of squamous cell carcinoma, revealing a multifaceted immunosuppression mechanism, including the exhaustion of T cells (20). Recently, IMC was used to map the spatial and temporal landscape of the lung at subcellular resolution during the progression of coronavirus disease 2019 (21). This study showed that the abundance, hyperactivities, the interplay between different inflammatory cells and structural changes contribute to the poor immune response in the late stage of the disease (21).

The recent precipitous expansion of spatial omics has also produced many datasets, and the increase will continue as the technology matures and becomes more widely accessible. A database that users can easily access, visualize and use for analyzing different types of spatial omics datasets is immensely beneficial for the scientific community. Unfortunately, no databases currently provide users with all these features in one complete package. At present, SpatialDB (22) is the only spatial omics database available, but it hosts only 24 transcriptome datasets, lacks proteomics data and does not support 3D datasets. SpatialDB also does not offer users any type of spatial analysis. While users can turn to software such as Giotto (23) and Squidpy (24) to analyze their data, these programs are targeted toward bioinformaticians with programming skills, which are unsuitable for most biologists.

Thus, we developed Aquila, a combined spatial omics database and analysis platform that supports all types of spatial omics data, including transcriptome and proteome, 2D and 3D, and from different technologies (Figure 1A). Aquila currently hosts 107 datasets from 15 technologies with >15.7 million cells. Aquila provides users the visualization of spatial omics data in multiple formats, such as cell maps, expression maps and co-localization of markers (Figure 1B). Users can also easily retrieve any dataset and run a repertoire of integrated analyses directly in the same browser, including advanced spatial analyses that typically require coding skills (Figure 1C). Furthermore, users can submit spatial omics data of their own to Aquila and perform the same analyses as the datasets found in the database. We address data privacy issues by storing all user datasets locally without uploading them to the remote server. Finally, Aquila can also be installed as an individual application on a desktop through the progressive web application standard.

Figure 1. — Schematic design of Aquila. (A) Various types of spatial omics datasets accessible in Aquila. (B) Examples of ROI visualization (i.e. cell map, expression map and co-localization of markers) in 2D and 3D. (C) A comprehensive list of spatial analyses with examples of results.

MATERIALS AND METHODS

Data collection and preprocessing

We manually searched for spatial omics-related keywords using PubMed and Google Scholar to acquire publications with relevant datasets. Only publicly available data were collected. To handle the different types of spatial omics data, they were stored in three parts: expression matrix (matrix market format), spatial location and region of interest (ROI) annotation information. For the expression matrix, we filtered out genes or proteins expressed in <100 cells (or spots) and cells (or spots) expressing <1 gene or protein. Any negative expression values were replaced with a minimum value of 0 to ensure all the expression values were non-negative. Afterward, the expression matrix was log transformed as Inline graphic . The spatial and annotation information was left intact without any transformation. The processing scripts were written in Python using Numpy (25), Pandas (26) and Scanpy (27).

Determining the dimension of ROIs

Aquila uses the envelope of an ROI as its dimension, which is the maximum boundary of the ROI. For a series of 2D points, the envelope is calculated as Inline graphic . This equation is also extended to 3D data.

Mixing colors for co-localization of markers

Aquila uses the following algorithm to determine mixed colors by combining two or more colors. Colors are represented in RGB color space, with color A as Inline graphic and color B as , and the mixed color result as . For each channel, the mixing value is calculated as .

Co-expression analysis

Aquila uses Pearson correlation and Spearman correlation to determine the relationship between the expression of any genes or markers. Users can select 2–50 genes to perform co-expression analysis. This analysis is implemented in Rust for optimized computing pair correlation using Inline graphic and as the kernel for acceleration. Co-expression is calculated as

Inline graphic

Spearman correlation has an additional process in which input vectors are ranked by Spearman ranking, and the ranked vectors are then used for subsequent calculations. Users do not need to provide any parameters for this analysis.

Cell distribution pattern analysis

The cell distribution pattern analysis examines whether cells are distributed randomly, evenly or in a cluster. Aquila offers users three methods to choose from, including the index of dispersion (28), the Morisita index (29) and the Clarks–Evans index (30). For all three methods, the null hypothesis is that points are in a completely random state (Poisson random point process). Each method is tested against the null hypothesis to determine which pattern cells are distributed in an ROI.

The index of dispersion method uses a sampling window to sample an ROI multiple times randomly. Each time, we obtain the number of cells in a window. Inline graphic is defined as the variance and is the mean. The -test is used to determine significance. The index value is calculated as

Since Aquila uses circles as sampling windows, users will need to define the radius for the circle and the number of sampling times. By default, Aquila uses one-tenth of the minimum side of the ROI envelope as the radius and samples 1000 times. For each run, the results will be different due to the difference between samplings.

The Morisita index method is based on quadratic statistics that rasterize an ROI into a grid. The number of points in a square is denoted as Inline graphic . The -test is used to determine significance. The index value is calculated as

Users can define the dimension of the grid to control the granularity of the rasterization process. The default setting uses a 10 × 10 grid to rasterize the ROI.

The Clark–Evans index is a nonparametric statistical method in which the distribution of cells is based on the distance between neighboring points. Here, Inline graphic is defined as the density and as the number of individuals.

The index of dispersion, the Morisita index and the Clarks–Evans index are all implemented in Rust.

Ripley’s cell distribution analysis

Ripley’s K function (31) is a cell distribution pattern analysis that works by calculating the average number of points from different distance intervals. Users need to define the number of intervals for the function. In Aquila, the K function and its variants G, F and L are made available by Pointpats (v2.2.0) in the PySAL ecosystem (32).

Spatial entropy analysis

Aquila uses two entropy metrics to measure tissue heterogeneity, the Shannon entropy (33) and Leibovici entropy (34). Shannon entropy is a classic method for evaluating the chaos of a system. In contrast, Leibovici entropy considers the contribution of spatial information to a system’s entropy. It requires a distance parameter to determine the range of co-occurring events. By default, the distance is one-tenth of the minimum side of the ROI envelope. Shannon entropy is calculated as

where Inline graphic is defined as the proportion of cell type in the ROI. Leibovici entropy is determined as

where Inline graphic is equal to the proportion of different co-occurring events within the distance condition . Spatial entropy analysis is implemented in Rust.

Spatial variable gene analysis

The spatial expression pattern of a gene may or may not depend on the spatial context. Aquila identifies spatial variable genes using a GPU accelerated version of SpatialDE (35,36). The current method for detecting spatial variable genes in a regular ROI on a web server is inefficient and can take more than a few minutes. Due to this limitation, users can select no more than 50 genes per run.

Constructing neighboring networks

The results from the neighboring network analysis lay the foundation for the following analyses: spatial community, cell centrality, cell–cell interaction, spatial co-expression and spatial autocorrelation. Aquila provides two methods, KD-Tree and Delaunay triangulation, to determine the neighbors of a cell. KD-Tree employs a binary tree structure approach to analyze spatial data, making it an efficient way for spatial queries. Users set two parameters to use this method, the search radius for each cell and the number of nearest neighbors for a cell. Delaunay triangulation is a nonparametric method that can be thought of as filling a discrete set of points (i.e. cells) in a 2D plane with triangles. Each point is used as the vertex of a triangle such that no points lie in the circumcircle of any triangle. Cells are considered neighbors if they are connected to the edge of the triangle. Both methods construct a spatial network and display the results graphically with nodes representing cells and edges as the Euclidean distance between cells. With the network graph, users can examine the quality of the network and determine which cells have neighbors. They can also check the Euclidean distance between cells by hovering over the edges. Both methods are implemented in Rust using Kiddo (v0.2.3) and Delaunator (v1.0.1).

Spatial community analysis

Tissue phenotyping can be conducted by grouping cells within a local environment as a community (37). Thus, it is essential to identify these groups of cells that are spatially distinguished. Aquila includes three community detection methods, Leiden (38), Louvain (39) and Infomap (40), for clustering cells based on the above-mentioned neighboring network results. For the Louvain method, a multilevel version was implemented, which has a better computation efficiency than the original (41). The Leiden method requires users to input a resolution parameter that affects the size of each community. No input parameters are needed for either Louvain or Infomap. These three methods are executed using Python-igraph (v0.9.10).

Cell centrality analysis

Cell centrality analysis measures the importance of a cell in a network. Aquila provides three ways to determine cell centrality based on ‘degree’, ‘closeness’ and ‘betweenness’ (42). ‘Degree’ centrality works by counting the number of links connected to a node, while ‘closeness’ centrality is based on calculating the number of times a node acts as a bridge along the shortest path between two other nodes. Finally, ‘betweenness’ centrality measures the average length of the shortest path between a node and all other nodes in the network. Cell centrality analysis is implemented in Python using Python-igraph (v0.9.10).

Cell–cell interaction analysis

The spatial relationship between each cell type is determined by bootstrapping, a previously described approach used to analyze spatial single-cell data (43). For each ROI, the cell type identity is randomly reassigned, while the physical location of each cell type remains unchanged. First, a null distribution is generated for the number of cells A in the neighborhood of cell B. This result is derived from several times of bootstrapping. The null distribution is then compared to the actual distribution to determine the significance. A pseudo P-value (two-tailed) is calculated by

If a Z-score is used, the significance is calculated as

The Z-score will then be converted into a P-value. An interaction result with a P-value <0.05 is considered significant. If two cells are likely to occur in each other’s neighborhood, their relationship is considered an association (+1); otherwise, their relationship is classified as avoidance (−1). Users need to define the number of times to perform the bootstrapping (500–2000). The cell–cell interaction analysis is implemented in Rust.

Spatial co-expression analysis

The spatial co-expression analysis is similar to the co-expression analysis described above, except that here Aquila uses the adjacency list from the neighbor network result to construct an expression vector. For example, the expression of marker A is Inline graphic and that of marker B is .

Spatial autocorrelation analysis

Spatial autocorrelation analysis calculates the degree of random spatial distribution of cells. If cells tend to locate closely together, they are considered positively spatially correlated. Moran’s I (44) measures global spatial autocorrelation, while Geary’s C (45) measures local spatial autocorrelation. Moran’s I is defined as

while Geary’s C is defined as

Inline graphic is defined as the number of spatial units indexed by and , while is the spatial weight matrix derived from the neighbor network analysis and is the sum of the matrix. The spatial autocorrelation analysis is implemented in Rust.

Structure and implementations

The Aquila web user interface (UI) was generated using Next.js, a framework based on React. The web UI was designed using MUI, following the Material Design Guidelines. The visualization was created using Echarts.js with WebGL support to enable GPU acceleration. The codes for processing user datasets were implemented using pure JavaScript and executed directly in the browser using Web Workers. All user data are stored locally using IndexedDB. The central API server was constructed using the Actix framework and written in Rust. Analyses including cell neighbor network, cell distribution, co-expression analysis, spatial autocorrelation analysis, spatial entropy and cell–cell interactions were also implemented in Rust. The secondary API server was constructed using FastAPI, written in Python and implemented the following analyses: Ripley statistics, spatial community, cell centrality and spatial variable gene. SpatialDE powered the spatial variable gene analysis with GPU support. Currently, we are running Aquila on a server with an NVIDIA GTX750 GPU. Nginx was used as the reverse proxy server to register the two API servers under the same API gateway.

DATABASE CONTENT AND USAGE

Querying and selecting data

To query and select datasets, users must first click on the ‘Explore’ button on the home page, which will direct them to the ‘Dataset’ page. This page displays all the datasets as a list of cards. Each card highlights the essential information of a dataset, including species, tissue, disease, technology, molecule, the statistics of datasets and associated publications. The card also contains buttons that users can choose to view the dataset, download the dataset or add the dataset to the download list to download multiple datasets at once (Figure 2A). Aquila has a powerful filtering panel on the same page that helps users look for the dataset(s) they are interested in faster. Users can select dataset(s) based on multiple conditions such as species, technology, tissue or disease (Figure 2B). They can also find datasets by running a fuzzy search with any keywords. Searching datasets by gene names (case-insensitive) is also supported (Figure 2B). After narrowing down their search, users can sort the filtered datasets by publication name, year or the number of cells/ROIs/genes in either an ascending or a descending order. Aquila also supports viewing datasets from the ‘Publications’ page since each publication may contain multiple datasets. Once users find their dataset of interest, they can explore it in detail by selecting the ‘View’ button. After entering the details page of data, users can also select to view different ROIs in each dataset through a feature-rich data table. The default will present the first ROI in the dataset.

Figure 2. — Navigating and using Aquila. (A) An example of a dataset card displaying different essential information. (B) A panel for filtering and searching datasets with a sort button. (C) An example of an analysis panel for selecting methods and inserting parameters to run an analysis.

Visualizing spatial maps

Spatial maps are shown in 2D or 3D, depending on the dataset (Figure 1B). Users can begin by obtaining a general overview of the dataset by choosing one or more ROIs they would like to view in the ROI preview panel. They can then pick from the ROI preview panel one ROI at a time that they would like to acquire additional details. Once users have selected an ROI, they can view different contents of the ROI by toggling between ‘Cell Map’, ‘Expression Map’ and ‘Co-localization of Markers’. ‘Cell Map’ shows users the spatial distribution of the different cell types in the ROI. ‘Cell Map’ also contains general information about the ROI, including the dimension (minimum bounding box) and the number of cells. As for ‘Expression Map’, this option lets users look at the spatial expression of individual markers. Users can select or search for their markers of interest using the dropdown menu. Multiple expression maps can be viewed at once, so users can compare the spatial expression patterns of the different markers. Users can click on an ROI to obtain a larger view. Quantification of the expression distribution for the selected marker is also provided under ‘Expression Map’. Finally, by selecting ‘Co-localization of Markers’, users can view the expression of multiple markers in a cell, similar to immunofluorescent and fluorescent in situ hybridization analyses. Users can select up to two to five markers, assigning each marker a different color to see whether their expression is co-localized. All spatial maps allow users to adjust the point size and canvas size and zoom in and out to check tissue structures at different resolutions. All visualizations displayed in Aquila can be downloaded as PNGs.

Performing spatial analysis

Users can perform a comprehensive spatial analysis of any selected ROI within Aquila. Currently, 14 types of analyses are available. A concise explanation of each analysis can be found by clicking on the down arrowhead icon to the right of each analysis title. Users can choose to use the general default parameters supplied for every analysis, or they can input their own values. Users can also hover over the question mark at the end of each parameter to learn its functionalities in the popup card (Figure 2C). Many analyses come with more than one method, which allows users to try out different methods to achieve optimal results. Analyses will start immediately after selecting the ‘Run’ button, and users can visualize the results as soon as they are completed. Some spatial analyses, as described above, rely on the results from the neighboring network analysis to work and, by default, are not accessible to users. To unlock and run these other spatial analyses, users will need to perform the ‘neighboring network’ analysis, which will then embed the results into these analyses. Other analyses require cell type information to run, and thus they are not available for datasets that do not provide this information. The details of each analysis and how to choose parameters are found in the ‘Materials and Methods’ section.

Analyzing user data

Users can use Aquila to analyze their data by uploading them on the ‘Analysis’ page. Users must upload three files containing ROI annotation, cell/spot coordination and gene expression values. After these files are processed, users can then view and analyze their data in Aquila with the exact layout and functionalities as the public datasets. To protect sensitive private information from patient samples or unpublished research data from possible leaks, Aquila processes the data and stores the results locally.

CONCLUSION AND FUTURE DEVELOPMENT

In this work, we introduce Aquila, a database and analysis package for spatial omics data. Aquila provides a 2D and 3D spatial omics data visualization platform and allows users to analyze existing datasets in the database. Moreover, Aquila also supports visualizing and analyzing user data. Overall, Aquila provides users with results in a detailed and comprehensive profile from a spatial perspective.

Many spatial omics datasets provide tissue images such as H&E staining and expression information. Combining expression information with images helps users better understand the datasets and gain deeper insights into their findings. However, Aquila currently does not support tissue images, mainly due to the vast storage size needed to archive them and the resulting performance degradation in the backend database. We will seek better technical solutions in the future to incorporate tissue images. Finally, we will continuously update Aquila with new datasets and integrate additional analyses to provide a richer understanding of the data.

DATA AVAILABILITY

Aquila is freely accessible to anyone with no registration requirements. The database can be accessed and all the datasets can be downloaded via https://aquila.cheunglab.org. For usage and API documentation, please refer to https://aquila.cheunglab.org/about. All preprocessing and source codes for deploying the database are found at https://github.com/Mr-Milk/Aquila-next.

ACKNOWLEDGEMENTS

We are grateful to Nitin Narwade, Lingling Hu, Zhengqiang Miao and Jianfei Zheng for devoting their valuable time to testing and providing priceless advice for improving Aquila. We thank the Information and Communication Technology Office from the University of Macau for providing network and hardware support to host the API server for the database. We also thank all the members of the Cheung lab for critical feedback on this work.

Contributor Information

Yimin Zheng, Cancer Centre, University of Macau, Taipa 999078, Macau SAR; Centre for Precision Medicine Research and Training, University of Macau, Taipa 999078, Macau SAR; MoE Frontiers Science Center for Precision Oncology, University of Macau, Taipa 999078, Macau SAR; Faculty of Health Sciences, University of Macau, Taipa 999078, Macau SAR.

Yitian Chen, Cancer Centre, University of Macau, Taipa 999078, Macau SAR; Centre for Precision Medicine Research and Training, University of Macau, Taipa 999078, Macau SAR; MoE Frontiers Science Center for Precision Oncology, University of Macau, Taipa 999078, Macau SAR; Faculty of Health Sciences, University of Macau, Taipa 999078, Macau SAR.

Xianting Ding, Institute for Personalized Medicine, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China.

Koon Ho Wong, Cancer Centre, University of Macau, Taipa 999078, Macau SAR; Centre for Precision Medicine Research and Training, University of Macau, Taipa 999078, Macau SAR; MoE Frontiers Science Center for Precision Oncology, University of Macau, Taipa 999078, Macau SAR; Faculty of Health Sciences, University of Macau, Taipa 999078, Macau SAR.

Edwin Cheung, Cancer Centre, University of Macau, Taipa 999078, Macau SAR; Centre for Precision Medicine Research and Training, University of Macau, Taipa 999078, Macau SAR; MoE Frontiers Science Center for Precision Oncology, University of Macau, Taipa 999078, Macau SAR; Faculty of Health Sciences, University of Macau, Taipa 999078, Macau SAR.

FUNDING

University of Macau [MYRG2018-00033-FHS, MYRG2020-00100-FHS]; Science and Technology Development Fund [0011/2019/AKP, 0137/2020/A3]. Funding for open access charge: University of Macau.

Conflict of interest statement. None declared.

REFERENCES

1. HuBMAP Consortium The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature. 2019; 574:187. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Chen X., Teichmann S.A., Meyer K.B.. From tissues to cell types and back: single-cell gene expression analysis of tissue architecture. Annu. Rev. Biomed. Data Sci. 2018; 1:29–51. [Google Scholar]
3. Moses L., Pachter L.. Museum of spatial transcriptomics. Nat. Methods. 2022; 19:534–546. [DOI] [PubMed] [Google Scholar]
4. Stickels R.R., Murray E., Kumar P., Li J., Marshall J.L., Di Bella D.J., Arlotta P., Macosko E.Z., Chen F.. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat. Biotechnol. 2021; 39:313–319. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Liu Y., Yang M., Deng Y., Su G., Enninful A., Guo C.C., Tebaldi T., Zhang D., Kim D., Bai Z.. High-spatial-resolution multi-omics sequencing via deterministic barcoding in tissue. Cell. 2020; 183:1665–1681. [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Ståhl P.L., Salmén F., Vickovic S., Lundmark A., Navarro J.F., Magnusson J., Giacomello S., Asp M., Westholm J.O., Huss M.. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016; 353:78–82. [DOI] [PubMed] [Google Scholar]
7. Chen A., Liao S., Cheng M., Ma K., Wu L., Lai Y., Qiu X., Yang J., Xu J., Hao S.. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell. 2022; 185:1777–1792. [DOI] [PubMed] [Google Scholar]
8. Cho C.-S., Xi J., Si Y., Park S.-R., Hsu J.-E., Kim M., Jun G., Kang H.M., Lee J.H.. Microscopic examination of spatial transcriptome using Seq-Scope. Cell. 2021; 184:3559–3572. [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Srivatsan S.R., Regier M.C., Barkan E., Franks J.M., Packer J.S., Grosjean P., Duran M., Saxton S., Ladd J.J., Spielmann M.. Embryo-scale, single-cell spatial transcriptomics. Science. 2021; 373:111–117. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Eng C.-H.L., Lawson M., Zhu Q., Dries R., Koulena N., Takei Y., Yun J., Cronin C., Karp C., Yuan G.-C.. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH+. Nature. 2019; 568:235–239. [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Xia C., Fan J., Emanuel G., Hao J., Zhuang X.. Spatial transcriptome profiling by MERFISH reveals subcellular RNA compartmentalization and cell cycle-dependent gene expression. Proc. Natl Acad. Sci. U.S.A. 2019; 116:19490–19499. [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Wang X., Allen W.E., Wright M.A., Sylwestrak E.L., Samusik N., Vesuna S., Evans K., Liu C., Ramakrishnan C., Liu J.. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science. 2018; 361:eaat5691. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Lewis S.M., Asselin-Labat M.-L., Nguyen Q., Berthelet J., Tan X., Wimmer V.C., Merino D., Rogers K.L., Naik S.H.. Spatial omics and multiplexed imaging to explore cancer biology. Nat. Methods. 2021; 18:997–1012. [DOI] [PubMed] [Google Scholar]
14. Lin J.-R., Izar B., Wang S., Yapp C., Mei S., Shah P.M., Santagata S., Sorger P.K.. Highly multiplexed immunofluorescence imaging of human tissues and tumors using t-CyCIF and conventional optical microscopes. eLife. 2018; 7:e31657. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Goltsev Y., Samusik N., Kennedy-Darling J., Bhate S., Hale M., Vazquez G., Black S., Nolan G.P.. Deep profiling of mouse splenic architecture with CODEX multiplexed imaging. Cell. 2018; 174:968–981. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Giesen C., Wang H.A., Schapiro D., Zivanovic N., Jacobs A., Hattendorf B., Schüffler P.J., Grolimund D., Buhmann J.M., Brandt S.. Highly multiplexed imaging of tumor tissues with subcellular resolution by mass cytometry. Nat. Methods. 2014; 11:417–422. [DOI] [PubMed] [Google Scholar]
17. Keren L., Bosse M., Thompson S., Risom T., Vijayaragavan K., McCaffrey E., Marquez D., Angoshtari R., Greenwald N.F., Fienberg H.. MIBI-TOF: a multiplexed imaging platform relates cellular phenotypes and tissue structure. Sci. Adv. 2019; 5:eaax5851. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Wang M., Hu Q., Lv T., Wang Y., Lan Q., Xiang R., Tu Z., Wei Y., Han K., Shi C.. High-resolution 3D spatiotemporal transcriptomic maps of developing Drosophila embryos and larvae. Dev. Cell. 2022; 57:1271–1283. [DOI] [PubMed] [Google Scholar]
19. Liu C., Li R., Li Y., Lin X., Zhao K., Liu Q., Wang S., Yang X., Shi X., Ma Y.. Spatiotemporal mapping of gene expression landscapes and developmental trajectories during zebrafish embryogenesis. Dev. Cell. 2022; 57:1284–1298. [DOI] [PubMed] [Google Scholar]
20. Ji A.L., Rubin A.J., Thrane K., Jiang S., Reynolds D.L., Meyers R.M., Guo M.G., George B.M., Mollbrink A., Bergenstråhle J.et al.. Multimodal analysis of composition and spatial architecture in human squamous cell carcinoma. Cell. 2020; 182:497–514. [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Rendeiro A.F., Ravichandran H., Bram Y., Chandar V., Kim J., Meydan C., Park J., Foox J., Hether T., Warren S.. The spatial landscape of lung pathology during COVID-19 progression. Nature. 2021; 593:564–569. [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Fan Z., Chen R., Chen X.. SpatialDB: a database for spatially resolved transcriptomes. Nucleic Acids Res. 2020; 48:D233–D237. [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Dries R., Zhu Q., Dong R., Eng C.-H.L., Li H., Liu K., Fu Y., Zhao T., Sarkar A., Bao F.. Giotto: a toolbox for integrative analysis and visualization of spatial expression data. Genome Biol. 2021; 22:1–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Palla G., Spitzer H., Klein M., Fischer D., Schaar A.C., Kuemmerle L.B., Rybakov S., Ibarra I.L., Holmberg O., Virshup I.. Squidpy: a scalable framework for spatial omics analysis. Nat. Methods. 2022; 19:171–178. [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Harris C.R., Millman K.J., van der Walt S.J., Gommers R., Virtanen P., Cournapeau D., Wieser E., Taylor J., Berg S., Smith N.J.et al.. Array programming with NumPy. Nature. 2020; 585:357–362. [DOI] [PMC free article] [PubMed] [Google Scholar]
26. McKinney W. pandas: a foundational Python library for data analysis and statistics. Proceedings of the Workshop onPython for High Performance and Scientific Computing. 2011; 14:1–9. [Google Scholar]
27. Wolf F.A., Angerer P., Theis F.J.. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018; 19:15. [DOI] [PMC free article] [PubMed] [Google Scholar]
28. Fath B.D. Encyclopedia of Ecology. 2018; Amsterdam, The Netherlands: Elsevier. [Google Scholar]
29. Morisita M. I _σ-index, a measure of dispersion of individuals. Res. Popul. Ecol. 1962; 4:1–7. [Google Scholar]
30. Clark P.J., Evans F.C.. Distance to nearest neighbor as a measure of spatial relationships in populations. Ecology. 1954; 35:445–453. [Google Scholar]
31. Ripley B.D. Spatial Statistics. 2005; Hoboken, NJ: John Wiley & Sons. [Google Scholar]
32. Rey S.J., Anselin L.. Handbook of Applied Spatial Analysis. 2010; Berlin: Springer; 175–193. [Google Scholar]
33. Shannon C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948; 27:379–423. [Google Scholar]
34. Leibovici D.G., Claramunt C., Le Guyader D., Brosset D. Local and global spatio-temporal entropy indices based on distance-ratios and co-occurrences distributions. Int. J. Geogr. Inf. Sci. 2014; 28:1061–1084. [Google Scholar]
35. Svensson V., Teichmann S.A., Stegle O.. SpatialDE: identification of spatially variable genes. Nat. Methods. 2018; 15:343–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
36. Kats I., Vento-Tormo R., Stegle O.. SpatialDE2: fast and localized variance component analysis of spatial transcriptomics. 2021; bioRxiv doi:27 October 2021, preprint: not peer reviewed 10.1101/2021.10.27.466045. [DOI]
37. Jackson H.W., Fischer J.R., Zanotelli V.R., Ali H.R., Mechera R., Soysal S.D., Moch H., Muenst S., Varga Z., Weber W.P.. The single-cell pathology landscape of breast cancer. Nature. 2020; 578:615–620. [DOI] [PubMed] [Google Scholar]
38. Traag V.A., Waltman L., van Eck N.J.. From Louvain to Leiden: guaranteeing well-connected communities. Sci. Rep. 2019; 9:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Rotta R., Noack A.. Multilevel local search algorithms for modularity clustering. J. Exp. Algorithmics. 2011; 16:2.1–2.27. [Google Scholar]
40. Rosvall M., Bergstrom C.T.. Maps of information flow reveal community structure in complex networks. PNAS. 2007; 105:1118–1123. [DOI] [PMC free article] [PubMed] [Google Scholar]
41. Blondel V., Guillaume J., Lambiotte R., Lefebvre E.. Fast unfolding of community hierarchies in large networks. J. Stat. Mech. 2008; 2008:P10008. [Google Scholar]
42. West D.B. Introduction to Graph Theory. 2001; Upper Saddle River, NJ: Prentice Hall. [Google Scholar]
43. Schapiro D., Jackson H.W., Raghuraman S., Fischer J.R., Zanotelli V.R., Schulz D., Giesen C., Catena R., Varga Z., Bodenmiller B.. histoCAT: analysis of cell phenotypes and interactions in multiplex image cytometry data. Nat. Methods. 2017; 14:873. [DOI] [PMC free article] [PubMed] [Google Scholar]
44. Moran P.A. Notes on continuous stochastic phenomena. Biometrika. 1950; 37:17–23. [PubMed] [Google Scholar]
45. Geary R.C. The contiguity ratio and statistical mapping. Inc. Stat. 1954; 5:115–146. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[B1] 1. HuBMAP Consortium The human body at cellular resolution: the NIH Human Biomolecular Atlas Program. Nature. 2019; 574:187. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2. Chen X., Teichmann S.A., Meyer K.B.. From tissues to cell types and back: single-cell gene expression analysis of tissue architecture. Annu. Rev. Biomed. Data Sci. 2018; 1:29–51. [Google Scholar]

[B3] 3. Moses L., Pachter L.. Museum of spatial transcriptomics. Nat. Methods. 2022; 19:534–546. [DOI] [PubMed] [Google Scholar]

[B4] 4. Stickels R.R., Murray E., Kumar P., Li J., Marshall J.L., Di Bella D.J., Arlotta P., Macosko E.Z., Chen F.. Highly sensitive spatial transcriptomics at near-cellular resolution with Slide-seqV2. Nat. Biotechnol. 2021; 39:313–319. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5. Liu Y., Yang M., Deng Y., Su G., Enninful A., Guo C.C., Tebaldi T., Zhang D., Kim D., Bai Z.. High-spatial-resolution multi-omics sequencing via deterministic barcoding in tissue. Cell. 2020; 183:1665–1681. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6. Ståhl P.L., Salmén F., Vickovic S., Lundmark A., Navarro J.F., Magnusson J., Giacomello S., Asp M., Westholm J.O., Huss M.. Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016; 353:78–82. [DOI] [PubMed] [Google Scholar]

[B7] 7. Chen A., Liao S., Cheng M., Ma K., Wu L., Lai Y., Qiu X., Yang J., Xu J., Hao S.. Spatiotemporal transcriptomic atlas of mouse organogenesis using DNA nanoball-patterned arrays. Cell. 2022; 185:1777–1792. [DOI] [PubMed] [Google Scholar]

[B8] 8. Cho C.-S., Xi J., Si Y., Park S.-R., Hsu J.-E., Kim M., Jun G., Kang H.M., Lee J.H.. Microscopic examination of spatial transcriptome using Seq-Scope. Cell. 2021; 184:3559–3572. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9. Srivatsan S.R., Regier M.C., Barkan E., Franks J.M., Packer J.S., Grosjean P., Duran M., Saxton S., Ladd J.J., Spielmann M.. Embryo-scale, single-cell spatial transcriptomics. Science. 2021; 373:111–117. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10. Eng C.-H.L., Lawson M., Zhu Q., Dries R., Koulena N., Takei Y., Yun J., Cronin C., Karp C., Yuan G.-C.. Transcriptome-scale super-resolved imaging in tissues by RNA seqFISH+. Nature. 2019; 568:235–239. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Xia C., Fan J., Emanuel G., Hao J., Zhuang X.. Spatial transcriptome profiling by MERFISH reveals subcellular RNA compartmentalization and cell cycle-dependent gene expression. Proc. Natl Acad. Sci. U.S.A. 2019; 116:19490–19499. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12. Wang X., Allen W.E., Wright M.A., Sylwestrak E.L., Samusik N., Vesuna S., Evans K., Liu C., Ramakrishnan C., Liu J.. Three-dimensional intact-tissue sequencing of single-cell transcriptional states. Science. 2018; 361:eaat5691. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] 13. Lewis S.M., Asselin-Labat M.-L., Nguyen Q., Berthelet J., Tan X., Wimmer V.C., Merino D., Rogers K.L., Naik S.H.. Spatial omics and multiplexed imaging to explore cancer biology. Nat. Methods. 2021; 18:997–1012. [DOI] [PubMed] [Google Scholar]

[B14] 14. Lin J.-R., Izar B., Wang S., Yapp C., Mei S., Shah P.M., Santagata S., Sorger P.K.. Highly multiplexed immunofluorescence imaging of human tissues and tumors using t-CyCIF and conventional optical microscopes. eLife. 2018; 7:e31657. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15. Goltsev Y., Samusik N., Kennedy-Darling J., Bhate S., Hale M., Vazquez G., Black S., Nolan G.P.. Deep profiling of mouse splenic architecture with CODEX multiplexed imaging. Cell. 2018; 174:968–981. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16. Giesen C., Wang H.A., Schapiro D., Zivanovic N., Jacobs A., Hattendorf B., Schüffler P.J., Grolimund D., Buhmann J.M., Brandt S.. Highly multiplexed imaging of tumor tissues with subcellular resolution by mass cytometry. Nat. Methods. 2014; 11:417–422. [DOI] [PubMed] [Google Scholar]

[B17] 17. Keren L., Bosse M., Thompson S., Risom T., Vijayaragavan K., McCaffrey E., Marquez D., Angoshtari R., Greenwald N.F., Fienberg H.. MIBI-TOF: a multiplexed imaging platform relates cellular phenotypes and tissue structure. Sci. Adv. 2019; 5:eaax5851. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18. Wang M., Hu Q., Lv T., Wang Y., Lan Q., Xiang R., Tu Z., Wei Y., Han K., Shi C.. High-resolution 3D spatiotemporal transcriptomic maps of developing Drosophila embryos and larvae. Dev. Cell. 2022; 57:1271–1283. [DOI] [PubMed] [Google Scholar]

[B19] 19. Liu C., Li R., Li Y., Lin X., Zhao K., Liu Q., Wang S., Yang X., Shi X., Ma Y.. Spatiotemporal mapping of gene expression landscapes and developmental trajectories during zebrafish embryogenesis. Dev. Cell. 2022; 57:1284–1298. [DOI] [PubMed] [Google Scholar]

[B20] 20. Ji A.L., Rubin A.J., Thrane K., Jiang S., Reynolds D.L., Meyers R.M., Guo M.G., George B.M., Mollbrink A., Bergenstråhle J.et al.. Multimodal analysis of composition and spatial architecture in human squamous cell carcinoma. Cell. 2020; 182:497–514. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] 21. Rendeiro A.F., Ravichandran H., Bram Y., Chandar V., Kim J., Meydan C., Park J., Foox J., Hether T., Warren S.. The spatial landscape of lung pathology during COVID-19 progression. Nature. 2021; 593:564–569. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22] 22. Fan Z., Chen R., Chen X.. SpatialDB: a database for spatially resolved transcriptomes. Nucleic Acids Res. 2020; 48:D233–D237. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B23] 23. Dries R., Zhu Q., Dong R., Eng C.-H.L., Li H., Liu K., Fu Y., Zhao T., Sarkar A., Bao F.. Giotto: a toolbox for integrative analysis and visualization of spatial expression data. Genome Biol. 2021; 22:1–31. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B24] 24. Palla G., Spitzer H., Klein M., Fischer D., Schaar A.C., Kuemmerle L.B., Rybakov S., Ibarra I.L., Holmberg O., Virshup I.. Squidpy: a scalable framework for spatial omics analysis. Nat. Methods. 2022; 19:171–178. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B25] 25. Harris C.R., Millman K.J., van der Walt S.J., Gommers R., Virtanen P., Cournapeau D., Wieser E., Taylor J., Berg S., Smith N.J.et al.. Array programming with NumPy. Nature. 2020; 585:357–362. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B26] 26. McKinney W. pandas: a foundational Python library for data analysis and statistics. Proceedings of the Workshop onPython for High Performance and Scientific Computing. 2011; 14:1–9. [Google Scholar]

[B27] 27. Wolf F.A., Angerer P., Theis F.J.. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018; 19:15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B28] 28. Fath B.D. Encyclopedia of Ecology. 2018; Amsterdam, The Netherlands: Elsevier. [Google Scholar]

[B29] 29. Morisita M. I _σ-index, a measure of dispersion of individuals. Res. Popul. Ecol. 1962; 4:1–7. [Google Scholar]

[B30] 30. Clark P.J., Evans F.C.. Distance to nearest neighbor as a measure of spatial relationships in populations. Ecology. 1954; 35:445–453. [Google Scholar]

[B31] 31. Ripley B.D. Spatial Statistics. 2005; Hoboken, NJ: John Wiley & Sons. [Google Scholar]

[B32] 32. Rey S.J., Anselin L.. Handbook of Applied Spatial Analysis. 2010; Berlin: Springer; 175–193. [Google Scholar]

[B33] 33. Shannon C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948; 27:379–423. [Google Scholar]

[B34] 34. Leibovici D.G., Claramunt C., Le Guyader D., Brosset D. Local and global spatio-temporal entropy indices based on distance-ratios and co-occurrences distributions. Int. J. Geogr. Inf. Sci. 2014; 28:1061–1084. [Google Scholar]

[B35] 35. Svensson V., Teichmann S.A., Stegle O.. SpatialDE: identification of spatially variable genes. Nat. Methods. 2018; 15:343–346. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B36] 36. Kats I., Vento-Tormo R., Stegle O.. SpatialDE2: fast and localized variance component analysis of spatial transcriptomics. 2021; bioRxiv doi:27 October 2021, preprint: not peer reviewed 10.1101/2021.10.27.466045. [DOI]

[B37] 37. Jackson H.W., Fischer J.R., Zanotelli V.R., Ali H.R., Mechera R., Soysal S.D., Moch H., Muenst S., Varga Z., Weber W.P.. The single-cell pathology landscape of breast cancer. Nature. 2020; 578:615–620. [DOI] [PubMed] [Google Scholar]

[B38] 38. Traag V.A., Waltman L., van Eck N.J.. From Louvain to Leiden: guaranteeing well-connected communities. Sci. Rep. 2019; 9:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B39] 39. Rotta R., Noack A.. Multilevel local search algorithms for modularity clustering. J. Exp. Algorithmics. 2011; 16:2.1–2.27. [Google Scholar]

[B40] 40. Rosvall M., Bergstrom C.T.. Maps of information flow reveal community structure in complex networks. PNAS. 2007; 105:1118–1123. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B41] 41. Blondel V., Guillaume J., Lambiotte R., Lefebvre E.. Fast unfolding of community hierarchies in large networks. J. Stat. Mech. 2008; 2008:P10008. [Google Scholar]

[B42] 42. West D.B. Introduction to Graph Theory. 2001; Upper Saddle River, NJ: Prentice Hall. [Google Scholar]

[B43] 43. Schapiro D., Jackson H.W., Raghuraman S., Fischer J.R., Zanotelli V.R., Schulz D., Giesen C., Catena R., Varga Z., Bodenmiller B.. histoCAT: analysis of cell phenotypes and interactions in multiplex image cytometry data. Nat. Methods. 2017; 14:873. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B44] 44. Moran P.A. Notes on continuous stochastic phenomena. Biometrika. 1950; 37:17–23. [PubMed] [Google Scholar]

[B45] 45. Geary R.C. The contiguity ratio and statistical mapping. Inc. Stat. 1954; 5:115–146. [Google Scholar]

PERMALINK

Aquila: a spatial omics database and analysis platform

Yimin Zheng

Yitian Chen

Xianting Ding

Koon Ho Wong

Edwin Cheung

Abstract

INTRODUCTION

Figure 1.

MATERIALS AND METHODS

Data collection and preprocessing

Determining the dimension of ROIs

Mixing colors for co-localization of markers

Co-expression analysis

Cell distribution pattern analysis

Ripley’s cell distribution analysis

Spatial entropy analysis

Spatial variable gene analysis

Constructing neighboring networks

Spatial community analysis

Cell centrality analysis

Cell–cell interaction analysis

Spatial co-expression analysis

Spatial autocorrelation analysis

Structure and implementations

DATABASE CONTENT AND USAGE

Querying and selecting data

Figure 2.

Visualizing spatial maps

Performing spatial analysis

Analyzing user data

CONCLUSION AND FUTURE DEVELOPMENT

DATA AVAILABILITY

ACKNOWLEDGEMENTS

Contributor Information

FUNDING

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases