GRACE: a comprehensive web-based platform for integrative single-cell transcriptome analysis

Hao Yu; Yuqing Wang; Xi Zhang; Zheng Wang

doi:10.1093/nargab/lqad050

. 2023 Jun 9;5(2):lqad050. doi: 10.1093/nargab/lqad050

GRACE: a comprehensive web-based platform for integrative single-cell transcriptome analysis

Hao Yu ^1,^2,^3,⁴, Yuqing Wang ^5,⁶, Xi Zhang ^7,^8,^9,^✉, Zheng Wang ^10,^11,^12,^13,^✉

PMCID: PMC10251641 PMID: 37305171

Abstract

Large-scale single-cell RNA sequencing (scRNA-seq) has emerged as a robust method for dissecting cellular heterogeneity at single-cell resolution. However, to meet the increasingly high computational demands of non-programming experts, a user-friendly, scalable, and accessible online platform for analyzing scRNA-seq data is urgently needed. Here, we have developed a web-based platform GRACE (GRaphical Analyzing Cell Explorer) (http://grace.flowhub.com.cn or http://grace.jflab.ac.cn:28080) that enables online massive single-cell transcriptome analysis, improving interactivity and reproducibility using high-quality visualization frameworks. GRACE provides easy access to interactive visualization, customized parameters, and publication-quality graphs. Furthermore, it comprehensively integrates preprocessing, clustering, developmental trajectory inference, cell-cell communication, cell-type annotation, subcluster analysis, and pathway enrichment. In addition to the website platform, we also provide a Docker version that can be easily deployed on private servers. The source code for GRACE is freely available at (https://github.com/th00516/GRACE). Documentation and video tutorials are accessible from website homepage (http://grace.flowhub.com.cn). GRACE can analyze massive scRNA-seq data more flexibly and be accessible to the scientific community. This platform fulfills the major gap that exists between experimental (wet lab) and bioinformatic (dry lab) research.

INTRODUCTION

With the development of single-cell technology, massive high-throughput transcriptome analysis has provided opportunities for groundbreaking discoveries (1,2). Many recently developed pipelines or packages, such as Seurat (3), Monocle (4) and Scanpy (5), integrate multiple functional modules to support bioinformatics programming tasks. However, users still require programming skills (at least R or Python) and a command-line interface to perform customized analysis. Additionally, wet labs may lack the appropriate talents to analyze massive single-cell data and operate powerful high-performance computing clusters. Thus, a major gap exists between experimental and bioinformatics research.

Several GUI-based platforms, such as Galaxy (6) (https://humancellatlas.usegalaxy.eu/), GranatumX (7), cellxgene (8), UCSC Cell Browser (9), ASAP (10), ICARUS (11) and SCP (https://singlecell.broadinstitute.org/single_cell) have been developed to analyze scRNA-seq data online. However, these platforms have not fully incorporated current comprehensive scRNA-seq analysis tools. A lack of batch-effect-removal function limits their scalability and utility in analyzing datasets from multiple samples (12). Without functional modularity, it is not easy to update their toolkits for these platforms. Thus, a robust platform is still required to provide a concise and stable environment for the integrated analysis of multiple datasets.

To address these challenges, we introduce GRACE, a web interface-based platform that supports online analysis, annotation, and visualization of scRNA-seq data. It comprehensively integrates the current standard pipelines in scRNA-seq analysis, empowering experimental researchers without programming skills to analyze and visualize datasets of single-cell transcriptome. GRACE is a GUI and Shiny-based service that offers a user-friendly, scalable and accessible online platform for analyzing scRNA-seq data, fulfilling the gap between experimental and bioinformatics research.

MATERIALS AND METHODS

Framework & design

GRACE is a code-free, open-source, all-in-one web platform designed to enable non-programming experts to analyze their single-cell transcriptome datasets. It employs Shiny (https://shiny.rstudio.com) to build GUI and Plotly's graphing library (https://plotly.com/r) for data visualization. Each function module follows a similar template to improve development efficiency. The structure of template comprises four main parts: ‘Act’ (activity) for specific action buttons, ‘Ctl’ (control) for special operations. ‘Plt’ (plot) and ‘Tab’ (table) are used for generating visualized graphs and tables, respectively. GRACE uses Anndata (https://anndata.readthedocs.io) as an internal data format to enhance data processing efficiency. Within internal data structure, ‘X’ or ‘var’ is used to store count matrix or each cell's metadata, respectively. GRACE utilizes SQLite3 database engine (https://sqlite.org/index.html) to store meta-information about each dataset, including sample IDs, species, groups, creation times, and file pathways. GRACE implements nextflow (13) package to facilitate online data uploading and internal data preprocessing. Furthermore, GRACE integrates best-practice pipelines and packages to achieve canonical analysis in a flexible and scalable way, including data preprocessing, dimension reduction, cell clustering & annotation, gene-expression analysis, trajectory analysis, cell-cell communication and single-cell gene ontology analysis (Figure 1).

Figure 1. — Summary of the workflow of GRACE. The analysis workflow of GRACE is divided into three parts: back-end development (framework and embedded software), front-end visualization and functional design. The back-end development utilizes the R-shiny and R-plotly packages to build the GUI framework. A unified module template is first built to implement each functional page, and then R packages such as Seurat and Monocle2 are incorporated into each page. The GUI framework is responsible for building the front-end graphical operation interface and data visualization. In terms of functional design, GRACE aims to provide comprehensive functions, including data preprocessing, cell clustering, and trajectory analysis.

Best-practice pipelines with high performance

Compared to other similar software, GRACE offers high-performance pipelines that facilitate a seamless transition from upstream to downstream data analysis (Supplementary Table 1). At each step of scRNA-seq analysis, GRACE comprehensively integrated the most canonical software packages or modules currently available without any modification, apart from single-cell pathway enrichment. The specific software packages or modules used are detailed in Supplementary Table 2. GRACE comprehensively integrates analysis methods and packages with unmodified code to achieve the robustness and reproducibility. For example, GRACE integrates SCTransform (14) and DoubletFinder (15) for data standardization and doublet removal. To improve the flexibility of sample integration, there are five optional methods available for batch effect removal, including RPCA in Seurat v4 (3), FastMNN (16), Harmony (17), scVI (18) and svANVI (18). All these softwares are encapsulated into nextflow pipeline for streamlined and efficient data processing. In ‘Feature Selection’ module, GRACE provides three options: (1) selecting the top N highly variable genes (HVGs), which is implemented by calling Seurat's FindVariableFeatures function; (2) selecting genes based on modeling using the scikit-learn package, which employs linearSVC and ExtraTreesClassifier algorithms; and (3) directly uploading a customized gene list as selected features for downstream analysis. Furthermore, GRACE also provides the ‘Feature filtering’ module to filter out irrelevant or noisy genes, such as cell-cycle, ribosomal genes, or customized gene set uploaded by users. To infer ‘cell-cell communication’ analysis, GRACE incorporates CellPhoneDB (19), CellChat (20) and Cellcall (21) to caculate the strength of intercellular interaction and differential pairs of ligand-receptors.

To increase the efficiency of subclustering, GRACE provides an all-in-one pipeline for subclustering analyses, including reconstruction of lineage trajectories, intra-type cell communication and single-cell pathway enrichment. This section integrates several packages, such as Monocle2 (4), SPRING (22), Seurat v4 (3), CellPhoneDB (19), CellChat (20), and Cellcall (21). In ‘Developmental trajectory’ section, GRACE has integrated Monocle2 (4) and SPRING (22) to enable reverse-graph-embedding or force-directed-graph based trajectory inference. Notably, GRACE develops ‘single-cell pathway enrichment’ module to investigate the pathways of biological processes and the expression of specific genes across different states along the developmental trajectory, which make users interpret their data comprehensively and depict cellular heterogeneity at the single-cell resolution. GRACE uses all the HVGs in the first N of PCs (N can be set by the users) as input. Gene-set enrichment was first performed to identify classes of input genes and generate the annotated terms. The background value is computed by the number of genes in one cell against that of population total genes. At the single-cell level, the foreground value is computed by the number of genes in each pathway against that out of population total genes. F-score represents the ratio of foreground versus background value. Adjusted F-score was to reduce false positives and minimize false negatives. ClusterProfiler package (23) was incorporated to work on the main steps of gene-set enrichment analysis.

GRACE offers two automated methods for ultra-fast cell-type annotation, in addition to manual annotation using prior knowledge. The first method, which is referred to as ‘Reference-based Annotation, utilizes the SingleR package (24) to annotate cells based on transcriptome datasets of pure cell types. It employs the ’Human Primary Cell Atlas Data" and ‘Mouse RNAseq Data’ references which are built-in data in the Celldex package (24). The second method uses the Weight Nearest Network (WNN) algorithm (3) to map query scRNA-seq data against a well-annotated single-cell transcriptome atlas, which is called ‘Atlas-based Annotation’. In this module, GRACE offers four pre-generated atlases that can be accessed through the link (https://azimuth.hubmapconsortium.org). Additionally, it permits users to customize the annotated reference database, in addition to using the built-in references of the SingleR package. Data visualization is achieved using the Plotly package (https://plotly.com/r).

Dataset demonstration

Classical ‘PBMC 3k’ dataset (https://cf.10xgenomics.com/samples/cell/pbmc3k/) was converted into hdf5 format via DropletUtils (v1.14.2) (25) and fed into GRACE as one-group dataset (25,26). The other public dataset ‘lymphoma’ scRNA-seq (27) represents two-group dataset, including ‘HC01’ (healthy) and ‘MA08’ (lymphoma). This dataset was saved as CSV formatted file. Our GitHub repository also contains all test data (https://github.com/th00516/GRACE/tree/main/testdata). We mapped this query dataset onto an online annotated atlas ‘PMBC 10k’ (https://support.10xgenomics.com/single-cell-gene-expression/datasets/).

Availability

All source code and its docker image can be free downloaded (https://github.com/th00516/GRACE). The instructions and online video tutorials are available on the website homepage. (http://grace.flowhub.com.cn), The atlas reference is available on the Dropbox sharing link (https://www.dropbox.com/sh/zvsziyryjunqglm/AABY4sPTvGQPyxI5PlWzqBWXa?dl=0).

RESULTS

Case study 1: ‘PBMC 3k’ as one-group dataset

This case study demonstrates best-practice preprocessing and cell-type annotation methods. Data preprocessing, including removal of less-expressed genes, doublets, and low-quality cells, as well as dimensional reduction, was performed automatically after dataset uploading using the internal implementation of nextflow (13) (Figure S2A). For cell-type annotation, we initially used a reference-based method with the SingleR default reference after unsupervised clustering. However, one unknown cluster remained in addition to annotated cell types (Figure S2B). To resolve this, we employed the ‘Atlas-based Annotation’ module of GRACE to map the query dataset onto an annotated atlas (‘PBMC 10k’). This module successfully annotated the unknown clusters as effector memory CD8+ T cells, which was confirmed by marker genes (CD8A, CD8B, GZMB and NKG7) (28) (Figure S2C–E). The co-expression analysis module further allowed investigation of potential molecular relationships within the selected subset (Figure S2F).

Case study 2: ‘lymphoma’ as two-group dataset

This case study demonstrates the functional modules used for analyzing a two-group dataset. Sample integration and batch effect correction were achieved using the RPCA algorithm (3) and nextflow package (13) (Figure S3A). After performing quality control, dimension reduction, and cell-type annotation (Figure S1), ‘DC’ cells were selected to examine subclustering function. The expression of marker genes and automatic cell-type annotation simultaneously annotated DC subsets (Figure S3A). The implemented Monocle2 package (4) was performed to further study the trajectory of DC subsets across different lineages. Two conventional dendritic cells (cDCs) including cDC1 lineage (expressing CLEC9A and XCR1) and cDC2 lineage (expressing CD11b/ITGAM and CD172a/SIRPA) could be identified (29) (Figure 2A, S3B). To better visualize these fate-defined subsets, the tree with two branches corresponding to two major lineages was generated independently (Figure 2A, S3B). As expected, the cDC1 lineage (including subC03, subC06 and subC07) can easily be distinguished from the cDC2 lineage (Figure S3B). GRACE has integrated SPRING, which reconstructed the kNN graph for ‘DC’ cells and mapped Monocle2’s pseudotime onto the kNN graph (Figure 2B). The expression level of CLEC9A was also mapped onto both Monocle2-DDRTree and SPRING kNN graph simultaneously (Figure 2B, C). The red circle indicates subcluster 3 under resolution 0.2, which was annotated as DC:monocyte-derived (Figure 2B, C). Furthermore, the expression of selected genes could be viewed to study the potential roles in lineage differentiation (Figure 2C, Supplementary Figure S3B).

Figure 2. — The developmental trajectory module in GRACE integrates both Monocle2 and SPRING to provide a comprehensive analysis of single-cell transcriptome data. (A) In the monocle2 section, the analysis was performed on the DC cluster of HC01 and MA08 integrated samples. The main results include pseudotime, monocle state, and the mapping of DC subclusters to DDRTree (Subclusters) results. The monocle state and subclusters support two-group comparisons, with HC01 representing group 1 and MA08 representing group 2. (B) In the SPRING section, there are six groups of parameters that can be set. GRACE integrated the original parameters of the SPRING Viewer, which can be used to adjust the Force-directed graph parameters. The SPRING Viewer supports displaying gene expression, such as the expression of CLEC9A. The original cell labels are used in GRACE to store cell type/cluster information, while the original custom colors are used in GRACE to store pseudotime information from monocle2. This enables integration and analysis of monocle2 and SPRING results. (C) The expression level of CLEC9A mapped onto DDRTree corresponds to the red circle in the kNN graph in (B), which represents subcluster 3 at a resolution of 0.2. This subcluster was annotated as DC:monocyte-derived.

GRACE provides module to perform pathway enrichment at single-cell resolution. The fold-changes and p-values of each gene-set can be calculated for individual cells (Figure S4A). The diverse sets of biological processes among different subtypes or along developmental trajectory could be observed, respectively (Figure 2C, S3C). Simultaneous visualization of gene expression in selected pathway (For instance, the visualization of CCR1, CCL2 and CXCL2 from GO:1990869 pathway using both UMAP and DDRtree plots) could facilitate an empirical investigation of the relationship between subsets and diverse molecular signatures (Figure S4B).

For cell-cell communication, GRACE provides two ways to study cell-cell communication. First, users can analyze all the known ligand-receptor pairs between each two-type of cells (Figure 3A). For example, CXCL12–CXCR4-axis had stronger interaction between fibroblast and B cells (Figure 3A), which has already been considered an attractive target for cancer therapies (30). Second, GRACE also supports the investigation of signaling transduction between each pair of cells with one specific ligand-receptor pair (Figure 3A). For instance, the enhanced interaction between B and fibroblasts existed in lymphoma tissue, compared to healthy tissue (Figure 3A).

Figure 3. — Demonstration of the Cell-to-cell communication module, including CellphoneDB, CellChat, and Cellcall. (A) Workflow of **CellphoneDB** analysis. The Control panel showed each cell type providing either ‘Ligand’ or ‘Receptor’, respectively, when two types of cells were defined for prediction. The dotplot showed that the ‘healthy’ group (HC01) had significantly different ligand-receptor profiles, compared to the ‘lymphoma’ group (MA08). Control panel representing how to analyze the specific ‘ligand-receptor’ pair, and the ‘*CXCL12–CXCR4*’ pair as an example. The dotplot showed the relative interaction of each cell group based on a measure of ‘*CXCL12–CXCR4*’ pair. P-value representing by circle size, and the color representing the average expression level of ligand-receptor pairs. (B) Workflow of **CellChat** analysis. Before running CellChat, users need to set the species information and database type in the control panel. The CellChat results can be displayed based on different activated pathways. Taking the pbmc3k dataset as an example, Grace provided the activation level of different ligand-receptor pairs between different cell types (bubble plot), the strength of interactions between different cell types (network and heatmap), and the differential expression of markers in the selected activated pathway between different cell types (expression of markers). (C) Workflow of **Cellcall** analysis. During running, Cellcall requires users to provide species information and FDR threshold (Control panel of CellCall). Using the pbmc3k dataset as an example, Grace provided the activation levels of different ligand-receptor pairs between different cell types (Bubble plot), and the strength of the interaction from ligand to receptor between different cell types (circle plot).

DISCUSSION

Single-cell transcriptome technology requires the comprehensive and sophisticated computational software for data integration, processing, analysis, mining and interpretation. In this study, we developed the web-based platform GRACE, based on a GUI computing server, to bridge the gap between experimental and bioinformatics research. It provides a user-friendly framework for investigating cellular heterogeneity, discovering cell subtypes, and revealing important biological processes.

This platform improves on many novel functions in various ways. GRACE is the only platform that offers comprehensive analysis capabilities, including doublet removal, dimensionality reduction, batch effect removal, clustering, cell annotation, developmental trajectory, and cell-to-cell communication analysis, as compared to all the other platforms (Supplementary table 1). Only GRACE has data management and long-term data storage capabilities, which enable users to easily modify parameters or switch to different software or algorithms. In addition to incorporating more software packages at each step to provide a greater range of analysis methods, GRACE also allows users to customize parameter settings as much as possible. The fully integrated analysis offers greater versatility than simply assembling modules such as Galaxy (6), or GranatumX (7), which only allows users to perform a few analysis tasks at a time. Furthermore, GRACE is the only platform that provides subcluster analysis, which enables users to perform in-depth investigations, particularly after cell-type annotation (Supplementary table 1). The heterogeneity and functionality of cellular subpopulations are currently hot topics in cutting-edge research (31–33). Moreover, the all-in-one format of GRACE is advantageous when linking data analysis and visualization. Compared to UCSC Cell Browser (9), SCP (https://singlecell.broadinstitute.org/single_cell) and cellxgene (8), GRACE allows seamless linkage that enriches user experience., GRACE also supports multiple data formats frequently used in scRNA-seq analysis, including tab-separated value/text (tsv/txt), comma-separated value (csv) formats directly downloaded from GEO database, normal RDS format as input of Seurat package (3), and hierarchical data format (HDF5) format generated from 10x Cellranger package. Fifth, many best-practice pipelines were implemented into GRACE, contributing to scalable, reproducible and straightforward development in both online platform and local server. In conclusion, GRACE provides easy access to interactive visualization, customized parameters, and publication-ready graphs for scRNA-seq analysis.

DATA AVAILABILITY

All source code and its docker image can be free downloaded (https://github.com/th00516/GRACE) and Zenodo (https://doi.org/10.5281/zenodo.7923930). The instructions and online video tutorials are available on the website homepage. (http://grace.flowhub.com.cn). The atlas reference is available on the Dropbox sharing link (https://www.dropbox.com/sh/zvsziyryjunqglm/AABY4sPTvGQPyxI5PlWzqBWXa?dl=0) and Zenodo (https://doi.org/10.5281/zenodo.7923930).

Supplementary Material

lqad050_Supplemental_Files

Click here for additional data file.^{(2.1MB, zip)}

ACKNOWLEDGEMENTS

Author contributions: Z.W. and H.Y. designed the project. H.Y and Z.W. developed GRACE software. Z.W., X.Z. and H.Y. wrote the manuscript and organized all figures, Y.W. designed and developed the project website, and edited the tutorial videos. All authors discussed results and commented on the manuscript.

Contributor Information

Hao Yu, Medical Center of Hematology, Second Affiliated Hospital, Army Medical University, Chongqing 400037, China; State Key Laboratory of Trauma, Burn and Combined Injury, Army Medical University, Chongqing 400037, China; Bio-Med Informatics Research Center & Clinical Research Center, The Second Affiliated Hospital, Army Medical University, Chongqing 400037, China; Jinfeng Laboratory, Chongqing 401329, China.

Yuqing Wang, Medical Center of Hematology, Second Affiliated Hospital, Army Medical University, Chongqing 400037, China; State Key Laboratory of Trauma, Burn and Combined Injury, Army Medical University, Chongqing 400037, China.

Xi Zhang, Medical Center of Hematology, Second Affiliated Hospital, Army Medical University, Chongqing 400037, China; State Key Laboratory of Trauma, Burn and Combined Injury, Army Medical University, Chongqing 400037, China; Jinfeng Laboratory, Chongqing 401329, China.

Zheng Wang, Medical Center of Hematology, Second Affiliated Hospital, Army Medical University, Chongqing 400037, China; State Key Laboratory of Trauma, Burn and Combined Injury, Army Medical University, Chongqing 400037, China; Bio-Med Informatics Research Center & Clinical Research Center, The Second Affiliated Hospital, Army Medical University, Chongqing 400037, China; Jinfeng Laboratory, Chongqing 401329, China.

SUPPLEMENTARY DATA

Supplementary Data are available at NARGAB Online.

FUNDING

National Key R&D Program of China [2022YFA1103300, 2022YFA1103303, 2022YFA1103304]; National Natural Science Foundation of China [82020108004]; Natural Science Foundation of Chongqing [cstc2019jcyj-msxmX0421]; Translational Research Grant of NCRCH [2020ZKZC02]; Youth Talent Development Program from Second Affiliated Hospital, Army Medical University [2022YQB014].

Conflict of interest statement. None declared.

REFERENCES

1. Lei Y., Tang R., Xu J., Wang W., Zhang B., Liu J., Yu X., Shi S.. Applications of single-cell sequencing in cancer research: progress and perspectives. J. Hematol. Oncol. J. Hematol. Oncol. 2021; 14:91. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Guo R., Lü M., Cao F., Wu G., Gao F., Pang H., Li Y., Zhang Y., Xing H., Liang C.et al.. Single-cell map of diverse immune phenotypes in the acute myeloid leukemia microenvironment. Biomark. Res. 2021; 9:15. [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Hao Y., Hao S., Andersen-Nissen E., Mauck W.M., Zheng S., Butler A., Lee M.J., Wilk A.J., Darby C., Zager M.et al.. Integrated analysis of multimodal single-cell data. Cell. 2021; 184:3573–3587. [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Qiu X., Mao Q., Tang Y., Wang L., Chawla R., Pliner H.A., Trapnell C.. Reversed graph embedding resolves complex single-cell trajectories. Nat. Methods. 2017; 14:979–982. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Wolf F.A., Angerer P., Theis F.J.. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018; 19:15. [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Giardine B., Riemer C., Hardison R.C., Burhans R., Elnitski L., Shah P., Zhang Y., Blankenberg D., Albert I., Taylor J.et al.. Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 2005; 15:1451–1455. [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Garmire D.G., Zhu X., Mantravadi A., Huang Q., Yunits B., Liu Y., Wolfgruber T., Poirion O., Zhao T., Arisdakessian C.et al.. GranatumX: a community-engaging, modularized, and flexible webtool for single-cell data analysis. Genomics Proteomics Bioinformatics. 2021; 19:452–460. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Megill C., Martin B., Weaver C., Bell S., Prins L., Badajoz S., McCandless B., Pisco A.O., Kinsella M., Griffin F.et al.. cellxgene: a performant, scalable exploration platform for high dimensional sparse matrices. 2021; bioRxiv doi:06 April 2021, preprint: not peer reviewed 10.1101/2021.04.05.438318. [DOI]
9. Speir M.L., Bhaduri A., Markov N.S., Moreno P., Nowakowski T.J., Papatheodorou I., Pollen A.A., Raney B.J., Seninge L., Kent W.J.et al.. UCSC Cell Browser: visualize your single-cell data. Bioinformatics. 2021; 37:4578–4580. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Gardeux V., David F.P.A., Shajkofci A., Schwalie P.C., Deplancke B.. ASAP: a web-based platform for the analysis and interactive visualization of single-cell RNA-seq data. Bioinformatics. 2017; 33:3123–3125. [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Jiang A., Lehnert K., You L., Snell R.G.. ICARUS, an interactive web server for single cell RNA-seq analysis. Nucleic Acids Res. 2022; 50:W427–W433. [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Tran H.T.N., Ang K.S., Chevrier M., Zhang X., Lee N.Y.S., Goh M., Chen J.. A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 2020; 21:12. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Di Tommaso P., Chatzou M., Floden E.W., Barja P.P., Palumbo E., Notredame C.. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 2017; 35:316–319. [DOI] [PubMed] [Google Scholar]
14. Hafemeister C., Satija R.. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 2019; 20:296. [DOI] [PMC free article] [PubMed] [Google Scholar]
15. McGinnis C.S., Murrow L.M., Gartner Z.J.. DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 2019; 8:329–337. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Haghverdi L., Lun A.T.L., Morgan M.D., Marioni J.C.. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat. Biotechnol. 2018; 36:421–427. [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Korsunsky I., Millard N., Fan J., Slowikowski K., Zhang F., Wei K., Baglaenko Y., Brenner M., Loh P., Raychaudhuri S.. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods. 2019; 16:1289–1296. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Gayoso A., Lopez R., Xing G., Boyeau P., Valiollah Pour Amiri V., Hong J., Wu K., Jayasuriya M., Mehlman E., Langevin M.et al.. A Python library for probabilistic analysis of single-cell omics data. Nat. Biotechnol. 2022; 40:163–166. [DOI] [PubMed] [Google Scholar]
19. Efremova M., Vento-Tormo M., Teichmann S.A., Vento-Tormo R.. CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes. Nat. Protoc. 2020; 15:1484–1506. [DOI] [PubMed] [Google Scholar]
20. Jin S., Guerrero-Juarez C.F., Zhang L., Chang I., Ramos R., Kuan C.-H., Myung P., Plikus M.V., Nie Q.. Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 2021; 12:1088. [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Zhang Y., Liu T., Hu X., Wang M., Wang J., Zou B., Tan P., Cui T., Dou Y., Ning L.et al.. CellCall: integrating paired ligand–receptor and transcription factor activities for cell–cell communication. Nucleic Acids Res. 2021; 49:8520–8534. [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Weinreb C., Wolock S., Klein A.M.. SPRING: a kinetic interface for visualizing high dimensional single-cell expression data. Bioinformatics. 2018; 34:1246–1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Yu G., Wang L.-G., Han Y., He Q.-Y.. clusterProfiler: an R Package for comparing biological themes among gene clusters. OMICS J. Integr. Biol. 2012; 16:284–287. [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Aran D., Looney A.P., Liu L., Wu E., Fong V., Hsu A., Chak S., Naikawadi R.P., Wolters P.J., Abate A.R.et al.. Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat. Immunol. 2019; 20:163–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
25. participants in the 1st Human Cell Atlas Jamboree Lun A.T.L., Riesenfeld S., Andrews T., Dao T.P., Gomes T., Marioni J.C.. EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data. Genome Biol. 2019; 20:63. [DOI] [PMC free article] [PubMed] [Google Scholar]
26. Griffiths J.A., Richard A.C., Bach K., Lun A.T.L., Marioni J.C.. Detection and removal of barcode swapping in single-cell RNA-seq data. Nat. Commun. 2018; 9:2667. [DOI] [PMC free article] [PubMed] [Google Scholar]
27. Rindler K., Jonak C., Alkon N., Thaler F.M., Kurz H., Shaw L.E., Stingl G., Weninger W., Halbritter F., Bauer W.M.et al.. Single-cell RNA sequencing reveals markers of disease progression in primary cutaneous T-cell lymphoma. Mol. Cancer. 2021; 20:124. [DOI] [PMC free article] [PubMed] [Google Scholar]
28. Hornburg M., Desbois M., Lu S., Guan Y., Lo A.A., Kaufman S., Elrod A., Lotstein A., DesRochers T.M., Munoz-Rodriguez J.L.et al.. Single-cell dissection of cellular components and interactions shaping the tumor immune phenotypes in ovarian cancer. Cancer Cell. 2021; 39:928–944. [DOI] [PubMed] [Google Scholar]
29. Villani A.-C., Satija R., Reynolds G., Sarkizova S., Shekhar K., Fletcher J., Griesbeck M., Butler A., Zheng S., Lazo S.et al.. Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors. Science. 2017; 356:eaah4573. [DOI] [PMC free article] [PubMed] [Google Scholar]
30. AlMusawi S., Ahmed M., Nateri A.S.. Understanding cell-cell communication and signaling in the colorectal cancer microenvironment. Clin. Transl. Med. 2021; 11:e308. [DOI] [PMC free article] [PubMed] [Google Scholar]
31. Kinker G.S., Greenwald A.C., Tal R., Orlova Z., Cuoco M.S., McFarland J.M., Warren A., Rodman C., Roth J.A., Bender S.A.et al.. Pan-cancer single-cell RNA-seq identifies recurring programs of cellular heterogeneity. Nat. Genet. 2020; 52:1208–1218. [DOI] [PMC free article] [PubMed] [Google Scholar]
32. Brown C.C., Gudjonson H., Pritykin Y., Deep D., Lavallée V.-P., Mendoza A., Fromme R., Mazutis L., Ariyan C., Leslie C.et al.. Transcriptional basis of mouse and human dendritic cell heterogeneity. Cell. 2019; 179:846–863. [DOI] [PMC free article] [PubMed] [Google Scholar]
33. Papalexi E., Satija R.. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat. Rev. Immunol. 2018; 18:35–45. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

lqad050_Supplemental_Files

Click here for additional data file.^{(2.1MB, zip)}

Data Availability Statement

[B1] 1. Lei Y., Tang R., Xu J., Wang W., Zhang B., Liu J., Yu X., Shi S.. Applications of single-cell sequencing in cancer research: progress and perspectives. J. Hematol. Oncol. J. Hematol. Oncol. 2021; 14:91. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2. Guo R., Lü M., Cao F., Wu G., Gao F., Pang H., Li Y., Zhang Y., Xing H., Liang C.et al.. Single-cell map of diverse immune phenotypes in the acute myeloid leukemia microenvironment. Biomark. Res. 2021; 9:15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3. Hao Y., Hao S., Andersen-Nissen E., Mauck W.M., Zheng S., Butler A., Lee M.J., Wilk A.J., Darby C., Zager M.et al.. Integrated analysis of multimodal single-cell data. Cell. 2021; 184:3573–3587. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] 4. Qiu X., Mao Q., Tang Y., Wang L., Chawla R., Pliner H.A., Trapnell C.. Reversed graph embedding resolves complex single-cell trajectories. Nat. Methods. 2017; 14:979–982. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5. Wolf F.A., Angerer P., Theis F.J.. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018; 19:15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6. Giardine B., Riemer C., Hardison R.C., Burhans R., Elnitski L., Shah P., Zhang Y., Blankenberg D., Albert I., Taylor J.et al.. Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 2005; 15:1451–1455. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7] 7. Garmire D.G., Zhu X., Mantravadi A., Huang Q., Yunits B., Liu Y., Wolfgruber T., Poirion O., Zhao T., Arisdakessian C.et al.. GranatumX: a community-engaging, modularized, and flexible webtool for single-cell data analysis. Genomics Proteomics Bioinformatics. 2021; 19:452–460. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8. Megill C., Martin B., Weaver C., Bell S., Prins L., Badajoz S., McCandless B., Pisco A.O., Kinsella M., Griffin F.et al.. cellxgene: a performant, scalable exploration platform for high dimensional sparse matrices. 2021; bioRxiv doi:06 April 2021, preprint: not peer reviewed 10.1101/2021.04.05.438318. [DOI]

[B9] 9. Speir M.L., Bhaduri A., Markov N.S., Moreno P., Nowakowski T.J., Papatheodorou I., Pollen A.A., Raney B.J., Seninge L., Kent W.J.et al.. UCSC Cell Browser: visualize your single-cell data. Bioinformatics. 2021; 37:4578–4580. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10. Gardeux V., David F.P.A., Shajkofci A., Schwalie P.C., Deplancke B.. ASAP: a web-based platform for the analysis and interactive visualization of single-cell RNA-seq data. Bioinformatics. 2017; 33:3123–3125. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11. Jiang A., Lehnert K., You L., Snell R.G.. ICARUS, an interactive web server for single cell RNA-seq analysis. Nucleic Acids Res. 2022; 50:W427–W433. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B12] 12. Tran H.T.N., Ang K.S., Chevrier M., Zhang X., Lee N.Y.S., Goh M., Chen J.. A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 2020; 21:12. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13] 13. Di Tommaso P., Chatzou M., Floden E.W., Barja P.P., Palumbo E., Notredame C.. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 2017; 35:316–319. [DOI] [PubMed] [Google Scholar]

[B14] 14. Hafemeister C., Satija R.. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 2019; 20:296. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15. McGinnis C.S., Murrow L.M., Gartner Z.J.. DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 2019; 8:329–337. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16. Haghverdi L., Lun A.T.L., Morgan M.D., Marioni J.C.. Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors. Nat. Biotechnol. 2018; 36:421–427. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] 17. Korsunsky I., Millard N., Fan J., Slowikowski K., Zhang F., Wei K., Baglaenko Y., Brenner M., Loh P., Raychaudhuri S.. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods. 2019; 16:1289–1296. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18. Gayoso A., Lopez R., Xing G., Boyeau P., Valiollah Pour Amiri V., Hong J., Wu K., Jayasuriya M., Mehlman E., Langevin M.et al.. A Python library for probabilistic analysis of single-cell omics data. Nat. Biotechnol. 2022; 40:163–166. [DOI] [PubMed] [Google Scholar]

[B19] 19. Efremova M., Vento-Tormo M., Teichmann S.A., Vento-Tormo R.. CellPhoneDB: inferring cell–cell communication from combined expression of multi-subunit ligand–receptor complexes. Nat. Protoc. 2020; 15:1484–1506. [DOI] [PubMed] [Google Scholar]

[B20] 20. Jin S., Guerrero-Juarez C.F., Zhang L., Chang I., Ramos R., Kuan C.-H., Myung P., Plikus M.V., Nie Q.. Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 2021; 12:1088. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] 21. Zhang Y., Liu T., Hu X., Wang M., Wang J., Zou B., Tan P., Cui T., Dou Y., Ning L.et al.. CellCall: integrating paired ligand–receptor and transcription factor activities for cell–cell communication. Nucleic Acids Res. 2021; 49:8520–8534. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22] 22. Weinreb C., Wolock S., Klein A.M.. SPRING: a kinetic interface for visualizing high dimensional single-cell expression data. Bioinformatics. 2018; 34:1246–1248. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B23] 23. Yu G., Wang L.-G., Han Y., He Q.-Y.. clusterProfiler: an R Package for comparing biological themes among gene clusters. OMICS J. Integr. Biol. 2012; 16:284–287. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B24] 24. Aran D., Looney A.P., Liu L., Wu E., Fong V., Hsu A., Chak S., Naikawadi R.P., Wolters P.J., Abate A.R.et al.. Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat. Immunol. 2019; 20:163–172. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B25] 25. participants in the 1st Human Cell Atlas Jamboree Lun A.T.L., Riesenfeld S., Andrews T., Dao T.P., Gomes T., Marioni J.C.. EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data. Genome Biol. 2019; 20:63. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B26] 26. Griffiths J.A., Richard A.C., Bach K., Lun A.T.L., Marioni J.C.. Detection and removal of barcode swapping in single-cell RNA-seq data. Nat. Commun. 2018; 9:2667. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B27] 27. Rindler K., Jonak C., Alkon N., Thaler F.M., Kurz H., Shaw L.E., Stingl G., Weninger W., Halbritter F., Bauer W.M.et al.. Single-cell RNA sequencing reveals markers of disease progression in primary cutaneous T-cell lymphoma. Mol. Cancer. 2021; 20:124. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B28] 28. Hornburg M., Desbois M., Lu S., Guan Y., Lo A.A., Kaufman S., Elrod A., Lotstein A., DesRochers T.M., Munoz-Rodriguez J.L.et al.. Single-cell dissection of cellular components and interactions shaping the tumor immune phenotypes in ovarian cancer. Cancer Cell. 2021; 39:928–944. [DOI] [PubMed] [Google Scholar]

[B29] 29. Villani A.-C., Satija R., Reynolds G., Sarkizova S., Shekhar K., Fletcher J., Griesbeck M., Butler A., Zheng S., Lazo S.et al.. Single-cell RNA-seq reveals new types of human blood dendritic cells, monocytes, and progenitors. Science. 2017; 356:eaah4573. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B30] 30. AlMusawi S., Ahmed M., Nateri A.S.. Understanding cell-cell communication and signaling in the colorectal cancer microenvironment. Clin. Transl. Med. 2021; 11:e308. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B31] 31. Kinker G.S., Greenwald A.C., Tal R., Orlova Z., Cuoco M.S., McFarland J.M., Warren A., Rodman C., Roth J.A., Bender S.A.et al.. Pan-cancer single-cell RNA-seq identifies recurring programs of cellular heterogeneity. Nat. Genet. 2020; 52:1208–1218. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B32] 32. Brown C.C., Gudjonson H., Pritykin Y., Deep D., Lavallée V.-P., Mendoza A., Fromme R., Mazutis L., Ariyan C., Leslie C.et al.. Transcriptional basis of mouse and human dendritic cell heterogeneity. Cell. 2019; 179:846–863. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B33] 33. Papalexi E., Satija R.. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat. Rev. Immunol. 2018; 18:35–45. [DOI] [PubMed] [Google Scholar]

PERMALINK

GRACE: a comprehensive web-based platform for integrative single-cell transcriptome analysis

Hao Yu

Yuqing Wang

Xi Zhang

Zheng Wang

Abstract

INTRODUCTION