Skip to main content
BMC Bioinformatics logoLink to BMC Bioinformatics
. 2023 Nov 7;24:420. doi: 10.1186/s12859-023-05490-y

Sctensor detects many-to-many cell–cell interactions from single cell RNA-sequencing data

Koki Tsuyuzaki 1,2,, Manabu Ishii 1, Itoshi Nikaido 1,3,
PMCID: PMC10631077  PMID: 37936079

Abstract

Background

Complex biological systems are described as a multitude of cell–cell interactions (CCIs). Recent single-cell RNA-sequencing studies focus on CCIs based on ligand–receptor (L–R) gene co-expression but the analytical methods are not appropriate to detect many-to-many CCIs.

Results

In this work, we propose scTensor, a novel method for extracting representative triadic relationships (or hypergraphs), which include ligand-expression, receptor-expression, and related L–R pairs.

Conclusions

Through extensive studies with simulated and empirical datasets, we have shown that scTensor can detect some hypergraphs that cannot be detected using conventional CCI detection methods, especially when they include many-to-many relationships. scTensor is implemented as a freely available R/Bioconductor package.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-023-05490-y.

Keywords: Single-cell RNA-sequencing, Cell–cell interaction, Hypergraph, Dimension reduction, Tensor decomposition, Non-negative Tucker2 decomposition, R/Bioconductor

Background

Complex biological systems and processes such as tissue homeostasis [1, 2], neurotransmission [3, 4], immune response [5], ontogenesis [6], and stem cell niches niche [7, 8] are composed of cell–cell interactions (CCIs). Many molecular biology studies have decomposed such systems into constituent parts (e.g., genes, proteins, and metabolites) to clarify their functions. Nevertheless, more sophisticated methodologies are required because CCIs essentially differentiate whole systems from functioning merely as the sum of their parts. Accordingly, micro-level measurements of such parts cannot always explain macro-level biological functions.

Previous studies have investigated CCIs using technologies such as fluorescence microscopy [913], microdevice-based methods such as microwells, micropatterns, single-cell traps, droplet microfluidics, and micropillars [1422], and transcriptome-based methods [2352]. In particular, the recent single-cell RNA-sequencing (scRNA-seq) studies have focused on CCIs based on ligand–receptor (L–R) gene co-expression. By investigating the detected cell types through scRNA-seq and the L–R pairs specifically expressed in the cell types, CCIs can potentially be understood at high resolution.

Despite their wide usage, the analytical methods based on L–R pairs are still not mature; such methods implicitly assume that CCIs consist of one-to-one relationships between two cell types and that the corresponding L–R co-expression is observed in a cell-type-specific manner. One study even removed ligand and receptor genes expressed in multiple cell types from their data matrix, assuming one-to-one CCIs [53]. In real empirical data, however, each ligand and receptor gene can be expressed across multiple cell types, and some studies have actually focused on many-to-many CCIs [25, 33, 36, 48, 54]. Such a difference between actual CCI patterns composed of real data and the hypothesis assumed by a model will cause severe bias in the detection of CCIs.

For the above reason, we propose scTensor, which is a novel CCI prediction method based on a tensor decomposition algorithm. Our method regards CCIs as hypergraphs and extracts some representative triadic relationships consisting of ligand-expression, receptor-expression, and related L–R pairs. The main contributions of this article are summarized as follows.

  • We developed a novel simulator to model the CCIs as hypergraphs and quantitatively evaluate the performance of scTensor and other L–R detection methods.

  • We re-implement some L–R detection methods from scratch in order to analyze the same L–R database with all of these methods and focus on only the performance of L–R detection methods, not the slight differences in data pre-processing and the L–R database used.

  • We show that scTensor’s performance with respect to its accuracy of many-to-many CCI detection, computation time, and memory usage are superior to the other L–R detection methods.

  • We describe the implementation of scTensor as an R/Bioconductor package to enable the reproducibility of data analyses as well as continuous maintenance and improvements. We provide some original visualization functions and a function to generate an HTML report in scTensor to enable detailed interpretation of the results. We have extended our framework to work with 125 species.

Results

CCI as a hypergraph

One of the simplest CCI representations is a directed graph, where each node represents a cell type and each edge represents the co-expression of all L–R pairs (Fig. 1a, left). The direction of each edge is set as the ligand expressing cell type the receptor-expressing cell type. Such a data structure corresponds to an asymmetric adjacency matrix, in which each row and column represents a ligand-expressing cell type and receptor-expressing cell type, respectively. If some combinations of cell types are regarded as interacting, the corresponding elements of the matrix are filled with 1 and otherwise 0. If the degree of CCI is not a binary relationship, weighted graphs and corresponding weighted adjacent matrices may also be used. The previous analytical methods are categorized within this approach [23, 24, 2634, 36, 40, 4346, 48, 49, 51, 52, 55].

Fig. 1.

Fig. 1

Cell–cell interaction (CCI) as a hypergraph (CaH). a Previous scRNA-seq studies have regarded CCIs as graphs, and the corresponding data structure can be expressed as an adjacency matrix (left). In this work, CCIs are regarded as context-aware edges (hypergraphs), and the corresponding data structure is a tensor (right). b The CCI-tensor is generated by users’ scRNA-Seq matrices, cell-type labels, and ligand–receptor (L–R) databases. NTD-2 is used to extract CaHs from the CCI-tensor. c Each CaH(r1,r2) is equal to the outer product of three vectors. A:r1(1) represents the ligand expression pattern, A:r2(2) represents the receptor expression pattern, and Gr1,r2,: represents the patterns of related L–R pairs

The drawback of using an adjacency matrix to describe CCIs is that multiple L–R co-expression scores are collapsed into a single value by summation or averaging. Because the average is simply a constant multiple of the sum, here we discuss only the sum. The summed value has no meaning in which L–R pairs are related to the CCI, and therefore CCIs and the related L–R pair lists cannot be detected simultaneously.

In contrast to an adjacency matrix (i.e., graph), the triadic relationship of CCIs also can be described as directed hypergraphs (i.e., CCI as hypergraph; CaH), where each node is a cell type but the edges are distinguished from each other by the different related L–R pair sets (Fig. 1a, right). Such a context-aware edge is called a “hyperedge” and is described as multiple different adjacency matrices. The set of matrices corresponds to a “tensor”, which is a generalization of a matrix to expand its order.

Overview of scTensor

Here we introduce the procedure of scTensor. Firstly, a tensor data is constructed through the following steps (Fig. 1b). A scRNA-seq matrix and the cellular labels specifying cell types are supposed to be provided by users. Firstly, the gene expression values of each cell are normalized by count per median of library size (CPMED [5658]) and logarithm transformation, for variance-stabilization, is performed to the data matrix [i.e., log10(CPMED+1)].

Next, the data matrix is converted to a cell-type-level average matrix according to the cell type labels. Combined with an L–R database, two corresponding row-vectors of an L–R pair are extracted from the matrix. The outer product (direct product) of the two vectors is calculated, and a matrix is generated. The matrix can be considered as the similarity matrix of all possible cell-type combinations for each L–R pair. Finally, for each L–R pair, the matrix is calculated, and the tensor χRJ×J×K, where J is the number of cell types and K is the number of L–R pairs, is generated as the merged matrices. In this work, this tensor is called the “CCI-tensor”.

After the construction of the CCI-tensor, we use the non-negative Tucker2 decomposition (NTD-2) algorithm [59, 60]. NTD-2 decomposes the CCI-tensor as a core tensor GRR1×R2×K, and two factor matrices A1RJ×R1 and A2RJ×R2, where R1 and R2 are the NTD-2 rank parameters. The factor matrix A1 describes the R1 of ligand gene expression patterns in each cell type and the factor matrix A2 describes R2 of receptor gene expression patterns in each cell type, and core tensor G describes the degree of association of all the combination (R1×R2) of the ligand and receptor expression patterns of each L–R pair.

The result of NTD-2 is considered the sum of some representative triadic relationships. In this work, each triadic relationship is termed CaHr1,r2, which refers to the outer product of three vectors, A:r11, A:r22, and Gr1,r2,:, where r1 (1r1R1) and r2 (1r2R2) are the indices of the columns of the two factor matrices (Fig. 1c). The CaHs are extracted in a data-driven way without the assumption of one-to-one CCIs. Therefore, this approach can also detect many-to-many CCIs according to the data complexity.

Evaluation of many-to-many CCIs detection

To examine the performance of the CCI methods in terms of detecting CaH, we validated the CCI methods using both simulated and empirical datasets (Fig. 2).

Fig. 2.

Fig. 2

Evaluation scheme. To evaluate CCI methods, 90 simulated datasets and six real empirical datasets were prepared. Four ligand–receptor (L–R) scoring methods and six binarization methods were then evaluated. For the evaluation of these methods, area under the receiver operating characteristic curve (AUCROC), area under the precision–recall curve (AUCPRC), memory usage, and computational time were determined. For the evaluation of binarization methods, F-measure, Matthews correlation coefficient (MCC), positive ratio (PR), false positive ratio (FPR), and false negative ratio (FNR) were determined

We first prepared 90 simulated datasets considering five numbers of cell types (3, 5, 10, 20, or 30), two CCI styles (one-to-one or many-to-many including one-to-many and many-to-one), three numbers of CCI types (1, 3, or 5), and three threshold values (E2, E5, or E10) for recognition of differentially expressed genes (DEGs). According to these conditions, ground truth CCIs were determined (Additional file 1).

Next, we prepared five real empirical datasets (FetalKidney [36], GermlineFemale [25], HeadandNeckCancer [54], Uterus [33], and VisualCortex [48]), each of which focused on many-to-many CCIs in their respective original papers.

There are many L–R scoring methods to quantify the degree of co-expression of ligand and receptor genes. We re-implemented four scores used in many CCI prediction methods to evaluate performance independent of software implementation (Table 1 and Additional file 1). Here, we selected as the four methods sum score (CellPhoneDB [37], Giotto [61], CrossTalkR [62], and Squidpy [63]), product score (NATME [64], FunRes [65], ICELLNET [66], and TraSig [67]) Halpern’s score [68], and Cabello - Aguilar’s score (SingleCellSignalR [69] and CellTalkDB [70]), each of which is widely used in many studies.

Table 1.

Correspondence between L–R scoring used in this study and previous tools

L–R scoring used in this study Previous tools
Sum score CellPhoneDB [37], Giotto [61], CrossTalkR [62], and Squidpy [63]
Product score NATME [64], FunRes [65], ICELLNET [66], and TraSig [67]
Halpern’s score Halpern et al.
Cabello - Aguilar’s score SingleCellSignalR [69] and CellTalkDB [70]

To differentiate significant CCIs from non-significant CCIs, many CCI methods introduce a label permutation test with a random permutation of cell-type labels to simulate the null distribution of CCIs. This process is considered a kind of binarization (1 for significant CCIs, 0 for non-significant CCIs). For scTensor, binarization was realized by median absolute deviation (MAD) thresholding against each column vector in factor matrices calculated by tensor decomposition.

To quantitatively evaluate how selectively each CCI method was able to detect the ground truth CCIs before and after binarization, nine evaluation measures were introduced. Four of them were applied both before and after binarization, and the remaining five were applied to the results only after binarization.

scTensor could selectively detect many-to-many CCIs in simulated datasets

The area under the curve of precision–recall (AUCPR) and Matthews correlation coefficient (MCC) values of 30 datasets with an E10 threshold value are shown in Fig. 3. For the details of all the evaluation results for all the conditions, see Additional files 312. Figure 3 shows that the AUCPR values can vary among the CCI methods. When the CCI-style was one-to-one (Fig. 3a, left), scTensor (NTD-2) achieved the highest AUCPR scores, and Halpern’s score obtained the second-highest AUCPR values on average. For Halpern’s score, however, binarization has significantly reduced the significant CCIs. This may be explained by Halpern’s score having the lowest FPR (Additional file 10) and the highest FNR (Additional file 11), and it suggests that Halpern’s score is a quite conservative method to detect one-to-one CCIs. When the CCI-style was set as many-to-many, both the previous and current versions of scTensor (NTD-3 and NTD-2, respectively) achieved higher AUCPR values on average (Additional file 4), compared with the other methods (Fig. 3a, right).

Fig. 3.

Fig. 3

Results of simulated datasets. a Area under the curve of precision–recall (AUCPR) of all the methods. b Matthews correlation coefficient (MCC) of the binarization methods

Figure 3b shows that the MCC values also varied among the CCI methods. When the CCI-style was one-to-one (Fig. 3b, left), Halpern’s score achieved the highest MCC values and scTensor (NTD-2) obtained the second-highest values on average (Additional file 8). When the CCI-style was many-to-many, scTensor (NTD-2) and sum score obtained the highest MCC values, compared with the other methods (Fig. 3b, right).

Characteristics of scTensor (NTD-2), Halpern’s score, and sum score

The comprehensive validation described that the three methods (scTensor (NTD-2), Halpern’s score, and sum score) performed better than the others under certain conditions. To further examine the characteristics and trends of each method, we aggregated the number of CCIs detected in three datasets in which each of the three methods excelled (Fig. 4).

Fig. 4.

Fig. 4

Analyses of three datasets in which each of the three methods excelled. Summary of the number of significant cell–cell interactions (CCIs) with a three cell types, one CCI types, one-to-one CCI style, and 1st-CCI type; b 20 cell types, five CCI types, many-to-many CCI style, 2nd-CCI type; and c 30 cell types, five CCI types, many-to-many CCI style, 5th-CCI type. The y-axis (L) and x-axis (R) indicate the ligand-expressing cell types and the receptor-expressing cell types, respectively. FN and FP indicate false negative and false positive CCIs, respectively

scTensor (NTD-2): this method performed well when the CCI style was many-to-many. For example, in Fig. 4a, most many-to-many CCIs could be detected. Although there were some false negative (FN) CCIs that were not detected, there were fewer false positive (FP) CCIs. In contrast, Halpern’s score was too conservative against this dataset and failed to detect most of the CCIs by the label permutation test. At a first glance of Fig. 4a, the sum score appears to work well with these data, but under scrutiny at the level of individual L–R pairs, sum score results contain many FP and FN CCIs (Additional file 13).

Sum score: This method performed well when the number of cell types was small and the style of CCI was restricted to one-to-one (Fig. 4b). Even though Halpern’s score and scTensor (NTD-2) were able to detect similar CCIs, Halpern’s score was quite conservative and contained many FN CCIs because it considered many CCIs to not be significant. For sum score, there seemed to be a bias toward FP CCIs. If the degree of co-expression of an L–R pair is high between two cell types, this method seems to detect FP pairs in which only one of the L–R is highly expressed. In such cases, cross-shaped patterns were observed in the heatmap in Fig. 4. In our simulated datasets, this cross-shaped pattern of FP CCIs were observed more frequently in the sum score.

Halpern’s score: In most data sets, Halpern’s score was found to be too conservative, with many FN CCIs, but in a very specific situation, that is, when the CCI-style was one-to-all (or all-to-one), it outperformed the other methods (Fig. 4c). In contrast, scTensor (NTD-2) inferred many FN CCIs among these data, while the sum score identified many FP CCIs (Additional file 13).

scTensor could selectively detect many-to-many CCIs in real datasets

Next, we applied these CCI methods to real empirical datasets (Table 2 and Additional files 312). As expected from the results of simulated datasets, scTensor (NTD-2) outperformed the other methods on these real datasets, which contain many-to-many CCIs. Regarding AUCPR (Fig. 5a) and MCC (Fig. 5b) values, scTensor (NTD-2) achieved higher values compared with the other methods, although the difficulty of detecting the CCIs was highly dependent on the dataset (Additional files 4, 8). We further investigated the real empirical datasets and found that the known CCIs reported by the original papers were reproduced by scTensor (Table 3). Additionally, some predicted many-to-many CCIs can be considered biologically plausible because the CCIs are related to the same signaling pathways of known CCIs, although the original papers did not refer to the CCIs. These results can be interactively investigated using the HTML report generated by scTensor (Additional files 1317).

Table 2.

Empirical datasets subjected to cell–cell interaction (CCI) identification

Name Organisms GEO ID # Genes # Cells # Cell types CCI-tensor size
FetalKidney [36] Homo sapiens GSE109205 3204 4131 11 11 × 11 × 1072
GermlineFemale [25] Homo sapiens GSE86146 2717 992 8 8 × 8 × 1622
HeadandNeckCancer [54] Homo sapiens GSE103322 5244 5577 26 26 × 26 × 61
Uterus [33] Mus musculus GSE118180 2566 6443 12 12 × 12 × 2282
VisualCortex [48] Mus musculus GSE102827 5365 47,209 30 30 × 30 × 274
Fig. 5.

Fig. 5

Cell–cell interaction (CCI) identification results from real empirical datasets. a Area under the curve of precision–recall (AUCPR) of all the methods. b Matthews correlation coefficient (MCC) of the binarization methods

Table 3.

Many-to-many cell–cell interactions (CCIs) detected by only scTensor

Dataset Signal pathway Ligand and receptor Reported
FetalKidney [36] Eph/ephrin EFNB2–EPHB3 Yes
FetalKidney [36] Eph/ephrin EFNB2–EPHB4 No
FetalKidney [36] Eph/ephrin EFNB2–EPHB6 No
FetalKidney [36] Eph/ephrin EFNA1–EPHA4/7 No
FetalKidney [36] Eph/ephrin EFNA5–EPHA4/7 No
GermlineFemale [25] Bone morphogenetic protein (BMP) BMP2–BMPR2 Yes
GermlineFemale [25] Bone morphogenetic protein (BMP) GDF9–BMPR2 No
GermlineFemale [25] NOTCH JAG1–NOTCH2 Yes
GermlineFemale [25] NOTCH JAG1–NOTCH3 No
HeadandNeckCancer [54] Fibroblast growth factor (FGF) FGF7–FGFR2 Yes
Uterus [33] Insulin-like growth factor (IGF) Igf1–Igf1r Yes
Uterus [33] Insulin-like growth factor (IGF) Igf2–Igf1r No
Uterus [33] Pleiotrophin (PTN) Ptn–Ptprb Yes
Uterus [33] Wingless-related integration site (Wnt) Rspo3–Lgr5 Yes
Uterus [33] Vascular endothelial growth factor (VEGF) Vegfa–Kdr Yes
Uterus [33] Vascular endothelial growth factor (VEGF) Vegfa–Flt1/Nrp1 No
VisualCortex [48] Activin receptor (ACVR) Inhba–Acvr1b Yes
VisualCortex [48] Activin receptor (ACVR) Nodal–Acvr1b No
VisualCortex [48] Activin receptor (ACVR) Nodal/Bmp5/Bmp7–Acvr2a No

Computational complexity and memory usage

We also assessed the orders of computational complexity and memory usage of all the CCI methods (Table 4). All the L–R score methods require O(N2L) order both in the computation and in memory usage, where N is the number of cell types and L is the number of L–R pairs. The label permutation tests combining any L–R scores require O(N2LP) in computation, where P is the number of random shuffles of cell-type labels. In many cases, P is typically set as a large value greater than 1000 [37, 69], making this a very time-consuming calculation.

Table 4.

Order of calculation time and memory space for cell–cell interaction (CCI) identification

Method Calculation time Memory space
Sum/product score O(N2L) O(N2L)
Permutation test of sum/product score O(N2LP) O(N2L)
Halpern’s score O(N2L) O(N2L)
Permutation test of Halpern’s score O(N2LP) O(N2L)
Cabello - Aguilar’s score O(N2L) O(N2L)
Permutation test of Cabello - Aguilar’s score O(N2LP) O(N2L)
Previous scTensor (NTD-3) O(N2L(R1+R2+R3)) O(N2L)
scTensor (NTD-2) O(N2L(R1+R2)) O(N2L)

In contrast, scTensor (NTD-2) does not perform the label permutation; instead, it simply utilizes the factor matrices after the decomposition of the CCI-tensor. Hence, the order of computational complexity is reduced to O(N2L(R1+R2)), where R1 and R2 are the number of columns or “rank” parameters for the first- and second-factor matrices, respectively. The rank parameters are typically set as small numbers (e.g., 10), this leads to a substantial computational advantage compared with the label permutation test. The computation time and memory usage when analyzing the simulated and real empirical datasets show that scTensor has an advantage in computational complexity compared with the label permutation test (Additional files 5, 6).

Method comparisons

The first method similar to scTensor is CellChat. This method uses the communication probabilities (3rd-order tensor), which is a CCI tensor constructed with the authors’ original score and is normalized so that the sum in the second mode is 1. In addition to the label permutation test on each L–R pair in the third-order tensor, NMF on the matrix data summarized by the second mode of the 3rd-order tensor is performed to detect global CCI patterns. However, this summarization reduces the order of tensor (i.e., 3rd to 2nd) and loses information on which L–R pairs contributed to the CCI. In particular, as this study has shown, the label permutation test tends to detect one-to-one CCI, whereas NMF may also detect many-vs-many CCI, making it difficult to consider many-vs-many CCI and the L–R pairs that contribute to it simultaneously, even when the two methods are combined. Therefore, models like scTensor that can handle higher-order data as is are preferable.

Inspired by our method, another method Tensor-cell2cell [71] extended our approach to higher-order CCI tensors (e.g., 4th-order tensors) to consider CCIs and the CCI contexts (e.g., disease state, organismal life stage, and tissue microenvironment) simultaneously. Other than its effectiveness for such a higher-order CCI tensor, the main differences between Tensor-cell2cell and scTensor may include the following. First, Tensor-cell2cell is implemented by Python but scTensor is implemented by R. Python offers a wide range of machine learning/deep learning packages, while R offers data preprocessing and visualization with TidyVerse and bioinformatics-related packages with Bioconductor.

Second, there are differences in tensor decomposition models; Tensor-cell2cell performs CANDECOMP/PARAFAC-type non-negative tensor decomposition but scTensor’s model is NTD-2. The difference between these models is the number of rank parameters. Tensor-cell2cell has only one rank parameter, while scTensor has rank parameters for the number of tensor orders (i.e. 3). This difference can be an advantage or a disadvantage; a small number of ranks reduces the computational time required to estimate the optimal ranks but it might make the model too simple.

Third, there are differences in the related tools. Tensor-cell2cell assumes their text file for input, it only supports major species such as mouse and human, and it seems to assume to be used with LIANA [72, 73], another CCI tool of the authors. On the other hand, scTensor supports 124 species (September 5, 2024) in the Bioconductor package LRBase, and can be combined with various other single-cell packages via the SingleCellExperiment object and Seurat (see Implementations and Fig. 6). With an understanding of these differences, users should choose the tool they want to use according to what they want to do.

Fig. 6.

Fig. 6

Implementation of the scTensor package. a scTensor is an R package that requires the input of both an scRNA-Seq expression matrix (SingleCellExperiment or Seurat) and a ligand–receptor (L–R) database (LRBase). The LRBase is retrieved from the AnnotationHub remote server, after which a LRBase object is created. b Using these objects, scTensor generates an HTML report file, and the results of cell–cell interaction (CCI) analysis can be visualized with a wide variety of plots

Implementations

scTensor is implemented as an R/Bioconductor package that is freely available. Both a scRNA-seq dataset and L–R database are required for scTensor execution. The default format for a scRNA-seq dataset is SingleCellExperiment, in which the gene IDs correspond to NCBI’s Gene database to allow links with other databases (Fig. 6a). A scRNA-seq dataset can also be converted from a Seurat object. We provided instructions for this data conversion (https://bioconductor.org/packages/release/bioc/vignettes/scTensor/inst/doc/scTensor_1_Data_format_ID_Conversion.html#case-iii-umi-count).

LRBase, which is the L–R database for scTensor, is stored on a remote server called AnnotationHub and is downloaded to the user’s machine on demand, only when called by the user (Fig. 6a). To extend out method to a wide range of organisms, in this work, we originally constructed and are providing the L–R lists for 125 organisms (https://github.com/rikenbit/lrbase-workflow/blob/master/sample_sheet/sample_sheet.csv). The details of the data processing pipeline are summarized in the README.md of lrbase-workflow (https://github.com/rikenbit/lrbase-workflow), which is a workflow for constructing the LRBase for each of the species. For data sustainability, we offer the data files, including older versions, on the AnnotationHub server. The data files are bi-annually updated in conjunction with Bioconductor updates and are provided using lrbase-workflow. Users can specify which version of the data is used for analysis, thus ensuring data reproducibility.

NTD-2 was implemented as within the function of nnTensor [74] R/CRAN package and internally imported into scTensor. scTensor constructs the CCI-tensor, decomposes the tensor by the NTD-2 algorithm, and generates an HTML report.

To enhance the biological interpretation of CaH results, we implemented some visualization functions (Fig. 6a) and these plots can be interactively investigated via web browser. A wide variety of gene-wise information is included in the report and can be linked to the L–R lists through the use of other R/Bioconductor packages; the gene annotation is assigned by biomaRt [75] (Gene Name, Description, Gene Ontology [GO], STRING, and UniProtKB), reactome.db [76] (Reactome) and MeSH.XXX.eg.db [77] (Medical Subject Headings [MeSH]), while the enrichment analysis (also known as over-representative analysis [ORA]) is performed by GOstats [78] (GO-ORA), meshr [77] (MeSH-ORA), ReactomePA [79] (Reactome-ORA), and DOSE [80] (Disease Ontology (DO)-ORA, Network of Cancer Genes (NCG)-ORA, DisGeNET-ORA).

To validate that the detected the co-expression of L–R gene pairs is also consistently detected in the other data including tissue- or cell-type-level transcriptome data, the hyperlinks to RefEx [81], Expression Atlas [82], SingleCell Expression Atlas [83], scRNASeqDB [84], and PanglaoDB [85] are embedded in the HTML report, facilitating comparisons of the L–R expression results with the data from large-scale genomics projects such as GTEx [86], FANTOM5 [87], the NIH Epigenomics Roadmap [88], ENCODE [89], and the Human Protein Atlas [90]. Additionally, in consideration of users who might want to experimentally investigate detected CCIs, we embedded hyperlinks to Connectivity Map (CMap [91]), which provides relationships between perturbations by the addition of particular chemical compounds/genetic reagents and the resulting gene expression change.

Discussion

In this work, we regarded CCIs as CaHs, which represent the triadic relationships of ligand-expressing cell types, receptor-expressing cell types, and the related L–R pairs. We implemented a novel algorithm scTensor based on a tensor decomposition algorithm for detecting such CaHs. Our evaluations using both simulated and real empirical datasets suggest that scTensor can detect many-to-many CCIs more accurately than the other conventional CCI methods. Additionally, the calculation time and memory usage performances of scTensor are also superior to those of the other CCI methods.

To extend the use of scTensor to a wide range of organisms, we also created multiple L–R datasets for 125 organisms. scTensor has been published as an R/Bioconductor package, facilitating the reproducibility of data analysis and the maintainability of datasets. We also implemented an HTML report function that simplifies checking the analysis results of scTensor. Like many CCI tools, scTensor can import an external L–R database.

In the development of many CCI tools, the authors also develop their own L–R databases and investigate the differences among various L–R databases, particulaly when comparing their method with other conventional methods [72]. This makes it difficult to distinguish whether the performance of a method is caused by differences in algorithms or databases. Although the primary CCI resources used for existing L–R tools are highly duplicated, even slight differences can influence the detection of CCIs [72]. Therefore, to separate these two comparisons and to focus only on the algorithmic differences, in this work, we compared several existing CCI algorithms, by re-implementing them and anchoring them to a common L–R database.

We were also able to examine several strengths and weaknesses of the methods other than scTensor. For example, Halpern’s score was found to be too conservative with many FN CCIs, but it was superior to the other methods with respect to the detection of one-to-all (or all-to-one) CCIs. A possible reason for this is that since the formula for this score includes the square root of the chi-square distribution with two degree of freedom (or an exponential distribution with an expected value of 2), and these distributions are known to be heavy-tailed to some extent, thus potentially inflating the number of significant L–R pairs.

The permutation test implicitly assumes that the interactions occur between very few cell types because the larger the observed L–R score is than the empirical distribution computed by label permutation, the more significant the test result is. However, if the expression levels of ligand and receptor genes are high in any cell of any cell type, the L–R scores calculated by the label permutation are will also be high, and thus, the observed value of the L–R score will be regarded as not a particularly high value in the empirical distribution; consequently, such a test result will be not significant. Therefore, detection of many-to-many CCIs by label permutation test is difficult in principle. In the extreme case of all-to-all CCIs, the current approaches (although it also includes scTensor) cannot avoid FN CCIs.

There are still some plans to improve scTensor to build on the advantages of this current framework. For example, the algorithm can be improved by utilizing acceleration techniques such as randomized algorithm/sketching methods [92], incremental algorithm/stochastic optimization [93, 94], or distributed computing on large-scale memory machines [95] for tensor decomposition, as is now available.

To reduce the memory usage of scTensor, we are developing DelayedTensor [96], which is an R/Bioconductor package to perform various tensor arithmetic and tensor decomposition algorithms based on DelayedArray [97], another R/Bioconductor package for handling out-of-core multidimensional arrays in R. We intend to reduce the memory usage of scTensor by supporting this data format.

Tensor data formats are very flexible ways to represent heterogeneous biological data structures [98], because they easily integrate supplemental information about genes or cell types in a semi-supervised manner. Such information could extend the scope of the data and thus improve the accuracy of inferences. For example, there are some attempts to use the following additional information for CCI detection as well (for more details, see Additional file 1).

  • CCI inference via receptor-receptor and extracellular matrix data [51, 99].

  • Consideration of multi-subunit complexes [37].

  • Comparison of CCIs across multiple conditions [62, 71, 100106].

  • Consideration of downstream transcriptional factors, target genes, and signaling pathways [69, 107109].

  • Integration with bulk RNA-Seq or other type of omics datasets [37, 64, 65, 107110].

  • Integration with pseudo-time [67, 111, 112].

  • Integration with spatial transcriptome data [113].

In particular, in a recent benchmark study [113], the proximity of spatial coordinates on tissue sections measured by spatial transcriptome technology and the CCI detected by L–R data were correlated, and some studies have attempted to integrate these two kinds of datasets a single model ([113] and Additional file 1). Auxiliary Information such as the proximity in spatial coordinates can be incorporated as a regularization term to extend the tensor decomposition model [114116].

Although it is beyond the scope of the present paper to cover all of the above-mentioned topics, considering these in the framework of tensor decomposition is a promising research direction, so we aim to continuously work on these through the development of updates and releases of scTensor for Bioconductor.

Conclusion

In this work, we present and evaluate scTensor, a new method for detecting CCIs based on L–R co-expression in scRNA-seq datasets. We also revealed that the widely used label permutation test has a bias that impedes the detection of many-to-many CCIs and demonstrated that the proposed method is a viable alternative.

Materials and methods

Simulated datasets

The simulated single-cell gene expression data were sampled from the negative binomial distribution NBfgcmg,ϕg, where fgc is the fold-change (FC) for gene g and cell type c, and mg and ϕg are the average gene expression and the dispersion parameter of the expression of gene g, respectively.

The mg value and gene-wise variance vg were calculated from a real scRNA-seq dataset of mouse embryonic stem cells (mESCs) measured by Quartz-Seq [117], and the gene-wise dispersion parameter ϕg was estimated as ϕg=vg-mg/mg2.

For the determination of differentially expressed genes (DEGs) and non-DEGs, fgc values were calculated based on the non-linear relationship of FC and the gene expression level log10fgc=aexp(-blog10mg+1). To estimate the parameters a and b, we detected the DEGs using edgeR. By setting the threshold values (i.e., false discovery rate) of edgeR as 10-2 (E2), 10-5 (E5), and 10-10 (E10) and using the resulting DEGs, a and b values for each threshold were estimated as (0.701, 0.363), (1.907, 0.666), and (4.429, 0.814), respectively.

For genes identified as DEGs based on a threshold according to the non-linear relationship above, the estimated fgc value was used, otherwise, 1 is specified as fgc. If a ligand gene of a cell type and a receptor gene of a cell type were both DEGs, we defined the relationship between these cell types as the ground truth CCIs and used them for quantitative evaluation.

To simulate the “dropout” phenomenon of scRNA-seq experiments, we also introduced the dropout probability pdropoutgc=exp(-cfgcmg2), which is used in ZIFA [118] (default, c=1), and the expression values were randomly converted to 0 according to the dropout probability.

To simulate various situations, we set many different CCI tensors, considering the number of cell types (3, 5, 10, 20, or 30), the style of CCIs (one-to-one or many-to-many including one-to-many and many-to-one), the number of types of CCIs (1, 3, or 5), and the DEG threshold value (E2, E5, or E10); in total, 90 synthetic CCI tensors were generated.

scRNA-seq real datasets

The gene expression matrix of human FetalKidney data was retrieved from the GEO database (GSE109205), and only highly variable genes (HVGs: http://pklab.med.harvard.edu/scw2014/subpop_tutorial.html) with low P values ( 1E−1) were extracted. The cell-type label data were provided by the authors upon our request.

The gene expression matrix of human GermlineFemale data was retrieved from the GEO database (GSE86146), and only HVGs with low P values ( 1E−7) were extracted.

The gene expression matrix and the cell-type labels of human HeadandNeckCancer data were retrieved from the GEO database (GSE103322), and only HVGs with low P values ( 1E−1) were extracted.

The gene expression matrix of mouse Uterus data was retrieved from the GEO database (GSE118180), and only HVGs with low P values ( 1E−1) were extracted. The cell-type labels were provided by the authors upon our request.

The gene expression matrix and the cell-type labels of mouse VisualCortex data were retrieved from the GEO database (GSE102827), and only HVGs with low P values ( 1E−1) were extracted.

The gene expression values of each cell are normalized by CPMED [5658] and logarithm transformation, for variance-stabilization, is performed to the data matrix. For analyzing these real datasets, known L–R pairs in DLRP [119], IUPHAR [120], and HPMR [121] were searched in the data matrix. We defined the ground truth CCIs between two cell types if the CCIs were reported by the original studies. The L–R pairs associated with the CCIs were used for quantitative evaluation.

scTensor algorithm

CCI-tensor construction

Here, data matrix YRI×H is the gene expression matrix of scRNA-seq data, where I is the number of genes and H is the number of cells. Matrix Y is converted into cell-type-wise average matrix XRI×J, where J is the number of cell types. The cell-type labels are assumed to be specified by the user’s prior analysis, such as clustering or confirmation of marker gene expression. The relationship between the X and Y is described below:

X=YA, 1

where the matrix ARH×J converts cellular-level matrix Y to cell-type-level matrix X and each element of A is

Ahj=1/njhthcellbelongstojthcelltype0otherwise, 2

where nj is the number of cells belonging to the j’s cell type.

Next, we search to determine whether any L–R pairs stored in the L–R database are both in the row names of matrix X, and if both IDs are found, corresponding J-length row-vectors of the ligand and receptor genes (xL and xR) are extracted.

Finally, a J×J matrix is calculated as the outer product of xL and xR and incrementally stored. The stacked J×J matrices can be considered as a three-dimensional array, which is also known as a three-order tensor. The following outer product in the kth L–R pair (Lk and Rk) found is stored as the frontal slice (sub-tensor) of the CCI-tensor χRJ×J×K:

χ::k=xLkxRk 3

Non-negative Tucker3 decomposition (NTD-3)

To extract the CaHs from the CCI-tensor χRJ×J×K, we utilize NTD-3 and NTD-2, which are generalizations of non-negative matrix factorization (NMF) to tensor data [59, 60]. The NMF approximates a non-negative matrix data as the product of two lower rank non-negative matrices (also known as factor matrices). Similar to NMF, NTD-3 and NTD-2 approximate a non-negative tensor data as the product of some factor matrices and a core tensor.

To extend NMF to NTD-3, we consider iterative updating A(n)G(n)A(-n)T=A(n)GA(n)n=1,2,3, which is the matricized expression of tensor decomposition. Here A(-n) is Kronecker product of the factor matrices without A(n) and G(n) is the mode-n matricization of the core tensor G. For example, if n=1, these become A(2)A(3) and G(n), respectively. By replacing X in the multiplicative update rule [60] with A(n)G(n)A(-n)Tn=1,2,3, we can obtain the update rule for A(n) as follows;

A(1)A(1)X(1)GA(1)TA(1)GA(1)GA(1)TA(2)A(2)X(2)GA(2)TA(2)GA(2)GA(2)TA(3)A(3)X(3)GA(3)TA(3)GA(3)GA(3)T. 4

Similarly, the updating rule for core tensor G is:

Gmax{χ×1A(1)T×2A(2)T×3A(3)T,ϵ}Gχ×1A(1)T×2A(2)T×3A(3)TG×1A(1)TA(1)×2A(2)TA(2)×3A(3)TA(3), 5

where ϵ is a small value included to avoid generating negative values (default value 1E−10).

Non-negative Tucker2 decomposition (NTD-2)

The NTD-3 has three rank parameters to be estimated, and it requires huge search space (R1×R2×R3). Additionally, the fewer the factor matrices, the more interpretable the results are. For these reasons, we further expanded the NTD-3 into a model called the NTD-2 [60] since v1.4.0 of scTensor.

In NTD-2, the third factor matrix A(3), which is related to L–R pairs, is replaced by an identity matrix IK, where the K diagonal elements are all 1 and the iteration step of A(3) is skipped as follows:

A(1)A(1)X(1)GA(1)TA(1)GA(1)GA(1)TA(2)A(2)X(2)GA(2)TA(2)GA(2)GA(2)T. 6

Here, GA(1)=G1[A(2)IK] and GA(2)=G2[A(1)IK].

The updating rule for core tensor G is

Gmax{χ×1A(1)T×2A(2)T,ϵ}Gχ×1A(1)T×2A(2)TG×1A(1)TA(1)×2A(2)TA(2). 7

Rank estimation of NTD-2

To extract the CaHs, scTensor estimates the NTD-2 ranks for each matricized CCI-tensor (X(n), n=1 or 2). To be able to focus only on the dimensions that are informative and are not noisy, we used an ad hoc approach for NTD-2 rank estimation.

Because NMF is performed in each matricized CCI-tensor in scTensor, we estimated each rank of NMF based on the residual sum of squares (RSS) [122] as

RSSmax-RSSkRSSmax-RSSmin>thrrank, 8

where RSSmax is the RSS by full rank NMF, RSSmin is the RSS by rank-1 NMF, RSSk is the RSS by rank-k NMF, and thrrank is the threshold value, ranging 0 to 1 (the default value is 0.8). RSS by rank-k NMF is calculated between a data matrix X and the reconstructed matrix from W and H calculated by multiplicative updating rule [60] as follows:

RSSk=X(n)-WkHkF2. 9

RSS by full-rank and rank-1 NMF is calculated by setting k as J and 1, respectively. With the estimated ranks (R1^,R2^), NTD-2 was performed, and only the pairs (r1,r2) with large core tensor values are selected as CaHs. In its default mode, scTensor selects CaHs that explain the top 20 pairs sorted by the core tensor values.

Binarization

To binarize each column vector of the factor matrices obtained by NTD-2, median absolute deviation (MAD), which is the median version of standard deviation (SD), was applied. Because we are only interested in the outliers of the elements of each vector in the positive direction, not the negative one, we focused only on the elements that deviate from the median in the positive direction as follows:

xi=1ximedianx+thrbin×MADx0otherwise. 10

Here, MADx is medianx-medianx and thrbin is the threshold value (the default value is 1.0).

L–R scoring

Several methods have been proposed to score the degree of co-expression of a given L–R pair between two cell types, as described below.

Sum score

The gene expression of a ligand gene l can be averaged over cells belonging to the sth cell type within J cell types as follows:

xlCs=1NCscCsxlc. 11

Here, CsC1,C2,,CJ and NCs is the number of cells belonging to cell type Cs.

Likewise, the gene expression of a receptor gene r is averaged over cells belonging to the tth cell type within J cell types as follows:

xrCt=1NCtcCtxrc. 12

Here, CtC1,C2,,CJ and NCt is the number of cells belonging to cell type Ct.

Using these values, the sum score is calculated as follows:

Scoresuml,Cs,r,Ct=xlCs+xrCt. 13

For example, some methods such as CellPhoneDB [37], Giotto [61], CrossTalkR [62], and Squidpy [63], essentially use this type of scoring (Table 1 and Additional file 1).

Product score

In some studies, the degree of co-expression is expressed as a product instead of a summation.

Scoreprodl,Cs,r,Ct=xlCs×xrCt 14

For example, some methods such as NATME [64], FunRes [65], ICELLNET [66], and TraSig [67], essentially use this type of scoring (Table 1 and Additional file 1).

Halpern’s score

Derived from the sum score, Halpern et al. proposed a score described below (Table 1 and Additional files 1).

In this score, Z-scaling is firstly applied to both xlCs and xrCt over J cell types as follows:

ZlCs=xlCs-meanxLstdxL 15
ZrCt=xrCt-meanxRstdxR. 16

Here, xL=xlC1,xlC2,,xlCJ and xR=xrC1,xrC2,,xrCJ. Then, the square root of the sum of squares of these values is used as the degree of co-expression as follows:

ScoreHalpernl,Cs,r,Ct=ZlCs2+ZrCt2. 17

Cabello-Aguilar’s score

Derived from the product score, Cabello - Aguilar et al. proposed a score described as follows (Table 1 and Additional file 1):

ScoreCabello-Aguilarl,Cs,r,Ct=Scoreprodμ+Scoreprod. 18

Here, μ is the averaged value of the normalized read count matrix and is added as a scaling factor to avoid division by zero. This score is used in SingleCellSignalR [69] and CellTalkDB [70].

Label permutation method

To quantify the deviation of the observed scores obtained from real data, many studies employ P values in a statistical hypothesis testing framework. Typically, the label permutation method is widely used to calculate P values. In principle, this method can be used in combination with any L–R score as described above.

Here, we consider assigning a P value to any type of Scorel,Cs,r,Ct above. In this method, the cluster labels of all the cells are randomly shuffled, and a synthetic score value is calculated. Performing this process 1000 times generates 1000 of the values. These values are used to generate the null distribution; for a combination of cell types, the proportion of the means which are “as or more extreme” than the observed mean is calculated as the P value. The label permutation test is performed as a one-tailed test; there is a focus on L–R scores with significantly higher values compared to the null distribution, and not on L–R scores with significantly lower values. Because separating significant CCIs from non-significant CCIs by hypothesis testing can be regarded as a binarization process, label permutation results were compared with the results of scTensor binarization.

Quantitative evaluation of CCIs

The CCIs detected by the various methods tested in this paper were compared with ground truth CCIs to quantitatively evaluate the performance of each method. To evaluate the results, we used the metrics below.

Evaluation of the scoring before and after binarization

Each CCI method uses each corresponding L–R score to quantify the degree of co-expression of a given L–R pair between two cell types. To quantitatively evaluate the performance of each score, area under the curve of receiver operating characteristic (AUCROC) and area under the curve of precision–recall (AUCPR) were used.

A receiver operating characteristic (ROC) curve is a plot of the true positive rate (TPR, or the sensitivity = TPTP+FN) versus the false positive rate (FPR, or 1 - specificity, where specificity = TNTN+FP) (where TP is the number of true positive CCIs, FP is the number of false positive CCIs, TN is the number of true negative CCIs, and FN is the number of false negative CCIs). The AUCROC value is the area under the ROC curve. AUCROC values range from 0 to 1, and the closer the value is to 1, the more the score indicates enrichment of the ground truch CCIs among the inferred CCIs.

A precision–recall curve is a plot of recall (i.e., sensitivity) versus precision (i.e., positive predictive value = TPTP+FP). The AUCPR value is the value of the area under the precision–recall curve. AUCPR ranges from 0 to 1, and the closer the value is to 1, the more the score indicates enrichment of the ground truch CCIs among the inferred CCIs. AUCPR is known for its robustness against class imbalance, compared with AUCROC [123125]. Hence, it seems that AUCPR is more appropriate than AUCROC because the number of significant CCIs are assumed to be less than that of non-significant CCIs in both simulated and real empirical data.

To evaluate whether the binarization was properly performed, we also applied these metrics to assess label permutation. As the label permutation test outputs P values, we utilized 1 − P value to quantify the degree of co-expression of elements of L–R pairs in the test. Because tensor decomposition is an unsupervised learning methods, we cannot distinguish which CaHs are enriched within the ground truth CCIs in advance. Additionally, we expected that scTensor could separate different styles of CCIs as multiple CaHs. Hence, we used the combination of CaHs from scTensor and ground truth CCIs with the maximum metrics values.

The calculation time and memory usage were evaluated by using the benchmark rules of Snakemake (https://snakemake.readthedocs.io/en/latest/snakefiles/rules.html?highlight=benchmark#benchmark-rules).

Evaluation of the scoring after binarization

Each CCI method uses a threshold value (e.g., P value, or MAD for scTensor) to differentiate significant CCIs from non-significant CCIs. This process is considered a kind of binarization (1 for significant CCIs, 0 for non-significant CCIs), so we evaluated how well each thresholding strategy could selectively detect the ground truch CCI by comparing the metrics below.

F-measure is the harmonic mean of precision and recall and is defined as follows:

F-measure=2precision×recallprecision+recall. 19

Matthews Correlation Coefficient (MCC) is a special case of Pearson correlation coefficient when two variables are both binary vectors. MCC is defined as follows:

MCC=TP×TN-FP×FNTP+FPTP+FNTN+FPTN+FN. 20

MCC is widely used for binary classification evaluation and especially known for its robustness against the class imbalance, compared with the other metrics such as accuracy, balanced accuracy, bookmaker informedness, markedness, and F-measure [126128]. Hence, it seems that MCC is more appropriate to use than F-measure because the number of significant CCIs are assumed to be less than that of non-significant CCIs in both simulated and real empirical data.

To distinguish whether the F-measure and MCC values correspond to the number of detected CCIs or their selectivity in focusing only the ground truth CCIs, we also compared the positive rate (PR), false positive rate (FPR), and false negative rate (FNR) values of all the methods.

Availability and requirements

R packages

Snakemake workflows

Supplementary Information

12859_2023_5490_MOESM1_ESM.xlsx (15.7KB, xlsx)

Additional file 1. List of existing L–R scoring and CCI detection methods.

12859_2023_5490_MOESM2_ESM.pdf (1.9MB, pdf)

Additional file 2. Ground truth CCIs in simulated datasets.

12859_2023_5490_MOESM3_ESM.pdf (3.6MB, pdf)

Additional file 3. AUCROC values of all methods.

12859_2023_5490_MOESM4_ESM.pdf (3.6MB, pdf)

Additional file 4. AUCPR values of all methods.

12859_2023_5490_MOESM5_ESM.pdf (3.5MB, pdf)

Additional file 5. Memory values of all methods.

12859_2023_5490_MOESM6_ESM.pdf (2.6MB, pdf)

Additional file 6. Computational time values of all methods.

12859_2023_5490_MOESM7_ESM.pdf (2.5MB, pdf)

Additional file 7. F-measure values of all binarization methods.

12859_2023_5490_MOESM8_ESM.pdf (2.4MB, pdf)

Additional file 8. MCC values of all binarization methods.

12859_2023_5490_MOESM9_ESM.pdf (2.3MB, pdf)

Additional file 9. PR values of all binarization methods.

12859_2023_5490_MOESM10_ESM.pdf (2.4MB, pdf)

Additional file 10. FPR values of all binarization methods.

12859_2023_5490_MOESM11_ESM.pdf (2.5MB, pdf)

Additional file 11. FNR values of all binarization methods.

12859_2023_5490_MOESM13_ESM.docx (224.1KB, docx)

Additional file 13. Three L-R pairs in which each of the three methods excelled.

12859_2023_5490_MOESM14_ESM.zip (130.6MB, zip)

Additional file 14. HTML report of FetalKidney.

12859_2023_5490_MOESM15_ESM.zip (165.5MB, zip)

Additional file 15. HTML report of GermlineFemale.

12859_2023_5490_MOESM16_ESM.zip (180.8MB, zip)

Additional file 16. HTML report of HeadandNeckCancer.

12859_2023_5490_MOESM17_ESM.zip (232.5MB, zip)

Additional file 17. HTML report of Uterus.

12859_2023_5490_MOESM18_ESM.zip (304.2MB, zip)

Additional file 18. HTML report of VisualCortex.

Acknowledgements

Some cell images used in figures are presented by © 2016 DBCLS TogoTV. We thank Mr. Akihiro Matsushima for their assistance with the IT infrastructure for the data analysis. We are also grateful to all members of the Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research for their helpful advice.

Abbreviations

CCIs

Cell–cell interactions

scRNA-seq

Single-cell RNA sequencing

L–R

Ligand and receptor

CaH

CCI as hypergraph

CMPED

Count per median of library size

NTD-3

Non-negative Tucker3 decomposition

NTD-2

Non-negative Tucker2 decomposition

AUCPR

Area under the curve of precision–recall

FPR

False positive rate

FNR

False negative rate

MCC

Matthews correlation coefficient

FP

False positive

FN

False negative

GO

Gene ontology

MeSH

Medical subject headings

ORA

Over-representative analysis

DO

Disease ontology

NCG

Network of cancer genes

CMap

Connectivity map

FC

Fold-change

mESCs

Mouse embryonic stem cells

DEGs

Differentially expressed genes

HVGs

Highly variable genes

NMF

Non-negative matrix factorization

RSS

Residual sum of squares

MAD

Median absolute deviation

SD

Standard deviation

AUCROC

Area under the curve

TPR

True positive rate

TP

True positive

TN

True negative

PR

Positive rate

Author contributions

KT and IN designed the study. KT designed the algorithm and benchmark test. KT retrieved and preprocessed the test data to evaluate the proposed method, implemented the source code, and performed all analyses. MI implemented the pipeline for bi-annual automatic updates of the R/Bioconductor packages. All authors have read and approved the manuscript.

Funding

This work was supported by MEXT KAKENHI Grant Number 16K16152 to KT. This work was partially supported by JST CREST Grant Numbers JPMJCR16G3 and JPMJCR1926 to IN.

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interest

The authors declare that they have no competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Koki Tsuyuzaki, Email: koki.tsuyuzaki@gmail.com.

Itoshi Nikaido, Email: itoshi.nikaido@riken.jp.

References

  • 1.Yu Y, Elble RC. Homeostatic signaling by cell-cell junctions and its dysregulation during cancer progression. J Clin Med. 2016;5(2):26. doi: 10.3390/jcm5020026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Livshits G, Kobielak A, Fuchs E. Governing epidermal homeostasis by coupling cell–cell adhesion to integrin and growth factor signaling, proliferation, and apoptosis. PNAS. 2012;109(3):4886–4891. doi: 10.1073/pnas.1202120109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Chao DL, Ma L, Shen K. Transient cell–cell interactions in neural circuit formation. Nat Rev Neurosci. 2009;10:262–271. doi: 10.1038/nrn2594. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Kasukawa T, Masumoto K, Nikaido I, Nagano M, Uno KD, Tsujino K, Hanashima C, Shigeyoshi Y, Ueda HR. Quantitative expression profile of distinct functional regions in the adult mouse brain. PLoS ONE. 2011;6:23228. doi: 10.1371/journal.pone.0023228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Miller JFAP, Mitchell GF. Cell to cell interaction in the immune response v. target cells for tolerance induction. J Exp Med. 1970;131(4):675–699. doi: 10.1084/jem.131.4.675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Pieters T, Roy VF. Role of cell–cell adhesion complexes in embryonic stem cell biology. J Cell Sci. 2014;127:2603–2613. doi: 10.1242/jcs.146720. [DOI] [PubMed] [Google Scholar]
  • 7.Tweedell KS. The adaptability of somatic stem cells: a review. J Stem Cells Regen Med. 2017;13(1):3–13. doi: 10.46582/jsrm.1301002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Plaks V, Kong N, Werb Z. The cancer stem cell niche: how essential is the niche in regulating stemness of tumor cells? Cell Stem Cell. 2015;16:225–238. doi: 10.1016/j.stem.2015.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hegerfeldt Y, Tusch M, Brocker EB, Friedl P. Collective cell movement in primary melanoma explants: plasticity of cell-cell interaction, beta1-integrin function, and migration strategies. Cancer Res. 2002;62:2125–2130. [PubMed] [Google Scholar]
  • 10.Hofschroer V, Koch KA, Ludwig FT, Friedl P, Oberleithner H, Stock C, Schwab A. Extracellular protonation modulates cell–cell interaction mechanics and tissue invasion in human melanoma cells. Sci Rep. 2017;7:42369. doi: 10.1038/srep42369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Stein JV, Gonzalez SF. Dynamic intravital imaging of cell–cell interactions in the lymph node. Mech Allerg Dis. 2016;139(1):12–20. doi: 10.1016/j.jaci.2016.11.008. [DOI] [PubMed] [Google Scholar]
  • 12.Reinhar-King CA, Dembo M, Hammer DA. Cell–cell mechanical communication through compliant substrates. Biophys J. 2008;95:6044–6051. doi: 10.1529/biophysj.107.127662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Dewji NN, Mukhopadhyay D, Singer SJ. An early specific cell–cell interaction occurs in the production of beta-amyloid in cell cultures. PNAS. 2006;103(5):1540–1545. doi: 10.1073/pnas.0509899103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Konry T, Sarkar S, Sabhachandani P, Cohen N. Innovative tools and technology for analysis of single cells and cell–cell interaction. Annu Rev Biomed Eng. 2016;18:259–284. doi: 10.1146/annurev-bioeng-090215-112735. [DOI] [PubMed] [Google Scholar]
  • 15.Rothbauer M, Zirath H, Ertl P. Recent advances in microfluidic technologies for cell-to-cell interaction studies. Lab Chip. 2018;18(2):249–270. doi: 10.1039/C7LC00815E. [DOI] [PubMed] [Google Scholar]
  • 16.Li R, Lv X, Zhang X, Saeed O, Deng Y. Microfluidics for cell–cell interactions: a review. Front Chem Sci Eng. 2016;10(1):90–98. doi: 10.1007/s11705-015-1550-2. [DOI] [Google Scholar]
  • 17.Wiklund M, Christakou AE, Ohlin M, Iranmanesh I, Frisk T, Vanherberghen V, Onfelt B. Ultrasound-induced cell–cell interaction studies in a multi-well microplate. Micromachines. 2014;5:27–49. doi: 10.3390/mi5010027. [DOI] [Google Scholar]
  • 18.Tauriainen J, Gustafsson K, Gothlin M, Gertow J, Buggert M, Frisk TW, Karlsson AC, Uhlin M, Onfelt B. Single-cell characterization of in vitro migration and interaction dynamics of t cells expanded with il-2 and il-7. Front Immunol. 2015;6:196. doi: 10.3389/fimmu.2015.00196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Merouane A, Rey-Villamizar N, Lu Y, Liadi I, Romain G, Lu J, Singh H, Cooper LJN, Varadarajan N, Roysam B. Automated profiling of individual cell–cell interactions from high-throughput time-lapse imaging microscopy in nanowell grids (timing) Bioinformatics. 2015;31(19):3189–3197. doi: 10.1093/bioinformatics/btv355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Espulgar W, Yamaguchi Y, Aoki W, Mita D, Saito M, Lee JK, Tamiya E. Single cell trapping and cell–cell interaction monitoring of cardiomyocytes in a designed microfluidic chip. Sens Actuators B Chem. 2015;207:43–50. doi: 10.1016/j.snb.2014.09.068. [DOI] [Google Scholar]
  • 21.Sarkar S, Sabhachandani P, Stroopinsky D, Palmer K, Cohen N, Rosenblatt J, Avigan D, Konry T. Dynamic analysis of immune and cancer cell interactions at single cell level in microfluidic droplets. Biomicrofluidics. 2016;10(5):054115. doi: 10.1063/1.4964716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Dura B, Dougan SK, Barisa M, Hoehl MM, Lo CT, Ploegh HL, Voldman J. Profiling lymphocyte interactions at the single-cell level by microfluidic cell pairing. Nat Commun. 2015;6:5940. doi: 10.1038/ncomms6940. [DOI] [PubMed] [Google Scholar]
  • 23.Ramilowski JA, Goldberg T, Harshbarger J, Kloppmann E, Lizio M, Satagopam VP, Itoh M, Kawaji H, Carninci P, Rost B, Forrest ARR. A draft network of ligand-receptor-mediated multicellular signalling in human. Nat Commun. 2015;22(6):7866. doi: 10.1038/ncomms8866. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Camp JG, Sekin K, Gerber T, Loeffler-Wirth H, Binder H, Gac M, Kanton S, Kageyama J, Damm G, Seehofer D, Belicova L, Bickle M, Barsacchi R, Okuda R, Yoshizawa E, Kimura M, Ayabe H, Taniguchi H, Takebe T, Treutlein B. Multilineage communication regulates human liver bud development from pluripotency. Nature. 2017;546(7659):533–538. doi: 10.1038/nature22796. [DOI] [PubMed] [Google Scholar]
  • 25.Li L, Dong J, Yan L, Yong J, Liu X, Hu Y, Fan X, Wu X, Guo H, Wang X, Zhu X, Li R, Yan J, Wei Y, Zhao Y, Wang W, Ren Y, Yuan P, Yan Z, Hu B, Guo F, Wen L, Tang F, Qiao J. Single-cell RNA-seq analysis maps development of human germline cells and gonadal niche interactions. Cell Stem Cell. 2017;20:858–873. doi: 10.1016/j.stem.2017.03.007. [DOI] [PubMed] [Google Scholar]
  • 26.Zhou JX, Taramelli R, Pedrini E, Knijnenburg T, Hunag S. Extracting intercellular signaling network of cancer tissues using ligand-receptor expression patterns from whole-tumor and single-cell transcriptomes. Sci Rep. 2017;7(1):8815. doi: 10.1038/s41598-017-09307-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Skelly DA, Squiers GT, McLellan MA, Bolisetty MT, Robson P, Rosenthal NA, Pinto AR. Single-cell transcriptional profiling reveals cellular diversity and intercommunication in the mouse heart. Cell Rep. 2018;22(3):600–610. doi: 10.1016/j.celrep.2017.12.072. [DOI] [PubMed] [Google Scholar]
  • 28.Pavlicev M, Wagner GP, Chavan AR, Owens K, Maziarz J, Dunn-Fletcher C, Lallapur SG, Muglia L, Jones H. Single-cell transcriptomics of the human placenta: inferring the cell communication network of the maternal-fetal interface. Genome Res. 2017;27:349–361. doi: 10.1101/gr.207597.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Joost S, Jacob T, Sun X, Annusver K, La Manno G, Sur I, Kasper M. Single-cell transcriptomics of traced epidermal and hair follicle stem cells reveals rapid adaptations during wound healing. Cell Rep. 2018;25(3):585–597. doi: 10.1016/j.celrep.2018.09.059. [DOI] [PubMed] [Google Scholar]
  • 30.Kramann R, Machado F, Wu H, Kusaba T, Hoeft K, Schneider RK, Humphreys BD. Parabiosis and single-cell RNA sequencing reveal a limited contribution of monocytes to myofibroblasts in kidney fibrosis. JCI Insight. 2018;3(9):99561. doi: 10.1172/jci.insight.99561. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Cohen M, Giladi A, Gorki AD, Solodkin DG, Zada M, Hladik A, Miklosi A, Salame TM, Halpern KB, David E, Itzkovitz S, Harkany T, Knapp S, Amit I. Lung single-cell signaling interaction map reveals basophil role in macrophage imprinting. Cell. 2018;175(4):1031–1044. doi: 10.1016/j.cell.2018.09.009. [DOI] [PubMed] [Google Scholar]
  • 32.Davidson S, Efremova M, Riedel A, Mahata B, Pramanik J, Huuhtanen J, Kar G, Vento-Tormo R, Hagai T, Chen X, Haniffa MA, Shields JD, Teichmann SA. Single-cell RNA sequencing reveals a dynamic stromal niche within the evolving tumour microenvironment. bioRxiv. 2018 doi: 10.1101/467225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Mucenski ML, Mahoney R, Adam M, Potter AS, Potter SS. Single cell RNA-seq study of wild type and hox9,10,11 mutant developing uterus. Sci Rep. 2019;9:4557. doi: 10.1038/s41598-019-40923-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wu H, Uchimura K, Donnelly EL, Kirita Y, Morris SA, Humphreys BD. Comparative analysis and refinement of human PSC-derived kidney organoid differentiation with single-cell transcriptomics. Cell Stem Cell. 2018;23(6):869–881. doi: 10.1016/j.stem.2018.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chen L, Lee JW, Chou CL, Nair AV, Battistone MA, Paunescu TG, Merkulova M, Breton S, Verlander JW, Wall SM, Brown D, Burg MB, Knepper MA. Transcriptomes of major renal collecting duct cell types in mouse identified by single-cell RNA-seq. PNAS. 2017;114(46):9989–9998. doi: 10.1073/pnas.1710964114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Menon R, Otto EA, Kokoruda A, Zhou J, Zhang Z, Yoon E, Chen YC, Troyanskaya O, Spence JR, Kretzler M, Cebrian C. Single-cell analysis of progenitor cell dynamics and lineage specification in the human fetal kidney. Development. 2018;145(16):dev164038. doi: 10.1242/dev.164038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Vento-Tormo R, Efremova M, Botting RA, Turco MY, Vento-Tormo M, Meyer KB, Park JE, Stephenson E, Polanski K, Goncalves A, Gardner L, Holmqvist S, Henriksson J, Zou A, Sharkey AM, Millar B, Innes B, Wood L, Wilbrey-Clark A, Payne RP, Ivarsson MA, Lisgo S, Filby A, Rowitch DH, Bulmer JN, Wright GJ, Stubbington MJT, Haniffa M, Moffett A, Teichmann SA. Single-cell reconstruction of the early maternal-fetal interface in humans. Nature. 2018;563(7731):347–353. doi: 10.1038/s41586-018-0698-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Biton M, Haber AL, Rogel N, Burgin G, Beyaz S, Schnell A, Ashenberg O, Su CW, Smillie C, Shekhar K, Chen Z, Wu C, Ordovas-Montanes J, Alvarez D, Herbst RH, Zhang M, Tirosh I, Dionne D, Nguyen LT, Xifaras ME, Shalek AK, von Andrian UH, Graham DB, Rozenblatt-Rosen O, Shi HN, Kuchroo V, Yilmaz OH, Regev A, Xavier RJ. T helper cell cytokines modulate intestinal stem cell renewal and differentiation. Cell. 2018;175(5):1307–1320. doi: 10.1016/j.cell.2018.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kumar MP, Du J, Lagoudas G, Jiao Y, Sawyer A, Drummond DC, Lauffenburger DA, Raue A. Analysis of single-cell RNA-seq identifies cell–cell communication associated with tumor characteristics. Cell Rep. 2018;25(6):1458–1468. doi: 10.1016/j.celrep.2018.10.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Verma M, Asakura Y, Murakonda BSR, Pengo T, Latroche C, Chazaud B, McLoon LK, Asakura A. Muscle satellite cell cross-talk with a vascular niche maintains quiescence via VEGF and notch signaling. Cell Stem Cell. 2018;23(4):530–543. doi: 10.1016/j.stem.2018.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Jerby-Arnon L, Shah P, Cuoco MS, Rodman C, Su MJ, Melms JC, Leeson R, Kanodia A, Mei S, Lin JR, Wang S, Rabasha B, Liu D, Zhang G, Margolais C, Ashenberg O, Ott PA, Buchbinder EI, Haq R, Hodi FS, Boland GM, Sullivan RJ, Frederick DT, Miao B, Moll T, Flaherty KT, Herlyn M, Jenkins RW, Thummalapalli R, Kowalczyk MS, Canadas I, Schilling B, Cartwright ANR, Luoma AM, Malu S, Hwu P, Bernatchez C, Forget MA, Barbie DA, Shalek AK, Tirosh I, Sorger PK, Wucherpfennig K, Van Allen EM, Schadendorf D, Johnson BE, Rotem A, Rozenblatt-Rosen O, Garraway LA, Yoon CH, Izar B, Regev A. A cancer cell program promotes T cell exclusion and resistance to checkpoint blockade. Cell. 2018;175(4):984–997. doi: 10.1016/j.cell.2018.09.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kelleher AM, Milano-Foster J, Behura SK, Spencer TE. Uterine glands coordinate on-time embryo implantation and impact endometrial decidualization for pregnancy success. Nat Commun. 2018;9(1):2435. doi: 10.1038/s41467-018-04848-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Yin J, Li Z, Yan C, Fang E, Wang T, Zhou H, Luo W, Zhou Q, Zhang J, Hu J, Jin H, Wang L, Zhao X, Li J, Qi X, Zhou W, Huang C, He C, Yang H, Kristiansen K, Hou Y, Zhu S, Zhou D, Wang L, Dean M, Wu K, Hu H, Li G. Comprehensive analysis of immune evasion in breast cancer by single-cell RNA-seq. bioRxiv. 2018 doi: 10.1101/368605. [DOI] [Google Scholar]
  • 44.Biase FH, Kimble KM. Functional signaling and gene regulatory networks between the oocyte and the surrounding cumulus cells. BMC Genom. 2018;19(1):351. doi: 10.1186/s12864-018-4738-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Thorsson V, Gibbs DL, Brown SD, Wolf D, Bortone DS, Ou Yang TH, Porta-Pardo E, Gao GF, Plaisier CL, Eddy JA, Ziv E, Culhane AC, Paull EO, Sivakumar IKA, Gentles AJ, Malhotra R, Farshidfar F, Colaprico A, Parker JS, Mose LE, Vo NS, Liu J, Liu Y, Rader J, Dhankani V, Reynolds SM, Bowlby R, Califano A, Cherniack AD, Anastassiou D, Bedognetti D, Rao A, Chen K, Krasnitz A, Hu H, Malta TM, Noushmehr H, Pedamallu CS, Bullman S, Ojesina AI, Lamb A, Zhou W, Shen H, Choueiri TK, Weinstein JN, Guinney J, Saltz J, Holt RA, Rabkin CE, Network CGAR, Lazar AJ, Serody JS, Demicco EG, Disis ML, Vincent BG, Shmulevich L. The immune landscape of cancer. Immunity. 2018;48(4):812–830. doi: 10.1016/j.immuni.2018.03.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Han X, Chen H, Huang D, Chen H, Fei L, Cheng C, Huang H, Yuan GC, Guo G. Mapping human pluripotent stem cell differentiation pathways using high throughput single-cell RNA-sequencing open access. BMC Genome Biol. 2018;19(1):47. doi: 10.1186/s13059-018-1426-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Costa A, Kieffer Y, Scholer-Dahirel A, Pelon F, Bourachot B, Cardon M, Sirven P, Magagna I, Fuhrmann L, Bernard C, Bonneau C, Kondratova M, Kuperstein I, Zinovyev A, Givel AM, Parrini MC, Soumelis V, Vincent-Salomon A, Mechta-Grigoriou F. Fibroblast heterogeneity and immunosuppressive environment in human breast cancer. Cancer Cells. 2018;33(3):463–479. doi: 10.1016/j.ccell.2018.01.011. [DOI] [PubMed] [Google Scholar]
  • 48.Hrvatin S, Hochbaum DR, Nagy MA, Cicconet M, Robertson K, Cheadle L, Zilionis R, Ratner A, Borges-Monroy R, Klein AM, Sabatini BL, Greenberg ME. Single-cell analysis of experience-dependent transcriptomic states in the mouse visual cortex. Nat Neurosci. 2018;21(1):120–129. doi: 10.1038/s41593-017-0029-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Suryawanshi H, Morozov P, Straus A, Sahasrabudhe N, Max KEA, Garzia A, Kustagi M, Tuschl T, Williams Z. A single-cell survey of the human first-trimester placenta and decidua. Sci Adv. 2018;4(10):4788. doi: 10.1126/sciadv.aau4788. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Puram SV, Tirosh I, Parikh AS, Patel AP, Yizhak K, Gillespie S, Rodman C, Luo CL, Mroz EA, Emerick KS, Deschler DG, Varvares MA, Mylvaganam R, Rozenblatt-Rosen O, Rocco JW, Faquin WC, Lin DT, Regev A, Bernstein BE. Single-cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer. Cell. 2017;171(7):1611–1624. doi: 10.1016/j.cell.2017.10.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Ximerakis M, Lipnick SL, Simmons SK, Adiconis X, Innes BT, Dionne D, Nguyen L, Mayweather BA, Ozek C, Niziolek Z, Butty VL, Isserlin R, Buchanan SM, Levine SR, Regev A, Bader GD, Levin JZ, Rubin LL. Single-cell transcriptomics of the aged mouse brain reveals convergent, divergent and unique aging signatures. bioRxiv. 2018 doi: 10.1101/440032. [DOI] [Google Scholar]
  • 52.Sivakamasundari V, Bolisetty M, Sivajothi S, Bessonett S, Ruan D, Robson P. Comprehensive cell type specific transcriptomics of the kidney. bioRxiv. 2017 doi: 10.1101/238063. [DOI] [Google Scholar]
  • 53.Peters AL, Luo Z, Li J, Mourya R, Wang Y, Dexheimer P, Shivakumar P, Aronow B, Bezerra JA. Single cell RNA sequencing reveals regional heterogeneity of hepatobiliary innate lymphoid cells in a tissue-enriched fashion. PLoS ONE. 2019;14(4):e0215481. doi: 10.1371/journal.pone.0215481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Puram SV, Tirosh I, Parikh AS, Patel AP, Yizhak K, Gillespie S, Rodman C, Luo CL, Mroz EA, Emerick KS, Deschler DG, Varvares MA, Mylvaganam R, Rozenblatt-Rosen O, Rocco JW, Faquin WC, Lin DT, Regev A, Bernstein BE. Single-cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer. Cell. 2017;171(7):1611–1624. doi: 10.1016/j.cell.2017.10.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Boisset JC, Vivie J, Murano MJ, Lyubimova A, van Oudenaarden A. Mapping the physical network of cellular interactions. Nat Methods. 2018;15:547–553. doi: 10.1038/s41592-018-0009-z. [DOI] [PubMed] [Google Scholar]
  • 56.Zheng GX, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, Ziraldo SB, Wheeler TD, McDermott GP, Zhu J, Gregory MT, Shuga J, Montesclaros L, Underwood JG, Masquelier DA, Nishimura SY, Schnall-Levin M, Wyatt PW, Hindson CM, Bharadwaj R, Wong A, Ness KD, Beppu LW, Deeg HJ, McFarland C, Loeb KR, Valente WJ, Ericson NG, Stevens EA, Radich JP, Mikkelsen TS, Hindson BJ, Bielas JH. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8:14049. doi: 10.1038/ncomms14049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.van Dijk D, Sharma R, Nainys J, Yim K, Kathail P, Carr AJ, Burdziak C, Moon KR, Chaffer CL, Pattabiraman D, Bierie B, Mazutis L, Wolf G, Krishnaswamy S, Pe’er D. Recovering gene interactions from single-cell data using data diffusion. Cell. 2018;174(3):716–629. doi: 10.1016/j.cell.2018.05.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Shekhar K, Lapan SW, Whitney IE, Tran NM, Macosko EZ, Kowalczyk M, Adiconis X, Levin JZ, Nemesh J, Goldman M, McCarroll SA, Cepko CL, Regev A, Sanes JR. Comprehensive classification of retinal bipolar neurons by single-cell transcriptomics. Cell. 2016;166(5):1308–1323. doi: 10.1016/j.cell.2016.07.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kim Y-D, Choi S. Nonnegative tucker decomposition. In: IEEE conference on computer vision and pattern recognition (2007)
  • 60.Cichocki A, Zdunek R, Phan AH, Amari S. Nonnegative matrix and tensor factorizations: applications to exploratory multi-way data analysis and blind source separation. New Jersey: Wiley; 2009. [Google Scholar]
  • 61.Dries R, Zhu Q, Dong R, Eng C-HL, Li H, Liu K, Fu Y, Zhao T, Sarkar A, Bao F, George RE, Pierson N, Cai L, Yuan G-C. Giotto: a toolbox for integrative analysis and visualization of spatial expression data. BMC Genome Biol. 2021;22(78):1–31. doi: 10.1186/s13059-021-02286-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Nagai JS, Leimkühler NB, Schaub MT, Schneider RK, Costa IG. Crosstalker: analysis and visualization of ligand-receptor networks. Bioinformatics. 2021;37(22):4263–4265. doi: 10.1093/bioinformatics/btab370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Palla G, Spitzer H, Klein M, Fischer D, Schaar AC, Kuemmerle LB, Rybakov S, Ibarra IL, Holmberg O, Virshup I, Lotfollahi M, Richter S, Theis FJ. Squidpy: a scalable framework for spatial single cell analysis. bioRxiv. 2021 doi: 10.1101/2021.02.19.431994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Hou R, Denisenko E, Ong HT, Ramilowski JA, Forrest ARR. Predicting cell-to-cell communication networks using NATMI. Nat Commun. 2020;11:5011. doi: 10.1038/s41467-020-18873-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Jung S, Singh K, Sol A. Funres: resolving tissue-specific functional cell states based on a cell–cell communication network model. Brief Bioinform. 2021;22(4):283. doi: 10.1093/bib/bbaa283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Noël F, Massenet-Regad L, Carmi-Levy I, Cappuccio A, Grandclaudon M, Trichot C, Kieffer Y, Mechta-Grigoriou F, Soumelis V. Dissection of intercellular communication using the transcriptome-based framework ICELLNET. Nat Commun. 2021;12:1089. doi: 10.1038/s41467-021-21244-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Li D, Velazquez JJ, Ding J, Hislop J, Ebrahimkhani MR, Bar-Joseph Z. Inferring cell–cell interactions from pseudotime ordering of scRNA-Seq data. bioRxiv. 2021 doi: 10.1101/2021.07.28.454054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Halpern KB, Shenhav R, Massalha H, Toth B, Egozi A, Massasa EE, Medgalia C, David E, Giladi A, Moor AE, Porat Z, Amit I, Itzkovitz S. Paired-cell sequencing enables spatial gene expression mapping of liver endothelial cells. Nat Biotechnol. 2018;36(10):962–970. doi: 10.1038/nbt.4231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Cabello-Aguilar S, Alame M, Kon-Sun-Tack F, Fau C, Lacroix M, Colinge J. Singlecellsignalr: inference of intercellular networks from single-cell transcriptomics. Nucleic Acids Res. 2020;48(10):55. doi: 10.1093/nar/gkaa183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Shao X, Liao J, Li C, Lu X, Cheng J, Fan X. Celltalkdb: a manually curated database of ligand-receptor interactions in humans and mice. Brief Bioinform. 2021;22(4):269. doi: 10.1093/bib/bbaa269. [DOI] [PubMed] [Google Scholar]
  • 71.Armingol E, Baghdassarian HM, Martino C, Perez-Lopez A, Knight R, Lewis NE. Context-aware deconvolution of cell–cell communication with tensor-cell2cell. Nat Commun. 2022;13:3665. doi: 10.1038/s41467-022-31369-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Dimitrov D, Türei D, Garrido-Rodriguez M, Burmedi PL, Nagai JS, Boys C, Ramirez Flores RO, Kim H, Szalai B, Costa IG, Valdeolivas A, Dugourd A, Saez-Rodriguez J. Comparison of methods and resources for cell–cell communication inference from single-cell RNA-seq data. Nat Commun. 2022;12(1):3224. doi: 10.1038/s41467-022-30755-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Baghdassarian H, Dimitrov D, Armingol E, Saez-Rodriguez J, Lewis NE. Combining liana and tensor-cell2cell to decipher cell–cell communication across multiple samples. bioRxiv. 2023 doi: 10.1101/2023.04.28.538731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Tsuyuzaki KT, Ishii M, Nikaido I. nnTensor: an R package for non-negative matrix/tensor decomposition. JOSS. 2023;8:5015. doi: 10.21105/joss.05015. [DOI] [Google Scholar]
  • 75.Durinck S, Spellman P, Birney E, Huber W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRT. Nat Protocols. 2009;4:1184–1191. doi: 10.1038/nprot.2009.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Fabregat A, Jupe S, Matthews L, Sidiropoulos K, Gillespie M, Garapati P, Haw R, Jassal B, Korninger F, May B, Milacic M, Roca CD, Rothfels K, Sevilla C, Shamovsky V, Shorser S, Varusai T, Viteri G, Weiser J, Wu G, Stein L, Hermjakob H, D’Eustachio P. The reactome pathway knowledgebase. Nucleic Acids Res. 2018;46(D1):649–655. doi: 10.1093/nar/gkx1132. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Tsuyuzaki K, Morota G, Ishii M, Nakazato T, Miyazaki S, Nikaido I. MeSH ORA framework: R/Bioconductor packages to support MeSH over-representation analysis. BMC Bioinform. 2015;16(45):1–17. doi: 10.1186/s12859-015-0453-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association. Bioinformatics. 2007;23(2):257–258. doi: 10.1093/bioinformatics/btl567. [DOI] [PubMed] [Google Scholar]
  • 79.Yu G, He Q. ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization. Mol BioSyst. 2016;12(12):477–479. doi: 10.1039/C5MB00663E. [DOI] [PubMed] [Google Scholar]
  • 80.Yu G, Wang L, Yan G, He Q. DOSE: an R/Bioconductor package for disease ontology semantic and enrichment analysis. Bioinformatics. 2015;31(4):608–609. doi: 10.1093/bioinformatics/btu684. [DOI] [PubMed] [Google Scholar]
  • 81.Ono H, Ogasawara O, Okubo K, Bono H. Refex, a reference gene expression dataset as a web tool for the functional analysis of genes. Sci Data. 2017;4:170105. doi: 10.1038/sdata.2017.105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Papatheodorou I, Fonseca NA, Keays M, Tang YA, Barrera E, Bazant W, Burke M, Fullgrabe A, Fuentes AM, George N, Huerta L, Koskinen S, Mohammed S, Geniza M, Preece J, Jaiswal P, Jarnuczak AF, Huber W, Stegle O, Vizcaino JA, Brazma A, Petryszak R. Expression atlas: gene and protein expression across multiple studies and organisms. Nucleic Acids Res. 2018;46(Database issue):246–51. doi: 10.1093/nar/gkx1158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Papatheodorou I, Moreno P, Manning J, Fuentes AMP, George N, Fexova S, Fonseca NA, Füllgrabe A, Green M, Huang N, Huerta L, Iqbal H, Jianu M, Mohammed S, Zhao L, Jarnuczak AF, Jupp S, Marioni J, Meyer K, Petryszak R, Medina CAP, Talavera-López C, Teichmann S, Vizcaino JA, Brazma A. Expression atlas update: from tissues to single cells. Nucleic Acids Res. 2019;48:947. doi: 10.1093/nar/gkz947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Cao Y, Zhu J, Jia P, Zhao Z. scRNASeqDB: a database for RNA-Seq based gene expression profiles in human single cells. Genes (Basel) 2017;8(12):368. doi: 10.3390/genes8120368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Franzén O, Gan L-M, Björkegren JLM. Panglaodb: a web server for exploration of mouse and human single-cell RNA sequencing data. Database 2019, 046 [DOI] [PMC free article] [PubMed]
  • 86.Carithers LJ, Ardlie K, Barcus M, Branton PA, Britton A, Buia SA, Compton CC, DeLuca DS, Peter-Demchok J, Gelfand ET, Guan P, Korzeniewski GE, Lockhart NC, Rabiner CA, Rao AK, Robinson KL, Roche NV, Sawyer SJ, Segre AV, Shive CE, Smith AM, Sobin LH, Undale AH, Valentino KM, Vaught J, Young TR, Moore HM, Consortium G. A novel approach to high-quality postmortem tissue procurement: the GTEx project. Biopreservation Biobanking. 2015;13(5):311–319. doi: 10.1089/bio.2015.0032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.RIKEN: Collection | 29 August 2017 The FANTOM5 project. https://www.nature.com/collections/jcxddjndxy
  • 88.Bernstein BE, Stamatoyannopoulos JA, Costello JF, Ren B, Milosavljevic A, Meissner A, Kellis M, Marra MA, Beaudet AL, Ecker JR, Farnham PJ, Hirst M, Lander ES, Mikkelsen TS, Thomson JA. The NIH roadmap epigenomics mapping consortium. Nat Biotechnol. 2010;28(10):1045–1048. doi: 10.1038/nbt1010-1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Consotium EP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Uhlen M, Fagerberg L, Hallstrom BM, Lindskog C, Oksvold P, Mardinoglu A, Sivertsson A, Kampf C, Sjostedt E, Asplund A, Olsson I, Edlund K, Lundberg E, Navani S, Szigyarto CA, Odeberg J, Djureinovic D, Takanen JO, Hober S, Alm T, Edqvist PH, Berling H, Tegel H, Mulder J, Rockberg J, Nilsson P, Schwenk JM, Hamsten M, von Feilitzen K, Forsberg M, Persson L, Johansson F, Zwahlen M, von Heijne G, Nielsen JFP. Proteomics tissue-based map of the human proteome. Science. 2015;347(6220):1260419. doi: 10.1126/science.1260419. [DOI] [PubMed] [Google Scholar]
  • 91.Lamb J, Crawford ED, Peck D, Modell JW, Blat IC, Wrobel MJ, Lerner J, Brunet JP, Subramanian A, Ross KN, Reich M, Hieronymus H, Wei G, Armstrong SA, Haggarty SJ, Clemons PA, Wei R, Carr SA, Lander ES, Golub TR. The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease. Science. 2006;313(5795):1929–1935. doi: 10.1126/science.1132939. [DOI] [PubMed] [Google Scholar]
  • 92.Wang Y, Tung H-Y, Smola A, Anandkumar A. Fast and guaranteed tensor decomposition via sketching. In: NIPS, vol 1, pp 991–999 (2015)
  • 93.Maehara, T., Hayashi, K., Kawarabayashi, K.: Expected tensor decomposition with stochastic gradient descent. In: AAAI’16, pp 1919–1925 (2016)
  • 94.Smith, S., Park, J., Karypis, G.: An exploration of optimization algorithms for high performance tensor completion. In: SC ’16 proceedings of the international conference for high performance computing, networking, storage and analysis, vol 31 (2016)
  • 95.Shin K, Sael L, Kang U. Fully scalable methods for distributed tensor factorization. IEEE Trans Knowl Data Eng. 2017;29(1):100–113. doi: 10.1109/TKDE.2016.2610420. [DOI] [Google Scholar]
  • 96.Tsuyuzaki K. Guidelines to handle large-scale and complex tensor data in r. BioC Asia 2021 (2021). 10.7490/f1000research.1118833.1
  • 97.DelayedArray: A Unified framework for working transparently with on-disk and in-memory array-like datasets. R Package Version 0.24.0. https://bioconductor.org/packages/DelayedArray
  • 98.Tsuyuzaki, K., Nikaido, I.: Biological systems as heterogeneous information networks: a mini-review and perspectives. HeteroNAM’18 (2018)
  • 99.Baccin C, Al-Sabah J, Velten L, Helbling PM, Grünschläger F, Hernández-Malmierca P, Nombela-Arrieta C, Steinmetz LM, Trumpp A, Haas S. Combined single-cell and spatial transcriptomics reveal the molecular, cellular and spatial bone marrow niche organization. Nat Cell Biol. 2020;22(1):38–48. doi: 10.1038/s41556-019-0439-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Lagger C, Ursu E, Equey A, Avelar RA, Pisco AO, Tacutu R, Magalhães JP. scAgeCom: a murine atlas of age-related changes in intercellular communication inferred with the package scDiffCom. bioRxiv. 2021 doi: 10.1101/2021.08.13.456238. [DOI] [Google Scholar]
  • 101.Yuan Y, Cosme C, Adams TS, Schupp J, Sakamoto K, Xylourgidis N, Ruffalo M, Kaminski N, Bar-Joseph Z. Cins: cell interaction network inference from single cell expression data. bioRxiv. 2021 doi: 10.1101/2021.02.22.432206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Jin S, Guerrero-Juarez CF, Zhang L, Chang I, Ramos R, Kuan C-H, Myung P, Plikus MV, Nie Q. Inference and analysis of cell–cell communication using cellchat. Nat Commun. 2021;12:1088. doi: 10.1038/s41467-021-21246-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Raredon MSB, Yang J, Garritano J, Wang M, Kushnir D, Schupp JC, Adams TS, Greaney AM, Leiby KL, Kaminski N, Kluger Y, Levchenko A, Le N. Connectome: computation and visualization of cell–cell signaling topologies in single-cell systems data. Sci Rep. 2022;12(1):4187. doi: 10.1038/s41598-022-07959-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Solovey M, Scialdone A. Comunet: a tool to explore and visualize intercellular communication. Bioinformatics. 2020;36(15):4296–4300. doi: 10.1093/bioinformatics/btaa482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Cillo AR, Kürten CHL, Tabib T, Qi Z, Onkar S, Wang T, Liu A, Duvvuri U, Kim S, Soose RJ, Oesterreich S, Chen W, Lafyatis R, Bruno TC, Ferris RL, Vignali DAA. Immune landscape of viral- and carcinogen-driven head and neck cancer. Immunity. 2020;52(1):183–1999. doi: 10.1016/j.immuni.2019.11.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Yuanxin W, Ruiping W, Shaojun Z, Shumei S, Changying J, Guangchun H, Michael W, Jaffer A, Andy F, Wang L. iTALK: an R package to characterize and illustrate intercellular communication. Nature. 2019 doi: 10.1101/507871. [DOI] [Google Scholar]
  • 107.Hu Y, Peng T, Gao L, Tan K. CytoTalk: De novo construction of signal transduction networks using single-cell transcriptomic data. Sci Adv. 2021;7(16):1356. doi: 10.1126/sciadv.abf1356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Browaeys R, Saelens W, Sayes Y. NicheNet: modeling intercellular communication by linking ligands to target genes. Nat Method. 2019;17(2):159–162. doi: 10.1038/s41592-019-0667-5. [DOI] [PubMed] [Google Scholar]
  • 109.Zhang Y, Liu T, Hu X, Wang M, Wang J, Zou B, Tan P, Cui T, Dou Y, Ning L, Huang Y, Rao S, Wang D, Zhao X. Cellcall: integrating paired ligand–receptor and transcription factor activities for cell–cell communication. Nucleic Acids Res. 2021;49(15):8520–8534. doi: 10.1093/nar/gkab638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Yu A, Li Y, Li I, Yeh C, Chiou AE, Ozawa MG, Taylor J, Plevritis SK. Reconstructing co-dependent cellular crosstalk in lung adenocarcinoma using REMI. bioRxiv. 2021 doi: 10.1101/2021.05.02.440071. [DOI] [Google Scholar]
  • 111.Pham D, Tan X, Xu J, Grice LF, Lam PY, Raghubar A, Vukovic J, Ruitenberg MJ, Nguyen Q. stLearn: integrating spatial location, tissue morphology and gene expression to find cell types, cell–cell interactions and spatial trajectories within undissociated tissues. bioRxiv. 2021 doi: 10.1101/2020.05.31.125658. [DOI] [Google Scholar]
  • 112.Wang S, MacLean A, Nie Q. SoptSC: similarity matrix optimization for clustering, lineage, and signaling inference. bioRxiv. 2018 doi: 10.1101/168922. [DOI] [Google Scholar]
  • 113.Liu Z, Sun D, Wang C. Evaluation of cell–cell interaction methods by integrating single-cell RNA sequencing data with spatial information. BMC Genome Biol. 2022;23(1):218. doi: 10.1186/s13059-022-02783-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Yokota T, Zhao Q, Cichocki A. Smooth PARAFAC decomposition for tensor completion. IEEE Trans Signal Process. 2016;64(20):5423–5436. doi: 10.1109/TSP.2016.2586759. [DOI] [Google Scholar]
  • 115.Ge H, Caverlee J, Zhang N, Squicciarini A. Uncovering the spatio-temporal dynamics of memes in the presence of incomplete information. In: Proceedings of CIKM’16, pp 1493–1502 (2016). 10.1145/2983323.2983782
  • 116.Park, J.Y., Carr, K.T., Zheng, S., Yue, Y., Yu, R.: Multiresolution tensor learning for efficient and interpretable spatial analysis. In: Proceedings of ICML2020 695, pp 7499–7509 (2020)
  • 117.Sasagawa Y, Nikaido I, Hayashi T, Danno H, Uno KD, Imai T, Ueda HR. Quartz-Seq: a highly reproducible and sensitive single-cell RNA sequencing method, reveals non-genetic gene-expression heterogeneity. BMC Genome Biol. 2013;14(4):31. doi: 10.1186/gb-2013-14-4-r31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Pierson E, Yau C. Zifa: dimensionality reduction for zero-inflated single-cell gene expression analysis. BMC Genome Biol. 2015;16(241):1–10. doi: 10.1186/s13059-015-0805-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Graeber TG, Eisenberg D. Bioinformatic identification of potential autocrine signaling loops in cancers from gene expression profiles. Nat Genet. 2001;29(3):295–300. doi: 10.1038/ng755. [DOI] [PubMed] [Google Scholar]
  • 120.Harding SD, Sharman JL, Faccenda E, Southan C, Pawson AJ, Ireland S, Gray AJG, Bruce L, Alexander SPH, Anderton S, Bryant C, Davenport AP, Doerig C, Fabbro D, Levi-Schaffer F, Spedding M, Davies N-I. The IUPHAR/BPS guide to pharmacology in 2018: updates and expansion to encompass the new guide to immunopharmacology. Nucleic Acids Res. 2018;46(D1):1091–1106. doi: 10.1093/nar/gkx1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Ben-Shlomo I, Yu Hsu S, Rauch R, Kowalski HW, Hsueh AJ. Signaling receptome: a genomic and evolutionary perspective of plasma membrane receptors involved in signal transduction. Sci Signal. 2003;2003(187):9. doi: 10.1126/stke.2003.187.re9. [DOI] [PubMed] [Google Scholar]
  • 122.Hutchins LN, Murphy SM, Singh P, Graber JH. Position-dependent motif characterization using non-negative matrix factorization. Bioinformatics. 2008;24(23):2684–2690. doi: 10.1093/bioinformatics/btn526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Chicco D. Ten quick tips for machine learning in computational biology. BMC BioData Min. 2017;10:35. doi: 10.1186/s13040-017-0155-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS ONE. 2015;10(3):0118432. doi: 10.1371/journal.pone.0118432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Davis J, Goadrich M. The relationship between precision–recall and roc curves. In: Proceedings of ICML’06, pp 233–240 (2006). 10.1145/1143844.1143874
  • 126.Chicco D, Jurman G. The advantages of the Matthews correlation coefficient (MCC) over f1 score and accuracy in binary classification evaluation. BMC Genom. 2020;21(6):1–13. doi: 10.1186/s12864-019-6413-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Boughorbel S, Jarray F, El-Anbari M. Optimal classifier for imbalanced data using Matthews correlation coefficient metric. PLoS ONE. 2017;12(6):0177678. doi: 10.1371/journal.pone.0177678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Chicco D, Tötsch N, Jurman G. The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation. BMC BioData Min. 2021;14(13):1–22. doi: 10.1186/s13040-021-00244-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12859_2023_5490_MOESM1_ESM.xlsx (15.7KB, xlsx)

Additional file 1. List of existing L–R scoring and CCI detection methods.

12859_2023_5490_MOESM2_ESM.pdf (1.9MB, pdf)

Additional file 2. Ground truth CCIs in simulated datasets.

12859_2023_5490_MOESM3_ESM.pdf (3.6MB, pdf)

Additional file 3. AUCROC values of all methods.

12859_2023_5490_MOESM4_ESM.pdf (3.6MB, pdf)

Additional file 4. AUCPR values of all methods.

12859_2023_5490_MOESM5_ESM.pdf (3.5MB, pdf)

Additional file 5. Memory values of all methods.

12859_2023_5490_MOESM6_ESM.pdf (2.6MB, pdf)

Additional file 6. Computational time values of all methods.

12859_2023_5490_MOESM7_ESM.pdf (2.5MB, pdf)

Additional file 7. F-measure values of all binarization methods.

12859_2023_5490_MOESM8_ESM.pdf (2.4MB, pdf)

Additional file 8. MCC values of all binarization methods.

12859_2023_5490_MOESM9_ESM.pdf (2.3MB, pdf)

Additional file 9. PR values of all binarization methods.

12859_2023_5490_MOESM10_ESM.pdf (2.4MB, pdf)

Additional file 10. FPR values of all binarization methods.

12859_2023_5490_MOESM11_ESM.pdf (2.5MB, pdf)

Additional file 11. FNR values of all binarization methods.

12859_2023_5490_MOESM13_ESM.docx (224.1KB, docx)

Additional file 13. Three L-R pairs in which each of the three methods excelled.

12859_2023_5490_MOESM14_ESM.zip (130.6MB, zip)

Additional file 14. HTML report of FetalKidney.

12859_2023_5490_MOESM15_ESM.zip (165.5MB, zip)

Additional file 15. HTML report of GermlineFemale.

12859_2023_5490_MOESM16_ESM.zip (180.8MB, zip)

Additional file 16. HTML report of HeadandNeckCancer.

12859_2023_5490_MOESM17_ESM.zip (232.5MB, zip)

Additional file 17. HTML report of Uterus.

12859_2023_5490_MOESM18_ESM.zip (304.2MB, zip)

Additional file 18. HTML report of VisualCortex.


Articles from BMC Bioinformatics are provided here courtesy of BMC

RESOURCES