A multi-omics integration framework using multi-label guided learning and multi-scale fusion

Yuze Li; Yinghe Wang; Tao Liang; Ying Li; Wei Du

doi:10.1093/bib/bbaf493

. 2025 Sep 25;26(5):bbaf493. doi: 10.1093/bib/bbaf493

A multi-omics integration framework using multi-label guided learning and multi-scale fusion

Yuze Li ^1,^#, Yinghe Wang ^2,^#, Tao Liang ³, Ying Li ^4,^✉, Wei Du ^5,^✉

PMCID: PMC12461718 PMID: 40996147

Abstract

The rapid development of high-throughput sequencing technologies has generated vast amounts of omics data, making multi-omics integration a crucial approach for understanding complex diseases. Despite the introduction of various multi-omics integration methods in recent years, existing approaches still have limitations, primarily in their reliance on manual feature selection, restricted applicability, and inability to comprehensively capture both inter-sample and cross-omics interactions. To address these challenges, we propose mmMOI, an end-to-end multi-omics integration framework that incorporates multi-label guided learning and multi-scale attention fusion. mmMOI directly processes raw high-dimensional omics data without requiring manual feature selection, thereby enhancing model interpretability and eliminating biases introduced by feature preselection. First, we introduce a multi-label guided multi-view graph neural network, which enables the model to adaptively learn omics data representations across different datasets, thereby improving generalizability and stability. Second, we design a multi-scale attention fusion network, which integrates global attention and local attention. This dual-attention mechanism allows mmMOI to more accurately integrate multi-omics data, enhance cross-omics feature representations, and improve classification performance. Experimental results demonstrate that mmMOI significantly outperforms state-of-the-art methods in classification tasks, exhibiting high stability and adaptability across diverse biological contexts and sequencing technologies. Additionally, mmMOI successfully identifies key disease-associated biomarkers, further enhancing its biological interpretability and practical relevance. The source code, datasets, and detailed hyperparameter configurations for mmMOI are available at https://github.com/mlcb-jlu/mmMOI.

Keywords: multi-omics integration, multi-label guided learning, multi-scale fusion, graph neural network

Introduction

Cancer is an extremely complex genomic disease that can occur in most human organs [1]. Previous studies have shown that genomic alterations such as somatic mutations, copy number changes, epigenetic aberrations, chromatin rearrangements, and gene fusions can all lead to cancer [2]. The treatment methods and processes for different cancer patients vary significantly. Moreover, even patients with the same type of cancer can have significantly different responses to the same drug and survival risks, suggesting a high degree of heterogeneity among subtypes of the same cancer. Therefore, clinical approaches should consider these differences to implement appropriate diagnostic and therapeutic measures [3]. The ability to accurately identify and classify these cancers and their subtypes will directly impact patients’ precision diagnosis and personalized treatment [4, 5].

With the advent of high-throughput sequencing technology, scientists have accumulated vast amounts of omics data [6, 7], including genomics [8], transcriptomics [9], epigenomics [10], metabolomics [11], and proteomics [12]. Early cancer classification predominantly relied on single-omics data, which failed to capture the intricate relationships between gene mutation, expression, and regulation, resulting in less accurate classifications [13, 14]. Integrating various omics data offers a comprehensive understanding of complex diseases and can be approached in three ways: early, late, and intermediate integrations [15]. Early integration combines all omics data into a single feature set, which can increase redundancy and decrease model performance. Late integration models each data type separately and combines the results but misses inter-omics correlations. Intermediate integration merges different omics data during model construction, reducing redundancy while preserving biological correlations, thus yielding better results. These methods need to address two main challenges: the feature representation of single-omics data and the effective integration of different types of omics data.

The most common method for representing single-omics data is the autoencoder (AE). Ma et al. [16] proposed the Multi-Omics Cancer Subtype Classification (MOCSC), which uses stacked sparse denoising AEs to extract features. These features are then provided to a single-layer neural network to obtain initial predictions, which are integrated into a view-related discovery network for final training and prediction. Benkirane et al. [17] introduced the CustOmics method, a staged fusion framework. In the first stage, an AE is provided for each type of omics data to create subrepresentations, which are input into a central variational AE in the second stage. Graph neural network (GNN)-based methods have also been proposed to capture the similarity relationships between samples. Wang et al. [18] developed the Multi-Omics Graph Convolutional Network (MOGONET), which uses three independent graph convolutional networks to analyze different types of omics data and integrates the learned label spaces into tensors for final classification. Chen et al. [19] proposed the Supervised graph contrastive learning for cancer subtype identification (MCRGCN), which employs supervised graph contrastive learning on multi-omics data using various data augmentation methods to effectively learn the unique feature distributions and interactions of different omics data, thus achieving cancer subtype classification. Liang et al. [20] introduced the Explainable multi-omics integration for disease prediction and module detection (GREMI), which uses graph attention networks to learn biomolecular interaction information for feature representation. The subsequent fusion process employs a real-class probability method to adaptively classify at both the feature and omics levels.

Attention mechanisms are widely applied in multi-omics problems. Gong et al. [21] proposed the Multi-omics integration via attention-based deep learning for biomedical classification (MOADLN), which uses a self-attention mechanism for dimensionality reduction and classifies through a multi-omics correlation discovery network. Pang et al. [22] proposed the attentionMOI method, which uses a distribution-based feature denoising algorithm for feature selection and then uses multi-omics attention fusion to predict cancer prognosis and identify cancer subtypes. Li et al. [23] proposed the Multi-omics integration via graph convolutional networks for cancer subtyping (MoGCN), combining AE and GCN for cancer subtype analysis. Ouyang et al. [24] proposed the Multi-omics integration via adaptive graph learning and attention mechanism (MOGLAM), combining GCN and attention mechanisms for cancer classification and biomarker identification. This method uses a dynamic graph convolutional network with feature selection to identify important biomarkers and applies a multi-omics attention mechanism to weight the embedded representations of different omics, capturing complex common and complementary information.

Despite recent advances in multi-omics integration, existing methods still face several limitations. First, omics data preprocessing is heavily dependent on feature selection, making the final integration outcomes more influenced by initial feature selection rather than the model’s intrinsic representation learning capability. This reliance not only restricts model interpretability but also introduces the risk of label leakage. Second, many current methods are tailored for specific datasets, requiring extensive parameter tuning or structural modifications when applied to different data types, thereby limiting their generalizability. Third, most existing approaches employ single-level attention mechanisms, which fail to capture both inter-sample and cross-omics interactions sufficiently. This deficiency reduces the comprehensiveness and granularity of multi-omics data integration.

To address these challenges, we proposed mmMOI, a multi-omics integration framework using multi-label guided learning and multi-scale fusion. The framework has two components: (i) the single-omics data representation learning module with a multi-view GNN based on multi-label guided learning and (ii) the multi-omics data fusion module with a multi-scale attention fusion network. Unlike traditional feature representation learning methods, we treat it as an auxiliary training task. Under the guidance of partial true labels and pseudo-labels extracted via a graph convolution module, the model evaluates predictions from each view and adjusts the fusion weights accordingly. This allows the model to adaptively integrate multi-view graphs, producing a consensus graph representation enriched with global information. Next, each omics representation learned is fed into the global attention module. This module assigns weights to different omics features while preserving the intra-omics relationships between patients. Finally, the local attention module refines the fused representations by capturing shared and complementary information across different omics types. In classification tasks across different cancer subtypes, mmMOI demonstrates superior performance compared to state-of-the-art approaches, consistently exhibiting high stability and adaptability across diverse biological contexts and sequencing technologies. Additionally, mmMOI effectively identifies key disease-associated biomarkers, further enhancing its biological interpretability and clinical applicability.

The primary contributions of the proposed model are summarized as follows: (i) We introduce a novel multi-omics integration framework that combines multi-label guided learning with multi-scale attention fusion, enhancing both interpretability and generalizability. (ii) A multi-label guided multi-view GNN is employed to adaptively learn representations from omics data, mitigating the risk of overfitting to specific labels. (iii) A multi-scale attention fusion network is designed to dynamically integrate different omics layers dynamically, enabling the model to accurately capture complex biological interactions.

Materials and methods

Overview of mmMOI

We present a multi-omics integration framework using multi-label guided learning and multi-scale fusion (short for mmMOI), as illustrated in Fig. 1A. For single-omics representation learning (Fig. 1B), the framework utilizes a multi-view GNN guided by multi-label learning to learn representations from each omics data type. Unlike traditional methods, this module treats representation learning as an auxiliary training task, enhancing feature extraction without overfitting specific labels. For multi-omics data fusion (Fig. 1C), the framework employs a multi-scale attention fusion network that adaptively integrates representations from different omics layers. This allows the model to capture complex interactions between various biological data types.

Three-part schematic showing (A) the overall pipeline for multi-omics integration, (B) a multi-view GNN framework where feature representation learning is guided by multi-label learning, and (C) a network structure employing multi-scale attention mechanisms for feature fusion. — Overview of mmMOI, including (A) The overall flow of the multi-omics integration framework, (B) The framework of the multi-view GNN guided by multi-label learning, and (C) The framework of the multi-scale attention fusion network.

Single-omics data representation

Cancer multi-omics datasets typically comprise only a few hundred samples, while feature dimensions can reach the thousands, often containing significant noise and redundant features. Consequently, representation learning is crucial for handling such high-dimensional omics data. In our study, we propose a multi-view GNN model based on multi-label auxiliary training for single-omics data representation learning. This model adaptively reduces the dimensionality of multi-omics data across different datasets and downstream tasks during the learning process. The corresponding pseudocode is shown as Algorithm 1.

Dimensionality reduction autoencoder

For any type of omics data Inline graphic , where represents the number of patients, and represents the original feature dimensions of the data. Define and as the encoder and decoder of the AE, respectively. The encoder maps the input omics data to the hidden space , and the decoder reconstructs into data with the same dimensions as the input data Inline graphic . Ultimately, the hidden space is extracted as the low-dimensional representation of the learned omics data. Specifically, (where is the dimension of the intermediate representation), and . The AE reduces the dimension of the omics data from to , resulting in the low-dimensional feature representation Inline graphic . Based on this, we can obtain the node relationship matrix , which is defined as follows:

(1)

where Inline graphic is the set threshold.

Dimensionality reduction autoencoder

For any omics data Inline graphic , where represents the number of patients, and represents the original feature dimensionality. Define and as the encoder and decoder, respectively. The encoder maps to latent space , and the decoder reconstructs to with identical dimensionality to . The latent representation (where Inline graphic is the reduced dimension) is extracted as the low-dimensional features. The AE thus reduces dimensionality from to . Based on , we derive the node relationship matrix as follows:

(2)

where Inline graphic is a predefined threshold, and .

Multi-label guided graph fusion

In multi-omics problems, complex interdependencies and associations exist between different samples. Although AEs have certain nonlinear fitting capabilities, they cannot capture these relationships well. GNNs, on the other hand, use graph-structured data to learn models and possess powerful nonlinear capturing abilities. They can utilize node features and the similarity relationships between nodes to extract more effective data representations. Therefore, we input the low-dimensional representations of different omics data, output by the dimensionality reduction AE, into the GNN for further feature representation learning. To extract the relationships between different nodes effectively, we use various similarity metrics to generate adjacency matrices Inline graphic for views. Since the from different views contain complementary information, integrating them into a consensus graph is crucial. Intuitively, weighting and summing each refined is a simple yet effective method. In our model, we use the actual labels from the training dataset and the pseudo labels Inline graphic obtained from clustering the consensus graph to score each view. The is obtained through the graph encoding module to determine the weight of each view. By , we can calculate the weight of each view :

(3)

where Inline graphic denotes the number of views, and represents the smooth-sharp parameter. When , it has a smoothing effect; when , it amplifies the differences between views. Based on this, the adjacency matrix of the consensus graph can be defined as follows:

(4)

Multi-label guided graph fusion

In multi-omics problems, complex interdependencies exist between samples. While AEs have nonlinear fitting capabilities, GNNs better capture relationships using graph-structured data. GNNs leverage node features and pairwise similarities to extract effective representations. Therefore, we feed the low-dimensional omics representations Inline graphic from dimensionality reduction AEs into GNNs for feature learning. To model node relationships, we generate view-specific adjacency matrices using multiple similarity metrics. Since contain complementary information, we integrate them into a consensus graph. The adjacency matrix of the consensus graph can be defined as follows:

(5)

where weights Inline graphic are determined by multi-label guidance. Using training labels and pseudo-labels from consensus graph clustering, we compute view scores via graph encoding:

(6)

where Inline graphic denotes the number of views, and represents the smooth-sharp parameter. When , it has a smoothing effect; when , it amplifies the differences between views.

Graph encoding and view evaluation

As shown in Fig. 1B, the inputs to the graph encoder are the low-dimensional features Inline graphic obtained from the AE, the adjacency matrices from different views, and the adjacency matrix of the consensus graph. The purpose of this module is to encode the shared low-dimensional features along with the graph structure information to generate latent representations of different omics data. Specifically, we train a parameter-shared GNN to utilize the adjacency matrices Inline graphic from different views and the adjacency matrix of the consensus graph to encode the low-dimensional features :

(7)

(8)

where GNN Inline graphic denotes the graph convolution operation, represents the low-dimensional features obtained from the AE, and and are the normalized forms of and , respectively.

Since the latent embedding Inline graphic of the consensus graph contains the global common information from all views, we can use it as a pseudo label to evaluate the clustering scores of each view . Additionally, we utilize the actual labels from the training dataset to assist in learning the fusion parameters. Specifically, Inline graphic can be calculated as follows:

(9)

where Inline graphic denotes the calculation of metrics such as accuracy, represents the k-means algorithm, and MLP represents the linear classifier.

Graph encoding and view evaluation

As shown in Fig. 1B, the encoder of the view-specific graph takes as input the low-dimensional features Inline graphic from the AE and view-specific adjacency matrices . Meanwhile, the encoder of the consensus graph takes as input the low-dimensional features from the AE and the consensus adjacency matrix . This module encodes with graph structural information to generate latent representations. We implement a parameter-shared GNN that operates on both view-specific and consensus graphs as follows:

(10)

(11)

where Inline graphic denotes graph neural network, and are normalized adjacency matrices. As shown in Fig. 1B, the consensus embedding contains global information and serves as pseudo-labels . We evaluate each view using both and training labels as follows:

(12)

where Inline graphic computes evaluation metrics (e.g. accuracy), denotes training set embeddings, and is the view score.

Multi-omics data fusion

Using the method proposed in the previous section, we can obtain feature representations for single-omics data. Next, we need to fuse different types of omics data to obtain a unified feature representation. We propose a multi-scale attention fusion network that focuses on both the global attention information among different omics data and the local attention information among different samples, thereby effectively fusing multiple types of omics data.

Global attention fusion network

Before fusing multi-omics data, it is crucial to address the fact that different types of omics data contribute differently to the final classification outcome [24]. The current feature representations only contain the correlation information between different patients within each type of omics data. Direct fusion may negatively impact the final classification accuracy; therefore, it is necessary to learn the contributions of different omics and adjust the features before fusion. Inspired by Hu et al. [25] in their research on channel attention in the computer vision field, we propose a Global Attention Fusion Network (GAFN) for multi-omics data. We treat different omics data as different channels and adaptively learn the importance of each channel using an attention mechanism. Based on this importance, we assign different weight values to different omics data, thereby enhancing the representation capabilities of the subsequent fusion network. First, we perform global pooling operations to compress a feature channel into a single point Inline graphic , which contains all the information of the th omics feature. The specific calculation is as follows:

(13)

where Inline graphic is the th omics representation, represents the dimensionality of the features after compression. We can obtain the global feature vector describing each feature channel.

Then, we use two bottleneck fully connected layers to learn the global feature vector, allowing it to comprehensively capture channel attention information. The final omics channel attention vector Inline graphic is calculated as follows:

(14)

where Inline graphic and are learnable weights, with Rectified Linear Unit (ReLU) activation function and Sigmoid activation function .

By this point, we have learned the attention weights of each channel. However, we only weigh the initial omics features by the obtained omics channel attention vector Inline graphic . In that case, the resulting feature vector will inevitably overemphasize the omics with richer information while neglecting the roles of other omics. Therefore, we introduce a residual mechanism [26], which considers the contributions of different omics while preserving their inherent specific information. We calculate the representation Inline graphic of the th omics after passing through the channel attention residual network as follows:

(15)

where Inline graphic denotes channel-wise scaling and represents the ReLU activation function.

Local attention fusion network

The GAFN learns attention weights from a global perspective of all patients and all omics data, helping us assign different importance to different omics features. However, we have not yet performed representation learning from the perspective of all omics data for each patient, which may result in the loss of important common and complementary information between different omics in the final fused features. Inspired by Vaswani et al. [27] in their research on self-attention and multi-head attention in the MLP field, we propose a Local Attention Fusion Network (LAFN) for multi-omics data. For patient Inline graphic , we form the omics feature matrix:

(16)

where Inline graphic is the th omics feature, and represents the dimensionality of the omics features.

Then, we can compute query, key, and value projections as follows:

(17)

(18)

(19)

where Inline graphic , , and are learnable weights.

After that, we use Inline graphic and combined with the softmax function to calculate the similarity weight for patient as follows:

(20)

where Inline graphic is used as a scaling factor to ensure that the calculated similarity weights do not become too large.

We can then calculate the feature matrix Inline graphic that describes each omics for patient as follows:

(21)

Following the concept of multi-head attention, we set up multiple self-attention blocks, with each self-attention block acting as an attention head. Through the mutual attention of different heads, we identify various associations between omics from different angles, enhancing the representation learning capabilities for different omics. We concatenate the Inline graphic attention heads, obtaining the new feature matrix as follows:

(22)

where Inline graphic denotes concatenation.

We then input this into a feedforward neural network (NN) composed of linear layers. This allows us to learn the final representation Inline graphic of patient , which integrates the features of various omics data, calculated as follows:

(23)

where Inline graphic is learnable, and the flatten operation transforms the matrix into a 1D vector to obtain the final representation of patient . The features of all patients form the final feature representation matrix , which is used for downstream tasks.

Model optimization

Dimensionality reduction autoencoder

During training, we optimize two objectives: (i) reconstruction loss, which measures the difference between the original features Inline graphic and the reconstructed features using mean squared error (MSE); (ii) auxiliary classification loss, which make the extracted low-dimensional representations more suitable for actual classification task using cross-entropy loss on the training set. The combined loss function is:

(24)

where Inline graphic denotes Frobenius norm (element-wise MSE), is the ground truth label of sample on class , is the predicted probability of sample on class , is the number of training samples (), and is a hyperparameter to balance feature reconstruction and classification tasks.

Multi-view graph neural network

To effectively represent different omics data, we aim to learn the intrinsic distribution information of different data. Referring to existing multi-view clustering research [28], we use the KL divergence loss to optimize the model, which is defined as follows:

(25)

where Inline graphic denotes Kullback–Leibler divergence, used to measure the distance between two distributions. The view distributions and consensus are computed via Student’s t-distribution:

(26)

where Inline graphic are cluster centroids initialized by k-means on . The target distribution is derived from as follows:

(27)

Additionally, we use the labeled data from the training set to provide auxiliary constraints to the multi-view GNN, with the specific loss function as follows:

(28)

where Inline graphic is the ground truth label for sample for class , is the predicted probability from classifier , represents th row of feature matrix , and is the number of training samples. The complete loss function of the multi-view GNN is as follows:

(29)

Multi-omics data fusion network

We employ a classifier to predict class label on the final feature representation Inline graphic obtained through the multi-scale attention fusion network and update the model parameters of the multi-scale attention fusion network by minimizing the following loss:

(30)

where Inline graphic is the ground truth label for sample for class , is the predicted probability from classifier , represents th row of fused feature matrix , and is the number of training samples.

Results

Evaluation datasets and baselines

In this study, we validated the mmMOI method using four types of cancer datasets: GBM (glioblastoma multiforme subtypes), BRCA (breast invasive carcinoma subtypes), OV (ovarian serous cystadenocarcinoma subtypes), and KIPAN (kidney cancer classification). For each cancer-type, we collected three types of omics data and classification labels containing mRNA expression, miRNA expression, and DNA methylation. The omics data for GBM, BRCA, and OV cancers were sourced from benchmark datasets compiled by Rappoport et al. using TCGA data [29]. Class labels for GBM, BRCA, and OV cancers and the KIPAN dataset were obtained from the UCSC cancer database [30]. The details of the four datasets are shown in Table 1.

Table 1.

Summary of datasets in our study

Dataset	Categories	Number of features for training mRNA, methy, miRNA
GBM	Proneural:72, Classical:71, Mesenchymal:84, Neural:47	12 042, 5000, 534
BRCA	Basal-like:92, HER2-enriched:37, Luminal A:278, Luminal B:110	15 551, 5000, 390
OV	Mesenchymal:68, Proliferative:76, Differentiated:66, Immunoreactive:81	15 789, 5000, 349
KIPAN	KICH:65, KIRC:201, KIRP:294	15 148, 5000, 533

Open in a new tab

To evaluate the performance of our method on the cancer classification task, we compared it with several traditional machine learning methods and the latest deep learning methods. The comparison methods include: NN [31], RF [32], SVM [33], XGBoost [34], MOGONET [18], MoGCN [23], MOGLAM [24], CustOmics [17], AttentionMOI [22], GREMI [20], and MCRGCN [19].

Comparative experiment

As shown in Tables 2–5, we compared the proposed mmMOI model with several baseline methods across four datasets. The results indicate that the mmMOI model outperforms the best baseline in all classification tasks, with notable improvements in Accuracy, F1-macro, Precision, and Recall. To further quantify the extent of these performance gains, we conducted paired t-tests to compute the statistical significance ( Inline graphic -values) of the improvements across the different metrics. The experimental findings demonstrate that, on all pertinent datasets, the majority of performance enhancements achieved by the proposed method relative to the baseline reached statistical significance (.05).

Table 2.

Classification performance of all methods on GBM dataset

Method	ACC	F1-macro	Precision	Recall
NN	0.75200.036	0.73240.025	0.77950.037	0.73830.022
RF	0.68730.074	0.64940.084	0.68380.082	0.65630.078
SVM	0.76360.016	0.72910.028	0.80480.016	0.73070.023
XGBoost	0.65460.098	0.62960.099	0.65790.087	0.63010.104
MOGONET	0.67420.061	0.64280.072	0.66160.087	0.64910.064
MoGCN	0.75120.015	0.72330.023	0.72710.018	0.74540.033
MOGLAM	0.68680.065	0.63700.072	0.66100.070	0.65340.072
CustOmics	0.79020.056	0.76970.066	0.80050.069	0.77500.066
AttentionMOI	0.79250.044	0.78180.044	0.80330.050	0.78620.044
GREMI	0.78660.036	0.77360.043	0.81590.048	0.77670.045
MCRGCN	0.80860.050	0.79260.052	0.80200.052	0.80300.052
mmMOI	0.82930.053	0.81600.056	0.84000.053	0.81950.059
Gain for best	2.07% (0.0338)	2.34% (0.0206)	2.41% (0.0134)	1.65% (0.0245)

Open in a new tab

Table 3.

Classification performance of all methods on BRCA dataset

Method	ACC	F1-macro	Precision	Recall
NN	0.84720.039	0.80910.036	0.85130.059	0.79180.02
RF	0.84100.030	0.72170.069	0.82230.116	0.70110.051
SVM	0.82820.021	0.75970.052	0.87890.044	0.72360.047
XGBoost	0.85640.027	0.77780.080	0.83470.109	0.75120.064
MOGONET	0.84460.027	0.78540.062	0.82910.050	0.77770.063
MoGCN	0.80410.002	0.68660.005	0.68470.003	0.73310.017
MOGLAM	0.85900.030	0.83340.036	0.85440.045	0.84330.034
CustOmics	0.87130.008	0.84780.017	0.88950.019	0.83060.026
AttentionMOI	0.85250.027	0.80760.049	0.82290.043	0.80840.055
GREMI	0.86920.035	0.80360.042	0.87320.064	0.78540.040
MCRGCN	0.88820.024	0.85400.021	0.88600.014	0.84540.039
mmMOI	0.90100.018	0.87470.028	0.89470.028	0.86670.034
Gain for best	1.28% (0.0314)	2.07% (0.0289)	0.87% (0.0329)	2.12% (0.0469)

Open in a new tab

Among the compared methods, RF, SVM, and XGBoost are classical machine learning methods, while the others are based on deep learning. Interestingly, machine learning methods performed comparably to deep learning methods on some datasets. For instance, SVM achieved the second-best Precision score on the OV dataset, indicating that machine learning methods can still be viable for specific tasks. However, it should be noted that RF and XGBoost performed poorly on the GBM and OV datasets, highlighting the instability of machine learning methods. In deep learning methods, a simple three-layer fully connected NN struggled to learn effective feature representations from the complex multi-omics data, resulting in poor performance across all datasets. The CustOmics method, although not outstanding among the compared methods, consistently ranked among the top performers in each dataset, demonstrating good robustness. CustOmics’ two-stage training, where each omics data type is input into an AE to learn intermediate representations, further validates the efficacy of our mmMOI’s AE-based pretraining strategy.

The performance of MOGONET, MoGCN, MOGLAM, and AttentionMOI varied significantly across different datasets. MOGONET and MOGLAM performed well on the easily distinguishable KIPAN and BRCA datasets but poorly on the more challenging GBM and OV datasets. Conversely, AttentionMOI excelled in the GBM and OV datasets but underperformed in the BRCA and KIPAN datasets. This suggests that these methods are highly sensitive to data quality. Among the latest methods, GREMI achieved the second-best performance on the KIPAN dataset, while MCRGCN was second-best on the other three datasets. Both methods employ GNN structures, confirming the suitability of GNNs for cancer classification tasks. Notably, no method achieved outstanding results across all datasets, underscoring the superior stability of our mmMOI deep learning method.

Table 4.

Classification performance of all methods on OV dataset

Method	ACC	F1-macro	Precision	Recall
NN	0.73910.063	0.72980.062	0.80260.033	0.73260.066
RF	0.71820.031	0.70640.036	0.72300.035	0.71150.029
SVM	0.80910.055	0.80270.062	0.83090.056	0.80230.060
XGBoost	0.66360.055	0.65010.062	0.66910.049	0.65450.060
MOGONET	0.68910.019	0.67850.019	0.71690.024	0.68210.019
MoGCN	0.73640.009	0.72900.009	0.73080.008	0.74540.011
MOGLAM	0.75460.042	0.74690.047	0.78310.047	0.75080.041
CustOmics	0.81180.034	0.80590.039	0.82710.035	0.80790.037
AttentionMOI	0.81750.033	0.80930.038	0.81760.038	0.82150.036
GREMI	0.80450.105	0.79810.106	0.82000.097	0.79930.103
MCRGCN	0.83640.011	0.82390.013	0.82720.015	0.82720.023
mmMOI	0.85180.041	0.84970.040	0.87220.023	0.84880.042
Gain for best	1.54% (0.0610)	2.58% (0.0387)	2.58% (0.0387)	2.16% (0.0287)
NN	0.93140.048	0.93220.044	0.94510.045	0.92480.042
RF	0.94520.025	0.94540.022	0.95650.023	0.93890.026
SVM	0.95240.013	0.95090.016	0.96200.021	0.94200.015
XGBoost	0.92620.014	0.92980.011	0.94130.119	0.92180.016
MOGONET	0.95620.020	0.95980.017	0.96480.023	0.95660.014
MoGCN	0.93050.009	0.93870.007	0.94100.007	0.93940.007
MOGLAM	0.95050.018	0.95440.013	0.95420.013	0.95690.017
CustOmics	0.95620.016	0.95540.015	0.95470.016	0.95820.020
AttentionMOI	0.93030.011	0.92810.009	0.92540.009	0.94420.002
GREMI	0.96190.019	0.96120.022	0.96610.028	0.95910.019
MCRGCN	0.95710.014	0.94190.018	0.93980.023	0.94530.014
mmMOI	0.97240.016	0.97210.017	0.97280.016	0.97290.022
Gain for best	1.05% (0.0587)	1.09% (0.0278)	0.67% (0.0192)	1.38% (0.0480)

Open in a new tab

Table 5.

Classification performance of all methods on KIPAN dataset

Method	ACC	F1-macro	Precision	Recall
NN	0.93140.048	0.93220.044	0.94510.045	0.92480.042
RF	0.94520.025	0.94540.022	0.95650.023	0.93890.026
SVM	0.95240.013	0.95090.016	0.96200.021	0.94200.015
XGBoost	0.92620.014	0.92980.011	0.94130.119	0.92180.016
MOGONET	0.95620.020	0.95980.017	0.96480.023	0.95660.014
MoGCN	0.93050.009	0.93870.007	0.94100.007	0.93940.007
MOGLAM	0.95050.018	0.95440.013	0.95420.013	0.95690.017
CustOmics	0.95620.016	0.95540.015	0.95470.016	0.95820.020
AttentionMOI	0.93030.011	0.92810.009	0.92540.009	0.94420.002
GREMI	0.96190.019	0.96120.022	0.96610.028	0.95910.019
MCRGCN	0.95710.014	0.94190.018	0.93980.023	0.94530.014
mmMOI	0.97240.016	0.97210.017	0.97280.016	0.97290.022
Gain for best	1.05% (0.0587)	1.09% (0.0278)	0.67% (0.0192)	1.38% (0.0480)

Open in a new tab

Ablation experiments

To evaluate the effectiveness of each module and the integration of various data types in our experiment, we conducted two parts of ablation studies. The first part focuses on module-based ablation, aiming to verify whether the design of each module in our model contributes to improving the overall experimental performance. The second part centers on ablation based on different omics data types, aiming to confirm whether the fusion of multiple omics data types enhances the performance of cancer classification tasks.

Ablation study of different modules

Tables 6–9 report the results of module-based ablation experiments on four datasets. In the experimental setup of w/o ML-GNN, the multi-view GNN part guided by multi-label learning is removed, and a simple AE is used for representation learning to verify the effect of the design of this part on single-omics representation learning. w/o GA-FN and w/o LA-FN represent the removal of the global attention fusion module and the local attention fusion module in the multi-omics data fusion part to verify the multi-omics fusion effect of attention from different angles. During the experiment, we performed ablation for each module while keeping the other modules intact. According to the results of each table, it can be seen that the best classification performance can be obtained by using all modules. According to the ablation effect of each module, the removal of single-omics representation learning has the greatest impact on the classification performance in most datasets, which indicates that the simple AE cannot capture the association information between different samples, and the multi-view GNN guided by multi-label learning can effectively extract this part of the information. At the same time, the removal of each attention fusion module impacts the model performance, which indicates that the use of multi-scale attention fusion modules can better extract the specificity and complementarity information between multi-omics data.

Table 6.

The results of ablation study on GBM dataset

	ACC	F1-macro	Precision	Recall
w/o ML-GNN	0.79410.058	0.76550.070	0.79860.074	0.77470.066
w/o GA-FN	0.80290.062	0.78920.064	0.80670.067	0.79490.066
w/o LA-FN	0.77660.051	0.73810.065	0.78360.047	0.74590.058
All modules	0.82930.053	0.81600.056	0.84000.053	0.81950.059

Open in a new tab

Table 7.

The results of ablation study on BRCA dataset

	ACC	F1-macro	Precision	Recall
w/o ML-GNN	0.87330.040	0.83440.050	0.86920.067	0.82170.040
w/o GA-FN	0.89380.019	0.86650.032	0.88720.032	0.85570.035
w/o LA-FN	0.88150.024	0.83420.037	0.89200.040	0.81480.038
All modules	0.90100.018	0.87470.028	0.89470.028	0.86670.034

Open in a new tab

Table 8.

The results of ablation study on OV dataset

	ACC	F1-macro	Precision	Recall
w/o ML-GNN	0.81730.038	0.81330.038	0.82890.034	0.81390.040
w/o GA-FN	0.84550.041	0.84210.040	0.86500.027	0.84190.042
w/o LA-FN	0.82730.045	0.82250.048	0.84700.032	0.82230.049
All modules	0.85180.041	0.84970.040	0.87220.023	0.84880.042

Open in a new tab

Table 9.

The results of ablation study on KIPAN dataset

	ACC	F1-macro	Precision	Recall
w/o ML-GNN	0.96190.014	0.96030.015	0.96470.015	0.95820.023
w/o GA-FN	0.96950.012	0.97040.014	0.97160.013	0.97040.019
w/o LA-FN	0.96240.015	0.96300.016	0.96860.018	0.95970.019
All modules	0.97240.016	0.97210.017	0.97280.016	0.97290.022

Open in a new tab

Ablation study of different omics data

mmMOI integrates three types of omics data for cancer classification. To explore the contribution of each type of omics data to the final classification and verify the necessity of multi-omics fusion, we conducted ablation experiments targeting different omics data types. We tested the classification performance on LGFPAM using single-omics data (mRNA, methy, and miRNA) and combinations of two omics data (mRNA+methy, mRNA+miRNA, and methy+miRNA). The experimental results on the GBM, BRCA, OV, and KIPAN datasets are shown in Fig. 2. We observed that, in all classification tasks, models trained with all three types of omics data outperformed those trained with two types of omics data and those trained with only one type of omics data. This showcases our method’s robust ability to fuse multi-omics data by integrating various types of omics information. Therefore, using all three types of omics data for fusion studies is necessary. Additionally, we noted that mRNA expression data consistently outperformed DNA methylation data and miRNA expression data across all classification tasks in the three cancer subtype classification datasets. Notably, on the BRCA and OV datasets, the performance of models using only mRNA expression data surpassed that of models using two types of omics data. It was even close to the performance of models using all three types of omics data. However, in the KIPAN dataset, the results were better using miRNA expression data. Hence, we conclude that mRNA expression significantly contributes to distinguishing different cancer subtypes in the three cancer subtype classification tasks. In contrast, DNA methylation and miRNA expression data contain more noise. At the same time, in the classification task of kidney cancer, miRNA expression data play a more important role, and the other two omics data contained more noise. However, effective information can still be extracted after feature fusion in all datasets. This indicates that our model can effectively capture important information from omics data while minimizing the impact of redundant and noisy information.

Bar chart comparing classification accuracy metrics of various omics data types analyzed by the mmMOI model. — Comparison of classification results across different omics data via mmMOI model.

t-SNE visualization

To more intuitively evaluate the representation learning ability of our model, we used t-distributed Stochastic Neighbor Embedding (t-SNE) [35] to perform dimensionality reduction and visualization analysis on the GBM, BRCA, OV, and KIPAN datasets. Note that the omics representation features we used for this visualization analysis are the final embeddings learned by the model, which are applied to the final classification. To visually demonstrate the performance of our model in multi-omics fusion representation learning, we compared the t-SNE visualization results of our method with those of three other models with good classification performance: MOGLAM, AttentionMOI, and MCRGCN. Figures 3–6 show the visualization results of each method on the four datasets. The embeddings learned by our method produce good clustering effects on all four datasets. Additionally, by comparing with MOGLAM AttentionMOI and MCRGCN, we found that the embeddings learned by our method, after dimensionality reduction visualization, generate clusters with smaller intra-class dispersion and more significant inter-class dispersion. This indicates that the strategies adopted by our model for multi-omics data fusion representation are successful.

Four 2D scatter plots showing embedding distributions for four models applied to the GBM dataset. — Visualization of embedding representations of four methods on the GBM dataset using t-SNE: (a) mmMOI, (b) MOGLAM, (c) AttentionMOI, (d) MCRGCN.

Four 2D scatter plots demonstrating sample clustering by various models on the KIPAN dataset. — Visualization of embedding representations of four methods on the KIPAN dataset using t-SNE: (a) mmMOI, (b) MOGLAM, (c) AttentionMOI, (d) MCRGCN.

Four 2D scatter plots illustrating clustering patterns of sample embeddings from different models on the BRCA dataset. — Visualization of embedding representations of four methods on the BRCA dataset using t-SNE: (a) mmMOI, (b) MOGLAM, (c) AttentionMOI, (d) MCRGCN.

Four 2D scatter plots showing feature embeddings produced by four models on the OV dataset. — Visualization of embedding representations of four methods on the OV dataset using t-SNE: (a) mmMOI, (b) MOGLAM, (c) AttentionMOI, (d) MCRGCN.

Prognostic analysis of selected biomarkers

To further evaluate the clinical utility of the biomarkers identified by our model, we leveraged the International Cancer Genome Consortium dataset as an independent clinical cohort to systematically validate their prognostic significance. Figure 7 illustrates representative prognostic biomarkers across different cancer types. Among them, podoplanin (PDPN) was found to be positively correlated with tumor malignancy, mainly expressed in the mesenchymal subtype with the worst GBM prognosis [36]. Maguire et al. found that ACSBG1 is significantly upregulated in breast cancer cells, thereby promoting the progression of obesity-related breast cancer. This effect is mediated through the regulation of long-chain fatty acid metabolism, which enhances the energy reserves of cancer cells and supports tumor growth [37]. Elsharkawi et al. demonstrated that, in ovarian cancer tissues, the methylation levels at multiple CpG sites within the PCDH17 promoter were significantly higher than those in normal or benign lesions. This finding suggests that gene silencing of PCDH17 through promoter methylation may be involved in the initiation and progression of ovarian cancer [38]. Zhao et al. demonstrated that SHC1 regulates polymerase I and transcript release factor (PTRF) expression through the epidermal growth factor receptor (EGFR) signaling pathway and can be detected in the exosomes present in the urine of patients with clear cell renal cell carcinoma. Analysis of urinary exosomes indicated that aberrant expression of PTRF/CAVIN1 is closely related to the development of clear cell renal cell carcinoma, suggesting that it may not only function intracellularly but also serve as a noninvasive diagnostic biomarker for early screening of the disease [39]. Refer to the supplementary data for additional prognostic biomarker analysis results.

Line plots displaying survival probabilities over time for groups with high and low biomarker expression. — The Kaplan–Meier survival curve of representative biomarkers in different datasets.

Enrichment analysis of selected biomarkers

To systematically evaluate the potential roles of the biomarkers automatically identified by our model in tumor initiation and progression, we first performed KEGG pathway and Gene Ontology (GO) enrichment analyses on the sets of differentially expressed genes output by the model for each dataset. Figure 8 summarizes the rankings of the top 10 enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways across four tumor types (GBM, BRCA, OV, and KIPAN). The three most significant pathways for each dataset are outlined below.

Bar chart showing biological pathways significantly enriched by selected biomarkers across multiple datasets. — The pathway enrichment of selected biomarkers in different datasets.

In the GBM dataset, the top three enriched pathways were Neuroactive ligand–receptor interaction, Neuroactive ligand signaling, and the cAMP signaling pathway. Each of these pathways is closely associated with extracellular signal recognition by neuronal cells and downstream second-messenger regulation, suggesting pivotal roles in glioma cell proliferation, migration, and resistance to therapy.

For the BRCA dataset, differentially expressed genes were predominantly enriched in the PI3K–Akt signaling pathway, the MAPK signaling pathway, and the Ras signaling pathway. These canonical oncogenic cascades are well established as key drivers of breast cancer, mediating cell proliferation, inhibition of apoptosis, and control of the cell cycle, thereby supporting the model’s capacity to capture core tumorigenic mechanisms.

In the OV dataset, the most significantly enriched pathways were human papillomavirus (HPV) infection, neuroactive ligand–receptor interaction, and proteoglycans in cancer. The enrichment of proteoglycan-related processes highlights their established roles in stromal remodeling, cell adhesion, and migration within the ovarian tumor microenvironment. In contrast, the enrichment of the HPV infection pathway suggests a possible, yet underexplored, viral-associated mechanism in particular ovarian cancer subtypes.

Finally, in the KIPAN dataset, enriched pathways included the MAPK signaling pathway, Salmonella infection, and Chemical carcinogenesis–receptor activation. The central role of the MAPK cascade in mediating cellular responses to extracellular stress and growth factors is well documented. In contrast, the enrichment of infection-related pathways may reflect the influence of the immune or inflammatory microenvironment on renal tumor biology.

Overall, these KEGG enrichment results not only align with the established pathogenetic mechanisms of each cancer type but also substantiate the biological relevance and translational potential of the biomarkers identified by our model. The full results of the GO enrichment analysis are provided in the supplementary data.

Discussion and conclusion

In recent years, the rapid advancement of high-throughput sequencing technologies has made the integration of multi-omics data an essential tool for unraveling the complexities of diseases. However, existing methods still face several limitations, including a heavy reliance on feature selection, restricted applicability across different datasets, and the inability to effectively capture intricate interactions at both the sample and omics levels. To address these challenges, we propose a multi-omics fusion framework mmMOI based on multi-label guided learning and multi-scale fusion learning to solve the problem of cancer classification. mmMOI is an end-to-end supervised learning method comprising two main components: the single-omics representation learning module and the multi-omics data fusion module. The multi-view GNN module, based on multi-label guided learning, is utilized in the single-omics representation learning part. The multi-view adaptive aggregation design, guided by real labels, has a better ability to capture sample similarity relationships than traditional methods. The multi-omics fusion part comprises a global attention module and a local attention module, which enable the fusion features to retain better the complementary information between different omics and the internal relationships between different patients.

Comparative experiments on four cancer datasets demonstrate that mmMOI achieves superior classification accuracy and stability compared to state-of-the-art methods. Furthermore, the model exhibits high stability and adaptability across different biological contexts and sequencing technologies. Additional visual dimensionality reduction analyses further validate the strong discriminative power of mmMOI’s learned omics features across various data types, confirming its effectiveness in multi-omics integration. Ablation experiments reveal that each core component of mmMOI makes a positive contribution to the final classification outcomes. Most importantly, mmMOI successfully identifies key disease-associated biomarkers, reinforcing its biological interpretability and highlighting its potential applicability in disease diagnosis and treatment. In future research, we aim to expand the applicability of mmMOI to a broader range of cancer datasets and other multi-omics classification tasks.

Key Points

We presented a novel framework to integrate multi-omics data by combining multi-label guided learning and multi-scale fusion techniques.
A multi-view graph neural network guided by multi-label learning was utilized to learn representations from each omics data type, enhancing feature extraction without overfitting specific labels.
A multi-scale attention fusion network was employed to adaptively integrate representations from different omics layers, which allows the model to capture complex interactions between various biological data types.
The method has been tested on benchmark datasets, showing consistent outperformance compared to other state-of-the-art methods.
Ablation experiments were conducted to assess the validity of each component and the impact of different fusion methods.

Conflict of interest: None declared.

Supplementary Material

mmMOI_supplementary_bbaf493

mmmoi_supplementary_bbaf493.pdf^{(4.1MB, pdf)}

Contributor Information

Yuze Li, Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, 130012 Jilin, China.

Yinghe Wang, Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, 130012 Jilin, China.

Tao Liang, Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, 130012 Jilin, China.

Ying Li, Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, 130012 Jilin, China.

Wei Du, Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Qianjin Street 2699, 130012 Jilin, China.

Funding

The authors acknowledge financial support from the National Natural Science Foundation of China (grant no. 62372494); Natural Science Foundation of Jilin Province (grant no. 20240302086GX).

Data availability

The source code, datasets, and detailed hyperparameter configurations for mmMOI are available at https://github.com/mlcb-jlu/mmMOI.

References

1. Garraway LA, Lander ES. Lessons from the cancer genome. Cell 2013;153:17–37. 10.1016/j.cell.2013.03.002 [DOI] [PubMed] [Google Scholar]
2. Chakravarthi BV, Nepal S, Varambally S. Genomic and epigenomic alterations in cancer. Am J Pathol 2016;186:1724–35. 10.1016/j.ajpath.2016.02.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Liu Z, Zhang XS, Zhang S. Breast tumor subgroups reveal diverse clinical prognostic power. Sci Rep 2014;4:4002. 10.1038/srep04002 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Duan R, Gao L, Gao Y. et al. Evaluation and comparison of multi-omics data integration methods for cancer subtyping. PLoS Comput Biol 2021;17:1–33. 10.1371/journal.pcbi.1009224 [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Hasin Y, Seldin M, Lusis A. Multi-omics approaches to disease. Genome Biol 2017;18:83. 10.1186/s13059-017-1215-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Van VA. Next generation sequencing of microbial transcriptomes: challenges and opportunities. FEMS Microbiol Lett 2010;302:1–7. [DOI] [PubMed] [Google Scholar]
7. Subramanian I, Verma S, Kumar S. et al. Multi-omics data integration, interpretation, and its application. Bioinform Biol Insights 2020;14:117793221989905–24. 10.1177/1177932219899051 [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Hu W, Lin D, Cao S. et al. Adaptive sparse multiple canonical correlation analysis with application to imaging (epi) genomics study of schizophrenia. IEEE Trans Biomed Eng 2018;65:390–9. 10.1109/TBME.2017.2771483 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Sugnet CW, Kent WJ, Ares M. et al. Transcriptome and genome conservation of alternative splicing events in humans and mice. Pac Symp Biocomput 2024;9:66–77. 10.1142/9789812704856_0007 [DOI] [PubMed] [Google Scholar]
10. Wang YP, Lei QY. Metabolic recoding of epigenetics in cancer. Cancer Commun 2018;38:25. 10.1186/s40880-018-0302-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Hu L, Liu J, Zhang W. et al. Functional metabolomics decipher biochemical functions and associated mechanisms underlie small-molecule metabolism. Mass Spectrom Rev 2020;39:417–33. 10.1002/mas.21611 [DOI] [PubMed] [Google Scholar]
12. Uhlén M, Fagerberg L, Hallström BM. et al. Tissue-based map of the human proteome. Science 2015;347:1260419. 10.1126/science.1260419 [DOI] [PubMed] [Google Scholar]
13. Huang Z, Zhan X, Xiang S. et al. SALMON: survival analysis learning with multi-omics neural networks on breast cancer. Cancer Commun 2019;10:166. 10.3389/fgene.2019.00166 [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Chen Y, Wen Y, Xie C. et al. MOCSS: multi-omics data clustering and cancer subtyping via shared and specific representation learning. iScience 2023;26:107378. 10.1016/j.isci.2023.107378 [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Rappoport N, Shamir R. Multi-omic and multi-view clustering algorithms: review and cancer benchmark. Nucleic Acids Res 2019;47:1044. 10.1093/nar/gky1226 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Ma Y, Guan J. MOCSC: a multi-omics data based framework for cancer subtype classification. In: IEEE (ed.), Proceedings of the 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Piscataway, NJ, USA: IEEE; 2022. pp. 2853–2856.
17. Benkirane H, Pradat Y, Michiels S. et al. Customics: a versatile deep-learning based strategy for multi-omics integration. PLoS Comput Biol 2023;19:1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Wang T, Shao W, Huang Z. et al. Mogonet integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nat Commun 2021;12:3445. 10.1038/s41467-021-23774-w [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Chen F, Peng W, Dai W. et al. Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration. Health Inform Sci Syst 2024;12:1–12. 10.1007/s13755-024-00274-x [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Liang H, Luo H, Sang Z. et al. GREMI: an explainable multi-omics integration framework for enhanced disease prediction and module identification. IEEE J Biomed Health Inform 2024;28:6983–96. 10.1109/JBHI.2024.3439713 [DOI] [PubMed] [Google Scholar]
21. Gong P, Cheng L, Zhang Z. et al. Multi-omics integration method based on attention deep learning network for biomedical data classification. Comput Methods Programs Biomed 2023;231:107377. 10.1016/j.cmpb.2023.107377 [DOI] [PubMed] [Google Scholar]
22. Pang J, Liang B, Ding R. et al. A denoised multi-omics integration framework for cancer subtype classification and survival prediction. Brief Bioinform 2023;24:1–12. 10.1093/bib/bbad304 [DOI] [PubMed] [Google Scholar]
23. Li X, Ma J, Leng L. et al. MoGCN: a multi-omics integration method based on graph convolutional network for cancer subtype analysis. Front Genet 2022;13:806842. 10.3389/fgene.2022.806842 [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Ouyang D, Liang Y, Li L. et al. Integration of multi-omics data using adaptive graph learning and attention mechanism for patient classification and biomarker identification. Comput Biol Med 2023;164:107303. 10.1016/j.compbiomed.2023.107303 [DOI] [PubMed] [Google Scholar]
25. Hu J, Shen L, Sun G. Squeeze-and-excitation networks. IEEE Trans Pattern Anal Mach Intell 2019;42:2011–23. 10.1109/TPAMI.2019.2913372 [DOI] [PubMed] [Google Scholar]
26.Xie S, Girshick R, Dollar P. et al. Aggregated residual transformations for deep neural networks. In: IEEE (ed.), Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Piscataway, NJ, USA: IEEE; 2017. pp. 1492–1500.
27.Vaswani A, Shazeer N, Parmar N. et al. Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R (eds.), Advances in Neural Information Processing Systems 30 (NeurIPS 2017). Red Hook, NY, USA: Curran Associates, Inc.; 2017. pp. 6000–6010.
28.Ling Y, Chen J, Ren Y. et al. Dual label-guided graph refinement for multi-view graph clustering. In: Chien L, Kambhampati S, Liu Q (eds.), Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23). Palo Alto, CA, USA: AAAI Press; 2023. pp. 8791–8798.
29. Weinstein JN, Collisson EA, Mills GB. et al. The cancer genome atlas pan-cancer analysis project. Nat Genet 2013;45:1113–20. 10.1038/ng.2764 [DOI] [PMC free article] [PubMed] [Google Scholar]
30. Goldman M, Craft B, Swatloski T. et al. The UCSC cancer genomics browser: update 2015. Nucleic Acids Res 2015;43:D812–7. 10.1093/nar/gku1073 [DOI] [PMC free article] [PubMed] [Google Scholar]
31. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015;521:436–44. 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]
32. Breiman L. Random forests. Mach Learn 2001;45:5–32. 10.1023/A:1010933404324 [DOI] [Google Scholar]
33. Platt JC. Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machines. Microsoft Research, 1998. [Google Scholar]
34.Chen TQ, Guestrin C. A scalable tree boosting system. In: Bontempi G, Horvath T, Luo J, Motoda H, Papadimitriou S (eds.), Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16). New York, NY, USA: ACM; 2016. pp. 785–794.
35. van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res 2008;9:2579–605. [Google Scholar]
36. Shiina S, Ohno M, Ohka F. et al. Car T cells targeting podoplanin reduce orthotopic glioblastomas in mouse brains. Cancer Immunol Res 2016;4:259–68. 10.1158/2326-6066.CIR-15-0060 [DOI] [PubMed] [Google Scholar]
37. Maguire OA, Ackerman SE, Szwed SK. et al. Creatine-mediated crosstalk between adipocytes and cancer cells regulates obesity-driven breast cancer. Cell Metab 2021;33:499–512.e6. 10.1016/j.cmet.2021.01.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
38. Elsharkawi SM, Elkaffash D, Moez P. et al. PCDH17 gene promoter methylation status in a cohort of Egyptian women with epithelial ovarian cancer. BMC Cancer 2023;23:89. 10.1186/s12885-023-10549-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Zhao Y, Wang Y, Zhao E. et al. PTRF/CAVIN1, regulated by SHC1 through the EGFR pathway, is found in urine exosomes as a potential biomarker of ccRCC. Carcinogenesis 2020;41:274–83. 10.1093/carcin/bgz147 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmMOI_supplementary_bbaf493

mmmoi_supplementary_bbaf493.pdf^{(4.1MB, pdf)}

Data Availability Statement

The source code, datasets, and detailed hyperparameter configurations for mmMOI are available at https://github.com/mlcb-jlu/mmMOI.

[ref1] 1. Garraway LA, Lander ES. Lessons from the cancer genome. Cell 2013;153:17–37. 10.1016/j.cell.2013.03.002 [DOI] [PubMed] [Google Scholar]

[ref2] 2. Chakravarthi BV, Nepal S, Varambally S. Genomic and epigenomic alterations in cancer. Am J Pathol 2016;186:1724–35. 10.1016/j.ajpath.2016.02.023 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref3] 3. Liu Z, Zhang XS, Zhang S. Breast tumor subgroups reveal diverse clinical prognostic power. Sci Rep 2014;4:4002. 10.1038/srep04002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref4] 4. Duan R, Gao L, Gao Y. et al. Evaluation and comparison of multi-omics data integration methods for cancer subtyping. PLoS Comput Biol 2021;17:1–33. 10.1371/journal.pcbi.1009224 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref5] 5. Hasin Y, Seldin M, Lusis A. Multi-omics approaches to disease. Genome Biol 2017;18:83. 10.1186/s13059-017-1215-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref6] 6. Van VA. Next generation sequencing of microbial transcriptomes: challenges and opportunities. FEMS Microbiol Lett 2010;302:1–7. [DOI] [PubMed] [Google Scholar]

[ref7] 7. Subramanian I, Verma S, Kumar S. et al. Multi-omics data integration, interpretation, and its application. Bioinform Biol Insights 2020;14:117793221989905–24. 10.1177/1177932219899051 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref8] 8. Hu W, Lin D, Cao S. et al. Adaptive sparse multiple canonical correlation analysis with application to imaging (epi) genomics study of schizophrenia. IEEE Trans Biomed Eng 2018;65:390–9. 10.1109/TBME.2017.2771483 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref9] 9. Sugnet CW, Kent WJ, Ares M. et al. Transcriptome and genome conservation of alternative splicing events in humans and mice. Pac Symp Biocomput 2024;9:66–77. 10.1142/9789812704856_0007 [DOI] [PubMed] [Google Scholar]

[ref10] 10. Wang YP, Lei QY. Metabolic recoding of epigenetics in cancer. Cancer Commun 2018;38:25. 10.1186/s40880-018-0302-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref11] 11. Hu L, Liu J, Zhang W. et al. Functional metabolomics decipher biochemical functions and associated mechanisms underlie small-molecule metabolism. Mass Spectrom Rev 2020;39:417–33. 10.1002/mas.21611 [DOI] [PubMed] [Google Scholar]

[ref12] 12. Uhlén M, Fagerberg L, Hallström BM. et al. Tissue-based map of the human proteome. Science 2015;347:1260419. 10.1126/science.1260419 [DOI] [PubMed] [Google Scholar]

[ref13] 13. Huang Z, Zhan X, Xiang S. et al. SALMON: survival analysis learning with multi-omics neural networks on breast cancer. Cancer Commun 2019;10:166. 10.3389/fgene.2019.00166 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref14] 14. Chen Y, Wen Y, Xie C. et al. MOCSS: multi-omics data clustering and cancer subtyping via shared and specific representation learning. iScience 2023;26:107378. 10.1016/j.isci.2023.107378 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref15] 15. Rappoport N, Shamir R. Multi-omic and multi-view clustering algorithms: review and cancer benchmark. Nucleic Acids Res 2019;47:1044. 10.1093/nar/gky1226 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref16] 16.Ma Y, Guan J. MOCSC: a multi-omics data based framework for cancer subtype classification. In: IEEE (ed.), Proceedings of the 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Piscataway, NJ, USA: IEEE; 2022. pp. 2853–2856.

[ref17] 17. Benkirane H, Pradat Y, Michiels S. et al. Customics: a versatile deep-learning based strategy for multi-omics integration. PLoS Comput Biol 2023;19:1–19. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref18] 18. Wang T, Shao W, Huang Z. et al. Mogonet integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nat Commun 2021;12:3445. 10.1038/s41467-021-23774-w [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref19] 19. Chen F, Peng W, Dai W. et al. Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration. Health Inform Sci Syst 2024;12:1–12. 10.1007/s13755-024-00274-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref20] 20. Liang H, Luo H, Sang Z. et al. GREMI: an explainable multi-omics integration framework for enhanced disease prediction and module identification. IEEE J Biomed Health Inform 2024;28:6983–96. 10.1109/JBHI.2024.3439713 [DOI] [PubMed] [Google Scholar]

[ref21] 21. Gong P, Cheng L, Zhang Z. et al. Multi-omics integration method based on attention deep learning network for biomedical data classification. Comput Methods Programs Biomed 2023;231:107377. 10.1016/j.cmpb.2023.107377 [DOI] [PubMed] [Google Scholar]

[ref22] 22. Pang J, Liang B, Ding R. et al. A denoised multi-omics integration framework for cancer subtype classification and survival prediction. Brief Bioinform 2023;24:1–12. 10.1093/bib/bbad304 [DOI] [PubMed] [Google Scholar]

[ref23] 23. Li X, Ma J, Leng L. et al. MoGCN: a multi-omics integration method based on graph convolutional network for cancer subtype analysis. Front Genet 2022;13:806842. 10.3389/fgene.2022.806842 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref24] 24. Ouyang D, Liang Y, Li L. et al. Integration of multi-omics data using adaptive graph learning and attention mechanism for patient classification and biomarker identification. Comput Biol Med 2023;164:107303. 10.1016/j.compbiomed.2023.107303 [DOI] [PubMed] [Google Scholar]

[ref25] 25. Hu J, Shen L, Sun G. Squeeze-and-excitation networks. IEEE Trans Pattern Anal Mach Intell 2019;42:2011–23. 10.1109/TPAMI.2019.2913372 [DOI] [PubMed] [Google Scholar]

[ref26] 26.Xie S, Girshick R, Dollar P. et al. Aggregated residual transformations for deep neural networks. In: IEEE (ed.), Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Piscataway, NJ, USA: IEEE; 2017. pp. 1492–1500.

[ref27] 27.Vaswani A, Shazeer N, Parmar N. et al. Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R (eds.), Advances in Neural Information Processing Systems 30 (NeurIPS 2017). Red Hook, NY, USA: Curran Associates, Inc.; 2017. pp. 6000–6010.

[ref28] 28.Ling Y, Chen J, Ren Y. et al. Dual label-guided graph refinement for multi-view graph clustering. In: Chien L, Kambhampati S, Liu Q (eds.), Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23). Palo Alto, CA, USA: AAAI Press; 2023. pp. 8791–8798.

[ref29] 29. Weinstein JN, Collisson EA, Mills GB. et al. The cancer genome atlas pan-cancer analysis project. Nat Genet 2013;45:1113–20. 10.1038/ng.2764 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref30] 30. Goldman M, Craft B, Swatloski T. et al. The UCSC cancer genomics browser: update 2015. Nucleic Acids Res 2015;43:D812–7. 10.1093/nar/gku1073 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref31] 31. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature 2015;521:436–44. 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]

[ref32] 32. Breiman L. Random forests. Mach Learn 2001;45:5–32. 10.1023/A:1010933404324 [DOI] [Google Scholar]

[ref33] 33. Platt JC. Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machines. Microsoft Research, 1998. [Google Scholar]

[ref34] 34.Chen TQ, Guestrin C. A scalable tree boosting system. In: Bontempi G, Horvath T, Luo J, Motoda H, Papadimitriou S (eds.), Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '16). New York, NY, USA: ACM; 2016. pp. 785–794.

[ref35] 35. van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res 2008;9:2579–605. [Google Scholar]

[ref36] 36. Shiina S, Ohno M, Ohka F. et al. Car T cells targeting podoplanin reduce orthotopic glioblastomas in mouse brains. Cancer Immunol Res 2016;4:259–68. 10.1158/2326-6066.CIR-15-0060 [DOI] [PubMed] [Google Scholar]

[ref37] 37. Maguire OA, Ackerman SE, Szwed SK. et al. Creatine-mediated crosstalk between adipocytes and cancer cells regulates obesity-driven breast cancer. Cell Metab 2021;33:499–512.e6. 10.1016/j.cmet.2021.01.018 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref38] 38. Elsharkawi SM, Elkaffash D, Moez P. et al. PCDH17 gene promoter methylation status in a cohort of Egyptian women with epithelial ovarian cancer. BMC Cancer 2023;23:89. 10.1186/s12885-023-10549-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref39] 39. Zhao Y, Wang Y, Zhao E. et al. PTRF/CAVIN1, regulated by SHC1 through the EGFR pathway, is found in urine exosomes as a potential biomarker of ccRCC. Carcinogenesis 2020;41:274–83. 10.1093/carcin/bgz147 [DOI] [PubMed] [Google Scholar]

PERMALINK

A multi-omics integration framework using multi-label guided learning and multi-scale fusion

Yuze Li

Yinghe Wang

Tao Liang

Ying Li

Wei Du

Abstract

Introduction

Materials and methods

Overview of mmMOI

Figure 1.

Single-omics data representation

Dimensionality reduction autoencoder

Dimensionality reduction autoencoder

Multi-label guided graph fusion

Multi-label guided graph fusion

Graph encoding and view evaluation

Graph encoding and view evaluation

Multi-omics data fusion

Global attention fusion network

Local attention fusion network

Model optimization

Dimensionality reduction autoencoder

Multi-view graph neural network

Multi-omics data fusion network

Results

Evaluation datasets and baselines

Table 1.

Comparative experiment

Table 2.

Table 3.

Table 4.

Table 5.

Ablation experiments

Ablation study of different modules

Table 6.

Table 7.

Table 8.

Table 9.

Ablation study of different omics data

Figure 2.

t-SNE visualization

Figure 3.

Figure 6.

Figure 4.

Figure 5.

Prognostic analysis of selected biomarkers

Figure 7.

Enrichment analysis of selected biomarkers

Figure 8.

Discussion and conclusion

Key Points

Supplementary Material

Contributor Information

Funding

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases