Skip to main content
Journal of Cancer Research and Clinical Oncology logoLink to Journal of Cancer Research and Clinical Oncology
. 2023 Jul 19;149(14):12881–12896. doi: 10.1007/s00432-023-05045-x

Establishment of a prognostic model based on m6A regulatory factors and stemness of hepatocellular carcinoma using RNA-seq data and scRNA-seq data

Yan Liang 1,#, Sen Chen 1,2,#, Jinghe Xie 3,#, Guanrong Yan 3, Tingting Guo 1,2, Tianyang Li 4, Shoupei Liu 1,2, Weiping Zeng 1,5, Shuai Zhang 4, Keqiang Ma 6, Honglin Chen 1, Yimeng Ou 7,, Bailin Wang 8,, Weili Gu 4,, Yuyou Duan 1,2,9,10,
PMCID: PMC10587019  PMID: 37466793

Abstract

Background

Hepatocellular carcinoma (HCC) with high incidence and mortality is one of the most common malignant cancers worldwide. Increasing evidence has reported that N6-methyladenosine (m6A) modification has been considered as a major contribution to the occurrence and development of tumors.

Method

In our study, we comprehensively analyzed the connection between m6A regulatory factors and cancer stem cells (CSCs) of HCC to establish a clinical tool for predicting its outcome. First, we concluded that the expression level of m6A regulatory factors was related with the stemness of hepatocellular carcinoma. Subsequently, we gained a ten hub regulatory factors that were associated with prognosis of hepatocellular carcinoma by overall survival (OS) analysis using ICGC and TCGA datasets, and these regulatory factors included YTHDF1, IGF2BP1, METTL3, IGF2BP3, HNRNPA2B1, IGF2BP2, RBM15B, HNRNPC, RBMX, and LRPPR. Next, we found that these ten hub m6A regulatory factors were highly expressed in CSCs, and CSCs related pathways were also enriched by the gene set variation analysis (GSVA). Then, correlation, consensus clustering and PCA analysis were performed to reveal potential therapeutic benefits of HCC. Moreover, univariate Cox regression (UNICOX), LASSON and multivariate Cox regression (MULTICOX) analyses were adopted to establish HCC prognosis prediction signature.

Results

Four regulatory factors RBM15B, LRPPRC, IGF2BP1, and IGF2BP3 were picked as valuable prognostic indicators.

Conclusion

In summary, these ten hub regulatory factors would be useful therapeutic targets for HCC treatment, and RBM15B/LRPPRC/IGF2BP1/IGF2BP3 prognostic indicators can be used to guide therapy for HCC patients.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00432-023-05045-x.

Keywords: HCC, m6A modification, CSCs, Stemness, Prognostic model

Introduction

The major histological subtype of liver cancer is HCC which accounted for 80–90% of primary liver cancer and the third most common reason for tumor-related mortality worldwide (Pan et al. 2011; Yang et al. 2019). Both high recurrence and metastasis rates contribute to the poor prognosis of HCC (Siegel et al. 2021). In clinical, transplantation of liver is the only treatment for terminal cancer, 5-year survival was gained by about 60%, but due to the limited donor, surgical removal of the lesions and transcatheter hepatic artery embolization with conventional chemotherapy treatment is the main choice of clinical treatment (Alqahtani, et al. 2019). It is necessary to establish a scoring model for the prognosis of HCC patients to provide the theoretical basis for the treatment selection and prognostic evaluation of clinicians.m6A RNA modification, the most dominant RNA modification, has been shown to play a crucial role in regulating RNA metabolism (Yang et al. 2018). m6A regulatory factors consists of “writers” that generate the m6A markers, “erasers” that can remove the m6A modification that has occurred, and “readers” that identify the m6A modification and further promote the transcription of downstream genes. m6A modification affects RNA metabolism in many ways, such as processing, nuclear output, translation, and decay of RNA (Yang et al. 2019; Wang et al. 2021; Shi et al. 2017; Roignant and Soller 2017). m6A RNA methylation plays many roles in cancer through a variety of mechanisms and offers more possibilities for early diagnosis and treatment of cancer. In addition, m6A modification also plays a critical role in CSCs (Wang et al. 2021). m6A RNA modification has been certified to be involved in the production and maintenance of CSCs, control of tumor progression and treatment resistance (Yang et al. 2018; Roignant and Soller 2017; Corbett 2018). In recent years, genetic signatures of m6A regulatory factors have been constructed to predict the prognosis of a variety of tumors, including gastric cancer (Guan et al. 2020), adrenocortical carcinoma (Jin et al. 2021), and liver cancer (Li, et al. 2021a). However, it has not been clarified the exact association between m6A and CSCs of HCC.

Giving the RNA-seq data available online, we tried to analyze whether the expression patterns of m6A regulatory factors can predict prognosis in HCC patients. Tathiane M. Malta et al. used an innovative single-class logistic regression (OCLR) machine learning algorithm to generate mRNAsi index that reflects the gene expression characteristics of stem cells through multi-platform analysis of stem cell transcriptome, methylation and transcription factor binding sites (Malta et al. 2018). The stemness index based on mRNA expression (mRNAsi index) is a measure of how similar tumor cells to stem cells, the higher the mRNAsi index, the stronger the stemness of tissues is. For understanding the association between m6A regulatory factors expression and tissue stemness, we accorded to the median data of mRNAsi index and split HCC patients into two groups with high-mRNAsi and low-mRNAsi, respectively, using TCGA (The Cancer Genome Atlas) dataset. In addition, we collected 23 m6A regulatory factors in this study and selected m6A differential regulatory factors (DEGs) from patients with high-mRNAsi and low-mRNAsi for further study in TCGA dataset. Furthermore, we selected m6A DEGs that had prognostic value to be candidate genes in both TCGA and ICGC (The International Cancer Genome Consortium). We found that these hub m6A regulatory factors were highly expressed in liver cancer stem cells (LCSCs) using scRNA-seq data (GSE149614) and RT-qPCR. GSVA analysis also showed that the hub m6A regulatory factors’ expression was associated with the CSCs related pathways. Moreover, we performed the analyses of correlation, function, Consensus, PCA, survival, and ROC to investigate the biological functions and mechanisms of these regulatory factors in HCC patients, highlighting the importance of these candidate hub regulatory factors. In addition, we observed the prognostic role of m6A regulatory genes in HCC, and build a prognostic model in ICGC, and verified it in TCGA. Finally, the overall survival of HCC patients could be better estimated using the prognostic model. In summary, our work suggests that the m6A regulatory factors are a potential prognostic indicator for HCC patients.

Materials and methods

HCC data source and preprocessing

The data of gene expression data and clinical annotations were gained from ICGC and TCGA datasets. Two cohorts were enrolled in our work: TCGA-LIHC (371 samples) and ICGC-LIRI-JP (243 samples). For mRNA expression data, FPKM-normalized, log2-transformed data were gained, respectively, from web sites Genomic Data Commons Data Portal (GDC Data Portal) (RRID: SCR_014514), ICGC Data Portal (RRID: SCR_021722). Clinical annotations in ICGC with 232 HCC patients and in TCGA with 240 HCC patients were also downloaded for further analysis. The data of Copy number variation (CNV) were gained from TCGA. In addition, the scRNA-seq (GSE149614) of HCC was downloaded from Gene Expression Omnibus (GEO) (RRID: SCR_005012) (Zheng et al. 2018).

23 m6A regulatory factors were identified from PubMed (RRID: SCR_004846), including 8 writers (METTL3, METTL14, WTAP, METTL16, RBM15B, ZC3H13 and VIRMA) (Roundtree et al. 2017; Chen et al. 2019), 3 erasers (ALKBH5, ALKBH7 and FTO) (Jia et al. 2011; Zheng et al. 2013), and 12 readers (IGF2BP1, IGF2BP2, IGF2BP3, YTHDF1, YTHDF2, YTHDF3, HNRNPA2B1, YTHDC1, YTHDC2, HNRNPC, LRPPRC and RBMX) (Chen et al. 2019; Du et al. 2021; Casella et al. 2019). We gained the gene expression matrix of corresponding m6A regulatory factors from TCGA and ICGC data for further investigation.

Identification of m6A regulatory factors CNV and expression difference between high-mRNAsi and low-mRNAsi groups

The m6A regulatory factors expression matrix and CNV data were applicated to the differentially CNV genes and the DEGs between high-mRNAsi and low-mRNAsi patients. According to the median data, we first classified HCC patients into high-mRNAsi and low-mRNAsi group to identify among the 371 samples in TCGA. Then R package “limma” was used to determine m6A DEGs by comparing high-mRNAsi specimens with low-mRNAsi specimens. 20 m6A DEGs (adjusted P < 0.05) in TCGA were identified for further study.

The Kaplan–Meier curve based on m6A DEGs in HCC patients

Among 17 m6A differential regulatory factors in TCGA and 13 m6A regulatory factors DEGs in ICGC, we figured out the potential prognostic m6A differential regulatory factors by Survival analysis and Kaplan–Meier curves using the “survival” R package (adjusted p < 0.05). The R package “survminer” and a two-sided log-rank test were used to identify the optimal cutoff value. To gain more accurate results, a Venn plot was used to obtain the overlapped prognostic m6A different regulatory factors in either TCGA or ICGC. Finally, ten shared hub regulatory factors were left for further analysis.

scRNA-seq data analyses

scRNA sequence data of different sites of HCC was downloaded, including ten HCC patients from four relevant sites: primary tumor, metastatic lymph node, portal vein tumor thrombus (PVTT) and non-tumor liver tissue. After aggregation of the primary tumor data of the 10 patients (Du, et al. 2021), R package “Seurat” was used for cell filtration, normalization, then we found the top-2000 highly variable genes for dimensional reduction and 6 clusters: carcinoma-associated fibroblasts (CAFs), endothelial cells (ECs), bursa dependent lymphocyte (B cells), macrophages, thymus-dependent lymphocyte (T cells) and tumors.

Moreover, we classified the cluster of tumors into three groups: 3_Positive LCSCs, 3_Negative liver cancer cells, and other cells. The Seurat functions “DotPlot” were applicated to show the expression of ten hub m6A regulatory factors in 3_Positive LCSCs and 3_Negative liver cancer cells.

Human primary HCC cells, LCSCs and HCC cell lines

The methods of LCSCs isolation and culture were adopted in this study: tumor stem cells derived from liver cancer cell lines were cultured by low adsorption suspension into pellets, and Triple+ (CD133+CD24+EPCAM+) flow separation method (Zheng et al. 2018). Human primary tumor stem cells derived from clinical liver cancer tumor tissues were cultured on trophoblast by our research group (Park et al. 2015).

The primary HCC and LCSCs from HCC samples were gained from liver HCC patients who underwent surgery at Guangzhou First People’s Hospital. The method and human samples used in our study were approved by the ethics committee at Guangzhou First People’s Hospital of South China University of Technology and strict adherence to the Declaration of Helsinki. Separation of the primary HCC cells has been established as previously described (Xie et al. 2021), and the isolation of LCSCs was followed after the separation of HCC cells which were washed with Dulbecco’s modified Eagle medium Nutrient Mixture F-12 (HAM) for 3 times and resuspended in HAM added with 0.5% bovine serum albumin (BSA), 20 ng/mL Epidermal Growth Factor (EGF), 4 ng/mL Basic Fibroblast Growth Factor (bFGF) and 1% penicillin/streptomycin. We seeded these cells onto mouse embryonic fibroblasts (MEF) and incubated in a 5% CO2 and 37 °C incubator. The cells grew clonally on MEF and the medium was refreshed every day.

The cells of Hep3B and Huh7 were gained from ATCC and grew in a 5% CO2 and 37 °C incubator. The culture media were Dulbecco’s modified Eagle medium (DMEM) with 10% fetal bovine serum and 1% penicillin/streptomycin. Hep3B and Huh7 suspension cells (Hep3B SP and Huh7 SP) were cultured in HAM with 2% B27, 20 ng/ml EGF, 4 ng/ml bFGF, and 1% penicillin/streptomycin. Cells were cultured in a 5% CO2 and 37 °C incubator.

Flow cytometry

Single-cell suspensions (Hep3B SP and Huh7 SP) were prepared and incubated with CD24 antibody (BioLegend Cat#311,106, RRID: AB_314855), EPCAM antibody (BioLegend Cat#324,212, RRID: AB_756086), and CD133 antibody (Biolegend, #S16016B, RRID: AB_2734479) for 30 min in 4 °C, followed by flow cytometry analysis. LCSCs (LCSC1 and LCSC2) were prepared and incubated with ALDH antibody (STEMCELL, Cat#01700) for 30 min in 37 °C, followed by flow cytometry analysis.

RNA extraction and quantitative real-time PCR (RT-qPCR)

Total RNA was extracted using TRIzol reagent (TakaRa, Cat#9109) based on the manufacturer’s protocol. cDNA synthesis was operated using the PrimeScript™ Reverse Transcriptase kit (TakaRa, Cat#9767). The primers in Table S1 and Table S2 were used to perform qPCR.

Consensus cluster analyses and PCA analyses

R package “ConsensusClusterPlus” and “PCA” was used to determine the function of ten hub m6A regulatory factors to divide HCC patients into various subgroups and study the gene expression patterns in the corresponding subgroups. After performing consensus clustering cumulative distribution function (CDF) for k = 2 to 9 in both TCGA and ICGC, k = 2 was shown and chosen for further study. To verify the significance of the cluster results of ten hub m6A regulators, we performed survival analysis and clinical information in TCGA and ICGC datasets.

GSVA and functional annotation

R package “GSVA” was used to perform GSVA enrichment analysis, getting different biological processes between stratified subgroups. GSVA, using a non-parametric and unsupervised method, is a common way to estimate the variation in biological process activities and pathways in the samples of gene expression dataset. We obtained function sets of “c2.cp.kegg.v7.4.symbols.gmt” and “h.all.v7.4.symbols.gmt” from MSigDB dataset. Adjusted p < 0.05 and logFC > 0.1 was considered as statistically significant, and we got the top 100 different pathways between 2 models in KEGG and top 30 different pathways between 2 models in HALLMARK with “limma” R package.

Establishment and verification of the prognostic model

UNICOX Analysis of ten candidate m6A regulatory factors (adjusted p < 0.05) was performed to confirm prognostic genes in ICGC. The LASSO analysis (adjusted p < 0.05) was applicated to construct a prognostic mode at the least and best risk factors. Then using the MULTICOX Analysis, we chose four regulators to build a prognostic risk signature. Finally, survival analysis, ROC analyses and risk score distribution in both TCGA and ICGC datasets were performed to verify the prognostic model.

Establishment of a novel nomogram for individuals

We performed nomogram analyses with R “rms” to combinate this prognostic signature with relevant clinical information.

Results

Landscape of genetic variations in m6A regulatory factors related to stemness of HCC

The specific flowchart of our study is shown in Fig. 1. 23 m6A regulatory factors were finally identified in our work. mRNAsi index was utilized to analyze liver hepatocellular carcinoma (LIHC) cases in TCGA, which was selected to quantify the stem-like indices and well-calculated stemness index for HCC patients. We divided the LIHC patients into high-mRNAsi and low-mRNAsi group based on the median data of mRNAsi index. It has been confirmed that the expression level of m6A regulatory factors is correlated with CNV, and the change of CNV may be one of the main reasons causing the expression disorder of m6A regulatory factors (Li et al. 2021b; Li et al. 2019). In this study, we found that the CNV of 23 m6A regulatory factors showed a prevalent alteration and the CNV alteration frequency in the high-mRNAsi patients were higher than those in the low-mRNAsi patients (Fig. 2A). As the study showing that the CNV was correlated with stemness of HCC, most of m6A regulatory factors in high-mRNAsi patients tend to gain and loss copy number (Fig. 2B, C). Then, we compared the m6A regulatory DEGs between low- and high-mRNAsi patients. Interestingly, 23 m6A regulatory factors were remarkably highly expressed in high-mRNAsi patients among 371 samples (Fig. 2D). Excluding METTL14, ZC3H13 and ALKBH7, other 20 m6A regulatory factors were regarded as significance (adjusted p < 0.05) (Fig. 2D). We concluded that m6A regulatory factors expression was correlated with the stemness of HCC tissues, and the expression lever of m6A regulatory factors was higher in HCC tissues with high stemness.

Fig. 1.

Fig. 1

The whole flowchart was shown to reveal the analyses process of prognostic model of HCC and underlying associations with LCSCs based on ICGC-LIRI-JP and TCGA-LIHC databases

Fig. 2.

Fig. 2

Expression and genetic landscapes of m6A DEGs in HCC patients with high- or low-mRNAsi in TCGA. A CNV frequency of in high- and low-mRNAsi in TCGA cohort. B, C Gain or loss copy number of 23 m6A regulatory factors in HCC patients with high- and low-mRNAsi in TCGA. D Expression level of 23 m6A regulatory factors in high- or low-mRNAsi in TCGA. E The overlapped prognostic m6A regulatory factors in TCGA and ICGC cohorts. Ten hub m6A regulatory factors were gained for detail analysis

Survival analysis of m6A regulatory factors in TCGA and ICGC

To investigate the correlation between m6A DEGs which was related to stemness and prognosis of HCC patients, we found out 17 m6A DEGs in TCGA and 13 m6A DEGs in ICGC were considered significant (adjusted p < 0.05) by overall survival (OS) analysis in 20 m6A regulatory factors. After determining the shared regulatory factors in both TCGA and ICGC, ten hub regulatory factors were selected (Fig. 2E). The survival analysis of ten hub regulatory factors, including YTHDF1, IGF2BP1, METTL3, IGF2BP3, HNRNPA2B1, IGF2BP2, RBM15B, HNRNPC, RBMX and LRPPR were presented (Figure S1, S2). Furthermore, we found that higher expression of ten hub regulatory factors were correlated with poor prognosis for HCC patients, consistent with the tumorigenicity of m6A regulatory factors in the development of malignant tumors (Figure S1, S2).

m6A regulatory factors were highly expressed in LCSC

Based on association between the expression and stemness of HCC, it promoted us to consider the relationship between m6A regulatory factors and the LCSCs which is known as tumor-initiating cells and is a small subpopulation of malignant tumor cells with stemness characteristics (Ahmed et al. 2017; Makena et al. 2020). Moreover, CSCs maintain the self-renewal ability by the assistance of m6A demethylation actively (Ma and Ji 2020). We analyzed the association between ten hub regulatory factors and three markers of CSCs. As a result, most of ten hub regulatory factors were positively related to CD24, EPCAM, and PROM1 (CD133), respectively, either in TCGA or ICGC patients (Fig. 3A, B). Then, we obtained a dataset (GSE149614), containing ten HCC samples from four relevant sites including: primary tumors, metastatic lymph nodes, portal vein tumor thrombus (PVTT) and non-tumor livers. After normalization, we collected ten HCC tumor samples in the dataset. Undergoing cell filtration, dimensional reduction, and clustering analysis, we found six clusters including: carcinoma-associated fibroblasts (CAFs), endothelial cells (ECs), macrophages, thymus bursa dependent lymphocytes (B cells), dependent lymphocytes (T cells), and tumors by markers of differential cells (Fig. 3C, Figure S3). Afterwards we calculated and selected the components of tumors for further analysis. CD24, EPCAM, and CD133, the markers of cancer stem cells (CSCs) reflected the stemness. Provided that the scRNA-seq data, we proposed to study the relationship between these ten hub m6A regulatory factors and these three markers of CSCs. Thus, we clustered the cell populations into three groups with 3_Positive (CD24+EPCAM+CD133+) LCSCs, 3_Negative (CD24EPCAMCD133) liver cancer cells and other cells, respectively (Fig. 3D, Figure S4A–S4C).

Fig. 3.

Fig. 3

Stemness analyses of ten hub m6A regulators in TCGA, ICGC and scRNA-seq datasets. A, B Correlation analysis of ten hub m6A regulatory factors and three CSC markers in TCGA and ICGC database. *P < 0.05, **P < 0.01, ***P < 0.001. C UMAP plot of different cell types, pink represented tumor cells. D UMAP plot of 3_Positive (CD24+EPCAM+CD133+) LCSCs, 3_ Negative (CD24EPCAMCD133) liver cancer cells and other cells in cluster of tumor cells. E Expression levels of ten m6A regulatory factors in 3_Positive (CD24+EPCAM+CD133+) LCSCs and 3_Negative (CD24EPCAMCD133) liver cancer cells in dot diagram. F, G RT-qPCR analysis of ten m6A regulatory factors in 3_Negative (CD24EPCAMCD133) liver cancer cells (Hep3B 3-, Huh7 3-) and 3_Positive (CD24+EPCAM+CD133+) LCSCs (Hep3B 3 + , Huh7 3 +). H, I RT-qPCR analysis for ten m.6A regulatory factors in primary liver cancer cells (LCC1, LCC2) and LCSCs (LCSC1, LCSC2)

Then, we discovered that m6A regulatory factors were almost highly expressed in the tumors of patients with 3_Positive LCSCs (Fig. 3E), this triggered us to consider that the high-expression m6A regulators may heighten the stemness of HCC cells and then led to poorer prognosis. In order to confirm our speculation, we sorted out 3_Positive (CD24+EPCAM+CD133+) LCSCs and 3_Negative (CD24EPCAMCD133) liver cancer cells by flow cytometry in both Hep3B and Huh7 cells (Figure S4D, S4F, S4G), then we discovered that the expression of these ten m6A regulatory factors were highly expressed in 3_Positive (CD24+EPCAM+CD133+) LCSCs from Hep3B and Huh7 cells by RT-qPCR (Fig. 3F–G). For further verification, we isolated the LCSCs from human samples such as LCSC1 and LCSC2 (Figure S4E) which expressed stemness characteristics. Their expression of stemness marker ALDH was more than 70% and the expression ratios of stemness genes and stemness markers were the same or even higher than those of Hep3B SP and Huh7 SP cells (Figure S5A−E). Employing RT-qPCR analysis, we found that these ten hub regulatory factors were higher expressed in LCSCs (LCSC1 and LCSC2) than liver cancers (LCC1 and LCC2) (Fig. 3H, I). These results showed that these ten hub regulatory factors might be the significant factors to influence the stemness of HCC.

HCC patients were divided into different subgroups by ten hub m6A regulatory factors

Based on the important association between these ten hub regulatory factors with stemness characteristics, we classified HCC patients with qualitatively different molecular subgroups accorded to the gene expression of 10 hub regulatory factors and 2 distinct molecular subgroups were eventually identified by unsupervised clustering, which included 213 patients in subgroup A and 158 patients in subgroup B in TCGA dataset (Fig. 4A, B), while 104 patients in subgroup A and 139 patients in subgroup B in ICGC dataset (Fig. 4F, G). The results from ICGC dataset confirmed the correctness those from TCGA dataset. Then, we adopted PCA analyses to better visualize the subgroups, including TCGA-A, TCGA-B, ICGC-A, and ICGC-B. The whole gene expression matrix of patients with HCC could be categorized into two distinct groups by PCA analyses, confirming the robustness of the existence of these two subgroups in both TCGA and ICGC datasets (Fig. 4C, H). In addition, we found that the subgroups separated by these ten hub m6A regulators had a marked different prognosis of patients with HCC. In TCGA dataset, TCGA-B subgroup in which the expression of ten hub regulatory factors was higher had a worse prognosis than TCGA-A subgroup did (Fig. 4D, E). As well in ICGC dataset, the ICGC-A had worse prognosis when compared with ICGC-B that was correlated with higher expression of ten hub regulatory factors (Fig. 4I, G). Furthermore, we studied the relationship of clinicopathological characteristics and different subgroups in ICGC and TCGA. The TCGA-B with higher expression level of m6A regulatory factors was at higher stage of T (adjusted p < 0.05) (Fig. 4E), while the ICGC-A subgroup with higher gene expression was significantly associated with gender (adjusted p < 0.05), and with worse survival status (adjusted p < 0.001) (Fig. 4G). We concluded that m6A regulators expression data in ICGC were more correlated with the survival and prognosis of HCC patients.

Fig. 4.

Fig. 4

Consensus cluster and PCA by m6A regulatory factors. A, B CDF for k = 2 was shown and chosen for further study from k = 2 to 9 in TCGA. C PCA analyses of the gene expression matrix of patients with HCC in TCGA. D Survival analyses for HCC patients in TCGA. E Heatmap with clinicopathologic characteristics of the two subgroups (subgroup A and subgroup B) divided by consensus cluster of ten hub m6A regulators in TCGA. F, G CDF for k = 2 to 9 in ICGC, k = 2 was showed and chose for further study. H PCA analyses of gene expression matrix of HCC patients in ICGC. I Survival analyses for HCC patients in ICGC. G Heatmap of clinicopathologic characteristics of the two subgroups (subgroup A and subgroup B) defined by consensus expression of ten hub m6A regulatory factors in ICGC

Function characteristics in different subgroups

To better study the function of ten hub m6A regulatory factors, correlation analysis was conducted that ten hub regulatory factors exhibited strong correlation with each other with strict criteria (adjusted p < 0.05) in TCGA and ICGC (Fig. 5A, B). In addition, the relativity of hub regulatory factors in TCGA was more close than those in ICGC, thus TCGA database was used as the main database for functional analysis. On the other hand, to determine how the molecular subtypes affected the prognosis, we performed GSVA of the two subgroups accorded to gene expression of ten hub m6A regulatory factors in KEGG and HALLMARK annotation. As shown in Fig. 5C, compared with the cluster A, clusters B which highly expressed ten hub regulatory factors was active in methylation processes, cell cycle and tumor malignancy. In contrast, cluster A was enriched in a variety of metabolic processes. Giving various pathways in the KEGG annotation were enriched such as “bladder cancer”, “thyroid cancer”, “colorectal cancer”, “glioma”, “renal cell carcinoma”, and “pathways in cancer” (Fig. 5C). Therefore, we concluded that ten hub regulatory factors played critical role in a variety of cancers and may be promising therapeutic targets.

Fig. 5.

Fig. 5

Correlation and function analysis of prognostic m6A DEGs in TCGA and ICGC. A, B The associations of ten overlapped DEGs with prognostic relevance in TCGA and ICGC. The closer relationship between the overlapped regulatory factors, the closer correlation value is to 1, the redder the square. C, D The GSVA of ten hub regulatory factors based on KEGG and HALLMARK annotation in TCGA. The functionality of the red and green logo was what we focus on

Furthermore, as we had showed the expression of these ten hub regulatory factors was more highly expressed in high-mRNAsi HCC patients, indicating the potential function of carcinogenesis was correlated with CSCs. And another result supported this conclusion that to show that the developmental signaling pathways including Notch signing pathway, Hedgehog signaling, and Wnt beta catenin signaling were ranked as the top (Fig. 5C), which sustained CSC alive (Clara et al. 2020). Consistently, HALLMARK annotation for the m6A DEGs among clusters A and B, the Notch, Hedgehog, and Wnt beta catenin signaling pathway also ranked as the top and is shown in Fig. 5D. These function results showed that these ten hub regulatory factors might affect the survival and development of CSCs in HCC patients.

A prognostic model was established in ICGC dataset and verified in TCGA dataset

Giving the close relationship between ten hub regulators and prognosis of HCC, we adopted a UNICOX analyses based on the expression date of these ten m6A regulatory DEGs in ICGC to study its prognostic value (Fig. 6A). The results indicated that five out of ten hub regulatory factors were closely correlated with clinical outcomes (adjusted p < 0.05). These five regulatory factors, including RBM15B, LRPPRC, IGF2BP1, RBMX, and IGF2BP3, were all oncogenes with hazard ratio (HR) > 1(Fig. 6A). Then, we performed LASSON analyses to better identify the hub m6A regulatory factors with the strongest prognostic power. Consequently, only four regulatory factors were selected to build the prognostic model with corresponding coefficients from the LASSO algorithm (Fig. 6B). In addition, we utilized the MULTICOX analysis to confirm the consequence and gain the corresponding coefficients, and the specific formula to calculate the RiskScore was as follows: RiskScore = 2.73 × RBM15B + 4.58 × LRPPRC + 1.23 × IGF2BP1 + 3.00 × IGF2BP3 (Fig. 6C). To explore the prognostic function of this four-regulator prognostic model, we calculate the RiskScore based on this risk score model and, respectively, separated HCC patients into low-risk and high-risk clusters accorded to the median RiskScore in both the discovery cohort (ICGC) and the validation cohort (TCGA). As the OS curve analyses had demonstrated, high-risk HCC patients had worse prognosis compared with HCC patients with low-risk (adjusted p < 0.001) (Fig. 6D, F). Then, we adopted a ROC curve analyses and evaluated the area under ROC curve (AUC). The AUC of the discovery cohort in ICGC were 0.731, 0.769, and 0.771 for the 1-, 2-, and 3-year OS, while the AUC in TCGA were 0.758, 0.642, and 0.643 for the 1-, 2-, and 3-year OS, indicating great predictive ability for survival status (Fig. 6E, G). Ultimately, the risk score distribution of HCC patients, related survival status and heatmap displayed the expression of four m6A prognostic regulatory factors in both high-risk and low-risk was exhibited in Fig. 6H, I). As a result, we discovered that the higher the RiskScore was, the higher prognostic regulators’ expression, the worse the survival outcome.

Fig. 6.

Fig. 6

Construction and validation of the m6A regulatory factors model. A UNICOX analyses of ten m6A regulators for patients with HCC in ICGC which is as training dataset in this study (adjusted p < 0.05). B Establishment of the LASSON regression model. C MULTICOX analyses of RBM15B, LRPPRC, IGF2BP1, and IGF2BP3 in ICGC and the prognostic model was established with concordance index being 0.75. As following was the formula: RiskScore = 2.73 × RBM15B + 4.58 × LRPPRC + 1.23 × IGF2BP1 + 3.00 × IGF2BP3. D, F OS analyses of patients with HCC accorded to the RiskScore model in ICGC and in the verification dataset (TCGA). E, G The ROC analyses of ICGC and TCGA were calculated by RiskScore model at 1, 2, and 3 years. H, I Heatmaps and distribution of four regulatory factors in the high- and low-risk subgroups in ICGC and TCGA datasets, respectively. The green curve was low-risk and the red curve was high-risk. The green circle means alive and the red circle means dead. The RiskScore of patients with HCC in verification dataset was gained by risk score model and split in two groups based on the middle score of the RiskScore in training dataset

Construction of a prognostic nomogram for HCC patients

Giving that a synthetic analysis could better predict the clinical prognosis for HCC patients, UNICOX and MULTICOX analysis were performed to figure out the association between the HCC patients and clinically relevant information. The result showed that RiskScore, the stages of disease progression, and the gender, were the critical risk factors of HCC patients (adjusted p < 0.05) (Fig. 7A, B). We made a nomogram based on gender, age, prior malignancy, stage, and RiskScore to provide a quantitative method to forecast the clinical outcome of individuals (Fig. 7C). As shown in Fig. 7C, the novel nomogram could comprehensively forecast the 1-, 2-, and 3-year survival outcome of HCC patients. Furthermore, we found that the RiskScore was the most significant factor of all clinical variables.

Fig. 7.

Fig. 7

An innovative nomogram for ICGC risk score model. A UNICOX analyses of clinical characteristics. B MULTICOX analyses of clinical characteristics. C A nomogram according to RiskScore model and clinical-related characteristics

Discussion

Hepatocellular carcinoma (HCC) ranking third in cancer deaths worldwide is a serious disease (Siegel et al. 2021; Sung et al. 2021). The recurrence rate of HCC remains high, although many patients can be early diagnosis and treatment (Siegel et al. 2021). For patients whose disease has entered the middle and late stages, the prognosis is much less optimistic. Moreover, CSCs are known to work in the development of cancer, relapse and resistance after treatment. The characterization of CSCs and its molecular basis may provide many benefits for cancer treatment and therapy. Furthermore, m6A RNA modifications can be modified by “writes” and removed by “erases” (Fu et al. 2014). Differential expression of specific m6A regulatory factors was correlated with abnormally regulated RNAs in cancers; however, the same m6A regulatory factors may play different roles in different cancers (Chen et al. 2019; Huang et al. 2020). It is also well known that m6A RNA modification has a crucial function in regulating the generation, maintenance, and drug resistance with CSCs (Cui et al. 2017). The biological basis and functional characteristics of m6A regulatory factors may be innovative indicators and targets.

Previous study has shown that m6A regulatory factors could exert effectively in various tumors and m6A methylation is highlighted in relation to CSCs and cancer biology. For example, Jeremy et al. found that m6A methylation was generally upregulated in glioblastoma and the “writers” YTHDF2 specifically stabilizes MYC mRNA in CSCs of glioblastoma (Dixit et al. 2021). Likewise, Zhang CZ et al. suggested that YTHDF2 regulated the expression of OCT4 through m6A methylation, promoting the phenotype LCSCs and tumor metastasis (Zhang et al. 2020). Studies have shown that FTO plays a key role in the self-renewal and immune evasion of CSCs and indicates the great prospects of cancer therapy by targeting FTO (Su et al. 2020). Giving the study of m6A in CSCs, we found that m6A mechanism might serve as effective strategies and new target for elimination of CSCs. In this study, we focused on the comprehensive analyses of TCGA and ICGC databases. The analyses of the m6A regulatory DEGs in high-mRNAsi and low-mRNAsi were conducted in TCGA, and relevant OS analyses of hub regulators were performed in ICGC and were almost confirmed in TCGA. The data of scRNA-seq with HCC were also performed to verify the positive association between m6A regulatory factors and LCSCs. Furthermore, besides the consensus analysis and nomogram, we also successfully build a prognostic model of m6A regulators for HCC which was established in ICGC verified in TCGA databases, offering a prospective m6A-related target for patients with HCC.

Ten overlapped m6A regulatory DEGs had been gained in both TCGA and ICGC in our study, these ten hub m6A regulatory factors were upregulated in high-mRNAsi groups and showed worse prognosis in HCC patients. As a result, we figured out that m6A regulatory factors might be a prospective target for CSCs in HCC, then by analyzing the scRNA-seq data of HCC and followed by RT-qPCR validation, we found that the ten hub m6A regulatory factors were positively related to the CSCs markers CD24, EPCAM, and CD133 and highly expressed in LCSCs. Besides the positive correlation with CSCs markers, the results of GSVA function analysis of KEGG and HALLMARK enrichment also showed that the enriched terms were not only related to cell cycle, immune modifications and tumor malignancy, but also to the developmental signaling pathways of CSCs. The results of function analysis showed that these ten hub regulatory factors might affect the survival and development of CSCs in HCC patients. Moreover, we used UNICOX, LASSO, and MULTICOX analysis to select four risk factors (RBM15B, LRPPRC, IGF2BP1 and IGF2BP3) for establishing prognostic model. RBM15B, RNA-binding protein, performs as a key regulator of m6A methylation of RNA. Jaffrey SR’s team demonstrated that RBM15 and related RBM15B interacted with WTAP to form the complex to target mRNA and also were critical for XIST-mediated gene silencing (Patil et al. 2016). Leucine-rich PPR motif protein (LRPPRC), a member of the PPR family, was a known genetic mutation. The team from Liu et al. found that SNHG17 could inhibit C-MYC ubiquitination by interacting with LRPPRC in HCC and promote HCC proliferation through snHG17-LRPPRC-C-MYC regulatory axis, which provides a potential target for cancer therapy (Liu et al. 2021). As the member of the insulin-like growth factor2 mRNA binding protein family, IGF2BP1 can promote the stability of MGAT5 mRNA by upregulating m6A modification of MGAT5, thus maintaining the LCSCs phenotype (Yang et al. 2021). IGF2BP3, the protein encoded by this gene was primarily found in the nucleolus, and also known as that HBV-pgRNA (pregenomic RNA) could be a potential biomarker to predict clinical outcome and recurrence of HCC (Ding et al. 2021), thus the pgRNA-IGF2BP3 axis had a crucial function in HBV-associated liver cancer. In addition, IFN-α-2A could augment the level of m6A RNA modification to reduce pgRNA stability, thereby controlling the occurrence of HBV-associated HCC. The built RiskScore with those four factors RBM15B, LRPPRC, IGF2BP1, and IGF2BP3 were likely to divide HCC patients into two prognosis-related clusters and further verified by survival, ROC, UNICOX and MULTICOX analysis in this study. In summary, we analyzed RNA-seq data of m6A regulatory factors employing ICGC and TCGA datasets. In addition, the RiskScore feature and prognostic graph were inferred to comprehensively forecast the clinical outcome for patients with HCC. These consequences suggest that ten hub regulators are promising targets for potential therapeutic strategies, and clinical and experimental validation are needed to further verify the clinical function and more potential applications of this model.

Supplementary Information

Below is the link to the electronic supplementary material.

Acknowledgements

Not applicable.

Abbreviations

scRNA-seq

Single-cell RNA-sequencing

RNA-seq

RNA-sequencing

HCC

Hepatocellular carcinoma

CSCs

Cancer stem cells

m6A

N6-Methyladenosine

OS

Overall survival

GSVA

Gene set variation analysis

APC

Principal component analysis

UNICOX

Univariate cox regression

LASSON

Least absolute shrinkage and selection operator

MULTICOX

Multivariate cox regression

OCLR

Single-class logistic regression

mRNAsi index

The stemness index based on mRNA expression

TCGA

The cancer genome Atlas

ICGC

The International Cancer Genome Consortium

LCSCs

Liver cancer stem cells

CNV

Copy number variation

GEO

Gene Expression Omnibus

CAFs

Carcinoma-associated fibroblasts

ECs

Endothelial cells

B cells

Bursa-dependent lymphocyte

T cells

Thymus-dependent lymphocyte

HAM

Dulbecco’s modified Eagle medium Nutrient Mixture F-12

BSA

Bovine serum albumin

EGF

Epidermal growth factor

bFGF

Basic fibroblast growth factor

MEF

Mouse embryonic fibroblasts

DMEM

Dulbecco’s modified Eagle medium

CDF

Consensus clustering cumulative distribution function

LIHC

Liver hepatocellular carcinoma

AUC

Area under ROC curve

Author contributions

YD, WG, BW and YO supervised the study. YL, SC, TG, TL, SL, WZ, SZ and KM prepared the samples, performed the experiments and collected the data. YL, GY, JX and HC collected and discussed the method of analysis. YL conducted the prognosis analysis and statistical analysis. YL wrote the manuscript, and YD reviewed and revised the manuscript. All the authors read and approved the manuscript.

Funding

This work was supported in part by Research Starting Funding of South China University of Technology (D6181910, D6201880, K5180910, and K5204120), by Research Starting Funding of the Second affiliated Hospital of South China University of Technology (KY09060026), by Research Agreement between South China University of Technology and Guangzhou First People’s Hospital (D9194290), by the Key Project of Guangzhou Science and Technology Program (No. 202102010027), and by GuangZhou Basic and Applied Basic Research Foundation (202201010538).

Availability of data and materials

No data were used to support this study.

Declarations

Conflict of interest

All the authors declare that they have no conflict of interests.

Ethical approval and consent to participate

The ethics committee at Guangzhou First People’s Hospital of South China University of Technology approved this study.

Consent for publication

All the authors have reviewed the manuscript and agree to publish it in its current form.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Yan Liang, Sen Chen and Jinghe Xie have contributed equally to this work.

Contributor Information

Yimeng Ou, Email: yimengou@126.com.

Bailin Wang, Email: wangbailin888@sina.com.

Weili Gu, Email: lili-6423@163.com.

Yuyou Duan, Email: yuyouduan@scut.edu.cn.

References

  1. Ahmed M et al (2017) Concise review: emerging drugs targeting epithelial cancer stem-like cells. Stem Cells 35(4):839–850 [DOI] [PubMed] [Google Scholar]
  2. Alqahtani A et al (2019) Hepatocellular carcinoma: molecular mechanisms and targeted therapies. Medicina (kaunas) 55(9):526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Casella G et al (2019) mRNA methylation in cell senescence. Wiley Interdiscip Rev RNA 10(6):e1547 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Chen XY, Zhang J, Zhu JS (2019) The role of m(6)A RNA methylation in human cancer. Mol Cancer 18(1):103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Clara JA et al (2020) Targeting signalling pathways and the immune microenvironment of cancer stem cells - a clinical update. Nat Rev Clin Oncol 17(4):204–232 [DOI] [PubMed] [Google Scholar]
  6. Corbett AH (2018) Post-transcriptional regulation of gene expression and human disease. Curr Opin Cell Biol 52:96–104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Cui Q et al (2017) m(6)A RNA methylation regulates the self-renewal and tumorigenesis of glioblastoma stem cells. Cell Rep 18(11):2622–2634 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Ding WB et al (2021) HBV/Pregenomic RNA increases the stemness and promotes the development of HBV-related HCC through reciprocal regulation with insulin-like growth factor 2 mRNA-binding protein 3. Hepatology 74(3):1480–1495 [DOI] [PubMed] [Google Scholar]
  9. Dixit D et al (2021) The RNA m6A reader YTHDF2 maintains oncogene expression and is a targetable dependency in glioblastoma stem cells. Cancer Discov 11(2):480–499 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Du J et al (2021) m6A regulator-mediated methylation modification patterns and characteristics of immunity and stemness in low-grade glioma. Brief Bioinform. 10.1093/bib/bbab013 [DOI] [PubMed] [Google Scholar]
  11. Fu Y et al (2014) Gene expression regulation mediated through reversible m6A RNA methylation. Nat Rev Genet 15(5):293–306 [DOI] [PubMed] [Google Scholar]
  12. Guan K et al (2020) Expression status and prognostic value of M6A-associated genes in gastric cancer. J Cancer 11(10):3027–3040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Huang H, Weng H, Chen J (2020) m(6)A modification in coding and non-coding RNAs: roles and therapeutic implications in cancer. Cancer Cell 37(3):270–288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Jia G et al (2011) N6-methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO. Nat Chem Biol 7(12):885–887 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Jin Y et al (2021) Analysis of m6A-related signatures in the tumor immune microenvironment and identification of clinical prognostic regulators in adrenocortical carcinoma. Front Immunol 12:637933 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Li Y et al (2019) Molecular characterization and clinical relevance of m(6)A regulators across 33 cancer types. Mol Cancer 18(1):137 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Li Y et al (2021a) Analysis of m6A RNA methylation-related genes in liver hepatocellular carcinoma and their correlation with survival. Int J Mol Sci 22(3):1474 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Li Y et al (2021b) Molecular characterization, biological function, tumor microenvironment association and clinical significance of m6A regulators in lung adenocarcinoma. Brief Bioinform. 10.1093/bib/bbaa225 [DOI] [PubMed] [Google Scholar]
  19. Liu JY et al (2021) LncRNA SNHG17 interacts with LRPPRC to stabilize c-Myc protein and promote G1/S transition and cell proliferation. Cell Death Dis 12(11):970 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ma Z, Ji J (2020) N6-methyladenosine (m6A) RNA modification in cancer stem cells. Stem Cells 38(12):1511 [DOI] [PubMed] [Google Scholar]
  21. Makena MR et al (2020) Cancer stem cells: Road to therapeutic resistance and strategies to overcome resistance. Biochim Biophys Acta Mol Basis Dis 1866(4):165339 [DOI] [PubMed] [Google Scholar]
  22. Malta TM et al (2018) Machine learning identifies stemness features associated with oncogenic dedifferentiation. Cell 173(2):338-354.e15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Pan H, Fu X, Huang W (2011) Molecular mechanism of liver cancer. Anticancer Agents Med Chem 11(6):493–499 [DOI] [PubMed] [Google Scholar]
  24. Park SC et al (2015) Clonogenically culturing and expanding CD34+ liver cancer stem cells in vitro. Stem Cells Dev 24(13):1506–1514 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Patil DP et al (2016) m(6)A RNA methylation promotes XIST-mediated transcriptional repression. Nature 537(7620):369–373 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Roignant JY, Soller M (2017) m(6)A in mRNA: an ancient mechanism for fine-tuning gene expression. Trends Genet 33(6):380–390 [DOI] [PubMed] [Google Scholar]
  27. Roundtree IA et al (2017) Dynamic RNA modifications in gene expression regulation. Cell 169(7):1187–1200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Shi H et al (2017) YTHDF3 facilitates translation and decay of N(6)-methyladenosine-modified RNA. Cell Res 27(3):315–328 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Siegel RL et al (2021) Cancer statistics, 2021. CA Cancer J Clin 71(1):7–33 [DOI] [PubMed] [Google Scholar]
  30. Su R et al (2020) Targeting FTO suppresses cancer stem cell maintenance and immune evasion. Cancer Cell 38(1):79-96.e11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Sung H et al (2021) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71(3):209–249 [DOI] [PubMed] [Google Scholar]
  32. Wang X et al (2021) RALYL increases hepatocellular carcinoma stemness by sustaining the mRNA stability of TGF-β2. Nat Commun 12(1):1518 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Xie J et al (2021) ITGB1 drives hepatocellular carcinoma progression by modulating cell cycle process through PXN/YWHAZ/AKT pathways. Front Cell Dev Biol 9:711149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Yang Y et al (2018) Dynamic transcriptomic m(6)A decoration: writers, erasers, readers and functions in RNA metabolism. Cell Res 28(6):616–624 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Yang JD et al (2019) A global view of hepatocellular carcinoma: trends, risk, prevention and management. Nat Rev Gastroenterol Hepatol 16(10):589–604 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Yang Y et al (2021) IGF2BP1 Promotes the liver cancer stem cell phenotype by regulating MGAT5 mRNA stability by m6A RNA methylation. Stem Cells Dev 30(22):1115–1125 [DOI] [PubMed] [Google Scholar]
  37. Zhang C et al (2020) YTHDF2 promotes the liver cancer stem cell phenotype and cancer metastasis by regulating OCT4 expression via m6A RNA methylation. Oncogene 39(23):4507–4518 [DOI] [PubMed] [Google Scholar]
  38. Zheng G et al (2013) ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol Cell 49(1):18–29 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Zheng H et al (2018) Single-cell analysis reveals cancer stem cell heterogeneity in hepatocellular carcinoma. Hepatology 68(1):127–140 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

No data were used to support this study.


Articles from Journal of Cancer Research and Clinical Oncology are provided here courtesy of Springer

RESOURCES