Skip to main content
Cancers logoLink to Cancers
. 2023 Jan 13;15(2):509. doi: 10.3390/cancers15020509

The Role of Intratumor Microbiomes in Cervical Cancer Metastasis

Lu Jiang 1,, Baofeng Duan 1,, Peng Jia 1, Yan Zhang 1,*, Xin Yan 1,*
Editor: W Martin Kast1
PMCID: PMC9856768  PMID: 36672459

Abstract

Simple Summary

Microbiomes are thought to be an essential characteristic of tumors, influencing their development and progression. We found and validated certain microbiomes associated with tumor metastasis in cervical cancer samples. Furthermore, we attempted to elucidate the mechanism of the interaction between microbiomes and host cells utilizing a multiomics study. Finally, we developed an excellent prognostic prediction model for cervical cancer employing these microbiomes and their linked differentially expressed genes. This study conducted novel research concerning the link between tumor microbiomes and the host, highlighting the role of microbiomes in cervical cancer metastasis.

Abstract

Background: Intratumor microbiomes can influence tumorigenesis and progression. The relationship between intratumor microbiomes and cervical cancer metastasis, however, remains unclear. Methods: We examined 294 cervical cancer samples together with information on microbial expression, identified metastasis-associated microbiomes, and used machine learning methods to validate their predictive ability on tumor metastasis. The tumors were subsequently typed based on differences in microbial expression. Differentially expressed genes in different tumor types were combined to construct a tumor-prognostic risk score model and a multiparameter nomogram model. In addition, we performed a functional enrichment analysis of differentially expressed genes to infer the mechanism of action between microbiomes and tumor cells. Results: Based on the 15 differentially expressed microbiomes, machine learning models were able to correctly predict the risk of cervical cancer metastasis. In addition, both the risk score and the nomogram model accurately predicted tumor prognosis. Differences in the expression of endogenous genes in tumors can influence the distribution of the intracellular microbiomes. Conclusions: Intratumoral microbiomes in cervical cancer are associated with tumor metastasis and influence disease prognosis. A change in gene expression within tumor cells is responsible for differences in the microbial populations within the tumor.

Keywords: cervical cancer, metastasis, microbiome, machine learning, nomogram, ferroptosis

1. Introduction

In terms of incidence and mortality, cervical cancer ranks as the third most common malignancy of the female reproductive system [1]. Persistent infection with high-risk human papillomaviruses (HPV) is closely associated with the occurrence and progression of cervical cancer [2]. In most cases, cervical cancer can spread extensively through the lymphatic vessels. Once lymph nodes or distant organs are involved, the prognosis worsens [3]. According to a growing body of research, the cervicovaginal microbiome plays a significant role in the persistence, recurrence, and progression of HPV infection [4]. Nevertheless, previous studies have focused primarily on the impact of the intravaginal microbiome on HPV infection and cervical cancer pathogenesis, neglecting the role of the intratumor microbiome in tumor development and prognosis, and in particular the relationship between distant metastasis and the microbiome within cervical cancer tumors [5,6].

Tumor microbiome refers to the genome of microorganisms (bacteria, archaea, fungi, and viruses) present in the tumor parenchyma and the microenvironment surrounding the tumor [7]. It has been demonstrated that intratumor microbiomes are more often parasitized by tumor cells and immune cells within the tumor microenvironment [8,9]. Using electron microscopy, Fu et al. [10] found that approximately 3% of tumor cells contained bacteria, and approximately 97.25% of the bacteria were intracellular parasites, which could promote lung metastasis of breast cancer cells. It is believed that the tumor microbiome is one of the most important characteristics of tumors, and is present in a wide range of solid tumors and influences tumorigenesis and progression by promoting host genomic mutations and immune modulation [11].

As tumor tissue is a low microbial abundance environment, the identification of microbiomes is a primary concern when studying the intratumor microbiome. Over 11,000 cancer cases were catalogued for the Cancer Genome Atlas (TCGA), which is a huge, comprehensive database of cancer molecular information [12]. A workflow developed by Poore et al. [13] enables corrected microbial abundances to be derived from high-throughput sequencing data of human cancer cells, and this method was applied to create a dataset of pancancerous tumor microbial abundances derived from whole genome or RNA sequencing data for the TCGA cohort. We identified the microbiomes associated with tumor metastasis using data from multiple genomics, systematically evaluated the connection between microbiomes and host gene expression, and demonstrated the significant potential of microbiomes in the diagnosis, pathogenesis, and prognosis of cervical cancer metastasis.

2. Materials and Methods

2.1. Preparation of Data

Microbiome data, mRNA-seq data, and clinical information for cervical cancer samples were downloaded from the TCGA–CESC cohort (https://portal.gdc.cancer.gov/; accessed on 1 November 2022). In particular, Poore et al. [13] derived microbiome data from secondary analyses of sequencing data. With regard to the total sequencing read length, approximately 7.2% of sequences of nonhuman origin were identified, of which 35.2% were identified as bacterial, viral, or archaeal, and annotated to genus-level operational taxonomic units (OTUs) by Kraken. A log2 counts per million (CPM) expression matrix was created using Voom normalization and SNM correction, available from CBioPortal (https://cbioportal-datahub.s3.amazonaws.com/cesc_tcga_pan_can_atlas_2018.tar.gz; accessed on 1 November 2022). The American Joint Committee on Cancer (AJCC) and the International Federation of Gynecology and Obstetrics (FIGO) staging systems are commonly used for staging cervical cancer. The patients were categorized according to their age, AJCC stage, and FIGO stage as well as other clinical characteristics. A total of 306 primary tumor samples were collected from the TCGA cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) cohort, of which 294 contained microbiological information. We classified patients into metastatic and nonmetastatic groups according to their N and M stages. N staging represents lymph node involvement, with N0 representing negative lymph nodes and N1 representing pelvic lymph node metastases. M staging represents distant metastases, with M0 representing non-distant metastases and M1 representing distant metastases [14]. Accordingly, patients who met both N0 and M0 were classified as nonmetastatic, and those who met N1 or M1 (any T stage) were classified as metastatic.

Additionally, the GSE52903 dataset in the Gene Expression Omnibus (GEO) database (https://ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE52903; accessed on 15 November 2022) was downloaded in order to obtain gene expression profiles and clinical information for all tumor samples as an external validation dataset. This comprised 72 samples, 17 of which were normal controls and 55 of which were tumor samples [15].

2.2. Identification of Tumor Metastasis-Associated Microbiomes and Evaluation of Machine Learning Classification Models

Firstly, the low-abundance microorganisms (less than two samples with expression log CPM > 0) were removed, and the limma package was then applied between the progressive and nonprogressive tumor groups to identify microbiomes at the genus level that were differentially expressed in the two groups based on screening criteria of log2 fold change (FC) ≥ 0.4 and p value < 0.05. Differentially expressed microbiomes were utilized as an explanatory variable, and cervical cancer metastases were chosen as the response. After dividing all 142 samples into training and test sets, three machine learning models were built and tested using the caret and DALEX packages to plot cumulative residual curves using random forest, support vector machine, and generalized linear models. The Mleval package was used to plot ROC curves for the tenfold cross-validation case to select the best training model. The importance of variables was calculated using the wrapper method, and the best combination of variables was filtered using recursive feature elimination (RFE) [16].

2.3. Construction and Evaluation of Nomogram

After obtaining the optimal combination of variables, we created a nomogram model using the rms package to calculate cervical cancer metastasis risk scores based on the sum of projections of each variable and corresponding to the likelihood of metastases. Calibration curves, decision curve analysis (DCA), and clinical impact curves were employed to assess the reliability of the model.

2.4. Tumor Typing Based on Metastasis-Associated Microbiomes

Consequently, 294 samples with survival-related information were retained. In order to define differentially expressed microbiomes (DEMs) associated with survival, univariate Cox regression analysis was performed on all screened metastasis-associated microbiomes using the survival package. The ConsensusClusterPlus package was used to cluster all 294 samples, which were consistently clustered into two tumor subtypes. On the basis of overall survival time and status, Kaplan–Meier curves were used to compare survival differences between the two groups.

2.5. Immune Infiltration of Tumors with Different DEM Clusters

CIBERSORT is a deconvolution algorithm that enables the calculation of the proportion of immune cell types in a given tissue by combining the genomes of marker cells from different immune cell subpopulations [17]. We calculated immune cell infiltration in tumors of different DEM clusters using the CIBERSORT website (https://cibersortx.stanford.edu; accessed on 15 November 2022) and compared the expression levels of common immune checkpoints across different clusters.

2.6. Gene Expression Analysis and Functional Enrichment of Tumors with Different DEM Clusters

Differentially expressed genes (DEGs) of different DEM clusters were screened based on the RNA-seq expression of 297 samples using the limma package with the screening criteria of |log2 fold change (FC)| ≥ 0.5 and p value < 0.05. On the basis of these DEGs, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed using the clusterProfiler package.

2.7. Development and Validation of a Risk Scoring System Based on DEM and DEG

We divided all 294 samples into training and test sets and carried out univariate Cox regression analysis and LASSO regression screening variables utilizing all screened DEMs and DEGs as risk factors. We then used a multivariate Cox regression model with the STEP method for iteration, producing an AIC (Akaike information criterion) minimum multivariate Cox regression model with risk score = i=1nexpression of risk factor i×coefficient. To assess the predictive validity of the risk score model, we compared the risk scores of different DEM clusters in the training and test sets, and we used the survivalROC package to plot time-dependent ROC curves of risk scores on overall survival (OS) at 1, 3, and 5 years. In addition, Kaplan-Meier curves were plotted by dividing all populations into high- and low-risk groups in accordance with their risk scores. As an external validation set, we calculated individual risk scores for samples from the GSE52903 dataset in order to determine the predictive validity of the risk scores.

2.8. Validation and Development of a Prognostic Nomogram for Cervical Cancer

Traditionally, cervical cancer prognosis is determined by pathological stage and type. In order to predict the 1-year, 3-year, and 5-year OS rates of patients, we added risk scores to FIGO stage, AJCC stage, and age to create a nomogram model of cervical cancer prognosis. To verify the reliability of the nomogram model, we plotted calibration, time-dependent ROC, and DCA curves.

2.9. The Relationship between Risk Score and Immune Infiltration

Based on the results of the CIBERSORT algorithm, we assessed the correlation between risk score and immune cell abundance. In addition, the ESTIMATE algorithm was used to determine the stromal, immune, and ESTIMATE scores in each sample, which were compared between the high- and low-risk groups [18]. A heatmap was produced to show the correlation between the expression levels of independent risk factors in the risk assessment model and the abundance of 22 immune cells.

2.10. Statistical Analysis

Data processing, analysis, and presentation were carried out using R software (version 4.1.2) and its relevant packages. Sangerbox (http://www.sangerbox.com/tool, accessed on 2 January 2023) and ImageGP (http://www.ehbio.com/ImageGP/, accessed on 2 January 2023) were also used to visualize the results [19,20]. A two-sided p value of 0.05 was considered significant.

3. Results

3.1. Identification of Microbiomes Associated with Cervical Cancer Metastasis

Figure 1 illustrates the flow chart of this study. First, we classified all the samples according to metastasis status, and then obtained the differentially expressed microbiome. Next, we evaluated the predictive value of these microbiomes for metastasis and constructed a microbial classification system for tumors. Through the systematic analysis of tumors with different microbial classifications, we screened the prognosis-related risk factors, and finally built a fairly reliable prognosis prediction model. There were a total of 294 samples with microbial information, 64 of which were classified as metastasis groups, and 78 as non-metastasis groups, and clinical information for each group is presented in Table 1. There were 1406 microbiomes classified at the genus level, which comprised 62 archaea, 138 viruses, and 1206 bacteria. Of these, 1396 microorganisms remained after the low-abundance microbiomes were removed. Of the 15 differentially expressed microbiomes screened, 7 and 8 were highly expressed in the metastasis and non-metastasis groups, respectively. Figure 2a is a volcano plot showing the distribution of differentially expressed microbiomes, and Figure 2b, c illustrates the relative abundance of the 15 differentially expressed microbiomes. Based on Spearman correlation analysis, Figure 2d shows that Klebsiella and Robiginitomaculum had the highest positive correlation, whereas Micromonospora and Kobuvirus had the highest negative correlation.

Figure 1.

Figure 1

Flow chart illustrating the analysis of tumor metastasis-associated microbiomes in the TCGA–CESC.

Table 1.

Comparison of clinical baselines for different metastasis status subgroups.

Characteristic Non-Metastasis Metastasis All *
n 78 64 294
M, n (%)
M0 78 (100%) 25 (39.1%) 111 (37.8%)
M1 0 (0%) 9 (14.1%) 9 (3.1%)
MX 0 (0%) 30 (46.8%) 174 (59.1%)
N, n (%)
N0 78 (100%) 2 (3.1%) 129 (43.9%)
N1 0 (0%) 57 (89.1%) 57 (19.4%)
NX 0 (0%) 5 (7.8%) 108 (36.7%)
T, n (%)
T1 56 (71.8%) 30 (46.9%) 134 (45.6%)
T2 20 (25.6%) 22 (34.4%) 71 (24.2%)
T3 2 (2.6%) 7 (10.9%) 20 (6.8%)
T4 0 (0%) 3 (4.7%) 8 (2.7%)
Tis 0 (0%) 1 (1.6%) 1 (0.3%)
TX 0 (0%) 1 (1.6%) 60 (20.4%)
age, mean ± SD 47.26 ± 12.88 45.36 ± 11.93 48.22 ± 13.91

* All refers to all samples with microbial information; Tis: tumor in situ.

Figure 2.

Figure 2

Identification and comparison of microbiomes associated with cervical cancer metastasis. (a) Volcano plot of differentially expressed microbiomes, with red dots indicating high expression in the metastasis group, and blue dots indicating high expression in the non-metastasis group; (b) heatmap showing the relative abundance of 15 microbiomes in each sample; (c) box plots comparing the relative abundance of 15 microbiomes between the metastasis and non-metastasis groups; and (d) analysis of Spearman’s correlation between 15 microbiomes. (p < 0.05 *; p < 0.01 **; p < 0.001 ***).

3.2. Model Construction and Feature Selection in Machine Learning

The ability of the screened metastasis-associated microbiomes to predict cervical cancer metastasis was tested using three machine learning models: random forest (RF), generalized linear model (GLM), and support vector machine (SVM). According to the cumulative residual curve (Figure 3a) and the receiver operating characteristic (ROC) curve (Figure 3b), the generalized linear model was the most effective machine learning model. In Figure 3c, we ranked the importance of all the microbial features, and the feature–accuracy curve (Figure 3d) indicated that the model was most accurate when all 15 microbial features were included.

Figure 3.

Figure 3

Model construction and feature screening using machine learning algorithms. (a) Diagram of the reverse cumulative distribution of residuals in the RF, GLM, and SVM models; (b) ROC curves and AUC values for the three models; (c) ranking of feature importance; and (d) feature–accuracy curves for generalized linear models. RF, random forest; GLM, generalized linear model; SVM, support vector machine.

3.3. The Nomogram Model for Prediction of Cervical Cancer Metastasis

Machine learning models have high predictive power but poor practical utility, so we constructed a nomogram model (Figure 4a) based on the screened microbial features and then validated it. Calibration curves revealed that microbial features accurately predicted cervical cancer metastasis (Figure 4b). At certain probability thresholds, the DCA demonstrated that the nomogram model was able to achieve a higher net benefit than individual microbial features, and thus is more applicable to clinical practice (Figure 4c). The clinical impact curves illustrate the comparison of the cost–benefit ratio between predicted and true metastasis status for different risk thresholds (Figure 4d).

Figure 4.

Figure 4

Construction and validation of nomogram models. (a) Nomogram model based on 15 microbial features; (b) calibration curve of the nomogram; (c) clinical decision curve (DCA) of the nomogram; and (d) clinical impact curve of the nomogram showing the prediction accuracy and cost-benefit ratio of the nomogram under different risk thresholds.

3.4. DEM-Based Tumor Typing and Prognosis, Immune Infiltration

To classify cervical cancer into different types using microbial features, we selected the simplest possible typing method. Based on the expression of all 15 microbiomes, univariate Cox regression analysis found that five microbiomes, Methylobacter, Robiginitomaculum, Klebsiella, Micromonospora, and Microbispora were associated with survival. Of these, Methylobacter showed a negative association with mortality risk, and the other four had a positive association with mortality risk. Figure 5a shows the forest plot. We defined these five microorganisms associated with survival as differentially expressed microbiomes (DEMs), and their relative abundance was used to categorize all patients into two clusters (Figure 5b). The K–M curves indicated that DEM cluster 2 had a significantly better prognosis than DEM cluster 1 (Figure 5c). Figure 5d shows the expression of five microbiomes in different DEM clusters. DEM cluster 1 was characterized by Robiginitomaculum, Microbispora, Klebsiella, and Micromonospora, whereas DEM cluster 2 was characterized by Methylobacter. In DEM cluster 2, cytotoxic T-lymphocyte-associated protein 4 (CTLA-4) and programmed cell death protein 1 (PD-1) expression levels were significantly higher than in DEM cluster 1, indicating that DEM cluster 2 would be more likely to benefit from immunotherapy (Figure 5e). In addition, the immune microenvironment of the tumor differed slightly between the DEM subtypes, such as higher levels of CD8+ T-cell infiltration in DEM cluster 2 and Treg cell infiltration in DEM cluster 1, but none of these differences were statistically significant (Figure 5f).

Figure 5.

Figure 5

DEM-based tumor typing and prognosis, immune infiltration. (a) Forest plot of univariate Cox regression analysis of five microorganisms associated with survival; (b) consistent clustering of all cervical cancer samples into two subtypes based on DEM; (c) K–M survival curves of two DEM clusters; (d) heatmap showing the expression of five DEMs and their relationship with DEM clusters, age, FIGO stage, and TNM stage; (e) DEM clusters with different expressions of three immune checkpoints, PD1, PDL1, and CTLA4; and (f) immune cell infiltration of tumors with different DEM clusters (the number above the boxplot represents the p value). (p < 0.05 *; p < 0.01 **).

3.5. The DEGs and Functional Enrichment Derived from the DEM Cluster

Differential expression analysis was conducted using mRNA-seq data to identify 23 DEGs in cervical cancer tumors with different DEM clusters. The results showed that 20 genes were highly expressed in DEM cluster 1 and 3 genes were highly expressed in DEM cluster 2 (Figure 6a). We then performed functional enrichment analysis on all 23 DEGs, and GO analysis indicated that ferritin heavy chain 1 (FTH1), egl-9 family hypoxia-inducible factor 1 (EGLN1), and cytochrome P450 family 51 subfamily A member 1 (CYP51A1) were enriched in the iron and ferrous ion-binding pathways. The KEGG analysis revealed that FTH1 and PCBP1 were enriched in the ferroptosis pathway (Figure 6b). A heatmap of the correlation between all differentially expressed genes and differently expressed microbiomes was plotted (Figure 6c).

Figure 6.

Figure 6

DEG and functional enrichment analysis of different DEM clusters. (a) Volcano plot of DEG distribution. Red dots indicate high expression in DME cluster 1, and blue dots indicate high expression in DEM cluster 2; (b) analysis of DEGs based on GO and KEGG functional enrichment; and (c) heatmap of correlation between DEGs and all differentially expressed microbiomes, microbiomes in bold and marked with ⋆ are DEMs. (p < 0.05 *; p < 0.01 **; p < 0.001 ***).

3.6. The Development and Validation of a Cervical Cancer Prognostic Risk Score Model

In order to construct a prognostic scoring system for cervical cancer, we used gene expressions in all DEMs and DEGs with a total of 28 variables as predictors. All patients were randomly assigned to a training set (n = 147) and a test set (n = 147) in a 1:1 ratio, and the two groups were matched according to the percentage of deaths. In the training set, the 28 variables were sequentially screened using univariate Cox regression (Figure 7a), LASSO regression (Figure 7b,c), and multivariate Cox regression analysis to determine the best model. As a result, we obtained an optimal risk score model with risk score = 0.298 × expression of FTH1 + 0.548 × expression of EGLN1, which had a C-index = 0.722.

Figure 7.

Figure 7

The construction and validation of a prognostic risk score model. (a) Univariate Cox regression in the training set to screen for survival-related variables, risk factors for survival are on the right side of the dotted line, and protective variables are on the left.; (b) cross-validation curves of LASSO regression; (c) path coefficient plots of LASSO regression; (d,e) comparison of prognostic risk scores for different DEM clusters; (f,g) K–M curves for patients with different risk score classes; (h,i) ROC curves for predictive validity of risk score models for 1-year, 3-year, and 5-year OS rates; (j) scatter plots showing survival status, risk score distribution, and heatmaps of expressions for two key variables for all patients; (k) prognostic K–M curves for patients with different risk score classes in the GSE52903 cohort; and (l) ROC curves for predictive validity of the risk score model in the GSE52903 cohort.

Subsequently, by comparing the prognostic risk scores of the different DEM clusters in both the training and test sets, we found that DEM cluster 1 had a significantly higher prognostic risk score than DEM cluster 2 (Figure 7d,e). According to their risk scores, we divided all the patients into a high-risk group and a low-risk group, and the KM curve analysis revealed that the prognosis of the high-risk group was significantly worse in both training and testing (Figure 7f,g). The areas under the ROC curve for the predictive validity of the risk score model for the 1-year, 3-year, and 5-year OS rate in the test set were 0.718, 0.775, and 0.794 (Figure 7h), respectively, and the areas under the ROC curve in the training set were 0.752, 0.712, and 0.741 (Figure 7i), respectively, indicating that the risk score model was fairly accurate in predicting OS rate. The expression of the two genes included in the risk score model was plotted as a heatmap, and the risk score and survival status scatter plots showed that as the risk score increased, mortality increased, and OS rate gradually decreased (Figure 7j).

Moreover, to demonstrate the generalizability of the risk score model, we calculated risk scores for 55 tumor patients in the GSE52903 cohort. The KM curves revealed that patients with high risk had significantly worse prognoses than those with low risk (p = 0.023) (Figure 7k). According to the ROC curves, the areas under the AUC curve for the model’s 1-year, 3-year, and 5-year OS rate predictions were 0.723, 0.606, and 0.571, respectively (Figure 7l).

3.7. Nomogram Model for Cervical Cancer Prognosis and Validation

In order to make the risk scoring system more practical, we incorporated FIGO stage, TNM stage, age, and risk score to build a nomogram model of cervical cancer prognosis for predicting 1-year, 3-year, and 5-year OS rate (Figure 8a). To validate the model, we used patient number 4, with a FIGO stage IV, T3NXMX, age less than 45 years, and a risk score grouped as high-risk, resulting in a total score of 531, predicting a survival rate of 50.7% at 1 year, 7.86% at 3 years, and 2.24% at 5 years. Notably, the survival rate of this patient was 2.27 years, indicating the high predictive validity of this model. Calibration curves showed a good agreement between actual and model-predicted risks (Figure 8b). ROC curves were plotted separately for 1-year, 3-year, and 5-year OS rates using independent predictors and the nomogram model, and these showed that the nomogram model had greater predictive validity than the independent predictors (Figure 8c–e). As can be seen from the clinical decision curves, the nomogram model consistently achieved greater net benefits at most probability thresholds (Figure 8f–h).

Figure 8.

Figure 8

Development and validation of the nomogram model for the prognosis of cervical cancer. (a) Scoring system of the nomogram model, where red dots indicate the scores of each index, total score, and predicted survival probability for patient number 4; (b) calibration curves of the nomogram model; (ce) ROC curves of the 1-year, 3-year, and 5-year OS rate predictive validity of the nomogram model and other independent predictors; (fh) clinical decision curves for the 1-year, 3-year, and 5-year OS rates using the nomogram model and other independent predictors.

3.8. The Relationship between Prognostic Risk Score and Immune Cell Infiltration

The expression of 22 immune cells within the tumor was determined using the CIBERSORT algorithm and correlated with the risk score. It was found that the risk score correlated positively with the infiltration of resting memory CD4+ T cells, M0 macrophages, and activated mast cells, and negatively with the infiltration of CD8+ T cells, activated CD4+ memory T cells, Tfh (follicular helper T cells), and resting mast cells (Figure 9a). Two key genes (FTH1 and EGLN1) were also associated with different immune cells (Figure 9b). In addition, both the Immune score and ESTIMATE score were lower in the high-risk group than in the low-risk group (Figure 9c).

Figure 9.

Figure 9

Relationship between risk score and immune infiltration. (a) Analysis of immune cell types associated with risk score (R represents Pearson correlation coefficient; p represents the significance levels); (b) heatmap of correlation between expression levels of FTH1 and EGLN1 and immune cells; and (c) immunity scores observed within different groups of risk scores. (p < 0.05 *; p < 0.01 **; p < 0.001 ***).

4. Discussion

The definitive cause of cervical cancer is persistent infection with high-risk HPV, and the vaginal microbiome can have a significant impact on HPV infection and cervical precancer by altering pH levels and lactate and hydrogen peroxide concentrations in the vagina and by directly interacting with cervical epithelial cells [21,22,23]. An in vitro experiment showed that lactic acid, a metabolite of Lactobacillus iners, activates the Wnt pathway through the lactate-Gpr81 complex, thereby increasing the level of core fucosylation in epithelial cells and inhibiting the proliferation and migration of cervical cancer cells [24]. In contrast to open cavities such as the lower genital and gastrointestinal tracts, there is a low abundance of microorganisms in tumor tissue. However, they are still a significant component of the tumor microenvironment, and the specificity of these microbiomes within specific tumors suggests that they might be associated with tumorigenesis and progression [11]. Microbiomes may contribute to tumor development in three different ways: (1) directly promoting tumorigenesis by increasing mutations, (2) modulating oncogenes or oncogenic pathways, and (3) inhibiting or promoting tumor progression by modulating the immune system of the host [25]. Poore et al. developed a workflow that allowed us to hybridize the data in TCGA with microbiomics, in which approximately 2% of all sequences in the CESC project were identified as being of microbial origin [13]. Using the results of the above research, microbial abundance was found to be a better predictor of prognosis in cervical cancer than clinical factors [26]. Understanding the mechanism of cervical cancer metastasis can assist in identifying high-risk cervical cancer cases early and reducing their lethality. In recent years, research on the mechanisms of cervical cancer metastasis has focused on oncogenes and their associated signaling pathways, and some recent studies have also demonstrated the regulatory role of noncoding RNAs [27]. Fu et al. [10] found that intracellular bacteria could promote lung metastasis of breast cancer cells, the mechanism of which may be that intracellular bacteria invade tumor cells, remodel the cytoskeleton via the RhoA–ROCK pathway, enhance the tolerance of circulating tumor cells to intravascular mechanical pressure, and reduce cell death during metastasis. If microbes are involved in the metastasis of breast cancer, could the same be true of the microbiomes within cervical tumors?

According to our findings, some intratumor microbiomes associated with cervical cancer can effectively predict metastases and are closely related to the prognosis of cervical cancer. Among them, we identified 15 genera of microbiomes as being associated with cervical cancer metastasis. A machine learning model and nomogram model of these 15 microbiomes were capable of accurately predicting the risk of cervical cancer metastasis. Overall survival rate was also positively correlated with five of these microbiomes. The risk of death from cervical cancer was positively associated with Robiginitomaculum, Klebsiella, Micromonospora, and Microbispora, but negatively associated with Methylobacter. It is possible to predict the prognosis of cervical cancer by evaluating the relative abundance of these five microbiomes and the corresponding tumor classification. Klebsiella has been linked to the incidence and development of a variety of malignancies. An in vitro investigation demonstrated that Klebsiella pneumoniae might increase the generation of reactive oxygen species and the expression of HIF-mRNA in airway epithelial cells, resulting in epithelial mesenchymal transition, which is frequently the foundation of tumor cell metastasis [28]. In clinical observation, the prevalence of Klebsiella was much higher in esophageal squamous cell carcinoma tissues than in healthy esophageal tissues, and K. pneumoniae in the bile of patients with pancreatic ductal adenocarcinoma can lead to gemcitabine resistance and worsen prognosis [29,30]. Furthermore, Klebsiella may also be involved in the development of bladder and colorectal cancers [31,32]. The other four bacteria are not abundant in cervical cancer tissue, and few reports have been published that relate it to human disease.

However, do these low-abundance microbes enter the tumor or immune cells at random? Do tumor cells play an active selection role in this process? This area has rarely been explored in previous studies. By combining transcriptomic and microbiomic analyses of tumors, our study demonstrated that microbial signatures predisposed to tumor metastasis were also associated with the gene expression profiles of tumor cells. Specifically, tumors with high expression of genes such as FTH1, EGLN1, and BCPB1 had more microbiomes associated with tumor metastasis. Most of these genes were enriched in pathways such as ferroptosis and iron transport. Ferroptosis is an iron-dependent programmed cell death that disrupts the structural integrity of cell membranes through the accumulation of lipid peroxides, which in turn leads to cell death [33]. As tumor cells are metabolically active and produce large amounts of reactive oxygen species (ROS), they require more iron reserves to maintain high levels of iron death, and iron depletion inhibits tumor cell growth and metastasis through this mechanism [33,34]. FTH1 encodes the ferritin heavy chain, a key subunit of ferritin that plays an important role in catalyzing the Fe2+ oxidation reaction, and FTH1 overexpression can lead to iron overload and inhibit ferroptosis [35]. An in vitro experiment demonstrated a significant increase in ferrous iron levels and ROS levels in macrophages after infection with bacteria, and the transportation of iron into bacterial vesicles was also found to induce bacterial death, which may be an effective defense mechanism for cellular clearance of pathogen infection [36]. Taking the results of our study into account, altered endogenous ferroptosis regulators may decrease bacterial clearance by inhibiting ferroptosis. This, in turn, allows bacteria to survive in these cells and perform pro-tumorigenic and metastatic functions. The upregulation of FTH1 in tumor cells or immune cells, for example, may contribute to a reduction in the clearance of bacteria by inhibiting ferroptosis.

Based on differentially expressed genes from different DEM clusters, we constructed a risk score model for cervical cancer prognosis and tested the predictive value of the model. Based on the results, the model had high predictive power for the 1-year, 3-year, and 5-year cervical cancer OS rates and could be extrapolated to the GSE dataset for validation. In addition to being associated with prognosis, the risk score was also related to the mode of immune infiltration. We found a reduced abundance of cells with antitumor immunity, such as CD8+ T cells and Tfh cells, together with a lower immune score and ESTIMATE score in the high-risk group, suggesting that higher tumor purity and absence of immune cells is another high-risk factor for tumor metastasis [18,37].

In this study, we propose a potential novel mechanism of interaction between tumor progression- and metastasis-associated microbiomes and host cells; however, some limitations are involved. Firstly, only primary tumors were sampled, and samples from metastatic lesions were not available for examination of the enrichment of intratumor microbiomes. Secondly, although the scoring system was validated using an external cohort, it was still derived from a public database and requires additional data with larger sample sizes in order to be prospectively validated. In addition, the data in the microbial database are normalized gene expressions, which can only provide information about the relative abundance of microorganisms within the tumor tissue rather than the absolute number, which is a prerequisite for understanding the role of microorganisms in cancer cells. As a final point, further in vivo and ex vivo experiments are required to determine the mechanism of action of tumor cells and microbiomes. There are still outstanding questions in the field of tumor microbiomes, such as, by what mechanisms microbiomes bind to and invade tumor/immune cells, whether tumor/immune cells passively invade or actively recruit microbiomes to acquire novel capabilities, and what other roles microbiomes within tumor cells play in the process of tumor metastasis. The answers to these questions will help us to find new targets for tumor treatment.

5. Conclusions

By comparing the gene sequences of microbial origin in the TCGA–CESC database with different metastatic status, we obtained 15 differentially expressed microbiomes, which we then used to construct machine learning and nomogram models to predict the risk of cervical cancer metastasis. Five of these microbiomes (Robiginitomaculum, Klebsiella, Micromonospora, Microbispora, and Methylobacter) were associated with cervical cancer prognosis, and, dependent on their expression, we built tumor microbiome clusters. Additionally, based on differentially expressed genes in patients with different DEM clusters, we constructed a model for prognostic risk scoring of cervical cancer patients, which achieved accurate prediction in the training set, test set, and external cohort. Lastly, we hypothesized that the differential expression of endogenous genes in tumor tissues could influence the type and distribution of intracellular microbiomes through functional enrichment analysis. For example, upregulation of the FTH1 gene inhibits the destructive effect of ferroptosis on intracellular microbiomes, resulting in a microbial state that is more favorable for tumor metastasis. These findings expand the field of tumor microbial study and contribute to the identification of new targets for tumor therapy as well as to the reduction of tumor metastasis and mortality rates through the use of microbiomes and differentially expressed genes to predict tumor prognosis.

Author Contributions

Conceptualization, X.Y. and Y.Z.; methodology, L.J.; software, L.J. and B.D.; validation, L.J., B.D. and P.J.; formal analysis, L.J. and B.D.; investigation, L.J. and B.D.; resources, L.J., B.D. and P.J.; data curation, P.J.; writing—original draft preparation, L.J. and B.D.; writing—review and editing, X.Y. and Y.Z.; visualization, L.J., B.D. and P.J.; supervision, X.Y. and Y.Z.; project administration, Y.Z. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

TCGA and GEO belong to public databases. Ethical approval was obtained from the patients included in the database. Users can download relevant data free of charge for research and publish relevant articles. Our study is based on open-source data, therefore there are no ethical issues or other conflicts of interest.

Informed Consent Statement

Informed consent was obtained from the patients involved in the TCGA and GEO databases.

Data Availability Statement

These data were derived from the following resources available in the public domain: (https://cbioportal-datahub.s3.amazonaws.com/; accessed on 1 November 2022) and (https://ncbi.nlm.nih.gov/geo/ query/acc.cgi?acc= GSE52903; accessed on 15 November 2022).

Conflicts of Interest

The authors declare no conflict of interest.

Funding Statement

This study received funding from National High Level Hospital Clinical Research Funding (Interdepartmental Clinical Research Project of Peking University First Hospital) (Grant No.2022CR19) and National Key R&D Program of China (Grant No.2020AAA0105200).

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Siegel R.L., Miller K.D., Fuchs H.E., Jemal A. Cancer statistics, 2022. CA Cancer J. Clin. 2022;72:7–33. doi: 10.3322/caac.21708. [DOI] [PubMed] [Google Scholar]
  • 2.Schiffman M., Doorbar J., Wentzensen N., de Sanjosé S., Fakhry C., Monk B.J., Stanley M.A., Franceschi S. Carcinogenic human papillomavirus infection. Nat. Rev. Dis. Prim. 2016;2:16086. doi: 10.1038/nrdp.2016.86. [DOI] [PubMed] [Google Scholar]
  • 3.Cohen P.A., Jhingran A., Oaknin A., Denny L. Cervical cancer. Lancet. 2019;393:169–182. doi: 10.1016/S0140-6736(18)32470-X. [DOI] [PubMed] [Google Scholar]
  • 4.Mitra A., MacIntyre D.A., Marchesi J.R., Lee Y.S., Bennett P.R., Kyrgiou M. The vaginal microbiota, human papillomavirus infection and cervical intraepithelial neoplasia: What do we know and where are we going next? Microbiome. 2016;4:58. doi: 10.1186/s40168-016-0203-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Liu J., Luo M., Zhang Y., Cao G., Wang S. Association of high-risk human papillomavirus infection duration and cervical lesions with vaginal microbiota composition. Ann. Transl. Med. 2020;8:1161. doi: 10.21037/atm-20-5832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.So K.A., Yang E.J., Kim N.R., Hong S.R., Lee J.H., Hwang C.S., Shim S.H., Lee S.J., Kim T.J. Changes of vaginal microbiota during cervical carcinogenesis in women with human papillomavirus infection. PLoS ONE. 2020;15:e0238705. doi: 10.1371/journal.pone.0238705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Qian X.B., Chen T., Xu Y.P., Chen L., Sun F.X., Lu M.P., Liu Y.X. A guide to human microbiome research: Study design, sample collection, and bioinformatics analysis. Chin. Med. J. 2020;133:1844–1855. doi: 10.1097/CM9.0000000000000871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nejman D., Livyatan I., Fuks G., Gavert N., Zwang Y., Geller L.T., Rotter-Maskowitz A., Weiser R., Mallel G., Gigi E., et al. The human tumor microbiome is composed of tumor type-specific intracellular bacteria. Science. 2020;368:973–980. doi: 10.1126/science.aay9189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Livyatan I., Nejman D., Shental N., Straussman R. Characterization of the human tumor microbiome reveals tumor-type specific intra-cellular bacteria. Oncoimmunology. 2020;9:1800957. doi: 10.1080/2162402X.2020.1800957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Fu A., Yao B., Dong T., Chen Y., Yao J., Liu Y., Li H., Bai H., Liu X., Zhang Y., et al. Tumor-resident intracellular microbiota promotes metastatic colonization in breast cancer. Cell. 2022;185:1356–1372.e26. doi: 10.1016/j.cell.2022.02.027. [DOI] [PubMed] [Google Scholar]
  • 11.Hanahan D. Hallmarks of Cancer: New Dimensions. Cancer Discov. 2022;12:31–46. doi: 10.1158/2159-8290.CD-21-1059. [DOI] [PubMed] [Google Scholar]
  • 12.Hutter C., Zenklusen J.C. The Cancer Genome Atlas: Creating Lasting Value beyond Its Data. Cell. 2018;173:283–285. doi: 10.1016/j.cell.2018.03.042. [DOI] [PubMed] [Google Scholar]
  • 13.Sepich-Poore G.D., Zitvogel L., Straussman R., Hasty J., Wargo J.A., Knight R. The microbiome and human cancer. Science. 2021;371:eabc4552. doi: 10.1126/science.abc4552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Olawaiye A.B., Baker T.P., Washington M.K., Mutch D.G. The new (Version 9) American Joint Committee on Cancer tumor, node, metastasis staging for cervical cancer. CA Cancer J. Clin. 2021;71:287–298. doi: 10.3322/caac.21663. [DOI] [PubMed] [Google Scholar]
  • 15.Medina-Martinez I., Barrón V., Roman-Bassaure E., Juárez-Torres E., Guardado-Estrada M., Espinosa A.M., Bermudez M., Fernández F., Venegas-Vega C., Orozco L., et al. Impact of gene dosage on gene expression, biological processes and survival in cervical cancer: A genome-wide follow-up study. PLoS ONE. 2014;9:e97842. doi: 10.1371/journal.pone.0097842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jin X., Wang J., Ge L., Hu Q. Identification of Immune-Related Biomarkers for Sciatica in Peripheral Blood. Front. Genet. 2021;12:781945. doi: 10.3389/fgene.2021.781945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chen B., Khodadoust M.S., Liu C.L., Newman A.M., Alizadeh A.A. Profiling Tumor Infiltrating Immune Cells with CIBERSORT. Methods Mol. Biol. 2018;1711:243–259. doi: 10.1007/978-1-4939-7493-1_12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Yoshihara K., Shahmoradgoli M., Martínez E., Vegesna R., Kim H., Torres-Garcia W., Treviño V., Shen H., Laird P.W., Levine D.A., et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 2013;4:2612. doi: 10.1038/ncomms3612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Shen W., Song Z., Zhong X., Huang M., Shen D., Gao P., Qian X., Wang M., He X., Wang T., et al. Sangerbox: A comprehensive, interaction-friendly clinical bioinformatics analysis platform. iMeta. 2022;1:e36. doi: 10.1002/imt2.36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Chen T., Liu Y.-X., Huang L. ImageGP: An easy-to-use data visualization web server for scientific researchers. iMeta. 2022;1:e5. doi: 10.1002/imt2.5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Amabebe E., Anumba D.O.C. The Vaginal Microenvironment: The Physiologic Role of Lactobacilli. Front. Med. 2018;5:181. doi: 10.3389/fmed.2018.00181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Rokos T., Holubekova V., Kolkova Z., Hornakova A., Pribulova T., Kozubik E., Biringer K., Kudela E. Is the Physiological Composition of the Vaginal Microbiome Altered in High-Risk HPV Infection of the Uterine Cervix? Viruses. 2022;14:2130. doi: 10.3390/v14102130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Frąszczak K., Barczyński B., Kondracka A. Does Lactobacillus Exert a Protective Effect on the Development of Cervical and Endometrial Cancer in Women? Cancers. 2022;14:4909. doi: 10.3390/cancers14194909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Fan Q., Wu Y., Li M., An F., Yao L., Wang M., Wang X., Yuan J., Jiang K., Li W., et al. Lactobacillus spp. create a protective micro-ecological environment through regulating the core fucosylation of vaginal epithelial cells against cervical cancer. Cell Death Dis. 2021;12:1094. doi: 10.1038/s41419-021-04388-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Wong-Rolle A., Wei H.K., Zhao C., Jin C. Unexpected guests in the tumor microenvironment: Microbiome in cancer. Protein Cell. 2021;12:426–435. doi: 10.1007/s13238-020-00813-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Hermida L.C., Gertz E.M., Ruppin E. Predicting cancer prognosis and drug response from the tumor microbiome. Nat. Commun. 2022;13:2896. doi: 10.1038/s41467-022-30512-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Cheng T., Huang S. Roles of Non-Coding RNAs in Cervical Cancer Metastasis. Front. Oncol. 2021;11:646192. doi: 10.3389/fonc.2021.646192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Leone L., Mazzetta F., Martinelli D., Valente S., Alimandi M., Raffa S., Santino I. Klebsiella pneumoniae Is Able to Trigger Epithelial-Mesenchymal Transition Process in Cultured Airway Epithelial Cells. PLoS ONE. 2016;11:e0146365. doi: 10.1371/journal.pone.0146365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Weniger M., Hank T., Qadan M., Ciprani D., Michelakos T., Niess H., Heiliger C., Ilmer M., D’Haese J.G., Ferrone C.R., et al. Influence of Klebsiella pneumoniae and quinolone treatment on prognosis in patients with pancreatic cancer. Br. J. Surg. 2021;108:709–716. doi: 10.1002/bjs.12003. [DOI] [PubMed] [Google Scholar]
  • 30.Hu M., Bai W., Zhao C., Wang J. Distribution of esophagus flora in esophageal squamous cell carcinoma and its correlation with clinicopathological characteristics. Transl. Cancer Res. 2020;9:3973–3985. doi: 10.21037/tcr-20-1954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Kaur C.P., Vadivelu J., Chandramathi S. Impact of Klebsiella pneumoniae in lower gastrointestinal tract diseases. J. Dig. Dis. 2018;19:262–271. doi: 10.1111/1751-2980.12595. [DOI] [PubMed] [Google Scholar]
  • 32.Mansour B., Monyók Á., Makra N., Gajdács M., Vadnay I., Ligeti B., Juhász J., Szabó D., Ostorházi E. Bladder cancer-related microbiota: Examining differences in urine and tissue samples. Sci. Rep. 2020;10:11042. doi: 10.1038/s41598-020-67443-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jiang X., Stockwell B.R., Conrad M. Ferroptosis: Mechanisms, biology and role in disease. Nat. Rev. Mol. Cell Biol. 2021;22:266–282. doi: 10.1038/s41580-020-00324-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.López-Filloy M., Cortez F.J., Gheit T., Cruz Y.C.O., Cruz-Talonia F., Chávez-Torres M., Arteaga-Gómez C., Mancilla-Herrera I., Montesinos J.J., Cortés-Morales V.A., et al. Altered Vaginal Microbiota Composition Correlates With Human Papillomavirus and Mucosal Immune Responses in Women With Symptomatic Cervical Ectopy. Front. Cell. Infect. Microbiol. 2022;12:884272. doi: 10.3389/fcimb.2022.884272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Tian Y., Lu J., Hao X., Li H., Zhang G., Liu X., Li X., Zhao C., Kuang W., Chen D., et al. FTH1 Inhibits Ferroptosis Through Ferritinophagy in the 6-OHDA Model of Parkinson’s Disease. Neurotherapeutics. 2020;17:1796–1812. doi: 10.1007/s13311-020-00929-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ma R., Fang L., Chen L., Wang X., Jiang J., Gao L. Ferroptotic stress promotes macrophages against intracellular bacteria. Theranostics. 2022;12:2266–2289. doi: 10.7150/thno.66663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Bruni D., Angell H.K., Galon J. The immune contexture and Immunoscore in cancer prognosis and therapeutic efficacy. Nat. Rev. Cancer. 2020;20:662–680. doi: 10.1038/s41568-020-0285-7. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

These data were derived from the following resources available in the public domain: (https://cbioportal-datahub.s3.amazonaws.com/; accessed on 1 November 2022) and (https://ncbi.nlm.nih.gov/geo/ query/acc.cgi?acc= GSE52903; accessed on 15 November 2022).


Articles from Cancers are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES