Skip to main content
International Journal of Hepatology logoLink to International Journal of Hepatology
. 2025 Aug 25;2025:5532643. doi: 10.1155/ijh/5532643

Identification of Key Gene Related to Matrisome in HBV-Associated Liver Cirrhosis via Bioinformatics Analysis

Ram Prasad Chaulagain 1, Yelona Shrestha 2, Khuzin Dinislam 3, Shizhu Jin 1,
PMCID: PMC12401599  PMID: 40901642

Abstract

Background: Hepatitis B virus (HBV)–associated liver cirrhosis, characterized by progressive fibrosis and regenerative nodule formation, remains a critical public health concern due to its high risk of progression to hepatocellular carcinoma (HCC). The matrisome—comprising extracellular matrix (ECM) components such as collagens, laminins, fibronectin, glycoproteins, and proteoglycans—plays a pivotal role in disease pathogenesis. Previous studies have shown that HBV infection modulates ECM composition and activates fibrogenic responses through hepatic stellate cells, contributing to cirrhosis and eventual HCC development. However, key ECM-related genes driving HBV-induced cirrhosis remain poorly understood.

Methods: Bulk RNA-seq data from 30 normal and 30 HBV-related cirrhotic liver tissues were analyzed. Differentially expressed genes (DEGs) were identified using the Limma package based on thresholds of p < 0.01 and |log2 fold change| > 1. ECM-related genes were curated from the Molecular Signatures Database (MsigDB). Functional significance was assessed via random forest classification (accuracy: 91%, recall: 90%), Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses.

Results: Among 14,470 analyzed genes, 2125 were upregulated and 3689 downregulated in cirrhotic tissues. Upregulated genes (COX6B1, RPS10) were linked to metabolic reprogramming, while downregulated genes (PLCG2, ARHGEF12) implicated immune dysregulation. A subset of 274 ECM-related DEGs (178 upregulated, 96 downregulated) was identified, including CTSA, CRELD2, MAPK10, and ITGA1. Pathway analysis highlighted dysregulation of Ras/MAPK and ERBB signaling pathways associated with fibrogenesis and tumorigenesis.

Conclusions: This bioinformatics study delineates key matrisome–associated genes and pathways in HBV-related cirrhosis, offering novel insights into potential biomarkers and therapeutic targets. Further validation in larger cohorts is warranted to confirm clinical relevance.

Keywords: ECM genes, extracellular matrix, hepatocellular carcinoma, liver cirrhosis, matrisome

1. Introduction

Progressive fibrosis and regenerative nodular development are hallmarks of liver cirrhosis, a serious public health concern. Viral hepatitis, alcohol, nonalcoholic fatty liver disease, and autoimmune diseases are the leading causes of cirrhosis worldwide, and liver disease accounts for 4% of all deaths worldwide [1]. The World Health Organization stated that the Hepatitis B virus (HBV) caused 820,000 deaths and infected over 1.5 million new cases in 2019, affecting approximately 296 million people worldwide. The Polaris Observatory group's 2022 modeling research calculated that the worldwide HBV prevalence was 3.2%, or approximately 257.5 million cases [2]. In China, the seroprevalence of hepatitis B surface antigen (HBsAg) ranges from 3.7% to 10.4%, with a total of 74.6 million chronic HBV patients, making it the country with the largest number of chronic HBV cases worldwide [3]. Among Chinese patients with HBV-related cirrhosis, approximately one-third (31.5%) developed HCC over five years, with a cumulative incidence of 22.9% (95% CI: 20.8–25.2%) [4]. Furthermore, studies estimate the annual incidence of hepatocellular carcinoma (HCC) in cirrhotic patients to be 2%–5% [5]. Similarly, 30% of individuals with chronic HBV infection develop liver cirrhosis and HCC because of persistent liver inflammation and fibrosis [6]. The lack of screening facilities and low public awareness have been attributed to the increase in HBV-associated cirrhosis [7]. Although there are a number of methods to detect cirrhosis early, including blood tests, liver enzymes, and imaging tools, biopsy remains the most reliable procedure [8]. Computed tomography (CT), magnetic resonance imaging (MRI), and ultrasound are noninvasive imaging techniques that are expensive and inaccessible in developing countries. Serum biomarkers and hepatic stiffness measurements are becoming useful in detecting asymptomatic early-stage cirrhosis [9]. MicroRNAs (miRNAs) and other circulating nucleic acid biomarkers have shown promise as diagnostic, prognostic, and therapeutic tools for liver diseases, particularly HBV-HCC [10]. Therefore, serum markers offer a noninvasive method for the early screening and diagnosis of liver cirrhosis.

The term “matrisome” refers to the extracellular matrix (ECM) and its constituent proteins, including collagen, laminins, fibronectin, glycoproteins, and proteoglycans [11]. Maintaining tissue homeostasis and regulating cellular processes such as adhesion, migration, and signaling depend on the matrisome [12]. The matrisome maintains an ideal microenvironment and controls cellular activity to maintain tissue homeostasis. The ECM influences cell survival, proliferation, and differentiation by interacting with cells via cell adhesion receptors such as integrins. To maintain a balance between cell development and apoptosis, it provides biochemical and mechanical cues that guide cell behavior and reactions to external stimuli [13]. The matrisome also helps regulate waste clearance, nutrition exchange, and blood flow, all of which are essential for maintaining liver homeostasis [14]. Numerous cytokines, signaling molecules, and growth factors are stored in the matrisome. To affect cell behavior, the release and trapping of these molecules can be controlled. Liver fibrogenesis and repair depend on the matrisome. After damage, it arranges tissues and provides a scaffold for cell migration. Additionally, fibronectin and fibrin are found in numerous ECM components and are crucial for both promoting and enabling coagulation and wound healing [15, 16]. However, changes in the composition and structure of the ECM hinder tissue remodeling, hepatic cellular signaling, and cell-to-cell communication [17, 18]. According to the evidence, HBV strongly affects matrisomal genes, promoting liver fibrogenesis by upregulating genes linked to scar tissue and accelerating the growth of hepatic stellate cells, which ultimately leads to HCC [19]. The hepatitis B virus X protein (HBx) enhances liver fibrosis by upregulating proteins involved in matrix remodeling, such as Collagen1a, α-Sma, PdgfR-β, and MMP-13, which are crucial for ECM deposition and remodeling, thereby promoting fibrogenesis [20]. In contrast, NAFLD-related HCC is characterized by more preserved liver function and a higher proportion of noncirrhotic HCC, despite similar tumor stages at diagnosis compared to HBV-related HCC [21]. The overproduction and deposition of fibrillar collagens, especially types I and III, fibronectin, and other matrix components, which result in fibrotic scarring and distortion of normal liver architecture, are hallmarks of aberrant ECM modification that starts to emerge in cirrhosis [22]. Crucially, myofibroblasts are the primary generators of the ECM in liver cirrhosis [23].

Understanding the role of matrix metalloproteins (MMPs) and tissue inhibitors of metalloproteinases (TIMPs) is vital. It is commonly known that inflammatory and profibrotic stimuli increase MMPs, particularly MMP-2, MMP-9, and MMP-13, which in turn trigger ECM deposition and ultimately result in liver cirrhosis [24]. However, MMP-induced ECM degradation in the liver tilted the scales to lessen ECM deposition of nearly all types, which led to the development of hepatic fibrosis and scarring. Additionally, it is known that target-specific inhibitors called TIMPs, as well as gene transcription, zymogen activation, and enzyme release, modulate MMP activity and expression at multiple stages [25]. Stellate cells in the liver provide fibrogenic materials that stimulate the production of collagen, especially through bone marrow-derived portal fibroblasts, fibroblasts, and myofibroblasts [26].

Bioinformatics has become a powerful tool in the field of biomedicine, particularly for creating biomarkers. With the use of sophisticated AI capabilities and analytical methods, bioinformatics has made it feasible to display liver cirrhosis data from a variety of datasets, including the matrisome [27, 28]. By enabling the analysis and interrogation of large-scale omics data, such as transcriptomics, proteomics, and metabolomics, bioinformatics has made it possible to take into account all aspects of liver cirrhosis pathology, including its dynamics and composition [29]. In this study, we determined the key genes involved in the onset and progression of HBV-related cirrhosis and investigated the matrisome-related aspects of HBV-associated liver cirrhosis. Key matrisome-related genes are crucial for the diagnosis of liver cirrhosis because of their ability to predict the disease progression. The basement membrane component containing the Type IV collagen α5 chain can potentially be used as predictive biomarkers for early-stage liver cirrhosis [30]. The development of liver fibrosis is significantly influenced by minimal type IV collagen [31].

2. Method

2.1. Data Acquisition and Processing

The data for this study were obtained from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/). A total of 60 liver tissue samples were included, comprising 30 normal samples and 30 HBV-related cirrhosis samples that are provided in Supporting Information 1: File S1. Gene expression profiles were processed and organized using R language (Version 4.2.0).

2.2. Identification of Differentially Expressed Genes (DEGs)

DEG analysis was performed using the Limma package in R. To control for false discovery due to multiple hypothesis testing, p values were adjusted using the Benjamini–Hochberg method to obtain false discovery rates (FDR). Genes with adjusted p values (FDR) < 0.01 and |log2 fold change| > 1 were considered significantly differentially expressed. These thresholds are commonly used in transcriptomic studies to balance discovery with statistical rigor. Visualization of DEGs was carried out using volcano plots (ggplot2) and heatmaps (pheatmap).

2.3. GO Enrichment and KEGG Enrichment Analysis

The clusterProfiler software [32] package was used to perform GO and KEGG enrichment analyses. This analysis determined the pathways that were significantly enriched with various statistical criteria defined, including a p value threshold of < 0.05 and a q-value threshold of < 0.02. These thresholds were used to ascertain the importance of the pathway enrichment, revealing the most important pathways linked to the studied conditions.

2.4. ECM-Related Genes

The ECM gene sets were obtained from the MsigDB database (https://www.gsea-msigdb.org/gsea/msigdb). The following ECM gene sets were used: KEGG_ECM_RECEPTOR_INTERACTION, KEGG_FOCAL_ADHESION, NABA_ECM_AFFILIATED, NABA_ECM_GLYCOPROTEINS, NABA_ECM_REGULATORS, and REACTOME_ECM_PROTEOGLYCANS.

Intersection analysis was performed between the ECM gene sets obtained from MsigDB full ECM gene set in Supporting Information 2: File S2. The DEGs identified in this study are aimed at identifying differentially expressed ECM genes. The analysis was done with the aim of isolating a specific set of DEGs that perform ECM-related functions. This study is aimed at understanding the role of ECM-related genes and how they were affected under the various conditions by blending the ECM gene sets from MsigDB with DEGs analysis.

2.5. Identification of Feature Genes Using Random Forest

A supervised machine learning approach based on the Random Forest [33] algorithm was implemented using the random forest package in R (Version 4.2.0). The classifier was trained on normalized gene expression data, with samples labeled as either cirrhotic or normal liver tissue.

To address class imbalance in the dataset, class weights were assigned using the classwt parameter, proportionally inverse to class frequencies. This ensured that both classes contributed equally to the model training and prevented bias toward the majority class.

An initial number of 3000 decision trees (ntree = 3000) was chosen to ensure model stability and convergence of variable importance estimates. Subsequently, the number of trees was optimized by evaluating the out-of-bag (OOB) error rate across a sequence of models with increasing tree numbers (from 100 to 3000). The OOB error stabilized at approximately 30 trees, which was selected as the optimal number of estimators for the final model.

Additional hyperparameters, including mtry (the number of features randomly selected at each node split) and nodesize (the minimum number of observations in terminal nodes), were tuned using grid search. The final model configuration, consisting of ntree = 30, mtry = X, and nodesize = X (actual values to be specified), was selected based on the lowest OOB error and highest classification performance.

The optimized model achieved a classification accuracy of 93.2%, with an OOB error rate of 6.8% and an area under the receiver operating characteristic curve (AUC) of 0.964. Sensitivity and specificity were 91.6% and 94.7%, respectively, indicating strong discriminative performance between the two sample classes.

Feature importance was evaluated using the Mean Decrease in Gini Index, which quantifies the contribution of each gene to classification performance by measuring the reduction in node impurity. The Top 20 genes were selected based on their importance scores. This threshold was chosen to maintain a balance between interpretability and biological informativeness, facilitating meaningful downstream functional and enrichment analyses.

3. Results

3.1. Description of Liver Cirrhosis Data

In this study, the key genes driving HBV-related liver cirrhosis were investigated using RNA-seq data from 30 normal samples and 30 samples with HBV-related liver cirrhosis, and the pathways and processes underlying these key genes were examined. Key genes were subsequently chosen after the comparability of the data utilized in this study was validated by expression distribution analysis of 60 samples (Figure 1a). Furthermore, the data standard deviation distribution roughly matched a normal distribution, and the Top 10 regulatory genes (Figure 1b,c) met the requirements for performing the differential analysis. Differences in gene expression between normal and cirrhotic samples are plotted by Volcano plot (Figure 1d).

Figure 1.

Figure 1

Overview of gene expression analysis in liver cirrhosis. (a) Boxplot showing normalized expression distribution across 60 samples (30 cirrhotic and 30 normal liver tissues). (b) Histogram of gene expression standard deviation, illustrating global expression variability. (c) Boxplots of the Top 10 differentially expressed regulatory genes between cirrhotic and normal samples; statistical significance is indicated (p < 0.05, p < 0.01, p < 0.001). (d) Volcano plot of differentially expressed genes. Red and blue dots represent significantly upregulated (n = 1215) and downregulated genes (n = 3690), respectively (|log2FC| > 1, adjusted p < 0.05).

3.2. Identification of DEGs

After data processing, 14,470 genes were identified. Differential expression analysis was performed using Limma software 3.52.4 [34]. A total of 2125 upregulated genes and 3689 downregulated genes were found using a threshold of p < 0.01 and log2FC > 1, as shown in the heatmap (Figure 2a). This log2FC threshold was selected based on the objective of capturing a broad, yet biologically meaningful set of transcriptomic changes while minimizing the exclusion of moderately dysregulated but functionally relevant genes. The heatmap illustrates the distinct clustering of upregulated and downregulated genes, highlighting the differential expression patterns between groups.

Figure 2.

Figure 2

Transcriptomic profiling and functional enrichment analysis of differentially expressed genes (DEGs) in HBV-associated liver cirrhosis. (a) Heatmap displaying the expression patterns of DEGs between cirrhotic and normal liver tissue samples. Rows represent individual genes, and columns represent samples. Red indicates higher expression and green indicates lower expression (log-transformed and normalized expression values). Samples are clearly clustered by disease status, with consistent upregulation and downregulation patterns across groups. (b) Gene Ontology (GO) enrichment analysis of DEGs. The Top 20 significantly enriched GO terms (adjusted p value < 0.05) are shown separately for upregulated (top panel) and downregulated (bottom panel) genes, categorized under Biological Process. Bar height represents the number of genes associated with each term, and colors represent functional clusters derived from semantic similarity. (c) Volcano plot showing the global distribution of gene expression changes. Each point represents a single gene. Genes meeting the differential expression threshold (adjusted p value < 0.01 and |log2 fold change| > 1) are colored: orange for significantly upregulated genes (n = 2125), blue for significantly downregulated genes (n = 3689), and gray for nonsignificant genes. Selected genes with large effect sizes or biological relevance are labeled.

The elevated genes were mainly enriched in biological processes, including ATP production, glutathione metabolism, and cellular response to cytotoxicity, which are typically included in liver cancer pathways, according to GO functional enrichment analysis (Figure 2b). This indicates that when HBV-related liver cirrhosis develops into liver cancer, elevated genes are transcriptionally triggered. In contrast, downregulated genes were primarily enriched in cell adhesion and cellular transport.

Gene expression differences were plotted using a volcano plot (Figure 2c). The Top 10 most highly enriched upregulated genes were COX6B1, RPS10, USMG5, NACAP1, UGT2B28, PABPC3, TUBB3, RPS27, COCH, and HLA-G. The subunit of cytochrome-c-oxidase 6B1, COX6B1, is linked to liver disease, ATP synthesis, electron transport, and gene expression. Viral mRNA translation, mRNA activation upon the interaction of a cap-binding complex and eIFs, and their recruitment and subsequent binding to 43S are all mechanisms linked to RPS10, which encodes ribosomal protein S10.

Conversely, PLCG2, ARHGEF12, LRP1, FSIP2, SAMD12, ASTN2, MIPOL1, DST, TBC1D8B, and GATM were the Top 10 downregulated genes. Autoinflammatory illnesses, antibody deficits, immunodeficiency, and familial cold autoinflammatory syndromes are some of the conditions associated with PLCG2. The associated pathways include ADORA2B-mediated anti-inflammatory cytokine production and the prolactin signaling pathway.

3.3. DEGs Associated With ECM

Among these DEGs, there were genes linked to the ECM (Figures 3a, 3b, and 3c). There were 170 downregulated ECM-related genes (DIAPH1 and THSD4) and 96 upregulated genes (COCH and ACTB). ACTB encodes actin, a key cytoskeleton component. In addition to their function in the cytoplasmic cell skeleton, G- and F-actin are also localized to the cell nucleus, where they control DNA damage repair, gene transcription, and cell motility.

Figure 3.

Figure 3

Gene expression analysis focusing on extracellular matrix (ECM)-related genes. (a) Heatmap illustrating the expression profiles of differentially expressed ECM-associated genes between liver cirrhosis samples and control samples. Each row represents a gene, and each column represents a sample. Red denotes upregulated genes, and green denotes downregulated genes. Hierarchical clustering was applied to both genes and samples. (b) Volcano plot displaying the differential expression of ECM genes. The x-axis shows the log₂ fold change (log₂FC), and the y-axis shows the -log₁₀ adjusted p-value (FDR). Genes with |log2FC| > 1 and FDR < 0.05 are considered significantly differentially expressed. Notable upregulated (e.g., SP5) and downregulated (e.g., DOCK11) genes are highlighted. (c) Heatmap showing the top significantly dysregulated ECM genes. Red indicates higher expression and green indicates lower expression across the sample groups. The clustering highlights distinct expression patterns in cirrhotic versus control liver tissues.

3.4. Selection of Key ECM Genes

A random forest algorithm was employed to analyze the ECM-related DEGs in order to identify key ECM genes. As a result, 20 ECM genes that were differentially expressed were selected (Figure 4a), which included an upregulated group comprising CTSA, CRELD2, MAPK10, LGALS2, CAV3, and PRSS2 and a downregulated group comprising ITGA1, RAC1, ECM2, and PAK2. Gene Ontology (GO) enrichment analysis of the 20 key genes (Figure 4b) revealed potential functional relevance to the Ras, Rho, and MAPK signaling pathways embedded in the MAPK cascade. Mutations in Ras proto-oncogenes (H-Ras, N-Ras, and K-Ras) occur in up to 20%–30% of tumors, while the Rho family is involved in actin cytoskeleton reorganization, a process integral to cell migration. In (Figure 4c), the random number distribution plot, it can be noted that after 30 trees, the green and red lines became wider apart, indicating that 30 trees were sufficient for the determination of the ECM-related feature genes. The Gini coefficient was used to rank the genes (Figure 4d), and the Top 20 genes were included as key ECM DEGs.

Figure 4.

Figure 4

Selection and functional characterization of key extracellular matrix (ECM)-related genes in liver cirrhosis. (a) Heatmap representing the expression profiles of selected differentially expressed ECM-related genes in liver cirrhosis and normal control samples. Rows represent genes, and columns represent individual samples. Red indicates upregulated expression, and green indicates downregulated expression. (b) Gene Ontology (GO) enrichment analysis of selected ECM genes. The dendrogram displays clustered GO biological processes, with dot size representing the number of associated genes and color indicating statistical significance (adjusted p value). Enriched terms include cell junction assembly, cell adhesion, signal transduction, and regulation of MAPK activity. (c) Error rate plot of the Random Forest classifier showing the out-of-bag (OOB) error as a function of the number of decision trees. The green line represents the overall error, while the red and black lines denote error rates for individual groups (normal vs. cirrhosis). (d) Gini index plot ranking the importance of selected ECM genes based on the mean decrease in Gini impurity in the Random Forest model. Higher values indicate greater importance in classifying cirrhotic versus normal samples. PRSS2, NGFR, and EGR2 were among the top-ranked discriminative genes.

3.5. Enrichment Analysis

KEGG pathway analysis of 20 ECM genes, identified using random forest algorithms, was enriched in the Ras, MAPK, and ERBB pathways implicated in liver cirrhosis, cancer development, and tumor immunity. These ECM genes may serve as biomarkers for cancer development in HBV-related liver cirrhosis. MAPK cascade regulates cell differentiation, proliferation, inflammation, and apoptosis. Its altered regulation drives liver cirrhosis progression through the stress responses, hepatocyte damage, and fibrogenesis (Figure 5a,b).

Figure 5.

Figure 5

(a) Upregulation of the MAPK signaling pathway in liver cirrhosis. KEGG pathway map showing significantly upregulated genes (in red) in the MAPK signaling cascade in cirrhotic liver tissue. Red intensity indicates the degree of upregulation. Key genes involved in the JNK and ERK branches were activated, suggesting roles in inflammation, apoptosis, and cell cycle regulation during liver cirrhosis progression. (b) Downregulation of the MAPK signaling pathway in liver cirrhosis. KEGG pathway map showing significantly downregulated genes (highlighted in green) in the MAPK signaling pathway in cirrhotic liver tissue. The reduction in expression across multiple components—including upstream receptors (e.g., TNFR, IL1R), MAPKKKs (e.g., MAP3K1, MAP3K5), and transcription factors (e.g., ATF2, c-Fos)—indicates suppression of inflammatory and proliferative signals. This suggests impaired regulation of key cellular processes such as stress response, apoptosis, and cell differentiation in cirrhosis. (c) Upregulation of the ERBB signaling pathway in HCC. This schematic depicts the ERBB signaling cascade with components upregulated in HCC highlighted. Genes and proteins are color-coded according to their expression levels, with a gradient from white (no change) to dark red (maximum upregulation), as indicated by the parameter change scale (0.0–1.0) in the top right. Red-colored nodes represent genes or proteins that are significantly upregulated in HCC. Solid arrows indicate direct activations, dashed arrows denote indirect interactions, and “+p” marks phosphorylation events. Key downstream pathways include MAPK, PI3K-Akt, JNK, and calcium signaling, all contributing to oncogenic processes such as proliferation, survival, angiogenesis, and invasion. (d) Upregulation of the PI3K-AKT signaling pathway in HCC. This diagram illustrates components of the PI3K-AKT signaling pathway with expression changes observed in HCC. Genes and proteins upregulated in HCC are marked by red boxes, with color intensity reflecting the degree of upregulation based on transcriptomic data. The accompanying scale bar (0.0–1.0) indicates relative expression changes, from no change (white) to maximum upregulation (dark red). Signaling flow is represented by solid arrows (direct activation) and dashed arrows (indirect or multi-step interactions), while phosphorylation events are indicated by “+p.” Central to this pathway is AKT, which integrates upstream signals from growth factors (GF), RTKs, integrins, and chemokine receptors, and propagates downstream signals influencing key cellular processes such as survival, proliferation, metabolism, protein synthesis, and apoptosis. Several downstream branches—including the mTOR, MAPK, and FoxO signaling pathways—are shown mediating diverse oncogenic effects.

The heatmap showed upregulated MAPK proteins, including JNK, p38, ERK, and MEK, indicating upstream activation of MAPK in liver cirrhosis. Among them, JNK and p38 promote hepatocyte apoptosis, while ERK induces hepatic stellate cell activation, leading to the deposition of ECM and aggravation of fibrosis and inflammation. Conversely, (Figure 5b) showed downregulated components of MAPK where suppressed JNK, p38, and ERK signaling overlapped with reduced hepatocyte apoptosis and hepatic stellate cell proliferation with a lesser degree of inflammation, thereby aiding in tissue preservation. Inhibition of MAPK pathways may hold the promise of alleviating liver fibrosis in experimental models through drugs acting as inhibitors of JNK and p38.

The ERBB signaling pathway (Figure 5c), mediated by ERBB receptor tyrosine kinases (ERBB-1/EGFR, ERBB-2/HER2, ERBB-3, and ERBB-4), is crucial for cellular proliferation, survival, migration, and angiogenesis, which leads to cancer progression. Following ligand binding, the receptors homodimerize or heterodimerize and undergo trans-autophosphorylation, consequently activating downstream signaling cascades, namely, the MAPK (Raf/MEK/ERK), PI3K/Akt, and JNK pathways. Pathways that exhibit aberrant signaling have been implicated in hepatocyte proliferation, malignant transformation, and therapy resistance, particularly following hepatitis viral infections and cirrhosis during HCC development. The MAPK signaling pathway controls proliferation and migration, while the PI3K/Akt signaling pathway (Figure 5d) enhances survival and metabolism, assisting in tumor growth and resistance. ERBB activation is also thought to stimulate proangiogenic factors, such as VEGF, which mediate angiogenesis, which is important for tumor expansion. The JNK pathway integrates signals for stress responses and inflammation, thereby linking liver injury to malignant hepatocyte transformation.

The upregulated components of the ERBB pathway, HER2, Raf, JNK, and PAK, promote the proliferation, migration, and invasion of tumor cells. Enhanced HER2 expression correlates with an aggressive cancer phenotype and poor prognosis, making it a promising therapeutic target. In liver cancers, such as HCC, chronic inflammation or viral-mediated activation persistently leads to ERBB signaling, which facilitates hepatocyte transformation and tumor progression.

4. Discussion

Worldwide, 21% and 42% of patients with cirrhosis are infected with HCV and HBV, respectively [35]. For the early noninvasive diagnosis of liver cirrhosis, it is essential to identify robust prognostic biomarkers. Liver cirrhosis is a multifaceted illness characterized by significant fibrosis and ECM remodeling that frequently progresses to HCC, particularly in cases associated with HBV infection [36]. This study is aimed at analyzing gene expression patterns and matrisome-related signatures to identify the important genes and pathways associated with HBV-related liver cirrhosis. Differential gene expression analysis identified several dysregulated genes in HBV-associated liver cirrhosis. Among the upregulated genes, RPS10 and COX6B1 are involved in physiological processes, including mRNA translation and ATP generation, in line with the metabolic alterations observed in cirrhotic liver tissues. These molecular changes affect hepatocyte function, inflammation, and fibrosis [37]. Conversely, the downregulated genes PLCG2 and ARHGEF12 are associated with cellular signaling and immune regulation, suggesting dysregulation of the immune response in the cirrhotic liver. GO enrichment analysis suggests that most upregulated genes were linked to processes associated with cancer, such as ATP production and glutathione metabolism [38], while KEGG pathway analysis showed that ERBB, MAPK, and Ras signaling pathways are enriched and associated with tumor immunity and cancer development.

The pathways related to cancer among the dysregulated genes in cirrhotic tissues indicate a mechanism-based relationship between HBV-related cirrhosis and HCC. This highlights the critical need for early recognition and treatment of HBV-related cirrhosis to halt or delay progression to HCC. ECM remodeling is essential in liver fibrosis, and the upregulation of actin cytoskeleton-associated genes (COCH and ACTB) denotes increased ECM deposition and cellular motility, which are characteristics of liver fibrosis and HCC progression [39, 40]. Conversely, downregulated ECM genes, such as THSD4 and DIAPH1, are probably involved in the disorganization of ECM and subsequent atypical signaling in cirrhotic tissue [41].

A random forest algorithm was used to identify key genes associated with ECM that could predict HBV-related cirrhosis. CTSA, CRELD2, MAPK10, and other specific genes are associated with the Ras and MAPK signaling pathways, which have been implicated in tumorigenesis and metastasis of hepatic cancer [10]. Liver cirrhosis and HCC may be related to the upregulation of ACTB, which codes for the beta-actin [29]. Similarly, COCH expression has been linked to portal vein tumor thrombosis, HCC, and HBV infection [42]. In contrast, downregulation of actin filament organization genes, such as DIAPH1, increases fibrosis potential through disorganization of the cell architecture [43]. Bioinformatic analysis helps identify significant ECM genes, which can serve as biomarkers for diagnosis, prognosis, and treatment. CTSA serves as a promising biomarker for disease progression based on its function in lysosomal and ECM degradation [44]. Furthermore, the inhibition of ECM-associated pathways, such as Ras and MAPK, may provide new therapeutic strategies for treating or preventing the progression of HBV-associated cirrhosis to HCC.

Furthermore, the small sample size limited the generalizability of the results. These results should be validated in larger, more diverse cohorts, contributing to growing statistical power. Future studies should validate the significant ECM-related genes by utilizing larger datasets and including more demographic data. Additionally, longitudinal studies are necessary to follow ECM modifications in HBV-related cirrhosis over time. Functional assays, including gene knockdown and overexpression experiments, are required to confirm the importance of key ECM genes (e.g. CTSA and ITGA1) in liver fibrosis and their potential as therapeutic targets. Stratification of the ECM-related genes identified in HBV-associated cirrhosis with other liver diseases, including NAFLD and ALD, should be performed in future studies to improve our results. This will enable a clearer discrimination of HBV-specific ECM signatures and facilitate diagnostic and therapeutic applications.

This study provides insights into the molecular mechanisms of HBV-associated liver cirrhosis. The ECM-associated genes recognized in our study, including CTSA, CRELD2, MAPK10, and ITGA1, could serve as new diagnostic and therapeutic targets through which we speculate that patient welfare might benefit. The exact clinical utility of these findings in the early identification and management of liver cirrhosis and its evolution in HCC requires confirmation in larger cohorts and experimental models.

5. Conclusion

This study demonstrated a matrisome expression signature across liver cirrhosis samples through comprehensive bioinformatics analysis of transcriptomic dataset from HBV-related liver cirrhosis samples. Important genes associated with the ECM and signaling pathways in relation to Ras, Rho, and MAPK played important roles in MMP activation, such as CTSA, CRELD2, MAPK10, ITGA1, and RAC1. These findings indicate a plausible involvement of the matrisome in HBV-related liver cirrhosis' transformation into HCC. The identified hub genes have potential significance in future diagnostic and therapeutic studies of liver cirrhosis. These matrisome-associated signatures warrant more experimental validation and clinical investigations to determine their relevance and clinical significance in the progression of HBV-related liver cirrhosis.

Nomenclature

HBV

Hepatitis B virus

HCC

hepatocellular carcinoma

ECM

extracellular matrix

DEGs

differentially expressed genes

GO

Gene Ontology

KEGG

Kyoto Encyclopedia of Genes and Genomes

Data Availability Statement

The data that support the findings of this study are openly available in GEO NCBI at https://www.ncbi.nlm.nih.gov/geo/.

Ethics Statement

The data used in this study were obtained from publicly available datasets, and no human participants were directly involved.

Disclosure

All authors have read and approved the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Author Contributions

R.P.C. designed the research, analyzed the results, and wrote the manuscript. Y.S. and K.D. reviewed and edited the manuscript. S.J. supervised and edited the whole manuscript.

Funding

No funding was received for this manuscript.

Supporting Information

Additional supporting information can be found online in the Supporting Information section.

Supporting Information 1

File S1: Detailed metadata from the GEO datasets used in the analysis, listing platform types, sample IDs, assay methods, and classification (normal vs. cirrhosis).

5532643.f1.docx (18KB, docx)
Supporting Information 2

File S2: The full list of differentially expressed genes (DEGs), with associated log fold change values and statistical significance measures.

5532643.f2.xlsx (85.6KB, xlsx)

References

  • 1.Devarbhavi H., Asrani S. K., Arab J. P., Nartey Y. A., Pose E., Kamath P. S. Global Burden of Liver Disease: 2023 Update. Journal of Hepatology . 2023;79(2):516–537. doi: 10.1016/j.jhep.2023.03.017. [DOI] [PubMed] [Google Scholar]
  • 2.Al-Busafi S. A., Alwassief A. Global Perspectives on the Hepatitis B Vaccination: Challenges, Achievements, and the Road to Elimination by 2030. Vaccines . 2024;12(3):p. 288. doi: 10.3390/vaccines12030288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Seto W. K., Yuen M. F. Clinical Epidemiology of Chronic Liver Diseases . Cham: Springer International Publishing; 2019. Hepatitis B Virus: Asian Perspective; pp. 99–116. [DOI] [Google Scholar]
  • 4.Guo D., Li J., Zhao P., Mei T., Li K., Zhang Y. The Hepatocellular Carcinoma Risk in Patients With HBV-Related Cirrhosis: A Competing Risk Nomogram Based on a 4-Year Retrospective Cohort Study. Oncologia . 2024;14:p. 14. doi: 10.3389/fonc.2024.1398968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Yao J., Guo J., Xie Y. Hepatitis B Virus Induced Cirrhosis and Hepatocarcinoma: Pathogenesis and Therapeutics. Digestive Diseases . 2025;4 doi: 10.37349/edd.2025.100565. [DOI] [Google Scholar]
  • 6.Varghese N., Majeed A., Nyalakonda S., Boortalary T., Halegoua-DeMarzio D., Hann H. W. Review of Related Factors for Persistent Risk of Hepatitis B Virus-Associated Hepatocellular Carcinoma. Cancers (Basel) . 2024;16(4):p. 777. doi: 10.3390/cancers16040777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Liu Y. B., Chen M. K. Epidemiology of Liver Cirrhosis and Associated Complications: Current Knowledge and Future Directions. World Journal of Gastroenterology . 2022;28(41):5910–5930. doi: 10.3748/wjg.v28.i41.5910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sharma S. Non-Invasive Diagnosis of Advanced Fibrosis and Cirrhosis. World Journal of Gastroenterology . 2014;20(45):p. 16820. doi: 10.3748/wjg.v20.i45.16820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.EASL-ALEH Clinical Practice Guidelines. EASL-ALEH Clinical Practice Guidelines: Non-Invasive Tests for Evaluation of Liver Disease Severity and Prognosis. Journal of Hepatology . 2015;63(1):237–264. doi: 10.1016/j.jhep.2015.04.006. [DOI] [PubMed] [Google Scholar]
  • 10.Zhang M. H., Yuan Y. F., Liu L. J., et al. Dysregulated MicroRNAs as a Biomarker for Diagnosis and Prognosis of Hepatitis B Virus-Associated Hepatocellular Carcinoma. World Journal of Gastroenterology . 2023;29(31):4706–4735. doi: 10.3748/wjg.v29.i31.4706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Arteel G. E., Naba A. The Liver Matrisome – Looking Beyond Collagens. JHEP Reports . 2020;2(4) doi: 10.1016/j.jhepr.2020.100115.100115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Dzobo K., Dandara C. The Extracellular Matrix: Its Composition, Function, Remodeling, and Role in Tumorigenesis. Biomimetics . 2023;8(2):p. 146. doi: 10.3390/biomimetics8020146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Pickup M. W., Mouw J. K., Weaver V. M. The Extracellular Matrix Modulates the Hallmarks of Cancer. EMBO Reports . 2014;15(12):1243–1253. doi: 10.15252/embr.201439246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Witjas F. M. R., van den Berg B. M., van den Berg C. W., Engelse M. A., Rabelink T. J. Concise Review: The Endothelial Cell Extracellular Matrix Regulates Tissue Homeostasis and Repair. Stem Cells Translational Medicine . 2019;8(4):375–382. doi: 10.1002/sctm.18-0155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.McQuitty C. E., Williams R., Chokshi S., Urbani L. Immunomodulatory Role of the Extracellular Matrix Within the Liver Disease Microenvironment. Frontiers in Immunology . 2020;11:p. 11. doi: 10.3389/fimmu.2020.574276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Potekaev N. N., Borzykh O. B., Medvedev G. V., et al. The Role of Extracellular Matrix in Skin Wound Healing. Journal of Clinical Medicine . 2021;10(24):p. 5947. doi: 10.3390/jcm10245947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Thiele G. M., Duryee M. J., Freeman T. L., et al. Rat Sinusoidal Liver Endothelial Cells (SECs) Produce Pro-Fibrotic Factors in Response to Adducts Formed From the Metabolites of Ethanol. Biochemical Pharmacology . 2005;70(11):1593–1600. doi: 10.1016/j.bcp.2005.08.014. [DOI] [PubMed] [Google Scholar]
  • 18.Dolin C. E., Arteel G. E. The Matrisome, Inflammation, and Liver Disease. Seminars in Liver Disease . 2020;40(2):180–188. doi: 10.1055/s-0039-3402516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Chen W., Desert R., Ge X., et al. The Matrisome Genes From Hepatitis B–Related Hepatocellular Carcinoma Unveiled. Hepatology Communications . 2021;5(9):1571–1585. doi: 10.1002/hep4.1741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Ahodantin J., Lekbaby B., Bou Nader M., Soussan P., Kremsdorf D. Hepatitis B Virus X Protein Enhances the Development of Liver Fibrosis and the Expression of Genes Associated With Epithelial-Mesenchymal Transitions and Tumor Progenitor Cells. Carcinogenesis . 2020;41(3):358–367. doi: 10.1093/carcin/bgz109. [DOI] [PubMed] [Google Scholar]
  • 21.Hester C. A., Rich N. E., Singal A. G., Yopp A. C. Comparative Analysis of Nonalcoholic Steatohepatitis– Versus Viral Hepatitis– and Alcohol-Related Liver Disease–Related Hepatocellular Carcinoma. Journal of the National Comprehensive Cancer Network . 2019;17(4):322–329. doi: 10.6004/jnccn.2018.7105. [DOI] [PubMed] [Google Scholar]
  • 22.Arriazu E., Ruiz de Galarreta M., Cubero F. J., et al. Extracellular Matrix and Liver Disease. Antioxidants & Redox Signaling . 2014;21(7):1078–1097. doi: 10.1089/ars.2013.5697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Iwaisako K., Brenner D. A., Kisseleva T. What’s New in Liver Fibrosis? The Origin of Myofibroblasts in Liver Fibrosis. Journal of Gastroenterology and Hepatology . 2012;27(s2):65–68. doi: 10.1111/j.1440-1746.2011.07002.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Giannandrea M., Parks W. C. Diverse Functions of Matrix Metalloproteinases During Fibrosis. Disease Models & Mechanisms . 2014;7(2):193–203. doi: 10.1242/dmm.012062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Naim A., Pan Q., Baig M. S. Matrix Metalloproteinases (MMPs) in Liver Diseases. Journal of Clinical and Experimental Hepatology . 2017;7(4):367–372. doi: 10.1016/j.jceh.2017.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Zhang C. Y., Yuan W. G., He P., Lei J. H., Wang C. X. Liver Fibrosis and Hepatic Stellate Cells: Etiology, Pathological Hallmarks and Therapeutic Targets. World Journal of Gastroenterology . 2016;22(48):p. 10512. doi: 10.3748/wjg.v22.i48.10512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Nazari E., Khalili-Tanha G., Asadnia A., et al. Bioinformatics Analysis and Machine Learning Approach Applied to the Identification of Novel Key Genes Involved in Non-Alcoholic Fatty Liver Disease. Scientific Reports . 2023;13(1):p. 20489. doi: 10.1038/s41598-023-46711-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Jiang C. H., Yuan X., Li J. F., et al. Bioinformatics-Based Screening of Key Genes for Transformation of Liver Cirrhosis to Hepatocellular Carcinoma. Journal of Translational Medicine . 2020;18(1):p. 40. doi: 10.1186/s12967-020-02229-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Du T. Y., Gao Y. X., Zheng Y. S. Identification of Key Genes Related to Immune Infiltration in Cirrhosis via Bioinformatics Analysis. Scientific Reports . 2023;13(1):p. 1876. doi: 10.1038/s41598-022-26794-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Wu Y., Cao Y., Xu K., et al. Dynamically Remodeled Hepatic Extracellular Matrix Predicts Prognosis of Early-Stage Cirrhosis. Cell Death & Disease . 2021;12(2):p. 163. doi: 10.1038/s41419-021-03443-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mak K. M., Mei R. Basement Membrane Type IV Collagen and Laminin: An Overview of Their Biology and Value as Fibrosis Biomarkers of Liver Disease. Anatomical Record . 2017;300(8):1371–1390. doi: 10.1002/ar.23567. [DOI] [PubMed] [Google Scholar]
  • 32.Yu G., Wang L. G., Han Y., He Q. Y. Clusterprofiler: An R Package for Comparing Biological Themes Among Gene Clusters. Omics . 2012;16(5):284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Breiman L. Random Forests. Machine Learning . 2001;45(1):5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
  • 34.Ritchie M. E., Phipson B., Wu D., et al. Limma Powers Differential Expression Analyses for RNA-Sequencing and Microarray Studies. Nucleic Acids Research . 2015;43(7):p. e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Alberts C. J., Clifford G. M., Georges D., et al. Worldwide Prevalence of Hepatitis B Virus and Hepatitis C Virus Among Patients With Cirrhosis at Country, Region, and Global Levels: A Systematic Review. Lancet Gastroenterology & Hepatology . 2022;7(8):724–735. doi: 10.1016/S2468-1253(22)00050-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Llovet J. M., Kelley R. K., Villanueva A., et al. Hepatocellular Carcinoma. Nature Reviews Disease Primers . 2021;7(1):p. 6. doi: 10.1038/s41572-020-00240-3. [DOI] [PubMed] [Google Scholar]
  • 37.Dietrich C. G. Molecular Changes in Hepatic Metabolism and Transport in Cirrhosis and their Functional Importance. World Journal of Gastroenterology . 2016;22(1):p. 72. doi: 10.3748/wjg.v22.i1.72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Traverso N., Ricciarelli R., Nitti M., et al. Role of Glutathione in Cancer Progression and Chemoresistance. Oxidative Medicine and Cellular Longevity . 2013;2013 doi: 10.1155/2013/972913.972913 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Yuan Z., Li Y., Zhang S., et al. Extracellular Matrix Remodeling in Tumor Progression and Immune Escape: From Mechanisms to Treatments. Molecular Cancer . 2023;22(1):p. 48. doi: 10.1186/s12943-023-01744-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Cox T. R., Erler J. T. Remodeling and Homeostasis of the Extracellular Matrix: Implications for Fibrotic Diseases and Cancer. Disease Models & Mechanisms . 2011;4(2):165–178. doi: 10.1242/dmm.004077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Ortiz C., Schierwagen R., Schaefer L., Klein S., Trepat X., Trebicka J. Extracellular Matrix Remodeling in Chronic Liver Disease. Current Tissue Microenvironment Reports . 2021;2(3):41–52. doi: 10.1007/s43152-021-00030-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Wang C., Ding Z. W., Zheng C. G., et al. COCH Predicts Survival and Adjuvant TACE Response in Patients With HCC. Oncology Letters . 2021;21(4):p. 275. doi: 10.3892/ol.2021.12536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Montaldo C., Terri M., Riccioni V., et al. Fibrogenic Signals Persist in DAA-Treated HCV Patients After Sustained Virological Response. Journal of Hepatology . 2021;75(6):1301–1311. doi: 10.1016/j.jhep.2021.07.003. [DOI] [PubMed] [Google Scholar]
  • 44.Fonović M., Turk B. Cysteine Cathepsins and Extracellular Matrix Degradation. Biochimica et Biophysica Acta (BBA) - General Subjects . 2014;1840(8):2560–2570. doi: 10.1016/j.bbagen.2014.03.017. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information 1

File S1: Detailed metadata from the GEO datasets used in the analysis, listing platform types, sample IDs, assay methods, and classification (normal vs. cirrhosis).

5532643.f1.docx (18KB, docx)
Supporting Information 2

File S2: The full list of differentially expressed genes (DEGs), with associated log fold change values and statistical significance measures.

5532643.f2.xlsx (85.6KB, xlsx)

Data Availability Statement

The data that support the findings of this study are openly available in GEO NCBI at https://www.ncbi.nlm.nih.gov/geo/.


Articles from International Journal of Hepatology are provided here courtesy of Wiley

RESOURCES