Skip to main content
Cureus logoLink to Cureus
. 2023 Sep 11;15(9):e45063. doi: 10.7759/cureus.45063

Identifying Crucial Biomarkers in Osteoporosis and Ulcerative Colitis Through Bioinformatics Analysis of Co-expressed Genes

Zhengyan Wang 1, Xukai Wang 1,, Jing Yan 2, Ying Wang 1, Xingxing Yu 1, Yanpeng Wang 1
Editors: Alexander Muacevic, John R Adler
PMCID: PMC10567515  PMID: 37842511

Abstract

Osteoporosis (OP) and ulcerative colitis (UC), prevalent immune diseases, exert a substantial socioeconomic impact globally. This study identifies biomarkers for these diseases, paving the way for in-depth research. Initially, the Gene Expression Omnibus (GEO) database was employed to analyze datasets GSE35958 and GSE87466. This analysis aimed to pinpoint co-expression differential genes (DEGs) between OP and UC. Subsequently, the Metascape database facilitated the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis of these DEGs' co-expression. For network construction and visualization, the STRING11.5 database along with Cytoscape 3.7.2 (Cytoscape Team, USA) were utilized to create a protein-protein interaction (PPI) network. Moreover, Cytoscape’s cytoHubba plugin was instrumental in identifying the central genes, known as hub genes. In the datasets GSE35958 and GSE87466, 156 co-expressed DEGs were discovered. The PPI network, constructed using STRING11.5 and Cytoscape 3.7.2, comprises 96 nodes and 222 connections. Notably, seven hub genes were identified, namely COL6A1, COL6A2, BGN, NID1, PLAU, TGFB1, and PLAUR. These DEGs were predominantly enriched in pathways such as extracellular matrix organization and collagen-containing extracellular matrix, as per GO analysis. For diagnostic model construction and hub gene validation, datasets GSE56815 and GSE107499 from the GEO database were employed. The top five hub genes were validated. In conclusion, the hub genes identified in this study played a significant role in the early diagnosis, prevention, and treatment of OP and UC. Furthermore, they provide fresh insights into the underlying mechanisms of these diseases’ development and progression.

Keywords: ulcerative colitis (uc), gene expression omnibus (geo) database, differentially expressed genes (degs), computational biology, osteoporosis (op)

Introduction

Osteoporosis (OP), the most prevalent metabolic bone disorder, is marked by a continuous loss of bone mass and tissue deterioration [1]. The bone remodeling process is out of balance, resulting in a decrease in bone strength and fractures [2]. OP is common in elderly and postmenopausal women, and the pathogenesis factors and mechanisms are mainly related to the immune system, genetics, estrogen levels in the body, smoking, and other lifestyles [3]. OP is a silent bone disease that is difficult to diagnose and treat in a timely manner because there are often no symptoms until low-energy fractures occur.

Mucous membrane inflammation is the hallmark of ulcerative colitis (UC), a chronic immune-mediated, recurrent inflammatory bowel disease (IBD) [4]. This inflammation begins in the rectum and, in severe cases, extends throughout the colon [5]. The etiology and mechanism of pathogenesis are unknown but mainly involve environmental, immune system, and other factors. Clinical symptoms are mainly abdominal pain, diarrhea, and rectal discharge of pus. In addition, extraintestinal manifestations related to the eyes, liver, and joints may be present [6]. If left untreated, it can lead to persistent bowel damage and increase the risk of surgery [7]. The incidence and prevalence of OP and UC are increasing year on year globally, creating a serious socioeconomic burden and consuming significant medical resources. Therefore, we need to pay attention to it.

OP and UC share commonalities in inflammation, immunity, and other factors, suggesting the possibility of shared or similar core genes, pathways, or biological processes. While previous studies have touched upon related aspects, there remains a gap in understanding the specific correlation between the two diseases. This study aims to bridge this gap by identifying differential genes and potential biomarkers through bioinformatics analysis, which could be instrumental in the prevention and treatment of OP and UC. Four datasets, namely GSE35958 [8], GSE87466 [9], GSE56815 [10], and GSE107499 were downloaded from the Gene Expression Omnibus (GEO) database. The first two datasets, GSE35958 and GSE87466, were employed for identifying co-expressed differential genes, while GSE56815 and GSE107499 were utilized for constructing disease diagnosis models. The analysis began with Gene Set Enrichment Analysis (GSEA) of GSE35958 and GSE87466. Subsequently, the GEO2R tool within the GEO database was used to filter out differentially expressed genes. Separate Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were conducted. The Venn 2.0 website facilitated the identification of co-expressed differential genes, which were then subjected to enrichment analysis. Utilizing Cytoscape 3.7.2 (Cytoscape Team, USA), the data was visualized, and a protein-protein interaction (PPI) network of co-expressed differential genes was constructed using the STRING (Search Tool for the Retrieval of Interacting Genes/Proteins) database. The cytoHubba plugin within Cytoscape 3.7.2 was employed to pinpoint the Hub genes. Lastly, Statistical Package for the Social Sciences (IBM SPSS Statistics for Windows, IBM Corp., Version 25.0, Armonk, NY) was used to construct a disease diagnostic model, which aided in validating the Hub genes. The identification of Hub genes for OP and UC through this study is expected to offer fresh perspectives for subsequent research and elucidate the biological mechanisms involved.

Materials and methods

Construction of flowchart and technical route

We first produced the specific flowchart and technical route of this study shown in Figure 1. We downloaded the GSE35958 (OP) and GSE87466 (UC) datasets from the GEO database and conducted separate GSEA analyses on each of them. Subsequently, we performed differential expression analysis and functional enrichment analysis for the expression profiles of these two datasets. Next, GO and KEGG enrichment analyses were carried out based on the differentially expressed genes shared between the GSE35958 (OP) and GSE87466 (UC) datasets, and a PPI network graph was constructed using these differentially expressed genes. Additionally, we employed the cytoHubba plugin in Cytoscape 3.7.2 to identify hub genes and conducted enrichment analysis on these hub genes. Finally, we validated the hub genes using external independent datasets, GSE56815 (OP) and GSE107499 (UC), to ascertain the accuracy and reliability of our findings.

Figure 1. Flow chart.

Figure 1

GSEA: Gene Set Enrichment Analysis, OP: osteoporosis, UC: ulcerative colitis, KEGG: Kyoto Encyclopedia of Genes and Genomes, GO: Gene Ontology, PPI: protein-protein interaction

Data source

The GEO database is utilized to store quality articulation datasets. We downloaded four datasets from GEO, namely GSE35958, GSE87466, GSE56815, and GSE107499 (Table 1). The GPL570 platform ([HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array) serves as the foundation for GSE35958. The GPL13158 platform ([HT_HG-U133_Plus_PM] Affymetrix HT HG-U133+ PM Array Plate) serves as the foundation for GSE87466. The GPL96 platform ([HG-U133A] Affymetrix Human Genome U133A Array) serves as the foundation for GSE56815. The GPL15207 platform, ([PrimeView] Affymetrix Human Gene Expression Array), is the foundation of GSE107499.

Table 1. Datasets from the GEO database.

GEO: Gene Expression Omnibus, OP: osteoporosis, UC: ulcerative colitis, BMD: bone mineral density

Data set Expression Platforms References
GSE35958 [8]   GPL570 PMID:23028809
OP 5    
Normal 4    
GSE87466 [9]   GPL13158 PMID:29401083 
UC 28    
Normal 21    
GSE56815 [10]   GPL96 PMID:29330445 
Low BMD 21    
High BMD 20    
GSE107499    GPL15207 None
UC 34    
Normal 22    

Enrichment analysis of OP and UC

GSEA is a computational method employed to determine the statistical significance of predefined sets of genes. This approach is particularly useful in identifying subtle changes in gene expression that may be overlooked by other methods. Another essential tool in the analysis is GO, which is widely used to categorize genes based on their biological processes (BP), cellular components (CC), and molecular functions (MF). Additionally, the KEGG is a comprehensive database that contains a wealth of information on genomes, biological pathways, diseases, drugs, and chemical substances. KEGG is particularly renowned for its pathway maps representing molecular interactions and reaction networks. We used the GSEA database and the Metascape database for GSE35958 and GSE87466 for GSEA enrichment analysis and GO enrichment analysis.

Identify co-expressed differential genes

To perform differential genetic analysis and download the data, we make use of the GEO2R tool in the GEO database. With a standard for dataset organization | logFC | ≥ 1.0 and adjusted P Value < 0.05. Co-expressed differential genes were found using the Venn 2.0 database. The Metascape database was used for the subsequent GO and KEGG enrichment analyses, and an adjusted P value < 0.05 was considered statistically significant.

PPI network construction of differentially expressed genes

The revealing of PPI networks can identify core protein genes and reveal both specific and non-specific interactions between proteins. Using the STRING 11.5 database, functional relationships between proteins were predicted, treating PPI networks of differential genes with scores of ≥ 0.4 as functional links, and building visual networks using Cytoscape 3.7.2. Following that, we identified and constructed densely connected modules from the PPI network by utilizing the MCODE plugin in the Cytoscape 3.7.2 software, which has a standard K-core of 2, a degree cutoff of 2, a maximum depth of 100, and a node score cutoff of 0.2. The module's genes were then subjected to enrichment analyses using GO and KEGG.

Selection and analysis of hub genes

In this study, the cytoHubba plugin within the Cytoscape 3.7.2 software was employed to pinpoint hub genes, which are genes that hold central positions within the PPI networks and are often key players in biological functions. Four algorithms, namely Degree, DMNC, MCC, and MNC, were utilized to select the top 20 hub genes. These algorithms are effective in identifying genes with high connectivity and centrality within the network, which are often indicative of their importance in biological processes. Subsequently, Venn 2.0 was used to identify the overlapping genes that were common across all four algorithms. This step is crucial in ensuring that only the most consistently significant genes are considered for further analysis. Finally, GO and KEGG enrichment analyses were conducted on these selected genes. This enrichment analysis helps in understanding the biological processes, cellular components, molecular functions, and pathways in which these hub genes are involved. By employing a combination of network analysis and enrichment studies, this approach provides a comprehensive understanding of the key genes and their roles in the context of OP and UC.

Validation and construction of diagnostic models

In this study, datasets GSE56815 and GSE107499 were employed as validation datasets. The characteristics of the hub genes, particularly those with scores in the top five, were used to generate corresponding expression profiles. This step is crucial for validating the significance and reliability of the identified hub genes. SPSS 25.0, a widely used statistical analysis software, was then utilized to analyze the model's Receiver Operating Characteristic (ROC) and the Area Under the Receiver Operating Characteristic (AUC). The ROC curve is a graphical representation that illustrates the diagnostic ability of a binary classifier system as its discrimination threshold is varied. The AUC represents the degree or measure of separability, indicating how well the model is capable of distinguishing between classes. An AUC value close to 1 indicates that the model has good class separation capabilities, whereas an AUC value close to 0.5 suggests that the model is no better than random guessing. Analyzing the ROC and AUC is essential for evaluating the performance and predictive accuracy of the diagnostic model built on the hub genes.

Results

Enrichment analysis of OP and UC

In this study, a total of 2337 differentially expressed genes were identified in OP based on the criteria of adjusted P-value < 0.05 and an |logFC| ≥ 1.0 (Figure 2A). Similarly, 1661 differentially expressed genes were identified in UC (Figure 2B). The Metascape database was employed to conduct GO and KEGG enrichment analyses for both OP and UC (Table 2). For OP, the results of GO and KEGG enrichment analyses are depicted in Figures 3A-3D. The primary BP involved positive regulation of cellular protein localization, positive regulation of intracellular protein transport, and regulation of intracellular protein transport. CC mainly targeted focal adhesion, cell-substrate adherens junction, and cell-substrate junction. MF primarily involved transcription coactivator activity, cadherin binding, and cell adhesion molecule binding. KEGG analysis for OP revealed that the main pathways involved were Adherens junction, Focal adhesion, and Proteoglycans in cancer. For UC, the GO and KEGG enrichment analyses are shown in Figure 3E-3H. The main aspects of BP included leukocyte migration, organization of the extracellular matrix, and organization of the extracellular structure. CC primarily targeted the collagen-containing extracellular matrix, the plasma membrane's exterior, and the secretory granule membrane. MF mainly involved glycosaminoglycan binding, structural components of the extracellular matrix, and cytokine activity. KEGG analysis for UC indicated that the primary pathways were rheumatoid arthritis (RA), viral protein interaction with cytokine and cytokine receptors, and cytokine-cytokine receptor interaction. Additionally, Figure 4 presents the results of the GSEA enrichment analysis for both OP and UC. This comprehensive analysis provides insights into the biological functions and pathways associated with the differentially expressed genes in OP and UC, contributing to a better understanding of the underlying mechanisms of these diseases.

Table 2. GO and KEGG analyses of GSE35958 and GSE87466.

KEGG: Kyoto Encyclopedia of Genes and Genomes, GO: Gene Ontology, DEGs: differential genes

DEGs Ontology ID Description Count p.adjust
GSE35958 BP GO:0033157 regulation of intracellular protein transport 61 7.35513E-12
  BP GO:0090316 positive regulation of intracellular protein transport 47 1.71622E-11
  BP GO:1903829 positive regulation of cellular protein localization 76 5.05755E-11
  CC GO:0005925 focal adhesion 117 9.1193E-22
  CC GO:0005924 cell-substrate adherens junction 117 9.13433E-22
  CC GO:0030055 cell-substrate junction 117 1.51679E-21
  MF GO:0050839 cell adhesion molecule binding 112 1.57448E-09
  MF GO:0045296 cadherin binding 81 1.16719E-08
  MF GO:0003713 transcription coactivator activity 74 7.48203E-07
  KEGG hsa04510 Focal adhesion 59 5.60412E-09
  KEGG hsa04520 Adherens junction 29 9.47447E-08
  KEGG hsa05205 Proteoglycans in cancer 56 1.2674E-07
GSE87466 BP GO:0050900 leukocyte migration 115 9.03478E-27
  BP GO:0030198 extracellular matrix organization 93 1.43215E-24
  BP GO:0043062 extracellular structure organization 100 2.23921E-24
  CC GO:0062023 collagen-containing extracellular matrix 94 5.79425E-23
  CC GO:0009897 external side of plasma membrane 76 8.33238E-14
  CC GO:0030667 secretory granule membrane 62 9.53361E-13
  MF GO:0005539 glycosaminoglycan binding 57 1.02347E-13
  MF GO:0005201 extracellular matrix structural constituent 44 7.09352E-12
  MF GO:0005125 cytokine activity 51 2.98427E-11
  KEGG hsa05323 Rheumatoid arthritis 35 5.20137E-12
  KEGG hsa04061 Viral protein interaction with cytokine and cytokine receptor 35 3.29002E-11
  KEGG hsa04060 Cytokine-cytokine receptor interaction 63 1.291E-09

Figure 2. Screening the datasets for differentially expressed genes.

Figure 2

(A) A volcanic map of the differential genes identified in the GSE35958 (OP) dataset. (B) A volcanic map of the differential genes identified in the GSE87466 (UC) dataset.

OP: osteoporosis, UC: ulcerative colitis

Figure 3. GO and KEGG analysis of the datasets.

Figure 3

(A) BP, (B) CC, (C) MF, and (D) KEGG of the GSE35958 (OP) dataset. (E) BP, (F) CC, (G) MF, and (H) (KEGG) of the GSE87466 (UC) dataset.

BP: biological processes, CC: cellular components, MF: molecular function, KEGG: Kyoto Encyclopedia of Genes and Genomes, GO: Gene Ontology, OP: osteoporosis, UC: ulcerative colitis

Figure 4. GSEA analysis of the datasets.

Figure 4

(A) GSEA of GSE35958 (OP) dataset. (B) GSEA of GSE87466 (UC) dataset.

GSEA: Gene Set Enrichment Analysis, OP: osteoporosis, UC: ulcerative colitis

Identify co-expressed differential genes

Utilizing the Venn 2.0 database, this study identified a total of 156 co-expressed differential genes shared between OP and UC (Figure 5A). Subsequent GO and KEGG enrichment analyses were conducted on these co-expressed differential genes (Figures 5B-5E) (Table 3). In terms of BP, the co-expressed differential genes were primarily involved in the organization of the extracellular matrix, organization of the extracellular structure, and positive regulation of cell adhesion. For CC, the genes were mainly associated with focal adhesion, cell-substrate adherens junction, and collagen-containing extracellular matrix. In the MF category, the genes were predominantly involved in growth factor binding, serving as extracellular matrix structural components, and exhibiting phosphoric diester hydrolase activity. KEGG analysis revealed that the co-expressed differential genes were most significantly involved in signaling pathways such as Proteoglycans in cancer, Focal adhesion, and the AGE-RAGE signaling pathway in diabetic complications. This analysis of co-expressed differential genes provides valuable insights into the shared biological functions and pathways between OP and RA, which can be instrumental in understanding the common mechanisms underlying these diseases and potentially developing targeted therapeutic strategies.

Table 3. GO and KEGG analyses of co-expression of differential genes.

KEGG: Kyoto Encyclopedia of Genes and Genomes, GO: Gene Ontology

Ontology ID Description p.adjust
BP GO:0030198 extracellular matrix organization 5.94e-07
BP GO:0043062 extracellular structure organization 2.93e-06
BP GO:0045785 positive regulation of cell adhesion 6.10e-06
BP GO:0043434 response to peptide hormone 4.73e-04
BP GO:0007159 leukocyte cell-cell adhesion 4.73e-04
CC GO:0062023 collagen-containing extracellular matrix 4.34e-05
CC GO:0005925 focal adhesion 0.008
CC GO:0005924 cell-substrate adherens junction 0.008
CC GO:0030055 cell-substrate junction 0.008
CC GO:0005788 endoplasmic reticulum lumen 0.010
MF GO:0019838 growth factor binding 0.063
MF GO:0005201 extracellular matrix structural constituent 0.091
MF GO:0008081 phosphoric diester hydrolase activity 0.091
MF GO:0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen 0.091
MF GO:0016597 amino acid binding 0.091
KEGG hsa05205 Proteoglycans in cancer 2.39e-05
KEGG hsa04510 Focal adhesion 4.68e-04
KEGG hsa04933 AGE-RAGE signaling pathway in diabetic complications 4.68e-04
KEGG hsa04610 Complement and coagulation cascades 0.001
KEGG hsa05416 Viral myocarditis 0.014

Figure 5. Screening and analyzing for co-expression differential genes.

Figure 5

(A) Venn of GSE35958 (OP) and GSE87466 (UC). (B) BP of co-expression of differential genes. (C) CC of co-expression of differential genes. (D) MF of co-expression of differential genes. (E) KEGG of co-expression of differential genes.

OP: osteoporosis, UC: ulcerative colitis, BP: biological processes, CC: cellular components, MF: molecular function, KEGG: Kyoto Encyclopedia of Genes and Genomes

PPI network construction of differentially expressed genes

The study employed the STRING 11.5 database and Cytoscape 3.7.2 software to construct and visualize the PPI network, which is depicted in Figure 6. The PPI network comprises 96 nodes and 222 edges, representing proteins and their interactions, respectively. To identify core modules within the PPI network, the MCODE plugin in Cytoscape 3.7.2 was used. The top four rated core modules were extracted and are displayed in Figure 7. Furthermore, the cytoHubba plugin in Cytoscape 3.7.2 was utilized to identify the top 20 hub genes using four algorithms: Degree, DMNC, MCC, and MNC. These are illustrated in Figure 8. Venn 2.0 was then used to select the overlapping genes common to the four algorithms, resulting in a total of seven hub genes, namely COL6A1, COL6A2, BGN, NID1, PLAU, TGFB1, and PLAUR (Figure 9A). The GO and KEGG enrichment analyses of these hub genes are presented in Figure 9B and Table 4. In terms of BP, the primary activities were bone morphogenesis, extracellular structure organization, and extracellular matrix organization. CC mainly involved the sarcolemma, endoplasmic reticulum lumen, and collagen-containing extracellular matrix. MF primarily consisted of extracellular matrix binding and structural constituents of the extracellular matrix that confer tensile strength. KEGG analysis revealed that the hub genes were significantly involved in pathways such as Proteoglycans in cancer, complement and coagulation cascades, and extracellular matrix (ECM)-receptor interaction. These findings are crucial as they highlight the key genes and pathways that may play a significant role in the pathogenesis of OP and UC, and can be potential targets for therapeutic interventions.

Table 4. GO and KEGG analyses of hub genes.

KEGG: Kyoto Encyclopedia of Genes and Genomes, GO: Gene Ontology

Ontology ID Description p.adjust
BP GO:0030198 extracellular matrix organization 5.01e-05
BP GO:0043062 extracellular structure organization 5.01e-05
BP GO:0060349 bone morphogenesis 0.002
BP GO:0070208 protein heterotrimerization 0.002
BP GO:0002062 chondrocyte differentiation 0.002
CC GO:0062023 collagen-containing extracellular matrix 2.42e-06
CC GO:0042383 sarcolemma 1.82e-04
CC GO:0005788 endoplasmic reticulum lumen 0.001
CC GO:0005581 collagen trimer 0.003
CC GO:0035579 specific granule membrane 0.003
MF GO:0005201 extracellular matrix structural constituent 6.42e-06
MF GO:0030020 extracellular matrix structural constituent conferring tensile strength 0.001
MF GO:0050840 extracellular matrix binding 0.002
MF GO:0034713 type I transforming growth factor beta receptor binding 0.023
MF GO:0048407 platelet-derived growth factor binding 0.023
KEGG hsa05205 Proteoglycans in cancer 0.007
KEGG hsa04610 Complement and coagulation cascades 0.016
KEGG hsa04512 ECM-receptor interaction 0.016
KEGG hsa04974 Protein digestion and absorption 0.016
KEGG hsa04510 Focal adhesion 0.049

Figure 6. PPI network of co-expression of differential genes.

Figure 6

Red is the upregulated gene. Green is the downregulated gene.

PPI: protein-protein interaction

Figure 7. Hub genes based on four algorithms of Cytoscape 3.7.2 based on cytoHubba plugin.

Figure 7

(A) Degree algorithm, (B) DMNC algorithm, (C) MCC algorithm, (D) MNC algorithm.

Figure 8. Screening for core module.

Figure 8

(A-D) The core module of Cytoscape 3.7.2 based on MCODE plugin.

Figure 9. Screening and analyzing for hub genes.

Figure 9

(A) Hub genes of Venn based on four algorithms of Cytoscape 3.7.2 based on cytoHubba plugin. (B) GO and KEGG of 7 Hub genes.

KEGG: Kyoto Encyclopedia of Genes and Genomes, GO: Gene Ontology

Validation and construction of diagnostic models

The GSE56815 and GSE107499 datasets were used and the first five genes were selected for diagnostic model construction and validation. GSE56815(OP) results from a display (Figure 10), COL6A1 has an AUC of 0.742; COL6A2 has an AUC of 0.755; BGN has an AUC of 0.682; NID1 has an AUC of 0.689. PLAU has an AUC of 0.747; GSE107499 (UC) results from a display (Figure 11), COL6A1 has an AUC of 0.849; COL6A2 has an AUC of 0.810; BGN has an AUC of 0.909; NID1 has an AUC of 0.890; PLAU has an AUC of 0.845. These results suggest that these five genes are reliable targets for OP and UC.

Figure 10. ROC validation of GSE56815.

Figure 10

Diagnostic model of (A) COL6A1, (B) COL6A2, (C) BGN, (D) NID1, (E) PLAU, and (F) general map based on GSE56815 (OP).

ROC: Receiver Operating Characteristic, OP: osteoporosis

Figure 11. ROC validation of GSE107499.

Figure 11

Diagnostic model of (A) COL6A1, (B) COL6A2, (C) BGN, (D) NID1, (E) PLAU, and (F) general map based on GSE107499 (UC).

ROC: Receiver Operating Characteristic, UC: ulcerative colitis

Discussion

OP and UC are conditions that often present with subtle early symptoms, making them difficult to diagnose in a timely manner. The global prevalence of both OP and UC is on the rise, posing a significant socioeconomic burden and causing distress and strain on the lives of patients. Interestingly, OP has been identified as one of the common extraintestinal complications of UC [11]. The prevalence of osteoporosis among patients with UC is increasing. While inflammation is believed to have a negative impact on bone metabolism, the exact mechanisms are not well understood. It is unclear whether bone loss in UC patients is a direct consequence of inflammation or if it is attributed to other factors such as poor mineral absorption, vitamin D deficiency, decreased physical activity, nutritional issues, or the use of medications like steroids and glucocorticoids. Research has indicated that inflammatory cytokines, such as TNF-α, can be produced in UC by activating T lymphocytes, which in turn can cause OP. Furthermore, TNF-α has been found to synergize with RANKL in acting directly on osteoclasts and their precursors. This regulates the OPG/RANKL/RANK pathway, leading to bone loss by promoting osteoclast production. In this study, 156 co-expressed differential genes were identified between OP and UC. Through the use of the cytoHubba plugin in Cytoscape 3.7.2, seven hub genes were pinpointed: COL6A1, COL6A2, BGN, NID1, PLAU, TGFB1, and PLAUR. Enrichment analysis revealed that these genes are primarily involved in the organization of the extracellular matrix, protein heterotrimerization, and chondrocyte morphogenesis in endochondral bone morphogenesis, among other biological processes. The identification of these hub genes and the understanding of their roles in biological processes can provide valuable insights into the mechanisms underlying the development of OP and UC. This, in turn, can pave the way for the development of more effective diagnostic tools and therapeutic interventions for these conditions.

Collagens are a family of proteins that play a crucial role in the structural integrity of various tissues in the body. COL6A1 (collagen type VI alpha 1) and COL6A2 (collagen type VI alpha 2) are members of the collagen family and are widely distributed in several tissues, including the skin, blood vessels, skeletal muscles, and lungs [12,13]. COL6A1 and COL6A2 contribute to the maintenance of the integrity of various tissues. Collagens, including COL6A1 and COL6A2, typically have a triple-helix domain, which is a common structural feature of these extracellular matrix proteins. Specifically, COL6A1 is a major structural component of microfibrils [14]. Mutations in the genes encoding COL6A1 and COL6A2 have been associated with certain diseases, including Bethlem Myopathy and Ullrich Congenital Muscular Dystrophy. These diseases are characterized by muscle weakness and joint abnormalities. Furthermore, COL6A1 and COL6A2 are involved in several biological pathways, including collagen chain trimerization and extracellular matrix organization. The extracellular matrix is essential for providing structural support to cells and tissues, and collagens like COL6A1 and COL6A2 are vital components of this matrix. Understanding the roles of COL6A1 and COL6A2 in tissue integrity and their involvement in diseases can be valuable for the development of therapeutic strategies for conditions associated with abnormalities in collagen and extracellular matrix [15,16].

BGN is an extracellular matrix protein that is encoded by the BGN gene. It is initially produced as a proprotein, which undergoes proteolytic processing to form the mature protein. BGN plays a crucial role in various biological processes, including bone growth, muscle development and regeneration, and the assembly of collagen fibers in different tissues [17,18]. In recent years, it has been hypothesized that BGN is a signaling molecule that mediates various stages of tumorigenesis [19]. Epithelial cell morphology, growth, differentiation, and migration are all mediated by BGN in a significant way. The primary function of BGN is to preserve the extracellular matrix's structural integrity [20]. BGN may have diagnostic and prognostic value for ovarian, prostate, gastric, and colorectal cancers, according to studies. This protein also regulates inflammation and innate immunity [21,22].

NID1 is a sulfated multifunctional glycoprotein widely found in basement membranes [23]. During inflammation, NID1 encourages the chemotaxis of neutrophils. Differentiation, proliferation, and apoptosis are all regulated by interactions between cells and basement membranes [24]. NID1 is expressed in various tissues and also binds to COL6A1 and Perlecan. It may play a role in cell-extracellular matrix interactions [25].

PLAU (Plasminogen Activator, Urokinase) is a Protein Coding gene. A secreted serine protease that converts plasminogen into plasmin is encoded by this gene [26]. The plasminogen activation system (PAS) is mainly involved in tissue remodeling and fibrinolysis and inflammation in various physiological states. PAS is most well-known for its ability to break down fibrin, but it is also upregulated in chronic inflammatory conditions like arthritis and atherosclerosis [27,28]. Studies have shown that various components of PAS are related to the pathophysiology of RA. Urokinase plasminogen activator (UPA), especially pro-inflammatory mediators, appears to play an important role in RA-associated bone and cartilage destruction [29]. Although we have integrated many bioinformatics methods and statistical methods, they still have their limitations. First, this is a retrospective study, so new clinical samples and data are lacking. Second, the identified genes were not verified by in vivo and in vitro experiments. In the end, the study is simply based on the GEO database. Data from other databases are not included.

Conclusions

In this study, we performed multiple bioinformatics analyses and identified key target genes (COL6A1, COL6A2, BGN, NID1, and PLAU) for OP and UC. The treatment of OP and UC may have potential targets in these key target genes. As a result, additional insights into the early diagnosis and treatment of OP and UC may result from a more in-depth examination of these genes.

The authors have declared that no competing interests exist.

Human Ethics

Consent was obtained or waived by all participants in this study

Animal Ethics

Animal subjects: All authors have confirmed that this study did not involve animal subjects or tissue.

References

  • 1.Update on osteoporosis screening and management. Anam AK, Insogna K. Med Clin North Am. 2021;105:1117–1134. doi: 10.1016/j.mcna.2021.05.016. [DOI] [PubMed] [Google Scholar]
  • 2.Interaction between bone and immune cells: implications for postmenopausal osteoporosis. Fischer V, Haffner-Luntzer M. Semin Cell Dev Biol. 2022;123:14–21. doi: 10.1016/j.semcdb.2021.05.014. [DOI] [PubMed] [Google Scholar]
  • 3.Determinants of imminent fracture risk in postmenopausal women with osteoporosis. Barron RL, Oster G, Grauer A, Crittenden DB, Weycker D. Osteoporos Int. 2020;31:2103–2111. doi: 10.1007/s00198-020-05294-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ulcerative colitis: an update. Segal JP, LeBlanc JF, Hart AL. Clin Med (Lond) 2021;21:135–139. doi: 10.7861/clinmed.2021-0080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.A comprehensive review and update on ulcerative colitis() Gajendran M, Loganathan P, Jimenez G, et al. Dis Mon. 2019;65:100851. doi: 10.1016/j.disamonth.2019.02.004. [DOI] [PubMed] [Google Scholar]
  • 6.Ulcerative colitis. Kobayashi T, Siegmund B, Le Berre C, et al. Nat Rev Dis Primers. 2020;6:74. doi: 10.1038/s41572-020-0205-x. [DOI] [PubMed] [Google Scholar]
  • 7.Ulcerative colitis: making the diagnosis. Kaenkumchorn T, Wahbeh G. Gastroenterol Clin North Am. 2020;49:655–669. doi: 10.1016/j.gtc.2020.07.001. [DOI] [PubMed] [Google Scholar]
  • 8.The transcriptional profile of mesenchymal stem cell populations in primary osteoporosis is distinct and shows overexpression of osteogenic inhibitors. Benisch P, Schilling T, Klein-Hitpass L, et al. PLoS One. 2012;7:0. doi: 10.1371/journal.pone.0045142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Molecular comparison of adult and pediatric ulcerative colitis indicates broad similarity of molecular pathways in disease tissue. Li K, Strauss R, Ouahed J, et al. J Pediatr Gastroenterol Nutr. 2018;67:45–52. doi: 10.1097/MPG.0000000000001898. [DOI] [PubMed] [Google Scholar]
  • 10.A novel approach for correction of crosstalk effects in pathway analysis and its application in osteoporosis research. Zhou Y, Gao Y, Xu C, Shen H, Tian Q, Deng HW. Sci Rep. 2018;8:668. doi: 10.1038/s41598-018-19196-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.The genetically predicted causal relationship of inflammatory bowel disease with bone mineral density and osteoporosis: evidence from two-sample Mendelian randomization. Xu D, Chen Y, Gao X, et al. Front Immunol. 2023;14:1148107. doi: 10.3389/fimmu.2023.1148107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Collagen VI at a glance. Cescon M, Gattazzo F, Chen P, Bonaldo P. J Cell Sci. 2015;128:3525–3531. doi: 10.1242/jcs.169748. [DOI] [PubMed] [Google Scholar]
  • 13.H3K27 acetylation activated-COL6A1 promotes osteosarcoma lung metastasis by repressing STAT1 and activating pulmonary cancer-associated fibroblasts. Zhang Y, Liu Z, Yang X, et al. Theranostics. 2021;11:1473–1492. doi: 10.7150/thno.51245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Collagen VI disorders: insights on form and function in the extracellular matrix and beyond. Lamandé SR, Bateman JF. Matrix Biol. 2018;71-2:348–367. doi: 10.1016/j.matbio.2017.12.008. [DOI] [PubMed] [Google Scholar]
  • 15.Identification and characterization of novel collagen VI non-canonical splicing mutations causing Ullrich congenital muscular dystrophy. Martoni E, Urciuolo A, Sabatelli P, et al. Hum Mutat. 2009;30:0–72. doi: 10.1002/humu.21022. [DOI] [PubMed] [Google Scholar]
  • 16.COL6A1 mutation leading to Bethlem myopathy with recurrent hematuria: a case report. Bao M, Mao F, Zhao Z, et al. BMC Neurol. 2019;19:32. doi: 10.1186/s12883-019-1263-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Biglycan neo-epitope (BGN(262)), a novel biomarker for screening early changes in equine osteoarthritic subchondral bone. Adepu S, Ekman S, Leth J, Johansson U, Lindahl A, Skiöldebrand E. Osteoarthritis Cartilage. 2022;30:1328–1336. doi: 10.1016/j.joca.2022.07.005. [DOI] [PubMed] [Google Scholar]
  • 18.OPG-Fc treatment partially rescues low bone mass phenotype in mature Bgn/Fmod deficient mice but is deleterious to the young mouse skeleton. Kram V, Jani P, Kilts TM, Li L, Chu EY, Young MF. J Struct Biol. 2020;212:107627. doi: 10.1016/j.jsb.2020.107627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Prognostic value and association with epithelial-mesenchymal transition and molecular subtypes of the proteoglycan biglycan in advanced bladder cancer. Schulz GB, Grimm T, Sers C, et al. Urol Oncol. 2019;37:530. doi: 10.1016/j.urolonc.2019.05.011. [DOI] [PubMed] [Google Scholar]
  • 20.BGN mutations in X-linked spondyloepimetaphyseal dysplasia. Cho SY, Bae JS, Kim NK, et al. Am J Hum Genet. 2016;98:1243–1248. doi: 10.1016/j.ajhg.2016.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.BGN and COL11A1 regulatory network analysis in colorectal cancer (CRC) reveals that BGN influences CRC cell biological functions and interacts with miR-6828-5p. Chen D, Qin Y, Dai M, et al. Cancer Manag Res. 2020;12:13051–13069. doi: 10.2147/CMAR.S277261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.The influence of TGF-β3, EGF, and BGN on SOX9 and RUNX2 expression in human chondrogenic progenitor cells. Janssen JN, Batschkus S, Schimmel S, Bode C, Schminke B, Miosge N. J Histochem Cytochem. 2019;67:117–127. doi: 10.1369/0022155418811645. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Genomic sequences and structural organization of the human nidogen gene (NID) Zimmermann K, Hoischen S, Hafner M, Nischt R. Genomics. 1995;27:245–250. doi: 10.1006/geno.1995.1038. [DOI] [PubMed] [Google Scholar]
  • 24.Interferon-induced protein 35 inhibits endothelial cell proliferation, migration and re-endothelialization of injured arteries by inhibiting the nuclear factor-kappa B pathway. Jian D, Wang W, Zhou X, et al. Acta Physiol (Oxf) 2018;223:0. doi: 10.1111/apha.13037. [DOI] [PubMed] [Google Scholar]
  • 25.Nidogen 1 and 2 gene promoters are aberrantly methylated in human gastrointestinal cancer. Ulazzi L, Sabbioni S, Miotto E, et al. Mol Cancer. 2007;6:17. doi: 10.1186/1476-4598-6-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Novel aspects and new roles for the serine protease plasmin. Syrovets T, Simmet T. Cell Mol Life Sci. 2004;61:873–885. doi: 10.1007/s00018-003-3348-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.The urokinase plasminogen activation system in rheumatoid arthritis: pathophysiological roles and prospective therapeutic targets. Buckley BJ, Ali U, Kelso MJ, Ranson M. Curr Drug Targets. 2019;20:970–981. doi: 10.2174/1389450120666181204164140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Eleven genomic loci affect plasma levels of chronic inflammation marker soluble urokinase-type plasminogen activator receptor. Dowsett J, Ferkingstad E, Rasmussen LJ, et al. http://10.1038/s42003-021-02144-8. Commun Biol. 2021;4:655. doi: 10.1038/s42003-021-02144-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Identification of prognostic aging-related genes associated with immunosuppression and inflammation in head and neck squamous cell carcinoma. Yang J, Jiang Q, Liu L, et al. Aging (Albany NY) 2020;12:25778–25804. doi: 10.18632/aging.104199. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Cureus are provided here courtesy of Cureus Inc.

RESOURCES