Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Aug 15;14:18939. doi: 10.1038/s41598-024-69080-5

Exploring the molecular mechanisms and shared potential drugs between rheumatoid arthritis and arthrofibrosis based on large language model and synovial microenvironment analysis

Zhaoquan Wei 1,#, Xi Chen 2,#, Youshi Sun 3,#, Yifei Zhang 1, Ruifang Dong 1, Xiaojing Wang 4, Shuangtao Chen 1,
PMCID: PMC11327321  PMID: 39147768

Abstract

Rheumatoid arthritis (RA) and arthrofibrosis (AF) are both chronic synovial hyperplasia diseases that result in joint stiffness and contractures. They shared similar symptoms and many common features in pathogenesis. Our study aims to perform a comprehensive analysis between RA and AF and identify novel drugs for clinical use. Based on the text mining approaches, we performed a correlation analysis of 12 common joint diseases including arthrofibrosis, gouty arthritis, infectious arthritis, juvenile idiopathic arthritis, osteoarthritis, post infectious arthropathies, post traumatic osteoarthritis, psoriatic arthritis, reactive arthritis, rheumatoid arthritis, septic arthritis, and transient arthritis. 5 bulk sequencing datasets and 4 single-cell sequencing datasets of RA and AF were integrated and analyzed. A novel drug repositioning method was found for drug screening, and text mining approaches were used to verify the identified drugs. RA and AF performed the highest gene similarity (0.77) and functional ontology similarity (0.84) among all 12 joint diseases. We figured out that they share the same key pathogenic cell including CD34 + sublining fibroblasts (CD34-SLF) and DKK3 + sublining fibroblasts (DKK3-SLF). Potential therapeutic target database (PTTD) was established with the differential expressed genes (DEGs) of these key pathogenic cells. Based on the PTTD, 15 potential drugs for AF and 16 potential drugs for RA were identified. This work provides a new perspective on AF and RA study which enhances our understanding of their pathogenesis. It also shed light on their underlying mechanism and open new avenues for drug repositioning studies.

Keywords: Rheumatoid arthritis, Arthrofibrosis, Synovial microenvironment, Arthritis, Sublining fibroblasts

Subject terms: Rheumatoid arthritis, Connective tissue diseases

Introduction

Synovial microenvironment plays an indispensable role in inflammatory joint disease13. It comprises a lining layer rich in lining fibroblasts (LLF) and macrophages, supported by a sublining connective tissue layer containing sublining fibroblasts (SLF) and an intricate vascular network. During joint inflammation, significant alterations in cellular composition occur within this microenvironment4, with the activation of macrophages and fibroblasts leading to cytokine production. This inflammatory process associated with the synovial tissue has been already observed in various forms of joint diseases including gouty arthritis, infectious arthritis, juvenile idiopathic arthritis, osteoarthritis, post infectious arthropathies, post traumatic osteoarthritis, psoriatic arthritis, rheumatoid arthritis, etc. 57.

Despite extensive research, most studies have focused on individual diseases8, with limited systemic investigation into joint disorders from the perspective of the synovial microenvironment. This limitation has hindered our comprehensive understanding of joint diseases. Moreover, as the literature on joint diseases rapidly expands, it becomes unfeasible to manually read and summarize all of it. Consequently, these demands prompt us to employ a natural language processing (NLP) algorithm to extract information from the abundant literature.

Recently, ChatGPT, a large language model that has opened a new era in NLP and achieved ideal results in various tasks9,10. This model can precisely extract and capture key information or knowledge from a large scale of unstructured textual data without any pre-training11. Its applications span bioinformatics, computational biology, and systems biology, based on data mining, machine learning, and NLP techniques1214. The extracted information often leads to new discoveries and the formulation of novel hypotheses through the construction of knowledge networks and integration with knowledge graphs15,16.

In this study, we performed a correlation analysis of various joint diseases using the GPT-3.5-Turbo model and revealed the highest similarity between rheumatoid arthritis (RA) and arthrofibrosis (AF) across 12 common joint diseases. Rheumatoid arthritis (RA) is a long-term autoimmune disease which causes severe inflammation of multiple joints. Arthrofibrosis (AF) refers to a fibrotic deposition complication after surgery and trauma that affects many joints. Both diseases exhibit similar symptoms and histopathological features, including chronic synovial hyperplasia, joint stiffness, pain, and excessive collagen and fibrous tissue production within the synovium. Both conditions are marked by synovial linings infiltrated with diverse immune cells and heterogeneous fibroblasts.

In fact, RA has been widely correlated with multiple fibrosis diseases in different organs, including idiopathic pulmonary fibrosis (IPF), cystic fibrosis (CF), liver fibrosis, interstitial lung disease (ILD), myocardial fibrosis, subretinal fibrosis, and lobular fibrosis in labial salivary glands3943. Up to 20–40% of RA patients develop pulmonary fibrosis, and 3% develop liver fibrosis. In recent years, several studies have also noted the association between RA and AF. However, no specialized research has been conducted on their correlation.

Therefore, this study aims to perform a comprehensive analysis between RA and AF and identify novel drugs for clinical use. Based on bulk sequencing and single-cell sequencing data, we demonstrated a presently unexplored tight connection and similarity between RA and AF. We explored the common pathogenic cell types and established a potential target database based on differentially expressed genes (DEGs). Additionally, we developed a novel drug repositioning method and identified drugs for RA and AF treatment.

Methods

Diseases similarity analysis

We applied the GPT-3.5-Turbo model for disease-related genes extraction9,10. In detail, we downloaded 373,644 literatures of 12 different common joint diseases from PubMed, including arthrofibrosis, gouty arthritis, infectious arthritis, juvenile idiopathic arthritis, osteoarthritis, post infectious arthropathies, post traumatic osteoarthritis, psoriatic arthritis, reactive arthritis, rheumatoid arthritis, septic arthritis, and transient arthritis (Supplementary Table 1). Genes information was extracted from the abstracts. Then we performed correlation analysis and network analysis with R package networkD3 (Version 0.4, https://github.com/reymont/networkD3) and GOSemSim (Version 2.16.1) with Resnik’s method17.

Single-cell data collection and integration

We have enrolled 4 different knee joint synovium single-cell RNA sequencing datasets of rheumatoid arthritis and 1 dataset of arthrofibrosis1821. These data were processed and integrated as the previous study described22. In details, all of these datasets were processed with a standard Seurat (Version 3.3) for R (V.3.6) protocol and integrated with Harmony(Version 0.1)23,24. We obtained a total of 51,430 synovial cells, with 7621 cells from arthrofibrosis patients, 32,296 from rheumatoid arthritis patients, and 11,513 from healthy controls. We used the first 20 PCs for dimensional reduction and cell clustering. Cell annotation was performed manually based on cell markers from the CellMarker 2.0 database25,26.

Bulk sequencing data collection and preprocessing

All the bulk sequencing data were downloaded from the NCBI Gene Expression Omnibus web resource (GEO, http://www.ncbi.nlm.nih.gov/geo) including 4 rheumatoid arthritis datasets (GSE55457, GSE55584, GSE55235, GSE12021) and one arthrofibrosis dataset (GSE135854)2729. These datasets were integrated and normalized to remove the batch effect. The differential expressed genes (DEGs) were calculated with the R package limma (Version 3.46.0, https://CRAN.R-project.org/package = limma) of R software (Version 4.2.0)30. The dimensional reduction analysis was performed with the R package tinyarray (Version 2.2.7, https://CRAN.R-project.org/package=tinyarray).

Deconvolution analysis of bulk RNA sequencing data

The cell type compositions in bulk RNA sequencing data were calculated with deconvolution analysis by R packages MuSiC (Version 0.2.0)31. This cell types information was obtained from the integrated and annotated single-cell RNA sequencing data. We used the standard protocol and default parameter setting in this analysis.

Gene functional enrichment analysis

We performed Gene ontology (GO) and KEGG analysis with the R package ClusterProfiler (Version 4.0)32. Gene signatures analysis was conducted by the R packages UCell (Version 1.3)33 and irGSEA (Version 1.1.3 https://github.com/chuiqin/irGSEA/). Common gene Venn plots were performed by the Venny (Version 2.1, https://bioinfogp.cnb.csic.es/tools/venny/index.html). The Sankey plot of common KEGG pathways was depicted with the R packages networkD3 (Version 0.4, https://github.com/reymont/networkD3).

Pseudotime trajectory analysis

The R package Monocle 3 (Version 1.2.9) was used to analyze the pseudotime trajectory in fibroblast cells. These fibroblast cells were ordered after the overall ‘trajectory’ of gene expression changes has been identified. Default settings were used for all tasks.

Cell communication analysis

A cell communication analysis of single-cell sequencing data was carried out using the R package CellChat (Version 1.4.0)34, involving 1,939 validated molecular interactions.

Potential therapeutic target database development

We have integrated the differential expressed genes of pathogenic cell types including DKK3-SLF, CD55-SLF, F-Mφ, and IFN-Mφ (|LogFc|> 1, adjust p < 0.05). Then the DEGs were overlapped with bulk sequencing DEGs of AF and RA separately. The common genes of AF and RA were eventually enrolled in the arthrofibrosis potential therapeutic target database (AF-PTTD) and rheumatoid arthritis potential therapeutic target database (RA-PTTD).

Drug repositioning and statistical analysis

Rheumatoid arthritis related drugs and their associated genes were collected by the R package DSEATM (Version 1.0.0) with the Mesh Term ID “D001172”35. There are few studies specific on the arthrofibrosis drugs, so we used the fibrosis drugs and related genes instead (Mesh Term ID: “D005355”). Potential arthrofibrosis target genes and rheumatoid arthritis were integrated with scRNA and bulk RNA differential expressed genes. We can set T to represent the potential target genes set with a length of M.

T = [t1, t2,…, tM].

Set D represent the drug genes with a length of N.

D = [d1, d2,…, dN].

Then the matching score can be represented as follows:

Matchingscore=-tiLogFcx,i1,MdjLogFcx,j1,NM+Ndx

Drugs with matching score over 0 were defined as potential drug candidates (PDC), and those with a score over 1 as highly potential drugs (HPD).The smooth curve of matching target gene numbers changing over different log fold change cut-off (logFc) were fitted with locally weighted regression method (LOESS). The number of supporting literatures were extracted from PubMed with the same protocol mentioned above. The supporting literature was defined as the literatures that mentioned the drug in the repositioning target diseases. The correlation scatter plot of the matching score and literature score was depicted by the R packages ggstatsplot (Version 0.9.4)36.

Results

Similarity analysis of 12 different joint diseases

The disease similarity analysis study enrolled 373,644 literatures on 12 common joint diseases (Supplementary Table 1). A total of 3042 disease associated genes were extracted and a network was constructed by the top 5 genes with the highest-ranking score (Fig. 1A). Rheumatoid arthritis and arthrofibrosis shared the highest number of common genes across the 12 diseases, with the gene similarity calculation revealing a correlation coefficient of 0.77 (Fig. 1B). GO term similarity calculation was consistent with the gene similarity (0.84) (Fig. 1C). These results indicated that the pathogenesis of rheumatoid arthritis may resembles arthrofibrosis’.

Figure 1.

Figure 1

Similarity analysis of 12 different joint diseases. (A) Top ranking diseases-associated gene network. (B) Heatmap of genes similarity between rheumatoid arthritis and arthrofibrosis. (C) Heatmap of GO terms similarity between rheumatoid arthritis and arthrofibrosis.

Cellular contribution analysis of rheumatoid arthritis and arthrofibrosis

A total of 51,430 synovial cells from 5 different single-cell datasets (4 RA and 1 AF) were successfully integrated together (Fig. 2A). Then these cells were manually annotated into seven different cell types including fibroblast cells, myofibroblast cells, endothelial cells, mononuclear phagocytes, mast cells, T cells, and B cells (Fig. 2B, Supplementary Table 2). The number of fibroblast cells was increased in both rheumatoid arthritis and arthrofibrosis synovial microenvironment (Fig. 2C). Myofibroblasts showed a pronounced increase in AF compared to RA and control groups, while immune cells such as mononuclear phagocytes, T cells, and B cells were more prevalent in RA.

Figure 2.

Figure 2

Cellular contribution analysis of rheumatoid arthritis and arthrofibrosis. (A, B) Unsupervised clustering of single-cell RNA sequencing, visualized by uniform manifold approximation and projection (UMAP) showing major clusters. Each point in the figure represents a single cell. (C) Cell proportion calculation among rheumatoid arthritis patients group, arthrofibrosis group and health control group. (D) Dimension reduction of bulk sequencing data by PCA. (E) Violin plot of estimated cell proportion including myofibroblast, fibroblast, mononuclear phagocytes, and T cells. (F) Semi-Violin plot of 5 common hallmark gene sets in rheumatoid arthritis and arthrofibrosis. (G) Venn plot of KEGG pathway of rheumatoid arthritis bulk sequencing data (RA bulk), rheumatoid arthritis single cell RNA sequencing data (RA sc), arthrofibrosis bulk sequencing data (AF bulk), and arthrofibrosis single cell RNA sequencing data (AF bulk). (H) Sankey plot of common KEGG pathway and common genes in rheumatoid arthritis single cell RNA sequencing data (RA sc), arthrofibrosis bulk sequencing data (AF bulk), and arthrofibrosis single cell RNA sequencing data (AF bulk).

Bulk sequencing analysis also revealed the similarity of RA and AF. Principal component analysis (PCA) of bulk tissue data showed RA and AF clustering closely compared to controls (Fig. 2D). The deconvolution calculation of cell type proportion confirmed the increased fibroblast cell count in both RA and AF (Fig. 2E). Changes in myofibroblast cells, mononuclear phagocytes, and T cells were consistent with single-cell data. B cells were not detected in this analysis due to the small sample size.

We also revealed that RA and AF share the key biological processes (Fig. 2F), with elevated gene sets related to inflammatory response, apical junction, and TNF-α signaling pathways. KEGG analysis figured out that they share as many as 25 common pathways (Fig. 2G). This was a relatively large proportion. The Sankey plot depicted the most important common pathways including the MAPK signaling pathway, Cytokine-cytokine receptor interaction, Toll-like receptor signaling pathway, TGF-β signaling pathway, ECM receptor interaction, GAP junction, and regulation of actin cytoskeleton (Fig. 2H). These are all important in the pathogenesis of both AF and RA.

Fibroblast subcluster analysis

Based on the previous studies22,37, fibroblasts can be further annotated into 3 subclusters including CD55 + lining fibroblasts (CD55-LLF), CD34 + sublining fibroblasts (CD34-SLF), and DKK3 positive sublining fibroblasts (DKK3-SLF) (Fig. 3A). CD55-LLF fibroblasts located in the synovium lining layer (LL). It expressed high level of CD55, PRG4, MMP3, FN1, and low level of THY1 (Fig. 3B). CD34-SLF, found in the sub-lining layer (SL), showed high expression of CD34, CXCL12, CXCL14, CCL2, and the highest level of THY1. The third fibroblast cell type, DKK3-SLF was also in sub-lining layer which expressed high levels of DKK3, POSTN, and intermediate levels of THY1. The DKK3-SLF and CD34-SLF are pathogenic cells that both increased in RA and AF (Fig. 3C).

Figure 3.

Figure 3

Fibroblast subcluster analysis. (A) Dimensional reduction and functional enrichment of fibroblast subcluster cells. (B) Marker genes of different fibroblast subcluster (C) Bar plot of different cell proportion in single cell data. (D) Violin plot of estimated cell proportion of bulk sequencing data. (E) Different cell stage proportion of fibroblast subcluster cells. (F) Pseudotime trajectory analysis of fibroblast cells. The black arrow indicates the differentiation direction of fibroblasts. (G) The signaling pattern of AF and RA.

The cell proportion estimation of bulk sequencing data supported these findings (Fig. 3D), showing higher proportions of DKK3-SLF and CD34-SLF in RA and AF compared to controls. Moreover, active proliferation of DKK3-SLF in both RA and AF was indicated by the increased proportions in S and G2M stages (Fig. 3E).

The pseudotime analysis showed a clear trajectory of CD55-LLF developed into CD34-SLF and DKK3-SLF (Fig. 3F). CD34-SLF was majorly increased in RA and DKK3-SLF in AF. Cell chat analysis also showed that DKK3-SLF and CD34-SLF are the major contributors in cell interaction (Fig. 3G). These results revealed that the pathogenic fibroblast cells were very similar in AF and RA, both in composition and behavior.

Mononuclear phagocytes subcluster analysis

The integrated mononuclear phagocytes were also divided into 7 different subclusters including conventional type 1 dendritic cells (cDC1), conventional type 2 dendritic cells (cDC2), M1-like fibrotic macrophage (M1-like F-Mφ), M2-like fibrotic macrophage (M2-like F-Mφ), interferon-stimulated macrophages (IFN-Mφ), S100A8/9 high macrophages (S100A8/9-Mφ), inflammatory macrophages (I-Mφ), immune regulated macrophages (IR-Mφ), transitional macrophages (T-Mφ) and others (Fig. 4A). The key pathogenic cell types F-Mφ and IFN-Mφ were elevated in both RA and AF (Fig. 4B). Functional annotation showed that F-Mφ were highly associated with extracellular matrix and collagen trimer (Fig. 4C), while IFN-Mφ were correlated with leukocyte migration and neutrophil activation.

Figure 4.

Figure 4

Mononuclear phagocytes subcluster analysis. (A) Dimensional reduction of mononuclear phagocytes subcluster cells. (B) Bar plot of different cell proportion in single cell data. (C) Functional enrichment of different mononuclear phagocytes subcluster cells. (D) M1/M2 polarization score of different mononuclear phagocytes subcluster cells. (E) M1 polarized cell proportion of different groups. (F) String plot of mononuclear phagocytes interaction of AF and RA. (G) Dimensional reduction of mononuclear phagocytes interaction in AF and RA.

To address the commonly used M1/M2 classification of macrophage, we calculated the M1/M2 polarization score using previously defined M1- and M2-associated marker genes38. F-Mφ and IFN-Mφ were enriched in M1 genes while other macrophage subsets such as IR-Mφ and cDC1 expressed mostly M2 genes (Fig. 4D). AF and RA exhibited similar M1 polarized cell proportions, which far exceed the control group (Fig. 4E).

We also performed cell chat analysis in the mononuclear phagocytes subcluster. The cell interaction patterns of AF and RA were also very similar to each other (Fig. 4F and G). Moreover, the interaction analysis between fibroblasts and macrophages indicated extensive crosstalk involving TGF-β, PDGF and collagen related pathways (Supplementary Fig. 1A, B). Compared with the control group, RA and AF are similar in this cell crosstalk.

T cells subcluster analysis

T cells were further annotated into 9 different subclusters including T follicular helper (Tfh), natural killer T cells (NKT), Naive CD8 + T cells, Naive CD4 + T cells, CD8 + effector T cells, CD4 + T helper cells 1 (Th1), CD4 + T helper cells 2 (Th2), CD4 + T helper cells 17(Th17), and CD4 + regulatory T cells (Treg) (Fig. 5A). T cells subcluster similarity analysis showed that the T cells in AF resembled those in RA (Fig. 5B). Th1 and Th2 were both increased in RA and AF (Fig. 5C). Cell chat analysis also revealed the similarity of cell interactions between AF and RA (Fig. 5D and E).

Figure 5.

Figure 5

T cells subcluster analysis. (A) Dimensional reduction of T cell subcluster cells. (B) Heatmap of T cells subcluster similarity analysis. (C) Bar plot of different T cells proportion in single cell data. (D) String plot of T cells interaction of AF and RA. (E) Dimensional reduction of T cells interaction in AF and RA.

Drug repositioning of AF and RA

In this study, we have developed a novel drug repositioning method based on the integration of common differential expressed genes of pathogenic cell types including DKK3-SLF, CD55-SLF, F-Mφ and IFN-Mφ. There were 395 rheumatoid arthritis related drugs and 266 fibrosis related drugs enrolled in the screening (Supplementary Table 3 and Supplementary Table 4). 15 of the rheumatoid arthritis related drugs were successfully repositioned in AF based on the matching score calculation (Supplementary Table 5). The matching score and target gene numbers of each potential drug candidate were shown in Fig. 6A. The top 6 drugs selected as the highly potential drugs (HPD) for AF treatment included Triazines, Isoxazoles, Anti-Inflammatory Agents (Non-Steroidal), Propionates, Auranofin, and Penicillamine (Fig. 6B). The matching target genes of top 6 drugs with different changes (logFc) were depicted in Fig. 6C.

Figure 6.

Figure 6

RA drugs repositioned in AF. (A) Scatter plot of matching score and target gene numbers of each drug. Red point represents the match score over 1 while the blue points were less than 1. (B) Fitting plot of highly potential drugs’ matching gene numbers changing over logFc. (C) Dot plot of match gene of 6 top screening drugs. (D) Scatter plot of supporting literature and all drug-related literatures. (E) Correlation scatter plot of literature score and match score.

To validate these drugs, we searched PubMed to get the number of literatures that support the repositioning of each drug. The literature score was calculated with the number of all literature and supporting literature shown in Fig. 6D. The correlation analysis revealed that the matching score was highly correlated with the literature score (R = 0.86, Fig. 6E). This confirmed that the matching scores were reliable criteria for drug repositioning evaluation.

Meanwhile, 16 fibrosis-related drugs were successfully repositioned in RA (Supplementary Table 6). The matching score and target gene numbers of each drug were also shown in Fig. 7A. The top 6 HPD were Emodin, Histone Deacetylase Inhibitors, Adenosine A2 Receptor Agonists, Benzamide, Azo Compounds, 2-methyl-2H-pyrazole-3-carboxylic acid (2-methyl-4-o-tolylazophenyl) amide (Fig. 7B and C). Literature studies confirmed the high correlation of matching score and literature score (Fig. 7D and E).

Figure 7.

Figure 7

Fibrosis drugs repositioned in RA. (A) Scatter plot of matching score and target gene numbers of each drug. Red point represents the match score over 1 while the blue points were less than 1. (B) Fitting plot of matching target gene numbers changing over logFc. (C) Dot plot of match gene of 6 top screening drugs. (D) Scatter plot of supporting literature and all drug-related literatures. (E) Correlation scatter plot of literature score and match score.

Discussion

This is the first study to describe a comprehensive connection between rheumatoid arthritis and arthrofibrosis, providing important insights into their pathogenesis and guides for clinical treatment.

Fibroblasts were dominant cells in both RA, AF and normal synovium44,45. Our study showed that the DKK3-SLF and CD34-SLF were the most important cell type in pathogenesis of both RA and AF, with increased proliferation in both conditions. Mizoguchi and colleagues reported that the DKK3-SLF was associated with severe synovitis and synovial hypertrophy by ultrasound19. Rapheal et al. also found a positive correlation between DKK3-SLF and clinical parameters of RA including visual analog scale (VAS), C-reactive protein level (CRP) and tender joint count (TJC)22. These may reveal that the DKK3-SLF was the key effector cells in the synovial hypertrophy. Based on the expression of chemotactic factors and inflammatory cytokines, CD34-SLF played an important role in monocyte recruitment and regulation of inflammation22.

Moreover, RA and AF show similarities in mononuclear phagocytes subcluster analysis. It was widely accepted that the M1 polarized macrophages secrete principally proinflammatory cytokines such as TNF-α and IL-1, which contribute to the disease progression both in AF and RA27,46,47. Our study confirmed that AF and RA have the relatively same M1 polarized proportion, which is far beyond the healthy control group.

Recently, many studies are focusing on the crosstalk between the synovial fibroblasts and macrophages, which might be a key to understand the mechanism of these two diseases4850. The pathogenic fibroblast cell types such as DKK3-SLF and CD34-SLF produce excessive cytokines and chemokines such as IL-6 and CCL2. This may contribute to the M1 polarization and increased TGF-β and PDGF secretion. These products amplified the inflammatory response of both macrophages and fibroblasts, which led to fibrotic pathogenesis and synovium hyperplasia. Such pathological circuits have been widely accepted in many fibrosis diseases such as lung fibrosis and may also be involved in synovial fibrosis of RA and AF as well51,52.

The novel drug repositioning method employed in this study, based on DEGs of key pathogenic cell types, offers a more precise approach to drug screening. The identified drugs have shown potential in clinical settings, with some already used for RA and AF treatment. For example, emodin was widely reported to ameliorate rheumatoid arthritis5358. Histone Deacetylase Inhibitors was believed as the new hope for rheumatoid arthritis5964. Silymarin was also reported to reduce the clinical syndromes in RA patients based on a non-randomized single-arm clinical trial65. Auranofin and penicillamine were both used for lung fibrosis and liver fibrosis prevention and treatment6672. Azathioprine and Sulfhydryl Compounds can ameliorate the idiopathic pulmonary fibrosis7376. Isoxazoles elicited an increase in adult mouse myocardial cell cycle activity while concurrently mitigating myocardial fibrosis 77. The positive correlation between matching scores and supporting literature further validates the reliability of this repositioning method.

However, several limitations should be acknowledged. First, the study primarily focuses on the transcriptomic data of the synovial microenvironment in RA and AF. These findings may not be fully generalizable to all RA and AF patients due to variations in disease presentation, patient demographics, genetic backgrounds, etc. Second, extracting data from existing literature may introduce bias if the extracted information is incomplete or skewed towards certain research trends. Lastly, although the study identifies potential therapeutic targets and drugs through computational analysis, it lacks experimental validation of these findings. Without laboratory or clinical confirmation, the practical applicability of these results remains uncertain. Future research could involve expanding the datasets, incorporating more diverse patient samples, and conducting experimental validations to confirm the computational predictions.

Conclusion

This study uncovers a previously unexplored connection between RA and AF through integrated sequencing data and text mining. We also developed a potential targets database and identified several potential drugs based on the knowledge. This work provides a new perspective on AF and RA study which enhances our understanding of the pathogenesis.

Data and code availability

These data and code can be found at: https://github.com/chenxi199506/RAandAF.

Supplementary Information

Author contributions

S.T.C. and X.C designed the experiments; X.C. and Z.Q.W. performed the experiments; X.C., C.L., and Z.Q.W. analyzed the data; X.C. wrote the manuscript; S.T.C. and Y.S.S. revised the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Zhaoquan Wei, Xi Chen and Youshi Sun.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-69080-5.

References

  • 1.Buckley, C. D. et al. Location, location, location: How the tissue microenvironment affects inflammation in RA. Nat. Rev. Rheumatol.17, 195–212 (2021). 10.1038/s41584-020-00570-2 [DOI] [PubMed] [Google Scholar]
  • 2.Qu, F., Guilak, F. & Mauck, R. L. Cell migration: Implications for repair and regeneration in joint disease. Nat. Rev. Rheumatol.15, 167–179 (2019). 10.1038/s41584-018-0151-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Margheri, F. et al. The protease systems and their pathogenic role in juvenile idiopathic arthritis. Autoimmun. Rev.18, 761–766 (2019). 10.1016/j.autrev.2019.06.010 [DOI] [PubMed] [Google Scholar]
  • 4.McGarry, T. et al. Hypoxia, oxidative stress and inflammation. Free Radic. Biol. Med.125, 15–24 (2018). 10.1016/j.freeradbiomed.2018.03.042 [DOI] [PubMed] [Google Scholar]
  • 5.Chen, L. et al. Nanomedicine strategies for anti-inflammatory treatment of noninfectious arthritis. Adv. Healthc. Mater.10, e2001732 (2021). 10.1002/adhm.202001732 [DOI] [PubMed] [Google Scholar]
  • 6.Ristic, B. et al. Pharmacologic inducers of the uric acid exporter ABCG2 as potential drugs for treatment of gouty arthritis. Asian J. Pharm. Sci.15, 173–180 (2020). 10.1016/j.ajps.2019.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Generali, E. et al. Nature versus nurture in the spectrum of rheumatic diseases: Classification of spondyloarthritis as autoimmune or autoinflammatory. Autoimmun. Rev.17, 935–941 (2018). 10.1016/j.autrev.2018.04.002 [DOI] [PubMed] [Google Scholar]
  • 8.Jing, W. et al. Role of reactive oxygen species and mitochondrial damage in rheumatoid arthritis and targeted drugs. Front. Immunol.14, 1107670 (2023). 10.3389/fimmu.2023.1107670 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Thirunavukarasu, A. J. et al. Large language models in medicine. Nat. Med.29(8), 1930–1940 (2023). 10.1038/s41591-023-02448-8 [DOI] [PubMed] [Google Scholar]
  • 10.Venerito, V. et al. AI am a rheumatologist: A practical primer to large language models for rheumatologists. Rheumatology62(10), 3256–3260 (2023). 10.1093/rheumatology/kead291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lenharo, M. ChatGPT gives an extra productivity boost to weaker writers. Nature. 10.1038/d41586-023-02270-9 (2023). 10.1038/d41586-023-02270-9 [DOI] [PubMed] [Google Scholar]
  • 12.Ueda, D. et al. ChatGPT’s diagnostic performance from patient history and imaging findings on the diagnosis please quizzes. Radiology308, e231040 (2023). 10.1148/radiol.231040 [DOI] [PubMed] [Google Scholar]
  • 13.Grünebaum, A. & Chervenak, F. A. The sky is the limit to explore ChatGPT’s impact in obstetrics & gynecology. Am. J. Obstet. Gynecol.S0002–9378(23), 00465–00469 (2023). [DOI] [PubMed] [Google Scholar]
  • 14.Desaire, H. et al. Distinguishing academic science writing from humans or ChatGPT with over 99% accuracy using off-the-shelf machine learning tools. Cell Rep. Phys. Sci.4, 101426 (2023). 10.1016/j.xcrp.2023.101426 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mika, A. P. et al. Assessing ChatGPT responses to common patient questions regarding total hip arthroplasty. J. Bone Joint Surg. Am.105, 1519–1526 (2023). 10.2106/JBJS.23.00209 [DOI] [PubMed] [Google Scholar]
  • 16.Alpert, J. S. & Chen, Q. M. Pharmacogenomics of statins: A view from ChatGPT. Am. J. Med.S0002–9343(23), 00430–00438 (2023). [DOI] [PubMed] [Google Scholar]
  • 17.Yu, G. et al. GOSemSim: An R package for measuring semantic similarity among GO terms and gene products. Bioinformatics26, 976–978 (2010). 10.1093/bioinformatics/btq064 [DOI] [PubMed] [Google Scholar]
  • 18.Stephenson, W. et al. Single-cell RNA-seq of rheumatoid arthritis synovial tissue using low-cost microfluidic instrumentation. Nat. Commun.9, 791 (2018). 10.1038/s41467-017-02659-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Mizoguchi, F. et al. Functionally distinct disease-associated fibroblast subsets in rheumatoid arthritis. Nat. Commun.9, 789 (2018). 10.1038/s41467-018-02892-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang, F. et al. Defining inflammatory cell states in rheumatoid arthritis joint synovial tissues by integrating single-cell transcriptomics and mass cytometry. Nat. Immunol.20, 928–942 (2019). 10.1038/s41590-019-0378-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Alivernini, S. et al. Distinct synovial tissue macrophage subsets regulate inflammation and remission in rheumatoid arthritis. Nat. Med.26, 1295–1306 (2020). 10.1038/s41591-020-0939-8 [DOI] [PubMed] [Google Scholar]
  • 22.Micheroli, R. et al. Role of synovial fibroblast subsets across synovial pathotypes in rheumatoid arthritis: A deconvolution analysis. RMD Open8, e001949 (2022). 10.1136/rmdopen-2021-001949 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hao, Y. et al. Integrated analysis of multimodal single-cell data. Cell184(3573–3587), e29 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Korsunsky, I. et al. Fast, sensitive, and accurate integration of single cell data with Harmony. Nat. Methods16, 1289–1296 (2019). 10.1038/s41592-019-0619-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Clarke, Z. A. et al. Tutorial: guidelines for annotating single-cell transcriptomic maps using automated and manual methods. Nat. Protoc.16, 2749–2764 (2021). 10.1038/s41596-021-00534-0 [DOI] [PubMed] [Google Scholar]
  • 26.Zhang, X. et al. Cell marker: A manually curated resource of cell markers in human and mouse. Nucleic Acids Res.47, D721–D728 (2019). 10.1093/nar/gky900 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Bayram, B. et al. Molecular pathology of human knee arthrofibrosis defined by RNA sequencing. Genomics112, 2703–2712 (2020). 10.1016/j.ygeno.2020.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Woetzel, D. et al. Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation. Arthritis Res. Ther.16, R84 (2014). 10.1186/ar4526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Huber, R. et al. Identification of intra-group, inter-individual, and gene-specific variances in mRNA expression profiles in the rheumatoid arthritis synovial membrane. Arthritis Res. Ther.10, R98 (2008). 10.1186/ar2485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res.43, e47 (2015). 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wang, X. et al. Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nat. Commun.10, 380 (2019). 10.1038/s41467-018-08023-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wu, T. et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation2, 100141 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Andreatta, M. & Carmona, S. J. UCell: Robust and scalable single-cell gene signature scoring. Comput. Struct. Biotechnol. J.19, 3796–3798 (2021). 10.1016/j.csbj.2021.06.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Jin, S. et al. Inference and analysis of cell-cell communication using cell chat. Nat. Commun.12, 1088 (2021). 10.1038/s41467-021-21246-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Luo, Z. H. et al. DSEATM: drug set enrichment analysis uncovering disease mechanisms by biomedical text mining. Brief. Bioinform.23, bbac228 (2022). 10.1093/bib/bbac228 [DOI] [PubMed] [Google Scholar]
  • 36.Patil, I. Visualizations with statistical details: The ‘ggstatsplot’ approach. J. Open Source Softw.6, 3167 (2021). 10.21105/joss.03167 [DOI] [Google Scholar]
  • 37.Korsunsky, I. et al. Cross-tissue, single-cell stromal atlas identifies shared pathological fibroblast phenotypes in four chronic inflammatory diseases. Med3, 481–518 (2022). 10.1016/j.medj.2022.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Martinez, F. O. et al. Transcriptional profiling of the human monocyte-to-macrophage differentiation and polarization: New molecules and patterns of gene expression. J. Immunol.177, 7303–7311 (2006). 10.4049/jimmunol.177.10.7303 [DOI] [PubMed] [Google Scholar]
  • 39.Olsson-White, D. A. et al. Assessment of liver fibrosis markers in people with rheumatoid arthritis on methotrexate. Intern. Med. J.52, 566–573 (2022). 10.1111/imj.15125 [DOI] [PubMed] [Google Scholar]
  • 40.Ntusi, N. A. B. et al. Diffuse myocardial fibrosis and inflammation in rheumatoid arthritis: Insights from CMR T1 mapping. JACC Cardiovasc. Imaging8, 526–536 (2015). 10.1016/j.jcmg.2014.12.025 [DOI] [PubMed] [Google Scholar]
  • 41.Katona, K. et al. Image analysis of fibrosis in labial salivary glands of patients with systemic autoimmune diseases. Close correlation of lobular fibrosis to seropositive rheumatoid arthritis and increased anti-CCP and RF titres in the serum. Pathology (Phila.)50, 418–425 (2018). [DOI] [PubMed] [Google Scholar]
  • 42.Matsuo, T. & Matsuo, N. Progressive subretinal fibrosis in patients with rheumatoid arthritis and renal dysfunction. Ophthalmologica212, 289–294 (1998). 10.1159/000027310 [DOI] [PubMed] [Google Scholar]
  • 43.Doyen, V. et al. Rheumatoid arthritis and cystic fibrosis. Rev. Mal. Respir.22, 667–671 (2005). 10.1016/S0761-8425(05)85617-7 [DOI] [PubMed] [Google Scholar]
  • 44.Schiavone Panni, A. et al. Stiffness in total knee arthroplasty. J. Orthop. Traumatol.10, 111–118 (2009). 10.1007/s10195-009-0054-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Smith, M. D. The normal synovium. Open Rheumatol. J.5, 100–106 (2011). 10.2174/1874312901105010100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Tardito, S. et al. Macrophage M1/M2 polarization and rheumatoid arthritis: A systematic review. Autoimmun. Rev.18, 102397 (2019). 10.1016/j.autrev.2019.102397 [DOI] [PubMed] [Google Scholar]
  • 47.Usher, K. M. et al. Pathological mechanisms and therapeutic outlooks for arthrofibrosis. Bone Res.7, 9 (2019). 10.1038/s41413-019-0047-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Knab, K., Chambers, D. & Krönke, G. Synovial macrophage and fibroblast heterogeneity in joint homeostasis and inflammation. Front. Med.9, 862161 (2022). 10.3389/fmed.2022.862161 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Kemble, S. & Croft, A. P. Critical role of synovial tissue-resident macrophage and fibroblast subsets in the persistence of joint inflammation. Front. Immunol.12, 715894 (2021). 10.3389/fimmu.2021.715894 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Cheng, L. et al. New insights from single-cell sequencing data: Synovial fibroblasts and synovial macrophages in rheumatoid arthritis. Front. Immunol.12, 709178 (2021). 10.3389/fimmu.2021.709178 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Xu, Q. et al. The role of macrophage-fibroblast interaction in lipopolysaccharide-induced pulmonary fibrosis: an acceleration in lung fibroblast aerobic glycolysis. Lab. Investig. J. Tech. Methods Pathol.102, 432–439 (2022). 10.1038/s41374-021-00701-7 [DOI] [PubMed] [Google Scholar]
  • 52.Li, Y. et al. S100A4+ macrophages are necessary for pulmonary fibrosis by activating lung fibroblasts. Front. Immunol.9, 1776 (2018). 10.3389/fimmu.2018.01776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Xiang, M. X. et al. Emodin-8-O-β-D-glucoside from Polygonum amplexicaule D. Don var. Sinense Forb. promotes proliferation and differentiation of osteoblastic MC3T3-E1 cells. Mol. Basel Switz.16, 728–737 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Luo, S. et al. Emodin ameliorates ulcerative colitis by the flagellin-TLR5 dependent pathway in mice. Int. Immunopharmacol.59, 269–275 (2018). 10.1016/j.intimp.2018.04.010 [DOI] [PubMed] [Google Scholar]
  • 55.Hwang, J. K. et al. Emodin suppresses inflammatory responses and joint destruction in collagen-induced arthritic mice. Rheumatol. Oxf. Engl.52, 1583–1591 (2013). 10.1093/rheumatology/ket178 [DOI] [PubMed] [Google Scholar]
  • 56.Hashiguchi, M. et al. Effect of aloe-emodin on the proliferation and apoptosis of human synovial MH7A cells; a comparison with methotrexate. Mol. Med. Rep.15, 4398–4404 (2017). 10.3892/mmr.2017.6541 [DOI] [PubMed] [Google Scholar]
  • 57.Ha, M. K. et al. Emodin inhibits proinflammatory responses and inactivates histone deacetylase 1 in hypoxic rheumatoid synoviocytes. Biol. Pharm. Bull.34, 1432–1437 (2011). 10.1248/bpb.34.1432 [DOI] [PubMed] [Google Scholar]
  • 58.Cheng, L., Chen, J. & Rong, X. Mechanism of emodin in the treatment of rheumatoid arthritis. Evid. Based Complement. Altern. Med. ECAM2022, 9482570 (2022). 10.1155/2022/9482570 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Cantley, M. D. et al. Histone deacetylase inhibitors as suppressors of bone destruction in inflammatory diseases. J. Pharm. Pharmacol.64, 763–774 (2012). 10.1111/j.2042-7158.2011.01421.x [DOI] [PubMed] [Google Scholar]
  • 60.Choo, Q. Y., Ho, P. C. & Lin, H. S. Histone deacetylase inhibitors: New hope for rheumatoid arthritis?. Curr. Pharm. Des.14, 803–820 (2008). 10.2174/138161208784007699 [DOI] [PubMed] [Google Scholar]
  • 61.Choo, Q. Y. et al. The histone deacetylase inhibitors MS-275 and SAHA suppress the p38 mitogen-activated protein kinase signaling pathway and chemotaxis in rheumatoid arthritic synovial fibroblastic E11 cells. Mol. Basel Switz.18, 14085–14095 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Choo, Q.-Y. et al. Histone deacetylase inhibitors MS-275 and SAHA induced growth arrest and suppressed lipopolysaccharide-stimulated NF-kappaB p65 nuclear accumulation in human rheumatoid arthritis synovial fibroblastic E11 cells. Rheumatol. Oxf. Engl.49, 1447–1460 (2010). 10.1093/rheumatology/keq108 [DOI] [PubMed] [Google Scholar]
  • 63.Grabiec, A. M. et al. Histone deacetylase inhibitors suppress inflammatory activation of rheumatoid arthritis patient synovial macrophages and tissue. J. Immunol.2010(184), 2718–2728 (1950). [DOI] [PubMed] [Google Scholar]
  • 64.Grabiec, A. M. et al. Histone deacetylase inhibitors suppress rheumatoid arthritis fibroblast-like synoviocyte and macrophage IL-6 production by accelerating mRNA decay. Ann. Rheum. Dis.71, 424–431 (2012). 10.1136/ard.2011.154211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Shavandi, M. et al. Silymarin (Livergol®) decreases disease activity score in patients with rheumatoid arthritis: A non-randomized single-arm clinical trial. Iran. J. Allergy Asthma Immunol.16, 99–106 (2017). [PubMed] [Google Scholar]
  • 66.Hu, M. et al. Deubiquitinase inhibitor auranofin attenuated cardiac hypertrophy by blocking NF-κB activation. Cell. Physiol. Biochem.45, 2421–2430 (2018). 10.1159/000488230 [DOI] [PubMed] [Google Scholar]
  • 67.Kim, H. Y. et al. Auranofin prevents liver fibrosis by system Xc-mediated inhibition of NLRP3 inflammasome. Commun. Biol.4, 824 (2021). 10.1038/s42003-021-02345-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lee, S. M. et al. Auranofin attenuates hepatic steatosis and fibrosis in non-alcoholic fatty liver disease via NRF2 and NF-κB signaling pathways. Clin. Mol. Hepatol.28(4), 827 (2022). 10.3350/cmh.2022.0068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Maydaniuk, D. et al. New auranofin analogs with antibacterial properties against Burkholderia clinical isolates. Antibiot. Basel Switz.10, 1443 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Roy, D. N. et al. Combination therapy with andrographolide and d-penicillamine enhanced therapeutic advantage over monotherapy with d-penicillamine in attenuating fibrogenic response and cell death in the periportal zone of liver in rats during copper toxicosis. Toxicol. Appl. Pharmacol.250, 54–68 (2011). 10.1016/j.taap.2010.09.027 [DOI] [PubMed] [Google Scholar]
  • 71.Kazemi, K. et al. Effect of D-penicillamine on liver fibrosis and inflammation in Wilson disease. Exp. Clin. Transplant. Off. J. Middle East Soc. Organ Transplant.6, 261–263 (2008). [PubMed] [Google Scholar]
  • 72.Selman, M. et al. Colchicine, D-penicillamine, and prednisone in the treatment of idiopathic pulmonary fibrosis: A controlled clinical trial. Chest114, 507–512 (1998). 10.1378/chest.114.2.507 [DOI] [PubMed] [Google Scholar]
  • 73.Broen, J. C. A. & van Laar, J. M. Mycophenolate mofetil, azathioprine and tacrolimus: Mechanisms in rheumatology. Nat. Rev. Rheumatol.16, 167–178 (2020). 10.1038/s41584-020-0374-8 [DOI] [PubMed] [Google Scholar]
  • 74.Izumi, S., Iikura, M. & Hirano, S. Prednisone, azathioprine, and N-acetylcysteine for pulmonary fibrosis. N. Engl. J. Med.367, 870 (2012). [DOI] [PubMed] [Google Scholar]
  • 75.Wong, A. W. et al. Prescribing patterns and tolerability of mycophenolate and azathioprine in patients with nonidiopathic pulmonary fibrosis fibrotic interstitial lung disease. Ann. Am. Thorac. Soc.19, 863–867 (2022). 10.1513/AnnalsATS.202108-954RL [DOI] [PubMed] [Google Scholar]
  • 76.Yildirim, Z. et al. Attenuation of bleomycin-induced lung fibrosis by oral sulfhydryl containing antioxidants in rats: Erdosteine and N-acetylcysteine. Pulm. Pharmacol. Ther.18, 367–373 (2005). 10.1016/j.pupt.2005.02.001 [DOI] [PubMed] [Google Scholar]
  • 77.Russell, J. L. et al. Targeting native adult heart progenitors with cardiogenic small molecules. ACS Chem. Biol.7, 1067–1076 (2012). 10.1021/cb200525q [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

These data and code can be found at: https://github.com/chenxi199506/RAandAF.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES