Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Mar 24.
Published in final edited form as: Sci Transl Med. 2024 Apr 10;16(742):eadk3506. doi: 10.1126/scitranslmed.adk3506

Synovial fibroblast gene expression is associated with sensory nerve growth and pain in rheumatoid arthritis

Zilong Bai 1, Nicholas Bartelo 1, Maryam Aslam 2, Elisabeth A Murphy 2, Caryn R Hale 2,3, Nathalie E Blachere 2,4, Salina Parveen 2, Edoardo Spolaore 5, Edward DiCarlo 5, Ellen M Gravallese 6, Melanie H Smith 5; Accelerating Medicines Partnership RA/SLE Network†,, Mayu O Frank 2, Caroline S Jiang 2, Haotan Zhang 1, Christina Pyrgaki 2, Myles J Lewis 7, Shafaq Sikandar 7, Costantino Pitzalis 7,8, Joseph B Lesnak 9, Khadijah Mazhar 9, Theodore J Price 9, Anne-Marie Malfait 10, Rachel E Miller 10, Fan Zhang 11, Susan Goodman 5, Robert B Darnell 2,4, Fei Wang 1,*, Dana E Orange 2,5,*
PMCID: PMC11931728  NIHMSID: NIHMS2056864  PMID: 38598614

Abstract

It has been presumed that rheumatoid arthritis (RA) joint pain is related to inflammation in the synovium; however, recent studies reveal that pain scores in patients do not correlate with synovial inflammation. We developed a machine-learning approach (graph-based gene expression module identification or GbGMI) to identify an 815-gene expression module associated with pain in synovial biopsy samples from patients with established RA who had limited synovial inflammation at arthroplasty. We then validated this finding in an independent cohort of synovial biopsy samples from patients who had early untreated RA with little inflammation. Single-cell RNA sequencing analyses indicated that most of these 815 genes were most robustly expressed by lining layer synovial fibroblasts. Receptor-ligand interaction analysis predicted cross-talk between human lining layer fibroblasts and human dorsal root ganglion neurons expressing calcitonin gene–related peptide (CGRP+). Both RA synovial fibroblast culture supernatant and netrin-4, which is abundantly expressed by lining fibroblasts and was within the GbGMI-identified pain-associated gene module, increased the branching of pain-sensitive murine CGRP+ dorsal root ganglion neurons in vitro. Imaging of solvent-cleared synovial tissue with little inflammation from humans with RA revealed CGRP+ pain-sensing neurons encasing blood vessels growing into synovial hypertrophic papilla. Together, these findings support a model whereby synovial lining fibroblasts express genes associated with pain that enhance the growth of pain-sensing neurons into regions of synovial hypertrophy in RA.

INTRODUCTION

Inflammatory pain can be driven by cytokines, bradykinins, and prostanoids, which bind specific receptors on primary nociceptor neurons to cause heightened sensation of pain (1). However, pain is not always proportional to inflammation, and clinical scenarios in which pain is dissociated from inflammation are useful to study the noninflammatory drivers of pain.

Rheumatoid arthritis (RA) is a chronic disease characterized by inflammation in the synovium, the tissue that lines the joint cavity. Remarkable progress has been made in developing an array of conventional synthetic, targeted synthetic, and biologic disease-modifying antirheumatic drugs (csDMARDs, tsDMARDs, and bDMARDs, respectively), which target relevant immune mediators (2). However, up to 20% of patients with RA are “difficult to treat”; that is, they do not improve despite treatment with at least two bDMARDs or tsDMARDs, with different mechanisms of action, after failing a csDMARD (3, 4). It has been assumed that synovial inflammation is the cause of RA joint pain. However, recent studies have revealed that pain can be dissociated from inflammation in RA (38). Patients with RA and limited synovial inflammation, also known as “fibroid,” “low inflammatory,” “pauci-immune,” or “fibroblast cell type abundance phenotype” synovium, have as much pain as those with extreme inflammation (3, 4, 913). Patients with low synovial inflammation tend to receive less benefit from treatment with anti-inflammatory drugs such as tumor necrosis factor inhibitors and DMARDs (12, 14).

Here, we hypothesized that a focused analysis of low inflammatory synovium might identify factors beyond inflammation that relate to joint pain. One challenge is that patient-reported outcome data, such as report of pain severity, are notoriously noisy, and tissue transcriptomic data are still relatively expensive, with only tens to hundreds of samples available for most studies. Given these limitations, existing analytic approaches are generally not sufficiently powered to identify gene modules that relate to patient reported outcomes. To address this, we developed a machine-learning approach, called graph-based gene expression module identification (GbGMI), to uncover the relationship between gene expression and pain. We then sought to determine which cells express pain-associated genes and how they might influence nociceptors in vitro and to validate these findings by visualizing neurons in low inflammatory RA synovium in vivo.

RESULTS

Pain is not related to inflammation in patients with RA with low inflammatory synovium

We categorized patients as high or low inflammatory using our previously reported histology scoring algorithm (9). RA pain scores were not different between patients with high and low inflammatory synovium (Fig. 1A). Pain scores were associated with the grade of synovial inflammation as measured by the density of cells per unit of tissue (in cells per square millimeter) in patients with high inflammatory synovium but not in patients with low inflammatory synovium (Fig. 1B).

Fig. 1. Pain is related to synovial inflammation in patients with RA with high, but not low, synovial inflammation.

Fig. 1.

(A) RA pain scores are shown compared to synovial tissue inflammatory classification in n=139 patients. (B) RA pain scores are shown according to cell density (in cells per square millimeter) of H&E (hematoxylin and eosin)–stained synovial tissue, in samples classified as high (n=35,r=0.40,P=0.048) or low inflammatory (n=104,r=0.08,P=0.53). ns, not significant in Mann-Whitney test. r= Spearman’s rank correlation coefficient. P= two-tailed P value.

We tested for genes that were highly associated with pain using the usual RNA sequencing (RNA-seq) analysis platform, limma (15). We failed to identify any individual genes that were correlated with pain, suggesting that the relationship of gene expression with pain could be multifactorial or nonlinear. We next hypothesized that there might be groups of genes whose expression varies in association with pain. We developed an iterative machine-learning GbGMI computational framework to uncover a group of genes whose expression is correlated with a given univariate clinical feature. GbGMI was given a multimodal input comprising a gene expression matrix M for m genes and n patients and an n-dimensional clinical feature vector a. GbGMI first calculates the patient-to-patient similarity structure according to the given clinical feature and compares that to the gene expressions using the Laplacian score. GbGMI then determines the optimal number of genes that together associate with the clinical feature through statistical tests between the t-distributed stochastic neighborhood embedding (t-SNE)–based summary scores of the selected genes and this clinical feature (see Materials and Methods and fig. S1).

GbGMI correctly identifies genes associated with synovial inflammation

We sought to test the validity of the approach by testing whether it would correctly identify genes known to be associated with inflammation as measured by cell density, which is highly associated with many individual genes as measured by limma (15). GbGMI identified a module of 2713 genes whose gene expression summary score correlates with synovial tissue cell density (Fig. 2, A to D). The positive control for this analysis was principal component one (PC1) of bulk synovial RNA-seq gene expression data, which was previously shown to associate with the extent of synovial inflammation and highly correlate with synovial cell density (16). The negative control was a gene expression summary score for a group of the top 5000 most variably expressed genes. As expected, the gene expression summary score of the top 5000 most variably expressed genes was not significantly correlated with synovial histologic cell density (P=0.21) (Fig. 2E), whereas PC1 scores of gene expression were significantly correlated with synovial histologic cell density (Spearman ρ=0.4, P=0.01) (Fig. 2F). The gene expression summary score of the GbGMI module of 2713 genes had a further improved correlation to synovial histologic cell density (Spearman ρ=0.59,P=0.0001) (Fig. 2G). This analysis indicated that GbGMI was a useful method that outperformed principal components analysis (PCA) in identifying a module of genes that associate with the grade of synovial inflammation measured by cell density.

Fig. 2. A synovial gene expression signature that correlates with synovial histologic cell density was identified using GbGMI or PCA.

Fig. 2.

(A) An expression heatmap is shown of the top 5000 most variably expressed genes in 38 patients. Gene expression amounts (rows) are represented as z scores for all patients. Patients (columns) are sorted by their mean nuclei densities. (B) Shown is a similarity matrix of synovial histologic cell densities. (C) Laplacian scores are shown for the top 5000 most variably expressed genes measuring how their expression varied compared with synovial histologic cell density similarity structure. Each dot represents a gene, sorted by Laplacian score in ascending order. (D) Shown is the −log(P value) of the correlation of the top k–ranked groups of genes with nucleus density similarity structure. Each dot represents a group of genes. (E) Synovial histologic cell density is shown according to PC1 score of the top 5000 most variably expressed genes in 38 patients (Spearman ρ=0.21, P=0.21). (F) Synovial histologic cell density is shown according to the summary score of the 5000 genes over the 38 patients (Spearman ρ=0.4, P=0.01). (G) Synovial histologic cell density is shown according to the summary score of the 2713 GbGMi-identified genes over the 38 patients (Spearman ρ=0.59, P=0.0001). Statistics presented in (E) to (G) indicate Spearman correlation coefficient and P value.

GbGMI identifies pain-associated synovial gene expression in patients with established RA

We next applied GbGMI to define a module of genes associated with pain in patients with low inflammatory synovium. Most of the 6582 genes that distinguish high and low inflammatory synovium are increased in high inflammatory synovium and are enriched for pathways representing infiltrating immune cells. To uncover genes associated with pain but not inflammation, we focused our analysis on 2227 genes that exhibited increased expression in low inflammatory synovium relative to high inflammatory synovium (9) and on pain scores that document the extent of pain in the joint that was sampled [Hip Osteoarthritis Outcome Score/Knee Osteoarthritis Outcome Score (HOOS/KOOS)] (Fig. 3A). The patient-reported pain scores a were transformed into a matrix of pairwise similarity scores between patients S (Fig. 3B). We next calculated the Laplacian score (17) for each of the 2227 low inflammatory genes on the basis of its expression values (that is, a row vector in M) and S (Fig. 3C). We then tested which number of top-ranked genes collectively best correlated with pain among patients with RA with low synovial inflammation and identified an 815-gene module, which we refer to as the GbGMI-identified pain-associated genes (Fig. 3D). Although the summary score of all 2227 low inflammatory genes did not correlate with pain, summary scores of the GbGMI-identified pain-associated genes were significantly correlated with the patient-reported HOOS/KOOS pain in patients with low inflammatory synovium (P=0.001) (Fig. 3E). This correlation was not as pronounced when including all patients with RA irrespective of inflammatory subset (Fig. 3F). Similar correlations were identified when the GbGMI-identified pain-associated genes were compared to Visual Analog Score (VAS) report of pain (fig. S2). We conducted sensitivity analysis on GbGMI using 22 patient subsets subsampled with a leave-one-out strategy. Despite the relatively small sample size, GbGMI demonstrated robustness; on these subsampled data, GbGMI yielded pain-association gene rankings consistent with the original ranking based on the overall 22 patients with low inflammation (smallest ρ=0.75, largest P=0, Spearman’s test; fig. S3A) and identified pain-associated gene modules significantly associated with the original 815-gene module (largest P=1.22×107, Fisher’s exact test; fig. S3B, last row and column).

Fig. 3. GbGMI identified a pain-associated synovial gene expression in patients with established and early RA.

Fig. 3.

(A) An expression heatmap is shown of 2227 genes with increased expression in low inflammatory synovium. Patients (columns) were grouped by inflammatory amounts: high (n=12) versus low (n=22, low and mixed inflammatory subtypes identified in (9). (B) Similarity structure is shown of HOOS/KOOS pain scores over patients. (C) Laplacian scores of the input 2227 genes measure how their expression amounts over patients relate to the pain score–based similarity structure. Each dot represents a gene sorted by Laplacian score in ascending order. (D) Significance of the correlation between the top k–ranked genes and HOOS/KOOS pain scores is shown. Each dot represents a group of genes. (E) Established RA HOOS/KOOS pain score according to the summary score of the 815 GbGMi-identified genes in low inflammatory samples (Kendall τ=0.5, P=0.001). (F) Comparison between the same pair of scores as (E) but for all samples, irrespective of inflammatory status (Kendall τ=0.23, P=0.06). (G) Early RA VAS pain scores are shown according to gene expression summary score of 738 GbGMi-identified pain-associated genes in patients with low inflammatory (fibroid or unassigned) synovial pathotype (Kendall τ=0.35,P=0.02). (H) Comparison between the same pair of scores as (G) but for all patients, irrespective of synovial pathotype (Kendall τ=0.15, P=0.04). z score was calculated from gene expression values over patients. Statistics presented in (E) to (H) indicate Kendall correlation coefficient and P value.

GbGMI-identified pain-associated synovial genes are also associated with pain in patients with early RA

Overfitting is a concern in using a graph-based machine learning approach to identifying groups of genes that associate with pain. It is possible that the GbGMI-identified pain-associated genes correlated with pain in the dataset in which they were found but not in other external datasets. We sought to test whether the pain-associated gene module identified in patients with established disease was also associated with pain in a second, independent Pathobiology of Early Arthritis Cohort (PEAC) dataset (11) of synovial biopsy samples from patients with early (mean of 6 months of symptoms) untreated RA. A total of 2018 of the 2227 low inflammatory genes and 738 of the 815 pain-associated genes found in the established RA dataset were also detected in this dataset. The 738 pain-associated genes were also correlated with VAS pain in patients with early RA with low inflammatory (fibroid or undefined) synovium (Fig. 3G). However, the 2018 low inflammatory genes were not. In this early RA cohort, the 738 genes were also associated with pain when samples from all patients were included as well, irrespective of synovial inflammatory subset (fibroid, undefined, myeloid, and lymphoid), although the association was again not as robust as was seen in those with low inflammatory synovium (Fig. 3H). The range of GbGMI summary scores decreased when all samples were included (Fig. 3, G and H). The association of the GbGMI-identified genes with pain was robust in the low inflammatory samples but persisted even when all patients were included, suggesting that these genes may play a role, albeit less pronounced, in pain in high inflammatory synovium, where inflammatory mediators are likely to also contribute.

GbGMI-identified pain-associated genes are enriched with neurogenesis pathways and predominantly expressed by synovial fibroblasts

We next sought to understand the biological meaning and the direction of the association of the 815 GbGMI genes with pain in patients with RA with low synovial inflammation. Limma was performed to detect genes whose expression correlated with pain. Genes were ranked by limma according to this correlation. Although limma did not identify any significant [false discovery rate (FDR) < 0.05] individual genes correlated with pain (fig. S4A), as a group, expression of the 815 GbGMI-identified pain genes was significantly decreased as the HOOS/KOOS pain score increased (adjusted P=7.38×1012, Kolmogorov-Smirnov test) (fig. S4, B and C). This indicated a positive correlation with pain severity. The 815 pain-associated genes were enriched for pathways such as nervous system development, neurogenesis, and neuron differentiation (Fig. 4A) and included ephrin (EPHA3, EPHA6, and EPHA7) and semaphorin (SEMA3B, SEMA3E, SEMA4C, SEMA5A, and SEMA6D) family members. The 1412 nonpain-associated genes included CD55, PRG4, CSPG4, and MERTK, genes known to be involved in the normal functions of lining macrophages and fibroblasts (18). These genes were enriched in molecular function and ribosomal RNA (rRNA) processing but not neuron axonal growth pathways (Fig. 4A and data file S1). We next examined which cells expressed the GbGMI pain-associated genes. We compared expression in both bulk RNA-seq data from sorted cell types, which offers high-depth coverage of RNA but less cell type resolution, and single-cell RNA-seq (scRNA-seq) data, which offers higher cell type resolution to cell subtypes but less depth of coverage, from the Accelerating Medicines Partnership dataset (19). Comparison of the pain-associated genes across sorted bulk synovial B cells, fibroblasts, monocytes, and T cells indicated that the fibroblasts exhibited the highest expression of pain-associated genes (Fig. 4, B and C). We reasoned that pain-associated genes might be more robustly expressed in fibroblasts because of a relative enrichment in fibroblasts, compared with immune cells, in low inflammatory samples. However, when looking only at fibroblasts, the pain-associated genes were increased compared with the nonpain-associated genes (Fig. 4B). The differences in pathways enriched in pain-associated and nonpain-associated genes as well as the difference in relative expression within fibroblasts indicated that the GbGMI method did not select a random group of fibroblast genes. Further analysis of the scRNA-seq dataset also confirmed that the fibroblast subsets exhibited the highest expression of pain-associated genes (Fig. 4D). Gene expression analysis among fibroblast subsets indicated that, compared with the other fibroblast subsets, lining CD55+ fibroblasts (SC-F4) exhibited the highest expression of GbGMI-identified pain-associated genes (Fig. 4E).

Fig. 4. The GbGMI-identified pain-associated gene signature is expressed by synovial lining layer fibroblasts.

Fig. 4.

(A) g:Profiler pathway enrichment analysis is shown of 815 pain-associated genes and 1412 nonpain-associated low inflammatory genes. ncRNA, long noncoding RNA; miRNA, microRNA. (B) Mean gene expression z scores of pain-associated genes and nonpain-associated genes detected in sorted bulk B cells (CD45+, CD3, and CD19+), fibroblasts (CD45, CD31, and PDPN+), monocytes (CD45+ and CD14+), and T cells (CD45+ and CD3+) are shown. z score is calculated on the basis of TPM normalized counts. ***P < 0.001. (C) Per-sample gene expression z scores of 769 pain-associated genes detected in sorted cell types from (B) are shown. (D) An expression heatmap is shown of 797 pain genes with nonzero variance in expression values across a subset (n=4354) of RA synovial cells (SC) in 18 unique cell populations (of B cells, SC-B1 to SC-B4; fibroblasts, SC-F1 to SC-F4; monocytes, SC-M1 to SC-M4; and T cells, SC-T1 to SC-T6), which were identified from the 5265 scRNA-seq profiles by an integrated analysis based on canonical-correlation analysis (CCA) from the Accelerating Medicine Partnership (19). z score is calculated on the basis of log2(CPM + 1)–transformed UMI counts over the RA synovial cells. (E) Volcano plots of 794 pain genes in scRNA-seq profiles (ImmPort accession #SdY998) (19) with nonzero variance in expression values across the subset (n=1532) of RA synovial fibroblasts in three sublining subsets, CD34+ (SC-F1), HLA-DRAhi (SC-F2), and DKK3+ (SC-F3), and one lining subset (SC-F4) are shown. Each volcano plot shows the differential expression analysis (using Seurat function FindMarkers) of the genes in each RA synovial fibroblast subtype compared with the other three, where x axis shows log2(fold change) and y-axis −log(adjusted P value). The significantly increased genes are red, significantly decreased genes are blue, and nonsignificantly differentially expressed genes are gray. The horizontal and vertical red lines respectively indicate the threshold of significance [log(adjustedP=0.05)] and the separation threshold between increased and decreased gene expression log2(fold change) = 0.

Ligand-receptor analysis predicts interactions between lining fibroblasts and human dorsal root ganglion nociceptors

Given that the pain-associated genes were enriched in neuron axonal growth pathways, we next explored predicted interactions of pain-associated synovial fibroblast genes with dorsal root ganglion (DRG) neurons likely to innervate the joint. We performed receptor-ligand interaction analysis to identify predicted receptor-ligand pairs using the pain-associated genes expressed by four synovial fibroblast subtypes in human RA synovial tissue and genes expressed in a human DRG (hDRG) single-nucleus RNA-seq (snRNA-seq) dataset (Fig. 5A) (20). Lining fibroblasts (SC-F4) were predicted to have the highest number of ligand-receptor interactions (39 SC-F4 ligands to hDRG receptors) (fig. S5A). Lining fibroblasts (SC-F4) were predicted to interact with several CGRP+ peptidergic and nonpeptidergic neuron subtypes (fig. S5, B and C, and data file S2). Comparison of the expression of 14 ligand or receptor encoding pain-associated genes of SC-F4 revealed a gradient of pain-associated genes that were relatively lowly expressed in SC-F1 cells and most highly expressed in SC-F4 cells, with HBEGF, CTGF, and NTN4 among the most robustly expressed (Fig. 5B).

Fig. 5. Filtering on synovial fibroblast genes predicted to influence DRG sensory neurons.

Fig. 5.

(A) Shown are predicted ligand-receptor interactions between synovial fibroblasts and neurons in the human DRG (hDRG). The circos plot shows the two unidirectional interactomes between the four synovial fibroblast subtypes and 10 hDRG neuron subtypes. The outermost layer indicates the RA synovial fibroblast subtypes of the cells (in colored squares) or hDRG neuron subtypes (in colored round dots) expressing corresponding ligand or receptor genes. The middle layer shows whether a gene is ligand coding or receptor coding in its associated interactions. The inner layer contains gene names. The two tissue-wise directions are distinguished by the colors of connections between gene names. The number of connections associated with the ligand/receptor genes in each fibroblast subtype or neuron subtype and those in each unidirectional tissue-wise relation are summarized in the corresponding legends. (B) An expression heatmap is shown of 14 pain-associated ligand/receptor encoding marker genes of synovial lining fibroblast (SC-F4) cells with nonzero variance in expression values across a subset (n=4354) of RA synovial cells in 18 unique cell populations (of B cells, SC-B1 to SC-B4; fibroblasts, SC-F1 to SC-F4; monocytes, SC-M1 to SC-M4; and T cells, SC-T1 to SC-T6), which were identified from the 5265 scRNA-seq profiles by an integrated analysis based on CC A from the Accelerating Medicine Partnership (19). z score was calculated using log2(CPM + 1) transformed UMI counts over the RA synovial cells. The genes are ranked top-down by their log fold change in differential expression analysis (DEA) of lining fibroblast (SC-F4) versus other fibroblasts. (C) A Sankey plot is shown of top 25 unique fibroblast to hDRG ligand-receptor interactions from the 815 GbGMI pain-associated genes ranked by Laplacian score value. (D) A chord plot is shown depicting pathway analysis of hDRG receptors identified by the ligand receptor interactome by GbGMI pain-associated genes. The top enriched pathways suggest promotion of axon growth, including axon guidance, extracellular matrix (ECM)–receptor interaction, regulation of actin cytoskeleton, and Rap1 signaling.

We repeated this receptor-ligand interaction analysis using a bulk hDRG spatial RNA-seq dataset with neuronal expression confirmed in independently generated human snRNA-seq dataset, offering increased depth of coverage and a larger number of human samples from organ donors of both sexes (2123) and a mouse DRG scRNA-seq dataset (24). From the 815 GbGMI fibroblast pain-associated genes, 158 unique ligand-receptor interactions were identified with receptors present on hDRG neurons (data file S3). The top 25 unique ligand-receptor interactions from synovial fibroblasts to hDRGs were ranked on the basis of the gene’s Laplacian score (Fig. 5C). This demonstrated possible activation of several receptors on hDRGs involved in axon growth, including integrins (ITGA5, ITGAM, ITGB2, and ITGB5) and neogenin (NEO1) that is targeted by netrins. Biological process and molecular function Gene Ontology (GO) terms were identified for the hDRG receptors in the interactome using Enrichr (25, 26). This demonstrated activation of hDRG receptors responsible for axon guidance, nervous system development, and dendrite function (data file S4). Pathway analysis (iPathwayGuide software; https://advaitabio.com/ipathwayguide) on hDRG receptor genes was used to determine biological pathways that could be activated in hDRG neurons by fibroblast pain-associated genes. This again revealed activation of several biological pathways within hDRGs, with axon guidance being the most enriched driven heavily by ephrin A and B signaling (Fig. 5D). This analysis also revealed enrichment of extracellular matrix receptor interactions, regulation of actin cytoskeleton, focal adhesion, and Ras-associated protein 1 (RAP1) signaling, all of which point to axon growth. To further explore possible cross-talk between DRGs and synovial fibroblasts, we also performed a ligand-receptor interactome between ligands from hDRG neurons and GbGMI synovial fibroblast pain-associated genes that encode for a receptor. This revealed 132 unique interactions between ligands from hDRGs and receptors from synovial fibroblasts (data file S5). Plotting the top 25 interactions ranked by Laplacian score (fig. S5D) revealed activation of protein tyrosine phosphatase receptors (PTPRS) and transforming growth factor–β (TGFBR1), both of which play roles in mediating cell growth and differentiation. GO term analysis of the fibroblast receptors revealed activation chemokine/cytokine signaling, immune cell migration, and fibroblast growth factor binding (data file S4).

Products of synovial fibroblasts influence adult DRG sprouting and branching in response to injury

We next sought to test whether any of the pain-associated synovial fibroblast genes found in this analysis might directly influence the growth of nociceptors in the synovium. CGRP+ nerve fibers have been previously identified in synovium (27). NTN4 was of interest because it was within the GbGMI-identified pain-associated gene module in this dataset, was highly expressed by synovial fibroblasts, and was identified in the hDRG interactome (data file S2). Although netrin-4 (NTN4) has only 30% sequence homology to NTN1, which plays a role in axon guidance during embryogenesis (28), NTN4 has been shown to augment embryonic olfactory bulb sprouting and thalamocortical branching (2931). We cultured adult mouse–dissociated DRG neurons with no supplements (medium alone); human nerve growth factor (huNGF), as a positive control, because it is known to produce robust effects on axon sprouting (32, 33); or human NTN4 (huNTN4) and measured survival, sprouting, and branching of CGRP+ DRG neurons. CGRP status of the DRG neurons was assigned using immunofluorescent stains (fig. S6). The percentage of neurons with sprouting and the numbers of branches were quantified using Sholl analysis (Fig. 6A). The medium control in this experiment was Neurobasal Plus medium containing B27, which contained the protein transferrin. RA synovial fibroblasts express transferrin, but it was not included in the GbGMI-identified pain gene list, and its expression does not correlate with patient report of pain (fig. S7A) and therefore served as an internal control. Whereas there was no effect of either huNGF or huNTN4 on CGRP+ neuron survival compared to CGRP+ neurons cultured in medium control, huNTN4 augmented sprouting and branching in vitro (Fig. 6B). This result was reproduced when mouse neurons were cultured with mouse Ntn4 (fig. S7B), which has over 90% homology to huNTN4. However, synovial fibroblasts secrete an array of many different factors predicted to interact with nociceptors, and, considering the complexity of neuron axon guidance and the multitude of factors known to attract or repel neuron growth, we reasoned that it would be useful to determine whether the sum of all factors secreted by synovial fibroblasts resulted in nociceptive neuron attraction, repulsion, or neither. We cultured CGRP+ neurons in RA synovial fibroblast–conditioned medium. Compared with medium alone, RA synovial fibroblast–conditioned medium augmented CGRP+ neuron sprouting and branching (Fig. 6C).

Fig. 6. Synovial fibroblast products affect DRG neurons in vitro.

Fig. 6.

(A) Shown are representative images of the method used for neurite quantification of sprouting and Sholl analysis of branching of dissociated DRG neurons. (B) Quantification of CGRP+ DRG neurons cultured with medium alone, huNGF, or huNTN4 is shown. Left: Survival was measured by the number of Map2b+B3tub+ cells of >10 μm. Each dot represents the sum of five ×10 magnification views from one experiment. Data from four experiments are presented. ns indicates not significant in Kruskal-Wallis test. Middle: The sum of sprouting neurons divided by the total number of neurons cultured with medium alone, huNGF, or huNTN4 is shown. Neurons with at least three axon branches greater than two times the size of the soma were classified as sprouting. Each dot represents the sum of five ×10 magnification views from one experiment. Data from four experiments are presented. *P < 0.05, ns indicates not significant in Kruskal-Wallis test. Right: Branching is shown as measured by the number of shell intersections of neurites, in DRG neurons cultured with medium alone (no treatment), huNGF, or huNTN4. Each dot represents the median with confidence interval of 40 neurons imaged from four experiments (10 neurons per experiment). ****P < 0.0001 in two-way ANO VA group*radius interaction with post hoc Dunnett’s multiple comparisons of each treatment group with the no treatment group. (C) Quantification of CGRP+ DRG neurons cultured with medium alone, medium supplemented with huNGF, or RA synovial fibroblast–conditioned medium is shown. Left: Survival was measured by the number of Map2b+B3tub+ CGRP+ cells of >10 μm. Each dot represents the sum of five ×10 magnification views from one experiment. data from three experiments are presented. ns indicates not significant in Kruskal-Wallis test. Middle: The sum of sprouting neurons divided by the total number of neurons cultured with medium alone, medium supplemented with hunGF, or RA synovial fibroblast–conditioned medium is shown. neurons with at least three axon branches greater than two times the size of the soma were classified as sprouting. Each dot represents the sum of five ×10 magnification views from one experiment. data from three experiments are presented. **P < 0.01 and ***P < 0.001 in Kruskal-Wallis tests. right: Branching is shown as measured by the number of shell intersections of neurites in DRG neurons cultured with medium alone, medium supplemented with hunGF, or RA synovial fibroblast–conditioned medium. Each dot represents the median with confidence interval of 45 neurons imaged from three experiments (15 neurons per experiment). *P < 0.05 and **P < 0.01 in two-way ANOVA group*radius interaction with post hoc Dunnett’s multiple comparisons of each treatment group with the no treatment group.

CGRP+ axons accompany blood vessels in RA synovial papillary hypertrophic processes

We next sought to delineate whether the structure of nociceptive neurons in the synovium of humans with RA are altered in vivo. We performed immunolabeling-enabled three-dimensional imaging of solvent-cleared organs (iDISCO) (34). Synovial papillary hypertrophy is a common feature of RA synovium, and blood vessels can be seen growing into these processes by gross inspection of unstained tissue (Fig. 7A) and hematoxylin and eosin (H&E)–stained synovial sections (Fig. 7B). SC-F4 fibroblasts, the cell type with the most robust expression of genes associated with patient report of pain end enriched for neural guidance pathway genes, line these papillary processes (35). Anti-CD31 antibody staining of cleared RA synovium revealed a profuse tangle of blood vessels growing into each papillary outgrowth (Fig. 7C and movies S1 and S2). High-magnification three-dimensional views inside the papillary processes revealed that CGRP+ axons accompanied many of the blood vessels that had grown into the papillary processes (Fig. 7D). CGRP staining was just outside the CD31 stain, consistent with a meshwork of CGRP+ axons wrapping around blood vessels (Fig. 7E), a neurovascular pattern previously observed with scanning electron microscopy (36). The presence of CGRP+ axons in papillary processes of a synovial pannus indicated abnormal axonal sprouting in vivo because papillary processes represent abnormal synovial growth. We conclude that neoangiogenesis into abnormal papillary processes toward lining fibroblasts in the low inflammatory RA synovium was accompanied by neoneurogenesis of CGRP+ nociceptive axons.

Fig. 7. RA synovial papillary hypertrophy processes contain CGRP+ nociceptive neurites encasing CD31+ vessels.

Fig. 7.

(A) Gross macroscopic image of RA synovial papillary hypertrophy with visible blood vessels is shown. (B) H&E-Stained section of RA synovial papillary hypertrophy is shown with minimal lymphocytic infiltration. Scale bar, 50 μm. (C) Whole-mount imaging (iDISCO) is shown from low inflammatory RA synovium stained with anti-CD31 antibodies (magenta) and anti-CGRP antibodies (cyan) (see also movie S1). Scale bar, 300 μm. (D) Whole-mount imaging is shown from inside a papillary process stained with anti-CD31 antibodies (magenta) and anti-CGRP antibodies (cyan) (see also movie S2). Scale bars, 200 μm. (E) Three consecutive optical cross sections of synovial tissue stained with anti-CD31 antibodies (magenta) and anti-CGRP antibodies (cyan) are shown. Scale bars, 50 μm.

DISCUSSION

Recent studies have shown that RA synovial tissue spans a spectrum of synovial pathotypes, from highly inflamed synovium to fibroblast-enriched synovium (13). Here, we show that patients with RA can have relatively limited synovial inflammation but highly abnormal synovium with extensive papillary hypertrophy, angiogenesis, and high severity of pain. We demonstrated that synovial lining fibroblast products can augment the growth of CGRP+ nociceptive axons/neurites and CGRP+ axons extend into synovial papillary processes, placing nociceptors spatially adjacent to lining fibroblasts. These observations support a model whereby altered synovial lining fibroblasts secrete factors, including NTN4, that facilitate growth of CGRP+ axons/neurites (fig. S8). These data are consistent with recent studies in osteoarthritis (OA), which identified aberrant CGRP+ axonal sprouting into normally aneural cartilage (3740).

Whereas RA tends to affect all three knee compartments similarly, OA tends to affect the medial side more severely. Compared with synovial fibroblasts from the nonpainful side, synovial fibroblast–conditioned medium from the painful side of knee OA increased neuron survival and longest branch length in vitro (41). Our studies also led to the discovery that NTN4 augments branching of injured pain-sensitive CGRP+ nerves in vitro. NTN4 binds the extracellular matrix molecule, laminin, specifically laminin γ1, and, in doing so, markedly weakens matrix stiffness (30). Laminins are large glycoproteins that are abundant in synovium, and it is possible that a compliant extracellular matrix facilitates CGRP+ axon sprouting or branching. NTN4 also binds cell surface receptors, such as neogenin (42), which are expressed by human nociceptors (21, 22). Avenues for future research include in vivo animal studies to test the function of NTN4, as well as other pain-associated genes, and exploration of their potential as therapeutic targets.

The GbGMI approach is a combination of feature selection and data dimensionality reduction that enabled us to uncover a group of genes that together change their expression in association with patient report of pain. Reducing the dimensionality of the data allowed us to detect associations with variable clinical data that were likely underpowered when all genes were included in the analysis as individual tests. Other data dimensionality reduction approaches, such as clustering and PCA, have been used in a similar way to detect clinical associations that would not be measurable in a gene-by-gene analysis (9). One key difference between the GbGMI approach and hierarchical clustering or PCA is that the latter groups genes in an unsupervised manner, on the basis of which genes tend to be expressed together in the dataset. In contrast, our GbGMI approach groups genes on the basis of their association with the variable of interest and was developed specifically to mitigate the challenge of relating patient-reported outcome data, which is notoriously variable, to tissue transcriptomic data, which is often inadequately powered for statistical analysis. Our approach might be useful in other scenarios where investigators are searching for clinical associations in a complex dataset with a limited sample size.

As shown in our statistical analysis, each individual gene expression did not correlate with pain. On the other hand, the coordinated expression within the GbGMI-identified pain-associated gene module suggests functional relatedness. Future studies are needed to further unravel how genes identified here relate to neuron growth and patient experience of pain either independently or in conjunction with other genes. However, our ligand-receptor interaction analysis strongly suggests that changes in gene expression within the joint drive interactions with nociceptors to promote pain. We identified numerous ligands that bind to receptors on hDRG neurons to mediate neurite/axon growth including netrins, ephrins, integrins, and semaphorins. These receptors signal through pathways known to produce pain, including phosphatidylinositol 3-kinase (PI3K)/AKT signaling (43). We also identified possible cross-talk between ligands expressed by hDRGs that bind to receptors encoded in the 815 GbGMI-identified pain-associated genes expressed by fibroblasts, suggesting that neurons may play a key role in influencing fibroblast differentiation through neuronal-released factors including transforming growth factor–β, fibroblast growth factors, and chemokines.

Our study has several limitations. Pain is a multidimensional experience and was measured here only by patient report, using two different questionnaires (HOOS/KOOS and VAS), and the tissue studied was limited to the synovium. Given the small sample size, other factors known to contribute to experience of RA pain, such as sex, age, disease activity, as well as central sensitization and lower overall pain tolerance, were not explored here. In addition, the experiments on neuron growth in vitro tested the effects of human protein on mouse neurons. Future studies are needed to explore the effects of fibroblast products on human neurons. There are also many other genes associated with patient report of pain in this dataset that warrant additional study. These factors include heparin binding epidermal growth factor (HBEGF), betacellulin (BTC), and connective tissue growth factor (CTGF), all of which have been shown to augment neuronal sprouting in response to injury and/or have been associated with pain in other clinical conditions or animal models (4446). In addition, CGRP+ afferents perform other functions in the periphery besides sensing and relaying pain centrally, such as altering blood flow (47). Similarly, neural guidance molecules also have important functions outside the nervous system, such as affecting angiogenesis, lung branching morphogenesis, immunomodulation, and tumorigenesis, and these nonneuronal effects of this family of molecules warrant further study in the context of arthritis. Last, fluorescence of the synovial lining layer cannot be discerned from technical edge artifacts in the tissue-cleared images.

Together, this work identifies a group of genes associated with patient report of pain in low inflammatory synovium in both early untreated and established RA. These findings have the potential to be leveraged to develop next-generation therapeutic approaches to alleviating pain in RA, particularly in the low inflammatory pathotype that may be less responsive to current therapeutics that target adaptive immune inflammation.

MATERIALS AND METHODS

Study design

This study involved 139 patients with RA undergoing arthroplasty at the Hospital for Special Surgery (HSS) in New York, meeting either the American College of Rheumatology (ACR)/European League Against Rheumatism 2010 Classification criteria (48) and/or the ACR 1987 criteria for RA (49). Patient data, including KOOS and HOOS questionnaires, were collected with approval from relevant institutional review boards (detailed in the “Study approval” section) with signed informed consent from all participants. RNA was extracted from 39 bulk synovial tissue samples previously sequenced (9) (ImmPort accession #SDY1299), which included three gene expression clusters correlating with varying degrees of synovial inflammation: low (n=14), intermediate (n=11), and high (n=14) (9). Among these 39 patients, 38 had mean nucleus density data (16); of these, 26 had low inflammatory synovium, among which 22 had HOOS/KOOS pain scores. External validation of pain-associated genes used bulk RNA-seq data from 87 patients with early RA obtained from the PEAC. The study aimed to uncover the source of joint pain in RA because it does not always correlate with synovial inflammation. We developed a graph-based machine-learning approach to identify a set of pain-associated genes in patients with RA and limited synovial inflammation. Further analyses were conducted to reveal which cell subtype in synovium primarily expressed the identified pain-associated genes. Studies through in vitro and in vivo approaches were conducted to test whether and how the identified pain-associated genes can suggest a potential mechanism for enhancing nociceptor growth into synovial pannus, thereby contributing to RA-associated pain. No sample size calculation, randomization, or blinding was performed.

Study approval

This study includes data from 139 patients with RA undergoing arthroplasty at the HSS in New York. All patients met either the ACR/European League Against Rheumatism 2010 Classification criteria (48) and/or the ACR 1987 criteria for RA (49). Patient demographic data and RA disease activity scores and treatments are listed in data file S9. Patient data including KOOS (50) and HOOS (51) questionnaires were collected. RA pain scores indicate response to the question, “how much pain have you felt due to your rheumatoid arthritis during the last week?” with responses ranging from 1 to 10. Condition pain scores indicate response to the question, “how much pain have you had because of your condition over the past week? Please indicate how severe your pain has been,” with responses ranging from 1 to 10. This study was approved by the HSS Institutional Review Board (approval no. 2014–233), the Rockefeller University Institutional Review Board (approval no. DOR0822), and the Biomedical Research Alliance of New York (approval no. 15–08-114–385). All participating patients provided their signed informed consent.

H&E histologic scoring

Synovial samples were obtained from the most grossly inflamed (dull and opaque) area of synovium. If there were no obviously inflamed areas, then samples were obtained from standard locations: the femoral aspects of the medial and lateral gutters and the central supratrochlear region of the suprapatellar pouch. Each tissue biopsy was sectioned at 5-μm thickness and stained with Harris modified hematoxylin solution and eosin Y (H&E) manufactured by Epredia. An expert musculoskeletal pathologist scored 14 synovial histologic features in a single section for each patient: lymphocytic inflammation, mucoid change, fibrosis, fibrin, germinal centers, lining hyperplasia, neutrophils, detritus, plasma cells, binucleated plasma cells, Russell bodies, sublining giant cells, synovial lining giant cells, and mast cells. Detailed methods for scoring these features are described in prior studies (9) and available at www.hss.edu/pathology-synovitis, and the classification algorithm is available at ImmPort https://immport.org/shared/study/SDY1299. Spearman correlation was used to compare the relationship of patient reports of pain, which was not normally distributed, to cell density.

Gene expression established arthritis cohort

RNA was extracted from 39 bulk synovial tissue samples collected from an established RA cohort and previously sequenced as described in (9) (ImmPort accession #SDY1299). Briefly, these libraries were prepared using TruSeq mRNA Stranded Library kits, 50-bp paired-end reads were sequenced on a HiSeq 2500 platform, and reads were aligned to hg19 using STAR (52). Samples with >0.1% globin mRNA were excluded from further analysis. After quality control, ComBaT in the Bioconductor SVA package (53) was used for batch effect correction, and DESeq2 (54) was used to normalize the data. Consensus clustering identified three gene expression clusters characterized by different amounts of synovial inflammation: low (n=14), intermediate (n=11), and high (n=14) (9). Of these 39 patients with RA, 38 had mean nucleus density data for the benchmarking analysis (16), and 26 had low inflammatory synovium, for which all 26 had scores for lymphocytic inflammation and lining hyperplasia and 22 had scores of fibrosis and HOOS/KOOS pain scores. We used limma (15) to test for gene expression correlates of fibrosis, lymphocytic inflammation, lining hyperplasia, and pain.

Gene expression early arthritis cohort

For external validation of the pain-associated genes identified by our method, we downloaded the bulk RNA-seq of patients with early RA from the PEAC on ArrayExpress (www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-6141/samples/). Using only samples with sufficient data quality, reads were pseudoaligned to hg38 with Kallisto to generate a counts matrix. Counts were normalized for read depth [counts into log2(CPM + 1) (counts per million)] and batch-corrected using removeBatchEffect from limma (15).

Gene expression in bulk sorted cells and single cells

In the analyses related to RA synovial cell types, we used the sorted-population bulk RNA-seq gene expression data of the synovial T cells (CD45+, CD3+, and CD14), monocytes (CD45+, CD3, and CD14+), B cells (CD45+, CD3, CD14, and CD19+), and fibroblasts (CD45, CD31, and PDPN+) collected by fluorescence-activated cell sorting (BD FACSAria Fusion) directly in buffer RLT (QIAGEN)(19). Normalized read counts per gene as transcripts per million (TPMs) were used (ImmPort accession #SDY998) (10).

We used the scRNA-seq data preprocessed in (19), where the gene expression amounts were quantified by counting UMIs (unique molecular identifiers) and transformed into log2(CPM + 1). From the scRNA-seq profiles for 32,391 genes and 5265 cells left after rigorous quality control, 18 unique cell populations were identified by an integrated strategy based on canonical correlation analysis. Specifically, in fibroblasts: CD34+ sublining fibroblasts (SC-F1), human leukocyte antigen (HLA)–DRAhi sublining fibroblasts (SC-F2), Dickkopf-related protein (DKK3)+ sublining fibroblasts (SC-F3), and CD55+ lining fibroblasts (SC-F4); in monocytes: interleukin-1B (IL-1B)+ proinflammatory monocytes (SC-M1), nuclear protein 1 (NUPR1)+ monocytes (SC-M2), C1QA+ monocytes (SC-M3), and interferon-activated monocytes (SC-M4); in T cells: three CD4+ clusters: CCR7+ T cells (SC-T1), FOXP3+ regulatory T cells (SC-T2), and programmed cell death protein 1 (PDCD1)+ T peripheral helper (TPH) and T follicular helper (TFH) (SC-T3) cells, and three CD8+ clusters, GZMK+ T cells (SC-T4), GNLY+GZMB+ cytotoxic lymphocytes (SC-T5), and GZMK+GZMB+ T cells (SC-T6); in B cells: naïve IGHD+CD27 (SC-B1), IGHG3+ CD27+ memory B cells (SC-B2), autoimmune-associated B cell cluster (SC-B3) with high expression of integrin alpha-X (ITGAX) (also known as CD11c), and a plasmablast cluster (SC-B4) with high expression of immunoglobulin genes and X-box-binding protein 1 (XBP1) (19). The top 200 marker genes for each subpopulation identified by differential expression analysis in the original publication were used as signature genes for individual cell types within RA synovium.

Description of our GbGMI framework

We developed GbGMI, a graph-based machine-l earning framework of algorithms, to identify a gene expression module that strongly correlates with pain in low inflammatory RA synovial tissues (fig. S1). We use MRm×n to represent the input gene expression matrix of genes and patients. We use a numeric vector aRn to represent the pain scores reported by these patients. We quantified the quality Q of a selected gene subset through the correlation between their collective expression and the pain score and then searched for a gene subset with optimal quality Q as a feature selection task (5558). Because an exhaustive search through all possible subsets of an input gene set to optimize Q is computationally intractable (59), we adapted and integrated the feature scoring strategy used in the filter feature-selection/ranking approaches (6062) and the feature subset scoring strategy used in the wrapper feature-selection approaches (58, 63). Specifically, we generated a gene prioritization list Ls according to how well each individual gene expression respects the geometric structure over patients built according to their pain scores. Then, the quality Qk of the kth candidate gene subset comprising the top k genes in Ls was evaluated for k ranging from 1 to m. The first k* where Qk peaked was used as the cutoff point on Ls. This subset of k* genes is the output pain-associated gene module.

We next adapted Laplacian score algorithm to prioritize genes. Given the input gene expression matrix and pain-score vector, we ranked the genes by the way each gene expression vector (that is, a row vector in the gene expression matrix M) respected a given geometric structure over the patients encoded in an n×n similarity matrix S based on the pain score vector a, by adapting the Laplacian score algorithm (17), instead of assuming independent observations in the correlation tests (fig. S1, step A). To compute S, a Gaussian kernel, which empirically outperformed other types of kernels (64, 65), was adopted to map the Euclidean distance between the pain scores of each pair of patients and into a similarity measure S(i,j)[0,1]

S(i,j)=e|a(i)a(j)|2h (1)

where h corresponds to the bandwidth or smoothing factor in a kernel metric definition. This formulation forces the similarity measure between any two patients with significantly different pain scores to be close to 0 while pushing the similarity measure between the patients of pain scores within a certain range (depending on the smoothing factor) to be closer to 1. This promotes the locality we want to focus on. We set h to control the local neighborhood of patients on graph S according to the theoretical range of the pain scores (for example, h=100 for HSS HOOS/KOOS pain score or PEAC VAS characteristic because either ranges between 0 and 100).

The Laplacian score of each gene was then computed to evaluate how well this gene’s expression on these patients preserves S (fig. S1, step B). This is different from the original publication (17, 66) that aimed to preserve the input feature space (for example, the input low inflammatory genes), wherein we aimed to select features (for example, the k* pain-associated genes). The Laplacian matrix of S is defined by L=DS, where D is a diagonal matrix with Dii indicating the degree of node (that is, patient) i in the weighted graph S [that is, D=diagS1. For the rth gene, let Mr,* be its n-dimensional gene expression vector across the patients (that is, the rth row vector in the gene expression matrix M), its Laplacian score ls(r) is computed as

ls(r)=m^rTLmr^m^rTDmr^,mr^=Mr,*TMr,*D11TD11 (2)

where the symbol 1 denotes a column vector whose all elements are 1’s with dimensionality determined by context. The gene prioritization list Ls is generated by sorting the input m genes according to their Laplacian scores in ascending order. The smaller the Laplacian score of a gene, the better its expression data respect the geometric structure defined by S over the patients [according to objective function analysis (17)]. Each top k subset on Ls forms a candidate gene subset for selection. The rows of M were reordered according to Ls (fig. S1, step C).

Quantification of candidate k-gene subset quality Qk

The Qk of the kth candidate gene subset was measured on the basis of the association between their collective expression pattern and the pain-score vector a. Given the gene expression submatrix M1:k,* of this candidate k-gene subset, Qk was computed through two steps: First, project the k-gene expression vector of each patient into a univariate summary score that preserves the patient-to-patient similarity structure in the original k-dimensional feature (gene) space. This addresses the dimensionality mismatch between the multigene expression and the univariate patient-level pain score (9, 17). The resulting summary score vector of the n patients is denoted sk (fig. S1, step D). Second, quantify Qk using the statistical significance of correlation test [for example, the − log (P value) of Kendall’s correlation test] between sk and a over the same patients. We chose the first k*, where Qk* peaked, as the cutoff point on the sorted gene prioritization list Ls. This subset of k* genes is the pain-associated gene module identified by our GbGMI framework (fig. S1, step E).

For fig. S1 (step D), we used the t-SNE (66). The resulting summary score vector sk for the n patients respects how the gene expression data were arranged in the k-gene feature space of this candidate gene subset. We hereon briefly sketch the instantiation of t-SNE with the variables involved in our study. Following the SNE framework (67), the directional similarity of patient j to patient i based on their multigene expression vectors M1:k,i and M1:k,j is

p(ji)=expM1:k,iM1:k,j2/2σi2ΣliexpM1:k,iM1:k,l2/2σi2 (3)

where the variance of the Gaussian kernel σi2 is chosen such that the perplexity of the conditional probability distribution over all points ji defined by

PerpPi=2HPi,HPi=Σjp(ji)log2p(ji) (4)

matches a prespecified value. The perplexity in this context can be interpreted as an estimation about the number of close neighbors of each patient on graph S. Therefore, we specified the perplexity on the basis of the rounded mean degree of the similarity graph S built from a. The symmetric SNE was used for mathematical and computational convenience in the t-SNE formulation by defining the following undirected similarities

p(i,j)=p(ji)+p(ij)2n (5)

where n is the number of patients. Because Σi,jp(i,j)=1, this is a valid probability distribution on the set of all pairs (i,j). The t-SNE step in this algorithm uses the t-distribution with one degree of freedom (also known as Cauchy distribution) as the one-dimensional similarity kernel applied to pairs of summary scores defined by

q(i,j)=(1+s(i)s(j)2)1Σkl(1+s(k)s(l)2)1 (6)

The main idea of this t-SNE-based step is to arrange the patients in a one-dimensional space such that the similarities q(i,j) between s(i) and s(j) match p(i,j) as close as possible in terms of the Kullback-Leibler divergence. Thus, the loss function is

L=Σi,jp(i,j)logp(i,j)q(i,j) (7)

a summary score vector sk is computed for each candidate k-gene subset, rendering a set of m such vectors skk=1,2,,m.

Our framework GbGMI can be applied to identify a subset of genes that collectively associate with some other patient-level numeric attribute beyond the main focus of this paper (the HOOS/KOOS pain score). Our framework is also flexible and adaptive in different contexts by replacing specific computational components with other design choices (for example, other embeddings instead of t-SNE for computing summary scores of selected gene expression).

DRG dissection and digestion

Before extraction of the DRG neurons, chambered coverslips (ibidi, 80286) were coated overnight at 37°C with poly-l-lysine (Sigma-Aldrich, P4832) and then for 2 hours at 37°C with mouse sarcoma basement membrane laminin (Sigma-Aldrich, L2020) diluted 1:50 with 1× phosphate-buffered saline (PBS) and with one also coated with huNTN4 (0.2 μg/ml; R&D Systems, 1254-N4). For each assay, sensory DRG neurons were harvested from two female 6- to 8-week-old C57BL/6 mice (the Jackson Laboratory), under a dissection microscope using forceps, and placed into a waiting 15-ml conical tube on ice containing 1× L-15 medium (Thermo Fisher Scientific, 21083027). DRGs were spun down at 950 rpm for 2 min. Medium was aspirated and replaced with 1 ml of L-15 containing dispase II (10 mg/ml; Sigma-Aldrich, 04942078001) and collagenase IV (10 mg/ml; Thermo Fisher Scientific, 17104019). DRGs were then placed at 37°C for 20 min. Enzyme solution was then carefully aspirated and replaced with 2 ml of L-15. Pellet was resuspended thoroughly with a 1000-μl pipette. Twenty-five microliters of deoxyribonuclease I (10 mg/ml) was then added, and once again, the cells were placed at 37°C for 20 min. The cells were then spun down for 5 min at 950 rpm and resuspended in 5 ml of L-15. After another 5 min of centrifugation, the cells were resuspended in 1 ml of L-15 and layered on top of 15% ice-cold bovine serum albumin (Sigma-Aldrich, A7906) and spun down at room temperature for 8 min at 1179 rpm to remove myelin. The cell pellet was resuspended in 1× Neurobasal Plus medium (Thermo Fisher Scientific, A3582901) containing B27 (Thermo Fisher Scientific, 17504001) diluted 1:50, GlutaMAX (Thermo Fisher Scientific, 35050061) diluted 1:100, and gentamicin sulfate (Abbott Laboratories) and then plated onto the precoated slides. Human β-NGF (0.1 μg/ml; R&D Systems, 256-GF) was added to the medium of the positive control slide chamber.

Harvesting fibroblast supernatant

A total of 500,000 synovial fibroblasts from low inflammatory RA, between five and eight passages, were plated in each well of six-well cell culture plates and covered with 2 ml of Neurobasal medium (Thermo Fisher Scientific, 21103049) with added 2% B-27 supplement (Thermo Fisher Scientific, 17504044), 1% GlutaMAX supplement (Thermo Fisher Scientific, 35050061), and 0.04% gentamicin sulfate. Medium was changed, and cells were washed in PBS after 24 hours at 37°C to allow for recovery from freeze-thaw. Fresh medium was added, and cells were left at 37°C overnight. After 24 hours, medium was removed and centrifuged at 1300 rpm for 5 min to remove cells and cell debris. Supernatant was then collected and stored at −80°C.

DRG neuron culture

A total of 50,000 neurons harvested from mice were plated on poly-l-lysine and 2% laminin (Sigma-Aldrich, L2020) coated two-well chamber slides with either medium alone, fibroblast supernatant + medium (1:1), huNGF, huNTN4, or mouse Ntn4. To remove precipitates formed during freeze-thaw, medium and supernatant samples were first filtered using 50-ml MilliporeSigma Steriflip sterile disposable vacuum filters with 0.22-μm pore membranes. Neurons were cultured for 24 hours at 37°C before fixation in 4% paraformaldehyde.

Neurite imaging, quantification, and comparison

Each chambered slide was examined under a Keyence fluorescence microscope, and the accompanying software was used to capture a stitched image of a large 5 × 5 area randomly selected on the plate at ×10 magnification. Five of these images were captured per plate. To quantify sprouting versus nonsprouting DRG neurons, each image was examined in ImageJ and used the Cell Counter plugin to keep track of the total number of sprouting neurons, identified as having at least three neurites extended from the cell body where the extensions were more than twice the diameter of the cell body and exhibited some degree of branching. From across these images, n=10 neurons that exhibited branching were selected, and their branching was quantified using the Sholl Analysis plugin. Each analysis used a start radius of 30 pixels, an end radius of 225 pixels, and a step size of 7. Mixed-model repeated-measures analysis was used to analyze Sholl data. The model included group, radius (categorical), and group*radius interaction as fixed effects. A significant group*radius interaction indicated group differences and branching.

iDISCO/imaging

Synovial tissue obtained was fixed overnight in 4% paraformaldehyde and washed three times with PBS for 5 min. Samples were stored long-term in PBS with 0.03% sodium azide and were stained following iDISCO protocol. Briefly, samples were treated as indicated using alternative nonmethanol pretreatment protocol before immunolabeling. All steps assumed an n=1. Antibodies used for staining were used at 1:150 and include anti-CD31 antibody polyclonal guinea pig (Synaptic Systems, 351004), anti-CGRP antibody polyclonal goat (ab36001, Abcam), Alexa Fluor 488 donkey anti-mouse immunoglobulin G (IgG) (H + L) ReadyProbes (Invitrogen), and Alexa Fluor 568 donkey F(ab′)2 anti-rabbit IgG (H + L; Abcam). After immunolabeling, samples were embedded in agar before clearing with dibenzyl ether (DBE). Samples were imaged by light sheet microscopy, submerged in DBE on an Ultramicroscope II (LaVision/Miltenyi BioTec). Images were captured using a 4× objective with 2.5-μm Z-slices. Acquired images were visualized using Imaris ×64 software (version 9.1 Bitplane), and three-dimensional reconstructions were recorded as mp4 video files. Optical slices were generated using the orthoslicer or oblique slicer tool.

Statistical analysis

Statistical analysis was performed with functions or libraries from the R software package unless stated otherwise. A P value, raw or adjusted depending on the context, below 0.05 was considered significant. For comparison between pain scores in two patient groups, data were analyzed by unpaired, two-sided Mann-Whitney. A moderated t test from limma (15) was used for testing individual gene expression correlates of fibrosis, lymphocytic inflammation, lining hyperplasia, and pain. To identify differentially expressed genes, a one-way analysis of variance (ANOVA) test was performed to compare individual gene expression among different pain level–based groupings. The FDR was used in multiple hypothesis testing to correct for multiple comparisons where applicable. For assessing the correlation between two patient-level scores (of the three: gene expression summary score, pain score, or cell density), Spearman’s or Kendall’s rank correlation tests were used. For comparison between group level pain–associated gene expression and synovial tissue gene expression, Kolmogorov-Smirnov test was used. To compare the quantification among different groups of DRG neurons, data were analyzed by unpaired, two-sided Kruskal-Wallis test or two-way ANOVA test, followed by Dunn’s posttest (*P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001). A hypergeometric test was performed to estimate statistical significance in pathway or gene set enrichment analysis, adjusted for multiple testing using g:SCS correction (68).

Supplementary Material

Supplementary Movie 1
Download video file (15.3MB, mp4)
Supplementary Movie 2
Download video file (6.8MB, mp4)
supplementary MdAr reproducibility checklist
Data File S1
Data File S2
Data File S3
Data File S4
Data File S5
Data File S6
Data File S7
Data File S8
Data File S9
Supplemental Materials and Methods

Acknowledgments:

We thank the participants who provided synovial tissue and blood samples for this study. We thank A. North and V. Sharma from the Rockefeller University’s Bio-Imaging Resource Center, RRID:SCR_017791, for help with imaging and image analysis.

Funding:

This project was funded by the National Science Foundation 1750326 (to Z.B. and F.W.); the Arthritis Foundation (to S.G.); the NIH R01AR078268 (to D.E.O.), UC2AR081025 (to D.E.O. and S.G.), R01AR077019 (to R.E.M.), R01AR064251 (to A.-M.M.), R01AR060364 (to A.-M.M.), P30AR079206 (to A.-M.M.), and U19-NS130608 (to T.J.P.); Rockefeller University UL1TR001866 (to D.E.O.); Arthritis National Research Foundation Award (to F.Z.), UK Medical Research Council G0800648 (to C. Pitzalis), Versus Arthritis 20022 (to C. Pitzalis); FOREUM 232719 (to S.S.); EFIC Grünenthal Grant (to S.S.); Versus Arthritis 21734 (to S.S.); and the Accelerating Medicines Partnership Program: Rheumatoid Arthritis and Systemic Lupus Erythematosus (AMP RA/SLE) Network. The AMP Program is a public-private partnership that includes AbbVie Inc., the Arthritis Foundation, Bristol-Myers Squibb Company, the Foundation for the National Institutes of Health, GlaxoSmithKline, Janssen Research and Development LLC, the Lupus Foundation of America, the Lupus Research Alliance, Merck Sharp & Dohme Corp., the National Institute of Allergy and Infectious Diseases, the National Institute of Arthritis and Musculoskeletal and Skin Diseases, Pfizer Inc., the Rheumatology Research Foundation, Sanofi, and Takeda Pharmaceuticals International Inc. Funding for AMP RA/SLE work was provided through grants from the National Institutes of Health (UH2-AR067676, UH2-AR067677, UH2-AR067679, UH2-AR067681, UH2-AR067685, UH2-AR067688, UH2-AR067689, UH2-AR067690, UH2-AR067691, UH2-AR067694, and UM2-AR067678).

Footnotes

Competing interests: The authors declare that they have no competing interests.

AMP RA/SLE Network: In addition to AMP RA/SLE Network members who are authors, the following AMP RA/SLE Network members are collaborators who have contributed to investigation and data curation of this manuscript: Jennifer Albrecht12, Jennifer H. Anolik12, William Apruzzese4, Brendan F. Boyce12, David L. Boyle13, Michael B. Brenner4, S. Louis Bridges Jr.3, Christopher D. Buckley14, Jane H. Buckner15, Vivian P. Bykerk1,3, James Dolan4, Laura T. Donlin1,3, Andrew Filer14, Gary S. Firestein13, Chamith Y. Fonseka4, Peter K. Gregersen16, Joel M. Guthridge17, Maria Gutierrez-Arcelus4, V. Michael Holers9, Laura B. Hughes18, Lionel B. Ivashkiv1,3, Eddie A. James15, Judith A. James17, A.Helena Jonsson9,4, Stephen Kelly19, James A. Lederer4, Yvonne C. Lee20, Arthur M. Mandelin II20, Mandy J. McGeachy21, Joseph R. Mears4, Nida Meednu12, Larry Moreland9,21, Harris Perlman20, Javier Rangel-Moreno12, Deepak A. Rao4, Soumya Raychaudhuri4, Christopher Ritchlin12, William H. Robinson22, Mina Rohani-Pichavant22, Jennifer Seifert9, Kamil Slowikowski4, Darren Tabechian12, Paul J. Utz22, Gerald F. M. Watts4, Kevin Wei4

12University of Rochester Medical Center, Rochester, NY 14642, USA. 13University of California, San Diego, La Jolla, CA 92093, USA. 14University Hospitals Birmingham NHS Foundation Trust and University of Birmingham, Birmingham B15 2GW, UK. 15Benaroya Research Institute at Virginia Mason, Seattle, WA 98101–2795, USA. 16Feinstein Institute for Medical Research, Northwell Health, Manhasset, New York, NY 11030, USA. 17Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA. 18University of Alabama at Birmingham, Birmingham, AL 35294, USA. 19Barts Health NHS Trust, London E1 2ES, UK. 20Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA. 21University of Pittsburgh School of Medicine, Pittsburgh, PA 15261, USA. 22Stanford University School of Medicine, Palo Alto, CA, 94305.

Data and materials availability:

All data associated with this study are present in the paper or the Supplementary Materials. The bulk synovial RNA-seq dataset is available at ImmPort (SDY1299). The PEAC dataset is available at https://peac.hpc.qmul.ac.uk. The AMP dataset is available at https://immunogenomics.io/ampsle1/ or ImmPort (SDY997). The raw sequence data are deposited in the sequence read archive under accession SRP135960 in the National Center for Biotechnology Information (NCBI) library. Our code is available at DOI: 10.5281/zenodo.10783506 and target URL https://zenodo.org/doi/10.5281/zenodo.10783505.

REFERENCES AND NOTES

  • 1.Marchand F, Perretti M, McMahon SB, Role of the immune system in chronic pain. Nat. Rev. Neurosci. 6, 521–532 (2005). [DOI] [PubMed] [Google Scholar]
  • 2.Alivernini S, Firestein GS, McInnes IB, The pathogenesis of rheumatoid arthritis. Immunity 55, 2255–2270 (2022). [DOI] [PubMed] [Google Scholar]
  • 3.Buch MH, Eyre S, McGonagle D, Persistent inflammatory and non-inflammatory mechanisms in refractory rheumatoid arthritis. Nat. Rev. Rheumatol. 17, 17–33 (2021). [DOI] [PubMed] [Google Scholar]
  • 4.Nagy G, Roodenrijs NM, Welsing PM, Kedves M, Hamar A, van der Goes MC, Kent A, Bakkers M, Blaas E, Senolt L, Szekanecz Z, Choy E, Dougados M, Jacobs JW, Geenen R, Bijlsma HW, Zink A, Aletaha D, Schoneveld L, van Riel P, Gutermann L, Prior Y, Nikiphorou E, Ferraccioli G, Schett G, Hyrich KL, Mueller-Ladner U, Buch MH, McInnes IB, van der Heijde D, van Laar JM, EULAR definition of difficult-to-treat rheumatoid arthritis. Ann. Rheum. Dis. 80, 31–35 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lee YC, Frits ML, Iannaccone CK, Weinblatt ME, Shadick NA, Williams DA, Cui J, Subgrouping of patients with rheumatoid arthritis based on pain, fatigue, inflammation, and psychosocial factors. Arthritis Rheum. 66, 2006–2014 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.McWilliams DF, Rahman S, James RJE, Ferguson E, Kiely PDW, Young A, Walsh DA, Disease activity flares and pain flares in an early rheumatoid arthritis inception cohort; characteristics, antecedents and sequelae. BMC Rheumatol. 3, 49 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Pollard LC, Choy EH, Gonzalez J, Khoshaba B, Scott DL, Fatigue in rheumatoid arthritis reflects pain, not disease activity. Rheumatology 45, 885–889 (2006). [DOI] [PubMed] [Google Scholar]
  • 8.Stebbings S, Herbison P, Doyle TCH, Treharne GJ, Highton J, A comparison of fatigue correlates in rheumatoid arthritis and osteoarthritis: Disparity in associations with disability, anxiety and sleep disturbance. Rheumatology 49, 361–367 (2010). [DOI] [PubMed] [Google Scholar]
  • 9.Orange DE, Agius P, DiCarlo EF, Robine N, Geiger H, Szymonifka J, McNamara M, Cummings R, Andersen KM, Mirza S, Figgie M, Ivashkiv LB, Pernis AB, Jiang CS, Frank MO, Darnell RB, Lingampali N, Robinson WH, Gravallese E, Accelerating Medicines Partnership in Rheumatoid Arthritis and Lupus Network, Bykerk VP, Goodman SM, Donlin LT, Identification of three rheumatoid arthritis disease subtypes by machine learning integration of synovial histologic features and RNA sequencing data. Arthritis Rheum. 70, 690–701 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Dennis G, Holweg CT, Kummerfeld SK, Choy DF, Setiadi AF, Hackney JA, Haverty PM, Gilbert H, Lin WY, Diehl L, Fischer S, Song A, Musselman D, Klearman M, Gabay C, Kavanaugh A, Endres J, Fox DA, Martin F, Townsend MJ, Synovial phenotypes in rheumatoid arthritis correlate with response to biologic therapeutics. Arthritis Res. Ther 16, ar4555 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lewis MJ, Barnes MR, Blighe K, Goldmann K, Rana S, Hackney JA, Ramamoorthi N, John CR, Watson DS, Kummerfeld SK, Hands R, Riahi S, Rocher-Ros V, Rivellese F, Humby F, Kelly S, Bombardieri M, Ng N, DiCicco M, van der Heijde D, Landewé R, van der Helm-van Mil A, Cauli A, McInnes IB, Buckley CD, Choy E, Taylor PC, Townsend MJ, Pitzalis C, Molecular portraits of early rheumatoid arthritis identify clinical and treatment response phenotypes. Cell Rep. 28, 2455–2470.e5 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Humby F, Lewis M, Ramamoorthi N, Hackney JA, Barnes MR, Bombardieri M, Setiadi AF, Kelly S, Bene F, DiCicco M, Riahi S, Rocher V, Ng N, Lazarou I, Hands R, van der Heijde D, Landewé RBM, van der Helm-van Mil A, Cauli A, McInnes I, Buckley CD, Choy EH, Taylor PC, Townsend MJ, Pitzalis C, Synovial cellular and molecular signatures stratify clinical response to csDMARD therapy and predict radiographic progression in early rheumatoid arthritis patients. Ann. Rheum. Dis. 78, 761–772 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zhang F, Jonsson AH, Nathan A, Millard N, Curtis M, Xiao Q, Gutierrez-Arcelus M, Apruzzese W, Watts GFM, Weisenfeld D, Nayar S, Rangel-Moreno J, Meednu N, Marks KE, Mantel I, Kang JB, Rumker L, Mears J, Slowikowski K, Weinand K, Orange DE, Geraldino-Pardilla L, Deane KD, Tabechian D, Ceponis A, Firestein GS, Maybury M, Sahbudin I, Ben-Artzi A, Mandelin AM, Nerviani A, Lewis MJ, Rivellese F, Pitzalis C, Hughes LB, Horowitz D, DiCarlo E, Gravallese EM, Boyce BF, Moreland LW, Goodman SM, Perlman H, Holers VM, Liao KP, Filer A, Bykerk VP, Wei K, Rao DA, Donlin LT, Anolik JH, Brenner MB, Raychaudhuri S, Deconstruction of rheumatoid arthritis synovium defines inflammatory subtypes. Nature 623, 616–624 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Nerviani A, Di Cicco M, Mahto A, Lliso-Ribera G, Rivellese F, Thorborn G, Hands R, Bellan M, Mauro D, Boutet M-A, Giorli G, Lewis M, Kelly S, Bombardieri M, Humby F, Pitzalis C, A pauci-immune synovial pathotype predicts inadequate response to TNFα-blockade in rheumatoid arthritis patients. Front. Immunol. 11, 845 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK, limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Guan S, Mehta B, Slater D, Thompson JR, DiCarlo E, Pannellini T, Pearce-Fisher D, Zhang F, Raychaudhuri S, Hale C, Jiang CS, Goodman S, Orange DE, Rheumatoid arthritis synovial inflammation quantification using computer vision. ACR Open Rheumatol 4, 322–331 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.He X, Cai D, Niyogi P, “Laplacian score for feature selection,” 2005; https://proceedings.neurips.cc/paper_files/paper/2005/file/b5b03f06271f8917685d14cea7c6c50a-Paper.pdf.
  • 18.Alivernini S, MacDonald L, Elmesmari A, Finlay S, Tolusso B, Gigante MR, Petricca L, Di Mario C, Bui L, Perniola S, Attar M, Gessi M, Fedele AL, Chilaka S, Somma D, Sansom SN, Filer A, McSharry C, Millar NL, Kirschner K, Nerviani A, Lewis MJ, Pitzalis C, Clark AR, Ferraccioli G, Udalova I, Buckley CD, Gremese E, McInnes IB, Otto TD, Kurowska-Stolarska M, Distinct synovial tissue macrophage subsets regulate inflammation and remission in rheumatoid arthritis. Nat. Med. 26, 1295–1306 (2020). [DOI] [PubMed] [Google Scholar]
  • 19.Zhang F, Wei K, Slowikowski K, Fonseka CY, Rao DA, Kelly S, Goodman SM, Tabechian D, Hughes LB, Salomon-Escoto K, Watts GFM, Jonsson AH, Rangel-Moreno J, Meednu N, Rozo C, Apruzzese W, Eisenhaure TM, Lieb DJ, Boyle DL, Mandelin AM II, Accelerating Medicines Partnership Rheumatoid Arthritis and Systemic Lupus Erythematosus (AMP RA/SLE) Consortium, Boyce BF, DiCarlo E, Gravallese EM, Gregersen PK, Moreland L, Firestein GS, Hacohen N, Nusbaum C, Lederer JA, Perlman H, Pitzalis C, Filer A, Holers VM, Bykerk VP, Donlin LT, Anolik JH, Brenner MB, Raychaudhuri S, Defining inflammatory cell states in rheumatoid arthritis joint synovial tissues by integrating single-cell transcriptomics and mass cytometry. Nat. Immunol 20, 928–942 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Jung M, Dourado M, Maksymetz J, Jacobson A, Laufer BI, Baca M, Foreman O, Hackos DH, Riol-Blanco L, Kaminker JS, Cross-species transcriptomic atlas of dorsal root ganglia reveals species-specific programs for sensory function. Nat. Commun. 14, 366 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Bhuiyan SA, Xu M, Yang L, Semizoglou E, Bhatia P, Pantaleo KI, Tochitsky I, Jain A, Erdogan B, Blair S, Cat V, Mwirigi JM, Sankaranarayanan I, Tavares-Ferreira D, Green U, McIlvried LA, Copits BA, Bertels Z, Del Rosario JS, Widman AJ, Slivicki RA, Yi J, Woolf CJ, Lennerz JK, Whited JL, Price TJ, Gereau RW 4th, Renthal W, Harmonized cross-species cell atlases of trigeminal and dorsal root ganglia. bioRxiv 2023.07.04.547740 [Preprint] (2023); 10.1101/2023.07.04.547740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tavares-Ferreira D, Shiers S, Ray PR, Wangzhou A, Jeevakumar V, Sankaranarayanan I, Cervantes AM, Reese JC, Chamessian A, Copits BA, Dougherty PM, Gereau IV RW, Burton MD, Dussor G, Price TJ, Spatial transcriptomics of dorsal root ganglia identifies molecular signatures of human nociceptors. Sci. Transl. Med. 14, eabj8186 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.North RY, Li Y, Ray P, Rhines LD, Tatsui CE, Rao G, Johansson CA, Zhang H, Kim YH, Zhang B, Dussor G, Kim TH, Price TJ, Dougherty PM, Electrophysiological and transcriptomic correlates of neuropathic pain in human dorsal root ganglion neurons. Brain 142, 1215–1226 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Sharma N, Flaherty K, Lezgiyeva K, Wagner DE, Klein AM, Ginty DD, The emergence of transcriptional identity in somatosensory neurons. Nature 577, 392–398 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kuleshov MV, Jones MR, Rouillard AD, Fernandez NF, Duan Q, Wang Z, Koplev S, Jenkins SL, Jagodnik KM, Lachmann A, McDermott MG, Monteiro CD, Gundersen GW, Maayan A, Enrichr: A comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 44, W90–W97 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Chen EY, Tan CM, Kou Y, Duan Q, Wang Z, Meirelles GV, Clark NR, Ma’ayan A, Enrichr: Interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics 14, 128 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Mapp PI, Walsh DA, Garrett NE, Kidd BL, Cruwys SC, Polak JM, Blake DR, Effect of three animal models of inflammation on nerve fibres in the synovium. Ann. Rheum. Dis. 53, 240–246 (1994). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Serafini T, Kennedy TE, Galko MJ, Mirzayan C, Jessell TM, Tessier-Lavigne M, The netrins define a family of axon outgrowth-promoting proteins homologous to C. elegans UNC-6. Cell 78, 409–424 (1994). [DOI] [PubMed] [Google Scholar]
  • 29.Koch M, Murrell JR, Hunter DD, Olson PF, Jin W, Keene DR, Brunken WJ, Burgeson RE, A novel member of the netrin family, beta-netrin, shares homology with the beta chain of laminin: Identification, expression, and functional characterization. J. Cell Biol. 151, 221–234 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Reuten R, Patel TR, McDougall M, Rama N, Nikodemus D, Gibert B, Delcros J-G, Prein C, Meier M, Metzger S, Zhou Z, Kaltenberg J, McKee KK, Bald T, Tüting T, Zigrino P, Djonov V, Bloch W, Clausen-Schaumann H, Poschl E, Yurchenco PD, Ehrbar M, Mehlen P, Stetefeld J, Koch M, Structural decoding of netrin-4 reveals a regulatory function towards mature basement membranes. Nat. Commun. 7, 13515 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Hayano Y, Sasaki K, Ohmura N, Takemoto M, Maeda Y, Yamashita T, Hata Y, Kitada K, Yamamoto N, Netrin-4 regulates thalamocortical axon branching in an activity-dependent fashion. Proc. Natl. Acad. Sci. U.S.A. 111, 15226–15231 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lindsay RM, Nerve growth factors (NGF, BDNF) enhance axonal regeneration but are not required for survival of adult sensory neurons. J. Neurosci. 8, 2394–2405 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Denk F, Bennett DL, McMahon SB, Nerve growth factor and pain mechanisms. Annu. Rev. Neurosci. 40, 307–325 (2017). [DOI] [PubMed] [Google Scholar]
  • 34.Renier N, Wu Z, Simon DJ, Yang J, Ariel P, Tessier-Lavigne M, iDISCO: A simple, rapid method to immunolabel large tissue samples for volume imaging. Cell 159, 896–910 (2014). [DOI] [PubMed] [Google Scholar]
  • 35.Mizoguchi F, Slowikowski K, Wei K, Marshall JL, Rao DA, Chang SK, Nguyen HN, Noss EH, Turner JD, Earp BE, Blazar PE, Wright J, Simmons BP, Donlin LT, Kalliolias GD, Goodman SM, Bykerk VP, Ivashkiv LB, Lederer JA, Hacohen N, Nigrovic PA, Filer A, Buckley CD, Raychaudhuri S, Brenner MB, Functionally distinct disease-associated fibroblast subsets in rheumatoid arthritis. Nat. Commun. 9, 789 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Iijima T, Zhang J-Q, Three-dimensional wall structure and the innervation of dental pulp blood vessels. Microsc. Res. Tech. 56, 32–41 (2002). [DOI] [PubMed] [Google Scholar]
  • 37.Gonçalves Dos Santos G, Jimenéz-Andrade JM, Woller SA, Muñoz-Islas E, Ramírez-Rosas MB, Ohashi N, Ferreira Catroli G, Fujita Y, Yaksh TL, Corr M, The neuropathic phenotype of the K/BxN transgenic mouse with spontaneous arthritis: Pain, nerve sprouting and joint remodeling. Sci. Rep. 10, 15596 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Obeidat AM, Miller RE, Miller RJ, Malfait A-M, The nociceptive innervation of the normal and osteoarthritic mouse knee. Osteoarthr. Cartil 27, 1669–1679 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Aso K, Shahtaheri SM, Hill R, Wilson D, McWilliams DF, Nwosu LN, Chapman V, Walsh DA, Contribution of nerves within osteochondral channels to osteoarthritis knee pain in humans and rats. Osteoarthr. Cartil 28, 1245–1254 (2020). [DOI] [PubMed] [Google Scholar]
  • 40.Zhu S, Zhu J, Zhen G, Hu Y, An S, Li Y, Zheng Q, Chen Z, Yang Y, Wan M, Skolasky RL, Cao Y, Wu T, Gao B, Yang M, Gao M, Kuliwaba J, Ni S, Wang L, Wu C, Findlay D, Eltzschig HK, Ouyang HW, Crane J, Zhou F-Q, Guan Y, Dong X, Cao X, Subchondral bone osteoclasts induce sensory innervation and osteoarthritis pain. J. Clin. Invest. 129, 1076–1093 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Nanus DE, Badoume A, Wijesinghe SN, Halsey AM, Hurley P, Ahmed Z, Botchu R, Davis ET, Lindsay MA, Jones SW, Synovial tissue from sites of joint pain in knee osteoarthritis patients exhibits a differential phenotype with distinct fibroblast subsets. EBioMedicine 72, 103618 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Lejmi E, Leconte L, Pédron-Mazoyer S, Ropert S, Raoul W, Lavalette S, Bouras I, Feron J-G, Maitre-Boube M, Assayag F, Feumi C, Alemany M, Jie TX, Merkulova T, Poupon M-F, Ruchoux M-M, Tobelem G, Sennlaub F, Plouët J, Netrin-4 inhibits angiogenesis via binding to neogenin and recruitment of Unc5B. Proc. Natl. Acad. Sci. U.S.A. 105, 12491–12496 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chen S-P, Zhou Y-Q, Liu D-Q, Zhang W, Manyande A, Guan X-H, Tian Y-K, Ye D-W, Omar DM, PI3K/Akt pathway: A potential therapeutic target for chronic pain. Curr. Pharm. Des. 23, 1860–1868 (2017). [DOI] [PubMed] [Google Scholar]
  • 44.Borges JP, Mekhail K, Fairn GD, Antonescu CN, Steinberg BE, Modulation of pathological pain by epidermal growth factor receptor. Front. Pharmacol. 12, 642820 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Wang Y, Zhang F, Zhang Y, Shan Q, Liu W, Zhang F, Zhang F, Yi S, Betacellulin regulates peripheral nerve regeneration by affecting Schwann cell migration and axon elongation. Mol. Med. 27, 27 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Wangzhou A, Paige C, Neerukonda SV, Naik DK, Kume M, David ET, Dussor G, Ray PR, Price TJ, A ligand-receptor interactome platform for discovery of pain mechanisms and therapeutic targets. Sci. Signal. 14, abe1648 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Kee Z, Kodji X, Brain SD, The role of calcitonin gene related peptide (CGRP) in neurogenic vasodilation and its cardioprotective effects. Front. Physiol 9, 1249 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Aletaha D, Neogi T, Silman AJ, Funovits J, Felson DT, Bingham III CO, Birnbaum NS, Burmester GR, Bykerk VP, Cohen MD, Combe B, Costenbader KH, Dougados M, Emery P, Ferraccioli G, Hazes JMW, Hobbs K, Huizinga TWJ, Kavanaugh A, Kay J, Kvien TK, Laing T, Mease P, Ménard HA, Moreland LW, Naden RL, Pincus T, Smolen JS, Stanislawska-Biernat E, Symmons D, Tak PP, Upchurch KS, Vencovský J, Wolfe F, Hawker G, 2010 rheumatoid arthritis classification criteria: An American College of Rheumatology/European League Against Rheumatism collaborative initiative. Arthritis Rheum. 62, 2569–2581 (2010). [DOI] [PubMed] [Google Scholar]
  • 49.Arnett FC, Edworthy SM, Bloch DA, McShane DJ, Fries JF, Cooper NS, Healey LA, Kaplan SR, Liang MH, Luthra HS, The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis. Arthritis Rheum. 31, 315–324 (1988). [DOI] [PubMed] [Google Scholar]
  • 50.Roos EM, Roos HP, Lohmander LS, Ekdahl C, Beynnon BD, Knee Injury and Osteoarthritis Outcome Score (KOOS)—Development of a self-administered outcome measure. J. Orthop. Sports Phys. Ther. 28, 88–96 (1998). [DOI] [PubMed] [Google Scholar]
  • 51.Nilsdotter AK, Lohmander LS, Klässbo M, Roos EM, Hip disability and osteoarthritis outcome score (HOOS) – Validity and responsiveness in total hip replacement. BMC Musculoskelet. Disord. 4, 1–8 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR, STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Leek JT, Storey JD, Capturing heterogeneity in gene expression studies by surrogate variable analysis. PLOS Genet. 3, 1724–1735 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Love MI, Huber W, Anders S, Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Saeys Y, Inza I, Larrañaga P, A review of feature selection techniques in bioinformatics. Bioinformatics 23, 2507–2517 (2007). [DOI] [PubMed] [Google Scholar]
  • 56.Liu S, Xu C, Zhang Y, Liu J, Yu B, Liu X, Dehmer M, Feature selection of gene expression data for cancer classification using double RBF-kernels. BMC Bioinformatics 19, 396 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Uzma F, Al-Obeidat A, Tubaishat B, Shah Z, Gene encoder: A feature selection technique through unsupervised deep learning-based clustering for large gene expression data. Neural Comput. Appl. 34, 8309–8331 (2022). [Google Scholar]
  • 58.Dhal P, Azad C, A comprehensive survey on feature selection in the various fields of machine learning. Appl. Intell. 52, 4543–4581 (2022). [Google Scholar]
  • 59.Koller MSD, Toward optimal feature selection, in ICML’96: Proceedings of the Thirteenth International Conference on International Conference on Machine Learning, Saitta L, Ed. (Morgan Kaufmann Publishers Inc., 1996), pp. 284–292. [Google Scholar]
  • 60.Wang H, Khoshgoftaar TM, Gao K, A comparative study of filter-based feature ranking techniques, in 2010 IEEE International Conference on Information Reuse & Integration (IEEE, 2010), pp. 43–48. [Google Scholar]
  • 61.Ang JC, Mirzal A, Haron H, Hamed HNA, Supervised, unsupervised, and semi-supervised feature selection: A review on gene selection. IEEE/ACM Trans. Comput. Biol. Bioinformatics 13, 971–989 (2013). [DOI] [PubMed] [Google Scholar]
  • 62.Solorio-Fernández S, Carrasco-Ochoa JA, Martínez-Trinidad JF, A review of unsupervised feature selection methods. Artif. Intell. Rev. 53, 907–948 (2019). [Google Scholar]
  • 63.El Aboudi N, Benhlima L, Review on wrapper feature selection approaches, in 2016 International Conference on Engineering & MIS (ICEMIS) (IEEE, 2016), pp. 1–5. [Google Scholar]
  • 64.Gönen M, Alpaydin E, Multiple kernel learning algorithms. J. Mach. Learn. Res. 12, 2211–2268 (2011). [Google Scholar]
  • 65.van der Maaten L, Hinton G, Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008). [Google Scholar]
  • 66.Wang B, Zhu J, Pierson E, Ramazzotti D, Batzoglou S, Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning. Nat. Methods 14, 414–416 (2017). [DOI] [PubMed] [Google Scholar]
  • 67.Hinton GE, Roweis S, Stochastic neighbor embedding, in Proceedings of the 15th International Conference on Neural Information Processing Systems (MIT Press, 2002), pp. 857–864. [Google Scholar]
  • 68.Reimand J, Kull M, Peterson H, Hansen J, Vilo J, g:Profiler—A web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 35, W193–W200 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Korotkevich G, Sukhov V, Budin N, Shpak B, Artyomov MN, Sergushichev A, Fast gene set enrichment analysis. bioRxiv 060012 [Preprint] (2021); 10.1101/060012. [DOI] [Google Scholar]
  • 70.Raudvere U, Kolberg L, Kuzmin I, Arak T, g:Profiler: A web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 47, W191–W198 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M, KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–D114 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Fabregat A, Jupe S, Matthews L, Sidiropoulos K, Gillespie M, Garapati P, Haw R, Jassal B, Korninger F, May B, Milacic M, Roca CD, Rothfels K, Sevilla C, Shamovsky V, Shorser S, Varusai T, Viteri G, Weiser J, Wu G, Stein L, Hermjakob H, D’Eustachio P, The reactome pathway knowledgebase. Nucleic Acids Res. 46, D649–D655 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Kelder T, van Iersel MP, Hanspers K, Kutmon M, Conklin BR, Evelo CT, Pico AR, WikiPathways: Building research communities on biological pathways. Nucleic Acids Res. 40, D1301–D1307 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G, Gene Ontology: Tool for the unification of biology. Nat. Genet 25, 25–29 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ramilowski JA, Goldberg T, Harshbarger J, Kloppmann E, Lizio M, Satagopam VP, Itoh M, Kawaji H, Carninci P, Rost B, Forrest ARR, A draft network of ligand–receptor-mediated multicellular signalling in human. Nat. Commun. 6, 1–12 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Binder JX, Pletscher-Frankild S, Tsafou K, Stolte C, O’Donoghue SI, Schneider R, Jensen LJ, COMPARTMENTS: Unification and visualization of protein subcellular localization evidence. Database 2014, bau012 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Braschi B, Denny P, Gray K, Jones T, Seal R, Tweedie S, Yates B, Bruford E, Genenames. org: The HGNC and VGNC resources in 2019. Nucleic Acids Res. 47, D786–D792 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Carbon S, Ireland A, Mungall CJ, Shu S, Marshall B, Lewis S, AmiGO Hub, Web Presence Working Group, AmiGO: Online access to ontology and annotation data. Bioinformatics 25, 288–289 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, Jensen LJ, von Mering C, STRING v11: Protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Yates B, Braschi B, Gray KA, Seal RL, Tweedie S, Bruford EA, Genenames.org: The HGNC and VGNC resources in 2017. Nucleic Acids Res. 45, D619–D625 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Hao Y, Hao S, Andersen-Nissen E, Mauck III WM, Zheng S, Butler A, Lee MJ, Wilk AJ, Darby C, Zager M, Hoffman P, Stoeckius M, Papalexi E, Mimitou EP, Jain J, Srivastava A, Stuart T, Fleming LM, Yeung B, Rogers AJ, McElrath JM, Blish CA, Gottardo R, Smibert P, Satija R, Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587.e29 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Thomas PD, Ebert D, Muruganujan A, Mushayahama T, Albou L-P, Mi H, PANTHER: Making genome-scale phylogenetics accessible to all. Protein Sci. 31, 8–22 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Zeisel A, Hochgerner H, Lönnerberg P, Johnsson A, Memic F, van der Zwan J, Häring M, Braun E, Borm LE, La Manno G, Codeluppi S, Furlan A, Lee K, Skene N, Harris KD, Hjerling-Leffler J, Arenas E, Ernfors P, Marklund U, Linnarsson S, Molecular architecture of the mouse nervous system. Cell 174, 021 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Movie 1
Download video file (15.3MB, mp4)
Supplementary Movie 2
Download video file (6.8MB, mp4)
supplementary MdAr reproducibility checklist
Data File S1
Data File S2
Data File S3
Data File S4
Data File S5
Data File S6
Data File S7
Data File S8
Data File S9
Supplemental Materials and Methods

Data Availability Statement

All data associated with this study are present in the paper or the Supplementary Materials. The bulk synovial RNA-seq dataset is available at ImmPort (SDY1299). The PEAC dataset is available at https://peac.hpc.qmul.ac.uk. The AMP dataset is available at https://immunogenomics.io/ampsle1/ or ImmPort (SDY997). The raw sequence data are deposited in the sequence read archive under accession SRP135960 in the National Center for Biotechnology Information (NCBI) library. Our code is available at DOI: 10.5281/zenodo.10783506 and target URL https://zenodo.org/doi/10.5281/zenodo.10783505.

RESOURCES