Abstract
Rheumatoid arthritis (RA) is an autoimmune disease involving antigen-specific T and B cells. Here, we perform single-cell RNA and repertoire sequencing on paired synovial tissue and blood samples from 12 seropositive RA patients. We identify clonally expanded CD4 + T cells, including CCL5+ cells and T peripheral helper (Tph) cells, which show a prominent transcriptomic signature of recent activation and effector function. CD8 + T cells show higher oligoclonality than CD4 + T cells, with the largest synovial clones enriched in GZMK+ cells. CD8 + T cells with possibly virus-reactive TCRs are distributed across transcriptomic clusters. In the B cell compartment, NR4A1+ activated B cells, and plasma cells are enriched in the synovium and demonstrate substantial clonal expansion. We identify synovial plasma cells that share BCRs with synovial ABC, memory, and activated B cells. Receptor-ligand analysis predicted IFNG and TNFRSF members as mediators of synovial Tph-B cell interactions. Together, these results reveal clonal relationships between functionally distinct lymphocyte populations that infiltrate the synovium of patients with RA.
Subject terms: Autoimmunity, Adaptive immunity, Lymphocytes
Activated B cells and T cells accumulate within joints of patients with rheumatoid arthritis. Here, the authors use single-cell transcriptome and repertoire profiling to identify clonally expanded synovial B cells and T cells and define their phenotypes and predicted cell-cell interactions.
Introduction
Synovial inflammation in rheumatoid arthritis (RA) involves a complex set of interactions between immune and non-immune cell subsets. A core feature of the immune response in seropositive RA is an adaptive immune response against citrullinated proteins involving both antigen-specific B cells and T cells1 The activation of B cells in RA has long been appreciated, given the characteristic production of disease-associated autoantibodies, including rheumatoid factor (RF) and anti-cyclic-citrullinated peptide (anti-CCP) antibodies2–5. B cells may be activated locally within RA synovium, as studies on synovial tissue have provided evidence of somatic hypermutation (SHM) and clonal expansion6,7. Furthermore, CCP- and RF-specific B cells may undergo distinct activation pathways8. Synovial B cells can contribute antibody-independent functions as well, including antigen presentation and cytokine secretion, which involve interactions with other cell types9–12.
Populations of T cells have likewise been strongly implicated in the initiation and maintenance of synovial inflammation in RA13. Genetic associations indicate a critical role for antigen presentation to CD4 + T cells via MHC class II in the development of RA14,15. Cellular profiling studies of RA synovial tissue and fluid have highlighted a large population of T peripheral helper (Tph) cells, as well as T follicular helper (Tfh) cells, both of which provide help to B cells through the production of IL-21 and CD40L16–19. Tph cells differ from Tfh cells in their migratory patterns, expressing chemokine receptors such as CCR2 and CCR5 to home to sites of peripheral inflammation such as the rheumatoid joint20,21. Large populations of CD8 + T cells also accumulate within RA synovium, including a prominent granzyme K-expressing population, which may contribute to synovial inflammation through inflammatory cytokine production rather than cytotoxicity22.
Complementing studies of specific subsets of immune cells, a holistic picture of both immune and non-immune populations in RA is emerging through single-cell RNA-sequencing (scRNA-seq) atlases of synovial tissue samples23–26. These studies have highlighted the diversity of cell states present in the inflamed tissue of these patients, as well as how the presence and effects of these states may differ among patient subpopulations25,26. For lymphocytes, analyses of the T cell receptors (TCR) or B cell receptors (BCR) can provide unique insights into the expansion and developmental relationships of lymphocyte subsets, leveraging the feature that each new lymphocyte generates a unique TCR/BCR that is shared with its progeny. Further, BCR genes undergo somatic hypermutation during antigen selection which can reveal B cell developmental lineages. Studies tracking TCRs across tissues or longitudinally in RA patients have identified shared T cell clones in different joints27,28, clonal expansion of specific cell subsets29, persistence of expanded clones over time30, and overrepresented gene rearrangements that may suggest shared antigenic targets28,31,32; however, reactivity of expanded TCRs from RA synovial CD4 + T cells to citrullinated peptides has been difficult to demonstrate33. Studies of BCR repertoires of RA patients have indicated somatic hypermutation in RA synovial B cells and identified potential specificities across B cells collected from synovial tissue or fluid6,34–36. A comprehensive examination of both the TCR and BCR repertoires of synovial tissue lymphocyte populations and across tissue and blood at the single-cell level has not been described. Such studies have the potential to directly link clonal features to the functional roles, developmental relationships, and cell–cell interactions of specific lymphocyte phenotypes, as has been achieved in studies of cancer immunotherapy and infectious disease37–40.
Here, we use 5’ droplet-based scRNA-seq on T and B cells of synovial tissue and matched peripheral blood samples from 12 patients with RA to simultaneously study their transcriptomes and antigen receptor repertoires. Our study provides a high-resolution landscape of the clonal relationships within and between cell states, and further between inflamed synovial tissue and peripheral blood.
Results
Single-cell profiling of synovial tissue and peripheral blood lymphocytes
We collected synovial tissue (n = 12) and matched peripheral blood (n = 10) from individuals with RA that comprised a subset of a larger cohort analyzed as part of the Accelerating Medicines Partnership Program: Rheumatoid Arthritis and Systemic Lupus Erythematosus (AMP RA/SLE) Network, prioritizing samples with high synovial cell yields and evident lymphocyte populations by flow cytometry26. Donors had a mean age of 63.6 years (range 28–80) and were predominantly female (n = 11). Patients in the cohort had a Clinical Disease Activity Index (CDAI) classification of moderate or high with a mean score of 26.4 (range 10.5–64.0), and were mainly lymphoid in pathotype, but otherwise had a range of disease duration, treatment, and cell-type abundance phenotype (CTAP) (Supplementary Fig. 1A and Supplementary Data 1)26,41,42.
Tissue samples were previously disaggregated into single-cell suspensions for an unbiased analysis of the cell states present in RA synovium using RNA and cell-surface protein profiling26. Cryopreserved synovial cells remaining after the initial analysis were thawed and sorted to isolate CD45 + CD3+ and CD45 + CD19+ populations, which were subsequently encapsulated into droplets and used to generate gene expression, cell-surface protein, and TCR/BCR single-cell sequencing libraries (Fig. 1A and Supplementary Fig. 1B). Sorted CD3 + T cells and CD19 + B cells were loaded in order to maximize the numbers of both cell lineages captured (Fig. 1E) and minimize sample bias based on varying lymphocyte abundance (Supplementary Fig. 1C, D). Cryopreserved PBMCs were thawed and sorted in parallel. Following a unified single-cell analysis pipeline of all samples, we recovered a total of 84,750 cells. After applying QC criteria, 83,159 cells remained, which we used to perform an initial round of unsupervised clustering at low resolution (Fig. 1B and Supplementary Fig. 2). On average, we obtained 2888 cells per synovial tissue sample (range 804–5188) and 4851 cells per blood sample (range 3442–6601) (Fig. 1C and Supplementary Fig. 1A).
Through parallel examination of gene expression and protein detection, we identified several populations of T cells expressing markers such as CD3E and IL7R, as well as a cluster containing B and plasma cells expressing CD79A and MS4A1 (CD20) (Fig. 1D and Supplementary Fig. 3A, B). Cells from both lineages could be found in all samples (Fig. 1E, F). We further identified a cluster containing proliferating T and B cells, characterized by the expression of markers including MKI67 and TYMS. Lastly, we identified two rare populations of contaminating cells expressing markers of fibroblasts (PRG4 and FN1) and monocytes (S100A8 and LYZ), which were excluded from subsequent analyses (Fig. 1D and Supplementary Fig. 3A, B).
Of the 81,708 lymphocytes captured across all samples, we obtained paired TCR or BCR information for 73,185 cells. In the synovial tissue, an average of 84.7% of lymphocytes per sample had an associated TCR or BCR (range 72.9–96.1%), while in the blood, 93.1% of lymphocytes per sample had this information on average (range 89.3–96.8%) (Fig. 1G and Supplementary Fig. 3C).
Differential abundances of CD4 + T cell populations in synovial tissue and blood
Subclustering of the 35,301 CD4 + T cells obtained from synovial tissue and blood samples identified 14 CD4 + T cell subsets (Fig. 2A, B, Supplementary Fig. 4A, and Supplementary Data 2). We identified two clusters expressing the B cell chemoattractant CXCL13, one of which was composed solely of Tph cells, while the other included some cells with detectable CXCR5, suggesting the cluster contained a mixture of Tfh and Tph cells (Supplementary Fig. 4B, C)16,24,26. Both clusters expressed genes associated with B cell help, such as MAF, and both scored highly for a Tfh cell gene signature (Supplementary Fig. 4B, D and Supplementary Data 3). While the Tfh/Tph cluster displayed higher expression of markers such as IL7R and CD69 compared to the Tph cluster, the Tph cluster had significantly elevated expression of PDCD1, CTLA4, LAG3, and others. Further, the pure Tph subset produced higher gene scores of TCR signaling and T cell activation compared to other CD4+ clusters, suggesting that the cells in this cluster are activated (Fig. 2C, Supplementary Fig. 4D, E, and Supplementary Data 3).
Several other clusters of memory cells with distinctive expression of chemokines, chemokine receptors, and granzymes were present, including CCR7+ (which may represent central memory cells), IL7R+ CCL5+, GZMA+CCL5+, and GZMK+ populations. In addition, we identified a population of cytotoxic CD4+T cells, marked by the expression of GNLY, PRF1, and GZMB, as well as a cluster characterized by a strong interferon-response signature. A small population likely containing a mix of naive and memory cells was distinguished primarily by increased expression of members of the GIMAP family (e.g., GIMAP4, GIMAP5), which has been associated with survival and quiescence in lymphocytes (Supplementary Data 2)43.
Subclustering also revealed two populations of regulatory T cells (Tregs). Both expressed FOXP3, CTLA4, and TIGIT, though one population was marked by stronger expression of IL2RA (encoding CD25) and IL32, while the other displayed higher CCR7 and TCF7, suggesting the presence of populations of effector and central memory-like Tregs, respectively44. A large cluster containing naive T cells expressing SELL, TCF7, and CCR7 displayed the strongest expression signature for a previously identified naive T cell gene set45 (Supplementary Fig. 4B–E). Lastly, an actively proliferating cluster of cells, as well as a subset with elevated mitochondrially encoded RNAs, could be detected (Supplementary Fig. 4B–E). In total, T cell populations identified by this present clustering analysis were well aligned with those identified in analyses of a larger set of synovial tissue samples (of which the 12 synovial samples studied here are a subset), where clustering was performed through a combination of RNA and cell-surface protein profiling, (Supplementary Fig. 4F)26.
We next identified T cell subsets that had differential representation in either synovial tissue or peripheral blood. Several memory/effector cell populations were enriched in synovium compared to blood, including the Tph, Tfh/Tph, CD25-high Treg, IL7R+CCL5+ memory, GZMA+CCL5+ memory, GZMK+ memory, and proliferating clusters (Fig. 2D and Supplementary Fig. 4G). In contrast, blood samples contained increased abundances of the naive, GIMAP+, and interferon-stimulated clusters. On average, 45% of the CD4+ population for each blood sample was composed of cells from the naive cluster (range 18–66%), while only 5% of synovial tissue CD4 + T cells had a naive phenotype (range 2–15%). Thus, a range of memory/effector T cell populations with distinct transcriptomic signatures are enriched in RA synovium compared to blood.
Clonally expanded Tph cells display increased effector and cytotoxic signatures
We then sought to examine the TCR repertoire of the CD4+ populations across synovial tissue and blood. Among the CD4+ populations, the GNLY+ cytotoxic cluster displayed the highest clonal expansion and comprised nearly all clones larger than 20 cells. Though detected clone size can be impacted by the number of cells analyzed, it is notable that the GNLY+ cluster was of lower abundance compared to other CD4 clusters (Fig. 2B), and clonal expansion was still detectable. Clonal expansion, defined as two or more cells with an identical TCR, was also identified among the Tph and GZMA + CCL5+ memory clusters (Fig. 2E and Supplementary Fig. 4H, I). An analysis of clonal sharing among the synovial tissue clusters revealed a high degree of clonal overlap between the Tph and Tfh/Tph clusters, suggesting that cells in these clusters are developmentally related despite their transcriptomic differences (Fig. 2F, G). By subdividing the Tfh/Tph cluster into cells with and without detectable CXCR5 transcript, we identified a set of clones represented in both CXCR5+ Tfh cells and cells within the Tph cluster (Supplementary Fig. 4J). In addition, the synovial Tph cluster shared the highest clonal overlap with the cycling cluster compared to all other subsets, highlighting the active proliferation of the Tph population within the synovial tissue. We also identified a small number of T cells from blood with TCRs that matched synovial Tph or Tfh/Tph cells; these blood T cells were most frequently in the blood Tph or Tph/Tfh clusters (Supplementary Fig. 4K). The pattern of clonal sharing across blood CD4 + T cell clusters overall appeared generally similar to that in synovium, but with a lower degree of overlap, and with the GZMA + CCL5+ subset showing the highest clonal overlap with the proliferating blood CD4 + T cells (Fig. 2F, G).
After identifying the Tph population as highly represented in the synovial tissue, clonally expanded, and actively proliferating, we sought to identify features that differentiate clonally expanded and unexpanded Tph cells in the tissue. Across donors, an average of 12.5% of Tph cells belonged to an expanded TCR clone (range 4–32.5%) (Fig. 2H). By examining transcriptional differences between these cells and Tph cells belonging to an unexpanded clone, clonally expanded Tph cells showed significantly increased expression of signatures of effector function and cytotoxicity, including elevated expression of IFNG, PRF1, CD40LG, and CCL5 (Fig. 2I, J). In contrast, no difference in the Tfh gene signature score was present between expanded and unexpanded cells, suggesting that clonally expanded Tph cells do not lose their B cell-helping functions. Interestingly, a similar comparison using cells from the Tfh/Tph cluster yielded no significant differences between expanded and unexpanded cells, further suggesting a unique set of features among cells in the Tph clusters (Fig. 2K).
Expanded CD8 + T cell clones across tissue types differ by patterns of GZMK and GZMB expression
Next, we isolated and subclustered the CD8 + T cells present in the synovial tissue and blood samples, which revealed nine CD8 + T cell populations that could be found across samples (Fig. 3A, B, Supplementary Fig. 5A, and Supplementary Data 2). Among these, three populations were distinguished by expression patterns of GZMK and GZMB, all of which had strong corresponding matches to clusters identified in a recent study of synovial tissue (Supplementary Fig. 5B, C)26. One of these clusters solely expressed GZMK, while another expressed GZMK and low levels of GZMB. Both of these populations also had elevated GZMA and CCL5, while the GZMK/B+ cluster was further differentiated through increased expression of markers, including CCL4 and HLA-DRA, and decreased IL7R. The third population, characterized by expression of GZMB only, appeared highly cytotoxic through gene module analysis and elevated expression of GNLY and PRF1 (Supplementary Fig. 5B–D). We also isolated a population of likely resident memory CD8 + T (Trm) cells, characterized by increased ZNF683 and XCL1, which was supported more broadly through examination of a previously published Trm gene list and consistent with a recent description of synovial Trm cells (Supplementary Data 3)46,47. We identified one naive CD8+ population with high expression of SELL and LEF1 and a naive gene module signature (Supplementary Fig. 5B–D). Another population of likely naive cells was also identified, characterized by increased expression of IGTB1 and LMNA. Lastly, we found separate clusters of CD8 + T cells with elevated interferon-response, proliferation, or mitochondrial gene modules (Supplementary Fig. 5B–D).
An analysis of cluster representation between synovial tissue and peripheral blood samples found the GZMK/B+ cluster to be highly increased in synovial tissue samples (Fig. 3C and Supplementary Fig. 5F), consistent with a recent report22. Clusters solely expressing GZMK or GZMB were not significantly different in abundance between tissue compartments, though the GZMK+ memory population trended higher in synovial tissue, while the GZMB TEMRA cluster appeared slightly elevated in the blood (Supplementary Fig. 5F). Additionally, the Trm, proliferating, and mitochondrial-high clusters had elevated representation in synovial tissue samples. In contrast, the naive and ITGB1-elevated populations could both be found at higher frequencies in the blood (Fig. 3C and Supplementary Fig. 5F).
After characterizing the cell states present among the CD8+ population, we sought to connect these clusters’ clonal attributes. Broadly, we found a much larger degree of clonal expansion in the CD8+ compartment compared to the CD4+ subsets. We found the strongest clonal expansion among the GZMK/B+ memory and GZMB + TEMRA clusters, and also noted cells in the ISG-high and proliferating clusters to belong to expanded clones (Fig. 3D and Supplementary Fig. 5G). Comparing the top 50 largest clones across the synovial tissue and blood revealed striking differences in cluster composition. Cells from the top 50 clones in the synovial tissue belonged overwhelmingly to the GZMK/B+ memory population. By contrast, the top 50 clones in the blood belonged to the GZMB + TEMRA cluster, with minor representation of other populations (Fig. 3E, F). This analysis of the largest clones, and a broader examination of the clonal overlap between subclusters in either tissue, showed sharing between the GZMK/B+ memory cluster and proliferating cells of the synovial tissue, while the GZMB + TEMRA was found to overlap with the proliferating component of the blood (Supplementary Fig. 5H, I). To further analyze this, we divided the proliferating cluster into subsets and mapped these cells onto the other CD8+ cells of this dataset. Compared to the general cluster proportions, the tissue and blood proliferating cells mapped heavily to the GZMK/B+ (201/261 tissue proliferating cells) and GZMB + TEMRA (24/38 blood proliferating cells) clusters, respectively, suggesting an active, though likely differing, role of these two populations in their distinct tissues (Fig. 3G).
A further examination of clonal overlap across tissues revealed a small degree of sharing between the tissue and blood components of either the GZMK/B+ memory or GZMB + TEMRA clusters separately, but clonal overlap between synovial tissue GZMK/B+ cells and blood GZMB + TEMRA cells was essentially absent (Supplementary Fig. 5J). Of the top 50 clones in both tissue compartments, only six could be found in both, nearly all of which were larger in the blood than the synovial tissue. Notably, four of these clones retained a GZMB+ skew even in the synovial tissue (Fig. 3E, F). Together, these results suggest functionally distinct roles for the expanded GZMK/B+ cells of the synovial tissue and the GZMB + TEMRA cells of the blood. The findings corroborate recent work suggesting that the GZMK/B+ cells likely do not arrive in the synovial tissue as GZMB+ cytotoxic T cells that subsequently alter their phenotype, and instead may enter the synovium as GZMK+ cells or expand locally in the tissue as they adopt a phenotype that includes GZMK expression22.
Predicted virus cross-reactive CD8 + T cells do not display altered expansion or phenotypic characteristics
Viral infection has long been connected with the potential for the development and maintenance of autoimmune diseases48. One possibility that may instigate this phenomenon is the presence of cross-reactive epitopes between a virus and endogenous proteins, which may drive the activation and expansion of a set of T cells. We sought to identify potentially viral-specific T cells in the synovial tissue to evaluate the extent of potential viral cross-reactivity among RA synovial CD8 T cells. To accomplish this, we gathered databases of previously identified CMV, EBV, and influenza A-specific T cells49,50. We then identified exact matches with the beta chain CDR3 sequence and HLA between the RA synovial CD8 + T cells analyzed here and those in the database (Supplementary Fig. 6A). Although three of the RA patients (RA04, RA05, and RA07) expressed less common HLA alleles (Supplementary Data 5), precluding the ability to find matches with previously-discovered specific clones, we were able to identify matching clones in the majority of patients (Fig. 3H and Supplementary Fig. 6B). Within and between patients, we further found these matches to be directed against multiple different epitopes for each of the viruses tested (Supplementary Fig. 6C). When examining which cluster these matching T cells belong to, we found a spread across clusters, with no cluster being significantly overrepresented compared to non-matching cells (Supplementary Fig. 6D). Further, we noted that few matching clones were expanded (16/56), with the largest matching clone comprising only seven cells. For comparison, analysis of CD8 + T cells from blood yielded similar results, with a comparable number of viral-reactive T cells, and a similarly broad distribution of cell phenotypes represented by viral-reactive cells (Fig. 3J and Supplementary Fig. 6E, F).
Requiring an exact match of clone sequence and HLA can provide strong evidence of the capacity for reactivity against a virus; however, identifying shared motifs between viral-specific T cells and those in our dataset may allow the identification of a larger set of T cells with the potential to detect viral epitopes. Thus, we employed GLIPH2 to identify motifs within the beta chain CDR3 sequence enriched in viral-specific T cells and our CD8 + T cells51. After running the GLIPH2 algorithm individually with each virus (CMV, EBV, and influenza) for RA synovial T cells, we filtered motifs that contained both virus and RA clones, and only those with an HLA match (Supplementary Fig. 6G). We obtained a larger number of clones belonging to a viral-associated GLIPH motif (369 unique clones) compared to our exact matches (56 unique clones), which was variable across donors (Fig. 3I and Supplementary Fig. 6H, I). To strengthen the association of the hits within our data, we sought to identify a relationship between patient age and the percent of CD8 + T cells associated with potential viral reactivity. Though not significantly correlated, we found trends between donor age and the percent of potential CMV- and EBV-reactive clones, consistent with the dynamics of the anti-viral repertoire with age52,53 (Supplementary Fig. 6J). Similar to results with exact matches, a breakdown of the cluster makeup of motif-matching cells and non-matching cells again revealed no significant differences among synovial CD8 + T cells, with similar results also obtained for CD8 + T cells from blood (Fig. 3J). Further, we identified no differences in the clone size distributions between these two groups, and also found no clones belonging to viral GLIPH motifs among the 50 largest synovial tissue clones (Fig. 3K).
Identification of activated innate T cell populations in RA synovium
Populations of innate T cells have previously been associated with RA, including natural killer (NK), γδ, and mucosal-associated invariant T (MAIT) cells, though clear roles for many of these subsets in RA remain elusive54–57. We identified and subclustered innate T cells in the synovial tissue and blood samples, resulting in seven subsets representing multiple innate lineages (Fig. 4A, Supplementary Fig. 7A, B, and Supplementary Data 2). Two populations of γδ T cells, including a Vδ1 subset characterized by expression of TRDV1 and a Vδ2 subset expressing TRDV2 and TRGV9 (the TCR γ variable gene commonly paired with TRDV2), were retrieved58. The Vδ1 population had an elevated expression of GZMB and TIGIT, while Vδ2 cells had higher levels of TYROBP. MAIT cells were also identified among the innate cells, expressing markers including SLC4A10, AQP3, and ZBTB16 (Fig. 4B and Supplementary Fig. 7C). Two populations of NK cells were detected, including CD56-dim and CD56-bright subsets, that aligned with corresponding NK populations in a scRNA-seq reference (Supplementary Fig. 7D)26. These CD3- subsets were not intended to be included in the sorting scheme used in this study; thus, their frequency may not reflect the true representation in these samples. In addition to these clusters, we also found a population characterized by the expression of ZNF683 (encoding Hobit), which appears to contain both γδ and NK cells and a population of innate cells with elevated mitochondrial gene expression (Supplementary Fig. 7C). When comparing the frequency of these populations between synovial tissue and peripheral blood, only the Vδ1 subset had significantly increased representation in the synovium, comprising a mean of 2.1% of T cells in the tissue (range 0.1–4.5%) and 0.5% (range 0.1–1.4%) in the blood (Fig. 4C and Supplementary Fig. 7E).
Leveraging the dataset’s repertoire information, we confirmed the presence of MAIT cells through an examination of the TCR alpha chain rearrangements. The MAIT population was characterized by its use of TRAV1-2, often accompanied by TRAJ33, TRAJ20, or TRAJ12 rearrangements59. The pairing of the TRAV1-2 and TRAJ33/20/12 gene rearrangements could be detected in over half of the cells with an associated TCR in the cluster, while it was largely absent across other T cells in the dataset (Fig. 4D and Supplementary Fig. 7F, G). We then sought to clonally track these MAIT cells between tissue and blood and found expanded clones uniquely represented in either tissue, as well as a subset of clones that were present in both synovium and blood (Fig. 4E and Supplementary Fig. 7H).
To characterize transcriptional differences between the clonally expanded and non-expanded MAIT across these tissues, we examined signatures of activation. Synovial MAIT cells had elevated activation scores compared to MAIT cells in the blood (Fig. 4F). A similar pattern was observed with Vδ1 and Vδ2 cells (Supplementary Fig. 7I). Because MAIT cells are known to become activated through both TCR-dependent and -independent mechanisms60, understanding how this relates to differences between tissues is relevant to better decipher the potential role of these cells in RA synovium. Gene signatures of TCR-dependent and -independent MAIT activation from multiple independent sources showed significantly enriched scores for both mechanisms in synovium compared to blood61,62. Both clonally expanded and non-expanded MAIT cells from synovial tissue had higher signatures for both mechanisms compared to their blood counterparts, but no significant difference could be detected between the expanded and non-expanded subsets of either tissue, suggesting subsets of MAIT cells in synovial tissue may become activated by either TCR-dependent and independent mechanisms (Fig. 4G, H, Supplementary Fig. 7J, K, and Supplementary Data 3). Together, these transcriptomic and repertoire data provide strong evidence for the presence of a defined MAIT cell population within RA synovium that appears activated.
Activated B cells are enriched in the synovium
Analogous to the T cell analysis, we characterized 27,869 B cells through subclustering to obtain eight B cell and two plasma cell populations (Fig. 5A–C, Supplementary Fig. 8A–C, and Supplementary Data 2). We annotated four of these populations as naive subsets based on relatively higher expression of naive markers such as IgD and TCL1A, strong mapping to naive B cells from blood and tonsil in the published literature (Fig. 5E) and recently identified in RA synovial tissue using RNA and surface protein expression (Supplementary Fig. 8B)26. Two of the naive B clusters were distinguished by relative levels of IgD expression (Naive-IgD-low, Naive-IgD-high). The Naive-IgD-low is also distinguished by higher FCER2 (CD23). The other two naive clusters had elevated expression of HSPA1B (Naive-HSPA1B+) or higher mapped mitochondrial reads (Naive-MT-high) and had weaker mapping to a naive B cell state (Fig. 5E and Supplementary Fig. 8B, D). These latter two clusters were the dominant naive population in the synovium. Naive-HSPA1B+ has an activated phenotype with upregulation of NR4A1 and DNAJB1 in addition to HSPA1B. Given the upregulation of ZEB2 and ITGAX (CD11c) (Supplementary Fig. 8D, E), this subset resembles an activated naive B cell described as expanded in the blood of patients with lupus63. The other six clusters are non-naive and dominate the synovial B cells. On average, 64.5% of the B cells for each blood sample were composed of cells from the naive clusters (IQR 59.5–86.7%), while only 9.1% of synovial tissue B cells had a naive phenotype (IQR 5.5–12.0%). We identified a cluster of memory B cells expressing memory markers, including CD27, TNFRSF13B, S100A10, and S100A4 (Fig. 5B and Supplementary Fig. 8C–E). This memory cluster maps to a peripheral blood memory B cell signature (Fig. 5E) and the switched memory population described by ref. 26 (Supplementary Fig. 8B). Finally, over 60% of blood and over 75% of synovial memory B cells are class-switched based on V(D)J sequencing (Fig. 5H).
Additional B cell populations included age-associated B cells (ABCs), activated-, and LILRA4 + B cells. Age-associated B cells (ABCs) expressed the canonical markers CD11c and FCRL5 as previously described (Fig. 5B and Supplementary Fig. 8C)24,26,63,64, as well as other ABC makers such as TBX21 (T-bet) and ZEB2 (Fig. 5B and Supplementary Fig. 8D). Interestingly, one of the top differentially expressed genes in the ABC cluster is IFI30, highlighting activation of interferon signaling as a potential driver of ABCs in situ (Supplementary Fig. 8C). The B-cell activated cluster was annotated based on high expression of NR4A1, related early response and germinal center light zone (GC LZ) genes (Fig. 5B, E and Supplementary Data 3). Notably, both ABCs and activated B cells showed evidence of class-switch recombination (Fig. 5H, I), consistent with previous reports7,63. Another small B cell population was named clonal-LILRA4+ based on high BCR clonality and high LILRA4 and G-protein signaling molecules. This small population only appeared in three PBL samples, with 87 out of 89 total cells belonging to one sample, and was omitted from downstream analyses. We used XBP1 and CD27 expression to identify plasmablast (XPB1+CD27int) and plasma cells (XBP1+CD27hiIgA + IgG+). Plasmablasts also mapped to a GC dark zone (GC DZ) signature (Fig. 5E), likely reflective of a high proliferation state (see Supplementary Fig. 9A for specific genes). Both populations exhibited high levels of class-switch recombination and SHM (Fig. 5F–I and Supplementary Data 3).
Next, we tested each population for enrichment in the blood versus synovium. All populations were found in both the blood and synovium and across multiple donors (Supplementary Fig. 8A). Activated B cells and plasma cells were significantly more abundant in the synovium. In contrast, Naive-IgD-low, Naive-IgD-high, B-memory, and Naive-MT-high clusters were significantly enriched in the blood (Fig. 5D and Supplementary Fig. 10D). We next sought to identify features that differentiate synovial from blood B cells by performing GSEA (Supplementary Fig. 10B). Many of the B cell populations in the synovium showed enrichment for GO pathways associated with cell activation and cytokine-mediated signaling (Supplementary Fig. 10B). The former is consistent with the overall activated state of B cells in the synovium. It is interesting that the most significant enrichment for cytokine-mediated signaling in any synovial B cell state is within the plasmablasts, suggesting that cytokine signaling is a critical mediator of plasmablast generation in the tissue (Supplementary Fig. 10B).
Accumulation of somatic hypermutation and class switch in synovial B cells
A unique feature of this dataset is the ability to match the precise BCR sequence to the transcriptomically defined B cell states in both synovium and paired blood. Thus, we can determine which populations acquire SHM, the nature of those mutations, and the differences between tissue and blood. As expected, memory B cells and plasmablasts/plasma cells had significant levels of SHM. Of additional interest, activated B cells and ABCs also had higher levels of SHM compared to the Naive-IgD-high population as a reference (Fig. 5F).
We then compared the rate of SHM between blood and synovial cells for each population (Fig. 5G) Interestingly, three naive-like B cell populations—Naive-IgD-low, Naive-HSPA1B+, and Naive-MT-high— had significantly higher mutation rates in the synovium compared to blood, suggesting a spectrum of in situ naive B cell activation as we recently described7. Notably, ABCs and activated B cell populations also exhibited significantly higher mutation in the synovium, suggesting in situ activation and selection. The only population that had significantly higher mutation rates in the blood were plasma cells. Naive-IgD-high, memory, and plasmablasts did not show significance for tissue-specific mutation differences (Fig. 5G).
Additionally, we examined the amount of class-switching occurring within each cell population and between blood and synovial cells. Synovial B cell states exhibited significantly more class-switched BCRs (IgG or IgA+, IgD−) compared to their blood counterparts across all populations, except for naive-IgD-high B cells (Fig. 5H). All cell populations showed a statistically significant increase in the amount of class-switched BCRs compared to the least class-switched population: naive-IgD-high B cells (Fig. 5I). Of note, over 50% of the synovial ABCs showed evidence of class-switching consistent with the emerging concept that ABCs generated during normal immune response are a subset of memory B cells65,66. It is also of interest that synovial ABCs, plasmablasts, and plasma cells have very distinct IgH isotype usage compared to their blood counterparts, with far fewer IgA and more IgG (Fig. 5H).
Evidence of in situ antigen exposure and clonal expansion in synovial B cells
Using the BCRs recovered in our dataset, we identified groups of clonally related B cells by quantifying the similarity between their IgH CDR3 sequences. As a first approach to this analysis, we employed a stringent 96.5% sequence homology to identify clones. This was done in order to identify identical clones and thus establish developmental relationships between cell states. Cell populations that were more highly mutated produced larger B cell clones, as has been reported in analyses of blood B cell repertoires from healthy individuals (Fig. 6A, B)67. For example, plasmablast and plasma cells, which were recovered from nearly all samples (Supplementary Fig. 8A), had a higher proportion of large clones (20–100 cells) than other populations, consistent with antigen-driven clonal expansion (Fig. 6A, B). For plasmablasts/plasma cells, clonal expansion was more prominent in the blood than in the synovium (Fig. 6B). This suggests that plasma cells in the synovium are experiencing different selection pressures than those circulating in the periphery, also supported by the distinct IgH isotype expression (Fig. 5H). In the synovium, B cell populations other than plasma cells had mostly singletons or, within the activated B cells, ABCs, and MT-high, smaller clones (2–5 cells). Overall, the presence of both clonal expansion and higher SHM is consistent with stimulation and provision of T cell help within these more activated B cell populations. Further, consistent with an antigen-experienced B cell repertoire, activated B cells and ABCs had shorter CDR3 length and higher charge overall in both the synovium and blood (Supplementary Fig. 10A)68. In the blood, clones were also mostly singletons but with a proportion of small, medium, and even large clones detected in the activated, memory, MT-high, ABCs, and Naive-HSPA1B+ B cells (Fig. 6B).
To assess clonal relationships between cell types, co-occurrence of expanded clone members between cell types was reported for each clone that contained a member within two different cell types. Though the vast majority of clones were contained within a specific population, we did identify clones that were shared between populations. Figure 6C depicts the clonal sharing across populations within each compartment. Within the synovium, plasma cells and plasmablasts share a large number of clones (24 clones, Fig. 6C), strongly supporting a developmental relationship between newly generated plasmablasts and more mature plasma cells. Notably, we also observed shared clones between ABCs and activated B cells, as well as between both these B cell states and the plasma cells. A small number of clones are also shared between the MT-high B cell state and both activated B cells and ABCs (Fig. 6C, D). Overall, this data suggests a developmental relationship from Naive-MT-high (a naive B cell population already showing some signs of antigenic stimulation based on higher SHM rates compared to resting naive—Fig. 5F) alternatively down an ABC vs. activated B cell pathway, as well as between these cell states and then downstream to plasma cells. It is also of interest that memory B cells share clones with activated B cells and plasma cells, consistent with memory B cells participating in synovial immune reactions (Fig. 6C, D). Within the peripheral blood, there is substantially more clonal sharing between multiple cell states (Fig. 6C). This is also evident when examining the cell population composition of the 50 most dominant clones in the blood compared to the synovium (Fig. 6E, F). As an example, the 5th most expanded blood clone was observed in the ABC, plasmablast, Naive-IgD-low, MT-high, and memory B cells (Fig. 6F). In contrast, the 50 most dominant synovial clones are heavily weighted toward plasma cells (Fig. 6E). Though clones were shared between the synovium and blood, these clones were not expanded (Fig. 6G). Tissue-trafficking clones were found within plasma cells, ABCs, and activated and memory populations, consistent with some trafficking of these B cells between the two compartments (Fig. 6H). Interestingly, clones shared between blood and tissue had distinct phenotypes (Fig. 6H). As an example, there are multiple cases of memory B cells in the blood becoming activated B cells or ABCs or plasma cells in the synovium, again strongly suggesting that memory B cells participate in synovial immune reactions.
In order to identify features that differentiate clonally expanded from unexpanded B cells in the synovium, we performed GSEA. We focused this analysis on plasmablasts and plasma cells, given the limited numbers of clones in other cell states. Clonally expanded plasmablasts showed increased expression of signatures of BCR signaling, cytokine-mediated signaling, and response to cytokines as compared to non-clonally expanded cells (Supplementary Fig. 11).
We repeated the analysis of clonally related B cells using a less stringent IgH CDR3 DNA sequence similarity of 80%69. The focus here was to generate and analyze clonal lineages in order to begin to map in situ immune reactions. Using this approach, more frequent small clones were detected in multiple cell states in both blood and synovium (compare Fig. 6B to Supplementary Fig. 12A, B). There were also more shared clones between cell states in the synovium, with sharing now detected from the naïve IgD-low and frequently across three or more populations (compare Fig. 6D to Supplementary Fig. 12C). As with 96.5% homology analysis, the most highly expanded synovial clones were in the plasmablast and plasma cell compartments (compare Fig. 6E to Supplementary Fig. 12D). With clonal lineages defined at 80% homology, more tissue-trafficking clones were identified (compare Fig. 6G, H to Supplementary Fig. 12E, F), again with examples of transitions of cell states between blood and tissue clones. Examples of B cell lineage trees highlight the developmental relationships from naïve IgD-low to ABC, activated, and plasma cells (Supplementary Fig. 12G, H).
Identification of altered T cell-B cell communication patterns in synovium
Given the association of BCR signaling, B cell activation, and response to cytokines with the enriched and clonally expanded B cells in the synovial analysis, we next sought to systematically investigate potential T cell-B cell interactions. We constructed cell–cell communications networks using CellChat70, with an initial focus on communication differences between the CD4 + T cell subpopulations and total B cells to assess the viability of the technique to first decipher well-characterized interactions. The proliferating, Tph, and Tfh/Tph subsets had the largest numbers of significant interactions identified with B cells (Fig. 7A). Detection of significant CXCL13-CXCR5 interactions between Tph-B cell and Tfh/Tph-B cell pairs is consistent with prior reports and supports the performance of the analysis method16. A significant IFNG-IFNGR interaction was further detected between Tph-B but not Tfh/Tph-B pairs, consistent with the increased expression of IFNG in the Tph cluster compared to the Tfh/Tph cluster (Supplementary Fig. 13A, B). We then generated an inferred communication network using CD4 + T subpopulations and B cell subpopulations. Examination of the cumulative incoming and outgoing interactions for each population identified the Tph population as having elevated signals for both directions, while the ABC population had the strongest outgoing signal of any cluster (Supplementary Fig. 13B). A pairwise analysis of significant interactions between these CD4 + T and B cell populations found the proliferating cluster to have the largest number of predicted interactions with all B cell subsets, though this may reflect a heterogeneous nature of proliferating cells. Aside from the proliferating cluster, elevated interactions between Tph cells and the ABC, plasmablast, memory, and clonal B cell populations were identified. The number of significant interactions for each of these pairs was higher than with any other CD4+ population, suggesting an elevated signaling potential of the Tph cluster (Fig. 7B).
Next, we leveraged the cross-tissue nature of our dataset to compare T cell-B cell signaling differences between synovial tissue and blood. We generated a separate communication network for each, both of which returned roughly similar numbers of significant interactions detected with a slight elevation in the synovium (Supplementary Fig. 13D). Within the synovial tissue, the TNF, CXCL chemokines, IL-2, and IFN type II signaling pathway families were elevated, together highlighting characteristics of an inflammatory and immune-activated state of the tissue. Among the signaling pathways underrepresented in the tissue compared to signaling in the blood were CCL, SELPLG, ICAM, and ITGB pathway families, which are often expressed in cells migrating in the blood (Fig. 7C). Finally, we sought to identify cell–cell interactions elevated in the synovial tissue between Tph or Tfh/Tph cells and either ABC, Memory, or Activated B cells. In all analyzed pairs, BTLA-TNFRSF14 (HVEM) and CXCL13-CXCR5 interactions were elevated in the synovium compared to the blood. In contrast, ITGB2-ICAM2 and LGALS9-CD44/45 interactions were higher in most blood pairs. Comparing different interactions between Tph-B subsets and Tfh/Tph-B subsets, LTA-TNFRSF14 (HVEM) was identified in all synovial Tph-B cell pairings but absent in Tfh/Tph-B cell pairings. In the synovium, IFNG-IFNGR was a significant interaction in both Tph/ABC and Tph/Activated B cell pairings, and LTA-TNFRSF1B (TNFR2) was specific to the Tph/ABC pairing only (Fig. 7D).
Discussion
By leveraging paired single-cell RNA and TCR/BCR sequencing and paired synovium and blood samples, this work provides a detailed assessment of the relationship between the immune repertoire and cell state composition, gene expression, blood and synovial lymphocyte trafficking, and cell–cell interactions in RA synovium. These data provide insight into the developmental relationships between specific synovial lymphocyte populations, for example, demonstrating clonal links connecting ABC and activated B cells with plasma cells, while separating clonally distinct Tph/Tfh vs CCL5 + CD4 T cells and GZMK+ vs GZMB + CD8 + T cells (Fig. 8).
Within the CD4 T cell compartment, the Tph cells express among the highest effector and activation signatures compared to other subsets and are significantly enriched in synovium compared to the blood, consistent with prior observations16,71. Here, we leverage the paired repertoire information of this population, identifying it as one of the most clonally expanded CD4 T cell populations and clonally related to both proliferating cells and Tfh cells. We acknowledge that a potential limitation of our approach is variability in the number of synovial T cells (and B cells) isolated and analyzed from different samples, which can affect the clone sizes detected. Nonetheless, we observed that Tph cells from multiple synovial tissue samples were clonally expanded and related to CXCR5+ Tfh cells, consistent with recent work tracking clonal relationships of Tph cells in synovial fluid29. Analyses of synovial tissue performed here suggest that Tfh cells, as contained within the Tfh/Tph cluster, may be more abundant in synovial tissue than in synovial fluid, in which PD-1hi CD4 + T cells are predominantly CXCR5- Tph cells. Further, our identification of some shared clones between Tph in synovium and in blood, even with small numbers of total T cells analyzed, supports the notion that a portion of Tph cells in blood have both transcriptomic and clonal relationships to Tph cells in synovium.
The gene expression patterns in clonally expanded Tph cells, combined with predicted cell–cell interactions, suggest additional roles for Tph cells, where upon T cell activation and clonal expansion, they upregulate factors related to effector function and cytotoxicity, such as GZMB, IFNG, and CCL5. This result may suggest a direct contribution by this population in promoting tissue inflammation or injury through cytotoxic activity in addition to its well-characterized B cell helper function (Fig. 8). Additionally, the IFNG production by activated and clonally expanded Tph cells could drive ABC production, while CCL5 production could attract additional effector T cells and myeloid cells.
While analyses of CD8 T cells have largely focused on the expression of cytotoxic features such as granzyme B, recent observations have highlighted a prominent CD8 T cell population in RA synovium with distinct expression of granzyme K22. Our analyses further underscore a key distinction between GZMK+ and GZMB + CD8 + T cells in RA synovium and blood. Whereas in the blood GZMB-expressing cytotoxic cells formulate the largest clones, CD8+ cells that express GZMK comprise the majority of the most expanded cells in the tissue. A striking finding through the current and prior analyses was the near-absence of clonal overlap between GZMK+ cells and GZMB+ cells across tissues, suggesting that GZMK+ cells do not arrive at the synovium as GZMB-expressing cytotoxic cells and may instead receive antigenic stimulation locally that drives clonal expansion and functions such as cytokine production22.
The presence of T cell cross-reactivity between virus and self has previously been associated with the capacity for driving autoimmunity, including in RA, yet the potential for virus-reactive T cells to contribute to synovial inflammation has remained uncertain72–75. Our analyses identify multiple synovial T cell clones that match those previously identified to be viral-reactive, a result consistent with the recent demonstration of viral reactivity among TCRs from RA synovial CD4 T cells33. Our paired RNA-seq/TCR analysis enabled interrogation of the phenotypes of potentially viral-reactive CD8 T cells, yet no broad-scale difference in the cluster composition or clonal characteristics of these cells was apparent. Because our methodology relied on utilizing viral-specific clones that had previously been identified on defined HLA alleles, there are likely a number of “false-negative” clones in the dataset that could be viral-reactive yet did not have a match in the examined databases. Future work to better define the viral-reactive capacity of T cells within the joint may rely on isolating viral-specific cells using viral peptides bound to tetramers and evaluating them for cross-reactivity to synovial antigens. Still, this work shows the presence of likely cross-reactive T cells within the synovium, yet with no enrichment for specific activated or effector phenotypes.
Populations of innate T cells are thought to contribute to RA76,77. Here, we define the subsets of innate T cells present in the synovium, including populations of γδ T and MAIT cells, with selective enrichment of Vdelta1 γδ T cells in the synovium compared to blood, consistent with their enrichment in other tissues including gut and skin78,79. Leveraging paired TCR information, this is the first single-cell RNA sequencing study in RA to confirm the presence of MAIT cells by analyzing VDJ gene rearrangements. Our detection of shared MAIT cell clones in synovium and blood suggests that these cells may traffic into and out of synovium. Synovial MAIT cells showed an increased activation signature compared to those in blood, as did γδ T cells, suggesting an active role for these cells in inflammatory arthritis.
One of the striking findings of our study is the enrichment of activated B cell populations in the synovium with evidence of clonal expansion and clonal sharing between different B cell states. We achieved unprecedented resolution of discrete B cell states, and in contrast to a previous study36, our data demonstrated clonal sharing between multiple B cell states extending beyond memory B cell and PC pools. We further extend the observation from previous studies34,36 that synovial plasma cells are generated from locally activated B cells, including activated B cells, ABCs, and memory B cells. The finding that the ABCs are a precursor of plasma cells in the synovium is consistent with a recent report80. Further, our work highlights the likely important signals promoting synovial B cell activation and selection, including antigen (reflected in upregulation of BCR signaling), cytokines (most notably IFNG), and direct cell–cell interactions, mainly involving Tph/Tfh cells.
Our study highlighted a spectrum of B cell activation unique to the synovium. We identified multiple naïve-like B cell states, with surprising evidence of antigen encounter/activation based on higher mutation rates. The majority of the synovial B cells in our study are non-naive, dominated by a B-activated cluster expressing NR4A17, ABCs, and plasmablasts/plasma cells. The higher mutation rates in these synovial enriched B cell subsets compared to their blood counterparts suggests that these cells are under different selection pressures. Although it is possible that more activated, somatically mutated B cells preferentially home to the synovium, we favor the hypothesis that activation and SHM occurs in situ, also supported by synovial clonal expansion and clonal sharing across cell states. Of note, the synovial B cell SHM is lower than germinal center B cells in the tonsil and similar to B cell mutation rates reported in other inflamed tissues81–83. We have hypothesized that activated B cells are involved in initiating ectopic lymphoid structures (ELS), as evidenced by their higher expression of ELS-inducing cytokines like LT and IL67. The data presented here further highlight that NR4A1+ activated B cells may go down an extrafollicular pathway as they share clones with ABCs and have a lower mutation rate compared to tonsil germinal center B cells. The availability of paired analysis of T and B cells from the same synovial samples allowed us to directly link clonally expanded T cell populations with synovial tissue expanded B cells for the first time. The most striking predicted interactions were identified between proliferating T cells and Tph with activated B cells, ABCs, and plasmablasts. Notably, Tph and ABCs had among the largest numbers of incoming and outgoing interactions, suggesting an elevated signaling potential for these populations. Signaling pathways enriched in the synovium and identified in the Tph-ABC interaction included cytokines (e.g. IFNG- produced by Tph and IFNGR on ABCs, LTA-TNFRSF14 (HVEM), and BTLA-TNFRSF14 (HVEM) interactions) and chemokines (see schematic in Fig. 8). Notably, GSEA also identified cytokine-mediated signaling as prominent in clonally expanded synovial B cells, including within the ABC population. This subset of B cells has been previously reported to be expanded in autoimmune disease63 with accumulation in inflamed tissue24,26, but the signals that promote ABCs in the synovium remain unclear. Our data strongly point to Tph cells as a key driver. This is in accord with a recent report in juvenile idiopathic arthritis demonstrating that clonally expanded IL21 and IFNG co-expressing Tph promote CD11c+ double negative B cell differentiation84.
Together, these findings across T and B cells highlight the altered cell state composition and clonal characteristics that may work together to maintain inflammation in RA. Our study utilized a cross-sectional cohort unified in high disease activity, but otherwise heterogeneous across treatment history, disease duration, and cell-type abundance phenotype (CTAP). Further studies with a larger cohort of patients will be necessary to connect clonal characteristics such as those identified here with patient stratifications, which may serve to increase our knowledge of the inherent cellular and molecular heterogeneity of the disease. It will also be of interest to define the specificity of the B and T cells that are clonally expanded, including their reactivity to citrullinated antigens. Citrullinated peptides may be the main antigenic drivers in the synovium as has been suggested by other studies6,85. As the understanding of pathogenic roles of B and T cell subsets in RA continues to evolve, this dataset will be a useful resource to generate or test insights related to the antigen receptor repertoires of synovial lymphocytes. The work further highlights specific lymphocyte populations, including Tph cells, ABC, and activated B cells, that show both transcriptomic signatures of antigen activation and clonal expansion, marking these cell populations as promising therapeutic targets that might be selectively targeted to blunt the pathologic adaptive immune response in RA.
Methods
Sample processing
For this study, patients were recruited and consented through the Accelerating Medicines Partnership (AMP) Network for RA and SLE26. Samples were collected from 15 clinical sites who are part of the AMP Network. The study was performed with informed consent in accordance with protocols approved by the Institutional Review Board at Stanford University. Written informed consent was obtained from all participants. Synovial tissue samples and matched peripheral blood mononuclear cells were cryopreserved after collection as described86. Stored synovial tissue samples were then thawed and disaggregated into single-cell suspensions by mincing and digesting with 100 µg/mL LiberaseTL (Roche) and 100 µg/mL DNaseI (Roche) in RPMI (Life Technologies) for 15 min, with occasional inversion during disaggregation. Disaggregated cells were passed through a 70 µm cell strainer and washed prior to antibody staining with anti-CD235a antibodies (clone 11E4B-7-6 (KC16), Beckman Coulter) and Fixable Viability Dye eFluor 780 (eBioscience/Thermo Fisher). Live non-erythrocyte cells (viability dye- CD235-) were collected by fluorescence-activated cell sorting (BD FACSAria Fusion) and were initially cryopreserved in Cryostor CS10 (Sigma-Aldrich). The disaggregated synovial tissue cells and matched cryopreserved peripheral blood mononuclear cells were then thawed in batches, and both T and B cells were collected by fluorescence-activated cell sorting (BD FACSAria II) (Supplementary Fig. 1B). A description of the antibodies used in this study can be found in Supplementary Data 4. A maximum of 16,000 of B and T cells combined were sorted from each sample. We sorted 8000 B cells and 8000T cells for the majority of blood samples. For synovial samples, the number of B cells and T cells sorted varied depending on cell abundance (B cell: 107-7472 cells, T cell: 2616-13636 cells) (Supplementary Fig. 1C, D). B cells and T cells from each sample were pooled prior to loading on a Chromium NextGEM Chip G (10X Genomics).
Single-cell library preparation and sequencing
Cells collected from cell sorting were encapsulated into oil droplets using a Chromium NextGEM Chip G (10X Genomics). Following reverse transcription and cDNA amplification, 5’ gene expression, immune repertoire, and feature barcode libraries were constructed following manufacturer protocols (v1.1). The libraries were finally pooled for sequencing on an Illumina Novaseq 6000 using an S4 flow cell. Gene expression libraries were sequenced to obtain a read depth of 100,000 reads per cell, feature barcode libraries were sequenced at 5000 reads per cell, and immune repertoire libraries were sequenced at 5000 reads per cell. FASTQ file demultiplexing for gene expression libraries was performed using the mkfastq function in CellRanger (10X Genomics, v4.0). Following this, alignment to a reference genome (GRCh38) and counting was completed using the count function to generate expression matrices for each sample. Immune repertoire FASTQ files were separately demultiplexed, and the vdj function was used to perform sequence assembly and clonotype calling for TCR and BCR sequences in each sample.
Initial quality control
Gene count matrices were imported in R for downstream analysis. Quality control was first performed jointly on all cells collected and sequenced in this experiment. Several metrics were explored to assess the quality of each cell. First, low-quality cells were distinguished from high-quality cells in each tissue compartment. Cells were kept for downstream analysis if they had at least 1000 mapped reads in either the blood or the synovium. After initial filtering, we generated several metrics to identify doublets using software packages scDblFinder87 and scds88, as well as marking cells that coexpressed at least 1 TCR and 1 BCR. At this point, a single synovial sample (AMP ID# 300_0415) was discarded due to only 12 cells passing these initial QC thresholds. Log-normalization was then applied to the gene expression counts for the remaining cells. Final QC thresholding was performed on the log-normalized counts, keeping cells with greater than 500 genes detected and less than 20 percent of detected reads coming from mitochondrial genes. Additional quality control was performed for T and B cells separately in downstream analysis.
Broad cell type identification
After initial QC, unsupervised clustering was performed on the remaining cells to identify major cell types present in the data. Principal components were first generated in order to reduce the dimensionality of the feature space before clustering. Using Seurat’s clustering functionalities on the first 30 principal components, a 20 nearest-neighbors network graph was computed. Then, we performed Louvain clustering with a resolution parameter of 0.3, and visualized the cells in 2D space using uniform manifold approximation and projection (UMAP). Differentially expressed genes between clusters were identified (Student’s T-test) by including only genes exhibiting a greater-than 0.25 log-fold difference between clusters. In order to annotate each cluster with a biologically meaningful name, genes with the highest log-fold changes were considered, as well as marker genes that are cell-type specific.
T cell subclustering
Cells broadly labeled as “T cells” and “Proliferating” from the combined object were subset, and Harmony89 was then used to perform batch correction at the level of the patient and tissue using theta = 2 and max.iter.cluster = 20. Using the top 50 Harmony embeddings, Louvain clustering was performed, which was then visualized in UMAP space. Nearest neighbors were identified using the Harmony embeddings, and clustering was iteratively performed with a resolution of 1.4 finally selected. Broad T cell markers (e.g., CD4, CD8, TRDC) were used along with differential gene expression (Wilcoxon rank-sum test) to identify the T cell lineages for each of these initial clusters. At this point, a small number of remaining contaminating cells (such as B cells from the Proliferating cluster), were removed. Based on gene expression, clusters of CD4, CD8, and innate T cells were separated and individually clustered using a similar strategy with slightly different parameters for each subset (CD4: 40 Harmony embeddings, 0.5 cluster resolution; CD8: 40 Harmony embeddings, 0.4 cluster resolution; innate T: 10 Harmony embeddings, 0.4 cluster resolution).
B cell subclustering
Cells labeled as “B cells” from the broad clustering, we further characterized B cell subpopulations. Before reclustering the B cells, we discard cells marked as doublets according to scDblFinder and B cells which simultaneously coexpressed at least 1 BCR and at least 1 TCR. On the remaining cells, Seurat’s default normalization and scaling was performed and principal components were generated. Harmony was used to perform batch correction at the patient level using theta = 2 and max.iter.cluster = 20. From here, the 20 nearest-neighbors network graph was generated using the first 30 harmonized principal components. We then applied Louvain clustering using a resolution parameter of 0.5 to identify clusters of similar cells to visualize in the UMAP space. Differential gene expression was then performed (Student’s t-test) to provide markers for cluster annotation. Before selecting the final set of input parameters, results were explored at multiple resolutions, variable number of included principal components, and using both harmonized and non-harmonized principal components.
Gene signature analysis
Gene signatures used in this study were obtained from the sources listed in Supplementary Data 3. The AddModuleScore function in Seurat was used to assign a value for each cell for each signature, corresponding to the average expression of the signature subtracted by the aggregate expression of randomly-selected control genes.
Dataset reference mapping
A reference object was built using Symphony90 with cells from ref. 26 that correspond to the populations being assayed (CD4 T, CD8 T, innate T, or B cells), integrating at the sample level and using the first 20 PCs. Data from the current study was then projected onto this reference using the mapQuery and knnPredict functions, to generate confidence scores for each reference cluster’s mapping, which we further visualized using pheatmap. The most-likely identities for each cluster were then cross-referenced with DEG lists from our dataset to aid in generating final cluster identities.
Single-cell TCR receptor profiling
For each sample, the filtered_contig_annotations.csv file output from cellranger was used to identify TCR sequences obtained for each cell barcode using scRepertoire91. Using the filterMulti argument, only the top 2 expressed chains were retained when a cell barcode was associated with more than two chains (e.g., ɑɑβ or ɑββ). This step also worked to remove TCR information when only 1 chain was available for a specific cell barcode (e.g., ɑ only or β only). These TCRs and their cell barcodes were then matched with corresponding cell barcodes obtained from the sample’s RNA library. Combined, 45,096 cells in the initial T cell subset had available TCR information, including 22,634 cells obtained from synovial tissue and 22,462 cells obtained from blood. Clones were further characterized into discrete groups for their extent of clonal expansion for downstream analysis.
Single-cell BCR receptor profiling
Analogous to the TCR profiling, output from CellRanger was used to identify BCR sequences for each cell barcode. For downstream analysis, we included BCR that were considered high-confidence, full-length, length of CDR3 amino acid at least 5, and length of CDR3 DNA at least 15. We also excluded BCR that were associated with more than two heavy chains or more than two light chains. BCR passing QC were then matched with cell barcodes from the sample’s single-cell RNA library. Combined, 38,482 cells in the B cell subset had associated BCR information, of which, 12,357 cells were from the synovial tissue and 26,125 were from blood. We assigned each BCR sequence to its closest sequence in the IMGT database using the Change-O tool92. The degree of somatic hypermutation in each B cell was quantified by determining the number of V and J substitutions in each B cell’s IgH CDR3 sequence when compared to its closest IMGT sequence. Clones were characterized in B cells according to the similarity of their CDR3 DNA regions. A similarity threshold of 96.5%, CD-HIT93 was used to define discrete clonal groups for downstream analysis. Clonal analysis was also repeated using a similarity threshold of 80%.
Clonal lineage tree analysis
In exploring the evolutionary relationships among BCR sequences within B cell clonal lineages, we utilized alignment and identity scores for both variable (V) and junctional (J) gene segments, along with somatic hypermutation (shm) levels within these gene segments. Additionally, we calculated the combined V and J somatic hypermutation rate (V-J shm rate), obtained by dividing the sum of V-shm and J-shm by the sequence length excluding the CDR3 region, all in comparison to the germline. A distance matrix was computed using the Euclidean method. Subsequently, we applied the ward.D method for hierarchical clustering to construct the dendrogram plot.
Mixed-effect modeling
A number of mixed-effect models were fit in our analysis, which all generally took a similar form. To adjust for patient-specific effects in our data, mixed-effect models were fit using sample_ID as a scalar random effect and fixed effects for other covariates of interest. Lme494 was used to obtain point estimates for all mixed-effect models, with 95% confidence intervals.
Gene set enrichment analysis
Gene set enrichment analysis was performed using fast gene set enrichment analysis (FGSEA)95. FGSEA calculates an enrichment score for each gene set, given a ranked vector of gene-level statistics. A null distribution of the enrichment score is estimated through random sampling of gene sets. P values are estimated as the number of random gene sets with more extreme enrichment scores than the gene set of interest, divided by the number of random gene sets generated. Multiple testing correction is then performed to get adjusted P values. We first performed differential expression for our comparisons of interest within each population (SYN vs PBL and clonal vs non-clonal) to obtain a ranked vector of gene-level statistics. We then sampled 1000 gene random sets to estimate the null distribution of each gene set’s enrichment score and calculate adjusted p values.
Cell–cell interactions
Cell–cell interaction inference was performed using CellChat70, which uses a manually curated database of ligand–receptor interactions gathered from KEGG signaling pathways and published literature. Interaction networks are constructed by identifying differentially expressed genes related to these interactions, computing the average expression of each ligand–receptor pair across cell cluster pairings, and finally calculating a communication probability value based on permutation testing. For finding communication probabilities, we used the tool’s Tukey triMean method, which performs a weighted average of the median and upper and lower quartiles, and further used 100 bootstraps (nboot = 100) to calculate p values.
HLA imputation
To obtain estimated two-field HLA alleles in each donor, we performed HLA imputation from SNP genotype data. We genotyped donors from this study by using the Illumina multi-ethnic genotyping array. We performed quality control of genotype by sample call rate >0.99, variant call rate >0.99, minor allele frequency >0.01, and PHWE > 1.0 × 10−6. We extracted the extended MHC region (28–34 Mb on chromosome 6) and performed haplotype phasing with SHAPEIT2 software96(Delaneau, Marchini, and Zagury 2011). We then performed HLA imputation by using a multi-ancestry HLA reference panel version 297 and minimac3 software98. From the imputed dosage of two-field HLA alleles of each HLA gene in each donor, we defined the most likely set of two two-field alleles (Supplementary Data 5).
Virus reactivity analysis
Publically-available datasets of TCRs were obtained from VDJdb50 and McPAS49, and were subsequently filtered to retain only TCRs associated with Epstein-Barr virus (EBV), cytomegalovirus (CMV), and influenza (FLU). As the large majority of data uploaded to these databases are from bulk TCR sequencing with only the β chain information available, we subsequently focused only on this chain in our single-cell TCR data. For exact matching, a “match” was considered when the CDR3 region, as well as the MHC allele, were the same. For GLIPH2 analysis, the same lists of EBV, CMV, and FLU reactive cells were input alongside patient CD8 TCRs as input. Results were filtered to retain GLIPH groups with previously identified TCRs presented on an MHC that matched an HLA allele from the patient. GLIPH group results were also filtered for stringency, with a Fisher score <0.01.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
We thank the participants who provided synovial tissue and blood samples for this study. We also acknowledge the Flow Cytometry Core and Genomics Research Center at the University of Rochester for performing the flow sorting, scRNA library preparation, sequencing, and primary data analysis for this study. This work was supported by the Accelerating Medicines Partnership Program: Rheumatoid Arthritis and Systemic Lupus Erythematosus (AMP RA/SLE) Network. The AMP Program is a public-private partnership that includes AbbVie, the Arthritis Foundation, Bristol-Myers Squibb Company, the Foundation for the National Institutes of Health, GlaxoSmithKline, Janssen Research and Development, the Lupus Foundation of America, the Lupus Research Alliance, Merck, the National Institute of Allergy and Infectious Diseases, the National Institute of Arthritis and Musculoskeletal and Skin Diseases, Pfizer, the Rheumatology Research Foundation, Sanofi, and Takeda Pharmaceuticals. Funding for AMP RA/SLE work was provided through grants from the National Institutes of Health (UH2-AR067676, UH2-AR067677, UH2-AR067679, UH2-AR067681, UH2-AR067685, UH2-AR067688, UH2-AR067689, UH2-AR067690, UH2-AR067691, UH2-AR067694, and UM2-AR067678). F.Z. is supported by the PhRMA Foundation Faculty Starter Grant for Translational Medicine. K.W. is supported by a Burroughs Wellcome Fund Career Awards for Medical Scientists, a Doris Duke Charitable Foundation Clinical Scientist Development Award, and a Rheumatology Research Foundation Innovative Research Award. D.A.R. is supported by a Burroughs Wellcome Fund Career Awards for Medical Scientists.
Author contributions
G.S.F., D.L.B., D.T., and C.P. recruited patients and obtained synovial tissues. L.W.M., S.M.G., V.M.H., A.F., V.P.B., and J.H.A. contributed to the procurement of samples and overall design of the AMP study. N.M., A.H.J., K.W., A.N., L.T.D., S.R., and M.B.B. contributed to the design and implementation of the tissue disaggregation, cell sorting, and single-cell sequencing pipeline. G.S.D., A.W., R.W., F.Z., A.H.J., S.S., J.T., and A.M. conducted the computational analysis. A.M., D.A.R., and J.H.A. supervised the study. G.S.D., A.W., N.M., J.C.E., D.A.R., and J.H.A. wrote and edited the initial manuscript draft. AMP RA/SLE Network members contributed to this study by managing patient recruitment, curating patient clinical data, obtaining and processing synovial tissue samples, managing biorepositories, providing website support, and/or providing input on data analysis and interpretation. All authors participated in revising the final manuscript.
Peer review
Peer review information
Nature Communications thanks Vivianne Malmström, Hiroyuki Yoshitomi and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
Single-cell RNA and TCR/BCR sequencing data generated in this study are available via the ARK Portal (10.7303/syn47217489.1) The data were available under controlled access due to data privacy laws. To access the data, users need to complete and submit a signed Data Use Certificate (DUC) to the ARK Portal at https://arkportal.synapse.org/Data%20Access. Additional data are provided in the Supplementary Information and as Source Data files. Source data are provided with this paper.
Code availability
The source code to reproduce analyses used in this study is available at https://github.com/dunlapg/amp2repertoire/tree/main.
Competing interests
A.H.J. receives research support from Amgen unrelated to the submitted work. K.Wei received a sponsored-research agreement from Gilead Sciences, 10X Genomics, and served as a consultant for Mestag Therapeutics and Santa Ana Bio. S.M.G. reports research support from Novartis and is a consultant for UCB unrelated to this work. G.S.F reports receiving grant support from Eli Lilly. V.M.H. is a co-founder of Q32 Bio and previously received research support from Janssen and was a consultant for Celgene and BMS, outside the submitted work. A.F. reports personal fees from Abbvie, Roche, and Janssen and grant support from Roche, UCB, Nascient, Mestag, GlaxoSmithKline and Janssen unrelated to this work. M.B.B. is a founder of Mestag Therapeutics and a consultant for GlaxoSmithKline, 4FO Ventures, and Scailyte AG. S.R is a founder of Mestag Therapeutics, a scientific adviser for Janssen and Pfizer, and a consultant for Gilead and Rheos Medicines, D.A.R. reports sponsored research from Janssen, Merck, and Bristol-Myers Squibb unrelated to the current work, and reports personal fees from Pfizer, Janssen, Merck, Scipher Medicine, GlaxoSmithKline, and Bristol-Myers Squibb and is co-inventor on a patent using T peripheral helper cells as a biomarker of autoimmune diseases. A.M.M. is a promotional speaker for Abbie and Pfizer, have developed educational content and speak for the National Psoriasis Foundation, and served as a consultant for CVS Caremark which none of this is relevant to this study. The remaining authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Garrett Dunlap, Aaron Wagner, Nida Meednu.
These authors jointly supervised this work: Andrew McDavid, Deepak A. Rao, Jennifer H. Anolik.
A list of authors and their affiliations appears at the end of the paper.
Contributor Information
Deepak A. Rao, Email: darao@bwh.harvard.edu
Jennifer H. Anolik, Email: jennifer_anolik@urmc.rochester.edu
Accelerating Medicines Partnership Program: Rheumatoid Arthritis and Systemic Lupus Erythematosus (AMP RA/SLE) Network:
Jennifer Albrecht, William Apruzzese, Jennifer L. Barnas, Joan M. Bathon, Ami Ben-Artzi, Brendan F. Boyce, S. Louis Bridges, Jr, Debbie Campbell, Hayley L. Carr, Arnold Ceponis, Adam Chicoine, Andrew Cordle, Michelle Curtis, Kevin D. Deane, Edward DiCarlo, Patrick Dunn, Lindsy Forbess, Laura Geraldino-Pardilla, Ellen M. Gravallese, Peter K. Gregersen, Joel M. Guthridge, Diane Horowitz, Laura B. Hughes, Kazuyoshi Ishigaki, Lionel B. Ivashkiv, Judith A. James, Joyce B. Kang, Gregory Keras, Ilya Korsunsky, Amit Lakhanpal, James A. Lederer, Yuhong Li, Zhihan J. Li, Katherine P. Liao, Holden Maecker, Arthur M. Mandelin, II, Ian Mantel, Mark Maybury, Mandy J. McGeachy, Joseph Mears, Alessandra Nerviani, Dana E. Orange, Harris Perlman, Javier Rangel-Moreno, Karim Raza, Yakir Reshef, Christopher Ritchlin, Felice Rivellese, William H. Robinson, Laurie Rumker, Ilfita Sahbudin, Karen Salomon-Escoto, Dagmar Scheel-Toellner, Jennifer A. Seifert, Anvita Singaraju, Melanie H. Smith, Paul J. Utz, Kathryn Weinand, Dana Weisenfeld, Michael H. Weisman, Qian Xiao, and Zhu Zhu
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-024-49186-0.
References
- 1.Weyand CM, Goronzy JJ. The immunology of rheumatoid arthritis. Nat. Immunol. 2021;22:10–18. doi: 10.1038/s41590-020-00816-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Roosnek E, Lanzavecchia A. Efficient and selective presentation of antigen-antibody complexes by rheumatoid factor B cells. J. Exp. Med. 1991;173:487–489. doi: 10.1084/jem.173.2.487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Schroder AE, et al. Differentiation of B cells in the nonlymphoid tissue of the synovial membrane of patients with rheumatoid arthritis. Proc. Natl Acad. Sci. USA. 1996;93:221–225. doi: 10.1073/pnas.93.1.221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Goldbach-Mansky R, et al. Rheumatoid arthritis associated autoantibodies in patients with synovitis of recent onset. Arthritis Res. 2000;2:236–243. doi: 10.1186/ar93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Marston B, Palanichamy A, Anolik JH. B cells in the pathogenesis and treatment of rheumatoid arthritis. Curr. Opin. Rheumatol. 2010;22:307–315. doi: 10.1097/BOR.0b013e3283369cb8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Corsiero E, et al. Single cell cloning and recombinant monoclonal antibodies generation from RA synovial B cells reveal frequent targeting of citrullinated histones of NETs. Ann. Rheum. Dis. 2016;75:1866–1875. doi: 10.1136/annrheumdis-2015-208356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Meednu N, et al. Dynamic spectrum of ectopic lymphoid B cell activation and hypermutation in the RA synovium characterized by NR4A nuclear receptor expression. Cell Rep. 2022;39:110766. doi: 10.1016/j.celrep.2022.110766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lu DR, et al. T cell-dependent affinity maturation and innate immune pathways differentially drive autoreactive B cell responses in rheumatoid arthritis. Arthritis Rheumatol. 2018;70:1732–1744. doi: 10.1002/art.40578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.O’Neill SK, et al. Antigen-specific B cells are required as APCs and autoantibody-producing cells for induction of severe autoimmune arthritis. J. Immunol. 2005;174:3781–3788. doi: 10.4049/jimmunol.174.6.3781. [DOI] [PubMed] [Google Scholar]
- 10.Wu F, et al. B cells in rheumatoid arthritis: pathogenic mechanisms and treatment prospects. Front. Immunol. 2021;12:750753. doi: 10.3389/fimmu.2021.750753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sun W, et al. B cells inhibit bone formation in rheumatoid arthritis by suppressing osteoblast differentiation. Nat. Commun. 2018;9:5127. doi: 10.1038/s41467-018-07626-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Meednu N, et al. Production of RANKL by memory B cells: a link between B cells and bone erosion in rheumatoid arthritis. Arthritis Rheumatol. 2016;68:805–816. doi: 10.1002/art.39489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Panayi GS, Lanchbury JS, Kingsley GH. The importance of the T cell in initiating and maintaining the chronic synovitis of rheumatoid arthritis. Arthritis Rheum. 1992;35:729–735. doi: 10.1002/art.1780350702. [DOI] [PubMed] [Google Scholar]
- 14.Gregersen PK, Silver J, Winchester RJ. The shared epitope hypothesis. an approach to understanding the molecular genetics of susceptibility to rheumatoid arthritis. Arthritis Rheum. 1987;30:1205–1213. doi: 10.1002/art.1780301102. [DOI] [PubMed] [Google Scholar]
- 15.Raychaudhuri S, et al. Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis. Nat. Genet. 2012;44:291–296. doi: 10.1038/ng.1076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Rao DA, et al. Pathologically expanded peripheral T helper cell subset drives B cells in rheumatoid arthritis. Nature. 2017;542:110–114. doi: 10.1038/nature20810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Yoshitomi H, Ueno H. Shared and distinct roles of T peripheral helper and T follicular helper cells in human diseases. Cell Mol. Immunol. 2021;18:523–527. doi: 10.1038/s41423-020-00529-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Manzo A, et al. Mature antigen-experienced T helper cells synthesize and secrete the B cell chemoattractant CXCL13 in the inflammatory environment of the rheumatoid joint. Arthritis Rheum. 2008;58:3377–3387. doi: 10.1002/art.23966. [DOI] [PubMed] [Google Scholar]
- 19.Kobayashi S, et al. A distinct human CD4+ T cell subset that secretes CXCL13 in rheumatoid synovium. Arthritis Rheum. 2013;65:3063–3072. doi: 10.1002/art.38173. [DOI] [PubMed] [Google Scholar]
- 20.Rao DA. T cells that help B cells in chronically inflamed tissues. Front. Immunol. 1924;9:2018. doi: 10.3389/fimmu.2018.01924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fortea-Gordo P, et al. Two populations of circulating PD-1hiCD4 T cells with distinct B cell helping capacity are elevated in early rheumatoid arthritis. Rheumatology. 2019;58:1662–1673. doi: 10.1093/rheumatology/kez169. [DOI] [PubMed] [Google Scholar]
- 22.Jonsson AH, et al. Granzyme K(+) CD8 T cells form a core population in inflamed human tissue. Sci. Transl. Med. 2022;14:eabo0686. doi: 10.1126/scitranslmed.abo0686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Stephenson W, et al. Single-cell RNA-seq of rheumatoid arthritis synovial tissue using low-cost microfluidic instrumentation. Nat. Commun. 2018;9:791. doi: 10.1038/s41467-017-02659-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhang F, et al. Defining inflammatory cell states in rheumatoid arthritis joint synovial tissues by integrating single-cell transcriptomics and mass cytometry. Nat. Immunol. 2019;20:928–942. doi: 10.1038/s41590-019-0378-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Alivernini S, et al. Distinct synovial tissue macrophage subsets regulate inflammation and remission in rheumatoid arthritis. Nat. Med. 2020;26:1295–1306. doi: 10.1038/s41591-020-0939-8. [DOI] [PubMed] [Google Scholar]
- 26.Zhang F, et al. Deconstruction of rheumatoid arthritis synovium defines inflammatory subtypes. Nature. 2023;623:616–624. doi: 10.1038/s41586-023-06708-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Musters A, et al. In rheumatoid arthritis, synovitis at different inflammatory sites is dominated by shared but patient-specific T cell clones. J. Immunol. 2018;201:417–422. doi: 10.4049/jimmunol.1800421. [DOI] [PubMed] [Google Scholar]
- 28.Klarenbeek PL, et al. Inflamed target tissue provides a specific niche for highly expanded T-cell clones in early human autoimmune disease. Ann. Rheum. Dis. 2012;71:1088–1093. doi: 10.1136/annrheumdis-2011-200612. [DOI] [PubMed] [Google Scholar]
- 29.Argyriou A, et al. Single cell sequencing identifies clonally expanded synovial CD4(+) T(PH) cells expressing GPR56 in rheumatoid arthritis. Nat. Commun. 2022;13:4046. doi: 10.1038/s41467-022-31519-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ishigaki K, et al. Quantitative and qualitative characterization of expanded CD4+ T cell clones in rheumatoid arthritis patients. Sci. Rep. 2015;5:12937. doi: 10.1038/srep12937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chang CM, et al. Characterization of T-cell receptor repertoire in patients with rheumatoid arthritis receiving biologic therapies. Dis. Markers. 2019;2019:2364943. doi: 10.1155/2019/2364943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Liu X, et al. T cell receptor beta repertoires as novel diagnostic markers for systemic lupus erythematosus and rheumatoid arthritis. Ann. Rheum. Dis. 2019;78:1070–1078. doi: 10.1136/annrheumdis-2019-215442. [DOI] [PubMed] [Google Scholar]
- 33.Turcinov S, et al. Diversity and clonality of T cell receptor repertoire and antigen specificities in small joints of early rheumatoid arthritis. Arthritis Rheumatol. 2023;75:673–684. doi: 10.1002/art.42407. [DOI] [PubMed] [Google Scholar]
- 34.Scheel T, et al. V-region gene analysis of locally defined synovial B and plasma cells reveals selected B cell expansion and accumulation of plasma cell clones in rheumatoid arthritis. Arthritis Rheum. 2011;63:63–72. doi: 10.1002/art.27767. [DOI] [PubMed] [Google Scholar]
- 35.Titcombe PJ, et al. Pathogenic citrulline-multispecific B cell receptor clades in rheumatoid arthritis. Arthritis Rheumatol. 2018;70:1933–1945. doi: 10.1002/art.40590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hardt U, et al. Integrated single cell and spatial transcriptomics reveal autoreactive differentiated B cells in joints of early rheumatoid arthritis. Sci. Rep. 2022;12:11876. doi: 10.1038/s41598-022-15293-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Luoma AM, et al. Molecular pathways of colon inflammation induced by cancer immunotherapy. Cell. 2020;182:655–671 e22. doi: 10.1016/j.cell.2020.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yost KE, et al. Clonal replacement of tumor-specific T cells following PD-1 blockade. Nat. Med. 2019;25:1251–1259. doi: 10.1038/s41591-019-0522-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Collora JA, et al. Single-cell multiomics reveals persistence of HIV-1 in expanded cytotoxic T cell clones. Immunity. 2022;55:1013–1031 e7. doi: 10.1016/j.immuni.2022.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Mathew NR, et al. Single-cell BCR and transcriptome analysis after influenza infection reveals spatiotemporal dynamics of antigen-specific B cells. Cell Rep. 2021;35:109286. doi: 10.1016/j.celrep.2021.109286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Canhao H, et al. Common evaluations of disease activity in rheumatoid arthritis reach discordant classifications across different populations. Front. Med. 2018;5:40. doi: 10.3389/fmed.2018.00040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Humby F, et al. Synovial cellular and molecular signatures stratify clinical response to csDMARD therapy and predict radiographic progression in early rheumatoid arthritis patients. Ann. Rheum. Dis. 2019;78:761–772. doi: 10.1136/annrheumdis-2018-214539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Barnes MJ, et al. Loss of T cell and B cell quiescence precedes the onset of microbial flora-dependent wasting disease and intestinal inflammation in Gimap5-deficient mice. J. Immunol. 2010;184:3743–3754. doi: 10.4049/jimmunol.0903164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Shevyrev D, Tereshchenko V. Treg heterogeneity, function, and homeostasis. Front. Immunol. 2019;10:3100. doi: 10.3389/fimmu.2019.03100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Abbas AR, et al. Deconvolution of blood microarray data identifies cellular activation patterns in systemic lupus erythematosus. PLoS ONE. 2009;4:e6098. doi: 10.1371/journal.pone.0006098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zhao J, et al. Single-cell RNA sequencing reveals the heterogeneity of liver-resident immune cells in human. Cell Discov. 2020;6:22. doi: 10.1038/s41421-020-0157-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Chang MH, et al. Arthritis flares mediated by tissue-resident memory T cells in the joint. Cell Rep. 2021;37:109902. doi: 10.1016/j.celrep.2021.109902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Fujinami RS, et al. Molecular mimicry, bystander activation, or viral persistence: infections and autoimmune disease. Clin. Microbiol. Rev. 2006;19:80–94. doi: 10.1128/CMR.19.1.80-94.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Tickotsky N, et al. McPAS-TCR: a manually curated catalogue of pathology-associated T cell receptor sequences. Bioinformatics. 2017;33:2924–2929. doi: 10.1093/bioinformatics/btx286. [DOI] [PubMed] [Google Scholar]
- 50.Shugay M, et al. VDJdb: a curated database of T-cell receptor sequences with known antigen specificity. Nucleic Acids Res. 2018;46:D419–D427. doi: 10.1093/nar/gkx760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Huang H, et al. Analyzing the Mycobacterium tuberculosis immune response by T-cell receptor clustering with GLIPH2 and genome-wide antigen screening. Nat. Biotechnol. 2020;38:1194–1202. doi: 10.1038/s41587-020-0505-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Khan N, et al. Cytomegalovirus seropositivity drives the CD8 T cell repertoire toward greater clonality in healthy elderly individuals. J. Immunol. 2002;169:1984–1992. doi: 10.4049/jimmunol.169.4.1984. [DOI] [PubMed] [Google Scholar]
- 53.Looney RJ, et al. Role of cytomegalovirus in the T cell changes seen in elderly individuals. Clin. Immunol. 1999;90:213–219. doi: 10.1006/clim.1998.4638. [DOI] [PubMed] [Google Scholar]
- 54.Liu MF, et al. Distribution of double-negative (CD4- CD8-, DN) T subsets in blood and synovial fluid from patients with rheumatoid arthritis. Clin. Rheumatol. 1999;18:227–231. doi: 10.1007/s100670050089. [DOI] [PubMed] [Google Scholar]
- 55.Andreu JL, et al. Selective expansion of T cells bearing the gamma/delta receptor and expressing an unusual repertoire in the synovial membrane of patients with rheumatoid arthritis. Arthritis Rheum. 1991;34:808–814. doi: 10.1002/art.1780340705. [DOI] [PubMed] [Google Scholar]
- 56.Yamin R, et al. High percentages and activity of synovial fluid NK cells present in patients with advanced stage active rheumatoid arthritis. Sci. Rep. 2019;9:1351. doi: 10.1038/s41598-018-37448-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Koppejan H, et al. Altered composition and phenotype of mucosal-associated invariant T cells in early untreated rheumatoid arthritis. Arthritis Res. Ther. 2019;21:3. doi: 10.1186/s13075-018-1799-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Lawand M, Dechanet-Merville J, Dieu-Nosjean MC. Key features of gamma-delta T-cell subsets in human diseases and their immunotherapeutic implications. Front. Immunol. 2017;8:761. doi: 10.3389/fimmu.2017.00761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Mori L, Lepore M, De Libero G. The immunology of CD1- and MR1-restricted T cells. Annu. Rev. Immunol. 2016;34:479–510. doi: 10.1146/annurev-immunol-032414-112008. [DOI] [PubMed] [Google Scholar]
- 60.Hinks TSC, Zhang XW. MAIT cell activation and functions. Front. Immunol. 2020;11:1014. doi: 10.3389/fimmu.2020.01014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Leng T, et al. TCR and inflammatory signals tune human MAIT cells to exert specific tissue repair and effector functions. Cell Rep. 2019;28:3077–3091 e5. doi: 10.1016/j.celrep.2019.08.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Lamichhane R, et al. Human liver-derived MAIT cells differ from blood MAIT cells in their metabolism and response to TCR-independent activation. Eur. J. Immunol. 2021;51:879–892. doi: 10.1002/eji.202048830. [DOI] [PubMed] [Google Scholar]
- 63.Jenks SA, et al. Distinct effector B cells induced by unregulated Toll-like receptor 7 contribute to pathogenic responses in systemic lupus erythematosus. Immunity. 2018;49:725–739 e6. doi: 10.1016/j.immuni.2018.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Wang S, et al. IL-21 drives expansion and plasma cell differentiation of autoreactive CD11c(hi)T-bet(+) B cells in SLE. Nat. Commun. 2018;9:1758. doi: 10.1038/s41467-018-03750-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Johnson JL, et al. The transcription factor T-bet resolves memory B cell subsets with distinct tissue distributions and antibody specificities in mice and humans. Immunity. 2020;52:842–855 e6. doi: 10.1016/j.immuni.2020.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Song W, et al. Development of Tbet- and CD11c-expressing B cells in a viral infection requires T follicular helper cells outside of germinal centers. Immunity. 2022;55:290–307 e5. doi: 10.1016/j.immuni.2022.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.DeWitt WS, et al. A public database of memory and naive B-cell receptor sequences. PLoS ONE. 2016;11:e0160853. doi: 10.1371/journal.pone.0160853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Meffre E, et al. Immunoglobulin heavy chain expression shapes the B cell receptor repertoire in human B cell development. J. Clin. Invest. 2001;108:879–886. doi: 10.1172/JCI13051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Soto C, et al. High frequency of shared clonotypes in human B cell receptor repertoires. Nature. 2019;566:398–402. doi: 10.1038/s41586-019-0934-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Jin S, et al. Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 2021;12:1088. doi: 10.1038/s41467-021-21246-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Aldridge J, et al. T helper cells in synovial fluid of patients with rheumatoid arthritis primarily have a Th1 and a CXCR3(+)Th2 phenotype. Arthritis Res. Ther. 2020;22:245. doi: 10.1186/s13075-020-02349-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Wucherpfennig KW, Strominger JL. Molecular mimicry in T cell-mediated autoimmunity: viral peptides activate human T cell clones specific for myelin basic protein. Cell. 1995;80:695–705. doi: 10.1016/0092-8674(95)90348-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Pender MP, et al. Defective T-cell control of Epstein-Barr virus infection in multiple sclerosis. Clin. Transl. Immunol. 2017;6:e126. doi: 10.1038/cti.2016.87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Ashton MP, et al. Incomplete immune response to coxsackie B viruses associates with early autoimmunity against insulin. Sci. Rep. 2016;6:32899. doi: 10.1038/srep32899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Lanz TV, et al. Clonally expanded B cells in multiple sclerosis bind EBV EBNA1 and GlialCAM. Nature. 2022;603:321–327. doi: 10.1038/s41586-022-04432-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Bank I. The role of gamma delta T cells in autoimmune rheumatic diseases. Cells. 2020;9:462. doi: 10.3390/cells9020462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Fathollahi A, et al. The role of NK cells in rheumatoid arthritis. Inflamm. Res. 2021;70:1063–1073. doi: 10.1007/s00011-021-01504-8. [DOI] [PubMed] [Google Scholar]
- 78.Bos JD, et al. T-cell receptor gamma delta bearing cells in normal human skin. J. Invest. Dermatol. 1990;94:37–42. doi: 10.1111/1523-1747.ep12873333. [DOI] [PubMed] [Google Scholar]
- 79.Deusch K, et al. A major fraction of human intraepithelial lymphocytes simultaneously expresses the gamma/delta T cell receptor, the CD8 accessory molecule and preferentially uses the V delta 1 gene segment. Eur. J. Immunol. 1991;21:1053–1059. doi: 10.1002/eji.1830210429. [DOI] [PubMed] [Google Scholar]
- 80.Wing E, et al. Double-negative-2 B cells are the major synovial plasma cell precursor in rheumatoid arthritis. Front. Immunol. 2023;14:1241474. doi: 10.3389/fimmu.2023.1241474. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Pascual V, et al. Analysis of somatic mutation in five B cell subsets of human tonsil. J. Exp. Med. 1994;180:329–339. doi: 10.1084/jem.180.1.329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Machado HE, et al. Diverse mutational landscapes in human lymphocytes. Nature. 2022;608:724–732. doi: 10.1038/s41586-022-05072-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Uzzan M, et al. Ulcerative colitis is characterized by a plasmablast-skewed humoral response associated with disease activity. Nat. Med. 2022;28:766–779. doi: 10.1038/s41591-022-01680-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Fischer J, et al. Effect of clonally expanded PD-1(high) CXCR5-CD4+ peripheral T helper cells on B cell differentiation in the joints of patients with antinuclear antibody-positive juvenile idiopathic arthritis. Arthritis Rheumatol. 2022;74:150–162. doi: 10.1002/art.41913. [DOI] [PubMed] [Google Scholar]
- 85.Kristyanto H, et al. Persistently activated, proliferative memory autoreactive B cells promote inflammation in rheumatoid arthritis. Sci. Transl. Med. 2020;12:eaaz5327. doi: 10.1126/scitranslmed.aaz5327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Donlin LT, et al. Methods for high-dimensonal analysis of cells dissociated from cyropreserved synovial tissue. Arthritis Res. Ther. 2018;20:139. doi: 10.1186/s13075-018-1631-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Germain PL, et al. Doublet identification in single-cell sequencing data using scDblFinder. F1000Res. 2021;10:979. doi: 10.12688/f1000research.73600.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Bais AS, Kostka D. scds: computational annotation of doublets in single-cell RNA sequencing data. Bioinformatics. 2020;36:1150–1158. doi: 10.1093/bioinformatics/btz698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Korsunsky I, et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat. Methods. 2019;16:1289–1296. doi: 10.1038/s41592-019-0619-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Kang JB, et al. Efficient and precise single-cell reference atlas mapping with Symphony. Nat. Commun. 2021;12:5890. doi: 10.1038/s41467-021-25957-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Borcherding N, Bormann NL, Kraus G. scRepertoire: an R-based toolkit for single-cell immune receptor analysis. F1000Res. 2020;9:47. doi: 10.12688/f1000research.22139.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Gupta NT, et al. Change-O: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data. Bioinformatics. 2015;31:3356–3358. doi: 10.1093/bioinformatics/btv359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Fu L, et al. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012;28:3150–3152. doi: 10.1093/bioinformatics/bts565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Bates D, et al. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 2015;67:1–48. doi: 10.18637/jss.v067.i01. [DOI] [Google Scholar]
- 95.Gennady, K. et al. Fast gene set enrichment analysis. Preprint at bioRxiv10.1101/060012 (2021).
- 96.Delaneau O, Marchini J, Zagury JF. A linear complexity phasing method for thousands of genomes. Nat. Methods. 2011;9:179–181. doi: 10.1038/nmeth.1785. [DOI] [PubMed] [Google Scholar]
- 97.Luo Y, et al. A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response. Nat. Genet. 2021;53:1504–1516. doi: 10.1038/s41588-021-00935-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Das S, et al. Next-generation genotype imputation service and methods. Nat. Genet. 2016;48:1284–1287. doi: 10.1038/ng.3656. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Single-cell RNA and TCR/BCR sequencing data generated in this study are available via the ARK Portal (10.7303/syn47217489.1) The data were available under controlled access due to data privacy laws. To access the data, users need to complete and submit a signed Data Use Certificate (DUC) to the ARK Portal at https://arkportal.synapse.org/Data%20Access. Additional data are provided in the Supplementary Information and as Source Data files. Source data are provided with this paper.
The source code to reproduce analyses used in this study is available at https://github.com/dunlapg/amp2repertoire/tree/main.