Abstract
The skin lesion erythema migrans (EM) is an initial sign of the Ixodes tick–transmitted Borreliella spirochetal infection known as Lyme disease. T cells and innate immune cells have previously been shown to predominate the EM lesion and promote the reaction. Despite the established importance of B cells and antibodies in preventing infection, the role of B cells in the skin immune response to Borreliella is unknown. Here, we used single-cell RNA-Seq in conjunction with B cell receptor (BCR) sequencing to immunophenotype EM lesions and their associated B cells and BCR repertoires. We found that B cells were more abundant in EM in comparison with autologous uninvolved skin; many were clonally expanded and had circulating relatives. EM-associated B cells upregulated the expression of MHC class II genes and exhibited preferential IgM isotype usage. A subset also exhibited low levels of somatic hypermutation despite a gene expression profile consistent with memory B cells. Our study demonstrates that single-cell gene expression with paired BCR sequencing can be used to interrogate the sparse B cell populations in human skin and reveals that B cells in the skin infection site in early Lyme disease expressed a phenotype consistent with local antigen presentation and antibody production.
Keywords: Immunology, Infectious disease
Keywords: Adaptive immunity, Bacterial infections
Introduction
Lyme disease (LD), an Ixodes tick–transmitted infection with spirochetes of the Borrelia burgdorferi sensu lato complex (genus Borreliella; ref. 1), is now the most common vector-borne disease in the Northern Hemisphere (2, 3). The earliest sign of infection reported in approximately 70% of patients is the appearance of an erythema migrans (EM) skin lesion at the initial tick bite site (3). EM presents as an expanding, erythematous macule that usually appears 1–2 weeks after infection, and arises from the immune response to B. burgdorferi spirochetes as they disseminate outwardly from the tick bite site (3). If not controlled by the local immune response in the skin, B. burgdorferi disseminates via the lymphatics and blood to cause disease in other areas of the skin and internal organs, including the heart, musculoskeletal, and nervous systems (3).
Borreliella spirochetes are widely held to be extracellular pathogens against which phagocytes, antibody, and complement are critical for host defense (4). B. burgdorferi–specific antibodies prevent infection in both animal models and humans and contribute to a reduction in pathogen burden and resolution of disease manifestations in mouse models (4). In humans, a robust circulating plasmablast response was found to correlate with more rapid resolution of LD symptoms after antibiotic treatment for EM (5). Within the EM lesion itself, however, B cells and plasma cells are not a dominant cell type. Histologic studies of EM reveal superficial and deep perivascular mononuclear infiltrates of lymphocytes and macrophages, occasional plasma cells, and, less commonly, neutrophils and mast cells (6, 7). By IHC, CD3+ T cells predominate the immune infiltrate in comparison with CD68+ macrophages or CD20+ B cells (6). The prevalence of B cells in EM lesions, however, may relate to the duration of the lesion prior to sampling. In EM lesions arising from infection with Borreliella strains in Europe, the proportion of infiltrate due to B cells was found to increase with the density of the infiltrate and size of the EM lesion, which typically reflects lesion duration (8). A study using a blister technique to extract cells from EM lesions found an increased frequency of T cells in comparison with their prevalence among PBMCs (9). In contrast, B cells were found at a frequency that was considerably lower than their prevalence in PBMCs, although this could be influenced by the method for cell extraction. By flow cytometry, both CD4+ and CD8+ T cells appeared antigen experienced, as assessed by CD45RO and CD27 expression, with enrichment of CD4+ T memory and effector subsets and CD8+ effector/cytolytic subsets. Blister fluid also contained elevated levels of proinflammatory cytokines IL-6, TNF-α, and IFN-γ as well as the anti-inflammatory cytokine IL-10, consistent with an earlier report of elevated mRNA expression of TNF-α, IFN-γ, and IL-10 detected by in situ hybridization using cytokine-specific RNA probes (6). More recently, evaluation of the whole skin transcriptome of EM lesions identified a strong IFN signature, consistent with cell-mediated immunity coordinated by T cells and innate immune populations as the main feature of the local immune response (10). The specific contribution of B cells to the cutaneous response, however, remains an open question. Understanding B cell responses in the EM lesion is key to identifying host factors that may be important determinants of localized versus systemic infection with this extracellular pathogen.
Here, we leveraged new advances in single-cell RNA-Seq (scRNA-Seq) analysis to characterize the B cell response in the EM lesion. Using the 10× Genomics Chromium scRNA-Seq platform, we defined the transcriptomes of single cells from whole-skin digests of EM and autologous uninvolved skin biopsies from patients with LD. Single-cell transcriptomics was combined with B cell receptor (BCR) and T cell receptor (TCR) repertoire sequencing to determine the origin and trafficking of these immune populations between the circulation and the skin. We found that clonally expanded B cells bearing plasma cell as well as memory B cell signatures were recruited to the site of the EM lesion, accompanied by activated T cells and myeloid subsets. Compared with B cells from uninvolved skin, EM B cells upregulated MHC class II genes and displayed a biased VH-gene usage compared with circulating B cells, suggesting an antigen-specific response. A disproportionate fraction of memory B cells utilized IgM receptors and displayed a low frequency of somatic hypermutation (SHM). This IgM-expressing B cell subset did not appear to be naive, but rather it was characterized by a TLR activation signature. Notably, clonal relatives of these B cells were found in the circulation. Taken together, our findings support a role for B cells, including IgM-expressing memory B cells, in local antigen presentation and antibody production in the skin response to B. burgdorferi infection.
Results
Subject characteristics and sample collection.
To identify and immunophenotype cell subsets in EM lesions at a single-cell level, we obtained biopsies of EM lesions from 10 subjects with early LD separated into 2 cohorts based on year of enrollment (Table 1). All subjects satisfied the CDC LD case definition; 9 of the 10 tested positive for B. burgdorferi antibodies using the C6 peptide enzyme immunoassay (Table 1; refs. 11–13). One subject in cohort 1 had disseminated infection manifesting as multiple EM without clinically apparent disease in extracutaneous sites, whereas the remainder had a single EM. Biopsies of uninvolved skin were obtained from 5 of 6 subjects in cohort 1 and from 2 of 4 subjects in cohort 2 (Table 1).
Table 1. Subject characterization.
Skin cell cluster identification and preliminary characterization using single-cell transcriptomics.
Single-cell suspensions of fresh EM and uninvolved skin samples from subjects in cohort 1 were analyzed by 10× Genomics scRNA-Seq with paired transcriptomes and repertoires without selecting for specific cell populations (Figure 1 and Supplemental Table 1; supplemental material available online with this article; https://doi.org/10.1172/jci.insight.148035DS1). Single-cell transcriptomes (Supplemental Figure 1, A–C) were pooled across subjects to identify cell clusters, to which identities were assigned (Supplemental Figure 2A) based on the expression of key marker genes for stromal and immune cells (Supplemental Figure 3 and Supplemental Table 2). As expected, there was increased representation of immune cell subsets, especially lymphocytes, in the EM skin versus adjacent uninvolved skin (Figure 2, A and B). On average, T cells constituted 35.6% of EM cells versus 10.5% of uninvolved skin cells (paired ratio t test, P = 0.012). B cells, which sparsely populate normal skin (14), accounted for 2.66% of EM lesion cells in comparison with 0.31% of cells from uninvolved skin samples (Figure 2B). Myeloid cells (paired ratio t test, P = 0.019), dendritic cells (paired ratio t test, P = 0.020), and NK cells (paired ratio t test, P = 0.006) were also found to be enriched in the EM lesion in comparison with uninvolved skin. The infiltration of immune cells consequently decreased proportions of nonimmune cell subsets. Fibroblasts constituted 16.3% of cells in the EM lesion versus 36.3% of uninvolved skin and keratinocytes constituted 11.2% of EM cells versus 24.6% of uninvolved skin cells (Figure 2, A and B).
Because the estimation of cell frequencies is dependent on the accuracy of cell cluster identification, we used a second method, the Seq-Well nanowell-based platform, to perform scRNA-Seq of EM and uninvolved skin samples from subjects in cohort 2 (Supplemental Table 3; ref. 15). After cluster identification, similar analyses were performed quantifying the frequency of different subsets and gene expression profiles (Supplemental Figure 4, A and B). The frequency of B cells was increased (Supplemental Figure 4, C and D) in 1 of 2 subjects that had paired samples (4.98% vs. 0.62% in EM vs. uninvolved skin, respectively), whereas no significant elevation was observed in the second. On average, EM biopsies from all 4 subjects in cohort 2 contained 2.0% B cells, similar to the 2.7% observed in cohort 1.
To more precisely define the B cell and T cell clusters, we examined their association with BCR and TCR expression by BCR and TCR repertoire sequencing using cohort 1 samples (Figure 2, B and C). These analyses could only be performed on cohort 1 samples because Seq-Well does not yet permit BCR and TCR sequencing. Both B cells (paired ratio t test, P = 0.021, 1.47% in EM, 0.16% in uninvolved) and T cells (paired ratio t test, P = 0.023, 32.5% in EM, 8.5% in uninvolved) identified using this new definition were statistically more abundant in the EM lesion in comparison with uninvolved skin (Figure 2C). Differences between the frequency of B cells and T cells in uninvolved compared with EM skin also were significant when calculated as a ratio relative to the frequency of keratinocytes, irrespective of whether the definition of B cells and T cells was based on marker genes (Figure 2D) or repertoire sequencing (Supplemental Figure 5). Taken together, our results show that B cells were in greater abundance in EM compared with uninvolved skin, along with T cells and innate immune cell subsets. T cells were the dominant lymphocyte subset, consistent with observations from earlier studies (8, 9).
B cells in EM lesions upregulated MHC class II expression.
To begin to phenotype B cells within the EM lesions, we investigated the genes most differentially expressed between B cells from 6 EM samples and 2 uninvolved skin samples by absolute difference using pseudo-bulk gene expression profiles. We were unable to perform paired testing for significance because no B cells with associated IGH were identified in the other 3 uninvolved skin samples. In the samples in which B cells were detected, 9 MHC class II genes were expressed by B cells, 4 of which were in the set of the top 20 upregulated genes (Figure 3A). Overrepresentation analysis using the top 40 differentially expressed genes (DEGs) and the Reactome pathways database identified a significant signature for MHC class II antigen presentation (P = 1.2 × 10–5), along with a signature for IFN signaling (P = 2.1 × 10–5) and IFN-γ signaling (P = 7.4 × 10–6; Figure 3B and https://doi.org/10.5281/zenodo.4711465). Visualization of the expression of the MHC class II gene HLA-DRA confirmed its low expression in the B cell cluster in uninvolved skin (Figure 3C) compared with its prominent upregulation in EM (Figure 3D).
T cells displayed a signature of trafficking and receiving costimulation.
We also examined the differences between T cell subsets in EM compared with uninvolved skin. Due to T cell heterogeneity, we first assigned T cells with an associated TCRB V(D)J sequence to either a CD4+ or CD8+ cell cluster computationally (Supplemental Table 4), followed by Tregs and dividing T cell subsets using a set of lineage-defining marker genes (Supplemental Figure 6, A–C, and Supplemental Table 5; ref. 16). The presence of CD4+ T cells in the 5 EM samples that had paired uninvolved skin samples permitted paired significance testing of pseudo-bulk gene expression profiles. Using a FDR threshold of 0.1, a total of 2944 genes were differentially expressed by CD4+ T cells in EM lesions compared with matched uninvolved skin (Supplemental Figure 7A). Signatures enriched among upregulated genes in EM skin included costimulation by the CD28 family (P = 0.006) and chemokine receptors bind chemokines, consistent with interactions between CD4+ T cells and antigen-presenting cells (APCs) in the skin microenvironment (Supplemental Figure 7D and https://doi.org/10.5281/zenodo.4711465). No significantly DEGs were identified for CD8+ T cells (https://doi.org/10.5281/zenodo.4711465; 4 sets of paired samples) or Tregs (https://doi.org/10.5281/zenodo.4711465; 4 sets of paired samples) in EM versus uninvolved skin after FDR correction (Supplemental Figure 7, B and C; top 20 DEGs shown based on absolute expression differences). However, when considering the top 40 EM-associated genes, several inflammation-related gene sets were enriched in CD8+ T cells and Tregs (Supplemental Figure 8, A and B).
The pattern of gene expression among the most highly changed genes in the EM skin versus uninvolved skin was similar between cohort 1 and cohort 2. We found a significant correlation when comparing the absolute expression differences of the top 20 upregulated and downregulated genes in cohorts 1 and 2 for CD4+ T cells (Supplemental Figure 9A) as well as CD8+ T cells (Supplemental Figure 9B), Tregs (Supplemental Figure 9C), and B cells (Supplemental Figure 9D). The same positive correlation was also observed in myeloid cells (Supplemental Figure 9E and https://doi.org/10.5281/zenodo.4711465). In cohort 2, we again observed B cell upregulation of the MHC class II gene HLA-DQA2 and CD69, a marker of both cell activation and resident tissue cells (17). The findings that EM B cells increased the expression of genes involved in antigen presentation and interaction with T cells, particularly MHC class II genes, and EM T cells upregulated genes associated with costimulation and interactions with APCs support a role for B cells as APCs in the EM lesion.
Memory B cells, including unmutated class-switched B cells, were enriched in EM lesions.
B cells in the skin belonged to 1 of 2 clusters, either a memory cell cluster or a plasma cell cluster, as identified by automated cell annotation (Supplemental Table 6) and based on the marker genes MS4A1 (CD20) for memory B cells and PRDM1 (BLIMP-1) for plasma cells (Supplemental Table 7 and Supplemental Figure 10, A and B). The frequencies of memory B cells and plasma cells (Figure 4, A and B), as well as CD4+ T cells, CD8+ T cells, Tregs, and dividing T cells (Figure 4, C and D), were quantified as a total fraction of the cells present in the biopsy sample. An enrichment for memory B cells (paired ratio t test, P = 0.010; Figure 4B) and all T cell subsets (paired ratio t test for CD4+ T cells, CD8+ T cells, Tregs, and dividing T cells were P = 0.019, P = 0.017, P = 0.014, P = 0.017, respectively; Figure 4D) was observed. Although the mean frequency of plasma cells was also higher in EM skin, this was not statistically significant (paired ratio t test, P = 0.126; Figure 4B) and appeared to be driven by a single subject due to the presence of 1 expanded clone; requantification excluding this subject confirmed no significant influx of plasma cells into the lesion (paired ratio t test, P = 0.261).
To better define the differences and respective functions between these 2 B cell subsets present in EM, we compared their BCR repertoire properties. We observed that memory B cells were more likely to be IgM (paired t test, P = 0.031; Figure 4E), and these IgM memory B cells displayed a lower frequency of SHM compared with IgM plasma cells (paired t test, P = 0.038; Figure 4F). Moreover, a subpopulation of memory B cells were unmutated (defined as <1% mutation frequency in V and J segments), particularly among IgM-expressing B cells (consistently more than plasma cells, paired t test, P = 0.046; Figure 4G and Supplemental Table 8). A subpopulation of IgG-switched plasma cells were also unmutated (5 cells across all samples or 15.8% of plasma cells; Supplemental Table 9). IgA-switched and IgG-switched cells were detected in both memory B and plasma cell populations. IgA-switched memory B cells were present in EM lesions in all subjects and among plasma cells in 4 subjects; IgG-switched cells were detected among all plasma cells in EM lesions and were only absent from the memory B cells of 1 subject. Overall, the memory B cell infiltrate in EM lesions contained a higher frequency of IgM-expressing cells than the plasma cell infiltrate.
Evidence for antigen-driven selection among B cells and T cells.
Given that we found B cells expressing IgA- and IgG-switched BCRs in the EM infiltrate, which suggests prior antigen experience, we examined the VH gene usage in the BCR repertoire of EM lesions. VH gene usage reflects the underlying antigen specificity of B cells and differences would suggest active selection of B cells found in the tissue based on specificity. We compared the VH gene usage distribution of B cells in the EM lesion with the repertoire observed from bulk sequencing of PBMCs (Supplemental Table 10) in these same subjects. We reasoned that PBMCs would be a suitable reference because the VH gene usage of the circulating B cell repertoire has been well characterized and the blood is the most proximal location from which nonresident B cells might enter the skin.
Switched B cells (IgA or IgG) in the EM lesion expressed less IGHV4-34 (paired t test, P = 0.002; Supplemental Figure 11A) and less IGHV3 overall (paired t test, P = 0.004; Supplemental Figure 11B). However, with the exception of IGHJ1 (paired t test, P = 0.001, usage of this J gene occurred in less than 2% of the circulating repertoire), no differences in the use of IGHJ genes were observed (Supplemental Figure 11C). To confirm that the VH gene usage differences were not due to the different technologies used to profile the skin and PBMCs (single cell vs. bulk, respectively), we verified our findings using a parallel single-cell PBMC sequencing sample from 1 subject with EM (Supplemental Figure 11, D–F). These analyses confirmed that B cells in the EM lesion were clonally expanded with a distinct VH gene repertoire featuring decreased use of IGHV3 family genes.
We found additional evidence for antigen-driven selection of B cells and T cells within the EM lesion by evaluating their clonal expansion (Supplemental Figure 12, A–C). In the case of B cells, clones were expanded in EM skin compared with paired uninvolved skin. Because traditional diversity metrics could not be used due to the sparsity of B cells in uninvolved skin samples, we quantified clonality by calculating the mean size of B cell clones in each sample. The mean number of cells of the largest B cell clone in uninvolved tissue was 1 B cell (i.e., no expansions in samples where B cells were found), whereas in EM tissue, the mean size of the largest B cell clone was 39 B cells (1-tailed paired Wilcoxon’s test P = 0.029; Supplemental Table 11). T cells were also more clonally expanded in the EM skin. We found that the mean size of the largest CD8+ T cell clone was larger in EM compared with uninvolved skin (1-tailed paired Wilcoxon’s test, 20 vs. 6 cells, P = 0.063; Supplemental Table 12), with a similar trend also observed for CD4+ T cell clones (1-tailed paired Wilcoxon’s test, 23 vs. 9, P = 0.094; Supplemental Table 13). EM B cells were also expanded compared with a control blood B cell single-cell repertoire available from 1 subject (4.9% for the largest clone in skin and 1.4% for the largest clone in blood, consistent with sizes reported in the literature) using methods that correct for different sampling depths (18). The presence of these clonal expansions is consistent with antigen-driven selection in situ.
EM-associated B cells and T cells were traced to the circulation.
To assess whether B cells and T cells present in EM skin were derived from a localized response in situ or active trafficking from the circulation, we examined the bulk repertoire of PBMCs (Supplemental Table 10) for clonal relatives of these EM cells in each subject. By quantifying the extent to which members of the same clonal family were found in both EM skin and blood, we found the overlap was highly significant for both B cells (paired t test, P = 0.014; Supplemental Figure 13A) and T cells (P = 6.1 × 10–5; Supplemental Figure 13B), suggesting a meaningful clonal connection between the 2 sites. EM B cells that were clonally related to circulating B cells (referred to as “mobile”) had similar repertoire properties to those without such relatives (referred to as “resident”). Specifically, no significant differences between mobile and resident B cells were found for isotype usage or SHM frequency (Supplemental Figure 14, A and B, respectively). We noted that up to 20% of the resident IgG B cells were unmutated, whereas no unmutated IgGs were found among mobile B cells, but this difference was not significant (Supplemental Figure 14C).
Because the PBMC repertoire was profiled using bulk repertoire sequencing, clones were defined for B cells and T cells using only IGH or TCRB, respectively, in these analyses. Although we have previously shown that IGH alone is sufficient to determine most B cell clonal relationships, up to 20% of clones defined this way may be subdivided based on light chain expression (19). To confirm the results obtained with bulk receptor sequencing, we examined 1 subject from whom both skin and blood B cell repertoires were obtained with paired heavy and light chain receptors. We observed that 76.5% of the clones shared between EM and the circulation and detected by examination of the heavy chain alone were preserved when the light chain was also incorporated in the clonal definition. Similar results were obtained for TCRB (85.6%). This single-cell analysis confirmed the presence of mobile B cells at the EM lesion, and that the mobile and resident B cells exhibited similar repertoire properties (Supplemental Figure 14, D–F).
B cell and T cell subsets displayed different levels of connectivity with the circulation (Figure 5). Compared with memory B cells, a higher frequency of plasma cell clones was mobile (paired ratio t test P = 0.007; Figure 5, A and B). Among T cell subsets, the frequency of mobile Treg clones was less than other T cell subsets (P = 0.006, ANOVA; Figure 5D). Tregs were less easily traced to the circulation than dividing T cells and CD8+ T cell clones (paired ratio t test P = 0.006, P = 0.003, respectively; Figure 5D); CD4+ T cells were also significantly less mobile compared with CD8+ T cells (P = 0.043; Figure 5, C and D).
We next compared the gene expression pattern of mobile and resident cells in the EM lesion. Mobile CD4+ T cells were characterized by upregulation of IL-2RB1 (Supplemental Figure 15, A and D) and TBX21 (https://doi.org/10.5281/zenodo.4711465). CD8+ T cells traced to the circulation were characterized by genes associated with CTL effector differentiation like EOMES, GZMK, and NKG7 as well as chemokine genes like CCL5 and CCL4 (Supplemental Figure 15B and https://doi.org/10.5281/zenodo.4711465). They were associated with an IFN γ signaling signature (P = 5.1 × 10–12; Supplemental Figure 15E). Memory B cells traced to the circulation were enriched for a signature consistent with cell surface interactions at the vascular wall (Supplemental Figure 15, C and F, and https://doi.org/10.5281/zenodo.4711465). No significant differences in gene expression were identified in our analysis of mobile versus resident Tregs or plasma cells. Thus, B cells that have clonal relatives in the circulation were characterized by a trafficking signature, whereas mobile CD4+ and CD8+ T cells displayed a skew toward effector activation and trafficking.
EM-associated unmutated IgM memory B cells were activated and part of diversified clones in the circulation.
Given previous work showing that T-independent B cell production of antibodies can protect against B. burgdorferi infection (20, 21), we sought to better characterize the phenotype of the unmutated IgM memory B cell subset. We found that unmutated IgM memory B cells, defined as having less than 1% mutation frequency, could not be detected in uninvolved skin from any subject. To confirm that unmutated IgM memory B cells were not misclustered naive B cells, we examined the expression of genes that may distinguish these subsets. We analyzed the entire gene expression profile of the unmutated IgM memory B cell subset (https://doi.org/10.5281/zenodo.4711465) compared with other memory B cells for genes associated with those reported previously for naive B cells (16). We did not find any naive B cell genes among the set of significantly DEGs. For example, among the marker genes CD27, CD23, and IL-4R, only IL-4R was observed to be more highly expressed among unmutated IgM memory B cells compared with other memory B cells, but this was not significant after FDR correction (IL-4R, paired t test P = 0.005, P = 0.053 after FDR correction; Figure 6, A and B). CD27 expression was not downregulated in the unmutated IgM memory B cell subset (P = 0.838).
The gene signature associated with the unmutated IgM memory B cells compared with all other memory B cells included signaling by the BCR (P = 1.6 × 10–4) and antigen-activated BCR (P = 2.2 × 10–5; Figure 6C). These unmutated IgM memory B cells also were associated with signatures related to the activation of BCR signaling (antigen activates BCR, P = 0.03) and TLRs (TLR10 cascade, TLR5 cascade, and MyD88 cascade initiated on plasma membrane). TLR10 was also observed to be the most highly expressed TLR in EM B cells but not in myeloid or dendritic cells (Supplemental Tables 14 and 15). The expression of TLR10 also correlated with the expression of an IFN-induced tryptophan catabolism gene kyureninase (KYNU; P < 2.2 × 10–16, Spearman’s correlation = 0.49), which has been associated with immunosuppression and previously identified as upregulated in a bulk transcriptome analysis of EM (10). Overall, the gene signatures of unmutated IgM memory B cells in the EM lesion suggested an activated phenotype, and they displayed a skewed VH and JH gene usage, suggesting they were selected by antigen. Compared with circulating unmutated IgM B cells, EM unmutated IgM memory B cells used less IGHV3-23 (paired t test P = 0.004), IGHV4-59 (P = 0.002), and IGHJ6 (P = 0.0005); IGHJ6 is a JH gene associated with long CDR3s and self-reactivity in naive B cell subsets (Supplemental Figure 16, A–C). These findings were also confirmed when comparing the EM B cell repertoire with the circulating B cell repertoire that was analyzed at the single-cell level (Supplemental Figure 16, D–F).
Although the unmutated IgM memory B cells were not clonally expanded within the EM lesion, 5 of these cells (2.3% of EM IgM B cells) had clonal relatives in the circulation (Supplemental Figure 17). Of these 5 clones, 4 included IgG-switched relatives in the circulation, 3 of which were also unmutated. Interestingly, although the IgM memory B cells in the EM lesion were unmutated, diversifying mutations were found in clonal relatives from the circulation in 3 of the 5 clones. Thus, unmutated IgM memory B cells, which are found specifically in the EM but not uninvolved skin, appeared to be activated cells that were related to expanded, class-switched, and diversified B cells in the circulation.
Discussion
The EM lesion results from local skin infection with Ixodes tick–transmitted B. burgdorferi spirochetes. The act of tick feeding and the associated immunomodulatory effects of tick saliva disrupt normal wound healing and create a protective niche for B. burgdorferi to establish infection in the blood-meal host skin (22, 23). This introduction of B. burgdorferi spirochetes at the bite site and their subsequent expansion elicits the first clinical sign of LD, EM, that we sought to elucidate at single-cell resolution.
The EM lesion has historically been characterized through histopathology and IHC as comprising perivascular and interstitial infiltrates dominated by IFN-γ–producing lymphocytes, macrophages, DCs, and occasional plasma cells (3, 8). Our results confirm and extend these earlier findings by providing the first detailed description at the single-cell level of B cells within the EM lesion. The enhanced representation of B cells was found in all Lyme EM samples in comparison with uninvolved skin. These cells expressed high levels of MHC class II and genes associated with antigen presentation and inflammatory activation, and results were validated using 2 independent cohorts and single-cell sequencing approaches. EM B cells comprised both memory B cell and plasma cell populations, with approximately half possessing identifiable circulating relatives. Memory B cells were disproportionately IgM+ and displayed lower overall SHM frequencies compared with plasma cells. A large fraction possessed unmutated germline receptors that were associated with a TLR/MyD88 signature. These IgM+ B cells also belonged to clones with circulating relatives but displayed a distinct VH gene repertoire compared with relatives in the blood, suggesting active selection for specific BCRs in situ. These results reveal that B cells recruited to the EM lesion may undergo expansion when in contact with antigens present in the skin microenvironment, which may include B. burgdorferi or tick-associated antigens as well as those of the host microbiome. As B. burgdorferi components are known to activate MyD88-dependent TLR signaling, these results support the hypothesis that specialized IgM memory B cells may act as early APCs, serving an important and previously uncharacterized role in the adaptive immune response against B. burgdorferi that arises at its portal of entry in the skin.
The presence of resident plasma cells and CD20+ B cells in healthy skin has been observed previously although their functional roles in this tissue remain poorly understood given their extremely low frequency (14). Although B cells were abundant in EM lesions in our study, we found a low frequency of B cells in uninvolved skin, consistent with these previous reports. None of the B cells in uninvolved skin were clonally expanded and only one IgG1-switched plasma cell could be traced to the circulation. By comparison, approximately 40% of B cells in EM lesions could be traced to the circulation, and approximately 80% were memory cells rather than plasma cells, including a plurality that expressed IgM. Previous literature has suggested that infiltrating IgM B cells may be functionally relevant in inflammation, including IgM ASCs in human skin (24, 25); we believe the B cells present in EM lesions may have played a similar role. The overall character of the B cell infiltrate in EM lesions appeared to be highly distinct when compared with the few B cells typically present in uninvolved skin.
The B cells in uninvolved skin expressed lower levels of CD20, reflecting a mainly plasma cell composition. We also directly demonstrated that memory B cells were observed to be more abundant in EM lesions. The memory B cells present in EM have an antigen-presenting signature (vs. circulating plasmablasts, which may also sometimes have an antigen-presenting phenotype) and signatures of IFN and cytokine signaling. Memory B cells, particularly those expressing MHC class II and costimulatory molecules, have previously been shown to be more abundant in animal models of inflammatory disorders at barrier sites, where they exhibit APC function to T cells and mediate local antibody production (14, 24, 26, 27). We also observed that B cells that could be traced to the circulation displayed signatures consistent with trafficking; some B cell clones could be found in both the skin and the circulation, providing supporting evidence that some of these B cells are not tissue resident. Studies have also shown trafficking of memory B cells to the lung in the context of influenza pneumonia; these B cells can transition to produce IgG-switched antibodies that clear infection (28). Our group has observed the trafficking of B cells, including plasmablast and nonplasmablast subsets, in B cell responses to HSV in the female reproductive tract (29). By tracing B cells using their BCRs, this study provides rigorous human data that antigen-presenting memory B cells directly infiltrated the Lyme EM lesion.
VH gene differences indicate a possible enrichment of B cells in the EM lesion with distinct specificities compared with those from the circulation. This suggests that selected B cells enter the EM lesion based on the specificities of their BCRs. Skewing away from VH3 family genes, as found in the study, has also been observed in the context of myasthenia gravis and specific tissue-like memory B cell subsets in multiple sclerosis by our group; recent studies have also observed this skewing in the context of SARS-CoV-2 infection (18, 30, 31). Other studies of autoimmune diseases, including those driven by IFN signaling such as systemic lupus erythematosus, have noted a similar shift in the global BCR repertoire (32–34). A recent study on circulating plasmablast responses in LD has provided parallel evidence that selection for B. burgdorferi specificity can be detected at the repertoire level although the exact VH genes differentially used by these B cells was not reported (5). In this regard, a European study examining the B cell repertoire in peripheral blood of patients with EM found an overlap between IgM and IgG CDR3 sequences during acute and convalescent infection samples compared with healthy controls, suggesting that B cell immune responses associated with B. burgdorferi infection may be detectable in the circulating repertoire (35).
Our finding that unmutated IgM-expressing memory B cells appear to be interacting with antigen suggests a potential source for T-independent IgM that may be important for early protective humoral immunity. The identity of these B cells is therefore of particular interest. There is evidence of a role for innate B cells in mice infected with B. burgdorferi, but whether similar populations contribute to the human response is unknown (20). IgM B cells that are predominant in EM lesions lack the hallmark features of an innate B cell population. Perhaps the most important evidence concerning this possibility is the distinctly non-naive gene expression profile of infiltrating unmutated IgM memory B cells and their expression of TLR gene programs. These B cells also use less JH6, which is consistent with a non-naive origin, and display a signature of prior antigen contact, suggesting an antigen-experienced phenotype. Although some studies have suggested that innate-like B cell populations in humans possess a distinct non-naive (antigen-experienced) phenotype (36, 37), markers previously found to be associated with these subsets were neither detectable (CD43) nor upregulated (CD5 and NOTCH2) in the subset of unmutated IgM memory B cells in our analysis. These B cells may instead represent a potentially extranodal B cell response that may precede a more robust lymph node response, analogous to what has been reported in B. burgdorferi infection in mice (38), and may be a source of IgM ASCs, a subset that has recently been observed in human skin (25). As discussed above, the observation of a TLR/MyD88 signature among this subset of B cells is consistent with literature suggesting the relevance of T-independent B cells (20, 21) and the pattern recognition system, including on B cells, in optimal serological responses to B. burgdorferi (39, 40). B. burgdorferi components that could drive T-independent B cell responses include its abundant lipoproteins that contain the TLR2-stimulating tripalmitoyl-S-glyceryl-cysteine moiety (e.g., in vivo expressed outer surface protein C and VlsE) as well as B. burgdorferi peptidoglycan (41, 42).
We also showed high levels of TLR10 mRNA in B cells compared with other subsets, including plasmacytoid DCs, on which it is known to be expressed at low levels (43). TLR1 and TLR10 were the most abundant TLR transcripts in B cells, whereas TLR2 and TLR4 transcripts were the most abundant in myeloid cells. TLR10 is functional in humans and possesses both activating and inhibitory signaling potential depending on the cell system examined (44, 45). Influenza virus infection of primary human monocytes in vitro enhances inflammatory cytokine production, whereas anti-TLR10 antibodies can suppress primary human B cell proliferation and production of cytokines and chemokines in response to BCR stimulation conditions mimicking T-independent or T-dependent engagement (46). A modulatory role for TLR10 on B cell function in LD remains to be elucidated in the temporal evolution of the lesion, although evidence exists for its role in controlling immune responses against other pathogens (46, 47). In this regard, we found a correlation between TLR10 expression and expression of KYNU, previously identified as upregulated in a bulk transcriptome analysis of EM and associated with immunosuppression (10).
Our findings related to T cells, particularly the identification of a Th1 signature and evidence of local IFN-γ production associated with CD8+ T cell subsets clonally traced to the circulation, are consistent with previous studies implicating these subsets in the control of local infection including against B. burgdorferi (3, 4, 39). However, we did not identify a notable role for Tregs because they lacked a clear differential gene expression signature in EM compared with uninvolved skin. Tregs did not significantly upregulate IL-10 in EM compared with uninvolved skin, and the expression levels of this transcript in Tregs were not elevated compared with myeloid subsets. These results are in contrast to the model proposed by Marques et al. for a role for expression Tregs based on bulk transcriptomics of EM lesions (10).
A single EM is a recognized early manifestation of LD, but some patients present with multiple EM, indicating disseminated infection. One subject in our cohort presented with multiple EM and some insights could be gleaned from analyzing this sample. The primary EM from this subject displayed the lowest fraction of infiltrating lymphocytes with the fewest cells from the T cell cluster present in the tissue relative to other subjects from the cohort. Although a robust B cell cluster infiltrate was present (second highest for memory B and plasma cells), it was distinct in terms of isotype usage and SHM frequency. The B cells in the EM lesion of this subject possessed the highest fraction of IgG-switched receptors among memory B cells, the highest fraction of IgA-switched receptors among plasma cells, and the highest mean SHM for IgA-switched receptors among plasma cells. IgA-serum reactivity to B. burgdorferi has been observed in up to a third of subjects with early disease and IgA-positive subjects were more likely to have disseminated disease (48). More extensive studies are warranted to determine whether a reduced T cell response and low frequency of unmutated IgM antigen-presenting memory B cells, as seen in the subject with disseminated disease in our study, represent a suboptimal skin immune response that is permissive to B. burgdorferi dissemination.
Our findings may have important significance for the study of host defense against infection with Treponema pallidum, the spirochetal agent of syphilis, and other epidemic treponematoses that enter through the skin (49). Sexually transmitted T. pallidum infection gives rise to the primary syphilitic chancre at its portal of entry through mucosal surfaces, especially in oral or genital areas. Histopathology of the lesion in humans reveals abundant T cells and macrophages as well as varying numbers of plasma cells (50, 51). B cells are present but have not been characterized. T. pallidum differs from B. burgdorferi in that it has a paucity of outer membrane proteins (52) and a vaccine directed at the rare outer membrane proteins has been elusive. Study of the syphilitic chancre using scRNA-Seq and BCR sequencing may therefore provide insight into the role of B cells in the pathogenesis of syphilis at its earliest stage of infection and aid in vaccine design.
Conclusions from our study may be limited by the modest sample size (n = 10) and the predominance of older female subjects. However, we show that our transcriptomics findings were agnostic to the type of scRNA-Seq performed because 2 approaches (10× Genomics and Seq-Well) produced the same qualitative results in terms of differential gene expression for key cell subsets like B cells, CD4+ T cells, and CD8+ T cells. We did not examine T cells bearing γδ TCR because there was poor recovery of these receptors from repertoire sequencing using the emulsion-based platform. Although comparison was made between autologous uninvolved skin biopsies with lesional skin in our studies, we cannot exclude a potential contribution of cells in the EM biopsies that may be trafficking to the site but still within the vasculature.
In summary, our single-cell analysis supports a role for B cells in antigen presentation and antibody production in the cutaneous response to B. burgdorferi infection in humans. The finding of unmutated IgM memory B cells in the EM lesion, possibly a consequence of TLR-mediated responses, suggests that these cells could have contributed to early defense against this pathogen prior to a more robust T-dependent response. The expression of TLR10 on these cells could have also contributed to paradoxical local immunosuppression that drives resolution of the lesion in the absence of antimicrobial intervention. Additional studies are warranted to examine the contributions of B cell subpopulations to host defense and resolution of clinical signs like EM.
Methods
Further information can be found in Supplemental Methods.
Subject selection.
Ten adult subjects with EM and fulfilling the CDC case definition for LD (10) were recruited from study sites affiliated with the Yale New Haven Hospital in New Haven County, Connecticut, USA, and the Mansfield Family Practice. These subjects made up 2 cohorts (n = 6 cohort 1, n = 4 cohort 2) enrolled between the months of May and October in 2 consecutive years (2018, 2019). All but 2 subjects (no. 9) tested positive on the Immunetics C6 Lyme EIA (purchased through Thermo Fisher Scientific), which was performed according to the manufacturer’s protocol and interpreted using the company’s recommended Lyme index values of greater than or equal to 1.10 (positive), 0.91–1.09 (equivocal), or less than or equal to 0.90 (negative; ref. 13).
Sample preparation.
Blood samples and skin punch biopsies of the leading edge of the EM lesion were collected from all 10 subjects, whereas control skin from uninvolved skin at least 2 cm away from the EM leading edge was collected from 7 subjects. PBMCs in whole blood were isolated by Ficoll-Paque PLUS (GE Healthcare) gradient centrifugation. After red blood cell lysis using ACK buffer (Thermo Fisher Scientific, A1049201), cells were washed and resuspended in 90% human sera AB (GeminiBio) and 10% DMSO for cyropreservation for bulk BCR and TCR sequencing. Skin biopsies were obtained with a 3-mM disposable biopsy punch (McKesson and Integra Life Sciences), and samples were placed immediately into MACS Tissue Storage Solution (Miltenyi Biotec, 130-100-008) for transport to the laboratory. Tissues were immediately processed into single-cell suspensions using the Whole Skin Dissociation Kit (Miltenyi Biotec, 130-101-540) according to the manufacturer’s recommendation. Briefly, the tissue was washed in DMEM media (Thermo Fisher Scientific, 11960), placed in the enzyme solution, and incubated in a 37°C water bath for 3 hours. Thereafter, the tissue cells were dissociated using the MACS Dissociator (Miltenyi Biotec, 130-093-235), preprogrammed for skin cell isolation (program h-skin-01). The cells were then washed in DMEM, filtered with a 70-μm filter, pelleted and resuspended in DMEM, and counted. Cells were processed immediately for single-cell library preparation as described below.
Bulk repertoire preparation, sequencing, and preprocessing.
RNA was prepared from frozen PBMCs using the RNeasy Mini Kit (QIAGEN) per the manufacturer’s instructions. BCR and TCR RNA libraries were prepared using reagents from New England Biolabs as part of the NEBNext Immune Sequencing Kit (a gift of Eileen Dimalanta and Chen Song, New England Biolabs, Ipswich, Massachusetts). These steps were performed as previously described with the inclusion of an additional set of TCRB-specific primers on the 3′ end (53, 54). Libraries were sequenced by 325 cycles for read 1 and 275 cycles for read 2 base-pair paired-end sequencing with a 20% PhiX spike on the Illumina MiSeq platform according to manufacturer’s recommendations. Bulk repertoire data processing and analysis was carried out using tools from the Immcantation framework (http://immcantation.org) as described in ref. 18. Preprocessing was carried out using pRESTO v0.5.11 and the IMGT human germline IGHV reference database (IMGT/GENE-DB v3.1.19; retrieved May 30, 2019) as described in refs. 53, 54. BCR isotypes were assigned by local alignment of the 3’ end of each sequence to a set of constant region sequences. Data were deposited in the NCBI’s Gene Expression Omnibus database (GEO GSE169440).
Emulsion-based single-cell library preparation and gene expression analysis.
Single-cell preparations were loaded into the Chromium Controller (10× Genomics) for emulsion generation, and libraries were prepared using the Chromium Single Cell 5′ Reagent Kit for version 1 chemistry per the manufacturer’s protocol. Libraries were sequenced on the NovaSeq 6000 with 100 × 100 or 150 × 150 paired-end reads for gene expression and BCR/TCR libraries, respectively. Base calls were converted to fastq sequences and demultiplexed using the cellranger v3.1.0 mkfastq function and aligned to the coding sequences of the GCRhg38 genome supplied by 10× Genomics. Sparse count matrices, barcode assignments, and feature calls were made using the cellranger count subcommand.
An average of 128 million reads were sequenced and an average of around 7000 cells were identified per sample (Supplemental Table 1). The resulting output was loaded into Seurat v.3.1.5 for analysis (55). Cells with fewer than 400 or more than 4000 genes detected or mitochondrial content above 15% of all transcripts were discarded (genes expressed in fewer than 5 cells in all samples were also excluded). A total of 65,363 cells were identified. Single-cell expression values were then scaled using log normalized count values (56). Highly variable genes were identified using FindVariableGenes. The IntegrateData function (using the first 30 dimensions, the default parameter) was used to correct for batch effects. The first 20 dimensions were then selected for clustering analysis based on the location of the inflection point from PCElbowPlot. Clusters from total cells were assigned using the FindClusters function based on a shared nearest neighbor algorithm using a resolution of 0.4 (value at which most major clusters could be resolved; ref. 55). Clusters were annotated based on the expression of a panel of marker genes (Supplemental Figure 3 and Supplemental Table 2). This procedure was repeated for cells from the T cell and B cell clusters that were associated with a TCR or BCR sequence from single-cell repertoire sequencing, respectively. We defined cells as associated with a TCR when repertoire sequencing identified an associated TCRB V(D)J sequence, and as associated with a BCR when the repertoire sequencing identified both IGH and IGK/L sequences. We did not require a paired TCRA because only 60.3% of TCRB were paired with TCRA, given low TCRA expression. In contrast, 96.5% of IGH were found to be paired with IGK/L. To identify clusters, a resolution of 0.7 was used for T cells, whereas a resolution of 0.1 was used for B cells. The Seurat RunUMAP function was run to visualize clusters.
To assign T cell and B cell cluster annotations, cell clusters were first annotated using a “basis set” of published marker genes called immunoStates (Supplemental Table 5 and Supplemental Table 7; ref. 16). The mean expression of each gene (raw transcript counts) was computed for each cluster; each cluster was then assigned to the immunoState subset with the highest Spearman’s correlation coefficient. Tregs and dividing cell immunoStates markers were absent from the immunoStates matrix. These annotations were identified manually using marker genes alone, that is, based on the fraction of each cluster expressing FOXP3 or CDK1 above a specified threshold (Supplemental Figure 6 and Supplemental Table 5). Assignments from the 2 clusters identified from B cell single-cell data were assigned using the immunoStates matrix and then confirmed based on the fractional expression of CD20 and BLIMP1 (Supplemental Figure 4 and Supplemental Table 7).
Bulk BCR and TCR repertoire V(D)J gene annotation, additional sequence filtering, clonal assignment, germline reconstruction and analysis of SHM, overlap, and phylogenetics.
Reconstructed V(D)J sequences from single-cell sequencing were extracted using the cellranger vdj function from fastq reads. V(D)J germline genes were reassigned using IgBLAST v.1.6.1 also using the September 12, 2018, version of the IMGT gene database for both bulk and single-cell repertoires. After V(D)J annotation, nonfunctional sequences were removed from further analysis for both bulk and single-cell repertoires. For single-cell sequencing–derived repertoires, cells with multiple IGH, TCRB, or TCRA V(D)J sequences were assigned to the most abundant V(D)J sequence by UMI count. In cases where multiple IGH, TCRB, or TCRA V(D)J sequences from the same cell shared the same number of UMI counts, cells were assigned to the most abundant V(D)J sequence by number of paired sequencing reads. V(D)J sequences without matching single-cell transcriptomes were excluded. Any TCRD or TCRG assigned V(D)J sequences from single-cell sequencing was discarded. IGH V(D)J sequences not paired with any matching light chain sequences from single-cell sequencing were also excluded. Bulk and single-cell repertoires were then concatenated and V(D)J sequences were assigned into clonal groups using Change-O v0.4.6 (57). Sequences were first grouped based on IGHV gene, IGHJ gene, and junction lengths. Within these groups, sequences differing by less than a length normalized Hamming distance of 0.18 were defined as clones by single-linkage clustering. This distance threshold was determined based on the average of the local minima between the 2 modes of within-sample bimodal distance-to-nearest histograms using a kernel density estimate as previously described (57). Germline sequences were reconstructed for each clonal cluster with D segment and N/P regions masked (with Ns) using the CreateGermlines.py function within Change-O. To identify TCR clones, clones were assigned if T cells shared the same full-length aligned V(D)J sequence.
To validate the identification of clones based on sharing between bulk and single-cell repertoires, sharing between single-cell repertoires was also quantified for a PBMC and EM sample from subject no. 5. This was done by repeating the same alignment, filtering, and single-linkage clustering approaches described above on these single-cell–only repertoires. In addition, the IGH sequences were then further grouped based on whether the cells shared any common combinations of IGKV with IGKJ and a specific junction length or IGLV with IGLJ and a specific junction length.
Mutations relative to the germline sequence were quantified using the observedMutations function from SHazaM v1.0.1 in R v3.6.1 (58). Clonal overlap was computed using a Bray-Curtis metric implemented using the scipy.spatial.distance.braycurtis function in scipy v1.1.0 (59). Clonal abundance was computed using the estimateAbundance function from Alakazam v1.0.2, with uniform downsampling (to the number of V(D)J from the sample with the fewest sequences to account for different sequencing depth) and 200 replicates of bootstrapping from the inferred complete clonal abundance (18). B cell lineage tree topologies and branch lengths were estimated by maximum parsimony using the dnapars program distributed as part of PHYLIP (v3.697; ref. 60) and the R package dowser v0.0.1. Trees were visualized using the R packages ape v5.4 (61) and ggtree v2.0.4 (62).
To compute background overlap for IGH sequences to justify the presence of significant clonal overlap, clones were reassigned across all subjects using the same clonal threshold of 0.18 described earlier or based on identical V(D)J sequences for TCRB sequences. Bray-Curtis overlap was computed for V(D)J sequences from single-cell–derived IGH repertoires and bulk repertoires from the same subject (“Intra-Subject) or different subjects. The average overlap for comparisons with different subjects was computed (Inter-Subject). A paired 1-tailed t test was performed to assess the significance of the log-transformed clonal overlap relative to the background.
Differential gene expression analysis.
Single-cell log normalized expression values were imputed using the ALRA algorithm to account for dropout during differential gene expression analysis (63). All differential gene expression analyses were performed using pseudobulk averages of gene expression across single-cell transcriptomes; individual gene expression values were first Z-score normalized across samples from the same subject before pseudobulk averages were computed (for all differential gene expression analysis; refs. 53, 54). Student’s t tests with pairing were performed to assign P values for hypothesis testing. FDR correction was performed using Storey’s method implemented within the q value package v2.18.0 in R from Bioconductor after visualization of the P value distribution (64). More than 2 cells must be present in each sample to calculate a pseudobulk average for that sample. A q value (FDR corrected P value) threshold of 0.1 was used for assigning significance for tests of significance from gene expression analysis.
EM compared with uninvolved skin B cell gene expression analysis differed from other calculations in that no paired significance testing was performed given that B cells were identified in only 2 uninvolved skin samples versus all 6 EM skin samples. Therefore, differential expression was computed without paired significance testing or normalization across samples. The minimum absolute differences between EM and uninvolved samples was computed and used to select the top and bottom genes (of note, unpaired significance testing [Student’s t test] did not yield significant genes after FDR calculation). A disproportionate fraction of the EM lesion B cell infiltrate in subject no. 6 was observed to be composed of a single expanded, predominantly plasma cell, IgG-switched clone causing the plasma cell fraction of this EM sample to be much higher than that of other samples; at 184 B cell members, this clone was substantially larger than the next largest clone of the data set containing 8 B cell members and was therefore an outlier in terms of size. This clone was excluded from gene expression analysis of this sample to maintain consistent memory B cell and plasma cell ratios across EM samples for gene expression analysis and to reduce noise.
When no significant genes were identified (comparisons of Treg and CD8+ T cells from uninvolved and EM skin), the top genes based on absolute difference were analyzed. Gene ontologies were assigned using the enrichr package v2.1 in R, which implements a Wilcoxon’s test for significance using a P value threshold of 0.05 for significance (65). A q value (corrected P value) threshold is reported in cases where this was also less than 0.05. The “Reactome_2016” gene set was used for all pathway enrichment analysis.
Code and data availability.
Code used in this study will be made available in a publicly accessible repository at https://bitbucket.org/kleinstein/projects/src/master/Jiang2021/ Data for gene expression analysis are available in a publicly accessible repository at https://doi.org/10.5281/zenodo.4711465
Statistics.
R v3.6.1 was used for all statistical analysis. Dataframe handling and plotting was performed using functions from the tidyverse v1.3.0 in R and pandas v0.24.2 and scipy v1.1.0 in python v3.7.5. All parametric statistical testing (aside from differential gene expression analysis and gene set analysis, described above, was performed in R using the t test functions for paired 2-tailed Student’s t tests unless otherwise specified. A significance threshold of P values of less than 0.05 was used. All tests of the proportions of different cell subsets or frequencies of cells out of 1 were first subjected to a log transformation and are specified by text, i.e., a ratio 2-tailed t test was performed. All other tests, including differences in SHM, isotype usage, and gene expression, did not involve a log conversion and are specified by text.
Study approval.
This research was conducted under human research protocols approved by the IRB of Yale University. Written informed consent was received from all participants prior to inclusion in the study.
Author contributions
LKB, RRM, DAH, AKS and SHK designed the research studies. LKB, KRD, and AAB participated in subject recruitment and sample collection. All authors participated in data acquisition, analysis and/or interpretation. AAB, KR, and RJ were responsible for the preparation of samples. KBH, HM, RJ, and IF performed bioinformatic analysis. RJ, IF, LKB, and SHK drafted the manuscript, and all authors edited and approved the content for submission.
Supplementary Material
Acknowledgments
We thank J. Christopher Love for critical discussions regarding SeqWell and review of the manuscript draft. We thank Eileen Dimalanta and Chen Song from New England Biolabs for providing reagents used for the preparation of bulk BCR libraries. We also thank Allison Nelson for assistance with subject recruitment and Jialing Mao, Ming Li, and Nicole Pettas for technical assistance with sample processing. We also thank Zane Cooke, Saadiya Dalal, and staff at Mansfield Family Practice for their clinical assistance. This project was supported by the National Institute of Allergy and Infectious Diseases of the NIH through grants U19-AI089992, R01-AI104739, U24-AI118672, and the Harold W. Jockers Professorship to LKB. AKS was supported in part by a Sloan Fellowship in Chemistry and the Beckman Young Investigator Program.
Version 1. 06/01/2021
In-Press Preview
Version 2. 06/22/2021
Electronic publication
Funding Statement
Unrestricted endowed professorship awarded to Linda Bockenstedt
Footnotes
Conflict of interest: SHK receives consulting fees from Northrop Grumman. DAH has received funding for his laboratory from Bristol Myers Squibb and Genentech. A child of DAH is an employee of Sanofi. AKS has received compensation for consulting and scientific advisory board membership from Honeycomb Biotechnologies, Cellarity, Repertoire Immune Medicines, Ochre Bio, Third Rock Ventures, Relation Therapeutics, and Dahlia Biosciences.
Copyright: © 2021, Jiang et al. This is an open access article published under the terms of the Creative Commons Attribution 4.0 International License.
Reference information: JCI Insight. 2021;6(12):e148035.https://doi.org/10.1172/jci.insight.148035.
Contributor Information
Ruoyi Jiang, Email: ruoyi.jiang@yale.edu.
Hailong Meng, Email: hailong.meng@yale.edu.
Khadir Raddassi, Email: khadir.raddassi@yale.edu.
Ira Fleming, Email: IRA.FLEMING@CUANSCHUTZ.EDU.
Kenneth B. Hoehn, Email: kenneth.hoehn@yale.edu.
Kenneth R. Dardick, Email: kdardick@gmail.com.
Alexia A. Belperron, Email: alexia.belperron@yale.edu.
Ruth R. Montgomery, Email: ruth.montgomery@yale.edu.
Alex K. Shalek, Email: shalek@fas.harvard.edu.
David A. Hafler, Email: david.hafler@yale.edu.
Steven H. Kleinstein, Email: steven.kleinstein@yale.edu.
Linda K. Bockenstedt, Email: linda.bockenstedt@yale.edu.
References
- 1. Whitman W, Rainey F, Kampfer P, Trujillo M, Chun J, DeVos P, et al. eds. Bergey’s Manual of Systematics of Archaea and Bacteria. John Wiley & Sons, Inc; 2019. [Google Scholar]
- 2.Schwartz AM, et al. Surveillance for Lyme disease - United States, 2008-2015. MMWR Surveill Summ. 2017;66(22):1–12. doi: 10.15585/mmwr.ss6622a1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Steere AC, et al. Lyme borreliosis. Nat Rev Dis Primers. 2016;2:16090. doi: 10.1038/nrdp.2016.90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bockenstedt LK, et al. Immune response to Borrelia: lessons from Lyme disease spirochetes. Curr Issues Mol Biol. 2020;42:145–190. doi: 10.21775/cimb.042.145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Blum LK, et al. Robust B cell responses predict rapid resolution of Lyme disease. Front Immunol. 2018;9:1634. doi: 10.3389/fimmu.2018.01634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mullegger RR, et al. Differential expression of cytokine mRNA in skin specimens from patients with erythema migrans or acrodermatitis chronica atrophicans. J Invest Dermatol. 2000;115(6):1115–1123. doi: 10.1046/j.1523-1747.2000.00198.x. [DOI] [PubMed] [Google Scholar]
- 7.Silberer M, et al. Downregulation of class II molecules on epidermal Langerhans cells in Lyme borreliosis. Br J Dermatol. 2000;143(4):786–794. doi: 10.1046/j.1365-2133.2000.03776.x. [DOI] [PubMed] [Google Scholar]
- 8.Boer A, et al. Erythema migrans: a reassessment of diagnostic criteria for early cutaneous manifestations of borreliosis with particular emphasis on clonality investigations. Br J Dermatol. 2007;156(6):1263–1271. doi: 10.1111/j.1365-2133.2007.07918.x. [DOI] [PubMed] [Google Scholar]
- 9.Salazar JC, et al. Coevolution of markers of innate and adaptive immunity in skin and peripheral blood of patients with erythema migrans. J Immunol. 2003;171(5):2660–2670. doi: 10.4049/jimmunol.171.5.2660. [DOI] [PubMed] [Google Scholar]
- 10.Marques A, et al. Transcriptome assessment of erythema migrans skin lesions in patients with early Lyme disease reveals predominant interferon signaling. J Infect Dis. 2017;217(1):158–167. doi: 10.1093/infdis/jix563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. CDC. Lyme Disease (Borrelia burgdorferi) 2017 Case Definition. https://wwwn.cdc.gov/nndss/conditions/lyme-disease/case-definition/2017/ Updated January 2017. Accessed May 21, 2021.
- 12.Liang FT, et al. Sensitive and specific serodiagnosis of Lyme disease by enzyme-linked immunosorbent assay with a peptide based on an immunodominant conserved region of Borrelia burgdorferi vlsE. J Clin Microbiol. 1999;37(12):3990–3996. doi: 10.1128/JCM.37.12.3990-3996.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wormser GP, et al. Single-tier testing with the C6 peptide ELISA kit compared with two-tier testing for Lyme disease. Diagn Microbiol Infect Dis. 2013;75(1):9–15. doi: 10.1016/j.diagmicrobio.2012.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Debes GF, McGettigan SE. Skin-associated B cells in health and inflammation. J Immunol. 2019;202(6):1659–1666. doi: 10.4049/jimmunol.1801211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. doi: 10.1101/689273. Hughes TK, et al. Highly efficient, massively-parallel single-cell RNA-seq reveals cellular states and molecular features of human skin pathology [preprint]. Posted on bioRxiv January 2, 2019. [DOI]
- 16.Vallania F, et al. Leveraging heterogeneity across multiple datasets increases cell-mixture deconvolution accuracy and reduces biological and technical biases. Nat Commun. 2018;9(1):4735. doi: 10.1038/s41467-018-07242-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Szabo PA, et al. Single-cell transcriptomics of human T cells reveals tissue and activation signatures in health and disease. Nat Commun. 2019;10(1):4706. doi: 10.1038/s41467-019-12464-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Vander Heiden JA, et al. Dysregulation of B cell repertoire formation in myasthenia gravis patients revealed through deep sequencing. J Immunol. 2017;198(4):1460–1473. doi: 10.4049/jimmunol.1601415. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhou JQ, Kleinstein SH. Cutting edge: Ig H chains are sufficient to determine most B cell clonal relationships. J Immunol. 2019;203(7):1687–1692. doi: 10.4049/jimmunol.1900666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Belperron AA, et al. Marginal zone B-cell depletion impairs murine host defense against Borrelia burgdorferi infection. Infect Immun. 2007;75(7):3354–3360. doi: 10.1128/IAI.00422-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.McKisic MD, Barthold SW. T-cell-independent responses to Borrelia burgdorferi are critical for protective immunity and resolution of lyme disease. Infect Immun. 2000;68(9):5190–5197. doi: 10.1128/IAI.68.9.5190-5197.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Nuttall PA, Labuda M. Tick-host interactions: saliva-activated transmission. Parasitology. 2004;129(Suppl):S177–S189. doi: 10.1017/s0031182004005633. [DOI] [PubMed] [Google Scholar]
- 23.Nuttall PA. Tick saliva and its role in pathogen transmission. Wien Klin Wochenschr. doi: 10.1007/s00508.019.1500-y. [published online May 6, 2019]. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Geherin SA, et al. The skin, a novel niche for recirculating B cells. J Immunol. 2012;188(12):6027–6035. doi: 10.4049/jimmunol.1102639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wilson RP, et al. IgM plasma cells reside in healthy skin and accumulate with chronic inflammation. J Invest Dermatol. 2019;139(12):2477–2487. doi: 10.1016/j.jid.2019.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Magri G, et al. Human secretory IgM emerges from plasma cells clonally related to gut memory B cells and targets highly diverse commensals. Immunity. 2017;47(1):118–134. doi: 10.1016/j.immuni.2017.06.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Young JS, et al. Donor B cells in transplants augment clonal expansion and survival of pathogenic CD4+ T cells that mediate autoimmune-like chronic graft-versus-host disease. J Immunol. 2012;189(1):222–233. doi: 10.4049/jimmunol.1200677. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Joo HM, et al. Broad dispersion and lung localization of virus-specific memory B cells induced by influenza pneumonia. Proc Natl Acad Sci U S A. 2008;105(9):3485–3490. doi: 10.1073/pnas.0800003105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Oh JE, et al. Migrant memory B cells secrete luminal antibody in the vagina. Nature. 2019;571(7763):122–126. doi: 10.1038/s41586-019-1285-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fraussen J, et al. Phenotypic and Ig repertoire analyses indicate a common origin of IgD-CD27- double negative B cells in healthy individuals and multiple sclerosis patients. J Immunol. 2019;203(6):1650–1664. doi: 10.4049/jimmunol.1801236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Galson JD, et al. Deep sequencing of B cell receptor repertoires from COVID-19 patients reveals strong convergent immune signatures. Front Immunol. 2020;11:605170. doi: 10.3389/fimmu.2020.605170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bashford-Rogers RJM, et al. Analysis of the B cell receptor repertoire in six immune-mediated diseases. Nature. 2019;574(7776):122–126. doi: 10.1038/s41586-019-1595-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Crow MK. Type I interferon in the pathogenesis of lupus. J Immunol. 2014;192(12):5459–5468. doi: 10.4049/jimmunol.1002795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Mietzner B, et al. Autoreactive IgG memory antibodies in patients with systemic lupus erythematosus arise from nonreactive and polyreactive precursors. Proc Natl Acad Sci U S A. 2008;105(28):9727–9732. doi: 10.1073/pnas.0803644105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kirpach J, et al. Detection of a low level and heterogeneous B cell immune response in peripheral blood of acute borreliosis patients with high throughput sequencing. Front Immunol. 2019;10:1105. doi: 10.3389/fimmu.2019.01105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Descatoire M, et al. Identification of a human splenic marginal zone B cell precursor with NOTCH2-dependent differentiation properties. J Exp Med. 2014;211(5):987–1000. doi: 10.1084/jem.20132203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Quach TD, et al. Distinctions among circulating antibody-secreting cell populations, including B-1 cells, in human adult peripheral blood. J Immunol. 2016;196(3):1060–1069. doi: 10.4049/jimmunol.1501843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Elsner RA, et al. Suppression of long-lived humoral immunity following borrelia burgdorferi infection. PLoS Pathog. 2015;11(7):e1004976. doi: 10.1371/journal.ppat.1004976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Whiteside SK, et al. IL-10 deficiency reveals a role for TLR2-dependent bystander activation of T cells in Lyme arthritis. J Immunol. 2018;200(4):1457–1470. doi: 10.4049/jimmunol.1701248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Woods A, et al. MyD88 negatively controls hypergammaglobulinemia with autoantibody production during bacterial infection. Infect Immun. 2008;76(4):1657–1667. doi: 10.1128/IAI.00951-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hirschfeld M, et al. Cutting edge: inflammatory signaling by Borrelia burgdorferi lipoproteins is mediated by toll-like receptor 2. J Immunol. 1999;163(5):2382–2386. [PubMed] [Google Scholar]
- 42.Jutras BL, et al. Borrelia burgdorferi peptidoglycan is a persistent antigen in patients with Lyme arthritis. Proc Natl Acad Sci U S A. 2019;116(27):13498–13507. doi: 10.1073/pnas.1904170116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hornung V, et al. Quantitative expression of toll-like receptor 1-10 mRNA in cellular subsets of human peripheral blood mononuclear cells and sensitivity to CpG oligodeoxynucleotides. J Immunol. 2002;168(9):4531–4537. doi: 10.4049/jimmunol.168.9.4531. [DOI] [PubMed] [Google Scholar]
- 44.Jiang S, et al. TLR10 is a negative regulator of both MyD88-dependent and -independent TLR signaling. J Immunol. 2016;196(9):3834–3841. doi: 10.4049/jimmunol.1502599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Oosting M, et al. Human TLR10 is an anti-inflammatory pattern-recognition receptor. Proc Natl Acad Sci U S A. 2014;111(42):E4478–E4484. doi: 10.1073/pnas.1410293111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Lee SM, et al. Toll-like receptor 10 is involved in induction of innate immune responses to influenza virus infection. Proc Natl Acad Sci U S A. 2014;111(10):3793–3798. doi: 10.1073/pnas.1324266111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Mulla MJ, et al. Cutting-edge report: TLR10 plays a role in mediating bacterial peptidoglycan-induced trophoblast apoptosis. Am J Reprod Immunol. 2013;69(5):449–453. doi: 10.1111/aji.12065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.D’Arco C, et al. Borrelia burgdorferi-specific IgA in Lyme disease. EBioMedicine. 2017;19:91–97. doi: 10.1016/j.ebiom.2017.04.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Giacani L, Lukehart SA. The endemic treponematoses. Clin Microbiol Rev. 2014;27(1):89–115. doi: 10.1128/CMR.00070-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Carlson JA, et al. The immunopathobiology of syphilis: the manifestations and course of syphilis are determined by the level of delayed-type hypersensitivity. Am J Dermatopathol. 2011;33(5):433–460. doi: 10.1097/DAD.0b013e3181e8b587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lukehart SA, et al. Characterization of lymphocyte responsiveness in early experimental syphilis. II. Nature of cellular infiltration and Treponema pallidum distribution in testicular lesions. J Immunol. 1980;124(1):461–467. [PubMed] [Google Scholar]
- 52.Radolf JD, Kumar S. The Treponema pallidum outer membrane. Curr Top Microbiol Immunol. 2018;415:1–38. doi: 10.1007/82_2017_44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Jiang R, et al. Single-cell repertoire tracing identifies rituximab-resistant B cells during myasthenia gravis relapses. JCI Insight. 2020;5(14):136471. doi: 10.1172/jci.insight.136471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Vander Heiden JA, et al. pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires. Bioinformatics. 2014;30(13):1930–1932. doi: 10.1093/bioinformatics/btu138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Butler A, et al. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol. 2018;36(5):411–420. doi: 10.1038/nbt.4096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Zheng GX, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8:14049. doi: 10.1038/ncomms14049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Gupta NT, et al. Hierarchical clustering can identify B cell clones with high confidence in Ig repertoire sequencing data. J Immunol. 2017;198(6):2489–2499. doi: 10.4049/jimmunol.1601850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Nouri N, Kleinstein SH. A spectral clustering-based method for identifying clones from high-throughput B cell repertoire sequencing data. Bioinformatics. 2018;34(13):i341–i349. doi: 10.1093/bioinformatics/bty235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Virtanen P, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 2020;17(3):261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. doi: 10.1101/2020.05.30.124446. Hoehn KB, et al. Phylogenetic analysis of migration, differentiation, and class switching in B cells [preprint]. Posted on bioRxiv May 31, 2020. [DOI] [PMC free article] [PubMed]
- 61.Paradis E, Schliep K. ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2019;35(3):526–528. doi: 10.1093/bioinformatics/bty633. [DOI] [PubMed] [Google Scholar]
- 62.Yu GC, et al. GGTREE: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017;8(1):28–36. doi: 10.1111/2041-210X.12628. [DOI] [Google Scholar]
- 63. doi: 10.1101/397588. Linderman GC, et al. Zero-preserving imputation of scRNA-seq data using low-rank approximation [preprint]. Posted on bioRxiv August 22, 2018. [DOI]
- 64.Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci U S A. 2003;100(16):9440–9445. doi: 10.1073/pnas.1530509100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Kuleshov MV, et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016;44(W1):W90–W97. doi: 10.1093/nar/gkw377. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.