Skip to main content
Nature Portfolio logoLink to Nature Portfolio
. 2024 Apr 24;21(5):846–856. doi: 10.1038/s41592-024-02255-0

De novo identification of CD4+ T cell epitopes

Paul M Zdinak 1,2,3, Nishtha Trivedi 1,2,#, Stephanie Grebinoski 1,3,#, Jessica Torrey 1,2, Eduardo Zarate Martinez 1,2,4, Salome Martinez 1,2, Louise Hicks 1,2, Rashi Ranjan 1,2, Venkata Krishna Kanth Makani 1,2, Mary Melissa Roland 1,2, Lyubov Kublo 1,2, Sanya Arshad 1,2, Mark S Anderson 5, Dario A A Vignali 1,6,7, Alok V Joglekar 1,2,7,
PMCID: PMC11093748  PMID: 38658646

Abstract

CD4+ T cells recognize peptide antigens presented on class II major histocompatibility complex (MHC-II) molecules to carry out their function. The remarkable diversity of T cell receptor sequences and lack of antigen discovery approaches for MHC-II make profiling the specificities of CD4+ T cells challenging. We have expanded our platform of signaling and antigen-presenting bifunctional receptors to encode MHC-II molecules presenting covalently linked peptides (SABR-IIs) for CD4+ T cell antigen discovery. SABR-IIs can present epitopes to CD4+ T cells and induce signaling upon their recognition, allowing a readable output. Furthermore, the SABR-II design is modular in signaling and deployment to T cells and B cells. Here, we demonstrate that SABR-IIs libraries presenting endogenous and non-contiguous epitopes can be used for antigen discovery in the context of type 1 diabetes. SABR-II libraries provide a rapid, flexible, scalable and versatile approach for de novo identification of CD4+ T cell ligands from single-cell RNA sequencing data using experimental and computational approaches.

Subject terms: Adaptive immunity, Antigen processing and presentation, Autoimmunity


SABR-IIs are engineered bifunctional receptors that enable CD4+ T cell antigen discovery.

Main

A hallmark of the adaptive immune system is the ability to raise antigen-specific responses. This is accomplished for αβT cells through the T cell receptor (TCR), which comprises TCRα and TCRβ chains1. Specifically, TCRs from CD4+ T cells recognize peptide epitopes on MHC-II or human leukocyte antigen (HLA)-II. The estimated size of the mature TCR repertoire is 108–1010 unique TCRs in mice and 109–1012 unique TCRs in humans24. Recognition of foreign antigens such as those from SARS-CoV-2 and tumor neoantigens by CD4+ T cells leads to their protective function5,6. On the other hand, recognition of self-antigens such as insulin in type 1 diabetes (T1D), leads to pathogenic CD4+ T cell responses7,8. Furthermore, regulatory T cells can bind to self-antigens and prevent autoimmunity9. The specificity of CD4+ T cells is key to their function, highlighting a need for antigen discovery approaches tailored for MHC-II and HLA-II10.

Traditionally, antigen-specific CD4+ T cells have been studied using functional assays that measure proliferation, cytokine release or cytotoxicity1114. These assays are sensitive but are limited to investigating tens of peptides simultaneously. Techniques such as barcoded tetramers can efficiently detect antigen-specific T cells but are limited to the interrogation of 100s of specificities simultaneously1520 and are further limited by the instability of multimers and lower affinities of CD4+ TCRs21,22. Unbiased approaches such as yeast display and combinatorial peptide libraries have been used to identify epitopes de novo, but these methods often identify nonphysiological epitopes (altered peptide ligands or mimotopes), are highly laborious, and in the case of yeast display, rely on soluble TCR generation2326. Cell-based methods are emerging approaches for TCR-directed antigen discovery. These methods preserve physiological TCR–pMHC interactions, can present large and defined epitope libraries and do not require substantial a priori knowledge of antigen specificity2732. The interchangeability between approaches for MHC-I and MHC-II is not trivial. The utility of cell-based, MHC/HLA-II, antigen discovery was demonstrated by Kisielow et al. using pMHC–TCR (MCR-TCR)28,33,34, which allowed for the identification of cognate epitopes by iterative screening against libraries encoded through complementary DNA or defined libraries34. More recently, TScan-II was deployed for antigen discovery of CD4+ T cells but requires separately engineered antigen-presenting cells (APCs)35.

With the increasingly widespread use of single-cell RNA sequencing (scRNA-seq) to interrogate T cell responses, it is paramount that T cell antigen discovery methods can be scaled to investigate tens to 100s of TCRs rapidly36. Recently, several algorithms for computational antigen discovery have been reported, including grouping of lymphocyte interactions by paratope hotspots (GLIPH/GLIPH2), distance measure on space of TCRs that permits clustering and visualization (tcrdist/tcrdist3) and clonotype neighbor graph analysis (CoNGA)3739. These algorithms identify TCR specificity groups comprising TCRs that share sequence similarity and/or motifs and are therefore predicted to share antigenic specificity. Recently, ‘reverse epitope discovery’ has been explored to leverage large datasets for comparison of TCR amino acid similarity40. Ultimately, Rosati et al. were able to identify public, immunodominant CD4+ T cell responses across 59 individuals; however, it remains challenging to predict the antigens of private clonotypes in private datasets, highlighting the need for high-throughput methods that synergize both experimental and computational approaches10.

Here we showcase a combination of several methodological advances in applying experimental and computational tools for antigen discovery. First, we report a modular cell-based method for antigen discovery using signaling and antigen-presenting bifunctional receptors to encode MHC/HLA-II molecules presenting covalently linked peptides (SABR-IIs) for mouse and human CD4+ T cells. Second, we show de novo identification of epitope specificities of TCRs derived from scRNA-seq data in a mouse model of T1D. Finally, we demonstrate that experimental antigen discovery can be amplified post hoc by computational approaches. Together, we have developed an experimental and computational workflow to rapidly de-convolute the specificity of scRNA-seq-derived CD4+ T cells de novo.

Results

Signaling and antigen-presenting bifunctional receptors II

We have previously described SABRs, which are chimeric receptors containing an extracellular pMHC complex attached to an intracellular CD28-CD3ζ signaling domain. We demonstrated that SABRs can read out TCR–pMHC interactions, allowing the construction of SABR libraries for antigen discovery for class I HLA alleles27. We sought to expand this platform to allow antigen discovery for MHC/HLA-II with seamless integration with class I alleles. Here, we created SABRs to present epitopes in MHC-II alleles, by covalently linking the epitope to the β-chain of MHC-II that is attached to the CD28-CD3ζ signaling domains downstream, along with a 2A peptide-linked MHC-II α-chain (Fig. 1a,b). To test whether SABR-IIs could present epitopes to TCRs and induce a signal, we expressed them using lentiviral vectors in NFAT–GFP Jurkat cells, which express green fluorescent protein (GFP) upon NFAT activation and translocation downstream of CD3ζ activation (a kind gift from Y. Chen and A. Weiss). We constructed murine SABR-IIs presenting epitopes in I-Ab, I-Ad and I-Ag7 (Ova, ISQAVHAAHAEINEAGR41; ATEG, ATEGRVRVNSAYQDK42; and 2.5mimo, YVRPLWVRME43, respectively). We co-incubated the SABR-II-expressing NFAT–GFP Jurkat cells with a separate population of Jurkat cells expressing either the BDC2.5 TCR (recognizes I-Ag7-2.5mimo), OT-II TCR (recognizes I-Ab-Ova), 5-4-E8 TCR (recognizes I-Ad-ATEG) or no TCR. Robust GFP and CD69 expression in SABR-II-expressing NFAT–GFP Jurkat cells was observed 18–20 h later in only the correctly paired assays (Fig. 1c and Extended Data Fig. 1a,b). The signal from the NFAT–GFP reporter offered minimal background in absence of a cognate TCR and correlated with surface SABR expression in the presence of a cognate TCR (Extended Data Fig. 1c–e). To demonstrate the application of SABR-IIs for human antigen discovery, we generated SABR-IIs to present the InsB9:23 epitope (SHLVEALYLVCGERG) in HLA-DQ8 (DQA1*0301:DQB1*0302, an HLA-II allele that is associated with increased risk of T1D and celiac disease44,45). We confirmed the ability of the DQ8-InsB9:23 SABR-II to present the epitope to two previously described, T1D patient-derived TCRs GSE.6H9 and GSE.20D11 (ref. 46). As expected, a high frequency of GFP+CD69+ cells were found only when the TCRs interacted with the InsB9:23 epitope and not a control hen egg lysozyme epitope (Fig. 1d and Extended Data Fig. 1f,g).

Fig. 1. SABR-IIs identify cognate TCR–pMHC interactions for antigen discovery.

Fig. 1

a, A schematic of SABR-II constructs. b, Signaling directionality between pMHC:TCR (left) and a SABR-II:TCR (right) c, Representative and summary plots for GFP and CD69 expression from SABR-II-expressing NFAT–GFP Jurkat cells after culture with TCR-expressing Jurkat cells. The bar graph indicates the mean of two technical replicates (dots). d, SABR co-incubation of Jurkat cells expressing either the GSE.6.H9 or GSE.20.D11 TCR against NFAT–GFP Jurkat cells expressing InsB9:23 or hen egg lysozyme (HEL) in HLA-DQ8 SABR-IIs. Mean and s.d. are plotted from three biological replicates. e, Schematic (top) of the ES metric used for putative hit-calling in SABR-II screens. Cartoon ES plot (bottom) of a SABR screen where putative hits (dots/circles) will fall in high- (green) and low-confidence (orange) zones based on positive control TCR–pMHC interactions. f,g, ES plots from I-Ag7 SABR-II library screens of the BDC2.5 TCR from eight biological replicates. The green and orange lines indicate the high- and low-confidence ES zones, respectively. In f, each dot represents the mean for each epitope with s.d. In g, the bar represents the mean with each biological replicate plotted as a point for the top 22 putative hits (x axis). h, ES plot for screen of the GSE.20.D11 TCR against both HLA-A2.1 and HLA-DQ8 SABR libraries simultaneously. Reads were mapped to the DQ8 library and ES was calculated for these epitopes. The InsB9:23 epitope is highlighted by the larger red dot.

Source data

Extended Data Fig. 1. SABR expression and signaling.

Extended Data Fig. 1

A. Representative expression of murine TCRβ (Bdc2.5 TCR) and I-Ag7-2.5mimo SABR-II after transduction of Jurkat and NFAT-GFP-Jurkat cells respectively. B. Representative flow cytometry plots of SABR-II expressing NFAT-GFP-Jurkat cells after co-incubation with TCR-expressing Jurkats. The TCRs and SABRs are indicated by rows and column names respectively. C. TCR3 expressing or mock Jurkats were co-incubated against NFAT-GFP-Jurkats cells expressing the I-Ag7 SABR presenting the QVEQLELNAARDPN HIP (SABR1). The GFP MFI was plotted (y-axis) as dots with s.d. (error bars) from technical duplicates against the I-Ag7 levels binned by half logs of the I-Ag7 labeling MFI (x-axis, bins depicted in D). D. Gating strategy to generate the half log bins of I-Ag7 expression in panel C E. Representative pseudo color dot plot depicting the I-Ag7 levels (x-axis) of SABR1 NFAT-GFP-Jurkats co-incubated against TCR3 expressing Jurkats depicted in panel A. The pseudo coloring depicts the GFP MFI for each given event. F. Representative expression of the HLA-DQ8 SABR-II after transduction of NFAT-GFP-Jurkat cells. G. Representative flow cytometry plots of SABR-II expressing NFAT-GFP Jurkat cells after co-incubation with TCR-expressing Jurkats. The respective TCRs and SABRs are indicated by rows and columns respectively.

Source data

To test the compatibility between human and mouse cells for the function of SABR-IIs, we performed co-incubation assays using 5KC cells (a mouse thymoma line, which was a kind gift from M. Nakayama). We observed that SABR-II–TCR interactions were retained irrespective of the host species (Extended Data Fig. 2a,b). Furthermore, we demonstrated that SABR-IIs consisting of B cell signaling domains (CD79A and CD79B), could also signal through NFAT (Extended Data Fig. 2c,d). As a further demonstration of the modularity of the SABR-II design and its potential for deployment in professional APCs, we expressed SABR-IIs containing either the CD28-CD3ζ or CD79A/B domains in Daudi B cells. We observed that the cognate interaction of both the SABRs with their TCRs resulted in upregulation of surface FAS on Daudi cells, showing that the SABR-II platform can signal in professional APCs4749 (Extended Data Fig. 2e,f).

Extended Data Fig. 2. Modularity of SABR-IIs.

Extended Data Fig. 2

A. Schematic (color coded by cell type) for corresponding co-incubation assays demonstrating SABR-IIs in both human (Jurkat) and murine (5KC) cell lines. B. OT-II TCR-expressing Jurkat cells against NFAT-GFP-Jurkats expressing the I-Ab-OVA SABR (left) and OT-II expressing 5KC cells against 5KC cells expressing the I-Ab-OVA SABR (right). Bars represent means from two biological replicates (dots). C. Schematic for corresponding co-incubation assays in NFAT-GFP-Jurkat cells demonstrating SABR-IIs with CD28-CD3z or CD79A and CD79B signaling domains. D. Representative flow plots (left) of OT-II TCR Jurkats co-incubated against NFAT-GFP-Jurkats expressing SABR-IIs with either CD28-CD3z(top) or CD79A and CD79B signaling domains. GFP expression was quantified after 24 hr co-incubation (right) with bars depicting the mean and s.d. (error bars) from 3 biological replicates (dots). E. C. Schematic for corresponding co-incubation assays in Daudi Cells demonstrating SABR-IIs with CD28-CD3z or CD79A and CD79B signaling domains. F. Representative dot plots (left) of Bdc2.5 TCR Jurkats co-incubated against Daudi cells expressing SABR-IIs with either CD28-CD3ζ or CD79A and CD79B signaling domains. The percentage of SABR-II+ Daudi cells expressing FAS was quantified by flow cytometry after 72 hr with bars depicting the mean and s.d. (error bars) from 3 biological replicates (dots).

Source data

We then asked whether SABR-IIs could be used to present a library of epitopes for CD4+ T cell antigen discovery. To that end, we constructed a SABR-II library to present epitopes derived from pancreatic islets in I-Ag7 by curating a list of 4,075 published epitopes from the Immune Epitope Database (iedb.org)50 and a study by Wan et al.51 (Supplementary Table 1). Of note, this defined library consisted of unmodified epitopes from endogenous proteins, synthetic mimotopes, deamidated epitopes and hybrid insulin peptides (HIPs) that arise from post-translational fusion and are not genetically encoded in vivo52,53. The epitope library was inserted into the I-Ag7 SABR-II backbone through pooled oligonucleotide synthesis, amplification and ligation-free cloning (Extended Data Fig. 3a). The I-Ag7 SABR-II-library was then expressed in NFAT–GFP Jurkat cells. We confirmed that after sequencing, the library accounted for a mean of 708 reads per epitope (Extended Data Fig. 3b). As a proof of concept, we performed co-incubation assays with Jurkat cells expressing the BDC2.5 TCR and sorted the top 1–2% of GFP+CD69+ cells at a rate of ~20 min per replicate with three replicates per TCR. We extracted the genomic DNA from sorted cells, amplified the SABR portion of the integrated proviruses and subjected the amplicons to Illumina sequencing (Extended Data Fig. 3c,d). The 1–2% sort gate represents >50-fold enrichment of cognate epitopes with minimal loss of signal (Extended Data Fig. 3e–h). Sequence reads were aligned to the I-Ag7 SABR-II backbone and the corresponding epitopes were scored based on their read counts. For each TCR under investigation, an enrichment score (ES) was determined for all the epitopes in a library. In each experiment, three replicates of a sort with TCR-expressing Jurkat cells were performed and reads were counted post-sequencing. In addition, three replicates of the unsorted library were also sequenced. A linear regression model was built using the unsorted library counts and used to determine the expected abundance of each epitope in the library. The ES was calculated based on the difference between the measured and the expected abundance of each epitope on a per-TCR basis (Fig. 1e). Based on ES values, two quantitative thresholds were used to determine putative cognate epitopes of a given TCR. A high-confidence zone containing clear outliers with a high ES and a low-confidence zone containing weak outliers with a moderately high ES were determined (Fig. 1e). This two-tiered strategy was used to call putative hits from screens. All the top-scoring epitopes for the BDC2.5 TCR were known BDC2.5 ligands containing the WXRM(D/E) motif (Fig. 1f,g, enriched ligands in red), a well-characterized trait of the BDC2.5 TCR43,5456. Across several independent experiments there was limited variation in ES values for the same TCRs and several epitopes fell into high- or low-confidence zones consistently (Extended Data Fig. 4a). Using a different TCR that was isolated from NOD mice, 4-8Ins57, which recognized the InsB9:23 epitope (SHLVEALYLVCGERG), we observed a similar pattern of ES for cognate epitopes (Extended Data Fig. 4b).

Extended Data Fig. 3. SABR-II library screening.

Extended Data Fig. 3

A. Schematics of the SABR-II library cloning and PCR strategy used for targeted reamplification of gDNA from sorted SABR-II library cells. B. The average read count in the libraries across 8 independent experiments is shown (blue dots) with s.d. (error bars). The x-axis denotes the epitope number (ordered in a descending order of mean read counts). C. Representative flow cytometry plots for SABR-II screen sorts. NFAT-GFP-Jurkat cells expressing the SABR-II library were labeled with cell trace violet, gated, and subsequently used to select top 1–2% of GFP/CD69 double positive cells for sorting. D. PCR indexing strategy for epitopes from gDNA for sequencing of SABR-II screens. E-H. NFAT-GFP-Jurkat cells expressing the I-Ag7 library, which contains no TCR3 target epitopes, were labeled with cell trace far red (CTFR+), and NFAT-GFP-Jurkat cells expressing the TCR3 targeted single SABR-II (QVEQLELNAARDPN) were labeled with cell trace violet (CTV+). The two cell types were mixed at decreasing ratios of the TCR3 targeted SABR-II and incubated against TCR3 expressing Jurkats at a 1:1 Jurkat to NFAT-GFP-Jurkat ratio. E. Gating strategy to identify sensitivity and enrichment of target cells in SABR-II library sort gate. Cells were partitioned by cell trace label then separately analyzed for proportion that fall into the sort gate (1–2% CD69+GFP+ NFAT-GFP-Jurkat cells) in the far-right panel. F. Relative proportion of TCR3 targeted SABR-II cells and library cells in the assay that are captured in the sort gate where bars indicate mean from 3 technical replicates. The x-axis indicates the level of spike in of the TCR3 target single SABR-II. G. Left panel shows the fold enrichment (mean of 3 technical replicates) of spiked in TCR3 target single SABR-II cells after sorting. The x-axis indicates the proportion of target cells spike in as in panel F. The right panel shows the pre- and post-sort proportion of spiked in TCR3 target single SABR-II cells. H. Gating strategy to specifically identify specificity of sort gate. Cells are partitioned by cell trace labeling after their appearance in the sort gate. Right panel shows quantification of the proportion of the TCR3 targeted single SABR-II cells that makeup the total sorted cells vs the untargeted library where bars indicated the mean percentage of total sorted cells from two technical duplicates (dots) across the % of target cells spiked in pre-sort (x-axis).

Source data

Extended Data Fig. 4. Enrichment score plots from I-Ag7 Library Validations.

Extended Data Fig. 4

A. The enrichment score plots for each of 8-independent replicate screen of the BDC2.5 TCR against the I-Ag7-SABR-II library. The resulting high and low confidence thresholds are denoted by green and orange lines respectively. B. The enrichment score for each of 3-independet replicate screen of the 4–8Ins TCR against the SABR-II library. The same high and low confidence thresholds as maintained from experiments in panel A. For every plot each dot represents a single epitope spread across the x-axis and red dots indicate putative cognate epitopes selected for validation.

Source data

To test whether HLA-DQ8 SABR-II could be used for antigen discovery, we curated a list of insulin B, insulin C and HIP epitopes published by Wiles et al.58 and cloned them into the DQ8 SABR-II backbone using the same pooled cloning strategy as the I-Ag7 SABR-II library (Supplementary Table 2). Furthermore, we combined SABR-I (the HLA-A*0201 library reported in our previous work27) and SABR-II libraries at a cellular level and screened against the GSE.20D11 TCR. As expected, the cognate epitope of the GSE.20D11 TCR, SHLVEALYLVCGERG (red), was enriched at a high confidence level from a combined class I and II library (Fig. 1h). This demonstrates that a combined library approach using the SABR platform can be implemented to increase throughput. Together, these results demonstrate the ability of SABR-IIs to successfully read out pMHC-II–TCR interactions across species and cell types and serve as a method for CD4+ TCR antigen discovery.

Single-cell profiling of islet-infiltrating CD4+ T cells

We sought to apply SABR-II libraries to identify the specificities of islet-infiltrating CD4+ T cells in NOD mice. Although NOD mice recapitulate many features of T1D and share several autoantigens with individuals with T1D53,5961, the overall antigenic landscape of islet-infiltrating CD4+ T cells in NOD mice remains undefined. Therefore, we performed scRNA-seq with V(D)J enrichment on T cells from individual pancreatic islets of 6-, 8- and 10-week-old NOD mice. We sorted Thy1.2+TCRβ+ T cells from 3–4 mice at each time point, combined them using TotalSeq cell-hashing oligonucleotides and proceeded to scRNA-seq using the 10x Genomics platform. In total, T cells from 11 mice were sequenced in three batches and the data were pooled for analysis. Hierarchical clustering in Seurat62, followed by bioinformatic gating on CD4+ T cells and re-clustering, revealed seven distinct CD4+ T cell clusters with no obvious bias between mice (Fig. 2a and Extended Data Fig. 5a,b). Next, we integrated TCR clonotypes with the transcriptomes using scRepertoire63 and identified the clonally expanded populations of CD4+ T cells (Fig. 2b). Clonal expansion was categorized as single (one clone per TCR), low (2–9 clones per TCR) or medium (≥10 clones per TCR). Clonal expansion was evident in clusters 0 and 3–6 (Fig. 2c). Generally, clonal expansion correlated with the expression of activation and exhaustion markers (Nkg7, Ccl5, Lag3 and Tigit), whereas naive T cell markers (Sell and Ccr7) coincided with un-expanded populations. We reasoned that clonally expanded cells within the islets were the most likely to target islet antigens and contribute to β-cell destruction. Therefore, we used clonal expansion as the sole criterion for selecting TCRs for antigen discovery. Overall, clonally expanded TCRs showed a slight skew toward certain Vα and Vβ alleles (Extended Data Fig. 5c–e), as has been reported previously64,65. Notably, expanded clones did not segregate solely based on their gene expression as indicated by the high degree of clonal sharing between CD4+ TCR clusters determined by the Morisita–Horn Index (Fig. 2d and Extended Data Fig. 5f). Clonally expanded TCRs showed increased expression of Lag3, similar to a restrained CD8+ T cell phenotype that was reported previously in NOD mice66. Further investigations into the transcriptional signatures of expanded T cells were reported previously67. Specifically, we identified 35 clonally expanded TCRs for screening, corresponding to 19 TCRs from three 8-week-old mice and 16 TCR from two 10-week-old mice (Supplementary Table 3). We reconstructed the TCRs using a home-brewed Python script that reconstructs full TCRα/β chains using the IMGT TCR allele dataset (Methods)68. The reconstructed TCR genes were synthesized through commercial vendors and subcloned into the pMIG-II–IRES–GFP vector containing a partial Cβ-chain derived from the BDC2.5 TCR. TCRs in the pMIG-II vector were then packaged intro retroviruses and expressed in Jurkat cells. Surface expression was confirmed by staining for murine TCRβ followed by flow cytometry. For TCRs with low transduction levels, we enriched the TCRβ+ population using either fluorescence-activated cell sorting or magnetic selection and proceeded with antigen discovery with SABR-II libraries (Fig. 2e).

Fig. 2. Single-cell RNA sequencing of islet-infiltrating CD4+ T cells.

Fig. 2

a, Uniform Manifold Approximation and Projection (UMAP) representations of islet-infiltrating CD4+ T cells from 6-, 8- and 10-week-old NOD mice. Hierarchical clusters generated by Seurat are shown in different colors and numbered. b, Overlay of clonal expansion on the gene expression cluster UMAP plot. Gray dots represent cells with unique clonotypes, light blue dots represent low (2–9 clonotypes) expansion, dark blue dots represent high (≥10 clonotypes) expansion. c. Dot plot of expression of select T cell markers by cluster. Left bar graph depicts cell number on x axis with colors to denote clone size from b and differential gene expression of select genes across clusters. d, Morisita–Horn index plot comparing all TCR sequences across each cluster. e, Schematic (top) of TCR cloning strategy into pMIG-II backbone along with representative flow cytometry plots (bottom) of murine TCR levels before and after enrichment in Jurkat cells.

Source data

Extended Data Fig. 5. Single-cell RNA-Sequencing of islet-infiltrating CD4+ T cells from NOD mice.

Extended Data Fig. 5

A. Hierarchical clustering of total T cells across 11 mice from 6-, 8-, and 10-week-old time points. B. Hierarchical clustering of CD4+ T cells from individual mice across 6-, 8-, and 10-week time points. C. Projection of top 40 expanded CD4+ T cell clones from 8-, and 10-week-old NOD mice onto Seurat clusters using scRepertoire. D. Distribution of top 40 expanded CD4+ TCR sequences across all mice. E. TRAV (top) and TRBV (bottom) usage from top 40 expanded CD4+ TCR sequences across all mice. F. Morisita-Horn index comparing expanded TCR clones across each mouse individually.

Source data

Identifying cognate epitopes of CD4+ TCRs de novo

We performed systematic screening of the cloned TCRs against the I-Ag7 SABR-II library. Several TCRs along with a positive control (such as BDC2.5 or 4-8Ins) were screened individually against the library for each sort (Extended Data Fig. 6a and Supplementary Table 4). High- and low-confidence ES zones for each screened TCR were defined by the ES values of the control TCR’s cognate epitopes. For all putative cognate epitopes, single SABR-IIs were constructed, expressed in NFAT–GFP Jurkat cells and used for co-incubation with the corresponding TCRs. Co-incubation assays that yielded a GFP signal higher than that obtained in assays with no TCR were determined to be positive and the epitopes deemed true cognate ligands (Extended Data Fig. 6b,c). Using this strategy, we obtained epitopes in the high-confidence zone for eight TCRs (Fig. 3a and Extended Data Fig. 6b,d). Among numerous altered peptide ligands (APLs), these TCRs recognized the physiological InsC-ChgA HIP (LQTLALWSRMD and analogs, recognized by TCR5, TCR6B, TCR9 and TCR34), InsC-Iapp HIP (LQTLALNAARDP and analogs, recognized by TCR4 and TCR15) and InsB9:23 (SHLVEALYLVCGERG and analogs recognized by TCR24 and TCR37). These cognate high-confidence hits were validated using single SABR-II co-incubation assays (Fig. 3b and Extended Data Fig. 7a). Further validations using in vitro mouse interleukin-2 (mIL-2) secretion by TCR-expressing 5KC reporter cells13 or CD25 expression by TCR-expressing splenic CD4+ T cells upon stimulation with the cognate epitope were performed (Extended Data Fig. 7b,c). Furthermore, low-confidence hits were called for ten TCRs and tested in co-incubation assays. Upon co-incubations, two out of the ten TCRs (TCR11 and TCR30) showed confirmation of reactivity, both recognizing InsB9:23 (SHLVEALYLVCGERG and analogs; Extended Data Fig. 7d and Fig. 3b). Notably, visualization of the cells corresponding to each de-convoluted TCR clone did not reveal overt differences in the transcriptional phenotype of cells recognizing the three different antigens (Fig. 3c). Taken together, these results indicate that SABR-II libraries can successfully identify cognate epitopes of CD4+ TCRs among thousands of epitopes for TCR-directed antigen discovery, starting simply from a TCR sequence with little a priori knowledge.

Extended Data Fig. 6. Representative SABR-II screens and hit validation.

Extended Data Fig. 6

A. Representative flow sort gating for cell trace violet labeled NFAT-GFP-Jurkat cells expressing the I-Ag7-SABR-II library after co-incubation with Jurkat cells expressing TCRs. Top 1–2% of cells expressing GFP and CD69 were sorted as is shown in the two rightmost panels (gate is constant across panels). B. ES plots from a single sort of 6-TCRs individually. High and low confidence thresholds are denoted by green and orange lines respectively. The colored arrows indicate putative hits tested for validation with single SABR-II assays depicted in panel C. C. Single SABR-II co-incubations for validation of putative hits from the screens in panel B where single SABR-II expressing NFAT-GFP-Jurkat cells were co-incubated against Jurkats expressing the TCR of interest and assayed 18–20 hr later by flow cytometry. Bars indicate means from 2 technical replicates (dots). Top panel depicts non-validated epitope, middle and bottom depict validated synthetic altered peptide ligands (APLs) and physiological epitopes. D. ES plots for TCRs screened against the I-Ag7-SABR-II library that yielded high-confidence putative hits grouped by the highest non-APL epitope. The same high and low confidence thresholds are used from plots generated in Extended Data Fig. 4.

Source data

Fig. 3. De novo identification of cognate epitopes for expanded CD4+ T cells.

Fig. 3

a, Representative SABR-II screen results for three TCRs are shown. Putative non-APL hits are indicated with the epitope sequences and larger red dots. Each dot represents an epitope in the library. The dotted lines indicate threshold for calling putative hits. b, Single SABR-II co-incubation assays with TCR-expressing Jurkat cells against NFAT–GFP Jurkat cells expressing SABR-IIs presenting a single epitope (as indicated). GFP+CD69+ cells in co-incubation assays at 18 h are quantified. Bars show mean and s.d. from three biological replicates. c, Projection of antigen-specific CD4+ T cell clones onto the UMAP plots of islet-infiltrating CD4+ T cells. TCRs are color coded and their cognate epitopes are indicated in the plot title.

Source data

Extended Data Fig. 7. Putative hit validation for high and low confidence hits.

Extended Data Fig. 7

A. Representative flow plots from the single SABR-II validations of the putative hits for high-confidence, non-APL hits. B. Murine IL-2 ELISA from 24 hr co-incubation of 5KC cells expressing TCR37 with NFAT-GFP-Jurkat cells expressing the InsB9:23 epitope where bars represent mean of IL-2 sectretion into supernatant from two technical duplicates (dots). C. CD25 expression measured on primary murine CD4+ T cells expressing TCR15 after 24 hr co-incubation with Bone Marrow Dendritic Cells pulsed with either 1μg InsC-IAPP peptide (LQTLALNAARDP) or no peptide. D. ES plots with arrows denoting epitopes tested in the corresponding single SABR-II co-incubations. The same high and low confidence thresholds are taken from Extended Data Fig. 4. The inset plots show single SABR-II validation assays where bars indicate the mean from 2–3 technical replicates (dots). The arrow colors match the epitopes within each inset plot.

Source data

TCR similarity predictions amplify antigen discovery

We hypothesized that computational grouping of TCR specificities may reveal closely related TCRs that potentially recognize the same epitope(s), similar to the reverse epitope discovery approach (Fig. 4a). In the absence of experimental antigen discovery, grouping of TCRs is not informative of reactivity; however, we hypothesized that TCRs that co-clustered with SABR-II de-convoluted TCRs bind to the same antigens. To test this, we used three TCR-similarity search algorithms: GLIPH2 (refs. 38,69), tcrdist3 (ref. 37) and CoNGA39. All three algorithms take slightly different approaches to group TCR sequences and generate clusters of TCR sequences that share high sequence similarity. In addition, CoNGA considers the transcriptional similarities among T cell clones. Using CoNGA, we defined TCR clusters for two TCRs, TCR4 and TCR6B, and identified analogs that slightly differed in sequences. Moreover, for TCR30, we were able to identify six TCR analogs that co-clustered in CoNGA analysis as well as GLIPH2. For TCR11, we first identified a gene expression (GEX) cluster that had ~50 TCRs that clustered based on gene expression. Using tcrdist3, we calculated the relative distance of each of these TCRs from TCR11 and selected the top seven clonotypes for expression. Together, 16 TCRs were identified as analogs of the experimentally de-convoluted TCRs (Extended Data Fig. 8). These TCRs were cloned and expressed in Jurkat cells. We performed co-incubation assays using single SABR-IIs and observed that 5 of 16 TCRs recognized the same epitopes as the parental TCRs (Fig. 4b). As a result, we were able to identify the cognate epitopes of five additional TCRs from our dataset that had otherwise not been selected for SABR-II screening based on our clonal expansion cutoff. Notably, the computationally identified and experimentally validated TCRs shared similar phenotypes as the experimentally de-convoluted TCRs (Fig. 4c). Therefore, we demonstrated that computational TCR similarity determinations could amplify experimental antigen discovery, leading to the deconvolution of 16 private TCRs de novo.

Fig. 4. Computational prediction of antigen specificity amplifies SABR-II antigen discovery.

Fig. 4

a, Schematic of the implementation for computational prediction and validation of shared antigen specificity for TCRs from private datasets. Solid red and blue lines indicate experimentally de-convoluted TCRs, dashed blue and red lines indicate potential analogs that may share specificity with the experimental TCRs and gray lines indicate remaining TCRs. b, Single SABR assays for analogs of TCR4, TCR6B, TCR11 and TCR30 are shown. Bar plots depict mean of percentage GFP+CD69+ SABR-II NFAT–GFP Jurkat cells plotted from two technical replicates. c, Projection of antigen-specific TCRs onto the UMAP after validation of predicted antigen specificity (right) compared to original clones (left).

Source data

Extended Data Fig. 8. CoNGA, TCRdist results and identification of TCR analogs.

Extended Data Fig. 8

The top three row of the CoNGA panel shows the GEX and TCR clusters with phenotype marker expression in each cluster. The TCR logo panel shows TCR clusters with logo representations of the TCRs that form these clusters. The purple, red, and blue rectangles indicate the TCR4 CoNGA cluster, TCR30 CoNGA cluster, and TCR11 gene expression clusters respectively. Each cluster’s TCRs which were selected for validation are listed in the tables below with the parental TCR. TCRs which lack GEX clusters in the TCR30 analog table were selected for validation by similarity determined through GLIPH2. TCR6B is not depicted in the figure because of the minimum clone size requirement for CoNGA, TCR11 mouse IDs are lost during CoNGA processing.

Identifying new HIP epitopes using SABR-II libraries

Given the predominance of HIP-reactive TCRs, we hypothesized that there may be other TCRs that respond to HIPs that were not encoded in our initial library configuration. While the initial I-Ag7 SABR-II library consisted of a number of HIPs, HIP formation is thought to be more widespread in pancreatic β-cells52,70. Therefore, we sought to construct a defined, HIP-focused library to probe whether there were any undiscovered HIP-reactive TCRs that could be recognized by the clonally expanded TCR in our dataset. To test this, we utilized a published proteomic dataset, which predicted that several proteins that were highly expressed in secretory granules of β-cells may contribute to HIP formation58. Using their predictions, we built a theoretical HIP library, in which all possible ‘left’ halves of the insulin C chain derived from natural cleavage products were fused to ‘right’ halves derived from secretory granule proteins (Fig. 5a). This 2,561-epitope library (12–25 amino acids per epitope) consisted of only HIPs and a small number of positive control epitopes (Supplementary Table 5). We screened the top three clonally expanded TCRs (TCR1, TCR2 and TCR3) against this library, as these TCRs had not been de-convoluted using the original library. We did not observe any putative hits for TCR1 and TCR2; however, TCR3 yielded several high- and low-confidence hits, all of which have not previously been reported (Fig. 5b). To confirm that the HIP itself was important for the cognate interaction, we cloned single 14-mer epitopes into SABR-IIs consisting of seven amino acids of the left portion of the HIP and seven amino acids of the right portion of the HIP. In this way, no nine amino acids from either peptide sequence alone could occupy the binding pocket of I-Ag7, ensuring that TCR reactivity spanned the HIP junction71. Upon single SABR co-incubations, all but one of the tested hits for TCR3 showed reactivity (Fig. 5c). Of note, in all the epitopes that were tested, the ‘left’ half derived from insulin C was conserved, whereas the ‘right’ halves were derived from several other proteins (Fig. 5d). These results show that defined theoretical SABR-II libraries can be deployed for determining non-contiguous epitope reactivity as well as TCR promiscuity. Moreover, the promiscuous binding of TCR3 to HIPs corroborates evidence from other NOD mouse-derived TCRs reacting to multiple HIPs72.

Fig. 5. HIP target library for identification of TCR with new HIP specificity.

Fig. 5

a, Schematic of HIPs generated by the post-translational fusion between insulin and other secretory granule proteins. b, ES plot of TCR3 screen against the SABR-II HIP library. Six putative hits were selected (numbered and highlighted by the larger dots) for single SABR validation across the high (green dashed line) and low (orange dashed line) confidence zones. c, Single SABR assays of 14 amino acid HIPs spanning the fusion junctions incubated against TCR3 or TCR4 expressing Jurkat cells. Bar plots depict mean of percentage GFP+CD69+ SABR-II NFAT–GFP Jurkat cells plotted from two technical replicates (dots). d, The amino acid sequences of HIPs and their right-half source proteins for epitopes in c are indicated.

Source data

Technical advances afforded by SABR-II screens

Finally, we sought to address two important aspects of antigen discovery techniques. First, we assessed whether SABR-II screens can directly read out the strength of TCR–pMHC binding. To that end, we selected six known BDC2.5 ligands across a range of ES values (Extended Data Fig. 9a) and we measured the functional avidity of their recognition by BDC2.5 TCR in vitro. Bone-marrow-derived dendritic cells were pulsed with a range of concentrations of peptides corresponding to the epitopes and used to present the peptides to BDC2.5 TCR-expressing 5KC cells. Secretion of mIL-2 was measured by ELISA and used to determine the functional avidity as EC50 (concentration of the peptide needed to induce half-maximal mIL-2) (Extended Data Fig. 9b). We observed that there was a modest negative correlation between the EC50 values of the epitopes and their ES values (Extended Data Fig. 9c). These results indicate that ES values can provide a semi-quantitative readout of the strength of interactions between TCRs and their cognate epitopes. Second, we evaluated whether we could increase the throughput of SABR-II screens by multiplexing TCRs and libraries at the cellular level. We combined the two previously described I-Ag7 libraries in equal proportions according to their size and used it as a single library. We also employed a dropout strategy, in which a mixture of seven TCRs was screened in replicate, where one TCR was left out in each replicate. After single enrichment, we determined the mean ES of all replicates that contained a given TCR and used it to identify the cognate epitope of that TCR (Extended Data Fig. 10 and Supplementary Table 6). Using this strategy, we were able to successfully recapitulate the results for four out of four TCRs previously identified in individual screens. The use of such a strategy will greatly enhance the throughput or SABR-II screens by reducing the hands-on sort time from 1 h per TCR to 20 min per TCR. These results show features that have been uniquely demonstrated by SABR-II screens and should increase the throughput of antigen discovery.

Extended Data Fig. 9. SABR-II screens are semi-quantitative readouts of functional avidity.

Extended Data Fig. 9

A. Selected BDC2.5 epitopes for functional avidity measurements. As shown by blue arrows, 6 epitopes across range of ES were chosen from mean ES (bars) across 8 independent experiments (dots). B. Individual biological replicates of peptide pulsing experiments with BDC2.5 TCR-expressing 5KCs against selected peptides. The y-axis shows normalized murine IL-2 secretion across a range of peptide concentrations (x-axis). C. The mean (dots) and s.d. (error bars) of Log EC50 values (x-axis) from the 3 biological replicates in panel B plotted against the mean (dots) and s.d. of ES values across 8 biological replicates (y-axis). The r and p values for two-sided Pearson correlation (dashed line) are reported.

Source data

Extended Data Fig. 10. ES plots from TCR and library multiplexing screens.

Extended Data Fig. 10

ES plots for each multiplexed sort where the dots represent the mean ES for each epitope in replicates (n = 6) where the target TCR was not included (dropout). The red dots indicate the cognate epitope of each TCR that were previously de-convoluted using individual TCR screens against the library.

Source data

Discussion

Here, we report SABR-IIs for CD4+ T cell antigen discovery, providing a robust method for screening a large number (1,000s to 10,000s) of epitopes. SABR-IIs can identify TCRs rapidly and can semi-quantitatively read out TCR–pMHC binding strengths. We have also shown that other non-T cell types can also be used to detect cognate interactions, expanding antigen discovery to professional APC-based platforms. Notably, SABR-II libraries can easily encode for deamidation and HIP formation, which are both post-translational modifications. Through this approach, we identified several new HIPs that were targeted by islet-infiltrating T cells and demonstrate an HIP-focused cell-based library strategy.

Moreover, we demonstrate a robust pipeline for reconstructing TCRs from scRNA-seq data and identifying their epitopes. The ability to start from and reconstitute TCRα/β sequences means that precious human samples are not wasted and can be assayed using additional methods. Furthermore, starting from scRNA-seq has the built-in advantage of leveraging the transcriptional information for each clone of an identified specificity, not limited by a few phenotypic surface markers or agnostic of the T cell’s function altogether. While we have chosen to profile the top expanded T cell clones in this study, we envision that future efforts can be focused on specific phenotypes of interest, such as regulatory T cells. In this way, both the environment from which the T cells are sampled and the properties of the T cells themselves will help further shape hypothesis-driven antigen discovery in autoimmune diseases such as T1D.

The ability to amplify antigen discovery using related TCRs by leveraging existing computational methods not only validates their utility but generates a positive-feedback loop for increased repertoire profiling and validation of TCR specificity. This will lead to an overall enlargement of the known epitope-specific TCR repertoire and provide incorporation of orthogonally obtained datasets for de novo antigen discovery. Finally, SABR-IIs in conjunction with SABRs, allow parallel antigen discovery for CD4+ and CD8+ T cells within the same platform and experiments.

We do wish to highlight the current limitations of our technique. The SABR-II in its current iteration is similar to the MCR-TCR platform28,33, which encodes for a signal emanating from MHC-II. There are several design differences that confer different capabilities to SABR-IIs, namely, the ability to perform single enrichments on larger libraries, the ability to multiplex TCRs and the ability to screen for both class I and II alleles. Notably, as the signaling domains of SABR-II are modular, SABR-IIs can be expressed and deployed in professional APCs; however, there are also key differences, such as lower library sizes, especially compared to the cDNA-generated libraries. As with the current cell-based epitope discovery methods, SABR-IIs cannot match the scale of yeast display, which can reach up to 108 epitopes for profiling. Techniques such as TScan-II have shown genome-scale antigen discovery; however, they cannot be used for both class I and class II discovery in the same platform35. Therefore, while not required, certain a priori criteria such as MHC binding prediction, tissue expression patterns or known immunopeptidomic datasets greatly enhance SABR-II library design. SABR-II screens are currently performed as ‘few against many’ assays, allowing tens of TCRs to be screened in a single day. The computational prediction tools we used here also pose inherent limitations to our workflow. As shown, 10 of 16 computationally predicted TCRs did not recognize the same antigens as the parental TCRs. This may be due to the erroneous calling of clonotypes or due to the analog-binding variations of the epitopes tested here. Either way, while we were able to amplify experimental antigen discovery, caution must be taken to not presume that prediction equals actual binding.

While we showed de novo identification of the 11 top expanded TCRs out of 36, we did not identify the cognate epitopes of the remaining TCRs. This could be due to several reasons. First, we used a published MHC elution dataset, which inherently has high specificity but low sensitivity for detecting MHC-II-bound epitopes. Building new SABR-II libraries based on tissue-specific gene expression may benefit by casting a wider net in search of cognate epitopes. In addition, a hallmark of numerous autoreactive diseases is the reactivity to post-translationally modified epitopes73,74. While we were able to encode hybrid and deamidated epitopes in our SABR-II libraries, we are developing approaches to incorporate a wider range of chemical modifications. Finally, the antigen sensitivity of class I SABRs is inherently lower than those of TCRs. We expect that SABR-IIs may also have a similar limitation, where very-low-affinity antigens do not generate a strong SABR signal and remain below the limit of detection without further modification, such as the introduction of a disulfide trap to stabilize the MHC and fix weak binding registers in place.

In summary, this study demonstrates that wielding SABR-IIs for TCR-directed antigen discovery and amplifying discovery with existing computational methods is a powerful combination for understanding CD4+ T cell specificities. By increasing the ability to survey the T cell repertoire we envision a more comprehensive catalog of the T cell ‘reactome.’

Methods

Ethics statement

All animal work was performed as per Institutional Animal Care and Use Committee (IACUC) guidelines under an approved IACUC protocol (no. 20037102). All experimental work was performed according to the institutional biosafety committee protocols.

Reagents and oligonucleotide primers

Reagents and oligonucleotide primers methods can be found in Supplementary Table 7. The lists of epitopes in the SABR-II libraries can be found in Supplementary Tables 3, 4 and 6.

Cell lines and peptides

Jurkat cells (ATCC) and Daudi cells (ATCC) were cultured in R10 (RPMI 1640 medium (Corning) supplemented with 10% FBS (Gemini Bio) and 10 U ml−1 penicillin–streptomycin (Corning)). NFAT–GFP Jurkat cells were a kind gift from A. Weiss and Y. Chen and were cultured in R10 supplemented with 2 mg ml−1 Geneticin (Corning). HEK293T cells (ATCC) were cultured in D10 (DMEM (Corning) supplemented with 10% FBS (Gemini Bio) and 10 U ml−1 penicillin–streptomycin (Corning)). 5KC cells were a kind gift from M. Nakayama and were cultured in IMDM (Gibco) with 10% FBS (Gemini Bio) and penicillin–streptomycin. All cell culture was performed at 37 °C with 5% CO2 in a humid cell culture incubator. Primary CD4+ T cells were isolated from spleens for NOD mice using a STEMCELL murine CD4+ T cell-positive selection kit (STEMCELL Technologies) and cultured in R10 (RPMI 1640 medium (Corning) supplemented with 10% FBS (Gemini Bio) and 10 U ml−1 penicillin–streptomycin (Corning)) supplemented with 5 U ml−1 IL-2 (R&D Biosciences).

Mice

Mice were housed in microisolator cages with up to five mice per cage in a 14-h light–10-h dark cycle. Temperatures of 65–75 °F (~18–23 °C) with 40–60% humidity were maintained. There was constant access to water. NOD/ShiLtJ (strain 001976, The Jackson Laboratory) mice were purchased at the age of 4 weeks. The mice were fed autoclaved rodent breeder diet (T. R. Last). Female mice were used for scRNA-seq and validation assays. For scRNA-seq, 6-, 8- or 10-week-old female mice were used. All animal work was performed under IACUC protocols in the Association for Assessment and Accreditation of Laboratory Animal Care-certified animal facility at the University of Pittsburgh.

Construction of SABRs

SABRs were designed by assembling the individual component sequences in Snapgene (DNAstar). HLA allele chains were downloaded from IMGT and MHC allele chains were downloaded from UniprotKB. SignalP-5.0 (ref. 75) was used to predict the signal sequence and truncate it. The signaling domains were derived from the previously published SABR constructs27. Beta-chain-Signaling-2A-Alpha-chain fragments were assembled and codon-optimized using IDT’s codon optimization tool. BsmBI sites were replaced without affecting the amino acid sequences and EcoRI sites were added at the ends. A 2-kb stuffer fragment was also synthesized according to previously published sequences27. Open reading frames were synthesized as gBlocks (IDT) and assembled using PCR (KOD mastermix, Milipore Sigma) using the following primers: 2kb-Insert-gBlock-F; 2kb-Insert-gBlock-R; BsmBI-Insert-Fwd; and ClassII-Alpha-Rev. The assembled full-length inserts were gel purified (Takara), digested with EcoRI (NEB), ligated in EcoRI-digested pCCLc-MND-X (a kind gift from D.B. Kohn) and transformed using NEB-5α cells (NEB). Inserts were verified using MND_Input_Verify_F and MND_Input_Verify_R primers. Once full-length backbones were cloned, they were used to clone individual epitopes. To insert epitopes, SABR vectors were digested with BsmBI along with alkaline phosphatase (rSAP, NEB) to excise the 2-kb stuffer fragment. Two complementary oligonucleotides, SABR-epitope-F and SABR-epitope-R, were synthesized for each epitope. Oligonucleotides were annealed to each other, phosphorylated and ligated into the BsmBI-digested backbone (T4 Ligase, NEB) and transformed in NEB-5α cells (NEB). For cloning SABR libraries, oligonucleotide pools containing overhangs (oligonucleotide epitope primer) were synthesized via Twist Biosciences. The pool was amplified using ClassII-Oligo-Fwd and ClassII-Oligo-Rev and cloned in a BsmBI-digested backbone using Infusion HD cloning (Takara). Bacteria were plated on LB agar containing 100 μg ml−1 carbenicillin (Life Technologies), grown overnight and single colonies were selected for verification by Sanger sequencing (Azenta). Successful clones were used to inoculate liquid culture for overnight growth followed by plasmid minipreps (Zyppy miniprep kit, Zymo). Pooled libraries were subjected to maxipreps (Nucleobond Maxiprep EF kit, Takara). Library coverage was determined by comparing the number of total colonies transformed to the number of epitopes encoded in the library. For B cell receptor SABRs, the protein sequences CD79A and CD79B domains were obtained from UniprotKB and fused with full-length MHC-II chains and obtained via commercial synthesis (Twist Biosciences). Epitopes were cloned in the B cell receptor SABR backbone as described above, except that the stuffer fragment was removed using XhoI digestion (NEB).

scRNA-seq of islet-infiltrating T cells and analysis

NOD mice were killed by CO2 asphyxiation and immediately dissected for pancreas perfusion and individual islet picking as previously decsribed66. Pancreas perfusion was performed under a dissecting microscope. The pancreatic duct was clamped using surgical clamps and 3 ml 600 U ml−1 Collagenase IV (Gibco) dissolved in HBSS (Gibco) was injected using a 30G needle. Perfused pancreata were collected and incubated at 37 °C for 30 min. After the incubation, HBSS with R10 was added to quench collagenase. After washing twice with HBSS + R10, the tissue was plated on a 10-cm plate and individual islets were picked using a micropipette. Islets were then incubated in dissociation buffer (Gibco), centrifuged and resuspended in the staining mix (1:500 dilution of anti-Thy1.2-BV605 + 1:500 dilution of Live/Dead-APC-Cy7 and 1:100 dilution of cell-hashing TotalSeq antibodies (BioLegend)). After staining, the cells were resuspended in PBS + 0.04% BSA (Millipore Sigma) and sorted on a BD FACS Aria III sorter. After sorting the cells, they were counted and processed for scRNA-seq. Cells were processed using 10× 5′ single-cell gene expression kit v3 in a Chromium controller according to the manufacturer’s protocols. V(D)J enrichment was performed using the single-cell 5′ VDJ enrichment kit according to the manufacturer’s protocols. Libraries were sequenced on a HiSeq4000 (Novogene) with a 70:20:10 mix for gene expression:VDJ:hashing libraries. Sequence data were downloaded on the Joglekar laboratory server and aligned to the mouse genome (Mm10) using CellRanger v.4.0.0 (10x Genomics). TCR annotation was performed using CellRanger vdj using mouse GRCm38 assembly. All three time points were sequenced and processed separately. CellRanger and CellRanger vdj output files were used as inputs in Seurat62 for normalization, scaling and dimensionality reduction. The packaged scRepertoire was used for TCR clonotype calling and analyses. The data were normalized using NormalizeData and scaled using ScaleData functions in Seurat. The scRepertoire63 functions combineTCR and combineExpression were used to add TCR clonotypes to each cell. The HTODemux function in Seurat was used to demultiplex cell hashes and assign the correct mouse identity to each cell. At this point, all three time points were merged in Seurat using the merge function. After merging, integration was performed using FindIntegrationAnchors and IntegrateData functions. Principal-component analysis was performed using RunPCA. The top 20 principal components were used for UMAP, followed by cluster identification using FindNeighbors and FindClusters. CD4+ T cells were subsetted using FeatureScatter and CellSelector functions and reclustered. Cluster markers were defined by the FindAllMarkers function. Clonotype data were sorted according to expansion and exported as a csv file. UMAP representations with clonotypes were generated using the highlightClonotypes function in scRepertoire. Differentially expressed genes were identified using the FindMarkers function using DESeq2 statistics and represented using EnhancedVolcano function. For the related manuscript67 (Xiao, Rohimikollu and Rosengart et al.), single (1), low (2–9) and medium (≥10) clonotypes were subsetted in Seurat and exported as Seurat objects for further analyses. All scRNA-seq analyses were performed using RStudio (v.2023.12.1+402).

TCR reconstruction and synthesis

TCR Vα, Jα, Vβ and Jβ alleles along with CDR3α and CDR3β sequences were used as the input to reconstruct full-length TCR sequences using the TCRgen_mouse.opt_v2.py script (available on GitHub at https://github.com/joglekar-lab/SABR-II). Mouse reference sequences were downloaded from IMGT. Full-length TCR sequences (TCRα-2A-TCRβ) flanked by EcoRI site and truncated at the BlpI site in Cb were synthesized as gene fragments via Twist Biosciences. TCR gene fragments were amplified using TCR-gene-fwd and TCR-gene-rev primers and subcloned using a pMIG-II vector containing BDC2.5 TCR (Vignali laboratory) using EcoRI-BlpI. Successful cloning was verified using Sanger sequencing (Azenta).

TCR similarity determinations

Exported clonotypes were used as inputs for GLIPH2 (ref. 69). For CoNGA, the merged dataset was exported as a .h5ad file and used as an input along with the CellRanger vdj output file. CoNGA analysis was performed using default parameters39. Pairwise relative distances among TCRs were calculated using tcrdist3 (ref. 37). CoNGA, tcrdist3 and GLIPH2 output files were searched manually for analogs that co-cluster with experimentally de-convoluted TCRs. Analogs were synthesized and cloned as described above.

Generation and cloning of SABR libraries

To generate the I-Ag7 restricted SABR library, we combined all Immune Epitope Database epitopes with a published immunopeptidome generated by Wan et al.51. Sequences were filtered remove all post-translational modifications except deamidation and HIPs and trimmed between 9–25 amino acid lengths. For the insulin C HIP and HLA-DQ8 library, non-contiguous epitopes from Wiles et al.66 as well as all Immune Epitope Database epitopes were combined to generate the epitope list. Epitope sequences were back-translated using the backtranslate_fast.py script.

Lentiviral vector production and transduction

Lentiviral vectors to express SABRs or TCRs were packaged via previously described procedures27,76. In brief, HEK293T cells were plated in six-well plates at 1 × 106 cells per well. After 24 h, cells were transfected with a mixture of the lentiviral shuttle plasmid (1 μg per well), pMDG-VSVG (0.2 μg per well) and pCMV-RD8.9 (1 μg per well) (both kind gifts from D.B. Kohn) using TransIT-293 (Mirus Bio) and OPTI-MEM (Life Technologies) using the TransIT-293 manufacturer’s protocol. After 3 days, viral supernatant was collected and filtered through 0.45-μm syringe filters (Millipore). When possible, the freshly filtered virus was used to transduce 1 × 106 Jurkat cells per ml of the virus. Occasionally, the virus was stored at −80 °C until use. For NFAT–GFP Jurkat cells, Geneticin was added 24 h following transduction.

Retroviral vector production and transduction

Retroviral vectors (pMIG-II) to express TCRs were packaged via previously described procedures77. In brief, HEK293T cells were plated in six-well plates at 1 × 106 cells per well. After 24 h, the cells were transfected with a mixture of the retroviral shuttle (1 μg per well), pRD114 (0.8 μg per well) and pHIT60 (1 μg per well) using TransIT-293 (Mirus Bio) and OPTI-MEM (Life Technologies). The following day, viral supernatant was collected and filtered through 0.45-μm syringe filters (Millipore). Transduction of 2.5 × 105 Jurkat cells was performed using RetroNectin (Takara) binding according to the manufacturer’s protocol using the filtered virus. For primary murine CD4+ T cells and 5KC cells, Phoenix-ECO cells (ATCC) were plated in six-well plates at 1 × 106 cells per well. After 24 h, the cells were transfected with the retroviral shuttle (2.5 μg per well) using TransIT-293 (Mirus Bio) and OPTI-MEM (Life Technologies) using the TransIT-293 manufacturer’s protocol. At 48 h after transfection viral supernatant was collected and filtered through 0.45-μm syringe filters (Millipore). Transduction of 2.5 × 105 5KC or primary murine CD4+ T cells was performed using RetroNectin (Takara) binding according to the manufacturer’s protocol using the filtered virus. Before transduction, primary murine CD4+ T cells were stimulated and grown for 24 h on 24-well plates coated with RetroNectin, 2 μg ml−1 anti-CD3ε (BioLegend) and 1 μg ml−1 anti-CD28 (BioLegend).

Co-culture assays

For SABR library screens, 3 × 106 NFAT–GFP Jurkat cells expressing the SABR library were labeled with CellTrace Violet (BioLegend) according to the manufacturer’s protocol before incubation with 3 × 106 Jurkat cells expressing the TCR of interest. These mixtures were incubated in a six-well plate for 16–20 h. Cells were stained with anti-CD69-APC-Cy7 where indicated (BioLegend) and the top 1–2% of GFP+CD69+ cells were sorted for genomic DNA extraction, indexing and sequencing (see below). Multiplexed assays were scaled on a per-TCR basis (for example 3 × 106 for each of three TCRs against 9 × 106 library cells). For single SABR assays, unless otherwise defined, 5 × 105 SABR expressing NFAT–GFP Jurkat cells (or 5KC cells) were labeled with CellTrace Violet (BioLegend) according to the manufacturer’s protocol before incubation with 5 × 105 TCR-expressing Jurkat cells (or 5KC cells) in a round-bottom 96-well plate for 16–20 h. Cells were stained with anti-CD69-APC-Cy7 when indicated and acquired on the Attune NxT flow cytometer (Thermo Fisher Scientific). All flow analysis was performed using FlowJo (BD). For Daudi cell co-culture, 1 × 106 Jurkat cells expressing the TCR of interest were incubated with 1 × 106 Daudi cells expressing a SABR of interest for 3 days. On day 3, cells were labeled with anti-RT1B-PE and anti-Fas-APC-Cy7 before being acquired on the Attune NxT flow cytometer.

High-throughput sequencing and analysis

Genomic DNA was extracted from sorted cells immediately after sorting, using the PureLink genomic DNA extraction kit (Life Technologies). The integrated SABR vectors were amplified with KOD polymerase (Millipore) and two rounds of amplification. In the first round, IDT-UD-SABR-C2-F and IDT-UD-SABR-C2-R primers were used to amplify the epitope. In the second round, UDI0001-R and UDI0001-F primers (representative of index 1) were used to add Illumina unique dual indexes (UDIs) to the amplicons. A different UDI was used for each sample. The reactions were pooled and purified with the NucleoSpin gel and PCR purification kit (Takara). The purified PCR product was checked before sequencing using 2% agarose gel and subjected to sequencing on a HiSeq4000 (Fulgent Genetics). Unaligned reads generated by the sequencer were stored in FASTQ files. FASTQ files were concatenated to generate one file for read1 and read2 each. The sequences were demultiplexed into individual indexes using demultiplex_dual.py. Epitopes were extracted and scored using epitope_extract_fastq_v1.1.py and merge_counts_split_v2.1.py. The ES was calculated using Microsoft Excel (Microsoft) workbooks.

Peptide pulsing assays

Bone-marrow-derived dendritic cells (BMDCs) were generated according to Abcam’s protocol (https://www.abcam.com/protocols/bmdc-isolation-protocol) by isolating bone marrow from NOD mice and differentiating these cells in granulocyte–macrophage colony-stimulating factor (R&D Systems) for 7 days. On day 7, 2 × 104 BMDCs were resuspended in R10 and plated in a flat-bottom 96-well plate. Tenfold serial dilutions of each peptide were added to the BMDCs and left to incubate for 1 h. After 1 h, 5 × 104 5KC or primary murine CD4+ T cells were added to the peptide-pulsed BMDCs. The assay was left to incubate for 24 h, at which point cells were spun down, supernatant was collected and used for mIL-2 detection with the LEGEND MAX Mouse IL-2 ELISA kit (BioLegend). Peptides were custom ordered from GenScript.

Statistical analysis

Flow cytometry plots were analyzed with FlowJo v.10. Statistical analyses and graphical representations were generated by Microsoft Excel and GraphPad Prism v.9 and v.10 (GraphPad).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Online content

Any methods, additional references, Nature Portfolio reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41592-024-02255-0.

Supplementary information

Reporting Summary (2.8MB, pdf)
Supplementary Table 1 (390.5KB, xlsx)

Lists of epitopes for I-Ag7 SABR-II library.

Supplementary Table 2 (2MB, xlsx)

Lists of epitopes for DQ8 SABR-II library and ES.

Supplementary Table 3 (22.3KB, xlsx)

TCR alleles and CDR3 sequences for top clonally expanded TCRs.

Supplementary Table 4 (36MB, xlsx)

ES calculations for all TCRs.

Supplementary Table 5 (2.2MB, xlsx)

Lists of epitopes for I-Ag7 HIP library and ES.

Supplementary Table 6 (4.2MB, xlsx)

ES from dropout analysis.

Supplementary Table 7 (20.9KB, xlsx)

Reagents used in the study.

Source data

Source Data Fig. 1 (573.9KB, xlsx)

Statistical source data.

Source Data Fig. 2 (12.4KB, xlsx)

Statistical source data.

Source Data Fig. 3 (205.4KB, xlsx)

Statistical source data.

Source Data Fig. 4 (11.1KB, xlsx)

Statistical source data.

Source Data Fig. 5 (73.5KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 1 (13KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 2 (17.9KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 3 (169.7KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 4 (569.7KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 5 (14.3KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 6 (589.1KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 7 (468.9KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 9 (18.6KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 10 (557KB, xlsx)

Statistical source data.

Acknowledgements

We thank A.R. Cillo, C. Workman, J. Bridge, P. Thomas, S. Schattgen, S. Chiou, J. Das, H. Xiao and H. Singh for scientific discussions and advice on experimental and computational techniques. We thank M.T. Leonard, K. Ford, K. Rankin, S. Rathod, K. Adam and A. Parikh for technical assistance in the experimental antigen discovery pipeline. We thank T. Tabib and R. Lafyatis at the Single Cell Core, The Unified Flow Cytometry Core and the Division of Laboratory Animal Research for aiding with scRNA-seq, flow cytometry and animal husbandry. NFAT–GFP Jurkat cells were a gift from A. Weiss (University of California San Francisco) and Y. Chen (University of California Los Angeles). The pCCLc-MND-X backbone and pCMV-RD8.9 were gifts from D.B. Kohn (University of California Los Angeles). The 5KC cells were a kind gift from M. Nakayama (Barbara Davis Center for Diabetes, University of Colorado Anschutz Medical Campus). P.M.Z. was funded by an Autoimmunity and Immunopathology training grant (5T32AI089443-14). This research was funded by National Institutes of Health (NIH)/National Institute of Diabetes and Digestive and Kidney Diseases New Investigator Gateway Award (1R03DK127447-01 to A.V.J.), NIH/National Institute of Diabetes and Digestive and Kidney Diseases/dkNET New Investigator in Bioinformatics Award (to A.V.J.), Pittsburgh Autoimmunity Center for Excellence in Rheumatology Innovative Discovery Award (to A.V.J.); NIH Director’s New Innovator Award (DP2 OD033187-01 to A.V.J.) and Juvenile Diabetes Research Foundation Strategic Research Agreement (3-SRA-2023-1354-S-B to A.V.J.).

Extended data

Author contributions

P.M.Z. designed and performed experiments, analyzed and interpreted the data and wrote the paper. S.G. designed and performed the scRNA-seq experiments. N.T. designed and performed SABR screens, validation experiments and SABR B cell experiments. J.T., E.Z.M., L.H., R.R., V.K.K.M., S.M., M.M.R. and S.A., performed experiments and assisted with technical procedures in TCR cloning, SABR screens and validations. M.S.A. and D.A.A.V. provided reagents, guidance and scientific discussions. A.V.J. conceptualized the study, designed and performed experiments, analyzed and interpreted the data and wrote the paper.

Peer review

Peer review information

Nature Methods thanks Michael Birnbaum, Encarnita Mariotti-Ferrandiz and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editor: Madhura Mukhopadhyay, in collaboration with the Nature Methods team.

Data availability

Sequencing data are available on the Gene Expression Omnibus under accession ID GSE247410. SABR-II plasmids, SABR-II libraries and TCRs will be made available upon request, given the large number of them. The individual sequences of epitopes as well as sufficient information to reconstruct the TCRs are provided in supplementary files. Source data are provided with this paper.

Code availability

All necessary scripts are deposited to GitHub at https://github.com/joglekar-lab/SABR-II.

Competing interests

D.A.A.V. is a cofounder and stockholder for Novasenta, Tizona and Trishula; stockholder for Oncorus and Werewolf; has patents licensed and obtains royalties from Astellas, BMS and Novasenta; is a scientific advisory board member for Tizona, Werewolf, F-Star, Bicara, Apeximmune and T7/Imreg Bio; is a consultant for Astellas, BMS, Almirall, Incyte, G1 Therapeutics and Inzen Therapeutics; and receives research funding from BMS, Astellas and Novasenta. M.S.A. Is a consultant for Imcyse and Novartis and owns stock in Merck and Medtronic. A.V.J. is a co-inventor on a patent application concerning the described platform, has received research funding from Mitsubishi-Tanabe Pharma and has served as a consultant for Pfizer. The remaining authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Nishtha Trivedi, Stephanie Grebinoski.

Extended data

is available for this paper at 10.1038/s41592-024-02255-0.

Supplementary information

The online version contains supplementary material available at 10.1038/s41592-024-02255-0.

References

  • 1.Davis MM, Bjorkman PJ. T-cell antigen receptor genes and T-cell recognition. Nature. 1988;334:395–402. doi: 10.1038/334395a0. [DOI] [PubMed] [Google Scholar]
  • 2.Robins HS, et al. Comprehensive assessment of T-cell receptor β-chain diversity in αβ T cells. Blood. 2009;114:4099–4107. doi: 10.1182/blood-2009-04-217604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Qi Q, et al. Diversity and clonal selection in the human T-cell repertoire. Proc. Natl Acad. Sci. USA. 2014;111:13139–13144. doi: 10.1073/pnas.1409155111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.de Greef PC, et al. The naive T-cell receptor repertoire has an extremely broad distribution of clone sizes. eLife. 2020;9:e49900. doi: 10.7554/eLife.49900. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Oh DY, Fong L. Cytotoxic CD4(+) T cells in cancer: expanding the immune effector toolbox. Immunity. 2021;54:2701–2711. doi: 10.1016/j.immuni.2021.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Moss P. The T cell immune response against SARS-CoV-2. Nat. Immunol. 2022;23:186–193. doi: 10.1038/s41590-021-01122-w. [DOI] [PubMed] [Google Scholar]
  • 7.James EA, Pietropaolo M, Mamula MJ. Immune recognition of β-cells: neoepitopes as key players in the loss of tolerance. Diabetes. 2018;67:1035–1042. doi: 10.2337/dbi17-0030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Pugliese A. Autoreactive T cells in type 1 diabetes. J. Clin. Invest. 2017;127:2881–2891. doi: 10.1172/JCI94549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Spence A, et al. Revealing the specificity of regulatory T cells in murine autoimmune diabetes. Proc. Natl Acad. Sci. USA. 2018;115:5265–5270. doi: 10.1073/pnas.1715590115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Joglekar AV, Li G. T cell antigen discovery. Nat. Methods. 2021;18:873–880. doi: 10.1038/s41592-020-0867-z. [DOI] [PubMed] [Google Scholar]
  • 11.Williams T, et al. Development of T cell lines sensitive to antigen stimulation. J. Immunol. Methods. 2018;462:65–73. doi: 10.1016/j.jim.2018.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Parish CR, Glidden MH, Quah BJ, Warren HS. Use of the intracellular fluorescent dye CFSE to monitor lymphocyte migration and proliferation. Curr. Protoc. Immunol. 2009 doi: 10.1002/0471142735.im0409s84. [DOI] [PubMed] [Google Scholar]
  • 13.Mann SE, et al. Multiplex T cell stimulation assay utilizing a T cell activation reporter-based detection system. Front. Immunol. 2020;11:633. doi: 10.3389/fimmu.2020.00633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bercovici N, Duffour MT, Agrawal S, Salcedo M, Abastado JP. New methods for assessing T-cell responses. Clin. Diagn. Lab Immunol. 2000;7:859–864. doi: 10.1128/CDLI.7.6.859-864.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zhang SQ, et al. High-throughput determination of the antigen specificities of T cell receptors in single cells. Nat. Biotechnol. 2018 doi: 10.1038/nbt.4282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Newell EW, Klein LO, Yu W, Davis MM. Simultaneous detection of many T-cell specificities using combinatorial tetramer staining. Nat. Methods. 2009;6:497–499. doi: 10.1038/nmeth.1344. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Klenerman P, Cerundolo V, Dunbar PR. Tracking T cells with tetramers: new tales from new tools. Nat. Rev. Immunol. 2002;2:263–272. doi: 10.1038/nri777. [DOI] [PubMed] [Google Scholar]
  • 18.Dolton G, et al. More tricks with tetramers: a practical guide to staining T cells with peptide-MHC multimers. Immunology. 2015;146:11–22. doi: 10.1111/imm.12499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Novak EJ, Liu AW, Nepom GT, Kwok WW. MHC class II tetramers identify peptide-specific human CD4(+) T cells proliferating in response to influenza A antigen. J. Clin. Invest. 1999;104:R63–R67. doi: 10.1172/JCI8476. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Nepom GT. MHC class II tetramers. J. Immunol. 2012;188:2477–2482. doi: 10.4049/jimmunol.1102398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Vollers SS, Stern LJ. Class II major histocompatibility complex tetramer staining: progress, problems, and prospects. Immunology. 2008;123:305–313. doi: 10.1111/j.1365-2567.2007.02801.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Rius C, et al. Peptide-MHC class I tetramers can fail to detect relevant functional T cell clonotypes and underestimate antigen-reactive T cell populations. J. Immunol. 2018;200:2263–2279. doi: 10.4049/jimmunol.1700242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Boder ET, Wittrup KD. Yeast surface display for screening combinatorial polypeptide libraries. Nat. Biotechnol. 1997;15:553–557. doi: 10.1038/nbt0697-553. [DOI] [PubMed] [Google Scholar]
  • 24.Wen F, Zhao H. Construction and screening of an antigen-derived peptide library displayed on yeast cell surface for CD4+ T cell epitope identification. Methods Mol. Biol. 2013;1061:245–264. doi: 10.1007/978-1-62703-589-7_15. [DOI] [PubMed] [Google Scholar]
  • 25.Wen F, Esteban O, Zhao H. Rapid identification of CD4+ T-cell epitopes using yeast displaying pathogen-derived peptide library. J. Immunol. Methods. 2008;336:37–44. doi: 10.1016/j.jim.2008.03.008. [DOI] [PubMed] [Google Scholar]
  • 26.Birnbaum ME, et al. Deconstructing the peptide-MHC specificity of T cell recognition. Cell. 2014;157:1073–1087. doi: 10.1016/j.cell.2014.03.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Joglekar AV, et al. T cell antigen discovery via signaling and antigen-presenting bifunctional receptors. Nat. Methods. 2019;16:191–198. doi: 10.1038/s41592-018-0304-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Kisielow J, Obermair F-J, Kopf M. Deciphering CD4+ T cell specificity using novel MHC–TCR chimeric receptors. Nat. Immunol. 2019;20:652–662. doi: 10.1038/s41590-019-0335-z. [DOI] [PubMed] [Google Scholar]
  • 29.Kula T, et al. T-Scan: a genome-wide method for the systematic discovery of T cell epitopes. Cell. 2019;178:1016–1028. doi: 10.1016/j.cell.2019.07.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Li G, et al. T cell antigen discovery via trogocytosis. Nat. Methods. 2019;16:183–190. doi: 10.1038/s41592-018-0305-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Sharma G, Rive CM, Holt RA. Rapid selection and identification of functional CD8+ T cell epitopes from large peptide-coding libraries. Nat. Commun. 2019;10:4553. doi: 10.1038/s41467-019-12444-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Dobson CS, et al. Antigen identification and high-throughput interaction mapping by reprogramming viral entry. Nat. Methods. 2022;19:449–460. doi: 10.1038/s41592-022-01436-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jyothi MD, Flavell RA, Geiger TL. Targeting autoantigen-specific T cells and suppression of autoimmune encephalomyelitis with receptor-modified T lymphocytes. Nat. Biotechnol. 2002;20:1215–1220. doi: 10.1038/nbt758. [DOI] [PubMed] [Google Scholar]
  • 34.Obermair FJ, et al. High-resolution profiling of MHC II peptide presentation capacity reveals SARS-CoV-2 CD4 T cell targets and mechanisms of immune escape. Sci. Adv. 2022;8:eabl5394. doi: 10.1126/sciadv.abl5394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dezfulian MH, et al. TScan-II: a genome-scale platform for the de novo identification of CD4(+) T cell epitopes. Cell. 2023;186:5569–5586. doi: 10.1016/j.cell.2023.10.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Yu B, et al. Engineered cell entry links receptor biology with single-cell genomics. Cell. 2022;185:4904–4920. doi: 10.1016/j.cell.2022.11.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Dash P, et al. Quantifiable predictive features define epitope-specific T cell receptor repertoires. Nature. 2017;547:89–93. doi: 10.1038/nature22383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Glanville J, et al. Identifying specificity groups in the T cell receptor repertoire. Nature. 2017;547:94–98. doi: 10.1038/nature22976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Schattgen SA, et al. Integrating T cell receptor sequences and transcriptional profiles by clonotype neighbor graph analysis (CoNGA) Nat. Biotechnol. 2022;40:54–63. doi: 10.1038/s41587-021-00989-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Pogorelyy MV, et al. Resolving SARS-CoV-2 CD4+ T cell specificity via reverse epitope discovery. Cell Rep. Med. 2022;3:100697. doi: 10.1016/j.xcrm.2022.100697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Robertson JM, Jensen PE, Evavold BD. DO11.10 and OT-II T cells recognize a C-terminal ovalbumin 323-339 epitope. J. Immunol. 2000;164:4706–4712. doi: 10.4049/jimmunol.164.9.4706. [DOI] [PubMed] [Google Scholar]
  • 42.Buzas EI, et al. A proteoglycan (aggrecan)-specific T cell hybridoma induces arthritis in BALB/c mice. J. Immunol. 1995;155:2679–2687. doi: 10.4049/jimmunol.155.5.2679. [DOI] [PubMed] [Google Scholar]
  • 43.Judkowski V, et al. Identification of MHC class II-restricted peptide ligands, including a glutamic acid decarboxylase 65 sequence, that stimulate diabetogenic T cells from transgenic BDC2.5 nonobese diabetic mice. J. Immunol. 2001;166:908–917. doi: 10.4049/jimmunol.166.2.908. [DOI] [PubMed] [Google Scholar]
  • 44.Tait BD. Genetic susceptibility to type I diabetes: a review. J. Autoimmun. 1990;3:3–11. doi: 10.1016/S0896-8411(09)90003-8. [DOI] [PubMed] [Google Scholar]
  • 45.Noble JA, et al. The role of HLA class II genes in insulin-dependent diabetes mellitus: molecular analysis of 180 Caucasian, multiplex families. Am. J. Hum. Genet. 1996;59:1134–1148. [PMC free article] [PubMed] [Google Scholar]
  • 46.Michels AW, et al. Islet-derived CD4 T cells targeting proinsulin in human autoimmune diabetes. Diabetes. 2017;66:722–734. doi: 10.2337/db16-1025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Hao Z, et al. Fas receptor expression in germinal-center B cells is essential for T and B lymphocyte homeostasis. Immunity. 2008;29:615–627. doi: 10.1016/j.immuni.2008.07.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Matou-Nasri S, et al. CD95-mediated apoptosis in Burkitt’s lymphoma B-cells is associated with Pim-1 down-regulation. Biochim. Biophys. Acta Mol. Basis Dis. 2017;1863:239–252. doi: 10.1016/j.bbadis.2016.09.012. [DOI] [PubMed] [Google Scholar]
  • 49.Rathmell JC, et al. CD95 (Fas)-dependent elimination of self-reactive B cells upon interaction with CD4+ T cells. Nature. 1995;376:181–184. doi: 10.1038/376181a0. [DOI] [PubMed] [Google Scholar]
  • 50.Vita R, et al. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 2019;47:D339–D343. doi: 10.1093/nar/gky1006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Wan X, et al. The MHC-II peptidome of pancreatic islets identifies key features of autoimmune peptides. Nat. Immunol. 2020;21:455–463. doi: 10.1038/s41590-020-0623-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Baker RL, et al. CD4 T cells reactive to hybrid insulin peptides are indicators of disease activity in the NOD mouse. Diabetes. 2018;67:1836–1846. doi: 10.2337/db18-0200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Amdare N, Purcell AW, DiLorenzo TP. Noncontiguous T cell epitopes in autoimmune diabetes: From mice to men and back again. J. Biol. Chem. 2021;297:100827. doi: 10.1016/j.jbc.2021.100827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Stadinski BD, et al. Chromogranin A is an autoantigen in type 1 diabetes. Nat. Immunol. 2010;11:225–231. doi: 10.1038/ni.1844. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Parras D, Sole P, Delong T, Santamaria P, Serra P. Recognition of multiple hybrid insulin peptides by a single highly diabetogenic T-cell receptor. Front. Immunol. 2021;12:737428. doi: 10.3389/fimmu.2021.737428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Ramirez L, Hamad AR. Status of autoimmune diabetes 20-year after generation of BDC2.5-TCR transgenic non-obese diabetic mouse. World J. Diabetes. 2013;4:88–91. doi: 10.4239/wjd.v4.i4.88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Lee T, Sprouse ML, Banerjee P, Bettini M, Bettini ML. Ectopic expression of self-antigen drives regulatory T cell development and not deletion of autoimmune T cells. J. Immunol. 2017;199:2270–2278. doi: 10.4049/jimmunol.1700207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Wiles TA, et al. Identification of hybrid insulin peptides (HIPs) in mouse and human islets by mass spectrometry. J. Proteome Res. 2019;18:814–825. doi: 10.1021/acs.jproteome.8b00875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Pearson JA, Wong FS, Wen L. The importance of the non obese diabetic (NOD) mouse model in autoimmune diabetes. J. Autoimmun. 2016;66:76–88. doi: 10.1016/j.jaut.2015.08.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Prasad S, Kohm AP, McMahon JS, Luo X, Miller SD. Pathogenesis of NOD diabetes is initiated by reactivity to the insulin B chain 9-23 epitope and involves functional epitope spreading. J. Autoimmun. 2012;39:347–353. doi: 10.1016/j.jaut.2012.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Zakharov PN, Hu H, Wan X, Unanue ER. Single-cell RNA sequencing of murine islets shows high cellular complexity at all stages of autoimmune diabetes. J. Exp. Med. 2020;217:e20192362. doi: 10.1084/jem.20192362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Hao Y, et al. Integrated analysis of multimodal single-cell data. Cell. 2021;184:3573–3587. doi: 10.1016/j.cell.2021.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Borcherding N, Bormann NL, Kraus G. scRepertoire: an R-based toolkit for single-cell immune receptor analysis. F1000Res. 2020;9:47. doi: 10.12688/f1000research.22139.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Baker FJ, Lee M, Chien YH, Davis MM. Restricted islet-cell reactive T cell repertoire of early pancreatic islet infiltrates in NOD mice. Proc. Natl Acad. Sci. USA. 2002;99:9374–9379. doi: 10.1073/pnas.142284899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Galley KA, Danska JS. Peri-islet infiltrates of young non-obese diabetic mice display restricted TCR β-chain diversity. J. Immunol. 1995;154:2969–2982. doi: 10.4049/jimmunol.154.6.2969. [DOI] [PubMed] [Google Scholar]
  • 66.Grebinoski S, et al. Autoreactive CD8(+) T cells are restrained by an exhaustion-like program that is maintained by LAG3. Nat. Immunol. 2022;23:868–877. doi: 10.1038/s41590-022-01210-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Rahimikollu J, et al. SLIDE: significant latent factor interaction discovery and exploration across biological domains. Nat. Methods. 2024 doi: 10.1038/s41592-024-02175-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Giudicelli V, Chaume D, Lefranc MP. IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes. Nucleic Acids Res. 2005;33:D256–D261. doi: 10.1093/nar/gki010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Chiou SH, et al. Global analysis of shared T cell specificities in human non-small cell lung cancer enables HLA inference and antigen discovery. Immunity. 2021;54:586–602. doi: 10.1016/j.immuni.2021.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Baker RL, Jamison BL, Haskins K. Hybrid insulin peptides are neo-epitopes for CD4 T cells in autoimmune diabetes. Curr. Opin. Endocrinol. Diabetes Obes. 2019;26:195–200. doi: 10.1097/MED.0000000000000490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Gioia L, et al. Position beta57 of I-A(g7) controls early anti-insulin responses in NOD mice, linking an MHC susceptibility allele to type 1 diabetes onset. Sci. Immunol. 2019;4:eaaw6329. doi: 10.1126/sciimmunol.aaw6329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Wenzlau JM, et al. Insulin B-chain hybrid peptides are agonists for T cells reactive to insulin B:9-23 in autoimmune diabetes. Front. Immunol. 2022;13:926650. doi: 10.3389/fimmu.2022.926650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Balakrishnan S, Kumar P, Prabhakar BS. Post-translational modifications contribute to neoepitopes in Type-1 diabetes: challenges for inducing antigen-specific tolerance. Biochim Biophys. Acta Proteins Proteom. 2020;1868:140478. doi: 10.1016/j.bbapap.2020.140478. [DOI] [PubMed] [Google Scholar]
  • 74.Buitinga M, et al. Inflammation-Induced citrullinated glucose-regulated protein 78 elicits immune responses in human type 1 diabetes. Diabetes. 2018;67:2337–2348. doi: 10.2337/db18-0295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Almagro Armenteros JJ, et al. SignalP 5.0 improves signal peptide predictions using deep neural networks. Nat. Biotechnol. 2019;37:420–423. doi: 10.1038/s41587-019-0036-z. [DOI] [PubMed] [Google Scholar]
  • 76.Cooper AR, et al. Highly efficient large-scale lentiviral vector concentration by tandem tangential flow filtration. J. Virol. Methods. 2011;177:1–9. doi: 10.1016/j.jviromet.2011.06.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Szymczak AL, et al. Correction of multi-gene deficiency in vivo using a single ‘self-cleaving’ 2A peptide-based retroviral vector. Nat. Biotechnol. 2004;22:589–594. doi: 10.1038/nbt957. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reporting Summary (2.8MB, pdf)
Supplementary Table 1 (390.5KB, xlsx)

Lists of epitopes for I-Ag7 SABR-II library.

Supplementary Table 2 (2MB, xlsx)

Lists of epitopes for DQ8 SABR-II library and ES.

Supplementary Table 3 (22.3KB, xlsx)

TCR alleles and CDR3 sequences for top clonally expanded TCRs.

Supplementary Table 4 (36MB, xlsx)

ES calculations for all TCRs.

Supplementary Table 5 (2.2MB, xlsx)

Lists of epitopes for I-Ag7 HIP library and ES.

Supplementary Table 6 (4.2MB, xlsx)

ES from dropout analysis.

Supplementary Table 7 (20.9KB, xlsx)

Reagents used in the study.

Source Data Fig. 1 (573.9KB, xlsx)

Statistical source data.

Source Data Fig. 2 (12.4KB, xlsx)

Statistical source data.

Source Data Fig. 3 (205.4KB, xlsx)

Statistical source data.

Source Data Fig. 4 (11.1KB, xlsx)

Statistical source data.

Source Data Fig. 5 (73.5KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 1 (13KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 2 (17.9KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 3 (169.7KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 4 (569.7KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 5 (14.3KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 6 (589.1KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 7 (468.9KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 9 (18.6KB, xlsx)

Statistical source data.

Source Data Extended Data Fig. 10 (557KB, xlsx)

Statistical source data.

Data Availability Statement

Sequencing data are available on the Gene Expression Omnibus under accession ID GSE247410. SABR-II plasmids, SABR-II libraries and TCRs will be made available upon request, given the large number of them. The individual sequences of epitopes as well as sufficient information to reconstruct the TCRs are provided in supplementary files. Source data are provided with this paper.

All necessary scripts are deposited to GitHub at https://github.com/joglekar-lab/SABR-II.


Articles from Nature Methods are provided here courtesy of Nature Publishing Group

RESOURCES