Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 May 12.
Published in final edited form as: Sci Immunol. 2021 Nov 12;6(65):eabl5053. doi: 10.1126/sciimmunol.abl5053

Single-Cell Multiomics Defines Tolerogenic Extrathymic Aire-Expressing Populations with Unique Homology to Thymic Epithelium

Jiaxi Wang 1,, Caleb A Lareau 2,3,, Jhoanne Bautista 4, Alexander Gupta 1,4, Katalin Sandor 2, Joe Germino 1, Yajie Yin 2, Matt Arvedson 1, Gabriella C Reeder 1, Nathan T Cramer 1, Fang Xie 1,4, Vasilis Ntranos 1,5, Ansuman T Satpathy 2,3, Mark S Anderson 1,6, James M Gardner 1,4,*
PMCID: PMC8855935  NIHMSID: NIHMS1767954  PMID: 34767455

Abstract

The Autoimmune Regulator (Aire), a well-defined transcriptional regulator in the thymus, is also found in extrathymic Aire-expressing cells (eTACs) in the secondary lymphoid organs. eTACs are hematopoietic antigen-presenting cells (APCs) and inducers of immune tolerance, but their precise identity has remained unclear. Here, we use single-cell multiomics, transgenic murine models, and functional approaches to define eTACs at the transcriptional, genomic, and proteomic level. We find eTACs consist of two similar cell types: CCR7+ Aire-expressing migratory dendritic cells (AmDCs) and an Airehi population co-expressing Aire and retinoic acid receptor-related orphan receptor gamma t (RORγt) that we term Janus cells (JCs). Both JCs and AmDCs have the highest transcriptional and genomic homology to CCR7+ migratory dendritic cells. eTACs, particularly JCs, have highly accessible chromatin and broad gene expression, including a range of tissue-specific antigens, as well as remarkable homology to medullary thymic epithelium and RANK-dependent Aire expression. Importantly, transgenic self-antigen expression by eTACs is sufficient to induce negative selection and prevent autoimmune diabetes. This transcriptional, genomic, and functional symmetry between eTACs (both JCs and AmDCs) and medullary thymic epithelium – the other principal Aire-expressing population and a key regulator of central tolerance – identifies a core program that may influence self-representation and tolerance across the spectrum of immune development.

One-Sentence Summary

Extrathymic Aire-expressing cells are hematopoietic migratory DC-like populations with homology to medullary thymic epithelium.

Editor’s Summary

Maintenance of T cell tolerance in post-thymic T cells depends on cells in peripheral lymphoid tissues that express the Autoimmune Regulator (Aire) gene, a transcriptional regulator enabling expression of a diverse set of self-antigens. To better define the hematopoietic lineage affiliation of these extrathymic Aire-expressing cells (eTACs), Wang et al. used single-cell transcriptional and chromatin profiling techniques to analyze eTACs in mouse lymph nodes. The eTACs were resolved into two distinct populations of migratory dendritic cells. One eTAC subset, designated as Janus cells, expressed Aire and the RORγt transcription factor typically associated with group 3 innate lymphoid cells. These findings establish eTACs as subsets of dendritic cells specially equipped to sustain T cell tolerance outside the confines of the thymus.

INTRODUCTION

The Autoimmune Regulator (Aire) gene plays an essential role in the maintenance of central tolerance, and its function has been extensively defined in medullary thymic epithelial cells (mTECs), where, among other roles, it regulates the expression of a diverse range of otherwise tissue-specific self-antigens (1,2). Aire expression has also been observed in the secondary and tertiary lymphoid organs in both mice and humans but not in primary tissues (35), and self-antigen expression in such extrathymic Aire-expressing cells (eTACs) is sufficient to cause deletion or inactivation of cognate T cells (5,6). Recently, eTACs have also been demonstrated to be essential for the maintenance of normal maternal-fetal immune tolerance (7). Thus, the precise identity and biology of these extrathymic Aire-expressing populations is of substantial biological and clinical interest.

While Aire expression has been described in various dendritic cell populations (46,811), the precise identity of extrathymic Aire-expressing cells has remained elusive. Recent reports have suggested that the principal Aire-expressing population in the secondary lymphoid organs may be a subset of group 3 innate lymphoid cells (ILC3s), based largely on the co-expression of Aire and retinoic acid receptor-related orphan receptor gamma t (RORγt) (12), encoded by the Rorc gene and known for its essential role in T helper cell type 17 (Th17) and ILC3 differentiation. A growing body of evidence has also shown that RORγt-expressing populations characterized as ILC3s can play important roles as APCs to promote immune tolerance in a range of contexts, most notably in commensal tolerance in the gut (13,14). Furthermore, single-cell transcriptional mapping of murine DC lineages has also recently identified populations of RORγt-expressing DCs (15). Thus, the identification of an Aire- and RORγt-coexpressing migratory DC-like population may have implications for some of these divergent findings.

Recent evidence suggests that migratory dendritic cells may have important regulatory functions in the maintenance of immune tolerance (1618). Initially characterized by their participation in the generation of memory immune responses to acquired antigens (19,20), migratory DCs (migDCs) have more recently been suggested to participate in the regulation and repression of immune responses. Transcriptional characterization of migDCs by the ImmGen consortium suggested enrichment for an immunomodulatory and tolerogenic transcriptional program (21). Deletion of migDC populations, or interference with their migration, worsened autoimmunity in a range of mouse models (16,17). More recently, populations of tumor-associated migratory DCs (termed mregDCs) were described which had both tolerogenic phenotype and function, and regulated immune responses to neoplasia (18). However, the signals and transcriptional regulatory circuits driving such tolerogenic phenotypes in migDCs remain obscure.

Here, we use high-throughput single-cell multiomics and a range of functional approaches to define eTACs as migratory DC-like populations and identify among them a new mixed-phenotype Aire- and RORγt-expressing population which we term Janus cells (JCs). We show that these populations have uniquely high degrees of broad chromatin accessibility and gene expression, including a range of tissue-specific antigens and immunomodulatory transcripts. Self-antigen expression in eTACs also leads to self-tolerance among T cells that escape thymic selection. Further, we find that eTACs have remarkable transcriptional and genomic homology to medullary thymic epithelium, despite being hematopoietic in origin. This convergent transcriptional circuitry between thymic and peripheral Aire-expressing populations, despite their distinct stromal and hematopoietic origins, may provide unique insights into the fundamental mechanisms maintaining self-tolerance throughout immune development.

RESULTS

Extrathymic Aire-expressing cells consist of distinct populations of migratory DC-like cells

To define the identity and basic biology of extrathymic Aire-expressing populations in an unbiased fashion, we sought to characterize these cells utilizing high-dimensional single-cell multiomics, including single-cell RNA sequencing (scRNA-seq) and ATAC with Selected Antigen Profiling by sequencing (ASAP-seq). As eTACs are present predominantly in the secondary lymphoid organs (5), we enriched for rare populations in the lymph nodes (LNs) by magnetic-column depletion of T and B cells from the digested, pooled LNs of WT and Aire-driven IGRP-GFP (Adig) mice, a previously characterized Aire-reporter mouse strain (5). Next, enriched cells were either directly subjected to parallel multiome sequencing (scRNA-seq and ASAP-seq) from WT mice, or flow sorted for all live, GFP+ cells from Adig mice (Fig. 1A, S1A). From the scRNA-seq data (Fig. S1BC), the unsorted WT sample (n=6,973 cells) allowed us to generate a map of rare and common secondary lymphoid populations, while the GFP-sorted sample (n=2,532 cells) from Adig mice allowed us to deeply profile eTAC populations and further map them into the immune landscape to define their identity. Using our combined data resource, we identified discrete populations of eTACs (GFP+ cells from the Adig LNs) which aligned closely with GFP and Aire transcript expression (Fig. 1B, D), confirming both sort purity as well as the fidelity of the GFP reporter. In the merged data, we were able to identify a broad range of rare lymphoid, myeloid, and stromal cell types, and found eTACs represented two distinct populations, both of which shared high transcriptional similarity to migratory DCs (Fig. 1CG, S2AB). The majority of Aire expression was seen in migratory DC1/2s, and we referred to these populations as Aire-expressing migratory dendritic cells (AmDCs), although these AmDCs did not cluster separately as a distinct group from Aire-negative migDCs (Fig. 1C). In contrast, the eTAC subset with the highest Aire expression was a discrete cluster, clearly distinct in RNA space after dimensional reduction, that expressed both high levels of Aire as well as Rorc (Fig. 1C, D, G), a gene otherwise associated with innate and adaptive lymphocyte populations such as Th17 and ILC3s (22). Importantly, these same eTAC populations were also all present in reference (non-Aire-enriched) samples alone, though at lower frequency (Fig. 1B, S1D). Despite its Rorc expression, this Airehi population otherwise appeared more closely related to the migratory DC lineage in t-SNE space, and expressed comparable levels of Zbtb46, Ccr7, and MHC class II. Because of the distinct phenotype and dual expression of Airehi/Rorc+, we refer to the cell population as Janus cells (JCs), in reference to the two-faced classical deity.

Figure 1. Extrathymic Aire-expressing cells consist of distinct populations of migratory DC-like cells.

Figure 1.

(A) Schematic of cell isolation and enrichment for scRNA-seq and ASAP-seq. (B) Reduced dimensionality representation of scRNA-seq data, indicating the GFP+ population and GFP transcript. (C) Annotated cell clusters of populations in scRNA-seq data. (D) Gene expression of selected transcripts used to define populations, including Aire and Rorc. (E) Hierarchical clustering of pseudobulk scRNA-seq clusters by all ImmGen microarray data using a scaled cosine similarity metric. Rows min-max normalized. ImmGen populations cluster identities annotated in red text as indicated. Aire-expressing populations are indicated with blue (migDC1/2) and red (JC) arrows. (F) Single-cell cosine similarity scores for scRNA-seq data using indicated ImmGen populations. (G) Gene expression heatmaps for selected gene sets, z-score-normalized per row. Aire-expressing populations are indicated with blue (migDC1/2) and red (JC) arrows.

To more precisely assign identity to extrathymic Aire-expressing populations in an unbiased fashion based on their entire transcriptome rather than individual marker genes, we next performed unsupervised two-dimensional hierarchical clustering and cosine similarity scoring of each pseudobulk population against the entire Immunologic Genome Project (ImmGen) microarray database of 178 annotated reference immune populations (23) in RNA space (Fig. 1E). Our informatics approach verified clustering of eTAC populations (i.e., JCs with AmDCs), as well as high transcriptional similarity to reference ImmGen populations collectively identified as migratory DCs (23). Reciprocally, we then utilized select ImmGen reference populations to assign cosine similarity scores on a per-cell basis to each cell profiled by scRNA-seq, which again demonstrated that both JCs and AmDCs had high transcriptional similarity to migratory DCs (Fig. 1F, S2B). These unbiased, reciprocal approaches to cell identity assignment indicate that all eTAC populations appear to be most transcriptionally similar to migratory dendritic cells.

Noting that migDCs have been characterized by a common canonical transcriptional signature (21), we verified this signature was highly expressed in all Aire-expressing clusters (JCs and migDC1/2) (Fig. 1G). And despite recent claims that this Aire/Rorc-coexpressing population was ILC3-like (12), we found that JCs clustered broadly with myeloid and not lymphoid lineages. JCs share less homology with defined populations of either NKp46+ or LTi ILC3s and do not express detectable levels of most canonical ILC3-family transcripts (Il17a/f, Il22, Il23r) (24). However, some genes, including Ccr6 and Tmem176a/b, were shared between the populations. Interestingly, eTACs in general, and JCs in particular, shared some transcripts with epithelial/stromal lineages (Fig. 1G).

Single-cell-chromatin accessibility supports the genomic identity of eTACs as myeloid populations with uniquely accessible chromatin

To further characterize eTACs at both the genomic and proteomic level, we subjected the same populations of cells purified from Aire-reporter and WT mice described above (Fig. 1A) to combined single-cell ATAC and surface protein multiomics (ASAP-seq) (25). In strong concordance with the single-cell RNA sequencing data, we again found that Aire accessibility was largely confined to migDCs and JCs (Fig. 2AC). To independently validate the cellular identity of these populations based on their chromatin profiles, we next performed unsupervised two-dimensional hierarchical clustering and cosine similarity scoring of each pseudobulk population against the entire ImmGen ATAC-seq database that consisted of 89 populations (26). While the ImmGen ATAC database does not include migratory DC populations, JCs and migDCs again both co-clustered with each other and showed the highest cosine similarity to dendritic cells (Fig. 2D, red and blue arrows). Conversely, these eTAC populations showed low similarity to ILCs or other lymphoid lineages and were also similar to medullary thymic epithelial cells (Fig. 2D, green arrow). These identities were reciprocally validated using per-cell cosine similarity projections based on accessible chromatin (Fig. S3A).

Figure 2. Single-cell-chromatin accessibility supports the genomic identity of eTACs as myeloid populations with uniquely accessible chromatin.

Figure 2.

(A) Reduced dimensionality representation of ASAP-seq data, indicating the GFP and WT sample origins. (B) Annotated cell clusters of populations in ASAP-seq data. (C) Gene activity scores of Aire and Rorc across ASAP t-SNE. (D) Hierarchical clustering of pseudobulk ASAP-seq clusters by all ImmGen ATAC-seq data using a scaled cosine similarity metric. Rows min-max normalized. ImmGen populations cluster identities annotated in red as indicated. Aire-expressing populations are indicated with blue (migDC1/2) and red (JC) arrows, and mTECs from the ImmGen database with a green arrow. (E) Accessible chromatin landscape near the Aire locus, including the previously described CNS1 region (bottom). (F) Volcano plot of differential accessibility peaks, indicating the number of peaks with greater (n=12,571) or less (n=1,578) accessibility in Aire expressing populations. Aire locus peaks in large red dots; CNS1 locus as indicated. (G) Comparison of number of accessible chromatin fragments between eTACs and non-eTAC populations in the LN by box/violin overlay. Statistical test: Mann-Whitney test. (H) Per-cluster abundance of accessible chromatin fragments/cluster, box/violin overlay, noting highest accessibility in JCs. Boxplots: center line, median; box limits, first and third quartiles; whiskers, 1.5× interquartile range.

Mapping the accessibility of the cis-regulatory region defining the Aire locus in these populations showed distinct peaks in JCs and migDCs in the conserved non-coding sequence 1 (CNS1) region, known to be essential for Aire expression (27,28) (Fig. 2E). Notably, our analysis of this locus also revealed several additional peaks unique to Aire-expressing populations that had not been previously described (Fig. 2E, Fig. S3B). Interestingly, global differential accessibility analyses of eTACs against all other Aire-negative populations showed significantly increased chromatin accessibility in Aire-expressing cells relative to all other LN populations, including at the Aire locus (Fig. 2F, G). To support that this was not due to batch effects, we also evaluated the same chromatin accessibility between eTACs and other LN populations in the WT dataset alone and again observed significantly increased accessibility among eTACs (Fig. S3C). To confirm this result was not due to skewing from a particular low-accessibility population, we also examined the per-cell chromatin accessibility of each individual population and found the same trend—JCs had the highest chromatin accessibility of any cell type in the LN, followed closely by migDC1/2s (Fig. 2H). Overall, our analyses support a marked increase of chromatin accessibility in eTACs, a striking observation given Aire’s known role in chromatin regulation in the thymus (2931) and in governing the expression of a broad range of tissue-specific antigens.

Focused multiomic analysis of eTAC populations identifies broad transcriptional upregulation and antigen presentation in all eTACs with uniquely high TSA expression in JCs

To better define the biology and relationships of eTAC subsets, we next focused specifically on GFP-sorted populations. We again confirmed that eTACs consist principally of AmDC1/2s and JCs, as well as small numbers of cells from B-cell and ILC lineages (Fig. 3A, Fig. S4A). Coincident with their increased chromatin accessibility, eTACs also had high levels of global gene expression compared with all other populations (Fig. S4B), with JCs having the highest level of global gene expression among eTAC clusters (Fig 3B, Fig. S4B). Differentially upregulated genes in JCs included a wide range of transcripts including both Aire and Rorc, as well as a distinct set of unique JC-specific genes (Fig. 3B, Table S1). Among JC-specific genes, we identified a large number of transcripts previously classified as tissue-specific antigens (TSAs) (32) (Fig. 3C, Table S2). Indeed, when we surveyed the global TSA expression, we found that JCs expressed the highest numbers of TSA genes, both when compared against AmDCs and more globally against all LN populations, presumably because of their overall increased levels of gene expression (Fig. 3D, Fig. S4B, C) and greater chromatin accessibility. Interestingly, these TSAs were particularly enriched for neuronal-associated and germ cell/placental antigens (Fig. S4D), suggesting both potential disease models in which to interrogate the function of these populations, as well as potential links to the established findings that loss of eTACs leads to a breakdown in maternal-fetal tolerance (7). JCs and AmDCs expressed a largely overlapping set of TSA genes, with JCs expressing a larger number of unique TSA transcripts (Fig. S4E).

Figure 3. Focused multiomic analysis of eTAC populations identifies broad transcriptional upregulation and antigen presentation in all eTACs with uniquely high TSA expression in JCs.

Figure 3.

(A) Reduced dimensionality representation for all GFP+ scRNA-seq and ASAP-seq data. (B) Differential gene expression analyses of JCs (x-axis) and AmDCs (y-axis), with Aire (orange) and Rorc (green) highlighted. Top 20 genes are shown in the table on the right. (C) Expression of three tissue-specific antigen genes, Gal, Prg2, and Pappa2, differentially upregulated in JCs. The top represents expression in the GFP+ scRNA-seq data and the bottom bar graphs represent tissue-restricted expression as indicated from GeneAtlas tissue populations. (D) Number of tissue-specific antigen (TSA) genes for the GFP+ only (left) and all lymph node (right) populations, noting the relative abundance in JCs. (E) Dot plot of antigen processing and presentation gene set showing expression in GFP+ populations. (F) Heatmap of immunomodulatory genes across all lymph node populations from scRNA-seq data, z-score normalized per row.

Both JCs and AmDCs also expressed a diverse array of genes associated with antigen processing and presentation in both the class I and class II MHC pathways (Fig. 3E). Additionally, eTACs also expressed a range of genes associated with immune tolerance and tolerogenic dendritic cell maturation (21,33), including Socs2, Fas, Cd200, Ido1, Smad4, and Irf2bp2 (Fig. 3F). These findings suggest that these populations may be involved in both the expression and/or acquisition of self-antigens, and supports the existing evidence (5,6,12) that eTACs may promote antigen-specific immune tolerance among interacting lymphocyte populations.

Surface-marker multiomics combined with functional flow cytometry and lineage-tracing allows for identification and characterization of eTACs and their lineage relationships

Although JC and AmDC subpopulations of eTACs appear transcriptionally similar, the lineage relationship between them was unclear. As single-cell ASAP-seq allows for identification of a broad range of surface-markers associated with each population, we next sought to define the surface proteins unique to AmDCs and JCs, which would allow us to distinguish and trace these populations. Differential analysis defined several surface proteins relatively more prevalent in either AmDCs (CD11c, CD45, MHCII) or JCs (CD200; Fig. 4A, B).

Figure 4. Surface-marker multiomics combined with functional flow cytometry and lineage-tracing allows for identification and characterization of eTACs and their lineage relationships.

Figure 4.

(A) Volcano plot of surface markers measured by ASAP-seq comparing JCs to AmDCs. (B) Per-cell visualization of selected surface markers indicated in panel A. (C) Dot plot summarizing expression of WNT and Stem-like genes over-expressed in JCs compared to Aire-expressing migratory DCs. (D) RNA velocity analysis of GFP+ populations. (E) Flow cytometry from WT and Adig lymph nodes, pre-gated on single, live, and dump (TCRβ, CD19, SiglecF, F4/80, NK1.1, Ly6C)-negative cells. (F) Flow cytometry from WT and Adig lymph nodes, pre-gated on single and live cells. (G) Flow cytometry from WT, Adig, and REALTAR mice lymph nodes, pre-gated on single, live, and dump (TCRβ, CD19, SiglecF, F4/80, NK1.1, Ly6C)-negative cells. (H) Flow cytometry from REALTAR mouse lymph nodes, pre-gated on single and live cells.

Furthermore, transcriptional data from the scRNAseq analysis suggested that JCs were highly enriched for expression of a range of Wnt-signaling pathway and progenitor-associated factors previously correlated with stem-like cell fates (3438) (Fig. 4C). Interestingly, Wnt signaling through E-cadherin and β-catenin also has a role in maintaining tolerogenic function in dendritic and myeloid cells (39,40). To determine whether JCs’ progenitor-like phenotype reflected their status as a precursor to AmDCs, we utilized both bioinformatic and functional approaches. Notably, RNA velocity (41) analysis showed no evidence of developmental vectors projecting from JCs to AmDCs (Fig. 4D). To verify this inference, we sought to characterize these relationships between respective eTAC populations by genetic lineage tracing and flow cytometry. First, using the Adig mouse, we verified that the majority of eTACs consist of migratory DC1/2s in roughly equal proportions of the two populations (Fig. 4E) – consistent with proportions found in our multiome data. Surprisingly, assessed as a percentage of total migDCs, AmDCs represent over 30% of the total migDC pool at baseline (Fig. 4F).

Next, to better identify JCs by flow cytometry and understand their lineage relationship to AmDCs, we used the previously described RORγt-Cre mouse (42). RORγt-Cre mice were crossed with Adig Aire-reporter and Rosa26-LSL-TmR mice to generate RORγt-Expression And Lineage-Tracer and Aire Reporter (REALTAR) mice. The REALTAR mouse allowed us to identify cells actively transcribing Aire by GFP expression, and cells expressing or having ever expressed RORγt by TmRed expression. Using this model, we could clearly identify two populations: a larger, distinct population of GFP+/TmR cells, as well as a smaller population of GFP+/TmR+ cells (Fig. 4G). Surface-marker characteristics of these populations matched what we observed by scRNA-seq and ASAP-seq (Fig. 3B, 4AB), that JCs (GFP+/TmR+) expressed dramatically lower levels of CD11c and CCR7 than AmDCs (GFP+/TmR; Fig. 4G). This was validated by staining for a number of other differential markers between AmDCs and JCs including MHCII, CD45, CCR6, and CD200 (Fig. 1G, 4A, 4G), which are consistent with previously published results (12).

Thus, we confirmed that JCs have a unique CCR7lo/CD11clo surface-marker phenotype distinct from AmDCs, despite expression of both genes at the transcript level in JCs. This lineage-tracer system also validates that JCs express the RORγt isoform of Rorc. Further, these data suggest that the majority of AmDCs are not lineage-traced by RORγt and thus do not appear to arise from an RORγt-expressing JC precursor. Finally, it is interesting to note that among MHCIIhi RORγt lineage-traced (TmRed+) cells, over 50% of this population is in fact Aire-expressing non-ILCs (GFP+, CD90; Fig. 4H). This suggests that perhaps JCs should be considered in the interpretation of previously described tolerogenic phenotypes attributed to MHCII-expressing ILC3s using RORγt-Cre systems (13,14).

eTACs are defined by transcriptional and regulatory homology to thymic medullary epithelium

While the global transcriptional and functional profile of eTACs suggested a high similarity with migratory DCs, we noted that our cosine similarity scoring against reference ImmGen populations demonstrated significant transcriptional homology to another population – medullary thymic epithelial cells (mTECs; Fig. 5A, B). Interestingly, this was particularly pronounced among JCs. Indeed, the top differentially expressed genes defining JCs were highly enriched only in mTECs when assessed among all other annotated populations in the ImmGen RNA-seq database (Fig. 5C). To define the expression of these genes more precisely relative to Aire expression in the thymus, we aggregated an atlas of prior published scRNA-seq data on thymic epithelial cells and performed integrative analyses using scVI (43) to establish a dimensionally-reduced map of thymic epithelial populations across the full spectrum of their development. The integrated data was annotated according to the population identities defined in the source data (4447) (Fig. S5). In addition to being expressed in mTECs, many of these JC-specific genes appeared tightly associated with Aire expression during mTEC development (Fig. 5D). Importantly, the majority of these genes were not, themselves, Aire-regulated in the thymus (three of the top 50 DE genes, 5.5% overall; Table S2) (48), but rather may represent Aire-associated transcriptional pathways with relevance for the biology and function of Aire in these populations. Strikingly, differential transcription factor binding-site accessibility analysis of the scATAC data demonstrated that the chromatin landscape of JCs was most highly enriched for transcription factor binding sites known for their roles driving Aire expression in mTECs, including RelB and NF-κB, as well as less well-characterized HIVEP genes (Fig. 5E, Table S3). While some of this overlap might be explained by binding-site sequence degeneracy, a number of these gene transcripts including the noncanonical NF-κB-pathway genes Relb and Nfkb2, and Hivep1 and Hivep3, appeared to be highly expressed in JCs and/or migDCs (Fig. 5F). Surveying the ImmGen ATAC database showed that these same transcription factor binding sites are also highly enriched in the open chromatin of medullary thymic epithelium (Fig. 5G, red arrow). Thus, our analyses indicate that JCs share a canonical transcriptional and accessible chromatin program with mTECs, likely mediated by Aire expression.

Figure 5. eTACs are defined by transcriptional and genomic homology to thymic medullary epithelium.

Figure 5.

(A) ImmGen similarity scores for scRNA-seq and (B) ASAP-seq data for mTEC populations. JCs and migratory DCs clusters for each embedding are noted by arrow. (C) Heatmap of ImmGen bulk RNA-seq data for genes most overexpressed in JCs relative to all other lymph node populations. mTEChi populations indicated in red text on top. (D) UMAP of aggregated published scRNA data from thymic epithelium showing annotated TEC subsets (left) and gene-density expression for Aire and genes enriched in JCs (right). (E) Rank-ordering of transcription factor motifs enriched in JCs compared to other populations. (F) Expression of transcription factors from panel E in scRNA-seq data. (G) ImmGen bulk ATAC-seq chromVAR deviation scores for all 89 populations, highlighting the mTEC population with red arrow. Rows represent top transcription factors identified from JCs and are min-max normalized.

RANK-RANKL signaling is required for Aire expression in eTACs

Given the strong genomic and transcriptional evidence for RANK/NF-κB signaling in eTACs and the well-established role of RANK-RANKL signaling in driving Aire expression in the thymus (49,50), we next asked whether these signals are also required for Aire expression in eTACs. First, we found that RANK (Tnfrsf11a) expression was highly upregulated in eTACs. OPG (Tnfrsf11b), a soluble RANKL antagonist that often serves in a negative-feedback regulatory mechanism in cells receiving high levels of RANK signaling (51,52), was quite specifically expressed in JCs among peripheral immune populations (Fig. 6A). As previously demonstrated (46,53,54), inhibition of RANK signaling by antibody-blockade of RANKL led to a significant loss of Aire expression among mTECs in the thymus (Fig. 6B, D). The same was true in the periphery, where eTACs (all GFP+ cells) decreased in relative and absolute abundance after RANKL-blockade (Fig. 6C, D). Most notably, JCs (GFP+ CCR7 CD11c) almost entirely disappeared (or lost Aire expression), suggesting that as in the thymus, Aire-expression in JCs depends on RANK/NF-κB signals. This result again demonstrates the symmetry between Aire expression in the thymus and periphery, provides valuable insight into signals required to induce Aire expression in hematopoietic populations, and suggests that studies utilizing RANK blockade, for example in the setting of tumor immunity (54,55), should also consider the impact on extrathymic populations.

Figure 6. RANK-RANKL signaling is required for Aire expression in eTACs.

Figure 6.

(A) Gene expression of RANK (Tnfrsf11a) and OPG (Tnfrsf11b) from scRNA-seq data. (B) Flow cytometry of Percoll-enriched thymic epithelial cells from WT and Adig mice treated with isotype or αRANKL, pre-gated on single, live, CD11c, CD45, EpCAM+ cells. (C) Flow cytometry of lymph nodes from WT and Adig mice treated with isotype or αRANKL treatment, pre-gated on single, live, and dump (TCRβ, CD19, SiglecF, F4/80, NK1.1, Ly6C)-negative cells. (D) Quantitation of percentage and absolute cell numbers from B and C, unpaired t-test.

Pancreatic self-antigen expression in eTACs is sufficient to induce deletion of T cells escaping thymic selection, and to prevent autoimmune diabetes

Given these transcriptional and genomic similarities between mTECs and eTACs, despite their different origins, we asked whether self-antigen presentation by eTACs is sufficient to mediate tolerance for T cells that escape thymic negative selection. To assess this, we utilized a feature of the Adig mouse, which also expresses a pancreatic self-antigen, islet-specific glucose-6-phosphatase related protein (IGRP), under the control of the Aire promoter (5). In order to generate mice in which only eTACs and not mTECs expressed IGRP, we thymectomized WT and Adig non-obese diabetic (NOD) mice and simultaneously transplanted wildtype thymi under the kidney capsule, then sublethally irradiated and reconstituted them with bone marrow from the 8.3 mouse, a CD8+ TCR-transgenic specific for IGRP (56) (Fig. 7A). Despite concern that peripheral antigen might traffic back to the transplanted thymus and mediate thymic negative selection, we saw no evidence of this, as FACS analysis and tetramer staining of lymphocytes from these mice demonstrated no thymic negative selection and no loss of tetramer avidity (Fig. 7BD). By contrast, in the secondary lymphoid organs, we observed profound deletion of IGRP-specific CD8+ T cells and significantly decreased tetramer avidity of surviving CD8+ T cells (Fig. 7BD). Importantly, despite the large numbers of IGRP-specific T cells leaving the thymus in this permissive TCR-transgenic system, eTAC-mediated deletion in the secondary lymphoid organs was sufficient to entirely prevent these mice from developing diabetes (Fig. 7E).

Figure 7. Pancreatic self-antigen expression in eTACs is sufficient to induce deletion of T cells escaping thymic selection, and to prevent autoimmune diabetes.

Figure 7.

(A) Experimental design for thymic swap and BM chimerism in NOD and Adig NOD mice. SL XRT: sublethal irradiation. BM: bone marrow. (B) Flow cytometry showing lymphocyte populations from indicated tissues. CLN: cervical lymph node. PLN: pancreatic lymph node. (C) Flow cytometry tetramer staining of lymphocyte populations from indicated tissues; panels on right show Adig NOD NRP-V7 staining alone to show distribution of avidity in the periphery, given small number of surviving cells. (D) Quantitation of absolute cell numbers from B, unpaired t-test. (E) Kaplan-Meier curves, diabetes-free survival after bone-marrow reconstitution with 8.3+ BM.

DISCUSSION

A growing body of evidence has demonstrated that migratory DCs in the steady state may play a significant role in immune tolerance, with diverse examples in immune homeostasis, autoimmunity, and tumor immunity (1618). Further, while traditional paradigms suggested an inverse correlation between DC maturity and immunogenicity, some mature DCs, particularly migratory DCs, acquire a discrete tolerogenic profile (33). However, the transcriptional signals driving such tolerogenic maturation have remained unclear. In parallel, work defining the identity and biology of eTACs has suggested a potential migDC-like phenotype (6,8), but their precise identity and biology has remained unclear.

Here, we defined eTACs using high-dimensional multiomics as consisting of two distinct migratory DC-like populations, AmDCs and JCs, and show that self-antigen expression in eTACs is sufficient to cause deletion of autoreactive T cells escaping thymic selection and prevent autoimmune diabetes. Remarkably, we find a distinct transcriptional homology between eTACs and mTECs, and comparative analysis of the two populations identified a core transcriptional signature shared between the two populations. This suggests several promising future directions to define both the signals driving Aire expression and more broadly the transcriptional circuitry underlying self-education in the adaptive immune system. We show that eTACs share a common genomic accessibility signature with mTECs enriched in RANK/NF-κB signaling pathways, and as in the thymus, RANK/RANKL signaling is required for Aire expression in eTACs. Beyond these known signals, several other pathways appear common to both populations, including Wnt-signaling and the Hivep genes. These parallel analyses also suggest novel candidate regulatory regions around the Aire locus which may provide further insight into the regulation of Aire expression in both the thymus and periphery. Finally, the unique genomic accessibility and high transcriptional activity of eTACs generally, and JCs specifically, raises numerous fascinating questions about their biology and homeostasis, and the role of TSA expression within these populations.

While prior work had suggested that Aire/RORγt+ cells might be an ILC3-like population (12), here, we used unbiased whole-transcriptome and whole-chromatin single-cell profiling methods to strongly support the myeloid and DC-like phenotype of JCs. Their similarities to ILCs appear to be largely a product of RORγt-dependent gene expression, like Tmem176a/b (24). These Airehi/RORγt+ cells have previously been demonstrated to be capable of antigen processing and presentation (12), and we show these pathways are upregulated in both JCs and AmDCs. Combined with their highly accessible chromatin and their broad transcriptional program including a range of tissue-specific antigens, this suggests the population may play a role in immune self-education. Indeed, the representation of placental antigens among the TSAs expressed by JCs suggests a potential link to the recently reported requirement for eTACs in the maintenance of maternal-fetal tolerance in both syngeneic and allogeneic pregnancy (7). Interestingly, due to their lack of surface protein expression of either CD11c or CCR7, this population may have been overlooked in prior analyses of dendritic cell subsets. Further, a number of previous studies have described tolerogenic phenotypes and attributed peripheral regulatory T-cell (Treg) induction to MHCII-expressing ILC3s using RORγt-Cre/MHCII-flox systems (13,14). The identification of RORγt-expressing JCs suggests that these cells should be considered as contributors to these phenotypes and deserve further study.

Numerous questions remain, including the contributions of peripheral Aire to normal immune homeostasis, the precise function of Aire within eTACs, and the mechanisms of eTAC-mediated tolerance induction. Additionally, while we have demonstrated here that JCs are most likely not precursors to the majority of AmDCs, JCs do share many phenotypic characteristics with migratory DCs and thus represent an intriguing discrete myeloid population whose developmental origin will be essential to further define. Similarly, the signals driving Aire expression in these populations, and in the thymus through shared programs, will require greater investigation to understand the physiologic significance of these regulatory networks. Finally, while eTACs have recently been demonstrated to be essential for normal immune homeostasis in maternal-fetal tolerance during pregnancy (7), the precise mechanisms of that tolerance remain unclear. While the expression of a range of pregnancy-associated and placental antigens in JCs offers a tantalizing clue, it remains to be demonstrated whether the observed phenotype – immune-mediated intrauterine growth restriction after ablation of eTACs – is a direct result of this antigen reservoir loss, of antigen acquisition by these migratory populations, or of some other mechanism.

The remarkable parallelism between thymic and peripheral Aire-expressing cells does, however, suggest a uniquely conserved central and peripheral core circuitry for the maintenance of immune tolerance despite the divergent origin of these two cell types. Defining the biology of this core Aire-associated program may shed significant insight on basic Aire biology and its role in immune tolerance. Overall, work delineating the conserved transcriptional circuitry between these two tolerogenic populations could have broad relevance both for our understanding of normal immune homeostasis mechanisms, and for diverse applications in autoimmunity, maternal-fetal tolerance, tumor immunity, and transplantation.

MATERIALS AND METHODS

Study Design

The aim of this study was to define the identity and biology of eTACs using murine transgenic model systems in combination with advanced single-cell multiomics and integrative computational analyses. We used Adig Aire-reporter mice to identify eTACs and sort them, along with a broad sample of immune populations from the mouse lymph node, and subjected these populations to parallel scRNA-seq and ASAP-seq. We performed flow cytometry using the Adig mouse and an additional RORγt-lineage-tracer and Aire-reporter (REALTAR) mouse to validate and extend these results and study lineage relationships between eTAC populations. We utilized publicly accessible bulk and single-cell RNA and ATAC sequencing data on medullary thymic epithelial cells (mTECs), dendritic cells, and other immune populations to define the migratory DC-like character of eTACs and the parallel biology between eTACs and mTECs. Further, we used antibody blockade of the RANK signaling pathway to determine whether Aire expression in the periphery, like the thymus, depends on these pathways. Finally, we used model-antigen systems in combination with TCR-transgenic bone marrow chimeras and thymic “swap” experiments to determine whether self-antigen expression in eTACs was sufficient to prevent autoimmunity mediated by autoreactive T cells that escape thymic selection.

Mice

C57BL/6J (B6, H-2b), NOD, B6 RORγt-Cre, and B6 Rosa26-LSL-TmR mice were purchased from The Jackson Laboratory. B6 Aire-Driven IGRP-GFP (Adig) and NOD-Adig mice were generated by the lab and characterized as previously described (5). NOD Thy-1.1 8.3 TCR-transgenic mice, previously derived and characterized (56), were bred and utilized as bone marrow donors. All mice were housed in specific pathogen-free facilities at the University of California San Francisco, and all animal studies were approved by the Institutional Committee on Animal Use and Care at the University of California San Francisco.

Antibodies and Tetramers

Fluorochrome- or biotin-tagged antibodies against the following mouse targets were purchased from: BD Biosciences – CD11c (N418), Siglec F (E50–2440); Thermo Fisher - TCRβ (H57–597), CD3ε (145–2C11), B220 (RA3–6B2), Ter119 (TER-119), MHCII (AF6–120.1); and BioLegend - CD4 (GK1.5), CD8 (53–6.7), NK1.1 (PK136), Ly6C (HK1.4), CD90.2 (30-H12), CD19 (6D5), F4/80 (BM8), CCR7 (4B12), XCR1 (ZET), SIRPα (P84), TCRγδ (GL3), CD45 (30-F11), CD200 (OX-90), CCR6 (29–2L17), EpCAM (G8.8). Isotype control antibodies were purchased from same source as target-specific antibodies. Fixable Blue Dead Cell Stain Kit for UV excitation (Thermo Fisher) was used for viability staining for flow cytometry. DAPI (4′,6-diamidino-2-phenylindole) was used for viability staining for cell sorting.

MHC class I tetramers (I-Ag7) were provided by the NIH Tetramer Core Facility (Emory University, Atlanta, GA). I-Ag7 tetramers used for staining CD8+ T lymphocytes were the IGRP mimotope/class I MHC tetramer NRP-V7-PE (KYNKANVFL) or irrelevant mock tetramer TUM-PE (KYQAVTTTL).

Thymectomy, thymic transplantation, bone marrow chimera generation

Wildtype neonatal (2 day-old B6 pups were used as thymic donors; thymi were procured and cultured in 12-well Transwell plates in 1.35mM 2-deoxyguanosine in complete DMEM-10 at 37°C for seven days. One day prior to transplant (on d6), 2-DG media was replaced with regular complete DMEM. At day seven of thymic incubation, 6–8-week-old NOD and Adig NOD mice were anesthetized with Ketamine/Xylazine and immobilized, and a midline submandibular incision made, the salivary glands were reflected upward, and a partial sternotomy was made. The thymic lobes were suction-aspirated, confirming two lobes aspirated per mouse, and the salivary glands reflected down to cover the partial sternotomy, and the skin closed with staples. The mice were then rotated on their sides, and the left flank shaved, after which a 1cm incision was made and the kidney exteriorized, the capsule incised, and the donor thymi (both lobes) transplanted in subcapsular fashion, after which the capsular defect was cauterized, and the kidney replaced intraabdominally. The flank incision was sutured closed with 6–0 Monocryl. Mice were recovered on covered warming pads with buprenorphine/flunixin analgesia, and carefully monitored during recovery.

Two weeks after the procedure, experimental NOD thymic swap mice were sublethally irradiated with 900 rads (600/300), and donor bone marrow was prepared from NOD 8.3 Thy-1.1 mice by complement depletion of CD4+ and CD8+ T cells (anti-CD4 clone GK1.5, anti-CD8 clone YTS 164.5). Five million BM cells were transferred per recipient via tail-vein injection. Mice were analyzed by flow cytometry at eight weeks post-reconstitution or followed for diabetes incidence by weekly urine dipstick, or at more frequent intervals if they developed wasting or polyuria.

RANKL blockade

Anti-RANKL antibody (Clone IK22/5, BioXCell) or isotype control antibody (clone 2A3, BioXCell) was administered at 100μg/mouse in PBS every other day via intraperitoneal (i.p.) injections for a total of 3 injections.

Mouse lymph node processing and purification

Mouse secondary lymphoid organs (cervical, brachial, axillary, inguinal, and mesenteric lymph nodes) were procured and pooled in digestion media consisting of RPMI with 2% FBS (Sigma), with 100 μg/ml DNase (Roche) and 50 μg/ml Liberase (Roche), minced and agitated at 37°C with gentle agitation for 30 minutes, and passed through a 70-micron filter. Cells were resuspended for magnetic column enrichment (Miltenyi LD column depletion with Streptavidin or anti-biotin microbeads and biotinylated antibodies against B220, Ter119, TCRβ, +/− CD3ε). Cells were then either processed directly for 10x single-cell analysis (WT LN) or sorted by flow cytometry for all live, GFP+ cells, along with WT (non-transgenic) controls for MFI thresholding. Cells were sorted into PBS with 0.04% BSA. Cell viability and counts were evaluated with Vi-CELL XR (Beckman Coulter), and samples with viability >85% used for sequencing.

Thymus processing for flow cytometry

Mouse thymi were isolated and minced with razor blades, transferred into 15ml falcon tubes in total 10ml of DMEM (Sigma) with 2% FBS, and vortexed briefly for 15 seconds. After settling for 5 minutes, the supernatant media was removed and replaced with 4 ml of digestion media consisting of DMEM with 2% FBS (Sigma), with 100 μg/ml DNase (Roche) and 50 μg/ml Liberase (Roche). Thymi were digested in 37˚C water bath with mechanical aid by pipetting through a glass Pasteur pipette every 4 minutes. Every 8 minutes, tubes were spun briefly to pellet undigested fragments and the digested supernatant was transferred to 20 ml of 0.5% BSA (Sigma-Aldrich), 2 mM EDTA (Teknova), in PBS (MACS buffer). This procedure was repeated for three rounds with 4 ml of fresh digestion media each time until there were no remaining visible tissue fragments (total 24 minutes). The single-cell suspension was then pelleted and washed once in MACS Buffer and passed through a 70-micron filter. To enrich for stromal cells, single cell suspension of thymi were subjected to density-gradient centrifugation with a three-layer Percoll gradient (GE Healthcare) with specific gravities of 1.115, 1.065, and 1.0. Stromal cells are isolated from the Percoll-light fraction, between the 1.065 and 1.0 layers, and resuspended in PBS before counting and flow staining.

scRNA-seq

ScRNA-seq libraries were generated using the 10x Chromium Controller and the Chromium Single Cell 3′ Library Construction Kit v3 according to the manufacturer’s instructions. Briefly, the suspended cells were loaded on a Chromium controller Single-Cell Instrument to generate single-cell Gel Bead-In-Emulsions (GEMs) followed by reverse transcription and sample indexing using a C1000 Touch Thermal cycler with 96-Deep Well Reaction Module (BioRad). After breaking the GEMs, the barcoded cDNA was purified and amplified, followed by fragmenting, A-tailing and ligation with adaptors. Finally, PCR amplification was performed to enable sample indexing and enrichment of scRNA-Seq libraries. The final libraries were quantified using a Qubit dsDNA HS Assay kit (Invitrogen) and a High Sensitivity DNA chip run on a Bioanalyzer 2100 system (Agilent). All libraries were sequenced using NovaSeq S4 (Illumina) with paired-end reads (2 × 150 cycles), and per-sample sequencing libraries were demultiplexed using Cell Ranger v5.

ATAC with Selected Antigen profiling by sequencing (ASAP-seq)

Cells were stained with a 198 Total-seq A-conjugated antibody panel for murine cells (BioLegend, see Table S4 for a list of antibodies, clones and barcodes used for ASAP-seq) as previously described (57) and outlined online: https://cite-seq.com/asapseq/. Briefly, following sorting, cells were fixed in 1% formaldehyde and processed as described for the mtscATAC-seq workflow (25), with the modification that during the barcoding reaction, 0.5 μl of 1 μM bridge oligo A (BOA for TSA) was added to the barcoding mix. For GEM incubation the standard thermocycler conditions were used as described by 10x Genomics for scATAC-seq. Silane bead elution and SPRI cleanup steps were modified as described to generate the indexed protein tag library. The final libraries were quantified using a Qubit dsDNA HS Assay kit (Invitrogen) and a High Sensitivity DNA chip run on a Bioanalyzer 2100 system (Agilent). All libraries were sequenced using NovaSeq S4 (Illumina) with paired-end reads (2 × 150 cycles), and per-sample sequencing libraries were demultiplexed using Cell Ranger ATAC v1.2.

scRNA-seq analyses

For each sequencing library, per-cell, per-gene counts were generated from raw sequencing reads (.fastq files) using Cell Ranger v5 count using the mm10 reference genome. Each library (GFP and WT) yielded high quality data, including 11,697 and 10,953 cells for the GFP and WT libraries with a median number of genes detected per cell of 1,831 and 1,451, respectively. To perform integrative downstream analyses, we utilized the Seurat analytical toolkit (58). First, we filtered putative cell doublets using DoubletFinder (59) and downsampled the GFP to 3,000 cells to allow for a more balanced representation of the LN populations in the reduced dimension space. Next, we filtered for cells meeting the following criteria: greater than 500 genes detected and 1000 UMIs and no more than 10% mitochondrial RNA UMIs and no more than 15,000 total UMIs detected per cell—thresholds based on inspection of the empirical density of cells—resulting in 2,532 GFP+ cells and 6,973 WT cells. After normalization and variable gene inference, we corrected the top 30 principal components for the sequencing batch using Harmony (60). The Harmony-corrected components were then used for identifying cell neighbors, clusters, and reduced dimensionality coordinates. Visualization of populations and gene abundances were performed using the default Seurat functionality. For GFP positive analyses (Figure 3), we utilized similar thresholds but did not down sample the library to increase power to discriminate features between JCs and migDCs. RNA velocity analyses were performed using velocyto using the default parameters for this GFP+ population (41).

ASAP-seq analyses

For this multi-modal capture, scATAC-seq data was aligned and processed using the Cell Ranger ATAC v1.2 pipeline with the mm10 reference genome. For the ASAP-seq antibody tag data, per-cell, per-antibody tag counts were enumerated via the kite | kallisto | bustools framework accounting for unique bridging events as previously described (57). Cells called by the Cell Ranger ATAC knee call were further filtered based on abundance accessible chromatin (>3,000 nuclear fragments) as well as accessible chromatin enrichment (>4 TSS enrichment score) and minimal non-specific antibody binding (<2000 total antibody tag counts). We further downsampled the GFP positive population to ensure a more balanced representation of all populations in the LN as described in the scRNA-seq analysis section. Next, gene activity score calculations (via Signac (61)) were performed using the label transfer functionality originally described in Seurat v3 (58) for our well-defined cluster annotations from the scRNA-seq data. We excluded cells with a <60% confidence for the maximum population assigned. Downstream analyses, including dimensionality reduction, transcription factor binding analysis, accessible chromatin visualization was all performed using functions from the Signac package (61), including those that wrap chromVAR (62).

Cell type annotation/ImmGen similarity scores

In order to assign identities to the resulting cell subpopulations/clusters in an unbiased way, we developed an approach to query the similarity of single-cell RNA-seq profiles against all 178 publicly available reference microarray profiles in the Immunological Genome Project (ImmGen) Database (23); all ImmGen primary data were downloaded from haemosphere (https://www.haemosphere.org/datasets/show). The similarity measure we used is based on the cosine distance between appropriately normalized microarray/gene expression profiles - considering only genes (n=2,000) that were identified as highly variable in scRNA-seq. For each reference subpopulation, the corresponding ImmGen microarray profile was normalized to unit sum. The single cell gene expression profiles were first scaled to 1e4 total counts and logarithmized before being normalized to unit sum. Each expression dataset was then subsequently scaled. For each single cell profile, the resulting cosine similarities to all ImmGen profiles were further min-max scaled in a range between 0 and 1, to obtain the final ImmGen similarity scores. We applied this approach both for scoring individual cells (t-SNE visualization) and pseudobulk cluster profiles (clustered heatmap visualization). Final cell type annotation was carried out in a semi-supervised way, cross-validating the resulting ImmGen scores with subpopulation-specific gene expression of known markers and shown in Figure S2. An analogous workflow was devised for the ATAC scoring of populations where the top ~9,000 variable peaks were used as a feature set before performing similarity estimates between the pseudobulk and single-cell ATAC-seq data generated here and the bulk ImmGen ATAC-seq database.

Tissue-specific antigen assignment, annotation, and expression analysis

To examine tissue-specific antigens (TSA), we downloaded the corresponding list (Supplemental Table 2) from Sansom et al. (32), comprising of 6,610 genes, 202 of which were significantly up-regulated in JCs compared to other Aire-expressing populations. To map these TSAs to their corresponding tissue, we downloaded the GeneAtlas MOE430 dataset from BioGPS and excluded cell lines from the data resource, resulting in 89 tissues for further examination. The distribution of tissues with JC-specific TSAs represented the tissue with the maximum expression per-gene from this resource.

Merging and Analysis of Previously Published mTEC scRNAseq datasets

Raw transcript data from four previously published thymic scRNAseq datasets were collected and merged, filtering on genes that were detected across all four datasets (4447) and on cells isolated from wild type mice. Data likely to correspond to doublets were excluded by filtering out cells with total reads and number of genes in the top 1 percent. Further quality control was performed by excluding cells containing fewer than 3 genes. Batch correction and normalization of raw read counts were performed with scVI (43) using the default parameters for model training and batch keys corresponding to each sequencing run within the four datasets. An scVI model trained on the top 5,000 highly variable genes was used to generate a UMAP for the merged data, which was then subset on the 18,349 cells in mTEC clusters based on cell type labels assigned in the published analysis of the individual datasets. A new UMAP was generated for this subset of the filtered data and an scVI model trained using all 16,786 genes in the dataset was used to normalize gene expression for visualization.

Statistical Analyses and Visualization

All data points are shown in the graphs as mean value indicated and individual data points as shown. For all boxplots: center line, median; box limits, first and third quartiles; whiskers, 1.5× interquartile range. All statistical analyses were performed using GraphPad Prism 8 and R v4.0.3. Statistical significance was calculated using one-way ANOVA with Tukey multiple comparison, the unpaired two-sided t-test, or the non-parametric Mann-Whitney U test. Diabetes-free survival was analyzed using the log-rank test. *p < 0.05, **p < 0.005, *** p < 0.0005 and **** p < 0.00005 were considered significant. Experimental graphics were generated with BioRender (https://biorender.com/).

Supplementary Material

Table S4

Details of ASAP-seq surface panel (Excel spreadsheet).

Table S3

Top transcription factor activity scores for JCs (Excel spreadsheet).

Table S1

Summary statistics of differentially expressed genes between JCs and amDCs (Excel spreadsheet).

Table S2

Summary statistics of genes most over-expressed in JCs (Excel spreadsheet).

Data file S1

Raw data file (Excel spreadsheet).

Supplemental Material
MDAR reproducibility checklist

Acknowledgments:

We thank members of the Bluestone, Anderson, and Gardner labs for technical support and discussions including Alice Chan, Corey Miller, Jennifer Bridge, Jonah Phipps, Hong Sun, and Irina Proekt. We thank Chrysothemis Brown and Sasha Rudensky for discussions. We thank the members of the UCSF Flow Cytometry Core Facility for assistance with cell sorting.

Funding: This work was supported by the ARCS Fellowship (J.W.), a Stanford Science Fellowship (C.A.L.), a Parker Scholar Ward (C.A.L.), NIH T32 FAVOR (J.L.B.), the NIH P01AI118688 (M.S.A.), the ASTS Fellowship in Transplantation grant (J.M.G.), the UCSF Sandler Fellows PSSP grant (J.M.G.), NIH K08230188 and U01CA260852 (A.T.S.), a Cancer Research Institute Technology Impact Award (A.T.S.), and the Juvenile Diabetes Research Foundation and Lupus Research Alliance (A.T.S.).

Footnotes

Competing interests: A.T.S. is a founder of Immunai and Cartography Biosciences and receives research funding from Arsenal Biosciences and Allogene Therapeutics. The other authors declare no competing interests.

Data and materials availability: The sequencing data are available through the Gene Expression Omnibus under accession GSE176282. All other data needed to evaluate the conclusions in the paper are present in the paper or the Supplementary Materials.

REFERENCES AND NOTES

  • 1.Anderson MS, Venanzi ES, Klein L, Chen Z, Berzins SP, Turley SJ, et al. Projection of an immunological self shadow within the thymus by the aire protein. Science (80- ). 2002; [DOI] [PubMed] [Google Scholar]
  • 2.Anderson MS, Venanzi ES, Chen Z, Berzins SP, Benoist C, Mathis D. The cellular mechanism of Aire control of T cell tolerance. Immunity. 2005;23(2):227–39. [DOI] [PubMed] [Google Scholar]
  • 3.Ramsey C, Hässler S, Marits P, Kämpe O, Surh CD, Peltonen L, et al. Increased antigen presenting cell-mediated T cell activation in mice and patients without the autoimmune regulator. Eur J Immunol. 2006;36(2):305–17. [DOI] [PubMed] [Google Scholar]
  • 4.Poliani PL, Kisand K, Marrella V, Ravanini M, Notarangelo LD, Villa A, et al. Human peripheral lymphoid tissues contain autoimmune regulator-expressing dendritic cells. Am J Pathol. 2010;176(3):1104–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gardner JM, DeVoss JJ, Friedman RS, Wong DJ, Tan YX, Zhou X, et al. Deletional Tolerance Mediated by Extrathymic Aire-Expressing Cells. Science (80- ) [Internet]. 2008. Aug 8;321(5890):843–7. Available from: 10.1126/science.1159407 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Gardner JM, Metzger TC, McMahon EJ, Au-Yeung BB, Krawisz AK, Lu W, et al. Extrathymic aire-expressing cells are a distinct bone marrow-derived population that induce functional inactivation of CD4+ T cells. Immunity. 2013; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gillis-Buck E, Miller H, Sirota M, Sanders SJ, Ntranos V, Anderson MS, et al. Extrathymic Aire -expressing cells support maternal-fetal tolerance. Sci Immunol [Internet]. 2021. Jul 16;6(61):eabf1968. Available from: 10.1126/sciimmunol.abf1968 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Fergusson JR, Morgan MD, Bruchard M, Huitema L, Heesters BA, van Unen V, et al. Maturing human CD127+ CCR7+ PDL1+ dendritic cells express AIRE in the absence of tissue restricted antigens. Front Immunol. 2019;10(JAN). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Kogawa K, Nagafuchi S, Katsuta H, Kudoh J, Tamiya S, Sakai Y, et al. Expression of AIRE gene in peripheral monocyte/dendritic cell lineage. Immunol Lett. 2002;80(3):195–8. [DOI] [PubMed] [Google Scholar]
  • 10.Lindh E, Lind SM, Lindmark E, Hässler S, Perheentupa J, Peltonen L, et al. AIRE regulates T-cell-independent B-cell responses through BAFF. Proc Natl Acad Sci U S A. 2008;105(47):18466–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lindmark E, Chen Y, Georgoudaki AM, Dudziak D, Lindh E, Adams WC, et al. AIRE expressing marginal zone dendritic cells balances adaptive immunity and T-follicular helper cell recruitment. J Autoimmun. 2013; [DOI] [PubMed] [Google Scholar]
  • 12.Yamano T, Dobeš J, Vobořil M, Steinert M, Brabec T, Ziętara N, et al. Aire-expressing ILC3-like cells in the lymph node display potent APC features. J Exp Med. 2019. May 6;216(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hepworth MR, Monticelli LA, Fung TC, Ziegler CGK, Grunberg S, Sinha R, et al. Innate lymphoid cells regulate CD4 + T-cell responses to intestinal commensal bacteria. Nature. 2013;498(7452):113–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hepworth MR, Fung TC, Masur SH, Kelsen JR, McConnell FM, Dubrot J, et al. Group 3 innate lymphoid cells mediate intestinal selection of commensal bacteria-specific CD4+ T cells. Science (80- ). 2015;348(6238):1031–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Brown CC, Gudjonson H, Pritykin Y, Deep D, Lavallée VP, Mendoza A, et al. Transcriptional Basis of Mouse and Human Dendritic Cell Heterogeneity. Cell. 2019;179(4):846–863.e24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Anandasabapathy N, Feder R, Mollah S, Tse SW, Longhi MP, Mehandru S, et al. Classical Flt3L-dependent dendritic cells control immunity to protein vaccine. J Exp Med. 2014;211(9):1875–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Idoyaga J, Fiorese C, Zbytnuik L, Lubkin A, Miller J, Malissen B, et al. Specialized role of migratory dendritic cells in peripheral tolerance induction. J Clin Invest. 2013;123(2):844–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Maier B, Leader AM, Chen ST, Tung N, Chang C, LeBerichel J, et al. A conserved dendritic-cell regulatory program limits antitumour immunity. Nature [Internet]. 2020;580(7802):257–62. Available from: 10.1038/s41586-020-2134-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Itano AA, McSorley SJ, Reinhardt RL, Ehst BD, Ingulli E, Rudensky AY, et al. Distinct dendritic cell populations sequentially present antigen to CD4 T cells and stimulate different aspects of cell-mediated immunity. Immunity. 2003;19(1):47–57. [DOI] [PubMed] [Google Scholar]
  • 20.Allenspach EJ, Lemos MP, Porrett PM, Turka LA, Laufer TM. Migratory and Lymphoid-Resident Dendritic Cells Cooperate to Efficiently Prime Naive CD4 T cells. Immunity [Internet]. 2008;29(5):795–806. Available from: 10.1016/j.immuni.2008.08.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Miller JC, Brown BD, Shay T, Gautier EL, Jojic V, Cohain A, et al. Deciphering the transcriptional network of the dendritic cell lineage. Nat Immunol. 2012;13(9):888–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Eberl G. RORγt, a multitask nuclear receptor at mucosal surfaces. Mucosal Immunol. 2017;10(1):27–34. [DOI] [PubMed] [Google Scholar]
  • 23.Heng TSP, Painter MW, Elpek K, Lukacs-Kornek V, Mauermann N, Turley SJ, et al. The Immunological Genome Project: networks of gene expression in immune cells. Nat Immunol [Internet]. 2008. Oct;9(10):1091–4. Available from: http://www.nature.com/articles/ni1008-1091 [DOI] [PubMed] [Google Scholar]
  • 24.Pokrovskii M, Hall JA, Ochayon DE, Yi R, Chaimowitz NS, Seelamneni H, et al. Characterization of Transcriptional Regulatory Networks that Promote and Restrict Identities and Functions of Intestinal Innate Lymphoid Cells. Immunity [Internet]. 2019;51(1):185–197.e6. Available from: 10.1016/j.immuni.2019.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lareau CA, Ludwig LS, Muus C, Gohil SH, Zhao T, Chiang Z, et al. Massively parallel single-cell mitochondrial DNA genotyping and chromatin profiling. Nat Biotechnol [Internet]. 2021;39(4):451–61. Available from: 10.1038/s41587-020-0645-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Yoshida H, Lareau CA, Ramirez RN, Rose SA, Maier B, Wroblewska A, et al. The cis-Regulatory Atlas of the Mouse Immune System. Cell [Internet]. 2019;176(4):897–912.e20. Available from: 10.1016/j.cell.2018.12.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Haljasorg U, Bichele R, Saare M, Guha M, Maslovskaja J, Kõnd K, et al. A highly conserved NF-κB-responsive enhancer is critical for thymic expression of Aire in mice. Eur J Immunol. 2015;45(12):3246–56. [DOI] [PubMed] [Google Scholar]
  • 28.LaFlam TN, Seumois G, Miller CN, Lwin W, Fasano KJ, Waterfield M, et al. Identification of a novel cis-regulatory element essential for immune tolerance. J Exp Med. 2015;212(12):1993–2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Guha M, Saare M, Maslovskaja J, Kisand K, Liiv I, Haljasorg U, et al. DNA breaks and chromatin structural changes enhance the transcription of autoimmune regulator target genes. J Biol Chem. 2017;292(16):6542–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Koh AS, Kuo AJ, Sang YP, Cheung P, Abramson J, Bua D, et al. Aire employs a histone-binding module to mediate immunological tolerance, linking chromatin regulation with organ-specific autoimmunity. Proc Natl Acad Sci U S A. 2008;105(41):15878–83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Koh AS, Miller EL, Buenrostro JD, Moskowitz DM, Wang J, Greenleaf WJ, et al. Rapid chromatin repression by Aire provides precise control of immune tolerance article. Nat Immunol [Internet]. 2018;19(2):162–72. Available from: 10.1038/s41590-017-0032-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Sansom SN, Shikama-Dorn N, Zhanybekova S, Nusspaumer G, Macaulay IC, Deadman ME, et al. Population and single-cell genomics reveal the Aire dependency, relief from Polycomb silencing, and distribution of self-antigen expression in thymic epithelia. Genome Res. 2014;24(12):1918–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Ardouin L, Luche H, Chelbi R, Carpentier S, Shawket A, Montanana Sanchis F, et al. Broad and Largely Concordant Molecular Changes Characterize Tolerogenic and Immunogenic Dendritic Cell Maturation in Thymus and Periphery. Immunity. 2016;45(2):305–18. [DOI] [PubMed] [Google Scholar]
  • 34.Wang B, Zhao L, Fish M, Logan CY, Nusse R. Self-renewing diploid Axin2 + cells fuel homeostatic renewal of the liver. Nature. 2015;524(7564):180–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Nusse R, Clevers H. Wnt/β-Catenin Signaling, Disease, and Emerging Therapeutic Modalities. Cell [Internet]. 2017. Jun;169(6):985–99. Available from: https://linkinghub.elsevier.com/retrieve/pii/S0092867417305470 [DOI] [PubMed] [Google Scholar]
  • 36.Zhang H, Ye M, Welner RS, Tenen DG. Sox4 Is Required for the Formation and Maintenance of Multipotent Progenitors. Blood [Internet]. 2014. Dec 6;124(21):1577–1577. Available from: https://ashpublications.org/blood/article/124/21/1577/88774/Sox4-Is-Required-for-the-Formation-and-Maintenance [Google Scholar]
  • 37.Kashiwada M, Pham N-LL, Pewe LL, Harty JT, Rothman PB. NFIL3/E4BP4 is a key transcription factor for CD8α+ dendritic cell development. Blood [Internet]. 2011. Jun 9;117(23):6193–7. Available from: https://ashpublications.org/blood/article/117/23/6193/22308/NFIL3E4BP4-is-a-key-transcription-factor-for-CD8α [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Ogawa M, Matsuzaki Y, Nishikawa S, Hayashi S, Kunisada T, Sudo T, et al. Expression and function of c-kit in hemopoietic progenitor cells. J Exp Med [Internet]. 1991. Jul 1;174(1):63–71. Available from: https://rupress.org/jem/article/174/1/63/57359/Expression-and-function-of-ckit-in-hemopoietic [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jiang A, Bloom O, Ono S, Cui W, Unternaehrer J, Jiang S, et al. Disruption of E-Cadherin-Mediated Adhesion Induces a Functionally Distinct Pathway of Dendritic Cell Maturation. Immunity. 2007;27(4):610–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Manicassamy S, Reizis B, Ravindran R, Nakaya H, Salazar-Gonzalez RM, Wang YC, et al. Activation of β-catenin in dendritic cells regulates immunity versus tolerance in the intestine. Science (80- ). 2010;329(5993):849–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.La Manno G, Soldatov R, Zeisel A, Braun E, Hochgerner H, Petukhov V, et al. RNA velocity of single cells. Nature [Internet]. 2018;560(7719):494–8. Available from: 10.1038/s41586-018-0414-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Ebertl G, Litman DR. Thymic origin of intestinal αβ T cells revealed by fate mapping of RORγt+ cells. Science (80- ). 2004;305(5681):248–51. [DOI] [PubMed] [Google Scholar]
  • 43.Lopez R, Regier J, Cole MB, Jordan MI, Yosef N. Deep generative modeling for single-cell transcriptomics. Nat Methods [Internet]. 2018;15(12):1053–8. Available from: 10.1038/s41592-018-0229-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Bornstein C, Nevo S, Giladi A, Kadouri N, Pouzolles M, Gerbe F, et al. Single-cell mapping of the thymic stroma identifies IL-25-producing tuft epithelial cells. Nature [Internet]. 2018;559(7715):622–6. Available from: 10.1038/s41586-018-0346-1 [DOI] [PubMed] [Google Scholar]
  • 45.Kernfeld EM, Genga RMJ, Neherin K, Magaletta ME, Xu P, Maehr R. A Single-Cell Transcriptomic Atlas of Thymus Organogenesis Resolves Cell Types and Developmental Maturation. Immunity [Internet]. 2018. Jun;48(6):1258–1270.e6. Available from: https://linkinghub.elsevier.com/retrieve/pii/S1074761318301845 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Wells KL, Miller CN, Gschwind AR, Wei W, Phipps JD, Anderson MS, et al. Combined transient ablation and single-cell RNA-sequencing reveals the development of medullary thymic epithelial cells. Elife [Internet]. 2020. Nov 23;9:1–27. Available from: https://elifesciences.org/articles/60188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Dhalla F, Baran-Gale J, Maio S, Chappell L, Holländer GA, Ponting CP. Biologically indeterminate yet ordered promiscuous gene expression in single medullary thymic epithelial cells. EMBO J. 2020;39(1):1–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Tomofuji Y, Takaba H, Suzuki HI, Benlaribi R, Martinez CDP, Abe Y, et al. Chd4 choreographs self-antigen expression for central immune tolerance. Nat Immunol [Internet]. 2020;21(8):892–901. Available from: 10.1038/s41590-020-0717-2 [DOI] [PubMed] [Google Scholar]
  • 49.Rossi SW, Kim MY, Leibbrandt A, Parnell SM, Jenkinson WE, Glanville SH, et al. RANK signals from CD4+3- inducer cells regulate development of Aire-expressing epithelial cells in the thymic medulla. J Exp Med. 2007;204(6):1267–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Hikosaka Y, Nitta T, Ohigashi I, Yano K, Ishimaru N, Hayashi Y, et al. The Cytokine RANKL Produced by Positively Selected Thymocytes Fosters Medullary Thymic Epithelial Cells that Express Autoimmune Regulator. Immunity. 2008;29(3):438–50. [DOI] [PubMed] [Google Scholar]
  • 51.Simonet WS, Lacey DL, Dunstan CR, Kelley M, Chang MS, Lüthy R, et al. Osteoprotegerin: A novel secreted protein involved in the regulation of bone density. Cell. 1997;89(2):309–19. [DOI] [PubMed] [Google Scholar]
  • 52.Udagawa N, Takahashi N, Yasuda H, Mizuno A, Itoh K, Ueno Y, et al. Osteoprotegerin produced by osteoblasts is an important regulator in osteoclast development and function. Endocrinology. 2000;141(9):3478–84. [DOI] [PubMed] [Google Scholar]
  • 53.Metzger TC, Khan IS, Gardner JM, Mouchess ML, Johannes KP, Krawisz AK, et al. Lineage Tracing and Cell Ablation Identify a Post-Aire-Expressing Thymic Epithelial Cell Population. Cell Rep [Internet]. 2013;5(1):166–79. Available from: 10.1016/j.celrep.2013.08.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Khan IS, Mouchess ML, Zhu ML, Conley B, Fasano KJ, Hou Y, et al. Enhancement of an anti-tumor immune response by transient blockade of central T cell tolerance. J Exp Med. 2014;211(5):761–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Bakhru P, Zhu M-L, Wang H-H, Hong LK, Khan I, Mouchess M, et al. Combination central tolerance and peripheral checkpoint blockade unleashes antimelanoma immunity. JCI Insight [Internet]. 2017. Sep 21;2(18). Available from: https://insight.jci.org/articles/view/93265 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Krishnamurthy B, Mariana L, Gellert SA, Colman PG, Harrison LC, Lew AM, et al. Autoimmunity to Both Proinsulin and IGRP Is Required for Diabetes in Nonobese Diabetic 8.3 TCR Transgenic Mice. J Immunol. 2008;180(7):4458–64. [DOI] [PubMed] [Google Scholar]
  • 57.Mimitou EP, Lareau CA, Chen KY, Zorzetto-Fernandes AL, Hao Y, Takeshima Y, et al. Scalable, multimodal profiling of chromatin accessibility, gene expression and protein levels in single cells [Internet]. Nature Biotechnology. Springer US; 2021. Available from: 10.1038/s41587-021-00927-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, et al. Comprehensive Integration of Single-Cell Data. Cell [Internet]. 2019;177(7):1888–1902.e21. Available from: 10.1016/j.cell.2019.05.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.McGinnis CS, Murrow LM, Gartner ZJ. DoubletFinder: Doublet Detection in Single-Cell RNA Sequencing Data Using Artificial Nearest Neighbors. Cell Syst [Internet]. 2019;8(4):329–337.e4. Available from: 10.1016/j.cels.2019.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K, et al. Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods [Internet]. 2019;16(12):1289–96. Available from: 10.1038/s41592-019-0619-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Stuart T, Srivastava A, Lareau C, Satija R. Multimodal single-cell chromatin analysis with Signac. bioRxiv [Internet]. 2020. Jan 1;2020.11.09.373613. Available from: http://biorxiv.org/content/early/2020/11/10/2020.11.09.373613.abstract [Google Scholar]
  • 62.Schep AN, Wu B, Buenrostro JD, Greenleaf WJ. ChromVAR: Inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat Methods. 2017;14(10):975–8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S4

Details of ASAP-seq surface panel (Excel spreadsheet).

Table S3

Top transcription factor activity scores for JCs (Excel spreadsheet).

Table S1

Summary statistics of differentially expressed genes between JCs and amDCs (Excel spreadsheet).

Table S2

Summary statistics of genes most over-expressed in JCs (Excel spreadsheet).

Data file S1

Raw data file (Excel spreadsheet).

Supplemental Material
MDAR reproducibility checklist

RESOURCES