Abstract
Establishing and maintaining tolerance to self-antigens or innocuous foreign antigens is vital for the preservation of organismal health. Within the thymus, medullary thymic epithelial cells (mTECs) expressing autoimmune regulator (AIRE) have a critical role in self-tolerance through deletion of autoreactive T cells and promotion of thymic regulatory T (Treg) cell development1–4. Within weeks of birth, a separate wave of Treg cell differentiation occurs in the periphery upon exposure to antigens derived from the diet and commensal microbiota5–8, yet the cell types responsible for the generation of peripheral Treg (pTreg) cells have not been identified. Here we describe the identification of a class of RORγt+ antigen-presenting cells called Thetis cells, with transcriptional features of both mTECs and dendritic cells, comprising four major sub-groups (TC I–TC IV). We uncover a developmental wave of Thetis cells within intestinal lymph nodes during a critical window in early life, coinciding with the wave of pTreg cell differentiation. Whereas TC I and TC III expressed the signature mTEC nuclear factor AIRE, TC IV lacked AIRE expression and was enriched for molecules required for pTreg generation, including the TGF-β-activating integrin αvβ8. Loss of either major histocompatibility complex class II (MHCII) or ITGB8 by Thetis cells led to a profound impairment in intestinal pTreg differentiation, with ensuing colitis. By contrast, MHCII expression by RORγt+ group 3 innate lymphoid cells (ILC3) and classical dendritic cells was neither sufficient nor required for pTreg generation, further implicating TC IV as the tolerogenic RORγt+ antigen-presenting cell with an essential function in early life. Our studies reveal parallel pathways for the establishment of tolerance to self and foreign antigens in the thymus and periphery, respectively, marked by the involvement of shared cellular and transcriptional programmes.
Subject terms: Mucosal immunology, Immune tolerance
Single-cell transcriptomic and epigenetic analysis has enabled the identification of Thetis cells, a class of RORγt+ antigen-presenting cells with a key role in the differentiation of commensal microbiota-induced peripheral regulatory T cells.
Main
In the thymus, a distinct lineage of epithelial cells establishes tolerance to self-antigens through deletion of autoreactive T cells and promotion of Treg cell differentiation1–4. These functions of mTECs are mediated in part through the expression of AIRE, which regulates the ectopic expression of tissue-restricted antigens1. Another major site of tolerance induction resides within intestinal lymphoid tissue, where an infant’s developing immune system is exposed to many new dietary components and colonizing microbes upon weaning. Establishing a harmonious host–microbiota relationship in this early life developmental window is critical to prevent later onset of immune-mediated disorders7,9. Central to the establishment of tolerance to intestinal microbes is the differentiation of naive T cells to peripherally generated Treg (pTreg) cells upon encounter with commensal-derived antigens5,6,10,11. Yet the identity of the antigen-presenting cell (APC) that promotes pTreg cell differentiation is not known. The narrow time window for establishing intestinal immune homeostasis suggests the presence of a developmentally restricted tolerogenic APC within the neonatal intestinal niche.
RORγt+ APCs promote pTreg differentiation
Extra-thymic pTreg cells, distinguished from their thymic counterparts by expression of the orphan nuclear receptor RORγt, arise in mesenteric lymph nodes (mLN) in response to commensal bacterial antigens and have a critical role in suppressing inflammatory immune responses against gut microbes6,11. Conversely, mice deficient in MHCII-restricted antigen presentation by RORγt+ cells (MHCIIΔRORγt), develop severe intestinal inflammation because they do not establish tolerance to commensal bacteria12, suggesting a potential connection between RORγt+ APCs and RORγt+ pTreg cell generation. To address this possibility, we analysed MHCIIΔRORγt mice at three weeks of age, when pTreg cells accumulate in the intestine5,6. We observed a marked reduction in RORγt+ pTreg cells within the mLN and colonic lamina propria, along with an expansion of CD44hi T effector (Teff) cells (Fig. 1a–d and Extended Data Fig. 1a). At eight weeks of age, these mice exhibited a sustained, severe reduction in RORγt+ pTreg cells along with expansion of colonic T helper 17 (TH17) cells (Fig. 1e and Extended Data Fig. 1b), in line with previous studies demonstrating a prominent role for pathobiont-specific RORγt+ pTreg cells in suppressing inflammatory TH17 cells11. Histological analysis demonstrated severe colitis with marked inflammatory cell infiltrate, mucosal ulceration and loss of crypts (Fig. 1f,g), confirming a critical role for RORγt+ APCs in preventing dysregulated intestinal immune responses.
Given previous reports of a developmental window for intestinal immune tolerance7,8, we addressed whether the pTreg cell deficit in adult MHCIIΔRORγt mice reflected a failure to generate pTreg cells in early life or a continuous requirement for RORγt+ APC-instructed pTreg cell differentiation. We therefore generated a RorcVenus-creERT2 allele for identification and temporal manipulation of RORγt+ cells (Extended Data Fig. 2a,b). Analysis of RorgtcreRosa26lsl-tdTomatoRorcVenus-creERT2 mice confirmed that expression of Venus protein, translated downstream of exon 11, faithfully reflected expression of the RORγt isoform within the mLN and large intestine (Extended Data Fig. 2c). Surprisingly, continuous ablation of MHCII on RORγt+ APCs in adult RorcVenus-creERT2H2-Ab1fl/fl mice treated with tamoxifen from 8–13 weeks of age resulted in only a modest decrease in pTreg cells within the mLN and LI (Extended Data Fig. 2d,e), indicating a minimal contribution of de novo pTreg cell differentiation to the pTreg cell pool of adult mice with stable microbial communities. Together, these results demonstrate an essential role for an early life RORγt+ APC in pTreg cell generation and raise the question of the nature of the tolerogenic RORγt+ APC.
Identification of a novel lineage of RORγt+ APCs
A number of candidate APC types have been suggested to regulate tolerance to the intestinal microbiota, including dendritic cells and MHCII+ ILC3s (also known as lymphoid tissue inducer (LTi)-like cells). Among these, the loss of tolerance to commensals in MHCIIΔRORγt mice has previously been attributed to ILC3s on the assumption that they represent the only RORγt+MHCII+ cell type12,13. However, recent studies have identified RORγt-expressing dendritic cells14,15 as well as RORγt+AIRE+ cells16, which were initially described as ‘ILC3-like’ cells but were subsequently shown to more closely resemble dendritic cells17. The role of these cells in immune tolerance remains unknown. Critically, the spectrum of RORγt+ APCs has not been examined within the mLN at the time when pTreg cells first arise. RORγt+ pTreg cells first appeared within the mLN between postnatal day (P)10 and P14, with rapid accumulation thereafter (Extended Data Fig. 3a). We therefore performed paired single-cell RNA sequencing (RNA-seq) and single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) of CD45+Lin–RORγt(Venus)+MHCII+ cells isolated from mLN at two weeks of age (Fig. 2a and Extended Data Fig. 3b). After quality filtering, we retained chromatin accessibility and transcriptional profiles for 10,145 cells. Unsupervised clustering of either the scRNA-seq or scATAC-seq data revealed two major cell types (Fig. 2b–d and Extended Data Fig. 3c). The first cell type represented ILC3s, spanning their full developmental spectrum including a RAG1+ ILC3 progenitor (ILC3p), proliferating and mature NCR+ ILC3s, and CCR6+ LTi cells (Fig. 2d and Extended Data Fig. 3c,d). The second cell type did not express canonical innate lymphoid cell (ILC) genes. This population, which was distinguished by a combination of both epithelial and dendritic cell-associated transcription factors and cell-surface molecules, consisted of four subsets and a small cluster of proliferating cells (Fig. 2b–d and Extended Data Fig. 3c–e). Although these cells expressed the dendritic cell marker Zbtb46, this transcript was also highly expressed by MHCII+ ILC3s, a finding that was confirmed by analysis of Zbtb46GFPRorgtcreRosa26lsl-tdTomato mice (Extended Data Fig. 3f). To elucidate the identity of non-ILC3 RORγt+ APCs, we compared the similarity of pseudo-bulk transcriptomes across a comprehensive database of immune and stromal cells (ImmGen). As expected, ILC3 scRNA-seq clusters aligned with ILC3s, whereas the remaining clusters exhibited surprisingly high correlation with both mTECs and dendritic cells (Fig. 2e), including specific expression of AIRE, the signature mTEC transcription factor, in clusters I and III (Fig. 2f). Independent cross-referencing of these cells with immune and epithelial cell atlases using CellTypist18, a machine learning tool for precise cell-type annotation, also showed their overlapping transcriptional features with both dendritic cells and a generic epithelial cell (Extended Data Fig. 3g,h), consistent with their expression of p63, a critical regulator of epithelial cell differentiation19,20. In light of the ‘shape-shifting’ hybrid phenotype of this group of RORγt+ APCs, we refer to these cells as Thetis cells. Comparison of Thetis cell cluster identity defined by chromatin accessibility or gene expression revealed near perfect congruence (Extended Data Fig. 2f), confirming that Thetis cells comprise four distinct cell types (TC I–TC IV). Analysis of pseudo-bulk transcriptomes for Thetis cell sub-groups alongside published single-cell thymic epithelial transcriptomes21 demonstrated overlap with distinct clusters of mature mTEC subsets, in particular mature AIRE+ (mTEC II) and ‘post-AIRE’ (mTEC III) subsets (Fig. 2g). Overall, these data demonstrated the existence of a novel RORγt+ cell type present within intestinal lymph nodes during early life.
The phenotypic landscape of Thetis cells
Extra-thymic AIRE expression has previously been reported in migratory CCR7+ dendritic cells22,23. Of note, the gene expression signature that distinguishes CCR7+ dendritic cells from their CCR7– counterparts does not define the dendritic cell lineage, but rather represents a particular transcriptional programme that can be acquired by classical dendritic cell subsets cDC1 and cDC2, as well as other APC types24,25, reflecting enhanced cell migration, T cell priming capacity and expression of immune-regulatory molecules25. The shared expression of AIRE in Thetis cells and CCR7+ dendritic cells prompted us to examine the relationship between these two cell types. Analysis of RorcVenus-creERT2AireGFP mice confirmed widespread AIRE (indicated by GFP) expression by Lin–CXCR6–CD11c+MHCII+CCR7+ cells encompassing both CCR7+ DC1 and DC2 (Extended Data Fig. 4a,b); however, less than 4% of CCR7+ cells expressed RORγt. Moreover, CXCR6–RORγt+MHCII+ Thetis cells were also found among CCR7– and CD11c−MHCII+ cell populations (Extended Data Fig. 4a,b), suggesting that Thetis cells were distinct from CCR7+ dendritic cells. To gain further insight into the distinguishing features of AIRE-expressing Thetis cells, dendritic cells and mTECs, we performed orthogonal SMART-seq2 scRNA-seq analysis of Lin–RORγt+MHCII+ cells isolated from the mLN of three-week-old RorcVenus-creERT2 mice in parallel with mLN AIRE+ dendritic cells and AIRE+ mTECs from age-matched AireGFP mice (Extended Data Fig. 4c–e). Clustering analysis, combined with mapping of SMART-seq2 transcriptomes to the droplet 10X dataset, confirmed the presence of LTi-like ILC3 and TC I–TC IV clusters (Fig. 3a and Extended Data Fig. 4f,g). Within the SMART-seq2 dataset, AIRE+ mTECs clustered with TC I (Extended Data Fig. 4f,g). Nevertheless, a direct comparison revealed unique expression of epithelial genes (Foxn1, Krt17 and Krt8), the thymic marker gene Tbata, and Fezf2 by mTECs, whereas TC I expressed genes associated with dendritic cells (Ccr7, Cd83 and Dpp4) (Extended Data Fig. 4h and Supplementary Table 1). In addition, Thetis cells exclusively expressed Ptprc (CD45) and RORγt (Extended Data Fig. 4i), a finding confirmed by analysis of mTECs from RorcVenus-creERT2 mice (Extended Data Fig. 4j). Despite overlapping markers, Thetis cells clustered separately from AIRE+ dendritic cells (Extended Data Fig. 4f,g), distinguished by a number of immune-regulatory genes (Extended Data Fig. 4k and Supplementary Table 2), underscoring the distinct identity of these two cell types. Furthermore, AIRE protein expression was readily detectable in 10–15% of the Thetis cell population, but not in dendritic cells (Fig. 3b), consistent with the lower level of Aire transcript observed in dendritic cells.
Index-sorting analysis of cell-surface markers revealed that Thetis cells spanned a spectrum from CD11c–/lo (TC I) to CD11chi (TC II–TC IV) cells (Fig. 3c and Extended Data Fig. 5a). In addition, TC IV was distinguished by high levels of CD11b expression. These findings suggest that RORγt+CD11c+CD11b+MHCII+ cells, previously identified among Tbet– cDC2B15, represent TC IV. TC II–TC IV expressed higher levels of RORγt (indicated by Venus) than ILC3s (Fig. 3c), reflecting Rorgt promoter activity (Extended Data Fig. 5b). Although TC I expressed lower levels of RORγt, all Thetis cell subsets expressed sufficient levels of Rorgt to drive Rorgt-cre-mediated recombination, as evidenced by the near universal expression of tdTomato by CXCR6–Venus(RORγt)+MHCII+ Thetis cells isolated from RorcVenus-creERT2RorgtcreRosa26lsl-tdTomato mice (Extended Data Fig. 5c). Of note, ILC3s, traditionally identified as CD90+ cells, encompassed both CD90– and CD90+ cells and did not express CD11c (Fig. 3c and Extended Data Fig. 5d). To assess whether the partially overlapping transcriptional features of Thetis cells with dendritic cells and mTECs were coupled to similar or distinct morphological attributes, we analysed Thetis cells by electron microscopy. Dendritic cells, AIRE+ mTECs and MHCII+ ILC3 cells served as reference populations. Although CD11c–RORγt+AIRE+ cells have previously been described as lymphoid16, electron microscopy analysis revealed that both CD11c–AIRE+ TC I and CD11c+ Thetis cells more closely resembled myeloid cells, in contrast to MHCII+ ILC3s, which had a typical lymphoid appearance (Extended Data Fig. 5e). Thetis cells were distinct from classical CCR7+ and CCR7– dendritic cells as well as mTECs, and featured distinctive mitochondria with rounded, condensed cristae. To further probe Thetis cell localization and morphology, we examined their spatial distribution in the mLN of AireGFP mice at P18. Immunofluorescence staining confirmed the presence of AIRE+ Thetis cells (TC I and TC III) as well as AIRE–CD11b+ TC IV, with TC IV preferentially located within the paracortex (Extended Data Fig. 5f). Analysis of mLN from RorcVenus-creERT2 mice confirmed similar spatial distributions of TC IV and RORγt+ pTreg cells (Extended Data Fig. 5g,h).
Thetis cell ontogeny
To address the ontogeny of Thetis cells, we first analysed dendritic cell fate-mapping Clec9acre/wtRosa26lsl-tdTomatoRorcVenus-creERT2AireGFP reporter mice in which both cDC1 and cDC2 are labelled owing to CLEC9A expression in dendritic cell progenitors26. In contrast to both AIRE+ and AIRE– DC2s, less than 5% of Thetis cells were tdTomato+ (Extended Data Fig. 6a), probably reflecting the small proportion of Thetis cells with detectable Clec9a expression (Extended Data Fig. 3e), rather than descendancy from a CLEC9A+ pre-dendritic cell. Given reports of a lymphoid pathway for cDC differentiation27, we next used a novel Rag1RFP-creERT2Rosa26lsl-YFP mouse to fate-map progeny of lymphoid progenitor cells following neonatal administration of 4-hydroxytamoxifen (4-OHT). In contrast to labelling of T cells and ILC3s, YFP+ cells were absent among Lin–MHCIIhiCXCR6– cells encompassing Thetis cells (Extended Data Fig. 6b). These findings suggested that Thetis cells are ontogenically and transcriptionally distinct from classical dendritic cells. Thus, the overlapping phenotype of Thetis cells and dendritic cells, in particular CCR7+ dendritic cells, probably reflects shared functions related to cell migration and antigen presentation rather than shared ontogeny (Extended Data Fig. 6c). RORγt+AIRE+ cells have previously been suggested to be related to ILC3s16,28, however, the absence of Rag1 fate-mapped Thetis cells suggested that Thetis cells were not descended from the RAG1+ ILC3p identified within our scRNA-seq dataset (Extended Data Fig. 3c). Furthermore, Thetis cells did not express RORα (Fig. 3d), a critical gene for ILC development that is expressed by ILC precursors (ILCp) and mature ILC subsets29–31. The exclusive expression of RORα by MHCII+ ILC3s enabled us to determine the lineage relationship between ILCp and Thetis cells using RorcVenusRoracreRosa26lsl-tdomato fate-mapping mice. Within the mLN, around 90% of LTi cells and 70% of ILC1 and natural killer cells were tagged with tdTomato, compared with less than 2% of Thetis cells (Fig. 3e and Extended Data Fig. 6d), confirming that Thetis cells are not developmentally related to ILCs. To address the possibility of the reverse relationship—that Thetis cells are the precursor to ILC3s—we analysed single-cell transcriptional dynamics using single-cell RNA velocity32 to computationally define lineage relationships between Thetis cells and ILC3s. This analysis identified established differentiation trajectories emanating from ILC3p to NCR+ ILC3s and LTi cells (Extended Data Fig. 6e). In contrast, no ‘connectivity’ was observed between Thetis cells and ILC3s in either direction. Although IL7R is expressed by ILCs and a subset of Thetis cells (Extended Data Fig. 6f), it is also expressed by CCR7+ dendritic cells23 (Extended Data Fig. 6g), consistent with recently reported IL7R-cre fate-mapping in around 25% of cDCs33, precluding this approach for lineage tracing ILCs, Thetis cells and dendritic cells. Together these results demonstrate that Thetis cells represent a new class of cells, distinct from both ILCs and classical dendritic cells.
To determine the transcription factors that regulate Thetis cell differentiation and heterogeneity, we turned to our scATAC-seq data, integrating differential transcription factor motif activity with gene expression. This analysis identified activity of canonical ILC3 transcription factors in ILC3s, including RORα, GATA3 and TCF1, as well T-BET (encoded by Tbx21) in NCR+ ILC3s (Fig. 3f), validating our approach. By contrast, Thetis cells were distinguished by activity of a unique group of transcription factors including Spi-B, a critical regulator of mTEC differentiation, as well as core transcription factors governing myeloid cell differentiation (PU.1 (encoded by Spi1), BCL11A, IRF8 and IRF4) (Fig. 3f), in agreement with their transcriptional overlap with both mTECs and dendritic cells. Notably, several of the signature Thetis cell transcription factors have been shown to regulate AIRE expression in mTECs34, suggesting a common transcriptional network between these two cell types. Together, these findings establish the unique identity of Thetis cells, delineating their shared and distinct features with both mTECs and dendritic cells.
Transcriptional programmes of Thetis cell subsets
To gain insight into the potential functions of Thetis cell subsets, we examined their distinguishing transcriptional features (Fig. 3g). TC I expressed canonical AIRE+ mTEC genes, including Aire, Cd80, Cd86, Tnfrsf11b (which encodes OPG), and genes associated with neuronal adhesion, signalling and growth (Nrxn1, Nrn1 and Ncam1). Of note, a recent study of peripheral AIRE-expressing cells identified a population of ‘mTEC-like’ RORγt+AIRE+ cells within lymph nodes, similarly distinguished by neuronal genes17, probably representing AIRE+ TC I. TC II was distinguished by exclusive expression of several distinctive genes including Pigr and Cldn7, signature molecules for a group of mTECs with a history of AIRE expression21,35, further highlighting parallels between the mTEC and Thetis cell subsets. TC III expressed high levels of AIRE as well as Nlrc5, a critical regulator of MHC Class I genes. TC IV expressed immune-regulatory genes (Cd274) as well as genes associated with cell migration (Marcks and Cxcl16). Analysis of differential chromatin accessibility and motif enrichment across Thetis cell subsets suggested several subset-specific transcriptional regulators, further underpinning the observed heterogeneity of Thetis cells (Extended Data Fig. 7a).
To validate the observed Thetis cell phenotypes, we devised a panel of flow cytometry markers (Fig. 3h). MHCII+ ILC3s were distinguished from Thetis cells by the expression of CXCR6 (Fig. 3d,e,h). Among CXCR6–RORγt+MHCIIhi cells, we confirmed the presence of Thetis cell subsets expressing signature cell-surface markers (Fig. 3i) with the expected pattern of AIRE protein expression in TC I and TC III (Fig. 3j). Of note, CCR6 and SIGLEC-G, which have been suggested to be markers for RORγt+AIRE+ cells16,36,37, were expressed by TC I and TC II but not by TC III and TC IV (Fig. 3h,i). Thus, whereas AIRE+ TC I represent RORγt+AIRE+cells, referred to either as Janus cells or RORγt+ extra-thymic AIRE-expressing cells (eTACs) in two concurrent analyses of RORγt+ APCs36,37, TC II–TC IV extend the spectrum of non-ILC3 RORγt+MHCII+ cells beyond AIRE-expressing cells (Fig. 3k). Together, our analyses suggest that Thetis cell subsets are molecularly and functionally distinct and point to a role for TC IV in pTreg differentiation.
ILC3 and DCs are dispensable for pTreg generation
Given the overlapping phenotype of Thetis cells with professional APC types with known roles in T cell tolerance, we hypothesized that Thetis cells were the relevant RORγt+MHCII+ cell type for instructing pTreg cell differentiation. A direct comparison of Thetis cell and ILC3 transcriptomes, as well as cell-surface protein expression, confirmed that Thetis cells were enriched for molecules associated with antigen presentation, T cell activation and cell migration, in contrast to MHCII+ ILC3 cells (Fig. 4a,b and Extended Data Fig. 8a). Furthermore, in contrast to Thetis cells, we did not observe CCR7 protein expression on MHCII+ ILC3s (Fig. 4c), despite detectable Ccr7 transcript. To examine the antigen-presenting ability of Thetis cells, we analysed cell-surface I-Ab bound CLIP peptide on H2-Dma–/– or littermate wild-type Thetis cells, which confirmed efficient H-2M-mediated CLIP displacement (Extended Data Fig. 8b). Accordingly, staining with the 25-9-17s monoclonal antibody, which binds to a subset of non-CLIP peptide–I-Ab complexes38, demonstrated equivalent levels of expression by Thetis cells and classical dendritic cells (Extended Data Fig. 8c). To further examine MHC class II antigen presentation by Thetis cells, we bred RorcVenus-creERT2 with BALB/c mice and confirmed expression of an endogenously processed self-peptide Eα52-68 bound I-Ab, using the YAe monoclonal antibody (Extended Data Fig. 8d). To address the ability of Thetis cells to induce pTreg cells ex vivo, we co-cultured Thetis cell subsets (either CCR6+ TC I and TC II or CCR6– TC III and TC IV) with naive C7 TCR transgenic CD4+ T cells and their cognate peptide under suboptimal Treg-inducing conditions. Notably, Thetis cells demonstrated significantly greater ability to promote Treg differentiation relative to cDC2s with the greatest efficacy observed among TC III and TC IV (92.7% FOXP3+ for TC III and TC IV versus 39.4% for cDC2; Extended Data Fig. 8e). Together, these results suggest that Thetis cells are competent APCs.
Our earlier analysis identified exclusive expression of IL-22 by ILC3s (Fig. 3d), suggesting the potential utility of MHCIIΔIl22 mice to determine the role of antigen presentation by ILC3s in pTreg induction. However, consistent with the low frequency of fate-mapped MHCII+ILC3s in Il22creRosalsl-tdTomato mice (less than 5%; Extended Data Fig. 8f), MHCIIΔIl22 mice exhibited only minimal loss of MHCII expression by ILC3s with no effect on the pTreg cell population (Extended Data Fig. 8g). We therefore turned to Roracre mice as a means to selectively target ILC3s but not Thetis cells. Notably, ILC3s were the only MHCII+ RORα fate-mapped cell type within the mLN (Extended Data Figs. 6d and 8h), establishing MHCIIΔRORα mice as a genetic model for studying the functional role of ILC3 antigen presentation. Indeed, analysis of three-week-old MHCIIΔRORα mice confirmed a complete loss of MHCII expression on ILC3s, with no changes in Thetis cells (Fig. 4d and Extended Data Fig. 8i). Conspicuously, the intestinal T cell composition was not perturbed in MHCIIΔRORα mice, with equivalent proportions and numbers of CD4+ Teff and Treg cells, including RORγt+ pTreg cells, within the mLN and large intestine (Fig. 4e and Extended Data Fig. 8j,k). To exclude a role for ILC3s in pTreg differentiation in later life, we examined MHCIIΔRORα mice at 12 weeks of age. In contrast to MHCIIΔRORgt mice, MHCIIΔRORα adult mice had normal pTreg frequencies, with no evidence of altered T cell activation states (Fig. 4f), and lacked histological signs of colonic inflammation (Fig. 4g), further confirming that MHCII-mediated antigen presentation by ILC3s is not required for intestinal tolerance. Besides ILC3s, antigen presentation by sub-immunogenic dendritic cells is thought to favour T cell tolerance. Although Thetis cells may have been inadvertently targeted by studies using ‘dendritic cell-specific’ Cre drivers39 owing to expression of both CD11c and Zbtb46, the absence of Clec9a fate-mapped Thetis cells enabled us to revisit a role for classical dendritic cell in pTreg differentiation through analysis of Clec9acre/creH2-Ab1fl/fl (MHCIIΔDC) mice in which dendritic cells, but not Thetis cells, were rendered MHCII-deficient (Extended Data Fig. 8l). Notably, we did not observe changes in RORγt+FOXP3+ cells in these mice (Extended Data Fig. 8m). Overall, these findings demonstrate that MHCII antigen presentation by ILC3s or cDCs is dispensable for pTreg cell differentiation, leaving Thetis cells as the pTreg-inducing RORγt+ APC.
Thetis cells induce pTreg cells in early life
Given the narrow temporal window of opportunity for establishing intestinal immune tolerance, we hypothesized that the presence of Thetis cells might determine this developmental window. Our analysis of Thetis cell abundance in mice ranging in age from 7 days to 6 weeks revealed their marked enrichment between 1 and 3 weeks of age, with a rapid decline thereafter (Fig. 5a and Extended Data Fig. 9a). Notably, Thetis cells—in particular TC IV—were enriched in the mLN compared with skin-draining peripheral lymph nodes (pLN) (Fig. 5b). To determine the dynamics of neonatal Thetis cell differentiation, we used RorcVenus-creERT2Rosa26lsl-tdTomato mice to label RORγt-expressing cells and their progeny. Following treatment of mice with 4-OHT at P1, more than 60% of Thetis cells remained tdTomato+ at P7 (Extended Data Fig. 9b). This proportion fell to 15% by P14, although total numbers of both tdTomato– and tdTomato+ Thetis cells increased between P7–P14 (Fig. 5c), reflecting de novo Thetis cell differentiation or proliferation during this critical developmental window. Both the proportion of tdTomato+ Thetis cells and total cell numbers declined from P14, reflecting waning differentiation beyond this age (Fig. 5c and Extended Data Fig. 9b). By contrast, the proportion of fate-mapped MHCII+ ILC3s declined between P7 and 14 but remained stable thereafter (Extended Data Fig. 9b), consistent with the notion that ILC3s are maintained by self-renewal40. Thus, Thetis cells or putative RORγt+ Thetis cell progenitors are present at birth, and are prominently enriched within the mLN at the time of induction of intestinal tolerance during early life. Together these data suggest a critical window of opportunity for pTreg cell differentiation, determined by a wave of Thetis cells in the mLN in early life.
To determine whether Thetis cell counterparts exist in humans, we analysed a recent single-cell atlas encompassing second trimester to adult intestine and mLN41. Within a group of myeloid cells annotated as ‘lymphoid’ dendritic cells, we identified a cluster of cells distinct from but closely related to CCR7+ dendritic cells, that expressed signature Thetis cell genes (TNFRSF11B and SPIB), including high levels of AIRE (Extended Data Fig. 9c–e). Analysis of orthologous signature Thetis cell subset genes confirmed enrichment in the putative human Thetis cell cluster, most prominently for TC III- and TC IV-defining genes (Extended Data Fig. 9f). In contrast to CCR7+ dendritic cells, human Thetis cells were present almost exclusively in the mLN (Extended Data Fig. 9g) and were highly enriched in fetal samples (32% versus 3.8%; Extended Data Fig. 9h), implying a conserved role for Thetis cells in intestinal tolerance during early life.
The close temporal and spatial relationship between TC IV and pTreg cells supported a key role for this Thetis cell subset in pTreg differentiation. To determine whether the TC IV subset has distinct features supporting pTreg generation besides MHCII-dependent TCR stimulation, we used latent Dirichlet allocation—a probabilistic topic model—to capture shared and unique gene expression programmes. Notably, the TC IV subset was enriched for a ‘Treg-inducing’ module encompassing critical molecules including IL-2, the TGFβ-activating integrin ITGΒ8, CD40 and CCL22 (Fig. 5d). TGFβ signalling is known to be critical for pTreg cell differentiation. Activation of latent extracellular TGFβ requires physical interaction with integrin αvβ6 or integrin αvβ8, and loss of either TGFβ signalling on T cells, or ITGΒ8 expression by hematopoietic cells, leads to impaired pTreg differentiation and development of autoimmunity and colitis42,43. Analysis of TGFβ signalling pathway genes in Thetis cell transcriptomes confirmed high expression of both Itgav and Itgb8 in TC IV, as well as unique expression of Tgfb2 (Fig. 5e). To address the role of Thetis cells in TGFβ-mediated pTreg differentiation, we generated mice with conditional loss of the Itgb8 gene in RORγt+ cells (Itgb8ΔRORγt). Although ILC3s expressed Itgav, they did not express Itgb8, as determined by analysis of SMART-seq2 single-cell transcriptomes as well as Itgb8tdTomato reporter mice (Fig. 5f,g and Extended Data Fig. 10a,b). Furthermore, scATAC-seq analysis showed that the Itgb8 locus was inaccessible in ILC3s (Extended Data Fig. 10c), confirming that Itgb8ΔRORγt mice have a specific deficiency of ITGB8 in Thetis cells and T cells. Analysis of mLN and large intestine from three-week-old mice, revealed a substantial reduction in pTreg cell frequency and numbers (Fig. 5h,i), mirroring the loss of pTreg cells observed in MHCIIΔRORγt mice. Differentiation of pTreg cells was normal in Cd4creItgb8fl/fl mice (Extended Data Fig. 10d), indicating that TGFβ activation by Thetis cells—and not by T cells—is a critical mechanism for intestinal pTreg cell differentiation.
The unique expression of Itgb8 by Thetis cells enabled us to test the requirement for TGFβ activation and antigen presentation by the same cell. Using bone marrow chimeras generated with a mix of Itgb8ΔRORγt and MHCIIΔRORγt bone marrow cells—in which both ILC3s and Thetis cells can present antigen, but the same Thetis cell cannot present antigen and activate TGFβ—we found that pTreg differentiation was critically dependent on antigen presentation by Itgb8-expressing Thetis cells, with an equivalent deficit in pTreg cells observed in MHCIIΔRORγt and MHCIIΔRORγt/Itgb8ΔRORγt chimeras (Fig. 5j), excluding the possibility of redundant and compensatory functions between Thetis cells and ILC3s. Overall, our findings identify a new class of APCs that is prominent in the mLN during a critical early life window, and demonstrate an essential role for the TC IV subset in establishing intestinal tolerance through the generation of pTreg cells (Fig. 5k).
Discussion
Contrary to the view that ‘neonatal immune privilege’—first demonstrated by Medawar in the 1950s44,45—results from the presence of immunosuppressive or ‘immature’ dendritic cells with an inferior stimulatory capacity, our results suggest the existence of a dedicated tolerogenic APC type that is enriched in early life. The requirement for MHC class II antigen presentation by Thetis cells in early life but not in adulthood provides support for a model in which a tightly regulated wave of Thetis cell differentiation during a narrow developmental window imprints durable microbiota-specific T cell tolerance. Of note, pTreg cell abundance in adulthood is determined by cues sensed during the first week of life8, coincident with the observed wave of Thetis cell differentiation, suggesting that modulation of Thetis cell development may have lasting effects on intestinal immune tolerance.
A defining feature of Thetis cells is their expression of RORγt. Our finding of overlapping markers between Thetis cells and dendritic cells resolves previous conflicting reports on RORγt+ dendritic cells and their relationship to ILC3s15,33. Selective targeting of cDCs or ILC3s, enabled by identification of cell-type-specific genes, demonstrated that neither ILC3s nor dendritic cells, contribute to mucosal tolerance. Whereas the precise ontogeny of Thetis cells remains to be established, a notable finding was their expression of transcription factors and markers typically associated with mTECs. A recent study highlighted the existence of hybrid cell types that emerge from AIRE+ mTECs in the thymus35. Our discovery of Thetis cells further challenges the current view of boundaries between cell lineages highlighting shared transcriptional programmes between haematopoietic and non-haematopoietic cells that may support common purposes. Within the thymus, AIRE+ mTECs instruct T cell tolerance through negative selection and neonatal thymic Treg generation1,22. The essential role of Thetis cells in neonatal pTreg differentiation highlights the symmetry between thymic and peripheral tolerance pathways. The expression of AIRE by TC I and TC III suggests the possibility that Thetis cells may share additional functions with mTECs, such as tolerance to self-antigens.
Here we have identified a novel tolerogenic APC type that is enriched in the intestine during a critical early life period when host–microbiota symbiosis is first established. The finding that TC IV instructs extra-thymic Treg cell generation provides a cellular basis for the reported early life window for the establishment of intestinal immune tolerance. Future exploration of Thetis cell biology may yield key insights into mechanisms of immune tolerance and the pathogenesis of autoimmune and inflammatory disease.
Methods
Mice
RorcVenus-T2A-creERT2 mice were generated by insertion of a targeting construct into the Rorc 3-UTR by homologous recombination in embryonic stem (ES) cells on the C57Bl/6 background. The IRES-Venus-T2A-creER-frt-NeoR-frt cassette targeting construct was created by cloning. Homologous arms were retrieved from BAC clone RP24-209K20. To facilitate ES cell targeting, the CRISPR–Cas9 system was used. The guide RNA was transcribed in vitro using the MEGA shortscript T7 kit (Life Tech Corp, AM1354) using recombineering techniques. The targeting vector, Cas9 protein (Fisher Scientific, A36498 Truecut Cas9 Protein v2) and guide RNA were co-electroporated into G1 ES cells derived from an F1 hybrid blastocyst of 129S6 × C57BL/6J. The resulting chimeras were bred with FLPeR mice to excise the NEO cassette. Rag1RFP-creERT2 (C57BL/6-Tg(Rag1-RFP,-cre/ERT2)33Narl) mice, obtained from the Rodent Model Resource Center, were generated by insertion of a BAC transgene comprising the Rag1 promoter and RFP-IRES-creERT2 into ES cells from C57Bl/6 mice.
Adig(AireGFP), Clec9acre, Roracre, Itgb8fl/fl, Cd4cre, H2-Dma–/–, C7 and Itgb8tdTomato mice have been previously described26,46–52. Rorgtcre, H2-Ab1fl/fl, R26lsl-tdTomato, R26lsl-YFP, Zbtb46GFP, Il22cre, C57Bl/6 (CD45.2), CD45.1 and BALB/c mice were purchased from Jackson Laboratories. Generation and treatments of mice were performed under protocol 21-05-007 and 08-10-023, approved by the Sloan Kettering Institute (SKI) Institutional Animal Care and Use Committee. All mouse strains were maintained in the SKI animal facility in specific pathogen free conditions in accordance with institutional guidelines and ethical regulations. Both male and female mice were included in the study and we did not observe sex-dependent effects. All mice analysed were age and litter matched unless otherwise specified. All animals used in this study had no previous history of experimentation and were naive at the time of analysis.
Tamoxifen diet
RorcVenus-creERT2H2-Ab1fl/fl and littermate RorcVenus-creERT2H2-Ab1fl/fwt mice were placed on a tamoxifen citrate–containing diet (TD.130860; Envigo) at eight weeks of age for five weeks.
Tissue processing
Mice were euthanized by CO2 inhalation. Organs were collected and processed as follows. Lymphoid organs were digested in collagenase in RPMI1640 supplemented with 5% fetal calf serum, 1% l-glutamine, 1% penicillin–streptomycin, 10 mM HEPES, 1 mg ml−1 collagenase A (Sigma, 11088793001) and 1 U ml−1 DNase I (Sigma, 10104159001) for 45 min at 37 °C, 250 rpm. Large intestine was removed, flushed with PBS and incubated in PBS supplemented with 5% fetal calf serum, 1% l-glutamine, 1% penicillin–streptomycin, 10 mM HEPES, 1 mM dithiothreitol and 1 mM EDTA for 15 min to remove the epithelial layer. Samples were washed and incubated in digest solution for 30 min. Ceramic beads (0.25 inch) (MP Biomedicals, 116540034) were added to large intestine samples (3 per sample) to aid in tissue dissociation. Digested samples were filtered through 100-μm strainers and centrifuged to remove collagenase solution. Thymus samples were minced with scissors followed by enzymatic digestion in RPMI1640 supplemented with 10% fetal calf serum, 1% l-glutamine, 10 mM HEPES, 62.5 μg ml−1 Liberase and 0.4 mg ml−1 DNase I. Density-gradient centrifugation using a 3-layer Percoll gradient with specific gravities of 1.115, 1.065 and 1.0 was used to enrich for stromal cells for flow cytometric analysis. For sorting of mTECs, single-cell suspension of digested thymocytes were depleted of CD45+ cells using CD45 microbeads (Miltenyi Biotec).
Flow cytometry
For flow cytometric analysis, dead cells were excluded either by staining with LIVE/DEAD Fixable Violet or Zombie NIR in PBS for 10 min at 4 °C, prior to cell-surface staining. Cells were then incubated with anti-CD16/32 in staining buffer (2% FBS, 0.1% Na azide, in PBS) for 10 min at 4 °C to block binding to Fc receptors. Extracellular antigens were stained for 20–30 min at 4 °C or room temperature (CCR7 staining) in staining buffer. For intracellular protein analysis, cells were fixed and permeabilized with Cytofix (BD Biosciences) and/or Ebioscience FOXP3 kit, per manufacturer instructions. Intracellular antigens were stained for 30 min at 4 °C in the respective 1× perm/wash buffer or overnight for intracellular AIRE staining. Live cells were treated with DNase (0.08 U ml−1) for 10 min at room temperature and washed with staining buffer prior to acquisition on a BD LSR or Cytek Aurora. 123count eBeads were added to quantify absolute cell numbers. The antibodies used for flow cytometry and FACS are listed in Supplementary Table 3. Unless otherwise stated, we used the following gatings: Thetis cells: Lin (SIGLEC-F, TCRβ, TCRγδ, CD19, B220, NK1.1)–CD64–Ly6C–RORγt (intracellular staining or expression of Venus in RorcVenus-creERT2 mice) CXCR6–MHCII+; MHCII+ ILC3s: Lin–CD64–Ly6C–RORγt (intracellular staining or expression of Venus in RorcVenus-creERT2 mice) CXCR6+MHCII+, and DC2s: Lin–CD64–Ly6C–RORγt–CD11c+MHCII+CD11b+.
Histological analysis of intestinal inflammation
Mice were euthanized by CO2 inhalation and large intestines were collected and immediately placed into 10% formalin. Histopathological assessment for inflammation scoring in the intestine was performed on H&E-stained sections based on established scoring systems for intestinal inflammation in mouse models53. Assessment includes severity and extent of inflammatory cell infiltrates, epithelial changes and mucosal architecture changes. In brief, the severity and extent of inflammatory cell infiltrates were evaluated histologically. Other evaluations include proliferation of epithelial cells lining the mucosa villous atrophy, crypts, loss of goblet cells, crypt abscesses, erosions and ulceration.
Multiome scRNA and scATAC-sequencing
For scRNA-seq and scATAC-seq of RORγt+MHCII+ cells, mLN from 2-week-old (P14–P17) RorcVenus-creERT2 mice were pooled from 16 biological replicates and processed as described earlier. Cells were depleted of Lin+ (TCRb, TCRγδ, CD19, B220, NK1.1)+ cells by staining with biotinylated antibodies followed by magnetic bead negative selection. Cells were incubated with anti-CD16/32 in sorting buffer (2% FBS in PBS) for 10 min at 4 °C to block binding to Fc receptors. Extracellular antigens were stained for 30 min at 4 °C in sorting buffer (2% FBS, 2mM EDTA, in PBS). Cells were washed and resuspended in sorting buffer with SYTOX blue (Invitrogen) for exclusion of dead cells. Live, CD45+Lin(SIGLEC-F, TCRβ, TCRγδ, CD19)–RORγt(Venus)+ MHCII+ cells were then sort purified. Cells were sorted into cRPMI, before being pelleted and resuspended in RPMI-2% FBS. Single-cell multiome ATAC and gene expression analysis was performed with the 10X genomics system using Chromium Next GEM Single Cell Multiome Reagent Kit A (catalogue no. 1000282) and ATAC Kit A (catalogue no. 1000280) following the user guide for the Chromium Next GEM Single Cell Multiome ATAC + Gene Expression Reagent Kits and the demonstrated protocol for nuclei isolation for single-cell multiome ATAC and gene expression sequencing. In brief, >50,000 cells (viability 95%) were lysed for 4 min and resuspended in Diluted Nuclei Buffer (10x Genomics, 2000207). Lysis efficiency and nuclei concentration was evaluated on Countess II automatic cell counter by trypan blue staining. Nuclei (9,660 per transposition reaction) were loaded, targeting recovery of 6,000 nuclei after encapsulation. After the transposition reaction, nuclei were encapsulated and barcoded. Next-generation sequencing libraries were constructed following the manufacturer’s instructions, and were sequenced on an Illumina NovaSeq 6000 system.
Plate-based Smart-seq2 sequencing
RORγt+MHCII+ cells were enriched from a pool of mLN from three-week-old (P21) RorcVenus-creERT2 mice. Cells were depleted of Lineage (TCRβ, TCRγδ, CD19, B220, NK1.1)+ cells via staining with biotinylated antibodies followed by magnetic bead negative selection. Live, Lin(CD3, TCRβ, TCRγδ CD19, B220, NK1.1)–CD64–Ly6C–MHCII+RORγt(Venus)+ cells were then sorted into single wells. Cells were stained for CD90, CD11c and CD11b for acquiring index-sorting information on cell-surface expression and equal numbers of CD90–CD11c+, CD90–CD11c–, CD90int and CD90hi cells were targeted for sorting to ensure representation of all cell types. AIRE+ mTECs were enriched from a pool of thymi from three-week old mice via staining with biotinylated antibodies against CD45 followed by magnetic bead negative selection. CD45–Epcam+MHCII+AIRE(GFP)+ cells were sorted into single wells. AIRE+ dendritic cells were enriched from a pool of MLN from the same three-week old mice. Cells were depleted of Lin+ cells as described above and live, Lin(CD3, TCRβ, TCRγδ CD19, B220, NK1.1)–CD90–CD64–Ly6C–CD11c+MHCII+AIRE(GFP)+ cells were then sorted into single wells. Retrospective index-sorting analysis confirmed that AIRE(GFP)+ cells were CD11cloMHCIIhi, representing CCR7+ dendritic cells.
Single cells were sorted into Buffer RLT (Qiagen). Cell lysates were immediately sealed and spun down before transferring to dry ice and storing at −80 °C. RNA was purified using the Agencourt RNAClean XP beads (Beckman Coulter) at a 2.2× ratio. First-strand cDNA synthesis was achieved using Maxima H Minus Reverse Transcriptase (ThermoFisher) according to the manufacturer’s protocol using oligo dT primers, with the addition of a custom template-switch oligo in a 1 mM final concentration. cDNA was amplified for 24 cycles using KAPA HiFi HotStart ReadyMix (Kapa Biosystems KK2601). After PicoGreen quantification, 0.1–0.2 ng of cDNA was used to prepare libraries with the Nextera XT DNA Library Preparation Kit (Illumina) in a total volume of 6.25 µL with 12 cycles of PCR. Indexed libraries were pooled by volume and cleaned by aMPure XP beads (Beckman Coulter) at a 1× ratio. Pools were sequenced on a HiSeq 4000 in a PE50 or PE100 run using the HiSeq 3000/4000 SBS Kit (Illumina). An average of 1.8 million paired reads were generated per sample and the percent of mRNA bases per sample averaged 63%.
Mouse scRNA-seq and scATAC-seq computational analysis
Pre-processing of the 10X multiome scRNA-seq and scATAC-seq for RORγt+MHCII+ cells
scRNA-seq and scATAC-seq FASTQ files were aligned to mm10 (Cell Ranger mouse reference genome mm10-2020-A-2.0.0) and counted by Cell Ranger ARC v2.0.0 with default parameters. The barcodes were filtered based on the number of RNA-seqtranscripts (>1,000 and < 50,000), the number of detected genes (>500 and < 6,000), and the fraction of mitochondrial transcripts (<15%). Barcodes were further filtered based on the number of scATAC-seq fragments (3.5 < log10(number of fragments) < 4.5) and transcription start site enrichment score (>4). Arrow files were created from the scATAC-seq fragments using ArchR v1.0.154, and doublets were identified and removed with default parameters. Finally, any genes detected in <2 cells in the scRNA-seq data were discarded, leaving 20,779 genes. After clustering, the scRNA-seq data (described in ‘Dimensionality reduction, cell clustering, and visualization’), and based on the expression of marker genes, we identified 5 minor contaminant clusters (glial cells; cluster 17, plasmacytoid dendritic cell; cluster 18, Rorc–/lo CCR7+ dendritic or Thetis cell; cluster 19, mixed monocyte/cDC1; cluster 20, and macrophage; cluster 21) which were excluded from downstream analyses. In total, 10,145 cells remained, with a median scRNA-seq library size of 3,150 and a median of 13,885 scATAC-seq fragments.
Pre-processing of the Smart-seq2 scRNA-seq dataset
Smart-seq2 sequencing data from demultiplexed samples was aligned to the mouse reference genome using STAR v2.7.7a55 with ‘--twopassMode Basic --outFilterMultimapNmax 1 --quantMode TranscriptomeSAM’. Sequence reads were aligned and annotated using a STAR index created from GENCODE GRCm38 (mm10) release M25 primary assembly genome and gene annotations56. Alignment files were individually name-sorted using Samtools v1.1157, and then used to create a cell-by-gene count matrix using featureCounts58 (subread v2.0.1). The count matrix was filtered based on the number of transcripts (>50,000), number of detected genes (>1,300), and the fraction of mitochondrial transcripts (<8%). Finally, genes detected in <2 cells were discarded. A total of 481 cells remained, with a median library size of 924,319 from 27,195 genes.
Dimensionality reduction, cell clustering, and visualization
For each scRNA-seq dataset, the filtered count matrix was library-size-normalized, log-transformed (‘log-normalized’ expression values) and then centred and scaled (‘scaled’ expression values) using Seurat v4.0.4. Principal component analysis (PCA) was performed on the scaled data (total number of principal components = 50). PhenoGraph clustering59 was performed using the first N principal components with k-nearest neighbours (N = 30 and k = 30 for the multiome scRNA-seq data; N = 20 and k = 30 for the Smart-seq2 dataset; N = 30 and k = 20 for the human gut dendritic cells). Cell clustering was visualized using UMAP60, computed from the nearest neighbour graph built by PhenoGraph.
The multiome scATAC-seq data analysis was restricted to the cells in clusters 1–16 of the scRNA-seq results, as previously described for pre-processing. Latent Semantic Indexing (LSI) was performed on 100,000 top variable tiles (500 bp genomic bins) identified after ten iterations of ‘IterativeLSI’ by ArchR. Tiles from non-standard chromosomes, chrM, and chrY were not included in this analysis. Cells were clustered (method=Seurat, k.param = 30, resolution = 1.2) and visualized with UMAP (nNeighbors = 30) using 30 LSI components. In both the scRNA-seq and scATAC-seq data, we identified several clusters of LTi cells (scRNA clusters 9–16 and scATAC clusters 7–13). These clusters showed weak pairwise matchings between scRNA and scATAC; therefore, they were combined as one group of LTi cells for downstream analyses.
Differential gene expression tests
DEGs between groups of cells were identified with MAST61, performed using Seurat functions. MAST was run on the log-normalized expression values. In all tests, genes were only considered if they were detected in at least 10% of the cells in at least one of the two groups compared (min.pct = 0.1, logfc.threshold = 0). In one-vs-rest differential expression tests comparing multiple groups, each group was compared to all the cells from other groups. Specific differential expression comparisons are described in the results. DEGs were reported according to their fold change (>1.5) and adjusted P value (<0.01). Ribosomal and mitochondrial genes were removed from the final list of genes reported or visualized. Where stated, the top DEG markers were subsequently selected for each group, based on fold change.
Data imputation for scRNA-seq data
MAGIC imputation62 was applied to the log-normalized expression values for the multiome scRNA-seq dataset to further de-noise and recover missing values. Imputed gene expression values were only used for data visualization on UMAP overlays and heatmaps, where stated.
Cell cycle scores
Using standard Seurat functions, we computed cell cycle scores for known S-phase and G2/M-phase marker genes63 to identify proliferating cells.
Topic modelling for scRNA-seq data
Topics were identified by fitting a latent Dirichlet allocation model, also known as a Grade of Membership (GoM) model64, to the raw gene expression count matrix for Thetis cells (clusters 1–5 of the multiome scRNA-seq data) using CountClust v1.18.065. Genes that were detected in fewer than 10 Thetis cells were not included. The optimal number of topics (K = 8) was selected among values ranging from 3 to 15 with the maximum Bayes factor (BF). The role of a topic in each cell is measured by the degree to which it represents that topic, and the topic weights sum up to 1 in each cell. The importance of a gene for each topic is measured by how distinctively differentially expressed it is in that topic, by measuring the KL-divergence of its relative gene expression to other topics, assuming a Poisson distribution. One topic, defined by ribosomal and mitochondrial genes and shared across all clusters, was removed from the topic model visualizations.
Dynamical modelling of RNA velocity for the multiome scRNA-seq data
The unspliced and spliced mRNAs for the scRNA-seq profiles of the multiome data were counted by Velocyto v0.17.1732 from the position-sorted BAM file containing GEX read alignments, outputted by Cell Ranger ARC in pre-processing. As annotation files for Velocyto, we used the same mm10 gene annotations used in pre-processing, in addition to the mm10 expressed repeat annotation from the RepeatMasker track of UCSC genome browser. Next, we used the Velocyto results to learn a generalized dynamical model of RNA velocities by scVelo v0.2.466. Count matrices were filtered, normalized, and log-transformed (min_shared_counts = 10, n_top_genes = 3000), cell cycle effect was corrected by regressing out S-phase and G2/M-phase scores, using Scanpy 1.6.067. After performing PCA on the corrected data (n_pcs = 30), first- and second-order moments were computed for each cell across its nearest neighbours in the PCA space (n_neighbors = 30). Finally, the full splicing kinetics were recovered and solved for each gene by scVelo’s dynamical model.
Integrating the Smart-seq2 dataset with the multiome dataset
RORγt+MHCII+ transcriptomes (based on cell-type as sorted) from the SMART-seq2 dataset were integrated with transcriptomes from the 10X multiome scRNA-seq data, using Seurat68. Based on the variability of genes in both datasets, 5,000 top scoring genes were selected by Seurat functions to identify ‘integration anchors’ with canonical correlation analysis (CCA). Expression values for these genes were integrated, scaled, and used for PCA. A UMAP embedding was computed from the first N = 30 principal components (k = 30). Additionally, using Seurat functions, the RORγt+MHCII+ cells from the SMART-seq2 dataset (query) were mapped to multiome scRNA-seq clusters (reference) by projecting the PCA from the reference onto the query to identify ‘transfer anchors’, and then assigning a prediction score for each reference cluster to query cells. The cluster identity with the highest score was chosen as the predicted label for each cell.
Single-cell enrichment scores for gene sets
Given a set of genes, we standardized the log-normalized expression values of each gene across cells and then averaged these values for all genes in the set, assigning an enrichment score to each cell. Where stated, these scoreswere standardized across cells and reported as z-scores.
Creating pseudo-bulk samples from scRNA-seq data
Pseudo-bulk samples were created by averaging the unimputed log-normalized gene expression values for each cluster. In cases where scaled values were used for downstream analyses, these average expression values were standardized across the pseudo-bulk samples.
Similarity of multiome scRNA-seq clusters to bulk microarray ImmGen samples
The RMA-normalized and log2-transformed gene expression data of 224 bulk microarray samples from a publicly availableImmGen dataset was downloaded from https://www.haemosphere.org69. For each gene, the probeset with the highest mean expression was retained. We included all cell types isolated from naive, untreated mice. Pseudo-bulk samples were generated from the multiome scRNA-seq data for each Thetis cell subset, and non-proliferating MHCII+ ILC3s (NCR+ ILC3 and LTi cells). The gene expression vectors were scaled across bulk and pseudo-bulk samples within each dataset, and their pairwise cosine similarities were used to compare the samples. These similarity scores were computed from the expression of 2,399 DEGs (FC > 1.3, adjusted P < 0.01) comparing the scRNA-seq clusters in a one-vs-rest test, that were also expressed in the microarray data. The proliferating and progenitor clusters were excluded from the differential expression test, and the LTi clusters were grouped together. For visualization of results, only cell lineages containing a cell type with > 0.25 cosine similarity with either Thetis cell or ILC3 clusters were plotted.
Cell-type prediction
To determine similarity between Thetis cells and known cell types we used CellTypist (https://www.celltypist.org), with both low- and high-resolution models of immune cells to classify cells with coarse and fine granularities, respectively. Top predicted labels for each input cell were visualized.
Similarity of Thetis cells to thymic epithelial cells
scRNA-seq profiles of CD45– thymic epithelial cells were downloaded from a publicly available dataset (GSE103967)21. The raw counts were library-size-normalized, log-transformed, and used to create pseudo-bulk samples for each thymic epithelial cluster. Pseudo-bulk samples were also generated to represent the multiome scRNA-seq Thetis cell clusters (2–5). These pseudo-bulk gene expression vectors were scaled across samples within each dataset, and their pairwise cosine similarities were used to compare clusters from the two datasets. These similarity scores were computed from the expression of 1,740 DEGs (FC > 1.3, adjusted P < 0.01) identified in a one-vs-rest differential expression test for non-proliferating Thetis cell clusters (2–5), that were also expressed in the thymic epithelial cells. Among individual clusters of thymic epithelial cells defined in the original dataset, we identified 2 clusters of transit amplifying AIRE+ cells (clusters 25 and 26), distinguished by signature gene expression including cell cycle genes.
scRNA-seq dataset of human gut dendritic cells
Dendritic cells (annotated as cDC1, cDC2 or lymphoid DC) within the myeloid dataset from the human gut atlas41 were re-clustered. From the gene markers for each Thetis cell subset (one-vs-rest differential expression test for non-proliferating Thetis cell scRNA-seq clusters, FC > 1.5, adjusted P < 0.01), we identified orthologous human genes that were uniquely mapped by gprofiler2 and computed enrichment scores for Thetis cell subset gene signatures for each human cell.
Peak calling for the multiome scATAC-seq data
For peak calling of the scATAC-seq data, clusters for similar cell types were grouped: C1 (TC IV), C2–4 (TC I–TC III), C5–6 (NCR+ ILC3), and C7–13 (LTi). Filtered scATAC-seq fragments for each group were extracted from ArchR arrow files. We performed MACS2 v2.2.7.1 on fragments of each group with ‘--gsize mm --qval 0.01 --nomodel --ext 200 --shift −100 --call-summits’. The peak summits were extended by 100 bp in each direction. Regions extending outside of mm10 chromosomes, arising from chrY or chrM, overlapping with blacklist regions precompiled by ArchR (merged from the ENCODE mm10 v2 blacklist regions from https://github.com/Boyle-Lab/Blacklist/blob/master/lists/mm10-blacklist.v2.bed.gz and mitochondrial regions that are highly mappable to the mm10 nuclear genome from https://github.com/caleblareau/mitoblacklist/blob/master/peaks/mm10_peaks.narrowPeak), or containing ‘N’ nucleotides (>0.001 of the sequence) were filtered. Regions from all groups were compiled and overlapping regions were merged to their union, resulting in a non-overlapping set of 176,942 peaks. A peak-by-cell count matrix was created by ArchR with a ‘ceiling’ value of 4 for the counts to avoid strong biases.
Transcription factor motif enrichment with chromVAR
The peaks that were accessible in <10 cells were filtered from the peak insertion counts, created as described in the previous section, and the resulting 176898 x 10145 peak-by-cell count matrix was used for motif enrichment with chromVAR v1.14.070. Mouse motif PWMs were downloaded from the CIS-BP database71 (‘Mus_musculus_2022_01_14_6-40_pm’), and the missing PWMs were extracted from ‘mouse_pwms_v1’ in chromVARmotifs v0.2.0. The GC content of the peaks was computed with chromVAR, and motifs were matched to them by motifmatchr v1.14.0. Then, chromVAR ‘deviations’ of the motifs were computed for the peak-by-cell count matrix. The ‘top motif’ for each transcription factor was selected by correlating its log-normalized gene expression values (from multiome scRNA-seq) with the deviation z-scores of its motifs, in the same cells, and picking the motif with the highest Pearson correlation coefficient. Finally, transcription factor–motif pairs with a correlation higher than 0.1 were selected. This resulted in 56 top transcription factors, out of 739 CIS-BP transcription factors that were expressed (that is, had any transcripts detected) in the multiome scRNA-seq profiles. The same process was repeated for the 139,528 × 1,552 peak-by-cell count matrix of Thetis cells (multiome scATAC-seq clusters 1–4) and the peaks accessible in at least 10 Thetis cells. Out of 652 CIS-BP transcription factors that were expressed in Thetis cells, 68 had a transcription factor–motif correlation higher than 0.1 and were selected as top transcription factors for Thetis cells.
Neonatal 4-OH tamoxifen administration
For labelling of RORγt+ cells, RorcVenus-creERT2AireGFP mice were injected intraperitoneally on P1 with 25 μg 4-OH-tamoxifen (4-OHT) and analysed on P8, P15 and P21. For RAG1 fate mapping, Rag1RFP-creERT2R26lsl-YFP mice were injected with 25 μg 4-OHT intraperitoneally on P3, P5 and P7 and analysed on P15.
Electron microscopy
RORγt+MHCII+ cells were enriched from a pool of mLN from P18 RorcVenus-creERT2 mice (for TC IV or reference CCR7– and CCR7+ dendritic cells) and P18 AireGFP mice (for TC I). Cells were depleted of Lin+ (TCRb, TCRgd, CD19, B220, NK1.1)+ cells via staining with biotinylated antibodies followed by magnetic bead negative selection. mTECs were enriched from a pool of thymi from P18 AireGFP mice as described above. Live, Lin(TCRb, TCRgd, CD19, B220, NK1.1)–CD64–Ly6C–MHCII+RORγt(Venus)+CD11c+CD11b+cells (TC IV), Lin–RORγt(Venus)–CD11cloMHCIIhi (CCR7+ dendritic cell), Lin–RORγt(Venus)–CD11chiMHCII– (CCR7– dendritic cell), Lin–CXCR6–CD11c–/loAIRE(GFP)hi (TC I), Lin–RORγt(Venus)+CD90+MHCII+ (LTi/ILC3), or CD45–Epcam+MHCII+AIRE(GFP)+ cells were then sorted directly into 2% glutaraldehyde, 4% PFA, and 2 mM CaCl2 in 0.1 M sodium cacodylate buffer (pH 7.2), fixed for > 1 h at room temperature, postfixed in 1% osmium tetroxide, dehydrated in acetone, and processed for Epon embedding. Ultrathin sections (60–65 nm) were counterstained with uranyl acetate and lead citrate. Images were taken with a transmission electron microscope (Tecnai G2–12; FEI) equipped with a digital camera (AMT BioSprint29).
Tissue preparation for immunofluorescence, microscopy and image analysis
Mesenteric lymph nodes (mLN) were dissected from 2- to 3-week-old RorcVenus-creERT2 or AireGFP mice and trimmed of fat using a dissection scope and forceps. mLNs were fixed in 2% paraformaldehyde for 4 h in 4 °C, washed 3 times in PBS, and dehydrated in 30% sucrose in 0.1 M phosphate buffer overnight (16–20 h). mLNs were embedded in optimal cutting temperature (OCT) compound, frozen on dry ice, and stored at −80 °C. 15–20m sagittal sections were placed on Superfrost Plus microscopy slides and stored at −20C until staining. mLN sections were permeabilized using 0.2% Triton X-100 for 15 min at room temperature, washed 3 times with PBS, blocked in 5% rabbit and donkey serum for 1 h at room temperature, and washed 3 times with PBS. Next, the sections were incubated with combinations of the following primary antibodies in PBS overnight at 4 °C: CD11c BV421 (Biolegend, Clone N418, 1:50), CD11b BV480 (BD Biosciences, Clone M1/70 1:50) CD4 AF647 (Biolegend, Clone RM4-5, 1:50), FOXP3 eFluor 570 (ThermoFisher, Clone FJK-16s, 1:50), GFP AF488 (ThermoFisher A12311, polyclonal, 1:100), RORγt APC (ThermoFisher, Clone AFKJS-9, 1:50), and MHCII AF700 (Biolegend, Clone M5/114.15.2, 1:200) antibodies. The samples were washed three times in PBSthe next day and mounted in SlowFade Diamond antifade reagent (ThermoFisher). No. 1.5 coverglass was used to seal the slide and all subsequent imaging was done on Leica SP8 microscope. Analysis was performed by histo-cytometry methods using Imaris software. Image segmentation was performed in Imaris using the ‘Surface object creation’ module, which uses a seeded region growing, k-means, and watershed algorithm to define individual cells.
In vitro cell culture
Naive CD4+vβ10+CD25–CD44loCD62Lhi C7 T cells were sort purified after enrichment with a CD4+ T cell negative selection kit (Miltenyi Biotec). Dendritic and Thetis cell subsets were sort purified from RorcVenus-creERT2 mice using gating strategy above with inclusion of CCR6 to distinguish TC I and TC II from TC III and TC IV. cDC2s were distinguished from their CCR7+ counterparts by CD11chiMHCIIint expression. T cells were co-cultured in triplicate at a ratio of 300 dendritic or Thetis cells to 1 × 103 T cells in the presence of ESAT peptide (1 µg ml−1; Invivogen), with the addition of 0.5 ng ml−1 TGFβ1 (Peprotech), 100 IU ml−1 of IL-2 (NCI). FOXP3 expression was assessed after 4 days of culture.
Bone marrow chimeric mice
Bone marrow cells were isolated from indicated donor mice, depleted of CD90.2+ and TER-119+ cells using magnetic bead based depletion. Bone marrow cells were resuspended in PBS and 2–3 × 106 cells were injected into 6-week-old CD45.1 mice, irradiated with 950 rad per mouse 1 day earlier. After 6 weeks, mLN and LI LP were collected for analysis.
Statistical analysis
Analysis of all data was done with unpaired two-tailed t-test, one or two-way ANOVA with a 95% confidence interval, or Mann–Whitney U test, as specified in the text or figure legends. P < 0.05 was considered significant: *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001. Details of number of replicates, sample size, significance tests and value and meaning of n for each experiment are included in the Methods or figure legends. Statistical tests were performed with Prism (GraphPad Software). scATAC and scRNA-seq experiments were carried out once. Mice were non-randomly allocated to experimental groups to ensure equal distribution of genotypes between treatments. Researchers were not blinded to genotype or treatment during the experiments. No measures were taken to estimate sample size of to determine whether the data met the assumptions of the statistical approaches used. Significance (α) was defined as <0.05 throughout, after correcting for multiple comparisons.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Online content
Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-022-05309-5.
Supplementary information
Acknowledgements
We thank J. van der Veeken, C. Campbell and members of the Rudensky laboratory for their technical support and helpful discussions; the Single Cell Core Facility at MSK for sample processing; J. Gardner for discussions, M. Anderson for the provision of AireGFP mice; C. Reis e Sousa for Clec9acre mice; D. Sheppard for Itgb8fl/fl mice; L. Denzin for H2-Dma–/– mice; M. Glickman for C7 mice; and S. Teichmann for assistance with human single-cell datasets. We acknowledge the use of the Integrated Genomics Operation Core, funded by the NCI Cancer Center Support Grant (CCSG, P30 CA08748), Cycle for Survival, and the Marie-Josée and Henry R. Kravis Center for Molecular Oncology. Elements of cartoons in Figs. 2a and 5j and Extended Data Fig. 4c were created with BioRender.com. This work was supported by the Parker Institute for Cancer Immunotherapy (C.C.B. and A.Y.R.), Ludwig Center at Memorial Sloan Kettering, NCI Cancer Center Support Grant P30 CA08748, NCI grant U54 CA209975 (A.Y.R. and C.L.), NIAID grant R01AI034206 (A.Y.R.) and the Hilton–Ludwig Cancer Prevention Initiative (Conrad N. Hilton Foundation and Ludwig Cancer Research) (A.Y.R.). D.D. was supported by a MSTP grant from NIGMS of the NIH under award number T32GM007739 to the Tri-Institutional MD-PhD Program and by a NIAID F30 Predoctoral Fellowship (F30AI154660-01). The laboratory of T.G.P.G. is supported by the Barbara and Wilfried Mohr foundation. V.T. is supported by the French Ministry of Research. A.Y.R. is an HHMI investigator. C.C.B. was supported by a Wellcome Trust Fellowship (WT201483/Z/16/Z), a Josie Robertson Investigator Award and a Parker Institute for Cancer Immunotherapy Senior Fellowship.
Extended data figures and tables
Source data
Author contributions
C.C.B. and A.Y.R designed experiments and wrote the manuscript. Z.T. designed and performed computational analyses. B.A., G.S., Y.A.P.I., Y.F.P., L.F., V.T. and C.C.B. performed experiments and analysed data. D.D. performed immunofluorescence staining and imaging analyses. S.H., C.F. and Z.-M.W. performed experiments. R.E. performed initial analysis of human intestinal single-cell data. H.A.P. performed electron microscopy analyses. J.V. processed tissues. M.K. performed immunofluorescence staining and imaging analyses for human tissue. L.J. generated mice under the supervision of M.v.d.B. G.G. provided mice, J.C.M. provided mice and supervised experiments, T.G.P.G. supervised human immunofluorescence experiments, C.L. supervised computational analyses and C.C.B. supervised experiments. All authors read and approved the manuscript.
Peer review
Peer review information
Nature thanks Sarah Teichmann and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Data availability
The mouse sequencing data are available through the Gene Expression Omnibus under accession number GSE174405. This manuscript makes use of publicly available data from the Human Cell Atlas, available at https://www.gutcellatlas.org. Source data are provided with this paper.
Competing interests
M.v.d.B. has received research support and stock options from Seres Therapeutics and stock options from Notch Therapeutics and Pluto Therapeutics; he has received royalties from Wolters Kluwer; has consulted, received honorarium from or participated in advisory boards for Seres Therapeutics, WindMIL Therapeutics, Rheos Medicines, Merck & Co., Magenta Therapeutics, Frazier Healthcare Partners, Nektar Therapeutics, Notch Therapeutics, Forty Seven, Priothera, Ceramedix, Lygenesis, Pluto Therapeutics, GlaskoSmithKline, Da Volterra, Vor BioPharma, Novartis (spouse), Synthekine (spouse) and Beigene (spouse); he has intellectual property licensed to Seres Therapeutics and Juno Therapeutics; and holds a fiduciary role on the Foundation Board of DKMS (a nonprofit organization). A.Y.R. is a member of the scientific advisory board and has equity in Surface Oncology and RAPT Therapeutics, and holds intellectual property licensed to Takeda, which is not related to the content of this study.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Blossom Akagbosu, Zakieh Tayyebi
Contributor Information
Alexander Y. Rudensky, Email: rudenska@mskcc.org
Chrysothemis C. Brown, Email: brownc10@mskcc.org
Extended data
is available for this paper at 10.1038/s41586-022-05309-5.
Supplementary information
The online version contains supplementary material available at 10.1038/s41586-022-05309-5.
References
- 1.Anderson MS, et al. Projection of an immunological self shadow within the thymus by the AIRE protein. Science. 2002;298:1395–1401. doi: 10.1126/science.1075958. [DOI] [PubMed] [Google Scholar]
- 2.Yang S, Fujikado N, Kolodin D, Benoist C, Mathis D. Regulatory T cells generated early in life play a distinct role in maintaining self-tolerance. Science. 2015;348:589–594. doi: 10.1126/science.aaa7017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Perry JSA, et al. Distinct contributions of Aire and antigen-presenting-cell subsets to the generation of self-tolerance in the thymus. Immunity. 2014;41:414–426. doi: 10.1016/j.immuni.2014.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Malchow S, et al. Aire enforces immune tolerance by directing autoreactive T cells into the regulatory T cell lineage. Immunity. 2016;44:1102–1113. doi: 10.1016/j.immuni.2016.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ohnmacht C, et al. The microbiota regulates type 2 immunity through RORγt+ T cells. Science. 2015;349:989–993. doi: 10.1126/science.aac4263. [DOI] [PubMed] [Google Scholar]
- 6.Sefik E, et al. Individual intestinal symbionts induce a distinct population of RORγt+ regulatory T cells. Science. 2015;349:993–997. doi: 10.1126/science.aaa9420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Al Nabhani Z, et al. A weaning reaction to microbiota is required for resistance to immunopathologies in the adult. Immunity. 2019;50:1276–1288. doi: 10.1016/j.immuni.2019.02.014. [DOI] [PubMed] [Google Scholar]
- 8.Ramanan D, et al. An immunologic mode of multigenerational transmission governs a gut Treg setpoint. Cell. 2020;181:1276–1290. doi: 10.1016/j.cell.2020.04.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Olszak T, et al. Microbial exposure during early life has persistent effects on natural killer T cell function. Science. 2012;336:489–493. doi: 10.1126/science.1219328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kim KS, et al. Dietary antigens limit mucosal immunity by inducing regulatory T cells in the small intestine. Science. 2016;351:858–863. doi: 10.1126/science.aac5560. [DOI] [PubMed] [Google Scholar]
- 11.Xu M, et al. c-MAF-dependent regulatory T cells mediate immunological tolerance to a gut pathobiont. Nature. 2018;554:373–377. doi: 10.1038/nature25500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hepworth MR, et al. Innate lymphoid cells regulate CD4+ T-cell responses to intestinal commensal bacteria. Nature. 2013;498:113–117. doi: 10.1038/nature12240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hepworth MR, et al. Group 3 innate lymphoid cells mediate intestinal selection of commensal bacteria-specific CD4+ T cells. Science. 2015;348:1031–1035. doi: 10.1126/science.aaa4812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Papaioannou NE, et al. Environmental signals rather than layered ontogeny imprint the function of type 2 conventional dendritic cells in young and adult mice. Nat. Commun. 2021;12:464. doi: 10.1038/s41467-020-20659-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Brown CC, et al. Transcriptional basis of mouse and human dendritic cell heterogeneity. Cell. 2019;179:846–863. doi: 10.1016/j.cell.2019.09.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Yamano T, et al. Aire-expressing ILC3-like cells in the lymph node display potent APC features. J. Exp. Med. 2019;216:1027–1037. doi: 10.1084/jem.20181430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wang J, et al. Single-cell multiomics defines tolerogenic extrathymic Aire-expressing populations with unique homology to thymic epithelium. Sci. Immunol. 2021;6:5053. doi: 10.1126/sciimmunol.abl5053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Suo C, et al. Mapping the developing human immune system across organs. Science. 2022;376:6597. doi: 10.1126/science.abo0510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yang A, et al. p63 is essential for regenerative proliferation in limb, craniofacial and epithelial development. Nature. 1999;398:714–718. doi: 10.1038/19539. [DOI] [PubMed] [Google Scholar]
- 20.Pellegrini G, et al. p63 identifies keratinocyte stem cells. Proc. Natl Acad. Sci. USA. 2001;98:3156–3161. doi: 10.1073/pnas.061032098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bornstein C, et al. Single-cell mapping of the thymic stroma identifies IL-25-producing tuft epithelial cells. Nature. 2018;599:622–626. doi: 10.1038/s41586-018-0346-1. [DOI] [PubMed] [Google Scholar]
- 22.Park JE, et al. A cell atlas of human thymic development defines T cell repertoire formation. Science. 2020;367:6480. doi: 10.1126/science.aay3224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Fergusson JR, et al. Maturing human CD127+CCR7+PDL1+ dendritic cells express AIRE in the absence of tissue restricted antigens. Front. Immunol. 2019;9:2902. doi: 10.3389/fimmu.2018.02902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nirschl C, et al. IFNγ-dependent tissue-immune homeostasis Is co-opted in the tumor microenvironment. Cell. 2017;170:127–141. doi: 10.1016/j.cell.2017.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Maier B, et al. A conserved dendritic-cell regulatory program limits anti-tumour immunity. Nature. 2020;580:257–262. doi: 10.1038/s41586-020-2134-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Schraml BU, et al. Genetic tracing via DNGR-1 expression history defines dendritic cells as a hematopoietic lineage. Cell. 2013;154:843–858. doi: 10.1016/j.cell.2013.07.014. [DOI] [PubMed] [Google Scholar]
- 27.Salvermoser J, et al. Clec9a-mediated ablation of conventional dendritic cells suggests a lymphoid path to generating dendritic cells in vivo. Front. Immunol. 2018;9:699. doi: 10.3389/fimmu.2018.00699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Dobeš J, et al. Extrathymic expression of Aire controls the induction of effective TH17 cell-mediated immune response to Candida albicans. Nat. Immunol. 2022;23:1098–1108. doi: 10.1038/s41590-022-01247-6. [DOI] [PubMed] [Google Scholar]
- 29.Fiancette R, et al. Reciprocal transcription factor networks govern tissue-resident ILC3 subset function and identity. Nat. Immunol. 2021;22:1245–1255. doi: 10.1038/s41590-021-01024-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Stehle C, et al. T-bet and RORα control lymph node formation by regulating embryonic innate lymphoid cell differentiation. Nat. Immunol. 2021;22:1231–1244. doi: 10.1038/s41590-021-01029-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Harly C, et al. The transcription factor TCF-1 enforces commitment to the innate lymphoid cell lineage. Nat. Immunol. 2019;20:1150–1160. doi: 10.1038/s41590-019-0445-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.La Manno G, et al. RNA velocity of single cells. Nature. 2018;560:494–498. doi: 10.1038/s41586-018-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zhou W, et al. ZBTB46 defines and regulates ILC3s that protect the intestine. Nature. 2022;609:159–165. doi: 10.1038/s41586-022-04934-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Herzig Y, et al. Transcriptional programs that control expression of the autoimmune regulator gene Aire. Nat. Immunol. 2016;18:161–172. doi: 10.1038/ni.3638. [DOI] [PubMed] [Google Scholar]
- 35.Michelson DA, Hase K, Kaisho T, Benoist C, Mathis D. Thymic epithelial cells co-opt lineage-defining transcription factors to eliminate autoreactive T cells. Cell. 2022;185:2542–2558. doi: 10.1016/j.cell.2022.05.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kedmi, R. et al. A RORγt+ cell instructs gut microbiota-specific Treg cell differentiation. Nature10.1038/s41586-022-05089-y (2022). [DOI] [PMC free article] [PubMed]
- 37.Lyu, M. et al. ILC3s select microbiota-specific regulatory T cells to establish tolerance in the gut. Nature10.1038/s41586-022-05141-x (2022). [DOI] [PMC free article] [PubMed]
- 38.Barton GM, et al. Positive selection of self-MHC-reactive T cells by individual peptide-MHC class II complexes. Proc. Natl Acad. Sci. USA. 2002;99:6937–6942. doi: 10.1073/pnas.102645699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Loschko J, et al. Absence of MHC class II on cDCs results in microbial-dependent intestinal inflammation. J. Exp. Med. 2016;213:517–534. doi: 10.1084/jem.20160062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gasteiger G, et al. Tissue residency of innate lymphoid cells in lymphoid and nonlymphoid organs. Science. 2015;350:981–985. doi: 10.1126/science.aac9593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Elmentaite R, et al. Cells of the human intestinal tract mapped across space and time. Nature. 2021;597:250–255. doi: 10.1038/s41586-021-03852-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Travis MA, et al. Loss of integrin αvβ8 on dendritic cells causes autoimmunity and colitis in mice. Nature. 2007;449:361–365. doi: 10.1038/nature06110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lacy-Hulbert A, et al. Ulcerative colitis and autoimmunity induced by loss of myeloid αv integrins. Proc. Natl Acad. Sci. USA. 2007;104:15823–15828. doi: 10.1073/pnas.0707421104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Billingham RE, Brent L, Medawar PB. ‘Actively acquired tolerance’ of foreign cells. Nature. 1953;172:603–606. doi: 10.1038/172603a0. [DOI] [PubMed] [Google Scholar]
- 45.Billingham RE, Brent L, Medawar PB. Quantitative studies on tissue transplantation immunity. III. Actively acquired tolerance. Philos. Trans. R. Soc. Lond. B. 1956;239:357–414. [Google Scholar]
- 46.Gardner JM, et al. Deletional tolerance mediated by extrathymic aire-expressing cells. Science. 2008;321:843–847. doi: 10.1126/science.1159407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zembrzycki A, et al. Sensory cortex limits cortical maps and drives top-down plasticity in thalamocortical circuits. Nat. Neurosci. 2013;16:1060–1067. doi: 10.1038/nn.3454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Lee PP, et al. A critical role for Dnmt1 and DNA methylation in T cell development, function, and survival. Immunity. 2001;15:763–774. doi: 10.1016/s1074-7613(01)00227-8. [DOI] [PubMed] [Google Scholar]
- 49.Proctor JM, et al. Vascular development of the brain requires β8 integrin expression in the neuroepithelium. J. Neurosci. 2005;25:9940–9948. doi: 10.1523/JNEUROSCI.3467-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Martin WD, et al. H2-M mutant mice are defective in the peptide loading of class II molecules, antigen presentation, and T cell repertoire selection. Cell. 1996;84:543–550. doi: 10.1016/s0092-8674(00)81030-2. [DOI] [PubMed] [Google Scholar]
- 51.Gallegos AM, Pamer EG, Glickman MS. Delayed protection by ESAT-6-specific effector CD4+ T cells after airborne M. tuberculosis infection. J. Exp. Med. 2008;205:2359–2368. doi: 10.1084/jem.20080353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Nakawesi J, et al. αvβ8 integrin-expression by BATF3-dependent dendritic cells facilitates early IgA responses to Rotavirus. Mucosal Immunol. 2020;14:53–67. doi: 10.1038/s41385-020-0276-8. [DOI] [PubMed] [Google Scholar]
- 53.Erben U, et al. A guide to histomorphological evaluation of intestinal inflammation in mouse models. Int. J. Clin. Exp. Pathol. 2014;7:4557–4576. [PMC free article] [PubMed] [Google Scholar]
- 54.Granja JM, et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 2021;53:403–411. doi: 10.1038/s41588-021-00790-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Frankish A, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019;47:D766–D773. doi: 10.1093/nar/gky955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Danecek P, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10:giab008. doi: 10.1093/gigascience/giab008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- 59.Levine JH, et al. Data-driven phenotypic dissection of AML reveals progenitor-like cells that correlate with prognosis. Cell. 2015;162:184–197. doi: 10.1016/j.cell.2015.05.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.McInnes, L., Healy, J. & Melville, J. UMAP: uniform manifold approximation and projection for dimension reduction. Preprint at https://arxiv.org/abs/1802.03426 (2018).
- 61.Finak G, et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 2015;16:278. doi: 10.1186/s13059-015-0844-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.van Dijk D, et al. Recovering gene interactions from single-cell data using data diffusion. Cell. 2018;174:716–729. doi: 10.1016/j.cell.2018.05.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Tirosh I, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science. 2016;352:189–196. doi: 10.1126/science.aad0501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Taddy MA. On estimation and selection for topic models. Proc. Mach. Learn. Res. 2012;22:1184–1193. [Google Scholar]
- 65.Dey KK, et al. Visualizing the structure of RNA-seq expression data using grade of membership models. PLoS Genet. 2017;13:e1006599. doi: 10.1371/journal.pgen.1006599. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Bergen V, Lange M, Peidli S, Wolf FA, Theis FJ. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 2020;38:1408–1414. doi: 10.1038/s41587-020-0591-3. [DOI] [PubMed] [Google Scholar]
- 67.Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19:1–5. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Stuart T, et al. Comprehensive integration of single-cell data. Cell. 2019;177:1888–1902. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Choi J, et al. Haemopedia RNA-seq: a database of gene expression during haematopoiesis in mice and humans. Nucleic Acids Res. 2019;47:D780–D785. doi: 10.1093/nar/gky1020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Schep AN, et al. ChromVAR: Inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods. 2017;14:975–978. doi: 10.1038/nmeth.4401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Weirauch MT, et al. Determination and inference of eukaryotic transcription factor sequence specificity. Cell. 2014;158:1431–1443. doi: 10.1016/j.cell.2014.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The mouse sequencing data are available through the Gene Expression Omnibus under accession number GSE174405. This manuscript makes use of publicly available data from the Human Cell Atlas, available at https://www.gutcellatlas.org. Source data are provided with this paper.