Abstract
Eosinophilic esophagitis (EoE) is an esophageal immune-mediated disease characterized by eosinophilic inflammation and epithelial remodeling, including basal cell hyperplasia (BCH). Although BCH is known to correlate with disease severity and with persistent symptoms in patients in histological remission, the molecular processes driving BCH remain poorly defined. Here, we demonstrate that BCH is predominantly characterized by an expansion of nonproliferative suprabasal cells that are still committed to early differentiation. Furthermore, we discovered that suprabasal and superficial esophageal epithelial cells retain progenitor identity programs in EoE, evidenced by increased quiescent cell identity scoring and the enrichment of signaling pathways regulating stem cell pluripotency. Enrichment and trajectory analyses identified SOX2 and KLF5 as potential drivers of the increased quiescent identity and epithelial remodeling observed in EoE. Notably, these alterations were not observed in gastroesophageal reflux disease. These findings provide additional insights into the differentiation process in EoE and highlight the distinct characteristics of suprabasal and superficial esophageal epithelial cells in the disease.
Keywords: Gastroenterology, Inflammation
Keywords: Allergy, Bioinformatics, Molecular biology
Introduction
Eosinophilic esophagitis (EoE) is an esophageal disease characterized by eosinophilia that results in dysphagia, edema, esophageal stricture, and food impaction due to a type 2 immune response triggered by food allergens. Current management of EoE includes proton pump inhibitors, topical corticosteroids, diet elimination, and dupilumab (1, 2). Despite advancements in EoE treatment, a considerable number of patients face symptom relapse or inadequate response to existing therapies (3, 4), resulting in unfavorable prognosis, diminished quality of life, and substantial healthcare expenses attributed to frequent procedures and lifelong treatment requirements (5). Hence, current efforts to improve therapeutic options focus on the alleviation of symptoms and prevention of complications.
In the esophagus, primary protection against food antigens passing through the lumen is provided by the stratified squamous epithelial barrier. After arising from the stem cells in the basal compartment, esophageal epithelial cells (EEC) migrate through the suprabasal compartment and initiate an early differentiation process, before reaching the superficial compartment where they complete terminal differentiation and eventually desquamate (Figure 1). Upon damage to the epithelial barrier, a rapid restoration of epithelial homeostasis is achieved through balanced self-renewal and differentiation of stem/progenitor cells. Dysregulated inflammation, aberrant tissue repair mechanisms, or failure to restore homeostasis will ultimately have pathological consequences (6).
Adverse alterations to the esophageal epithelium are a primary driver of EoE (7) and include intraepithelial eosinophilic inflammation, basal cell hyperplasia (BCH), dilatation of intercellular space, and dysregulated terminal differentiation (8, 9). Histologically, BCH is the most prominent epithelial change in EoE and is defined by pathologists as an expansion of EEC within the basal zone (10). Despite the predominant incidence of BCH in EoE, the changes in the molecular and cellular identity occurring in BCH are largely unexplored. Underscoring the importance of understanding the role of BCH in EoE pathogenesis, BCH is linked to disease severity in EoE and directly correlates with persistent symptoms (odds ratio, 2.14; 95% CI, 1.03–4.42; P = 0.041) and endoscopic findings (odds ratio, 7.10; 95% CI, 3.12–16.18; P < 0.001) in patients in histologic remission (10). While a recent study demonstrated that a ~15% increase in cycling epibasal (PDPN–) cells contributed to BCH in EoE (8), BCH pervades approximately 65% of the epithelial surface area in patients with EoE (11). This indicates that BCH is associated with additional distinct alterations in EEC characteristics that extend beyond hyperproliferation. Thus, a better molecular characterization of BCH is needed to improve the current understanding of symptom recurrence and persistent endoscopic findings in EoE. This will ultimately guide the development of novel therapeutic approaches for EoE, particularly for cases in which reducing eosinophilic inflammation is not sufficient to restore epithelial tissue integrity or to improve clinical symptoms.
To address this gap in knowledge and investigate more extensively the molecular changes occurring in BCH, we performed single-cell RNA-Seq (scRNA-Seq) of esophageal mucosal biopsies from treatment-naive adult patients with EoE and healthy controls (HC). Our findings reveal that BCH in EoE primarily involves the expansion of nonproliferative suprabasal EEC that are committed to early differentiation while retaining a progenitor cell identity. Through our analysis, we identified the transcription factors (TFs) and regulators of stem cell renewal, SOX2 and KLF5, as the prominent predicted regulators of differentially expressed genes (DEGs) in these atypical early differentiated EEC found in EoE. We further confirmed the increased expression of SOX2 and KLF5, along with their downstream targets, in the early differentiated EEC observed in EoE. Finally, these alterations were not detected in individuals with gastroesophageal reflux disease (GERD).
Results
Characterization of esophageal mucosal cell populations in adult EoE.
To characterize the single-cell transcriptomic landscape of the esophageal mucosa in EoE, we obtained proximal and distal biopsies from 6 adults with EoE along with 6 HC (Figure 2A). Histological processing was performed on additional adjacent biopsies (Figure 2A). Immunostaining was conducted on biopsies from 22 additional EoE subjects and 16 HC to validate scRNA-Seq findings. Patient characteristics and demographics are summarized in Table 1. Fresh tissue specimens were digested to generate single-cell suspensions and sequenced using the 10X Genomics platform. After quality control filtering, integration was performed using reciprocal principal component analysis (PCA) dimensional reduction. Uniform Manifold Approximation and Projection (UMAP) for Dimension Reduction embeddings were calculated using the Seurat R package (12), followed by unsupervised graph-based clustering. Clusters were annotated based on established marker genes (Figure 2, B and C) and transcriptional signatures (Supplemental Figure 1A; supplemental material available online with this article; https://doi.org/10.1172/jci.insight.171765DS1).
Table 1. Patient demographics summary.
Within the integrated data set of 151,519 cells, we identified 8 major cell populations: epithelial cells (Epi) (n = 131,822), T cells and NK cells (T/NK) (n = 11,134), mononuclear phagocytes (MNP) (n = 5,211), mast cells (Mast) (n = 1,733), B cells (B) (n = 116), endothelial cells (Endo) (n = 1,239), fibroblasts (Fib) (n = 244), and smooth muscle cells (SM) (n = 20) (Figure 2B). Representative marker genes used for cell type annotation included KRT6A and DSG3 (Epi); CD3D and NKG7 (T/NK); CD68, CD207, and CD14 (MNP); KIT and CPA3 (Mast); CD79A and IGHA1 (B); VWF and CDH5 (Endo); DCN, COL1A1, and MYL9 (Fib); and MYL9, MYH11, and CNN1 (SM) (Figure 2C). We obtained 85,745 cells from HC and 65,774 from EoE (Supplemental Figure 1B). The distribution of major cell populations was largely similar between the EoE and HC groups (Figure 2D and Supplemental Figure 1C), with EEC being the predominant cell type (Figure 2E).
Defining EEC clusters in HC and EoE.
The prominent representation of EEC in our data set (86.83%) (Figure 2E), a central contributor to EoE pathogenesis (13, 14), enabled high-resolution characterization of their transcriptional changes in EoE. To ensure that UMAP embeddings were assigned based on epithelial subtypes under homeostatic conditions, EEC were reintegrated using anchors identified from HC samples, enabling the representation of HC and EoE EEC within each cluster (Supplemental Figure 2, A and B). Ten epithelial clusters were identified via unsupervised graph-based clustering (Supplemental Figure 2, A–C). To distinguish slow-cycling stem cells in the basal layer from faster-cycling epibasal cells, we performed subclustering of the quiescent (clusters 1 and 2) and dividing clusters (cluster 3, S-phase; clusters 4 and 5, G2/M phase) (Supplemental Figure 3A), as previously described (8). Subclustered cell populations were annotated based on the expression of KRT13 (15), DST (16), and cell cycle markers (Supplemental Figure 3, A and B). Our assignment of cell populations aligns with previous classifications using high/low PDPN expression (8) (Supplemental Figure 3C). The basal compartment clusters, Quiescent_1 (Q1), Quiescent_2 (Q2), Basal_Dividing (BD), and Epibasal (EB), were then reassigned within the total esophageal epithelial object (Figure 3A).
To annotate the resulting 9 epithelial clusters and classify them into esophageal epithelial compartments (basal [B], suprabasal [SB], or superficial [SF]), we examined the expression of established marker genes in the HC data set (Figure 3, A–C, and Supplemental Figure 3D). Within the basal compartment, the quiescent (Q) EEC clusters Q1 and Q2 demonstrated elevated expression of quiescence markers KRT15 and DST(7), while the proliferating clusters BD and EB displayed increased expression of the S-phase marker PCNA (17, 18) and the G2/mitosis marker MKI67 (19) (Supplemental Figure 3D). Consistent with the existing literature, basal cells exhibited expression of the TFs SOX2 (20) and TP63 (21–23) (Figure 1, Figure 3C, and Supplemental Figure 3D). Suprabasal clusters were identified based on the expression of KRT13 (KRT13hi) (24), IVL (25), and SERPINB3 (8) (Figure 1, Figure 3C, and Supplemental Figure 3D). The superficial markers CNFN (8, 26), FLG (27), and KRT78 (28) were used to characterize superficial cell clusters (Figure 1, Figure 3C, and Supplemental Figure 3D). Cluster annotation was confirmed using the transcriptional profiles of each HC cell cluster (Supplemental Figure 3E).
BCH is characterized by the expansion of nonproliferative suprabasal cells committed to early differentiation.
We next examined alterations in the relative representation of EEC compartments between EoE and HC. Surprisingly, we did not observe an expansion of the basal compartment in EoE (Figure 3D). However, analysis of EEC proportions at the cluster level revealed a decrease of the quiescent reserve EEC (Q1) and an increase of the fast-cycling epibasal cells in EoE (Figure 3E). The quantification of Ki-67 staining confirmed the increased proliferation in epibasal cells above the basal layer observed by scRNA-Seq (Figure 3, F and G). This shift in cell proportion confirms the previously reported hyperproliferation in EoE (29, 30). BCH scoring in adjacent esophageal mucosal biopsies from scRNA-Seq patients using EoE-HSS criteria (Figure 3H and Supplemental Figure 4, A–C) revealed an increase greater than 3-fold in the percentage of epithelium thickness occupied by BCH in EoE compared with HC (Figure 3H). These findings suggest that the morphological changes associated with BCH extend higher up in the esophageal epithelium, beyond the detected hyperproliferation, indicating that additional changes in EEC may occur during the development of BCH. To gain further insight into the cell identity of EEC labeled as basal in the histological evaluation of BCH, we examined the changes in cell proportions within different EEC clusters in EoE. Interestingly, we observed an expansion of EEC belonging to the suprabasal clusters, and all these clusters exhibited a nonproliferative phenotype (Figure 3, D–F, and Supplemental Figure 4, D and E). This finding suggests that the EEC identified as basal in the histological assessment of BCH may actually represent nonproliferative differentiated suprabasal cells with an abnormal morphology.
Increased basal identity marker expression is observed in suprabasal and superficial EEC in EoE.
To gain deeper insights into the alterations in cell identity associated with BCH, we investigated changes in transcriptional profiles within epithelial clusters in EoE. Our analysis initially focused on differentiation markers associated with basal, suprabasal, and superficial cell identities. We observed a substantial decrease in FLG expression within the superficial cluster SF2 as well as reduced expression of CNFN and KRT78 in the superficial cluster SF1 in EoE compared with HC (Figure 4A). This loss of terminal differentiation is a well-documented characteristic in EoE (8, 13). Interestingly, we observed that the increased expression of KRT13 and IVL, which occurs during suprabasal commitment, was still present in the suprabasal clusters in EoE as compared with the basal compartment, albeit at slightly lower levels compared with HC (Figure 4A and Supplemental Figure 4D). Surprisingly, in addition to their expected expression in basal clusters, genes associated with basal cells, such as SOX2, KLF5, and TP63, were also expressed throughout the suprabasal and superficial clusters SB1 through SF1 in EoE (Figure 4A). Our analysis reveals that EoE is not solely characterized by a loss of terminal differentiation but rather demonstrates that a majority of EEC still initiate an early differentiation process. Most notably, our analyses unveiled that both early and terminally differentiated EEC retain the expression of genes associated with basal cells in EoE.
Differential gene expression analysis reveals a dysfunctional differentiation process in EoE.
To gain further insights into the molecular changes associated with EoE, we conducted differential gene expression analysis on EEC in EoE compared with HC per cluster. Our findings further support the pivotal role of suprabasal and superficial EEC in EoE pathogenesis, as we observed the highest number of DEGs in the SB2 and SB3 clusters, followed by SF1 through SF2 (Figure 4B). SB and SF clusters also exhibited the greatest number of overlapping gene changes (Figure 4B) (31). Additionally, greater DEG log2 fold-changes (logFC) were detected in SB2 through SF2 compared with other clusters (Supplemental Figure 5A). Pathway enrichment analysis was performed on the hierarchical clustering of the DEGs resulting from the comparison of all EEC in EoE to HC. Given our observation of expanded EEC in SB1 through SF1 in EoE (Figure 3E) and the extensive transcriptional changes observed in the suprabasal and superficial clusters, we hypothesized that the upregulated genes in cluster 1 within the suprabasal and superficial compartments of EoE (Figure 4C) are crucial drivers of BCH. To identify potential upstream regulators of the DEGs within cluster 1, we utilized EnrichR, which maintains updated databases of ChIP-Seq experiments and the Gene Expression Omnibus (GEO) signature of DEGs resulting from TF perturbations (32). Through enrichment analysis against these databases, we identified SOX2, TP63, and KLF5, 3 regulators of stem cell self-renewal in various tissues (21, 22, 33), as the top predicted TFs regulating the DEGs within cluster 1 (Figure 5A). Furthermore, the increased expression of SOX2, TP63, and KLF5, along with their downstream targets, was confirmed in the suprabasal and superficial compartments in EoE (Figure 5B).
Suprabasal and superficial EEC retain a progenitor-like identity in EoE.
To further explore our hypothesis that suprabasal and superficial EEC maintain an epithelial progenitor-like identity in EoE, we developed 2 gene signatures that capture genes preferentially expressed in either quiescent cells or superficial cells in HC. The signatures were developed to establish a quiescent/basal/differentiation axis in human EEC (Supplemental Tables 1 and 2). Violin plots and contour plots mapping the quiescent signature score (y axis) and superficial signature score (x axis) by disease condition revealed a distinct separation between the superficial compartment of HC from the basal and suprabasal compartments (Figure 5C and Supplemental Figure 5B). However, in EoE, we observed a notable shift toward decreased superficial score and increased quiescent score in the superficial compartment, which resulted in an overlap between the superficial and suprabasal compartments (Figure 5C and Supplemental Figure 5B). Upon separation of the suprabasal and superficial compartments into epithelial clusters, we observed increased quiescent identity in each suprabasal and superficial cluster in EoE beginning at SB2, with the most dramatic shift in SF1 (Figure 5C and Supplemental Figure 5B). Further supporting maintained progenitor identity in early differentiated EEC in EoE, pathway enrichment analysis of the DEGs in each cluster between EoE and HC predicted the activation of the embryonic stem cell pluripotency pathway in EEC in EoE, with the highest activation scores and pathway coverage in suprabasal and superficial clusters (Supplemental Figure 5C).
To validate the alterations in the quiescent/basal/differentiation axis, we performed multispectral fluorescence staining on esophageal mucosal sections from HC and EoE using established markers of basal (KRT14, p63), suprabasal (IVL), and superficial (CNFN) cell identity (Figure 6A). For comparison, marker gene expression across clusters in HC or EoE is shown (Figure 6B). We confirmed appropriate expression of the suprabasal marker IVL following exit from the basal compartment in EoE (Figure 6A). The analysis of cell proportions revealed an expanded suprabasal population, a reduced number of superficial cells, and a consistent basal compartment (Figure 6C), which is consistent with our scRNA-Seq findings. Notably, 73.8% of EEC in EoE expressed the basal marker p63 in the suprabasal and superficial compartments, while HC primarily exhibited basal-restricted p63 expression (Figure 6, A and D). Thus, our findings indicate that, despite maintaining the correct spatial organization of suprabasal lineage commitment, most suprabasal and superficial EEC in EoE retain a basal identity.
Pseudotemporal analysis confirms a global differentiation shift toward basal identity in EEC in EoE.
As a complimentary approach to examine changes in cell differentiation, we merged epithelial samples from HC and EoE for pseudotemporal analysis with Monocle3 to examine differences in cell fate trajectories along the course of differentiation (Figure 7A and Supplemental Figure 6A). We followed the established model for EEC ordering and designated S-phase cells that are committed to divide as root cells (26) (Figure 7B and Supplemental Figure 6B), where resulting daughter cells that return to the G0 quiescent reserve (Q1 and Q2) are captured in 1 direction of the trajectory, while daughter cells that commit to differentiation (SB1 through SF2) move in another direction of the trajectory (Figure 7C). Trajectory analysis was performed on healthy and EoE conditions combined (Figure 7B) to allow direct comparison of pseudotime values between conditions. A severe decrease in late pseudotime peaks was observed in EEC in EoE, with a concentration of cells in an intermediate range of pseudotime values instead (Figure 7D). This shift in pseudotime value distribution was consistent across patients with EoE (Supplemental Figure 6C) and was not explained by the decreased frequency of superficial cells (Figure 7E). The comparison of pseudotime densities between epithelial compartments revealed a marked reduction in pseudotime density profiles in the suprabasal and superficial compartments in EoE, compared with HC (Figure 7E). Breakdown of the suprabasal and superficial compartments into component clusters revealed a significant decrease in pseudotime values starting in SB3 in EoE compared with HC (Figure 7F). In fact, hierarchical clustering of mean pseudotime values of each differentiated cluster demonstrated an 87% accuracy in distinguishing EoE from HC using the first 2 dendrogram nodes (Supplemental Figure 6D). This further demonstrates that suprabasal and superficial EEC retain a basal-like identity in EoE, compared with suprabasal and superficial EEC in HC.
Next, we identified gene modules that displayed trajectory-dependent gene expression patterns in EoE (Figure 8A, Supplemental Figure 7A, and Supplemental Table 3). Top terms from pathway enrichment analysis are shown for each module in Supplemental Figure 7B. Modules 4, 5, 6, and 7 show different expression patterns between EoE and HC (Supplemental Figure 7C). Modules 4, 5, and 6 were linked to EEC differentiation (Supplemental Figure 7B). Module 7 was particularly interesting as it contained genes with substantially increased expression in EoE in all epithelial clusters (Supplemental Figure 7C; Figure 8, A and B; and Supplemental Table 4), with peak increase in SB2 (Supplemental Figure 7D). Pathway enrichment analysis of module 7 genes identified enriched terms associated with response to wounding, regulation of actin filament-based process, regulation of keratinocyte proliferation, and positive regulation of cell motility (Figure 8C). Mean module 7 gene signature scores show elevated expression along the differentiation trajectory in EoE as compared with HC, peaking at pseudotemporal values representing the differentiated clusters that show earlier pseudotemporal identity in EoE (Figure 8D).
Interestingly, we observed that SOX2 and KLF5 expression also increased across a similar pseudotemporal range in EoE compared with HC (Figure 8D). A higher percentage of EEC expressed overlapping SOX2, KLF5, and module 7 signature scoring in EoE, with the highest level of coexpression within the same range of pseudotemporal values (Figure 8D and Supplemental Figure 7E), suggesting the regulation of module 7 genes by SOX2 and KLF5. All module 7 genes exhibited increased expression in EoE EEC, peaking in clusters SB2 through SF1 (Supplemental Figure 7F). Notably, expression of module 7 genes in EoE peaked in suprabasal and superficial clusters showing aberrant SOX2 and KLF5 expression (Supplemental Figure 7F). Furthermore, over 49% of module 7 genes were known epithelial targets of either SOX2, KLF5, or the SOX2-KLF5 interaction (34–37) (Supplemental Figure 7F and Supplemental Tables 5 and 6) with known protein-to-protein interactions (Supplemental Figure 8). This supports our findings regarding the key involvement of SOX2, KLF5, or their interaction in governing the upregulated gene programs identified in the suprabasal and superficial compartments in EoE and suggests that they play a prominent role in disease-associated tissue remodeling.
SOX2 and KLF5 gene programs are altered in the suprabasal and superficial compartments in EoE.
In addition to our findings that SOX2 and KLF5 are coexpressed in EoE, our analysis revealed an increased expression of these TFs in a greater percentage of cells within the suprabasal and superficial EEC clusters in EoE, in comparison with HC (Figure 9A and Supplemental Figure 9, A and B). IHC confirmed increased nuclear expression of SOX2 and KLF5 in suprabasal and superficial EEC in EoE (Figure 9, B–F). Interestingly, KLF5 was recently identified as a SOX2 binding partner, and their interaction led to the acquisition of chromatin binding sites not observed with SOX2 or KLF5 alone (37). Published epithelial-specific SOX2-, KLF5-, or SOX2/KLF5-regulated gene programs (34, 35, 37) were enriched across EEC clusters in EoE, with the most dramatic increase in the suprabasal and superficial clusters (Figure 9G and Supplemental Tables 5 and 7). In total, 1,620 genes known to be regulated by SOX2 and/or KLF5 were significantly upregulated in EEC in EoE (FDR-adjusted P < 0.05 and logFC > 0.25), 76.5% of which demonstrated the highest upregulation in the suprabasal and superficial compartments (Supplemental Table 7). Furthermore, in EoE, there was significant dysregulation of 224 genes known to be coregulated by the SOX2-KLF5 interaction (FDR-adjusted P < 0.05 and |logFC| > 0.25), with 86.7% of these genes showing upregulation (Supplemental Table 8).
To further investigate the gene targets coregulated by SOX2 and KLF5 displaying elevated expression in EoE, we conducted unsupervised clustering analysis of their expression between HC and EoE compartments (Supplemental Figure 9C). Cluster 1 genes exhibited progressively reduced expression throughout the suprabasal and superficial compartments in HC but showed increased expression in EoE (Supplemental Figure 9C). Enrichment analysis revealed changes related to actin-filament based processes and cell morphogenesis associated with differentiation (Supplemental Figure 9D). Increased cluster 2 gene expression was seen in the suprabasal and superficial compartments in EoE, with relatively low expression in HC (Supplemental Figure 9C). Cluster 2 genes were linked to pathways associated with cell-to-cell junction and actin cytoskeleton organization (Supplemental Figure 9D). These findings demonstrate that the disrupted expression of SOX2 and KLF5, along with their coregulated downstream targets, contribute to epithelial remodeling specifically within the suprabasal and superficial compartments in EoE.
Dysregulated differentiation and aberrant signaling of progenitor-regulating TFs in EEC are specific to EoE and not observed in GERD.
Given that patients with EoE and GERD present with overlapping symptoms, such as heartburn and dysphagia (38), and that both undergo BCH (39), we next assessed whether the transcriptomic changes observed in EoE are disease specific or influenced by acid reflux. To investigate this, we performed scRNA-Seq on 4 patients with GERD, and we imputed cell identities established from HC and EoE data sets onto GERD EEC. Differential gene expression was calculated between EoE and HC for each epithelial compartment, and logFC from patients with GERD compared with HC was determined for genes significantly altered in EoE (|logFC| > 0.5 and FDR-adjusted P < 0.05) within each compartment. As shown in Figure 10A, EEC from GERD and EoE shared only a few genes changing in the same direction. Notably, most EoE DEGs in the basal and suprabasal compartments showed minimal change in GERD (Figure 10A). However, in the superficial compartment, 48% of DEGs displayed opposite changes in GERD (|logFC| > 0.5) (Figure 10A). We next compared known epithelial markers between HC, EoE, and GERD. In contrast to the loss of terminal differentiation observed in EoE, GERD EEC showed the correct expression patterns of early (KRT13 and IVL) and late differentiation markers (CNFN, SPRR2D, FLG, and KRT78) (Figure 10B).
To comprehensively compare the suprabasal and superficial compartments in EoE and GERD, we calculated differential gene expression between EoE versus HC in these compartments. We then conducted hierarchical clustering of the obtained DEGs and performed pathway enrichment analysis on genes within each hierarchical cluster (Supplemental Figure 10A). Finally, gene signatures were generated from each hierarchical cluster, and scoring was calculated across all HC, EoE, and GERD EEC (Figure 10C). DEGs in clusters 1 and 3 showed increased expression in the suprabasal and superficial compartments in EoE compared with HC but remained unchanged in GERD (Figure 10C and Supplemental Figure 10A). Enriched terms for these DEGs were associated with type II interferon signaling, chromatin remodeling, pluripotency of stem cells, cell junction organization, and cytoskeleton organization (Figure 10C and Supplemental Figure 10A). Similarly, cluster 4 genes related to keratinocyte differentiation showed decreased expression in the superficial compartment in EoE but were not decreased in GERD (Figure 10C and Supplemental Figure 10A).
To assess changes along the quiescent/basal/differentiation axis between GERD, EoE, and HC, we scored EEC using quiescent and superficial gene signatures. In GERD, the superficial compartment demonstrated proper adoption of superficial cell identity and inhibition of basal cell identity, unlike in EoE (Figure 10D and Supplemental Figure 10B). Moreover, the changes observed in the quiescent and superficial cell identity in the clusters SB3-SF2 in EoE were absent in GERD (Supplemental Figure 10B); this was consistent across patients with GERD (Supplemental Figure 10C). Furthermore, we observed no aberrant expression of SOX2, KLF5, TP63, or KLF4 expression in the suprabasal and superficial compartments in GERD, unlike in EoE (Figure 10, B and E, and Supplemental Figure 10D). Notably, by performing hierarchical clustering of the main features identified in the suprabasal and superficial compartments of patients with EoE, we were able to accurately distinguish healthy individuals and patients with GERD from those with EoE, achieving a 93.3% accuracy at the top-level partition in the dendrogram (Supplemental Figure 11). These findings highlight that the loss of terminal differentiation, the shift toward basal cell identity in the suprabasal and superficial compartments, and the abnormal expression of SOX2 and/or KLF5 are exclusive to EEC in EoE and not attributable to gastric reflux in these patients.
Discussion
Esophageal homeostasis relies on a careful balance between proliferation, differentiation, and cell death, which is critical for the maintenance of epithelial barrier function. Unfortunately, this process is disrupted in EoE, leading to Th2-mediated eosinophilic inflammation and epithelial remodeling, including loss of differentiation and BCH (9). Understanding the role of BCH in EoE disease progression is essential for improving clinical management and treatment strategies. Previous studies have highlighted the association between BCH and disease severity in patients with EoE (10) and demonstrated that BCH affects > 66% of the esophageal epithelial surface area in these patients (11). Even with treatment, BCH persists in approximately half of patients with EoE and correlates with persistent symptoms and endoscopic findings in histologically inactive patients (10, 11). To investigate the cellular identities and transcriptional processes underlying BCH and altered epithelial differentiation in EoE, we performed scRNA-Seq on esophageal biopsies obtained from adult patients with EoE and HC.
BCH in EoE has been suggested to result from the proliferation of basal cells (29, 40). While we confirmed the expansion of proliferating epibasal cells in EoE, our observations indicate that the morphological changes associated with BCH extend beyond the region of hyperproliferation. Interestingly, our study uncovered that BCH in EoE is marked by the expansion of nonproliferative suprabasal cells. In addition to BCH, EoE is characterized by a more widespread loss of differentiation (8, 13). While we confirmed the reduced expression of terminal differentiation markers in the superficial clusters in EoE, our study revealed a more intricate differentiation dynamic in EoE. Importantly, we demonstrate that suprabasal EEC undergo proper commitment to early differentiation after exiting the basal compartment. Furthermore, we found that suprabasal and superficial EEC in EoE retain a basal-like identity, as supported by earlier pseudotemporal identities and elevated expression of quiescence-associated genes in these epithelial clusters. In a recent study on intestinal injury repair mechanisms, a transient cell population derived from transit amplifying cells exhibited a regenerative stem cell–like transcriptional profile but lacked stem cell capacity (41). The authors coined the term adaptive differentiation to describe this atypical differentiation process that occurred in response to tissue damage to facilitate tissue repair (41). We hypothesize that the enhanced stem-like characteristics observed in suprabasal and superficial compartments in EoE are indicative of an adaptive differentiation process. This process may be triggered by the chronic inflammation in the EoE microenvironment, resembling a tissue-wide wound-healing response. While adaptive differentiation is potentially beneficial for tissue repair in the intestine, further investigation is needed to understand its implications in EoE and its potential contribution to pathology in the presence of chronic inflammation.
Enrichment analysis of upregulated genes in the suprabasal and superficial compartments in EoE revealed the potential regulatory role of SOX2 and KLF5 in BCH, epithelial remodeling, and the maintenance of stem cell identity in differentiated EEC. SOX2 is known for its involvement in stem cell maintenance and self-renewal by suppressing differentiation genes (21, 42, 43). Similarly, KLF5 regulates cell proliferation, migration, differentiation, and stemness (44–46). We confirmed elevated expression of SOX2 and KLF5, which overlapped in suprabasal and superficial EEC in EoE, and observed the upregulation of their target genes. Previous research demonstrates the interaction between SOX2 and KLF5 in the progression from normal tissue to esophageal squamous cell cancer (ESCC), suggesting a potential role in response to tissue injury (37, 47). Pathway analysis of the SOX2/KLF5 targets with increased expression in suprabasal and superficial EEC in EoE revealed terms related to epithelial remodeling. This suggests that SOX2 and KLF5 individually confer a basal identity to suprabasal and superficial EEC in EoE, while their combined signaling regulates gene programs involved in chronic epithelial wound repair. Exploring the mechanistic regulation of the injury response, the development of BCH/adaptive differentiation in EoE, and the upstream factors influencing SOX2 and KLF5 expression in suprabasal and superficial EEC represent complex areas that require further future investigation. Additionally, further studies are needed to explore how available therapeutic interventions can modulate the expression levels of SOX2 and KLF5.
Although our study focused on the interaction between SOX2 and KLF5, we cannot exclude the possibility that SOX2 also interacts with other factors in EoE. For instance, in esophageal and lung squamous cell cancer cell lines, SOX2 and p63 — another TF upregulated in differentiated EEC in EoE — were shown to jointly occupy multiple genomic loci (48). Furthermore, the joint binding of p63, SOX2, and KLF5 was demonstrated to regulate chromatin accessibility, epigenetic modifications, and gene expression in ESCC (40). Furthermore, SOX2 and KLF4 operate as a functional core in pluripotency induction across cells of different origins (49). Thus, additional investigations are needed to explore the interaction of SOX2 with other TFs predicted by our computational analyses in EoE.
Finally, this study also explored the transcriptomic changes at the single-cell level in GERD. Given the overlap in symptoms and histological presentation, particularly the presence of BCH (38), it was crucial to determine whether our findings were exclusive to EoE or applicable to GERD as well. Our results clearly demonstrate that the observed increased basal identity, aberrant SOX2 and KLF5 expression, and abnormal expression of other progenitor-regulating TFs in the suprabasal and superficial compartments are specific to EoE and not present in patients with GERD. Therefore, these changes in EoE cannot be solely attributed to gastric reflux. While our analysis primarily focused on comparing EoE and GERD to HC, the differences in cellular identities and transcriptomics between patients with GERD and HC will be subject to further investigation. It is noteworthy that, while GERD is a risk factor for esophageal cancer development (50), epidemiological studies have not found an association between EoE and esophageal cancer, despite the presence of chronic inflammation (51). In contrast, reflux has been shown to influence other esophageal conditions such as achalasia and scleroderma, which are both associated with a higher susceptibility to esophageal cancer progression (52, 53). Therefore, further exploration of the cellular landscape of EEC in GERD, with a larger cohort of patients, may provide valuable insights into the distinctions among GERD, EoE, achalasia, and scleroderma esophageal diseases and their varying susceptibilities to esophageal cancer progression.
Overall, we believe that our findings will provide future guidance on the development of novel therapeutic approaches for EoE. The development of targeted therapies aiming at promoting proper differentiation of suprabasal cells in EoE could help to restore normal epithelial homeostasis for cases in which the reduction of eosinophilic inflammation is not sufficient to completely restore epithelial tissue integrity or to improve clinical symptoms.
In conclusion, our study uncovered that BCH in EoE is characterized by nonproliferative EEC with a combination of differentiation and stem-like transcriptional features. The involvement of SOX2 and KLF5 as potential key regulators sheds light on the underlying molecular mechanisms driving BCH and adaptive differentiation in EoE. Further exploration of epithelial remodeling and adaptive differentiation holds great promise for advancing our understanding of disease progression and may pave the way for novel therapeutic strategies, particularly for patients who do not respond to conventional antiinflammatory treatments.
Methods
Human specimen collection.
HC met asymptomatic criteria including the lack of esophageal symptoms (heartburn, dysphagia, chest pain), history of tobacco use or alcohol dependency, BMI greater than 30 kg/m2, or previous treatment with antacids or proton pump inhibitors. Patients with EoE were recruited at the primary visit contingent upon confirmed diagnosis and no history of steroid treatment. Patients with GERD were recruited at the primary visit contingent upon positive Bravo pH testing. Exclusion criteria for EoE and GERD included active severe esophagitis (Los Angeles esophagitis Grade C and above) (54), evidence of mechanical obstruction due to peptic stricture (GERD), long-segment Barrett’s metaplasia, unstable medical illness with ongoing diagnostic workup and treatment, current drug or alcohol abuse or dependency, current neurologic or cognitive impairment that would make the patient an unsuitable candidate for a research trial, severe mental illness, pregnancy and bleeding diathesis, or need for anticoagulation that cannot be stopped for endoscopy. Biospecimen Reporting for improved study quality data including age, sex, and race is detailed in Table 1.
scRNA-Seq sample preparation, library preparation, and sequencing.
Esophageal mucosal biopsies from the proximal and distal esophagus were processed immediately following collection and treated separately. Tissue was digested in Dispase (Corning) diluted in HBSS containing 10 μM HEPES and 10 μg/mL DNase I at 37°C for 15 minutes with 1,500 rpm agitation, followed by digestion in 0.25% trypsin containing 10 μM HEPES and 10 μg/mL DNase I for 20 minutes at 37°C with agitation. The cell suspension was filtered through a 40 μm strainer followed by 12-minute and 6-minute centrifugation at 500g at 4°C. Resuspended pellets were filtered through a 40 μm flowmi filter (SP Bel-Art) and measured for cell count and viability using the Cellometer Auto2000 (Nexcelom Bioscience). All cell suspensions met an 85% minimum viability. In total, 16,000 cells were loaded into the Chromium iX Controller (10X Genomics) on a Chromium Next GEM Chip G (10X Genomics) to capture ~10,000 cells per sample and were processed for encapsulation according to the manufacturer’s protocol. The cDNA and library were generated using the Chromium Next GEM Single Cell 3′ Reagent Kits v3.1 (10X Genomics) and Dual Index Kit TT Set A (10X Genomics) according to the manufacturer’s manual. Quality control was performed by Agilent Bioanalyzer High Sensitivity DNA kit (Agilent Technologies) and Qubit DNA HS assay kit (Invitrogen) for qualitative and quantitative analysis, respectively. The multiplexed libraries were pooled and sequenced on Illumina Novaseq 6000 sequencer (Illumina) with 100 cycle kits using the following read length: 28 bp Read1 for cell barcode and UMI, and 90 bp Read2 for transcript. Library preparation and sequencing was done at Northwestern University NUSeq facility core. The GRCh38 transcriptome was used as a reference for alignment and feature counting using Cell Ranger (V4.0.0/6.0.0/6.1.0, 10X Genomics).
Data filtering, integration, and clustering.
Filtered matrix files were processed as Seurat objects in the Seurat R package 4.2.0 (55) with a minimum threshold of expression in ≥ 5 cells per gene. Each data set was filtered to exclude cells with total gene counts < 400 and total unique gene counts < 100. Data sets were individually normalized, scaled, and processed to calculate variable features using Seurat’s SCTransform workflow. Stricter quality control filtering was performed across all samples to remove cell populations with low total counts of unique genes or cell populations with high mitochondrial gene percentage (mean > 25%) following integration. Individual filtered samples were then integrated using reverse PCA dimensional reduction. Dimensionality reduction was performed followed by calculation of UMAP embeddings, nearest neighbors, and graph-based clustering. Clusters were annotated according to the expression of known cell-specific gene markers and were confirmed against the transcriptional profiles identified by Seurat’s function FindAllMarkers.
Epithelial cluster and compartment identification.
Epi were subsetted and reintegrated on a per-sample basis using the Seurat integration pipeline described above. Integration anchors were calculated against HC samples as reference. PCA was performed, and the first 30 PCs were included for downstream analysis. Optimal clustering resolution of 0.5 was determined using Clustree. Quiescent (clusters 1 and 2) and cycling clusters (clusters 3–5) were subclustered to distinguish cycling basal cells (DST+, MKI67+) from cycling epibasal cells (DST–, KRT13lo, MKI67+). Epithelial clusters were annotated according to expression of known genes in HC as previously described (26) and confirmed against the transcriptional profiles identified by FindAllMarkers, performed on HC cells. Clusters were combined into parental epithelial compartments (B, SB, SF) based on the expression of established markers (8, 15, 16, 26).
Cell cycle and proliferation analysis.
Seurat’s function CellCycleScoring (56) was used to assign the cell cycle phase of each cell. Cells exhibiting a weak predicted score for S and G2/M were classified as G0/G1 phase. Expression of the markers KRT15 and DST identified Q1 and Q2 epithelial clusters as quiescent and distinguished the G0 from the G1 phase. During the SCTransform workflow, cell cycle was not regressed, allowing EEC to cluster based on quiescence, S-phase, G2/M-phase, and progressive stages of differentiation, confirmed using the expression of marker genes and cell cycle scoring for each cluster. Cell proportion in each cluster was used to assess proliferation rates.
Detection of DEGs, gene expression analysis, and gene set enrichment analysis.
Identification of DEGs between cell clusters was performed using FindAllMarkers, with filtering for significantly upregulated genes with |logFC| > 0.25. For differential expression analysis comparing expression profiles between like cell identities across disease conditions, the per-sample population mean gene expression was calculated from the normalized RNA assay. Tested genes were filtered by a lower minimum percentage (min.pct) threshold of 5%–10%, which is the percentage of cells expressing a given gene per cell group. The R package edgeR 3.36.0 (57–59) was utilized to create a DGEList object, followed by calculation of normalization factors and counts per million. The logFC was computed, and significance was determined using the Wilcoxon rank-sum test, with FDR P value adjustment to correct for multiple comparisons. DEGs were filtered based on an FDR-adjusted P < 0.05 and |log2FC| > 0.25, unless a more stringent threshold was specified. To visualize the percentage of cells expressing a gene across clusters, the percentage expression in each cluster was calculated using a minimum expression threshold to filter cells with negligible expression of the gene. Pathway enrichment analyses were performed on DEGs filtered for logFC and significance based on FDR-adjusted P value, as mentioned above. The analysis of positively and negatively regulated DEGs was completed using the Ingenuity Pathway Analysis (IPA, Qiagen) software. For the analysis of DEGs changing in only 1 direction, pathway enrichment was performed with either Metascape or ClusterProfiler (60–62).
TF analysis.
TF analysis was performed using the R package EnrichR 3.1.0 (32, 63, 64). To identify upstream TFs that regulate EoE-specific gene programs, DEGs were calculated across all EEC between disease conditions and filtered for |logFC| > 1 and FDR-adjusted P < 0.05. Hierarchical clustering was performed on population Z scores of DEGs across healthy and EoE epithelial compartments. Relevant hierarchical clusters were selected and used as input for EnrichR analysis with either the ChEA3 2022 ChIP-Seq database or the TF Perturbations followed by Expression GEO Signature database (https://maayanlab.cloud/Enrichr/#libraries).
Heatmap visualization, population Z score calculation, and hierarchal clustering.
Gene sets displayed in heatmaps, including gene sets incorporated from external sources, were confirmed as changed in EoE with differential expression testing filtered based on FDR-adjusted P < 0.05 and minimum logFC threshold. To calculate population Z scores, average population expression values were derived from the normalized RNA assay and scaled by the mean and SD calculated across all populations. All heatmaps show population Z scores unless otherwise indicated. Hierarchical clustering using the hclust function from the R Stats package 3.6.2 was performed on population Z scores, using the ward.D2 clustering method (65, 66) and the Pearson distance method (67, 68). Heatmaps were generated using the R package Complex Heatmap 2.10.0 (67, 68).
Gene signature score analysis and functional analysis.
Gene signatures were generated using Seurat’s function AddModuleScore. Quiescent and superficial gene signatures were defined using HC cells from our scRNA-Seq data set. Differential expression analysis was performed comparing either quiescent epithelial clusters (Q1 and Q2) or superficial clusters (SF1-SF2) to the remaining epithelium. DEGs were filtered for FDR-adjusted P < 0.05 and ranked by logFC, with the top 100 selected. Quiescent and superficial signature scores were plotted using the ggplot2 R package’s geom_density_2d function (69), with consistent binning applied across all compared conditions. TF-regulated gene signatures were identified via enrichment analysis (EnrichR) (32, 63, 64) or sourced from external experiments (Supplemental Table 5). The data sets included in this analysis were previously published (34, 35, 37). Gene signatures were also calculated from coexpressed gene modules identified using Monocle3 that were also used as input to the stringDB R package (70) to infer protein-to-protein interactions.
Pseudotime analysis.
Pseudotime analysis was performed using the R package Monocle3 1.0.0 (71–73). Individual samples were log2 normalized, scaled, merged using Seurat’s merge function, dimensionality reduced, and batch corrected using the fast mutual nearest neighbors (FMNN) method by individual sample. UMAP embeddings were calculated. A CellDataSet object was created with normalized and scaled counts for 2,000 variable genes and reduction feature loadings calculated by FMNN. Monocle3’s function learn_graph was used to infer a trajectory graph from the UMAP embeddings, with a Euclidean distance ratio of 1, a geodesic distance ratio of 0.5, and a minimum branch length of 10. Cells within the S-phase epithelial cluster were assigned a root state of pseudotime 0. Increasing pseudotime values of cells committed to becoming quiescent are depicted to the left on pseudotime axes, and pseudotime values of cells committed to differentiation are depicted to the right on pseudotime axes.
For EoE samples only, Monocle3’s function graph_test was utilized to identify genes with differential expression along the trajectory. Identified genes were clustered into modules of coexpressed genes with corresponding gene signatures calculated. To determine the most represented module in each cell, each module gene signature was scaled and centered between –2 and 2 across all cells. Each cell was assigned to the module exhibiting the highest scaled scoring. To visualize the expression of genes or signatures across pseudotime-ordered cells, we plotted the gene expression or gene signature score for each cell and calculated local mean expression values using local weighted regression fitting of the data by the locally estimated scatterplot smoothing (LOESS) method. For the calculation of coexpression, cells were assessed on a binary basis for expression of all examined genes or gene signatures (value = 1) or expression of less than all or none of the examined genes or gene signatures (value = 0). The values were plotted for each cell ordered in pseudotime, and local mean values were calculated using the LOESS method.
Imputation of cell populations in the GERD scRNA-Seq data from the EoE and HC scRNA-Seq data set.
An imputation was performed on each cell in the processed GERD epithelial data set to determine the analogous cell population in the integrated HC and EoE epithelial data set using Seurat’s MapQuery function. Cluster labels were assigned based on the maximum prediction score for each of the query cells.
IHC and scoring.
Immunostaining was performed on formalin-fixed, paraffin-embedded (FFPE) esophageal mucosal biopsies as previously described (45). Briefly, heat-induced antigen retrieval was performed for 30 minutes in Buffer A (Electron Microscopy Sciences, pH 6). Tissue sections were blocked using 0.3% H2O2, streptavidin/biotin incubation, and Starting Block blocking buffer (Thermo Fisher Scientific). Primary and secondary specific antibodies were added (Supplemental Table 9), and detection was performed as previously described (45). Images were acquired on a Nikon Eclipse Ci microscope with a Nikon DS-Ri2 camera and NIS Elements software. H&E staining was performed by the Robert H. Lurie Comprehensive Cancer Center Pathology Core. Image analysis was performed using Fiji software (74). H&E-stained slides were evaluated for BCH according to EoE-HSS (11). For staining quantification, positive cell fraction was calculated as the percentage of positively stained cells compared with the total cell count. For intensity quantification, nuclei were identified by thresholding, mask conversion, watershed segmentation, and particle analysis, followed by measurement of average inverted intensity (grayscale units) after background subtraction.
Multispectral fluorescence staining and imaging.
Multispectral fluorescence staining was performed using the Opal 6-Plex Detection kit (Akoya Biosciences) using FFPE tissue sections. Slides were baked at 60°C for 15 minutes and deparaffinized with the Leica Bond Dewax solution (Leica Biosystems), followed by heat-based antigen retrieval using Bond Epitope Retrieval Solution 1 (Leica Biosystems) for 30 minutes. Using the Leica Bond Rx Automated Stainer (Leica Biosystems), slides were incubated with primary antibodies followed by the appropriate secondary horseradish peroxidase–conjugated polymer. Incubation was next performed with a unique Opal dye permitting fluorophore covalent bonding to the horseradish polymer. Heat-based retrieval with Bond Epitope Retrieval 1 (Leica Biosystems) was finally performed for 20 minutes. Slides were subjected to sequential rounds of staining. Primary antibodies, concentrations, and associated fluorophores are detailed in Supplemental Table 9. Sections were counterstained with Spectral DAPI and mounted with ProLong Diamond Antifade Mountant (Thermo Fisher Scientific). Images were acquired using the Vectra3 microscope (Akoya Biosciences) and Phenochart Whole Slide Viewer (Akoya Biosciences). Postacquisition image adjustments were performed using InForm Automated Image Analysis Software (Akoya Biosciences) and Fiji (74).
Statistics.
Statistical analyses were performed using R version 4.1.1. Descriptive statistics are displayed as mean ± SEM for continuous variables unless otherwise described and as frequency counts for categorical variables. For nonnormally distributed continuous data, Wilcoxon rank-sum test was used. When testing multiple conditions, multiple-comparison adjustment was employed. P < 0.05 was considered statistically significant.
Study approval.
Procedures using human tissue were performed by the Digestive Health Foundation Biorepository with approval from the Northwestern IRB (study STU00208111). Written informed consent was received prior to participation.
Data availability.
All raw sequencing files and processed barcode and feature matrices used within the article are deposited in NCBI’s GEO database under accession code GSE218607. All supporting analytic code is available at the “scRNA-Human_EoE_Esophagus” repository hosted by the Tetreault Lab on GitHub (https://github.com/Tetreault-Lab/Tetreault-scRNA-Human_EoE_Esophagus-2023; commit ID f51f957). All other supporting data are available within the article, supplement, or Supporting Data Values or from the corresponding author upon reasonable request.
Author contributions
MHC was involved in designing research studies, conducting experiments, acquiring data, analyzing data, and writing the manuscript. ALK was involved in designing research studies and analyzing data. PJK, DAC, NG, and JEP were involved in designing research studies and writing the manuscript. DRW and KAW were involved in designing research studies, analyzing data, and writing the manuscript. MPT was involved in designing research studies, analyzing data, providing reagents, and writing the manuscript.
Supplementary Material
Acknowledgments
This work was supported by NIH NIDDK P01DK117824 and Digestive Health Foundation to MPT, PJK, and JEP; NIH NHLBI F31HL147413 to MHC; NIH NIDDK R01DK121159 to KAW; the Robert H. Lurie Comprehensive Cancer Center (NIH NCI CCSG P30 CA060553) through the Northwestern University Pathology Core Facility; and the NUSeq Core Facility (1S10OD025120). This research was supported in part through the computational resources and staff contributions provided for the Quest high-performance computing facility at Northwestern University, which is jointly supported by the Office of the Provost, the Office for Research, and Northwestern University Information Technology. This research was also supported in part through the computational resources and staff contributions provided by the Genomics Compute Cluster, which is jointly supported by the Feinberg School of Medicine, the Center for Genetic Medicine, Feinberg’s Department of Biochemistry and Molecular Genetics, the Office of the Provost, the Office for Research, and Northwestern Information Technology. The Genomics Compute Cluster is part of Quest, Northwestern University’s high-performance computing facility.
Version 1. 09/06/2023
In-Press Preview
Version 2. 10/09/2023
Electronic publication
Footnotes
Conflict of interest: JEP, PJK, and Northwestern University jointly possess intellectual property rights and ownership pertaining to FLIP Panometry systems, methods, and apparatus, in collaboration with Medtronic Inc. DAC serves as a speaker and consultant, maintaining licensing agreements with Medtronic, and holds a consultancy role with Phantom Pharmaceuticals. PJK offers consultancy to Ironwood and Reckitt. JEP serves as an advisory board member, consultant, and speaker for Endogastric Solutions; has received a grant from Ironwood; serves as a speaker and advisory board member for Takeda and AstraZeneca; and maintains roles, including speaker, consultant, advisory board member, and grant recipient, with Medtronic. JEP also holds intellectual property patents with licensing agreements with Medtronic; is involved as a speaker, consultant, and advisory board member at Torax/Ethicon; is connected to Diversatek as a grant recipient, consultant, and advisory board member; and holds an advisory board position at Phantom Pharmaceuticals and Neurogastrx. NG provides consultancy to entities, including Allakos, AstraZeneca, Bristol Myers Squibb, Nutricia, and Knopp; is a consultant and speaker for Sanofi-Regeneron; and receives publication royalties from UpToDate. DRW acts as a speaker and consultant for Pfizer.
Copyright: © 2023, Clevenger et al. This is an open access article published under the terms of the Creative Commons Attribution 4.0 International License.
Reference information: JCI Insight. 2023;8(19):e171765.https://doi.org/10.1172/jci.insight.171765.
Contributor Information
Margarette H. Clevenger, Email: margaretteclevenger2021@u.northwestern.edu.
Adam L. Karami, Email: adam.karami@temple.edu.
Dustin A. Carlson, Email: dustin-carlson@northwestern.edu.
Peter J. Kahrilas, Email: p-kahrilas@northwestern.edu.
Nirmala Gonsalves, Email: n-gonsalves@northwestern.edu.
John E. Pandolfino, Email: j-pandolfino@northwestern.edu.
Deborah R. Winter, Email: deborah.winter@northwestern.edu.
Kelly A. Whelan, Email: kelly.whelan@temple.edu.
Marie-Pier Tétreault, Email: marie-pier.tetreault@northwestern.edu.
References
- 1.Chehade M, Aceves SS. Treatment of eosinophilic esophagitis: diet or medication? J Allergy Clin Immunol Pract. 2021;9(9):3249–3256. doi: 10.1016/j.jaip.2021.07.029. [DOI] [PubMed] [Google Scholar]
- 2.Dellon ES, et al. Dupilumab in adults and adolescents with eosinophilic esophagitis. N Engl J Med. 2022;387(25):2317–2330. doi: 10.1056/NEJMoa2205982. [DOI] [PubMed] [Google Scholar]
- 3.Dellon ES, et al. Rapid recurrence of eosinophilic esophagitis activity after successful treatment in the observation phase of a randomized, double-blind, double-dummy trial. Clin Gastroenterol Hepatol. 2020;18(7):1483–1492. doi: 10.1016/j.cgh.2019.08.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Greuter T, et al. Long-term treatment of eosinophilic esophagitis with swallowed topical corticosteroids: development and evaluation of a therapeutic concept. Am J Gastroenterol. 2017;112(10):1527–1535. doi: 10.1038/ajg.2017.202. [DOI] [PubMed] [Google Scholar]
- 5.Taft TH, et al. Qualitative assessment of patient-reported outcomes in adults with eosinophilic esophagitis. J Clin Gastroenterol. 2011;45(9):769–774. doi: 10.1097/MCG.0b013e3182166a5a. [DOI] [PubMed] [Google Scholar]
- 6.Okin D, Medzhitov R. Evolution of inflammatory diseases. Curr Biol. 2012;22(17):R733–R740. doi: 10.1016/j.cub.2012.07.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rochman M, et al. Epithelial origin of eosinophilic esophagitis. J Allergy Clin Immunol. 2018;142(1):10–23. doi: 10.1016/j.jaci.2018.05.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Rochman M, et al. Single-cell RNA-Seq of human esophageal epithelium in homeostasis and allergic inflammation. JCI Insight. 2022;7(11):159093. doi: 10.1172/jci.insight.159093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Underwood B, et al. Breaking down the complex pathophysiology of eosinophilic esophagitis. Ann Allergy Asthma Immunol. 2023;130(1):28–39. doi: 10.1016/j.anai.2022.10.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Whelan KA, et al. Persistent basal cell hyperplasia is associated with clinical and endoscopic findings in patients with histologically inactive eosinophilic esophagitis. Clin Gastroenterol Hepatol. 2020;18(7):1475–1482. doi: 10.1016/j.cgh.2019.08.055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Collins MH, et al. Newly developed and validated eosinophilic esophagitis histology scoring system and evidence that it outperforms peak eosinophil count for disease diagnosis and monitoring. Dis Esophagus. 2017;30(3):1–8. doi: 10.1093/dote/dow025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Stuart T, et al. Comprehensive integration of single-cell data. Cell. 2019;177(7):1888–1902. doi: 10.1016/j.cell.2019.05.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Rochman M, et al. Profound loss of esophageal tissue differentiation in patients with eosinophilic esophagitis. J Allergy Clin Immunol. 2017;140(3):738–749. doi: 10.1016/j.jaci.2016.11.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Simon D, et al. Active eosinophilic esophagitis is characterized by epithelial barrier defects and eosinophil extracellular trap formation. Allergy. 2015;70(4):443–452. doi: 10.1111/all.12570. [DOI] [PubMed] [Google Scholar]
- 15.Okumura T, et al. Neurotrophin receptor p75(NTR) characterizes human esophageal keratinocyte stem cells in vitro. Oncogene. 2003;22(26):4017–4026. doi: 10.1038/sj.onc.1206525. [DOI] [PubMed] [Google Scholar]
- 16.Busslinger GA, et al. Human gastrointestinal epithelia of the esophagus, stomach, and duodenum resolved at single-cell resolution. Cell Rep. 2021;34(10):108819. doi: 10.1016/j.celrep.2021.108819. [DOI] [PubMed] [Google Scholar]
- 17.Bravo R, Macdonald-Bravo H. Changes in the nuclear distribution of cyclin (PCNA) but not its synthesis depend on DNA replication. EMBO J. 1985;4(3):655–661. doi: 10.1002/j.1460-2075.1985.tb03679.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Celis JE, Celis A. Cell cycle-dependent variations in the distribution of the nuclear protein cyclin proliferating cell nuclear antigen in cultured cells: subdivision of S phase. Proc Natl Acad Sci U S A. 1985;82(10):3262–3266. doi: 10.1073/pnas.82.10.3262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Miller I, et al. Ki67 is a graded rather than a binary marker of proliferation versus quiescence. Cell Rep. 2018;24(5):1105–1112. doi: 10.1016/j.celrep.2018.06.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Que J, et al. Multiple dose-dependent roles for Sox2 in the patterning and differentiation of anterior foregut endoderm. Development. 2007;134(13):2521–2531. doi: 10.1242/dev.003855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Boyer LA, et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell. 2005;122(6):947–956. doi: 10.1016/j.cell.2005.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Pellegrini G, et al. p63 identifies keratinocyte stem cells. Proc Natl Acad Sci U S A. 2001;98(6):3156–3161. doi: 10.1073/pnas.061032098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Whyte WA, et al. Master transcription factors and mediator establish super-enhancers at key cell identity genes. Cell. 2013;153(2):307–319. doi: 10.1016/j.cell.2013.03.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Banks-Schlegel S, Green H. Involucrin synthesis and tissue assembly by keratinocytes in natural and cultured human epithelia. J Cell Biol. 1981;90(3):732–737. doi: 10.1083/jcb.90.3.732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Viaene AI, Baert JH. Expression of cytokeratin-mRNAs in squamous-cell carcinoma and balloon-cell formation of human oesophageal epithelium. Histochem J. 1995;27(1):69–78. doi: 10.1007/BF00164174. [DOI] [PubMed] [Google Scholar]
- 26.Kabir MF, et al. Single cell transcriptomic analysis reveals cellular diversity of murine esophageal epithelium. Nat Commun. 2022;13(1):2167. doi: 10.1038/s41467-022-29747-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dale BA, et al. Identification of filaggrin in cultured mouse keratinocytes and its regulation by calcium. J Invest Dermatol. 1983;81(1 suppl):90s–95s. doi: 10.1111/1523-1747.ep12540769. [DOI] [PubMed] [Google Scholar]
- 28.Kc K, et al. In vitro model for studying esophageal epithelial differentiation and allergic inflammatory responses identifies keratin involvement in eosinophilic esophagitis. PLoS One. 2015;10(6):e0127755. doi: 10.1371/journal.pone.0127755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Denning KL, et al. Immunoreactivity of p53 and Ki-67 for dysplastic changes in children with eosinophilic esophagitis. Pediatr Dev Pathol. 2013;16(5):331–336. doi: 10.2350/13-03-1306-OA.1. [DOI] [PubMed] [Google Scholar]
- 30.Jiang M, et al. BMP-driven NRF2 activation in esophageal basal cell differentiation and eosinophilic esophagitis. J Clin Invest. 2015;125(4):1557–1568. doi: 10.1172/JCI78850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lex A, et al. UpSet: visualization of intersecting sets. IEEE Trans Vis Comput Graph. 2014;20(12):1983–1992. doi: 10.1109/TVCG.2014.2346248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Chen EY, et al. EnrichR: interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinformatics. 2013;14:128. doi: 10.1186/1471-2105-14-128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bourillot PY, Savatier P. Krüppel-like transcription factors and control of pluripotency. BMC Biol. 2010;8:125. doi: 10.1186/1741-7007-8-125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Fang X, et al. ChIP-seq and functional analysis of the SOX2 gene in colorectal cancers. OMICS. 2010;14(4):369–384. doi: 10.1089/omi.2010.0053. [DOI] [PubMed] [Google Scholar]
- 35.Paranjapye A, et al. Krüppel-like factor 5 regulates wound repair and the innate immune response in human airway epithelial cells. J Biol Chem. 2021;297(2):100932. doi: 10.1016/j.jbc.2021.100932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Uchiyama A, et al. SOX2 epidermal overexpression promotes cutaneous wound healing via activation of EGFR/MEK/ERK signaling mediated by EGFR ligands. J Invest Dermatol. 2019;139(8):1809–1820. doi: 10.1016/j.jid.2019.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wu Z, et al. Reprogramming of the esophageal squamous carcinoma epigenome by SOX2 promotes ADAR1 dependence. Nat Genet. 2021;53(6):881–894. doi: 10.1038/s41588-021-00859-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cheng E, et al. Eosinophilic esophagitis: interactions with gastroesophageal reflux disease. Gastroenterol Clin North Am. 2014;43(2):243–256. doi: 10.1016/j.gtc.2014.02.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dunbar KB, et al. Association of acute gastroesophageal reflux disease with esophageal histologic changes. JAMA. 2016;315(19):2104–2112. doi: 10.1001/jama.2016.5657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Jiang YY, et al. TP63, SOX2, and KLF5 establish a core regulatory circuitry that controls epigenetic and transcription patterns in esophageal squamous cell carcinoma cell lines. Gastroenterology. 2020;159(4):1311–1327. doi: 10.1053/j.gastro.2020.06.050. [DOI] [PubMed] [Google Scholar]
- 41.Ohara TE, et al. Adaptive differentiation promotes intestinal villus recovery. Dev Cell. 2022;57(2):166–179. doi: 10.1016/j.devcel.2021.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Liu K, et al. The multiple roles for Sox2 in stem cell maintenance and tumorigenesis. Cell Signal. 2013;25(5):1264–1271. doi: 10.1016/j.cellsig.2013.02.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Masui S, et al. Pluripotency governed by Sox2 via regulation of Oct3/4 expression in mouse embryonic stem cells. Nat Cell Biol. 2007;9(6):625–635. doi: 10.1038/ncb1589. [DOI] [PubMed] [Google Scholar]
- 44.Kim CK, et al. Krüppel-like factor 5 regulates stemness, lineage specification, and regeneration of intestinal epithelial stem cells. Cell Mol Gastroenterol Hepatol. 2020;9(4):587–609. doi: 10.1016/j.jcmgh.2019.11.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Yang Y, et al. Krüppel-like factor 5 activates MEK/ERK signaling via EGFR in primary squamous epithelial cells. FASEB J. 2007;21(2):543–550. doi: 10.1096/fj.06-6694com. [DOI] [PubMed] [Google Scholar]
- 46.Yang Y, et al. Krüppel-like factor 5 controls keratinocyte migration via the integrin-linked kinase. J Biol Chem. 2008;283(27):18812–18820. doi: 10.1074/jbc.M801384200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ge Y, et al. Stem cell lineage infidelity drives wound repair and cancer. Cell. 2017;169(4):636–650. doi: 10.1016/j.cell.2017.03.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Watanabe H, et al. SOX2 and p63 colocalize at genetic loci in squamous cell carcinomas. J Clin Invest. 2014;124(4):1636–1645. doi: 10.1172/JCI71545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.An Z, et al. Sox2 and Klf4 as the functional core in pluripotency induction without exogenous Oct4. Cell Rep. 2019;29(7):1986–2000. doi: 10.1016/j.celrep.2019.10.026. [DOI] [PubMed] [Google Scholar]
- 50.Maret-Ouda J, et al. Gastroesophageal reflux disease: a review. JAMA. 2020;324(24):2536–2547. doi: 10.1001/jama.2020.21360. [DOI] [PubMed] [Google Scholar]
- 51.Syed A, et al. The relationship between eosinophilic esophagitis and esophageal cancer. Dis Esophagus. 2017;30(7):1–5. doi: 10.1093/dote/dox050. [DOI] [PubMed] [Google Scholar]
- 52.Savarino E, et al. Achalasia. Nat Rev Dis Primers. 2022;8(1):28. doi: 10.1038/s41572-022-00356-8. [DOI] [PubMed] [Google Scholar]
- 53.Zhang Y, et al. One stomach, two subtypes of carcinoma-the differences between distal and proximal gastric cancer. Gastroenterol Rep (Oxf) 2021;9(6):489–504. doi: 10.1093/gastro/goab050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Lundell LR, et al. Endoscopic assessment of oesophagitis: clinical and functional correlates and further validation of the Los Angeles classification. Gut. 1999;45(2):172–180. doi: 10.1136/gut.45.2.172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Hao Y, et al. Integrated analysis of multimodal single-cell data. Cell. 2021;184(13):3573–3587. doi: 10.1016/j.cell.2021.04.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Tirosh I, et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-Seq. Science. 2016;352(6282):189–196. doi: 10.1126/science.aad0501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Chen Y, et al. From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline. F1000Res. 2016;5:1438. doi: 10.12688/f1000research.8987.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.McCarthy DJ, et al. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 2012;40(10):4288–4297. doi: 10.1093/nar/gks042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Robinson MD, et al. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Wu T, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovation (Camb) 2021;2(3):100141. doi: 10.1016/j.xinn.2021.100141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Yu G, et al. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zhou Y, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1523. doi: 10.1038/s41467-019-09234-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kuleshov MV, et al. EnrichR: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016;44(w1):W90–W97. doi: 10.1093/nar/gkw377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Xie Z, et al. Gene set knowledge discovery with EnrichR. Curr Protoc. 2021;1(3):e90. doi: 10.1002/cpz1.90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Murtagh F, Legendre P. Ward’s hierarchical agglomerative clustering method: which algorithms implement Ward’s criterion? J Classif. 2014;31(3):274–295. doi: 10.1007/s00357-014-9161-z. [DOI] [Google Scholar]
- 66.Ward JH. Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963;58(301):236–244. doi: 10.1080/01621459.1963.10500845. [DOI] [Google Scholar]
- 67.Gu Z, et al. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics. 2016;32(18):2847–2849. doi: 10.1093/bioinformatics/btw313. [DOI] [PubMed] [Google Scholar]
- 68.Gu Z, Hubschmann D. Make interactive complex heatmaps in R. Bioinformatics. 2022;38(5):1460–1462. doi: 10.1093/bioinformatics/btab806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Wickham H, ed. ggplot2. Springer-Verlag; 2016. [Google Scholar]
- 70.Szklarczyk D, et al. The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 2021;49(d1):D605–D612. doi: 10.1093/nar/gkaa1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Qiu X, et al. Single-cell mRNA quantification and differential analysis with Census. Nat Methods. 2017;14(3):309–315. doi: 10.1038/nmeth.4150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Qiu X, et al. Reversed graph embedding resolves complex single-cell trajectories. Nat Methods. 2017;14(10):979–982. doi: 10.1038/nmeth.4402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Trapnell C, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32(4):381–386. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Schindelin J, et al. Fiji: an open-source platform for biological-image analysis. Nat Methods. 2012;9(7):676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All raw sequencing files and processed barcode and feature matrices used within the article are deposited in NCBI’s GEO database under accession code GSE218607. All supporting analytic code is available at the “scRNA-Human_EoE_Esophagus” repository hosted by the Tetreault Lab on GitHub (https://github.com/Tetreault-Lab/Tetreault-scRNA-Human_EoE_Esophagus-2023; commit ID f51f957). All other supporting data are available within the article, supplement, or Supporting Data Values or from the corresponding author upon reasonable request.