Abstract
Colorectal cancer (CRC) is responsible for over 900,000 annual deaths worldwide. Emerging evidence supports pro‐carcinogenic bacteria in the colonic microbiome are at least promotional in CRC development and may be causal. We previously showed toxigenic C. difficile from human CRC‐associated bacterial biofilms accelerates tumorigenesis in Apc Min/+ mice, both in specific pathogen‐free mice and in gnotobiotic mice colonized with a defined consortium of bacteria. To further understand host–microbe interactions during colonic tumorigenesis, we combined single‐cell RNA‐sequencing (scRNA‐seq), spatial transcriptomics, and immunofluorescence to define the molecular spatial organization of colonic dysplasia in our consortium model with or without C. difficile. Our data show a striking bipartite regulation of Deleted in Malignant Brain Tumors 1 (DMBT1) in the inflamed versus dysplastic colon. From scRNA‐seq, differential gene expression analysis of normal absorptive colonocytes at 2 weeks postinoculation showed DMBT1 upregulated by C. difficile compared to colonocytes from mice without C. difficile exposure. In contrast, our spatial transcriptomic analysis showed DMBT1 dramatically downregulated in dysplastic foci compared with normal‐adjacent tissue. We further integrated our datasets to generate custom colonic dysplasia scores and ligand‐receptor mapping. Validation with immunofluorescence showed DMBT1 protein downregulated in dysplastic foci from three mouse models of colonic tumorigenesis and in adenomatous dysplasia from human samples. Finally, we used mouse and human organoids to implicate WNT signaling in the downregulation of DMBT1 mRNA and protein. Together, our data reveal cell type‐specific regulation of DMBT1, a potential mechanistic link between bacteria and colonic tumorigenesis. Published 2025. This article is a U.S. Government work and is in the public domain in the USA. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of The Pathological Society of Great Britain and Ireland.
Keywords: colorectal, colorectal adenocarcinoma, microbiome, tumorigenesis
Introduction
Among cancer‐related deaths worldwide, colorectal cancer (CRC) is the second most common cause [1]. The survival rate for early‐stage CRC is 90% compared to 14% for stage IV [2, 3]. Modifiable risk factors for CRC are well defined, but the effect of bacteria on colorectal carcinogenesis is not well understood. The list of bacteria associated with CRC, including Fusobacterium nucleatum and pks+ Escherichia coli, is growing [4, 5, 6, 7]. Suggested mechanisms for bacteria‐mediated tumorigenesis include damaging DNA, activating Wingless (WNT) signaling, blunting antitumor immunity, and promoting biofilm formation [8]. Bacterial colonization of mouse models for interrogating mechanisms of CRC tumorigenesis has included enterotoxigenic Bacteroides fragilis, pks+ E. coli, F. nucleatum, and bacterial biofilms [4, 6, 9, 10, 11]. However, not all bacterial inflammation is sufficient to induce tumorigenesis [12].
Previously, we showed that C. difficile from human CRC accelerates tumorigenesis in the distal colon of genetically susceptible Apc Min/+ mice [13]. These mice contain a truncation mutation in Apc leading to multiple intestinal neoplasia (Min). They develop 40–50 small intestinal adenomas, but few if any colonic adenomas. The mechanisms by which C. difficile increases tumor formation in the distal colon appear to involve WNT signaling, reactive oxygen species, and IL‐17 [13]. Here, we identified an epithelium‐specific factor, the host protein Deleted in Malignant Brain Tumors 1 (DMBT1). DMBT1 is a glycoprotein involved in mucosal immune defense and epithelial differentiation. It represents a potential molecular link between bacteria and cancer because of its dual functions in innate immunity and regulation of epithelial maturation [14]. DMBT1 is a 340 kDa glycoprotein with 13 homologous scavenger receptor cysteine‐rich domains that bind host immune factors, including immunoglobulin and complement [15, 16, 17, 18, 19, 20, 21, 22]. DMBT1 binds lipopolysaccharide, enabling it to agglutinate and exclude bacteria from the epithelium [17, 23]. From bulk tissue experiments, DMBT1 mRNA and protein have variable expression in gastric and colon carcinomas [19, 24, 25, 26, 27]. It remains unknown how DMBT1 affects CRC tumorigenesis.
Here, we created a multiomic atlas to characterize C. difficile‐associated colorectal tumorigenesis. We identified the cell type‐specific differential regulation of mouse Dmbt1 in colonic dysplasia and demonstrated the generalizability of this phenotype in human CRC.
Materials and methods
Mice
Animal experiments were approved by animal care at Johns Hopkins University, protocol MO23M87. C57Bl/6 mice of Apc Min/+ (Δ716 Apc) genotype were bred germ‐free as previously described [13]. At age 8 weeks, males were inoculated by single gavage of the 29‐member bacterial consortium described previously along with 104 spores of C. difficile strain Cd_3728T [13]. The control group was given a single gavage of the same consortium without C. difficile.
scRNA‐seq
Mouse colons were harvested and washed in cold phosphate‐buffered saline (PBS). The distal two‐thirds were agitated in chelation buffer at 4 °C followed by mechanical and enzymatic dissociation using cold‐activated protease/DNase. Single cells were encapsulated into aqueous droplets using lysis buffer and polyT‐oligo beads and inDrop [28]. Purification, cDNA library polymerization, and indexing were performed as described previously [13, 29]. Sequencing was performed using a NovaSeq6000 (Illumina, San Diego, CA, USA). Raw sequences were aligned to genome GRCm38.85 to generate count matrices using the dropEst pipeline [30]. Counts matrices were filtered using Dropkick [31]. Preprocessing, dimensionality reduction, clustering, and correlation visualization were performed in Python (v. 3.9) using Scanpy [32]. Differential gene expression and gene set enrichment analysis were performed in R (R v. 4.4.2, Vienna, Austria) using Scanpy and GSEApy, respectively [33].
Spatial transcriptomics
Mouse colons were harvested, fixed, and embedded using standard techniques. Tissue sections (5‐μm) were processed using the GeoMx RNA protocol. Standard markers provided in the kit included: CD45‐594, PanCK‐532, and SYTO‐13 (NanoString Technologies, Seattle, WA, USA; ThermoFisher Scientific, Waltham, MA, USA). Regions of interest were based on normal versus dysplastic morphology. Segments were created with an epithelium mask based on pan‐cytokeratin‐positive staining and a lamina propria mask based on pan‐cytokeratin‐negative/CD45‐positive areas. Segments were separate samples for sequencing. GeoMx Digital Spatial Profiling was performed using the manufacturer's protocols and Mouse Whole Transcriptome Atlas [34] (NanoString Technologies). Barcodes were sequenced using a NovaSeq6000, and sequencing files were compiled with bcl2fastq (Illumina), demultiplexed, and converted to Digital Count Conversion files by NanoString (GeoMx DnD pipeline v2.3.4) [35]. For each no‐template polymerase chain reaction (PCR) control, a minimum of 1,000 raw reads was required to pass filtering, and each probe was required to pass the Grubbs outlier test in at least 20% of segments (α = 0.01), which left 16,471 targets. We removed segments containing less than 5% of targets above thresholds, which left 172 segments. Expression was normalized using the 3rd‐quartile method. Differential expression was analyzed with linear mixed‐effect models, accounting for inter‐mouse heterogeneity.
Immunofluorescence
Mouse colons from the azoxymethane/dextran sodium sulfate‐induced cancer model and the transgenic Lrig1 CreER/+ ;Apc fl/+ adenoma model were obtained from previously published work [36, 37]. The human colon tissue microarray was obtained from the University of Virginia Pathology Department Cooperative Human Tissue Network, approved through IRB‐exemption #070166. We used anti‐β‐catenin (ref. 12F7D1, 1:400; Vanderbilt Antibody Protein Resource, Nashville, TN, USA), anti‐Dmbt1 (ref. HPA040778, 1:400; Sigma‐Aldrich, Saint Louis, MO, USA), goat anti‐mouse, and donkey anti‐rabbit secondary antibodies (ThermoFisher Scientific).
Murine organoids
Lrig1 CreER/+ ;Apc fl/fl colonic organoids were treated with 1 μm (Z)‐4‐Hydroxytamoxifen (Sigma‐Aldrich) for 72 h in Mouse IntestiCult (STEMCELL Technologies, Vancouver, BC, Canada). Three days posttreatment, organoids were spilt using TrypLE (Gibco, ThermoFisher Scientific, Waltham, MA, USA) for 3 min at 37 °C and plated at 300 cells/μl in 30 μl Matrigel (Corning, Corning, NY, USA) domes [38]. After 7 days, organoids were harvested for RT‐qPCR, viability, and immunoblotting.
RT‐qPCR
Total RNA was extracted using 1 ml TRIzol (Invitrogen, ThermoFisher Scientific, Waltham, MA, USA) for 10 min. RNA clean‐up was performed with Direct‐zol MiniPrep (Zymo Research, Irvine, CA, USA). cDNA was generated with Verso cDNA (ThermoFisher Scientific). TaqMan Fast advanced master mix and probes (ThermoFisher Scientific) were used in a QuantStudio thermocycler (Applied Biosystems, Foster City, CA, USA) following the manufacturer's protocol. Primer probe sets used were: ACTB (Hs01060665_g1), NKD1 (Hs01548773_m1), AXIN2 (Hs00610344_m1), and DMBT1 (Hs01069306_m1).
Immunoblotting
Five 30‐μl domes per condition were combined in cell dissociation solution (2 mm EDTA in PBS) then washed in PBS. RIPA with 0.1% SDS was added to organoids. Samples were lysed for 30 min before sonication. BCA protein assays (ThermoFisher Scientific) were performed following the manufacturer's protocol. SDS‐PAGE using NuPage LDS sample buffer was run in 3%–8% Tris‐acetate gels (Invitrogen). Proteins were transferred to nitrocellulose membranes overnight at 4 °C. Membranes were blocked in 5% milk in TBS‐T (0.1% Tween‐20) and probed with anti‐DMBT1 primary antibody (ref. SAB2108362, 1:600; Sigma‐Aldrich) and anti‐rabbit HRP antibody (Invitrogen). Membranes were developed using Pierce ECL (ThermoFisher Scientific).
Human organoids
Normal colonic organoids were derived under approved IRB #182165. Organoids were spilt into single‐cell suspension using TrypLE (Gibco) for 3 min at 37 °C and plated at 700 cells/μl in 30 μl Matrigel domes. Organoids were cultured in Human IntestiCult (STEMCELL Technologies) medium until domes were full. Media was replaced with 80% DMEM/F12: 20% IntestiCult, including CHIR99021 (Tocri, Bristol, UK) diluted in DMSO (Fisher, ThermoFisher Scientific, Waltham, MA, USA) at 10 μm for 72 h. Organoids were washed in PBS, and RNA was extracted.
Statistical analysis and data availability
All statistical analyses were performed using Python (v. 3.9). For scRNA‐seq differential expression analysis, we used a Wilcoxon test with Benjamini–Hochberg correction in Scanpy. Gene set enrichment was performed with GSEApy using random permutation (n = 1,000) and Bonferroni correction. Ligand‐receptor inferences were based on Wilcoxon tests using CellPhoneDB [39]. Statistical significance thresholds were p ≤ 0.05 and false discovery rate (FDR) ≤0.05 for all tests. Visualizations were created with listed packages, seaborn for violin plots, and ggplot2 for volcano plots [40, 41]. Institution‐supported artificial intelligence based on GPT‐4 (OpenAI) was used to troubleshoot errors (aichat.app.vumc.org). Scripts were verified by experienced users. Codes are available upon request. Sequence files and datasets are available: Geo accession GSE270726 and GSE270745.
Results
C. difficile‐colonized ApcMin /+ mice had expansion of unique cell types and states
Our prior work demonstrated that C. difficile from human CRC‐associated biofilms exacerbates adenoma formation in the distal colon of Apc Min/+ mice [13]. To identify the mechanism(s) driving tumorigenesis, we performed two transcriptomic experiments on colonic tissue from gnotobiotic Apc Min/+ mice. Germ‐free Apc Min/+ mice were given a single oral gavage of a 29‐bacteria consortium plus toxigenic C. difficile as described previously (Figure 1A) [13]. Control mice were given the same consortium but without C. difficile. Two weeks after inoculation, colonic tissue was collected for scRNA‐seq or formalin‐fixed and paraffin‐embedded for spatial transcriptomics (Figure 1B). Dimensionality reduction illustrated the relative similarities of clusters and cell type assignments using Uniform Manifold Approximation and Projection (UMAP) (Figure 1C). Unsupervised clustering was also performed (supplementary material, Figure S1A). We applied Cellular Trajectory Reconstruction Analysis using gene Counts and Expression (CytoTRACE) to predict the stem‐like potential of clusters based on the number of different genes expressed per cell (supplementary material, Figure S1B) [42]. The score shows, in addition to the stem and transit amplifying compartment (TAC) cells, the absorptive precursor 2 (Abs_Pre2) cluster contained cells predicted to have stem‐like capabilities. Similarly, differentiated cell types, like goblet cell types and mature absorptive cells (Abs1), were predicted to have low stem‐like potential with lower CytoTRACE scores (supplementary material, Figure S1B and Table S1). To enable direct comparisons between C. difficile‐inoculated mice and controls, we merged similar clusters irrespective of inoculation. Progenitor, goblet, deep crypt secretory, and stem clusters comprised a greater proportion of cells from C. difficile‐inoculated mice compared to controls (supplementary material, Figure S1C–E). Agr2‐high goblet cells were present mostly in controls, whereas Clca1‐high goblet cells were present mostly in C. difficile‐inoculated mice. Canonical marker genes were used for making cell type assignments (supplementary material, Figure 2A). Pearson correlations between clusters further supported the similarities from dimensionality reduction (supplementary material, Figure 2B). The distribution of cells from each condition and mouse demonstrated the relative contribution of each condition and validated that no individual mouse skewed the clustering (Figure 1D,E). Based on cell cycle‐specific genes (supplementary material, Table S2), the UMAP was colored to show the proliferating cells (Figure 1F).
Figure 1.

Cell type identification and analysis of scRNA‐seq from colonic epithelial cells in C. difficile‐colonised Apc Min/+ mice demonstrated unique cell states. (A) Schematic of experimental design: 8‐week‐old germ‐free Apc Min/+ mice were gavaged with C. difficile‐containing bacterial consortium or control consortium as previously published [13], and mice were euthanized at 10 weeks. (B) Epithelial crypts were dissociated from the colonic tissue for scRNA‐seq; n = 3 mice/group. Spatial transcriptomics was performed on formalin‐fixed, paraffin‐embedded whole colon; separate experiment with n = 3 mice/group. (C) UMAP plot with cell type assignments for clusters based on canonical mouse gene markers (supplementary material, Figure S2). (D–F) UMAP plots colored by inoculation (C. difficile or control), mouse, or cell cycle phase. (G–K) UMAP plots colored by scaled gene expression (a.u. = arbitrary units) of canonical gene markers for different cell states: fetal reversion (Marcksl1), autophagy (Sqstm1), senescence (Cdkn1), intestinal plasticity (Prom1), and regenerative stemness (Anxa1).
Figure 2.

Differential gene expression analysis of absorptive colonocytes from scRNA‐seq indicated increased immune activity and upregulation of Dmbt1 expression in C. difficile‐inoculated mice. (A) Violin plot of Dmbt1 expression from C. difficile‐inoculated mice versus control within each merged cell type, scaled expression, arbitrary units (a.u.). (B) Volcano plot of differentially expressed genes from absorptive cells based on inoculation. Labeled genes were among the most differentially expressed. (C) We used computational gene set enrichment analysis to determine whether the predefined set of genes in Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways showed statistically significant concordance in absorptive cells. Listed pathways contain genes that were differentially upregulated in C. difficile‐inoculated versus control mice; FDR = false discovery rate. (D) Gene enrichment plot of absorptive cells for the KEGG cell adhesion molecules pathway showed the degree to which upregulated genes correlated with C. difficile‐inoculation (Pos) compared to control (Neg) and the ranked metric quantification of the signal‐to‐noise ratio. NES = normalized enrichment score.
To characterize the cell states present in these clusters, we colored the UMAP plot to show expression for Marcksl1, Sqstm1, Cdkn1a, Prom1, and Anxa1, which represent fetal/embryonic reversion, autophagy, senescence, dysplastic priming, and regenerative stemness, respectively (Figure 1G–K) [43, 44, 45, 46, 47]. Interestingly, a subset of absorptive cells (Abs3, cluster 21 in Figure 1C) from C. difficile‐inoculated mice was enriched with these marker genes. This pattern suggests, perhaps, that this cluster simultaneously is responding to injury and is susceptible to dysplastic transformation [48]. We investigated whether these cells exhibited signs of gastric metaplasia based on the findings of Chen et al, who identified a gastric metaplasia signature when characterizing sessile serrated lesions associated with bacteria‐induced injury [49]. The Abs3 cluster scored highest compared with other clusters (supplementary material, Figure S2C), but was driven by expression of Axna1 and Gsdmd without a strong influence from other genes (supplementary material, Figure S2D).
Differential gene expression analysis of absorptive colonocytes
To identify genes contributing to cell type and cell state differences, we performed differential gene expression analysis comparing the inoculation groups within each cell type (Figure 2A, and supplementary material, Figure S3A–H; Table S3). Dmbt1 was among the most upregulated genes in the C. difficile‐inoculated mice. We focused on Dmbt1 as a potential factor in bacteria‐induced dysplasia because of its dual role as a pathogen‐associated molecular pattern receptor and an inducer of epithelial differentiation [14]. Dmbt1 expression was increased in C. difficile‐inoculated mice compared to controls in all cell types but was most highly expressed in absorptive and progenitor cells (Figure 2A,B and supplementary material, Tables S3 and S4). Gene set analysis of absorptive cells showed significant enrichment of cell adhesion and immune response (Figure 2C,D and supplementary material, Table S5). These pathways are implicated in colonic dysplasia [49]. The gene set enrichment analysis for absorptive precursors and deep crypt secretory cells comparing C. difficile‐inoculated mice versus controls showed similarly enriched pathways (supplementary material, Figure S3A–D). For EEC, goblet, immune, progenitor, stem, and tuft cell types, there were no significantly enriched pathways (FDR ≤0.05) in C. difficile‐inoculated mice. Despite little change in Dmbt1 between C. difficile‐inoculated mice versus controls among the goblet cell subtypes, we wanted to understand the potential contribution of goblet cell types to early tumorigenic events. We compared Goblet Clca1‐Hi to Goblet Agr2‐Hi cells and identified similar immune response pathways, but also cellular senescence and viral carcinogenesis (supplementary material, Figure S3E–G). Clca1‐high goblet cells have significantly elevated levels of canonical goblet cell genes (Mxd1, Zg16, and Tff3) compared with Agr2‐high goblet cells (supplementary material, Figure S3H and supplementary material, Table S6) [50]. The high expression of Duox2 in the Clca1‐high goblet cells suggests their possible role as sentinel goblet cells [51].
DMBT1 was significantly downregulated in colonic dysplasia
The earliest observation of dysplastic foci in our prior studies led us to the specific timepoint of 2‐weeks postinoculation for this work [13]. Dysplastic cells were not identifiable from scRNA‐seq alone, because they were rare compared to the stem and proliferative cells. To identify transcriptional changes in C. difficile‐associated dysplasia, we performed spatial transcriptomics, with particular attention to dysplastic foci and normal adjacent tissue (supplementary material, Figure S4A,B). For this experiment, we repeated the mouse experiment in Figure 1 and compared C. difficile‐inoculated mice versus controls. Only mice who received the 29‐bacteria consortium plus C. difficile developed dysplastic foci, and the incidence of dysplasia was the same as we reported previously [13]. The epithelia and lamina propria were collected as separate segments within each region of interest (supplementary material, Figure S4A–C). Dimensionality reduction was performed on all segments and displayed by UMAP plots (Figure 3A,B). These plots further demonstrated the similarities between segments in an unbiased manner (Figure 3A,B and supplementary material, Table S7). Gene expression analysis comparing dysplasia to normal tissue revealed upregulation of tumor‐associated genes in lamina propria: Cxcl14, Igf1, and Mmp13 (Figure 3C and supplementary material, Table S8) [52, 53, 54]. Increased WNT signaling was evident by Nkd1 and Axin2 upregulation in dysplastic foci compared to normal epithelial tissue (Figure 3D). Dmbt1 was one of the most differentially expressed genes, but was downregulated in dysplastic foci compared with normal colonocytes (Figure 3D), which surprised us because of its upregulation in scRNA‐seq (Figure 2). Differential expression between C. difficile‐inoculated versus control mice showed upregulation of inflammatory genes, like Lcn2 from lamina propria segments (Figure 3E and supplementary material, Table S8), and reactive‐oxygen species‐producing genes, like Nos2 from epithelia (Figure 3F). The spatial transcriptomics data supported our scRNA‐seq data, but included a set of genes specifically dysregulated in dysplastic foci.
Figure 3.

Spatial transcriptomic analysis of C. difficile‐inoculated mice versus controls. (A and B) Dimensionality reduction with UMAP demonstrated segregation based on gene expression for all segments. Each point represents an individual segment, and the shape of each point represents the tissue in which the segment was chosen. The color of the point signifies the mouse of origin (A) or the inoculation received (B). (C and D) Volcano plots of differential gene expression between normal and dysplastic tissue for (C) lamina propria and (D) epithelium; top genes with high absolute log2 fold‐changes are labeled. (E and F) Volcano plots of differential gene expression compared segments from (E) lamina propria and (F) epithelium of control mice versus C. difficile‐colonized mice.
Novel integration of spatial transcriptomics and scRNA‐seq revealed potential pathways to C. difficile‐associated dysplasia
To integrate our two transcriptomic datasets, we leveraged their individual strengths: the ability to identify dysplastic foci with spatial context versus higher resolution expression data in the scRNA‐seq dataset. We began with significantly upregulated genes (log2 fold‐change >1, adj. p < 0.05) in dysplastic epithelia versus normal from spatial transcriptomics (Figure 3D and supplementary material, Figure S5A). We derived a dysplasia score from this gene set and applied it to the scRNA‐seq data and observed the highest scoring cells in the stem cell cluster (supplementary material, Figure 5B,C). We refined the list to include only 12 genes with increased expression in the stem cell cluster: Sox4, Axin2, Ifitm3, Hes6, Prox1, Tubb5, Phlda1, Jun, Smoc2, Slc12a2, Ascl2, and Hopx. This revised dysplasia score enhanced the signal‐noise ratio and was highest in cells from C. difficile‐inoculated mice (Figure 4A,B). Interestingly, enteroendocrine and tuft cells also had high dysplasia scores (supplementary material, Table S9).
Figure 5.

Dmbt1 expression is downregulated in multiple mouse models of colonic tumorigenesis. (A) Volcano plot showing Dmbt1 among the most differentially regulated genes expressed by differentiated colonocytes from mice given protumorigenic slurry (magenta) versus nontumorigenic slurry (blue). Genes with the highest absolute log2 fold‐changes are labeled (supplementary material, Table S11). (B) Dmbt1 normalized gene expression from spatial transcriptomics demonstrated upregulation in epithelial cells from C. difficile‐inoculated mice but downregulation in dysplastic foci. Points are segments with bar and whiskers as mean and standard error of the mean. Asterisks (*) indicate a significant difference (p < 0.05 from Wilcoxon rank sum test with continuity correction) (supplementary material, Table S11). (C) Immunofluorescence microscopy shows Dmbt1 protein expression is reduced in dysplastic foci, as indicated by altered β‐catenin distribution (decreased membranous, increased cytoplasmic and nuclear localization) compared to adjacent normal tissue in three different mouse models of colonic neoplasia. Dmbt1 was reduced in 100% of dysplastic foci (n = 57) from 11 mice. (D) Organoid formation assay using Lrig1 CreER/+ ;Apc fl/fl organoids treated with 1 μm 4‐hydroxytamoxifen (4OH‐Tam) versus vehicle control at 7 days posttreatment, shown using phase contrast microscopy. (E) Cell viability based on luminescence (RLU = relative light units). (F) RT‐qPCR analysis of Dmbt1 expression in 4OH‐Tam‐treated Lrig1 CreER/+ ;Apc fl/fl organoids compared to vehicle control where relative quantification (RQ) is 2−ΔΔCt, and Ct is the cycle threshold. Expression is quantified relative to the reference transcript Actb. These plots represent averages from 43 independent wells of organoids from two different animals. (G) Immunoblotting analysis of vehicle control versus 4OH‐Tam‐treated Lrig1 CreER/+ ;Apc fl/fl organoids.
Figure 4.

Integration of spatial and single‐cell transcriptomics revealed potential pathways to C. difficile‐associated dysplasia. (A) UMAP plot of scRNA‐seq dataset representing the 12‐gene dysplasia score (a.u. = arbitrary units) generated from the spatial transcriptomics dataset. (B) Violin plot showing dysplasia score distribution for each inoculation and cell type. (C) Scatterplot and Spearman correlation between scaled Dmbt1 gene expression and the dysplasia score for all epithelial cells from scRNA‐seq data. Shading around the trendlines indicates 95% confidence intervals. C. difficile Spearman = −0.3415, p‐value = 1.958e‐113; Control Spearman = −0.3003, p‐value = 3.155e‐90. (D) Heatmap showing scRNA‐seq expression for the 12 genes used to create the dysplasia score. Gastric precancer genes are shown with relatively no expression. (E and F) Heatmaps displaying the total number of significant ligand‐receptor interactions inferred by CellPhoneDB. (G) Venn diagrams illustrating the intersection of C. difficile‐specific interaction partners from scRNA‐seq and spatial transcriptomics datasets. (H) Chord diagrams illustrating the predicted interactions with Sema7a by cell type in C. difficile‐colonized mice and control mice. No significant Sema7a interactions were predicted in control mice.
In the scRNA‐seq dataset, we found higher dysplasia scores correlated with lower Dmbt1 expression (Figure 4C). The differential expression of these 12 genes is shown in Figure 4D with Sox4, Ifitm3, and Smoc2 among the most upregulated genes in stem cells from C. difficile‐inoculated mice. These genes are implicated in oncogenic signaling in colorectal cancer [55, 56, 57]. DMBT1 is a marker for gastric precancerous lesions and is upregulated during spasmolytic polypeptide‐expressing metaplasia (SPEM) [19, 58]. However, we did not observe expression of markers for SPEM and gastric dysplasia: Tacstd2, Aqp5, and Ceacam5 (Figure 4D). Dmbt1 appears to be upregulated by C. difficile but downregulated in dysplasia, representing an interesting confluence of our transcriptomic datasets.
To determine if inferred ligand‐receptor pairs from scRNA‐seq data could be validated with spatial information to identify novel mechanisms with higher confidence, we leveraged the high‐resolution scRNA‐seq data. Using CellPhoneDB, we compared the sum of interactions between C. difficile‐inoculated mice versus controls (Figure 4E,F) [39]. From these interactions, we identified 84 genes unique to the C. difficile‐inoculated mice that were not significant in controls (Figure 4G red circle and supplementary material, Table S10). For validation, we searched for these interaction partners in our spatial transcriptomics epithelial segments (supplementary material, Table S8). We identified Semaphorin 7a (Sema7a) and Tumor Necrosis Factor Receptor Superfamily, member 1b (Tnfrsf1b) as significant genes in C. difficile‐inoculated mice from both scRNA‐seq inferred ligand‐receptor pairs and spatial datasets (Figure 4G blue circle). The only interacting partners for Sema7a predicted from CellPhoneDB were integrin α1β1 and plexin C1 (Plxnc1). These interactions were present in C. difficile‐inoculated mice predictions but not controls (Figure 4H). This validation approach suggested Sema7a is important for the host response to C. difficile during tumorigenesis. Although we did not find significant differential expression of Sema7a‐binding partners Itgb1, Itga1, or Plxnc1 in the spatial transcriptomics dataset, our integration method has wide applicability for future work. The interaction of Tnfrsf1b with its ligand Tumor Necrosis Factor alpha is likely to involve the cellular immune response, and our ongoing studies investigate mechanistic contributions of the immune compartment.
DMBT1 expression was downregulated in multiple mouse models of colonic tumorigenesis
To understand and validate the dysplasia‐specific Dmbt1 downregulation phenotype, we performed post hoc analysis of scRNA‐seq data from our previously published work using protumorigenic slurry‐inoculated mice versus nontumorigenic slurry‐inoculated mice [13]. The protumorigenic slurry contained C. difficile and led to significantly increased Dmbt1 gene expression compared with nontumorigenic slurry (not containing C. difficile) in differentiated colonocytes (Figure 5A and supplementary material, Table S11). This upregulation of Dmbt1 from our C. difficile biofilm slurry model supports our findings from the 29‐bacteria consortium with C. difficile (Figure 2A). Looking at the spatial transcriptomics data in the consortium model to examine Dmbt1 expression, we found that dysplastic segments from C. difficile‐inoculated mice expressed less Dmbt1 than normal tissue segments from either group (Figure 5B). Interestingly, the expression of Dmbt1 was higher in normal segments from C. difficile‐inoculated mice compared with normal segments from control mice (Figure 5B and supplementary material, Figure S6A). Dmbt1 upregulation in nondysplastic tissue from C. difficile‐inoculated mice compared with controls is consistent with other gastrointestinal infections [23, 58].
We next investigated Dmbt1 protein expression using immunofluorescence in two additional mouse models of colonic tumorigenesis: the azoxymethane/dextran sodium sulfate‐induced colitis‐associated cancer model and transgenic Lrig1 CreER/+ ;Apc fl/+ mice [36, 37]. In each model, Dmbt1 was markedly reduced and inversely correlated with β‐catenin, used here to mark dysplasia with increased WNT signaling (Figure 5C). These data illustrate the downregulation of Dmbt1 as a common phenotype in murine colonic dysplasia, suggesting that reduced Dmbt1 expression may be a generalizable characteristic of colonic dysplasia, not unique to C. difficile‐associated tumorigenesis.
To ascertain the mechanism of Dmbt1 downregulation in dysplastic foci, we derived colonic organoids from Lrig1 CreER/+ ;Apc fl/fl mice. Apc loss of heterozygosity occurs concomitantly with dysplasia in Lrig1 CreER/+ ;Apc fl/+ mouse adenomas [37], so we hypothesized that biallelic truncation of Apc and subsequent WNT signaling activation would lead to Dmbt1 downregulation. We induced Apc recombination in colonic organoids using 4‐hydroxytamoxifen (4OH‐Tam) and showed increased organoid formation and viability compared to untreated controls (Figure 5D,E). Next, we showed 4OH‐Tam lowered both Dmbt1 expression using RT‐qPCR and Dmbt1 protein using immunoblotting (Figure 5F,G).
DMBT1 protein expression is lost in human colonic dysplasia
To determine if DMBT1 loss occurs in human colonic tumorigenesis, we examined adenomas from a published multiomic atlas of precancerous colon polyps [49]. These data show DMBT1 gene expression is upregulated in differentiated crypt‐top cells and downregulated in adenoma‐specific cells (Figure 6A–C and supplementary material, Table S12). While some of the adenoma‐specific cells have increased DMBT1 expression in cells from tubular adenomas (TA), the tubulo‐villous adenomas (TVA) with high‐grade dysplasia had the lowest expression of DMBT1 (supplementary material, Figure S6B–D and Table S12). When comparing the expression of DMBT1 in crypt‐top (CT) versus adenoma‐specific cells to canonical WNT signaling target genes (Figure 6C), we observed an inverse correlation similar to the mouse colonic dysplastic foci with increased β‐catenin staining (Figure 5C).
Figure 6.

DMBT1 expression was decreased in human colonic dysplasia. (A) UMAP plot of scRNA‐seq data from human colonic precancerous polyps with coloring indicating cell types, EEC = enteroendocrine. (B) Violin plot showing DMBT1 scaled expression (a.u. = arbitrary units) in different cell types. Asterisks (*) indicate significant difference from all other cell types (p < 0.05 Kruskal–Wallis with Dunn's test, supplementary material, Table S12). (C) Heatmap of DMBT1 mean expression alongside canonical WNT signaling target genes indicated an inverse relationship between WNT activity and DMBT1 expression. (D) DMBT1 immunostaining in well‐differentiated, low‐grade dysplastic crypts (arrowhead) adjacent to high‐grade dysplasia (arrow) within a human colon adenocarcinoma. (E) Quantification of DMBT1 immunofluorescence staining in a human tissue microarray with specimens spanning the progression from normal adjacent tissue to metastatic colorectal cancer. The average fluorescence (raw integrated density) for 3 or 4 tissue cores per patient sample is shown as a single point; n = 3–15 patients/group (*p < 0.05, **p < 0.01 from Kruskal–Wallis with Dunn's test). Blue lines are means, and whiskers are standard error of the mean. (F) RT‐qPCR analysis performed on normal human colonic organoids. Each point represents the mean of three technical replicates from seven independent wells pooled together. NKD1 and AXIN2 are canonical WNT signaling target genes. Relative quantification is equivalent to 2−ΔΔCt, and Ct is the cycle threshold. Expression is quantified relative to the reference transcript ACTB, normalized to vehicle control, and scaled to min–max of 0 to 1 (*p < 0.05 from Wilcoxon Rank Sum Test). CHIR = CHIR99021.
To investigate DMBT1 protein expression, we performed immunofluorescence microscopy on a human tissue microarray containing diverticulosis, ulcerative colitis, normal‐adjacent to adenocarcinomas, tubular adenomas, tubulo‐villous adenomas, adenocarcinoma, and metastatic carcinoma (Figure 6D,E and supplementary material, Table S12). These data revealed that average DMBT1 protein levels were highest in tubular and tubulo‐villous adenomas, but lowest in metastatic adenocarcinomas (Figure 6E and supplementary material, Figure S7). To determine whether WNT signaling downregulated DMBT1, we added CHIR99021 to normal human colonic organoids (Figure 6F). CHIR99021 is a specific inhibitor of GSK3β and potent activator of WNT target genes, such as NKD1 and AXIN2. The results showed CHIR99021‐mediated activation of WNT is associated with decreased DMBT1. The downregulation of DMBT1 in human colonic dysplasia appears to be regulated, at least in part, by WNT signaling in mouse and human colon.
Discussion
Comprehensive profiling of unique tumor models has elucidated generalizable phenotypes and valuable insights into CRC biology [49, 59]. This approach is especially revealing in the context of bacteria‐associated tumorigenesis [10, 60, 61]. Building on our discovery that C. difficile exacerbates colonic adenoma formation in Apc Min/+ mice, we integrated scRNA‐seq and spatial transcriptomics to understand the molecular underpinnings of this phenotype. We reveal how the intersection of single‐cell and spatial datasets identified the dysplasia‐specific downregulation of DMBT1 in colorectal precancerous adenomas and adenocarcinomas in mouse models and, importantly, human tissues.
Meticulous mapping of cell types and states in the gastrointestinal tract has uncovered fundamental insights into inflammation and cancer [62, 63, 64, 65, 66, 67, 68]. We observe the unique Abs3 cluster representing a subset of Krt20‐positive absorptive colonocytes from C. difficile‐inoculated mice with high expression of Prom1 and Anxa1 (Figure 1J,K). We interpret these cells to be differentiated and reactive while coincidentally expressing markers of dysplastic and predysplastic colonocytes. We initially thought this Abs3 cluster might represent the origin of sessile serrated lesions that develop through a gastric metaplasia state [49]. However, we did not see a significant overlap with that gene signature (supplementary material, Figure S2C). To be clear, the gastric metaplasia state in colonic precancerous sessile serrated lesions is different from the intestinal metaplasia, or SPEM, occurring in the stomach [69]. Our mouse model hinges on the germline mutation of one Apc allele, which already initiates increased WNT signaling. This baseline WNT activation favors the development of conventional adenoma cells over sessile serrated cells during dysplastic transformation [49, 70].
DMBT1 functions in immune exclusion of pathogens and regulation of the cell proliferation‐differentiation axis. We observed it specifically downregulated in colonic dysplasia, suggesting the loss of DMBT1 could be a link between infection, inflammation, and tumorigenesis [14]. DMBT1 is transcriptionally regulated via NF‐κB signaling downstream of Toll‐Like Receptor 4 in infection and STAT3 signaling in Crohn's colitis [23, 71]. Other genes, such as BVES and SATB2, are downregulated in colitis‐associated dysplasia and recently have been reported as potential biomarkers [72, 73]. Although loss of DMBT1 in other cancers is caused by deletion at the 10q25.3–26.1 locus [74], DMBT1 is deleted in only 5%–17% of CRC (Genomic Data Commons, National Cancer Institute, https://portal.gdc.cancer.gov/genes/ENSG00000187908, accessed 29 November 2023). DMBT1 has not been shown to function as a traditional tumor suppressor via DNA repair, interference with secondary messenger signaling, or modification of oncogenes/oncoproteins. Studies with Dmbt1 −/− mice have led to mixed results with respect to colonic inflammation [75, 76], but Dmbt1‐deficient mice have not been used in conventional colonic adenoma/cancer models. DMBT1 slows proliferation of mouse embryonic stem cells and initiates their differentiation into single‐layered epithelia [77]. In renal tubules, DMBT1 promotes the columnarization and terminal differentiation of intercalated cells [78]. DMBT1 overexpression reduces the onset of ovarian cancer and decreases metastasis of gallbladder cancer through promoting terminal differentiation [79, 80]. These data suggest DMBT1 may inhibit tumorigenesis by driving differentiation. In human CRC, loss of DMBT1 may be associated with decreased cancer‐related survival [24, 25]. The increased DMBT1 in adenomas compared with controls (Figure 6) introduces the possibility that DMBT1 is indirectly regulated by WNT target genes. Our data identify decreased Dmbt1 in dysplastic crypts of three different mouse models of tumorigenesis. Therefore, we suspect DMBT1 downregulation might provide a selective advantage for dysplastic transformation or progression to adenocarcinoma. Downregulation of DMBT1 might support colonic tumorigenesis by allowing dysplastic cells to maintain a more undifferentiated state, retaining their proliferative capacity. It is also possible that loss of DMBT1 is a bystander effect of other mechanisms driving tumorigenesis. Future studies are needed to determine how this mucin‐like glycoprotein might function to limit CRC tumorigenesis.
Author contributions statement
EHG, CNH, DBL, MJS, KSL, CLS, RJC, JLD and NOM contributed to project conceptualization. EHG, SRK, MER, HML, CNH, AJS, XL, DBL, JLD and NOM contributed to methodology and experimental design. EHG, SRK, MER, HML, HK, CNH, HD, AJS, XL, JLD and NOM performed experiments and data collection. EHG, SRK, MER, AJS, MKW, QL, JLD and NOM performed data validation. EHG, SRK, MER, HK, CNH, JLD and NOM contributed to data visualization. EHG, SRK, MER, HK, CNH, AJS, XL, QL, KSL, CLS, RJC, JLD and NOM performed data analysis and interpretation of results. MJS, QL, KSL, CLS, RJC, JLD and NOM acquired funding. DBL, MJS, QL, KSL, CLS, RJC, JLD and NOM provided supervision and coordination of this work. EHG, JLD, and NOM drafted the article. EHG, DBL, MJS, KSL, CLS, RJC, JLD and NOM contributed to article editing. All authors approved the final version of the article.
Supporting information
Figure S1. Additional analysis for the scRNA‐seq dataset illustrated unsupervised clustering, CytoTRACE scores, and merged clusters by similar type
Figure S2. Cell type assignments, correlations, and metaplastic scoring for scRNA‐seq dataset
Figure S3. Gene set enrichment and differential gene expression analysis for absorptive precursors, deep crypt secretory cells, and goblet cell subtypes
Figure S4. Spatial transcriptomics of C. difficile‐colonized Apc Min/+ mice using the GeoMx Digital Spatial Profiler (DSP)
Figure S5. Dysplasia score genes from spatial transcriptomics
Figure S6. Additional transcriptomics analysis
Figure S7. Immunofluorescence and H&E microscopy images from the human tissue microarray
Table S1. Differential gene expression for scRNA‐seq Leiden clusters as cell types
Table S2. Mouse cell cycle genes
Table S3. Statistics for DMBT1 expression by cell type
Table S4. Differential gene expression for scRNA‐seq comparing C. difficile to control
Table S5. Gene set enrichment analysis for absorptive cells
Table S6. Differential gene expression analysis for Clca1 versus Agr2 goblet cells
Table S7. Source data and metadata for UMAP plots of spatial transcriptomics
Table S8. Differential gene expression analysis for spatial transcriptomics
Table S9. Dysplasia score statistics by cell type
Table S10. Ligand‐receptor pair inference source data
Table S11. Differential gene expression analysis and Dmbt1 spatial transcriptomics expression
Table S12. DMBT1 human scRNA‐seq statistics
Acknowledgments
Core Services were performed through the Vanderbilt University Medical Center's Digestive Disease Research Center (NIH grant P30DK058404) and the Vanderbilt Ingram Cancer Center (NIH grant P30CA068485). These cores included the Tissue Pathology Shared Resource, Vanderbilt Technologies for Advanced Genomics (VANTAGE), and Digital Histology Shared Resource. Funding to NOM and DBL was provided by the U.S. Department of Veterans Affairs via grants BX005699 and BX002943, respectively. This work was additionally supported by NIH grant R00CA230192 to JLD and P50CA236733 (GI SPORE) to RJC.
Conflict of interest statement: CLS reports grant support provided to her institution by Janssen and Bristol Myers Squibb. She receives royalties for reviews from ‘Up‐to‐Date’ unrelated to the current work. No other conflicts of interest were declared.
Data availability statement
All data supporting the findings of this study are publicly available. These data are included in the published article, supplementary information files, and/or can be obtained at NCBI Gene Expression Omnibus (GEO) database (https://ncbi.nlm.nih.gov/geo/) under Accession Numbers GSE270726 and GSE270745.
References
- 1. Bray F, Laversanne M, Sung H, et al. Global cancer statistics 2022: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 2024; 74: 229–263. [DOI] [PubMed] [Google Scholar]
- 2. Levin TR, Corley DA, Jensen CD, et al. Effects of organized colorectal cancer screening on cancer incidence and mortality in a large community‐based population. Gastroenterology 2018; 155: 1383–1391.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. AlZaabi A, AlHarrasi A, AlMusalami A, et al. Early onset colorectal cancer: challenges across the cancer care continuum. Ann Med Surg (Lond) 2022; 82: 104453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Arthur JC, Perez‐Chanona E, Mühlbauer M, et al. Intestinal inflammation targets cancer‐inducing activity of the microbiota. Science 2012; 338: 120–123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Kostic AD, Chun E, Robertson L, et al. Fusobacterium nucleatum potentiates intestinal tumorigenesis and modulates the tumor‐immune microenvironment. Cell Host Microbe 2013; 14: 207–215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Dejea CM, Fathi P, Craig JM, et al. Patients with familial adenomatous polyposis harbor colonic biofilms containing tumorigenic bacteria. Science 2018; 359: 592–597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Pleguezuelos‐Manzano C, Puschhof J, Huber AR, et al. Mutational signature in colorectal cancer caused by genotoxic pks+ E. coli . Nature 2020; 580: 269–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Tekle GE, Garrett WS. Bacteria in cancer initiation, promotion and progression. Nat Rev Cancer 2023; 23: 600–618. [DOI] [PubMed] [Google Scholar]
- 9. Wu S, Rhee K‐J, Albesiano E, et al. A human colonic commensal promotes colon tumorigenesis via activation of T helper type 17 T cell responses. Nat Med 2009; 15: 1016–1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Bullman S, Pedamallu CS, Sicinska E, et al. Analysis of fusobacterium persistence and antibiotic response in colorectal cancer. Science 2017; 358: 1443–1448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Tomkovich S, Dejea CM, Winglee K, et al. Human colon mucosal biofilms from healthy or colon cancer hosts are carcinogenic. J Clin Invest 2019; 129: 1699–1712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Shin AE, Tesfagiorgis Y, Larsen F, et al. F4/80+Ly6Chigh macrophages lead to cell plasticity and cancer initiation in colitis. Gastroenterology 2023; 164: 593–609.e13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Drewes JL, Chen J, Markham NO, et al. Human colon cancer‐derived Clostridioides difficile strains drive colonic tumorigenesis in mice. Cancer Discov 2022; 12: 1873–1885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Mollenhauer J, End C, Renner M, et al. DMBT1 as an archetypal link between infection, inflammation, and cancer. Inmunología 2007; 26: 193–209. [Google Scholar]
- 15. Mollenhauer J, Holmskov U, Wiemann S, et al. The genomic structure of the DMBT1 gene: evidence for a region with susceptibility to genomic instability. Oncogene 1999; 18: 6233–6240. [DOI] [PubMed] [Google Scholar]
- 16. Bikker FJ, Ligtenberg AJM, End C, et al. Bacteria binding by DMBT1/SAG/gp‐340 is confined to the VEVLXXXXW motif in its scavenger receptor cysteine‐rich domains. J Biol Chem 2004; 279: 47699–47703. [DOI] [PubMed] [Google Scholar]
- 17. End C, Bikker F, Renner M, et al. DMBT1 functions as pattern‐recognition molecule for poly‐sulfated and poly‐phosphorylated ligands. Eur J Immunol 2009; 39: 833–842. [DOI] [PubMed] [Google Scholar]
- 18. Rundegren J, Arnold RR. Differentiation and interaction of secretory immunoglobulin a and a calcium‐dependent parotid agglutinin for several bacterial strains. Infect Immun 1987; 55: 288–292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Garay J, Piazuelo MB, Lopez‐Carrillo L, et al. Increased expression of deleted in malignant brain tumors (DMBT1) gene in precancerous gastric lesions: findings from human and animal studies. Oncotarget 2017; 8: 47076–47089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Mollenhauer J, Müller H, Kollender G, et al. The SRCR/SID region of DMBT1 defines a complex multi‐allele system representing the major basis for its variability in cancer. Genes Chromosomes Cancer 2002; 35: 242–255. [DOI] [PubMed] [Google Scholar]
- 21. Holmskov U, Mollenhauer J, Madsen J, et al. Cloning of gp‐340, a putative opsonin receptor for lung surfactant protein D. Proc Natl Acad Sci U S A 1999; 96: 10794–10799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Tino MJ, Wright JR. Glycoprotein‐340 binds surfactant protein‐a (SP‐A) and stimulates alveolar macrophage migration in an SP‐A–independent manner. Am J Respir Cell Mol Biol 1999; 20: 759–768. [DOI] [PubMed] [Google Scholar]
- 23. Rosenstiel P, Sina C, End C, et al. Regulation of DMBT1 via NOD2 and TLR4 in intestinal epithelial cells modulates bacterial recognition and invasion. J Immunol 2007; 178: 8203–8211. [DOI] [PubMed] [Google Scholar]
- 24. Park HS, Kim BC, Yeo HY, et al. Deleted in malignant brain tumor 1 is a novel prognostic marker in colorectal cancer. Oncol Rep 2018; 39: 2279–2287. [DOI] [PubMed] [Google Scholar]
- 25. Mori M, Shiraishi T, Tanaka S, et al. Lack of DMBT1 expression in oesophageal, gastric and colon cancers. Br J Cancer 1999; 79: 211–213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Robbe C, Paraskeva C, Mollenhauer J, et al. DMBT1 expression and glycosylation during the adenoma–carcinoma sequence in colorectal cancer. Biochem Soc Trans 2005; 33: 730–732. [DOI] [PubMed] [Google Scholar]
- 27. Kang W, Nielsen O, Fenger C, et al. Induction of DMBT1 expression by reduced ERK activity during a gastric mucosa differentiation‐like process and its association with human gastric cancer. Carcinogenesis 2005; 26: 1129–1137. [DOI] [PubMed] [Google Scholar]
- 28. Banerjee A, Herring CA, Chen B, et al. Succinate produced by intestinal microbes promotes specification of tuft cells to suppress ileal inflammation. Gastroenterology 2020; 159: 2101–2115.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Southard‐Smith AN, Simmons AJ, Chen B, et al. Dual indexed library design enables compatibility of in‐drop single‐cell RNA‐sequencing with exAMP chemistry sequencing platforms. BMC Genomics 2020; 21: 456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Petukhov V, Guo J, Baryawno N, et al. dropEst: pipeline for accurate estimation of molecular counts in droplet‐based single‐cell RNA‐seq experiments. Genome Biol 2018; 19: 78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Heiser CN, Wang VM, Chen B, et al. Automated quality control and cell identification of droplet‐based single‐cell data using dropkick. Genome Res 2021; 31: 1742–1752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Wolf FA, Angerer P, Theis FJ. SCANPY: large‐scale single‐cell gene expression data analysis. Genome Biol 2018; 19: 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Fang Z, Liu X, Peltz G. GSEApy: a comprehensive package for performing gene set enrichment analysis in python. Bioinformatics 2022; 39: btac757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Merritt CR, Ong GT, Church SE, et al. Multiplex digital spatial profiling of proteins and RNA in fixed tissue. Nat Biotechnol 2020; 38: 586–599. [DOI] [PubMed] [Google Scholar]
- 35. Park J, Foox J, Hether T, et al. System‐wide transcriptome damage and tissue identity loss in COVID‐19 patients. Cell Rep Med 2022; 3: 100522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Vega PN, Nilsson A, Kumar MP, et al. Cancer‐associated fibroblasts and squamous epithelial cells constitute a unique microenvironment in a mouse model of inflammation‐induced colon cancer. Front Oncol 2022; 12: 878920. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Powell AE, Wang Y, Li Y, et al. The pan‐ErbB negative regulator Lrig1 is an intestinal stem cell marker that functions as a tumor suppressor. Cell 2012; 149: 146–158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Miyoshi H, Stappenbeck TS. In vitro expansion and genetic modification of gastrointestinal stem cells in spheroid culture. Nat Protoc 2013; 8: 2471–2482. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Garcia‐Alonso L, Handfield L‐F, Roberts K, et al. Mapping the temporal and spatial dynamics of the human endometrium in vivo and in vitro. Nat Genet 2021; 53: 1698–1711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Waskom M. seaborn: statistical data visualization. J Open Source Softw 2021; 6: 3021. [Google Scholar]
- 41. Wickham H. Ggplot2: Elegant Graphics for Data Analysis. Springer‐Verlag: New York, 2016. [Google Scholar]
- 42. Gulati GS, Sikandar SS, Wesche DJ, et al. Single‐cell transcriptional diversity is a hallmark of developmental potential. Science 2020; 367: 405–411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Fazilaty H, Brügger MD, Valenta T, et al. Tracing colonic embryonic transcriptional profiles and their reactivation upon intestinal damage. Cell Rep 2021; 36: 109484. [DOI] [PubMed] [Google Scholar]
- 44. Kumar AV, Mills J, Lapierre LR. Selective autophagy receptor p62/SQSTM1, a pivotal player in stress and aging. Front Cell Dev Biol 2022; 10: 793328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Basu N, Saha S, Khan I, et al. Intestinal cell proliferation and senescence are regulated by receptor guanylyl cyclase C and p21. J Biol Chem 2014; 289: 581–593. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Liang X, Duronio GN, Yang Y, et al. An enhancer‐driven stem cell–like program mediated by SOX9 blocks intestinal differentiation in colorectal cancer. Gastroenterology 2022; 162: 209–222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Malla SB, Byrne RM, Lafarge MW, et al. Pathway level subtyping identifies a slow‐cycling biological phenotype associated with poor clinical outcomes in colorectal cancer. Nat Genet 2024; 56: 458–472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Zhu L, Gibson P, Currle DS, et al. Prominin 1 marks intestinal stem cells that are susceptible to neoplastic transformation. Nature 2009; 457: 603–607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Chen B, Scurrah CR, McKinley ET, et al. Differential pre‐malignant programs and microenvironment chart distinct paths to malignancy in human colorectal polyps. Cell 2021; 184: 6262–6280.e26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Nyström EEL, Martinez‐Abad B, Arike L, et al. An intercrypt subpopulation of goblet cells is essential for colonic mucus barrier function. Science 2021; 372: eabb1590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Birchenough GMH, Nyström EEL, Johansson MEV, et al. A sentinel goblet cell guards the colonic crypt by triggering Nlrp6‐dependent Muc2 secretion. Science 2016; 352: 1535–1542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Lin K, Zou R, Lin F, et al. Expression and effect of CXCL14 in colorectal carcinoma. Mol Med Rep 2014; 10: 1561–1568. [DOI] [PubMed] [Google Scholar]
- 53. Qiao C, Huang W, Chen J, et al. IGF1‐mediated HOXA13 overexpression promotes colorectal cancer metastasis through upregulating ACLY and IGF1R. Cell Death Dis 2021; 12: 564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Huang M‐Y, Chang H‐J, Chung F‐Y, et al. MMP13 is a potential prognostic marker for colorectal cancer. Oncol Rep 2010; 24: 1241–1247. [PubMed] [Google Scholar]
- 55. Chen X, Xu M, Xu X, et al. METTL14‐mediated N6‐methyladenosine modification of SOX4 mRNA inhibits tumor metastasis in colorectal cancer. Mol Cancer 2020; 19: 106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Li D, Peng Z, Tang H, et al. KLF4‐mediated negative regulation of IFITM3 expression plays a critical role in colon cancer pathogenesis. Clin Cancer Res 2011; 17: 3558–3568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Shvab A, Haase G, Ben‐Shmuel A, et al. Induction of the intestinal stem cell signature gene SMOC‐2 is required for L1‐mediated colon cancer progression. Oncogene 2016; 35: 549–557. [DOI] [PubMed] [Google Scholar]
- 58. Sousa JF, Ham A‐JL, Whitwell C, et al. Proteomic profiling of paraffin‐embedded samples identifies metaplasia‐specific and early‐stage gastric cancer biomarkers. Am J Pathology 2012; 181: 1560–1572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Heiser CN, Simmons AJ, Revetta F, et al. Molecular cartography uncovers evolutionary and microenvironmental dynamics in sporadic colorectal tumors. Cell 2023; 186: 5620–5637.e16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Niño JLG, Wu H, LaCourse KD, et al. Effect of the intratumoral microbiota on spatial and cellular heterogeneity in cancer. Nature 2022; 611: 810–817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Li X, Wu D, Li Q, et al. Host‐microbiota interactions contributing to the heterogeneous tumor microenvironment in colorectal cancer. Physiol Genomics 2024; 56: 221–234. [DOI] [PubMed] [Google Scholar]
- 62. Burclaff J, Bliton RJ, Breau KA, et al. A proximal‐to‐distal survey of healthy adult human small intestine and colon epithelium by single‐cell transcriptomics. Cell Mol Gastroenterol Hepatol 2022; 13: 1554–1589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Elmentaite R, Kumasaka N, Roberts K, et al. Cells of the human intestinal tract mapped across space and time. Nature 2021; 597: 250–255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Fawkner‐Corbett D, Antanaviciute A, Parikh K, et al. Spatiotemporal analysis of human intestinal development at single‐cell resolution. Cell 2021; 184: 810–826.e23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Parikh K, Antanaviciute A, Fawkner‐Corbett D, et al. Colonic epithelial cell diversity in health and inflammatory bowel disease. Nature 2019; 567: 49–55. [DOI] [PubMed] [Google Scholar]
- 66. Haber AL, Biton M, Rogel N, et al. A single‐cell survey of the small intestinal epithelium. Nature 2017; 551: 333–339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Moor AE, Harnik Y, Ben‐Moshe S, et al. Spatial reconstruction of single enterocytes uncovers broad zonation along the intestinal villus axis. Cell 2018; 175: 1156–1167.e15. [DOI] [PubMed] [Google Scholar]
- 68. Li J, Simmons AJ, Chiron S, et al. A specialized epithelial cell type regulating mucosal immunity and driving human Crohn's disease. bioRxiv 2023; 2023.09.30.560293. [Not peer reviewed].
- 69. Lee S‐H, Jang B, Min J, et al. Up‐regulation of aquaporin 5 defines spasmolytic polypeptide‐expressing metaplasia and progression to incomplete intestinal metaplasia. Cell Mol Gastroenterol Hepatol 2022; 13: 199–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Sakamoto N, Feng Y, Stolfi C, et al. BRAFV600E cooperates with CDX2 inactivation to promote serrated colorectal tumorigenesis. Elife 2017; 6: e20331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Diegelmann J, Czamara D, Bras EL, et al. Intestinal DMBT1 expression is modulated by Crohn's disease‐associated IL23R variants and by a DMBT1 variant which influences binding of the transcription factors CREB1 and ATF‐2. PLoS One 2013; 8: e77773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Parang B, Kaz AM, Barrett CW, et al. BVES regulates c‐Myc stability via PP2A and suppresses colitis‐induced tumourigenesis. Gut 2017; 66: 852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Ma C, Henn P, Miller C, et al. Loss of SATB2 expression is a biomarker of inflammatory bowel disease‐associated colorectal dysplasia and adenocarcinoma. Am J Surg Pathol 2019; 43: 1314–1322. [DOI] [PubMed] [Google Scholar]
- 74. Mollenhauer J, Wiemann S, Scheurlen W, et al. DMBT1, a new member of the SRCR superfamily, on chromosome 10q25.3–26.1 is deleted in malignant brain tumours. Nat Genet 1997; 17: 32–39. [DOI] [PubMed] [Google Scholar]
- 75. Lisle RCD, Xu W, Roe BA, et al. Effects of Muclin (Dmbt1) deficiency on the gastrointestinal system. Am J Physiol Gastrointest Liver Physiol 2008; 294: G717–G727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Renner M, Bergmann G, Krebs I, et al. DMBT1 confers mucosal protection in vivo and a deletion variant is associated with Crohn's disease. Gastroenterology 2007; 133: 1499–1509. [DOI] [PubMed] [Google Scholar]
- 77. Takito J, Al‐Awqati Q. Conversion of ES cells to columnar epithelia by hensin and to squamous epithelia by laminin. J Cell Biol 2004; 166: 1093–1102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Vijayakumar S, Takito J, Hikita C, et al. Hensin remodels the apical cytoskeleton and induces columnarization of intercalated epithelial cells: processes that resemble terminal differentiation. J Cell Biol 1999; 144: 1057–1067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Ma N, Zhao Y. DMBT1 suppresses cell proliferation, migration and invasion in ovarian cancer and enhances sensitivity to cisplatin through galectin‐3/PI3k/Akt pathway. Cell Biochem Funct 2020; 38: 801–809. [DOI] [PubMed] [Google Scholar]
- 80. Sheng S, Jiwen W, Dexiang Z, et al. DMBT1 suppresses progression of gallbladder carcinoma through PI3K/AKT signaling pathway by targeting PTEN. Biosci Biotechnol Biochem 2019; 83: 2257–2264. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1. Additional analysis for the scRNA‐seq dataset illustrated unsupervised clustering, CytoTRACE scores, and merged clusters by similar type
Figure S2. Cell type assignments, correlations, and metaplastic scoring for scRNA‐seq dataset
Figure S3. Gene set enrichment and differential gene expression analysis for absorptive precursors, deep crypt secretory cells, and goblet cell subtypes
Figure S4. Spatial transcriptomics of C. difficile‐colonized Apc Min/+ mice using the GeoMx Digital Spatial Profiler (DSP)
Figure S5. Dysplasia score genes from spatial transcriptomics
Figure S6. Additional transcriptomics analysis
Figure S7. Immunofluorescence and H&E microscopy images from the human tissue microarray
Table S1. Differential gene expression for scRNA‐seq Leiden clusters as cell types
Table S2. Mouse cell cycle genes
Table S3. Statistics for DMBT1 expression by cell type
Table S4. Differential gene expression for scRNA‐seq comparing C. difficile to control
Table S5. Gene set enrichment analysis for absorptive cells
Table S6. Differential gene expression analysis for Clca1 versus Agr2 goblet cells
Table S7. Source data and metadata for UMAP plots of spatial transcriptomics
Table S8. Differential gene expression analysis for spatial transcriptomics
Table S9. Dysplasia score statistics by cell type
Table S10. Ligand‐receptor pair inference source data
Table S11. Differential gene expression analysis and Dmbt1 spatial transcriptomics expression
Table S12. DMBT1 human scRNA‐seq statistics
Data Availability Statement
All data supporting the findings of this study are publicly available. These data are included in the published article, supplementary information files, and/or can be obtained at NCBI Gene Expression Omnibus (GEO) database (https://ncbi.nlm.nih.gov/geo/) under Accession Numbers GSE270726 and GSE270745.
