Skip to main content
mBio logoLink to mBio
. 2024 Feb 8;15(3):e03300-23. doi: 10.1128/mbio.03300-23

Genomic surveillance of Clostridioides difficile transmission and virulence in a healthcare setting

Erin P Newcomer 1,2,#, Skye R S Fishbein 1,3,#, Kailun Zhang 1,3, Tiffany Hink 4, Kimberly A Reske 4, Candice Cass 4, Zainab H Iqbal 4, Emily L Struttmann 4, Carey-Ann D Burnham 3,5,6, Erik R Dubberke 4,, Gautam Dantas 1,2,3,5,6,
Editor: Julian Parkhill7
PMCID: PMC10936198  PMID: 38329369

ABSTRACT

Clostridioides difficile infection (CDI) is a major cause of healthcare-associated diarrhea, despite the widespread implementation of contact precautions for patients with CDI. Here, we investigate strain contamination in a hospital setting and the genomic determinants of disease outcomes. Across two wards over 6 months, we selectively cultured C. difficile from patients (n = 384) and their environments. Whole-genome sequencing (WGS) of 146 isolates revealed that most C. difficile isolates were from clade 1 (131/146, 89.7%), while only one isolate of the hypervirulent ST1 was recovered. Of culture-positive admissions (n = 79), 19 (24%) patients were colonized with toxigenic C. difficile on admission to the hospital. We defined 25 strain networks at ≤2 core gene single nucleotide polymorphisms; two of these networks contain strains from different patients. Strain networks were temporally linked (P < 0.0001). To understand the genomic correlates of the disease, we conducted WGS on an additional cohort of C. difficile (n = 102 isolates) from the same hospital and confirmed that clade 1 isolates are responsible for most CDI cases. We found that while toxigenic C. difficile isolates are associated with the presence of cdtR, nontoxigenic isolates have an increased abundance of prophages. Our pangenomic analysis of clade 1 isolates suggests that while toxin genes (tcdABER and cdtR) were associated with CDI symptoms, they are dispensable for patient colonization. These data indicate that toxigenic and nontoxigenic C. difficile contamination persist in a hospital setting and highlight further investigation into how accessory genomic repertoires contribute to C. difficile colonization and disease.

IMPORTANCE

Clostridioides difficile infection remains a leading cause of hospital-associated diarrhea, despite increased antibiotic stewardship and transmission prevention strategies. This suggests a changing genomic landscape of C. difficile. Our study provides insight into the nature of prevalent C. difficile strains in a hospital setting and transmission patterns among carriers. Longitudinal sampling of surfaces and patient stool revealed that both toxigenic and nontoxigenic strains of C. difficile clade 1 dominate these two wards. Moreover, quantification of transmission in carriers of these clade 1 isolates underscores the need to revisit infection prevention measures in this patient group. We identified unique genetic signatures associated with virulence in this clade. Our data highlight the complexities of preventing transmission of this pathogen in a hospital setting and the need to investigate the mechanisms of in vivo persistence and virulence of prevalent lineages in the host gut microbiome.

KEYWORDS: C. difficile infection, transmission, whole-genome sequencing, virulence factors, pathogens, antibiotic resistance

INTRODUCTION

Clostridioides difficile infection (CDI) is one of the most common healthcare-associated infections in the US and is the leading cause of healthcare-associated infectious diarrhea (1, 2). Since the early 2000s, C. difficile research has focused largely on hypervirulent strains, such as PCR ribotype 027 (1, 36), which were responsible for hospital-associated CDI outbreaks. Strains of ribotype 027 were responsible for 51% and 84% of CDI cases in the United States and Canada in 2005, respectively (1, 4, 5). Since then, other circulating strains have emerged as the prevalent strains causative of CDI, such as 078 and 014/020 (79). One report indicated that the prevalence of PCR ribotype 027 decreased from 26.2% in 2012 to 16.9% in 2016 (9). As the landscape of C. difficile epidemiology continues to evolve, we must update our understanding of how various strains of this pathogen evolve, spread, and cause disease.

In addition to the changing prevalence of CDI-causing C. difficile strains, their transmission dynamics also appear to be evolving. In the late 1980s, it became clear that patients with active CDI shed spores onto their surroundings, leading to C. difficile transmission and future CDI events in the healthcare setting (1). Because of this, patients with active CDI are placed on contact precautions to prevent transmission to susceptible patients, which has been successful in reducing the rates of CDI (2, 10). Nevertheless, while epidemiological estimates indicate that 20%–42% of infections may be connected to a previous infection, multiple genomic studies fail to associate a CDI case to a previous case (1113). This suggests other potential sources of disease development in the hospital environment. Indeed, while asymptomatic carriers of C. difficile have not been a significant focus of infection prevention efforts, studies have shown that these carriers do shed toxigenic C. difficile spores to their surroundings, which could cause disease (14). Though carriers have not been consistently identified as major transmitters of C. difficile that causes CDI, recent work has suggested that patients carrying C. difficile asymptomatically may be at elevated risk for the development of CDI (15). Correspondingly, it is critical to both confirm this finding in another setting and understand the genomic factors that may influence the transition from carrier to CDI manifestation in hospitalized patient populations. Correspondingly, it is critical to understand if C. difficile carriers are major contributors to new C. difficile acquisition or CDI manifestation in hospitalized patient populations.

C. difficile strains are categorized into five major clades and three additional cryptic clades. These clades encompass immense pangenomic diversity with many mobilizable chromosomal elements (16, 17), including numerous temperate phages that have potential influences over C. difficile toxin expression, sporulation, and metabolism (18). Two major toxin loci, not required for viability, encode large multi-unit toxins that independently augment the virulence of C. difficile. Epithelial destruction and CDI have largely been attributed to the presence of pathogenicity locus (PaLoc) encoding toxins TcdA and TcdB. In addition, an accessory set of toxins (CdtA and CdtB) encoded at the binary toxin locus may worsen disease symptoms (19). Yet, many nontoxigenic strains of C. difficile have been documented and are adept colonizers of the GI tract, even without the PaLoc (20). As there has been continued debate about strain-specific virulence attributes (2123), it is important to investigate the extent of strain-level pangenomic diversity and the consequences of such diversity on host disease (24, 25).

The purpose of this study was to evaluate the role of C. difficile strain diversity in colonization outcomes and hospital epidemiology. By sampling patients (n = 384) and their environments for 6 months in two leukemia and hematopoietic stem cell (HCT) transplant wards at Barnes-Jewish Hospital in St. Louis, USA, we used isolate genomics to identify environmental contamination of both toxigenic and nontoxigenic C. difficile by carriers and CDI patients and corresponding transmission between both patient groups. Integration of isolate genomic data and CDI information from this prospective study with isolate genomic data from a complementary retrospective study of asymptomatic vs symptomatic C. difficile colonization in the same hospital (26, 27) indicated that the clade 1 lineage, containing both toxigenic strains and nontoxigenic strains, dominates circulating populations of C. difficile in this hospital. Furthermore, this lineage revealed novel clade-specific genetic factors that are associated with CDI symptoms in patients.

MATERIALS AND METHODS

Study design

This prospective observational study took place in the leukemia and hematopoietic stem cell transplant wards at Barnes-Jewish Hospital (BJH) in St. Louis, MO, USA. Each ward consisted of two wings with 16 beds; on the acute leukemia ward, we enrolled from both wings (32 beds), and on the HCT ward, we enrolled on one wing (16 beds). The wards were sampled for 6 months from January to July 2019 (acute leukemia) and 4 months from March to July 2019 (HCT). These units are located two floors apart in the same building. Colonized on admission was defined as (i) having a C. difficile culture-positive specimen collected before calendar day 3 of admission, or (ii) having a history of an earlier C. difficile culture-positive specimen of the same strain collected during a previous hospitalization. Acquisition was considered indeterminate when the earliest culture-positive specimen was collected on calendar day 4 or later during admission. For enzyme immunoassay (EIA)-positive admissions, new acquisitions were defined as having toxigenic culture-negative specimen that preceded the EIA-positive clinical stool collection during the same admission. EIA-positive admissions were classified as indeterminate if the patient did not have any stool or rectal swab culture results available prior to the collection of the EIA-positive stool sample. EIA-positive admissions were defined as colonized on admission or pre-existing colonization if the patient had a toxigenic culture-positive specimen that preceded the EIA-positive clinical stool during the same admission.

Sample collection, selective culture, and isolate identification

Patients and their environments were sampled upon admission to a study ward and then weekly until discharge. Per hospital standards, bleach is used for daily and terminal discharge cleaning. From each patient, a stool specimen and/or rectal swab was collected as available. Remnant fecal samples from the BJH microbiology laboratory that were obtained during routine clinical care for C. difficile testing were also collected. Stool samples and rectal swabs collected on enrollment were refrigerated for up to 3 hours before processing. Specimens from all other time points were stored at −80°C in tryptic soy broth (TSB)/glycerol before processing. Environmental samples were collected from bedrails, keyboards, and sink surfaces using three E-swabs (Copan). If a surface was unable to be sampled, a swab was taken from the IV pump or nurse call button as an alternative. Swab eluates were stored at −80°C until processing.

Broth enrichment culture for C. difficile in Cycloserine Cefoxitin Mannitol Broth with Taurocholate and Lysozyme (Anaerobe Systems, Morgan Hill, CA, USA) was performed on all admission specimens and checked for growth at 24 hours, 48 hours, and 7 days after inoculation. If C. difficile was isolated, all other specimens collected from that patient and their surroundings were also cultured on Cycloserine-Cefoxitin Fructose Agar with Horse Blood and Taurocholate agar (Anaerobe Systems). Colonies resembling C. difficile (large, spreading, gray, ground glass appearance) were picked by a trained microbiologist and sub-cultured onto a blood agar plate (BAP). Growth from the subculture plate was identified using matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS) (bioMerieux, Durham, NC, USA). Upon identification, sweeps of C. difficile BAPs were collected in TSB and stored at −80°C for sequencing. If both the rectal swab sample and stool sample produced a C. difficile isolate, the stool isolate was preferentially used for analysis over the rectal swab isolate. The discharge/last specimen collected for admission was also cultured for C. difficile if C. difficile was not isolated from the admission specimen. If C. difficile was isolated from the discharge/last specimen collected, then all specimens from that admission were also cultured for C. difficile.

C. difficile toxin enzyme immunoassay was conducted as part of routine clinical care based on clinical suspicion of CDI. To be diagnosed with C. difficile infection, a patient must have been EIA+ for C. difficile toxin (Alere TOX A/B II); those who were not tested (due to no clinically significant diarrhea) or tested EIA− and were culture-positive for C. difficile were considered C. difficile carriers. Episodes of carriage or CDI are defined as the time from the first culture-positive specimen from a patient to the last culture-positive specimen during a given hospital admission.

Short read sequencing and de novo genome assembly

Parameters used for computational tools are provided parenthetically. Total genomic DNA from C. difficile isolates was extracted from frozen plate scrapes using the QIAamp BiOstic Bacteremia DNA Kit (Qiagen) and quantified with the PicoGreen dsDNA assay (Thermo Fisher Scientific). DNA from each isolate was diluted to a concentration of 0.5 ng/µL for library preparation using a modified Nextera kit (Illumina) protocol (28). Sequencing libraries were pooled and sequenced on the NovaSeq 6000 platform (Illumina) to obtain 2 × 150 bp reads. Raw reads were demultiplexed by index pair and adapter sequencing trimmed and quality filtered using Trimmomatic (v0.38, SLIDINGWINDOW:4:20, LEADING:10, TRAILING:10, MINLEN:60) (29). Cleaned reads were assembled into draft genomes using Unicycler (v0.4.7) (30). Draft genome quality was assessed using Quast (31), BBMap (32), and CheckM (33), and genomes were accepted if they met the following quality standards: completeness greater than 90%, contamination less than 5%, N50 greater than 10,000 bp, and less than 500 contigs > 1,000 bp. Isolates and metadata can be found in Table S1.

Isolate characterization and typing

A Mash Screen was used to identify likely related genomes from all NCBI reference genomes (34). Average nucleotide identity (ANI) between the top three hits and the draft assembly was calculated using dnadiff (35). Species were determined if an isolate had >75% alignment and >96% ANI (36) to a type strain and were otherwise classified as genomospecies of the genus-level taxonomy call.

In silico multilocus sequence typing (MLST) was determined for all C. difficile and genomospecies isolates using mlst (37, 38). Isolate contigs were annotated using Prokka (39) (v1.14.5, -mincontiglen 500, -force, -rnammer, -proteins GCF_000210435.1_ASM21043v1_protein.faa [40]). cdtAB was determined to be a pseudogene if there were three hits to cdtB, indicating the damaged structure of the pseudogene (41). C. difficile clade was determined using predefined clade-MLST relationships described by Knight et al. (16).

Phylogenetic analyses

The .gff files output by Prokka (39) were used as input for Panaroo (v1.2.10) (42) to construct a core genome alignment. The Panaroo alignment was used as input to construct a maximum-likelihood phylogenetic tree using Fasttree (43). The output .newick file was visualized using the ggtree (v3.4.0) (44) package in R. Cryptic clade isolates were determined as such based on phylogenetic clustering with cryptic clade reference isolates.

SNP analyses and network formation

We identified pairwise single nucleotide polymorphism (SNP) distances between isolates identified as the same MLST. The isolate assembly with the fewest number of contigs in an MLST group was chosen as a reference for that MLST group. Cleaned reads were aligned to their respective reference, and SNP distances were calculated with snippy (45). Pairwise SNP distances between isolates were calculated by merging VCF files with bcftools (46) and a custom script. Only SNPs within the core genome of each MLST group were considered, thus core MLST SNPs were used for strain network determination. A cutoff of ≤2 core MLST SNPs was used to define strain networks, as has been used previously to account for strain variation (15, 47).

Phage identification and clustering

Isolate genomes were analyzed with Cenote-Taker 2 (48) to identify contigs with end features such as direct terminal repeats indicating circularity and inverted linear repeats (ITRs) or no features for linear sequences. Identified contigs were filtered by length and completeness to remove false positives. Length limits were 1,000 nucleotides (nt) for the detection of circularity, 4,000 nt for ITRs, and 5,000 nt for other linear sequences. The completeness was computed as a ratio between the length of our phage sequence and the length of matched reference genomes by CheckV (49), and the threshold was set to 10.0%. Phage contigs passing these two filters were then run through VIBRANT (50) with the “virome” flag to further remove obvious non-viral sequences (50). Based on MIUViG recommended parameters (51), phages were grouped into “populations” if they shared ≥95% nucleotide identity across ≥85% of the genome using BLASTN and CheckV.

Analysis of genotypic associations with disease severity

Two previously sequenced retrospective cohorts from the same hospital were included to increase power (26, 52). In the analyses of toxigenic vs nontoxigenic isolates from clade 1, pyseer (53) was run using an SNP distance matrix (using snp-dist as above), binary genotypes (presence or absence of tcdB), and Panaroo-derived gene presence/absence data. In the analysis of CDI suspicion, all isolates from clade 1 were used, which represented one isolate per patient episode. Isolates recovered from environmental surfaces were excluded. Using these assemblies, a core genome alignment was generated using Prokka (39) and Panaroo (42) as above. SNP distances were inferred from the core-gene alignment using snp-dists (54). Binary phenotypes were coded for the variable CDI suspicion, whereby isolates associated with a clinically tested stool were associated with symptomatic colonization (TRUE). Isolates that were associated with a surveillance stool and had no clinical testing associated with that patient timepoint were coded as non-symptomatic colonization (FALSE). Gene candidates were filtered based on “high-bse” and annotated using HMMER on RefSeq databases and using a bacteriophage-specific tool, VIBRANT (50). Selected outputs were visualized in R using the beta coefficient as the x-axis and the −log10(likelihood ratio test P-value) as the y-axis.

Reference assembly collection

We chose 23 reference assemblies from Knight et al. (16) for Fig. 2 because of their MLST-clade associations (Table S2). References span clades 1–5 and cryptic clades C-1, C-2, and C-3, with one reference from each of the three most frequent MLSTs in each clade. Cryptic clade C-3 only had two reference assemblies available. References were annotated and included in phylogenetic tree construction as above.

All Clostridioides difficile genomes available at the National Institutes of Health National Library of Medicine were acquired for Fig. 4c construction. References from NCBI (Table S3) were included if they had less than 200 contigs. Assemblies that met these quality requirements were annotated and phylogenetically clustered as above.

RESULTS

Surveillance of C. difficile reservoirs in hospital wards reveals patient colonization and environmental contamination

We prospectively collected patient and environmental samples to investigate genomic determinants of C. difficile carriage, transmission, and CDI (Fig. 1). Across the study period, we enrolled 384 patients from 647 unique hospital admissions and collected patient specimens upon admission and weekly thereafter (Fig. S1). We collected at least one specimen (clinical stool collected as part of routine care, study-collected stool, or study-collected rectal swab) from 364 admissions for a total of 1,290 patient specimens (Table 1). We selectively cultured C. difficile from 151 stool specimens or rectal swabs if stool was unavailable or culture negative. We also collected weekly swabs from the bedrails, sink surfaces, and in-room keyboards, for a total of 3,045 swabs from each site. We cultured all environmental swabs collected from rooms in which patients that ever produced culturable C. difficile were housed, for a total of 398 swab sets plus one and two additional keyboard and sink handle swabs, respectively. In total, 22/398 (5.5%) bedrail swabs cultured and 4/399 (1.0%) keyboard swabs cultured were culture positive for C. difficile (Fig. 2a). C. difficile was never recovered from sink surfaces (all sinks on these units are hand-less activated) or other sampled sites. Collapsing multiple positive samples from the same patient admission results in 20 positive bedrails (20/79, 25.3% of all admissions with positive patient specimens) and 4 positive keyboards (4/79, 5.06% of all admissions with positive patient specimens) (Fig. 2b).

Fig 1.

Fig 1

Study sampling and testing overview. (a) We sampled leukemia and hematopoietic stem cell transplant wards at Barnes-Jewish Hospital in St. Louis, USA for 6 and 4 months, respectively. Patients were enrolled and sampled upon admission, and then weekly for their time in the study wards. Surfaces were sampled weekly across the duration of the study. All samples and stool collected as part of routine clinical care were subjected to selective culture and MALDI-TOF MS identification, and isolates were whole-genome sequenced. Results of EIA testing as part of routine care were obtained.

TABLE 1.

C. difficile epidemiology on the admission level (N = 647 admissions)a,b

Variable N (%)
Admissions with ≥1 stool and/or rectal swab specimen collected 364/647 (56)
 Culture positive any time during admission 79/364 (22)
 Toxigenic culture positive at any time during admission 57/364 (16)
  Toxigenic culture positive on admission (colonized on admission) 19/364 (4)
  Toxigenic culture negative on admission, toxigenic culture positive later in admission (acquisition) 6/364 (2)
  Unknown colonization status on admission, toxigenic culture positive later in admission (indeterminate acquisition) 32/364 (9)
 Non-toxigenic culture positive at any time during admission 26/364 (7)
  Non-toxigenic culture positive on admission (non-toxigenic colonized on admission) 6/364 (2)
  Non-toxigenic culture negative on admission, toxigenic culture positive later in admission (non-toxigenic acquisition)c 2/364 (1)
  Unknown colonization status on admission, non-toxigenic culture positive later in admission (indeterminate acquisition)d 18/364 (5)
EIA positive during admission 25/647 (4)
 EIA positive during admission with specimen(s) collected 17/364 (5)
  Indeterminate toxigenic C. difficile acquisition 6/17 (35)
  New acquisition 3/17 (18)
  False-positive EIAe 3/17 (18)
  Colonized on admission or pre-existing colonization 3/17 (18)
  Recurrent CDI 2/17 (12)
a

Colonized on admission was defined as (i) having a C. difficile culture-positive specimen collected before calendar day 3 of admission, or (ii) having a history of an earlier C. difficile culture-positive specimen of the same strain collected during a previous hospitalization. Acquisition was considered indeterminate when the earliest culture-positive specimen was collected on calendar day 4 or later during admission.

b

For EIA-positive admissions, new acquisitions were defined as having toxigenic culture-negative specimens that preceded the EIA-positive clinical stool collection during the same admission. EIA-positive admission was classified as indeterminate if the patient did not have any stool or rectal swab culture results available prior to the collection of the EIA-positive stool sample. EIA-positive admissions were defined as colonized on admission or pre-existing colonization if the patient had a toxigenic culture-positive specimen that preceded the EIA-positive clinical stool during the same admission.

c

Includes one patient admission with non-toxigenic C. difficile acquisition, who was co-colonized with toxigenic C. difficile on admission (This patient is also counted in the toxigenic culture positive colonized on admission row.).

d

Includes two patient admissions co-colonized with toxigenic C. difficile; both toxigenic and non-toxigenic were indeterminate acquisition (These patients are also counted in the toxigenic culture indeterminate acquisition row.).

e

These results were considered false-positive EIAs because both the clinical stool specimen and additional surveillance stool or rectal swab specimens were toxigenic culture negative. One participant had a culture-negative clinical specimen and culture-negative stool and rectal swab specimens from the subsequent day. The second participant had a culture-negative clinical specimen and six additional culture-negative specimens (five stool and one rectal swab) collected during admission, ranging from 3 days prior to clinical stool collection through 26 days after clinical stool collection. The third participant had four non-toxigenic culture-positive specimens during the EIA+ admission (including the clinical stool; dates ranged from 8 days prior to EIA+ through 6 days post-EIA+ collection) and three non-toxigenic culture-positive specimens during a subsequent admission; all seven non-toxigenic culture-positive specimens were the same strain. None of the three participants had a toxigenic culture-positive specimen during the EIA+ admission.

Fig 2.

Fig 2

Total samples collected and phylogenetic relationships reveal carriers outnumber CDI patients, and bedrails are the most commonly contaminated surface. Total (a) isolates collected and (b) culture-positive episodes from each source. We found more carriers than CDI patients, and bedrails yielded the most C. difficile isolates. (c) Cladogram of all isolates collected during this study plus references.

Results from selective culture indicated that 21.7% of unique admissions (79/364 admissions with available specimens) were culture positive for C. difficile at some point during their admission (Fig. 2b; Table 1). Of these, 57 were toxigenic culture positive. Nineteen (4% of all admissions) patient admissions were considered “colonized on admission” (i.e., toxigenic culture positive within the first three calendar days of admission), and toxigenic C. difficile was acquired in six (2%) admissions. For most toxigenic culture-positive admissions (32; 9% of all admissions), C. difficile acquisition was considered indeterminate, meaning the earliest toxigenic culture-positive specimen was collected on calendar day 4 or later during admission. Full admission-level culture results can be found in Table 1.

C. difficile carriers outnumbered patients with CDI

Patients with CDI were identified through routine clinical care, with CDI defined as patients who had stool submitted for C. difficile testing, as ordered by the clinical team when suspicious for CDI, and who tested positive for C. difficile toxins by enzyme immunoassay (EIA+). Otherwise, if they were culture positive and EIA− or culture positive and not EIA tested, they were considered carriers. Overall, 25 positive EIAs occurred during the study period; of these, 17 occurred during admissions with study specimens available for culture. Among these 17 admissions, 3 (18%) were considered new C. difficile acquisition, 6 (35%) had indeterminate timing of C. difficile acquisition, 3 (18%) were false-positive EIAs, 3 (18%) were colonized on admission/pre-existing colonization, and 2 (12%) were recurrent CDI (Table 1). The substantial detection of longitudinal patient C. difficile colonization prompted us to investigate the genomic correlates of C. difficile-associated disease and transmission in these two patient populations.

Phylogenetic clustering reveals lack of hypervirulent strains and presence of cryptic clades

We conducted whole-genome sequencing to ascertain phylogenetic distances among isolates and to identify closely related strains of C. difficile. We identified 141 isolate genomes as C. difficile (using a 75% alignment and 96% average nucleotide identity threshold). One isolate was identified as Clostridium innocuum and five isolates were classified as C. difficile genomospecies (92%–93% ANI). To contextualize population structure, we applied a previously established MLST-derived clade definition to our isolate cohort (16). The majority of C. difficile isolates were from clade 1 (131/146, 89.7% of C. difficile and genomospecies, Fig. 2c). Four patient-derived isolates were identified from clade 2 but only one was of the hypervirulent strain ST1 (PCR ribotype 027) (6). We found that the distribution of STs associated with carriers was significantly different from that of STs associated with CDI patients (P < 0.001, Fisher’s exact test, Fig. 2b), suggesting some strain specificity to disease outcome.

Interestingly, the five genomospecies’ isolates clustered with other isolates belonging to a recently discovered C. difficile cryptic clade C-1 (Fig. S2). While cryptic clades are genomically divergent from C. difficile, these isolates can produce homologs to TcdA/B and cause CDI-like disease in humans (16, 55). In a clinical setting, they are frequently identified by MALDI-TOF MS as C. difficile and diagnosed as causative of CDI (55). These data highlight the novel distribution of circulating C. difficile strains in the two study wards. While many patients with multiple isolates had homogeneous signatures of colonization (with closely related isolates), four patients (4/72 patients with positive cultures, 6%) produced isolates from distinct ST types.

Carriers and CDI patients contribute to transmission networks and environmental contamination

Given the predominance of clade 1 isolates, we sought to identify clonal populations of C. difficile strains, indicative of direct C. difficile contamination (patient-environment) or transmission (patient-patient). We compared pairwise, core genome SNP distances within MLST groups to identify networks of transmission connecting isolates less than or equal to two SNPs apart (Fig. S3). We identified a total of 25 strain networks, 2 of which contain patient isolates from different patients (networks 17 and 31, Fig. 3a and d). These strain networks were temporally linked, as there were significantly fewer days between same-network isolates than isolates from different networks (P < 2.2e-16, Wilcoxon, Fig. 3b). We also sought to understand if CDI patients were more likely to contaminate bedrails than carriers. While we found slightly higher numbers of total bedrail isolates collected and unique bedrails contaminated by networks with CDI patients, neither comparison reached statistical significance (ns, Student’s t-test, Fig. S4a and b).

Fig 3.

Fig 3

Surfaces are a site of environmental contamination and potential for transmission from colonized and CDI patients. (a) Strain networks were defined by ≤2 MLST core gene SNP cutoff. Network 10 includes the non-toxigenic isolates from patient 2245, which are likely not responsible for the CDI. (b) Absolute value of days between isolates within strains and between strains. Isolates within the same strain were significantly temporally linked (P < 2.2e−16, Wilcoxon test). (c) Number of comparisons in each group that fall within strain cutoff. Patient: between two isolates collected from the same patient; bedrail: between a patient isolate and an isolate taken from their bedrail; keyboard: between a patient isolate and an isolate taken from their keyboard. Fisher’s exact test, BH corrected. (d) Strain tracking diagram of transmission networks associated with more than one patient. Colors indicate MLST of network and horizontal lines indicate stay in a room. Patient 2330 shed C. difficile onto the bedrail, and patient 2336 later was identified as a carrier of the same strain.

We compared strain connections among a single patient’s isolates from stool or rectal swab (“patient”) and between these isolates and environmental isolates from their immediate surroundings (“bedrail” or “keyboard,” Fig. 3c). While the majority of bedrail isolates fell within the same network as patient isolates from that room (33/44 comparisons, 75%), 25% (11/44 comparisons) were genomically distinct, suggesting contamination from alternate sources. Keyboards were mostly colonized with distinct strains from the patient (22%, 2/9 comparisons), indicating other routes of contamination (P < 0.05, Fisher’s exact test, BH corrected, Fig. 3c). Among the networks that contain multiple patients, we found no instances of potential transmission from the inhabitant of one room to the subsequent inhabitant. However, in both instances, each potential transmission is associated with a temporal overlap in patient stay in the same ward, providing epidemiological support for putative transmission (Fig. 3d). Importantly, we found no networks connecting patients with CDI to C. difficile carriers, suggesting successful containment through contact precaution protocols. Two patients (patients 2026 and 2056) carried a strain of C. difficile and later developed CDI with that same strain. These data suggest that direct transmission from CDI patients may no longer be the driving force behind patient CDI in this setting on contact precautions and prompted us to investigate the relationship between isolate genetic diversity and patient symptomology.

Accessory genomic elements are associated with host CDI symptoms

Despite evidence of transmission in this prospective study, a minority of patients were diagnosed with CDI relative to those asymptomatically colonized with C. difficile in part due to the presence of nontoxigenic C. difficile isolates (Fig. 2b). To power our investigation of virulence determinants across patient-colonizing C. difficile strains, we performed whole-genome sequencing on 102 additional patient-derived C. difficile isolates from a previously described C. difficile-colonized/CDI cohort from the same hospital (26), where all patients had clinical suspicion of CDI (CDI suspicion), defined by a clinician ordering an EIA test during patient admission. Using an MLST-based clade definition as above, we identified that most CDI cases result from isolates within clade 1, though clade 2 isolates were more likely to be associated with CDI status (Fig. 4a). The latter finding supports previous data indicating that clade 2 isolates are hypervirulent, often attributed to the presence of the binary toxin operon or increased expression from the PaLoc (19, 56, 57). Meanwhile, some clade 1 isolates contain no toxin genes, indicating a diversity of colonization strategies in this lineage. Pangenomic comparison of nontoxigenic vs toxigenic isolates revealed that in addition to the PaLoc, the majority of our toxigenic isolates from clade 1 (95/131 of our cohort) possess remnants of the binary toxin operon (Fig. 4b, cdtR and cdtA/B pseudogenes). Interestingly, we found that nontoxigenic isolates had a higher diversity of phage populations relative to toxigenic isolates (Fig. S5, P = 5.7e-8, Wilcoxon). Given the previous report that full-length cdtAB was identified only within clades 2, 3, and 5 (16), we investigated the conservation of cdtR (the transcriptional regulator of the binary toxin locus) across C. difficile strains (containing five lineages). We additionally examined >1,400 C. difficile genome assemblies from NCBI (Table S3; Fig. 4c). cdtR (unlike cdtAB) was dispersed across clade 1 and significantly associated with tcdB (Fig. 4d, Fisher’s exact test, BH corrected), suggesting a selective pressure to maintain some element of both toxin loci in these isolates. Notably, these operons are not syntenic, further underlining the significance of the association. From this association, we sought to understand why some toxigenic clade 1 isolates cause CDI and some colonize without symptoms. Using 148 toxigenic clade 1 isolates collected from this study and two previous studies from the same hospital (26, 52), we utilized a bacterial GWAS approach, pyseer (53), that identifies genetic traits associated with a binary or continuous phenotype, in this case, patients with or without CDI symptoms. Using CDI suspicion (see Materials and Methods) as an outcome variable, we found that multiple amidases (including cwlD), putative transcriptional regulators, and many genes of unknown function were enriched in isolates associated with CDI symptoms (Fig. 4e). These data indicate that the most prevalent circulating Cd strains that cause CDI are not the hypervirulent clade 2 strains but highlight the possibility that remnant genomic features from epidemic strains and other features may contribute to virulence in this hospital-associated clade of C. difficile.

Fig 4.

Fig 4

Clade 1 is responsible for the majority of CDI cases and carries unique correlates to symptom severity. (a) EIA status by clade across this and a previous study (26). Fisher’s exact test, P < 0.01. (b) Differentially abundant genes between toxigenic and nontoxigenic isolates in clade 1 from this study. Genes with a population structure adjusted P-value (LRT P-value) of <0.001 as produced by pyseer. (c) Phylogenetic tree of >1,400 C. difficile isolates from NCBI (Table S3) depicting the presence of binary toxin and PaLoc operons. (d) Presence of full-length cdtR and association with tcdB presence. (e) Filtered results (P-values < 0.01), pyseer analysis evaluating gene association with CDI suspicion in clade 1 isolates using the LRT P-value. Purple color indicates P < 0.001. Positive beta coefficient indicates gene association with CDI suspicion, while negative beta indicates asymptomatic colonization.

DISCUSSION

Through our prospective genomics study of two hospital wards, we were able to identify the connections between contamination of different surfaces and the strains carried by hospitalized patients and quantify some spread between carriers. Our estimates of the prevalence of patients with CDI (3.8%) agree with other estimates of 2%–4% CDI in patients with cancer (5860). While many studies have quantified surface contamination, few have had the genomic resolution to identify clonality between isolates indicating transmission or patient shedding (6163).

We observed distinct contamination between a patient’s bedrail that differed from the strain the patient carried, indicating that the bedrail may be a point of transmission. Furthermore, we did not identify any instances of CDI that could be genomically linked to an earlier CDI case or C. difficile carrier. We identified two possible instances of transmission between carriers, though neither of these occurrences resulted in CDI. As this finding is in the context of contact precautions for CDI patients, it indicates that these strategies are successful at limiting transmission of C. difficile that causes CDI, and there is a limited risk of CDI due to transmission from carriers. These findings confirm previous suggestions that carriers are not a significant risk for transmission leading to CDI (64, 65).

Our data suggest the need to investigate diverse lineages of C. difficile beyond previous epidemic strains to clarify the mechanisms of the disease. Among 79 culture-positive admissions, we only isolated the epidemic PCR ribotype 027 strain once, causing just one case of CDI within our cohort. Because the overall burden of clade 1 isolates was so high, we hypothesize that understanding the mechanisms and genomic factors by which these isolates cause disease may become more important as the burden of PCR ribotype 027 decreases (66). While clade 1 isolates associated with CDI symptoms are expectedly toxigenic (containing the toxin genes in the PaLoc), we also found an enrichment in two different amidase genes, which could contribute to differences in cell cycle progression during germination, growth or autolysis (67, 68). How the function of such a gene contributes to an increase in symptomology remains to be understood. Furthermore, we confirmed a genetic relationship between cdtR and tcdB across C. difficile lineages that indicates some evolutionary pressure for maintaining the regulatory gene of the less prevalent toxin operon (cdtR). This phylogenomic analysis supports recent functional data from clade 2 isolates that the presence of cdtR increases the expression of tcdB disease severity in an animal model of CDI (57). While this was previously suggested in vitro, it is unclear how generalizable this relationship is across lineages (56). In fact, we predict that clade 1 isolates containing only cdtR and the PaLoc may produce more toxin in vivo than those without cdtR. Future studies are warranted to investigate the role of both classes of genes implicated in this phenotype.

Our study has a number of important limitations. As this study focused on C. difficile colonization, disease, and transmission in two wards in the same hospital system, studies with increased sample size or meta-analysis studies are necessary to understand generalizable epidemiological measurements of C. difficile-patient dynamics (15). For example, we were unable to fully quantify in-unit transmission, as not all patients were able to provide stool specimens and/or consent to rectal swabs within 3 days of admission. Additionally, since we did not culture all environmental swabs or specimens, we likely missed some instances of surface contamination or more transient patient carriage, and thus expect that we underestimated the frequencies of contamination and carriage in these wards. Furthermore, the patients housed in the leukemia and HCT ICUs are unique due to their long hospital stays and high antibiotic exposure (69). While this population was selected specifically to allow us to increase our sample sizes, these patient characteristics could contribute to extended C. difficile colonization time relative to other hospital patient cohorts. Finally, we note the evidence for multi-strain colonization within a single patient (Patient 2330). This patient was diagnosed with CDI, but only nontoxigenic C. difficile was isolated (network 10). This could be due to co-colonization, or a false-positive toxin EIA. Given our approach of only culturing and sequencing single isolates per patient time point, future studies are needed to investigate the extent of within-patient C. difficile strain diversity by interrogating additional cultured isolates per sample or via metagenomic methods (70).

Despite these limitations, this work highlights new investigative directions for our understanding of CDI virulence. This work and others find risk for patients carrying C. difficile long term in the development of CDI, and we hypothesize that the mechanisms of virulence may be more complex than previous epidemic strains. We also hypothesize that non-CDI carriers contribute to the expansion of C. difficile transmission networks. Indeed, though much human and animal research has focused on epidemic strains that are two decades old, we and others have identified disease and colonization largely from clade 1 lineages. We also investigate gene flux of phage-like elements, which may play an important role in colonization, particularly in nontoxigenic isolates. Moreover, within this lineage, we found a mosaic representation of genes associated with the PaLoc that highlights the possibility of different mechanisms of colonization by this population of C. difficile. Future studies utilizing other human cohorts or animal models are warranted to investigate disease and pathogenicity caused by clade 1 C. difficile strains.

Conclusions

Our study provides new insight into the nature of prevalent C. difficile strains in a hospital setting, transmission between carriers, and virulence during circulation. Longitudinal sampling of surfaces and patient stool revealed that both toxigenic and nontoxigenic strains of C. difficile clade 1 are prevalent in these two wards. Moreover, our estimation of carriage patterns emphasizes the need for further investigation into the longitudinal carriage of C. difficile and its increased risk for CDI. We also note distinct differences in phage carriage between toxigenic and nontoxigenic C. difficile. We identified novel associations of accessory genes with CDI symptomology and toxigenicity (cdtR and cwlD). Our data highlight the complexities of understanding disease from this pathogen in a hospital setting and the need to investigate the mechanisms of in vivo persistence and virulence of prevalent lineages in the host gut microbiome.

ACKNOWLEDGMENTS

The authors are grateful to members of the Dantas lab for their helpful feedback on the data analysis and preparation of the manuscript. The authors would also like to thank the Edison Family Center for Genome Sciences and Systems Biology staff, Eric Martin, Brian Koebbe, MariaLynn Crosby, and Jessica Hoisington-López for their expertise and support in sequencing/data analysis.

This work was supported in part by an award to E.R.D. and G.D. through the Foundation for Barnes-Jewish Hospital and Institute of Clinical and Translational Sciences. This publication was supported by the NIH/National Center for Advancing Translational Sciences (NCATS), grant UL1 TR002345 (PI: B. Evanoff). This work was also supported by funding through the CDC BAA #200-2018-02926 under PI Erik Dubberke. S.R.S.F. is supported by the National Institute of Child Health and Human Development (NICHD; https://www.nichd.nih.gov/) of the NIH under award number T32 HD004010 (PI: P. Tarr).

The conclusions from this study represent those of the authors and do not represent the positions of the funding agencies.

S.R.S.F., K.A.R., C.-A.D.B., E.R.D., and G.D. participated in idea formulation and funding for this project. T.H., K.A.R., C.C., Z.H.I., E.L.S., and E.R.D. conducted participant enrollment, sample collection, and microbiological isolation. E.P.N., S.R.S.F., K.Z., and G.D. conducted all sequencing analysis and figure generation. E.P.N. and S.R.S.F. completed the writing of the manuscript. All authors read and approved the final manuscript.

Contributor Information

Erik R. Dubberke, Email: edubberk@wustl.edu.

Gautam Dantas, Email: dantas@wustl.edu.

Julian Parkhill, Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom.

ETHICS APPROVAL

The study protocol was approved by the Washington University Human Research Protection Office (IRB #201810103). All participants provided written informed consent.

DATA AVAILABILITY

The data sets generated and analyzed during the current study are available in NCBI GenBank under BioProject accession no. PRJNA980715.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/mbio.03300-23.

Supplemental figures and tables. mbio.03300-23-s0001.pdf.

Fig. S1-S5; Tables S1-S3.

DOI: 10.1128/mbio.03300-23.SuF1

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. Czepiel J, Dróżdż M, Pituch H, Kuijper EJ, Perucki W, Mielimonka A, Goldman S, Wultańska D, Garlicki A, Biesiada G. 2019. Clostridium difficile infection: review. Eur J Clin Microbiol Infect Dis 38:1211–1221. doi: 10.1007/s10096-019-03539-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. McDonald LC, Gerding DN, Johnson S, Bakken JS, Carroll KC, Coffin SE, Dubberke ER, Garey KW, Gould CV, Kelly C, Loo V, Shaklee Sammons J, Sandora TJ, Wilcox MH. 2018. Clinical practice guidelines for Clostridium difficile infection in adults and children: 2017 update by the infectious diseases society of America (IDSA) and society for healthcare epidemiology of America (SHEA). Clin Infect Dis 66:e1–e48. doi: 10.1093/cid/cix1085 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Clements ACA, Magalhães RJS, Tatem AJ, Paterson DL, Riley TV. 2010. Clostridium difficile PCR ribotype 027: assessing the risks of further worldwide spread. Lancet Infect Dis 10:395–404. doi: 10.1016/S1473-3099(10)70080-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. McDonald LC, Killgore GE, Thompson A, Owens RC, Kazakova SV, Sambol SP, Johnson S, Gerding DN. 2005. An epidemic, toxin gene-variant strain of Clostridium difficile. N Engl J Med 353:2433–2441. doi: 10.1056/NEJMoa051590 [DOI] [PubMed] [Google Scholar]
  • 5. Loo VG, Poirier L, Miller MA, Oughton M, Libman MD, Michaud S, Bourgault A-M, Nguyen T, Frenette C, Kelly M, Vibien A, Brassard P, Fenn S, Dewar K, Hudson TJ, Horn R, René P, Monczak Y, Dascal A. 2005. A predominantly clonal multi-institutional outbreak of Clostridium difficile-associated diarrhea with high morbidity and mortality. N Engl J Med 353:2442–2449. doi: 10.1056/NEJMoa051639 [DOI] [PubMed] [Google Scholar]
  • 6. Fatima R, Aziz M. 2019. The hypervirulent strain of Clostridium difficile: NAP1/B1/027 - a brief overview. Cureus 11:e3977. doi: 10.7759/cureus.3977 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Goorhuis A, Bakker D, Corver J, Debast SB, Harmanus C, Notermans DW, Bergwerff AA, Dekker FW, Kuijper EJ. 2008. Emergence of Clostridium difficile infection due to a new hypervirulent strain, polymerase chain reaction ribotype 078. Clin Infect Dis 47:1162–1170. doi: 10.1086/592257 [DOI] [PubMed] [Google Scholar]
  • 8. Bauer MP, Notermans DW, van Benthem BHB, Brazier JS, Wilcox MH, Rupnik M, Monnet DL, van Dissel JT, Kuijper EJ, ECDIS Study Group . 2011. Clostridium difficile infection in Europe: a hospital-based survey. Lancet 377:63–73. doi: 10.1016/S0140-6736(10)61266-4 [DOI] [PubMed] [Google Scholar]
  • 9. Giancola SE, Williams RJ, Gentry CA. 2018. Prevalence of the Clostridium difficile BI/NAP1/027 strain across the United States veterans health administration. Clin Microbiol Infect 24:877–881. doi: 10.1016/j.cmi.2017.11.011 [DOI] [PubMed] [Google Scholar]
  • 10. Balsells E, Filipescu T, Kyaw MH, Wiuff C, Campbell H, Nair H. 2016. Infection prevention and control of Clostridium difficile: a global review of guidelines, strategies, and recommendations. J Glob Health 6:020410. doi: 10.7189/jogh.06.020410 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kong LY, Eyre DW, Corbeil J, Raymond F, Walker AS, Wilcox MH, Crook DW, Michaud S, Toye B, Frost E, Dendukuri N, Schiller I, Bourgault A-M, Dascal A, Oughton M, Longtin Y, Poirier L, Brassard P, Turgeon N, Gilca R, Loo VG. 2019. Clostridium difficile: investigating transmission patterns between infected and colonized patients using whole genome sequencing. Clin Infect Dis 68:204–209. doi: 10.1093/cid/ciy457 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Svenungsson B, Burman LG, Jalakas-Pörnull K, Lagergren A, Struwe J, Akerlund T. 2003. Epidemiology and molecular characterization of Clostridium difficile strains from patients with diarrhea: low disease incidence and evidence of limited cross-infection in a Swedish teaching hospital. J Clin Microbiol 41:4031–4037. doi: 10.1128/JCM.41.9.4031-4037.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Durham DP, Olsen MA, Dubberke ER, Galvani AP, Townsend JP. 2016. Quantifying transmission of Clostridium difficile within and outside healthcare settings. Emerg Infect Dis 22:608–616. doi: 10.3201/eid2204.150455 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Warren BG, Turner NA, Addison R, Nelson A, Barrett A, Addison B, Graves A, Smith B, Lewis SS, Weber DJ, Sickbert-Bennett EE, Anderson DJ. 2022. The impact of infection versus colonization on Clostridioides difficile environmental contamination in hospitalized patients with diarrhea. Open Forum Infect Dis 9:ofac069. doi: 10.1093/ofid/ofac069 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Miles-Jay A, Snitkin ES, Lin MY, Shimasaki T, Schoeny M, Fukuda C, Dangana T, Moore N, Sansom SE, Yelin RD, Bell P, Rao K, Keidan M, Standke A, Bassis C, Hayden MK, Young VB. 2023. Longitudinal genomic surveillance of carriage and transmission of Clostridioides difficile in an intensive care unit. Nat Med 29:2526–2534. doi: 10.1038/s41591-023-02549-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Knight DR, Imwattana K, Kullin B, Guerrero-Araya E, Paredes-Sabja D, Didelot X, Dingle KE, Eyre DW, Rodríguez C, Riley TV. 2021. Major genetic discontinuity and novel toxigenic species in Clostridioides difficile taxonomy. Elife 10:e64325. doi: 10.7554/eLife.64325 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Mullany P, Allan E, Roberts AP. 2015. Mobile genetic elements in Clostridium difficile and their role in genome function. Res Microbiol 166:361–367. doi: 10.1016/j.resmic.2014.12.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Fortier LC. 2018. Bacteriophages contribute to shaping Clostridioides (Clostridium) difficile species. Front Microbiol 9:2033. doi: 10.3389/fmicb.2018.02033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Gerding DN, Johnson S, Rupnik M, Aktories K. 2014. Clostridium difficile binary toxin CDT: mechanism, epidemiology, and potential clinical importance. Gut Microbes 5:15–27. doi: 10.4161/gmic.26854 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Gerding DN, Meyer T, Lee C, Cohen SH, Murthy UK, Poirier A, Van Schooneveld TC, Pardi DS, Ramos A, Barron MA, Chen H, Villano S. 2015. Administration of spores of nontoxigenic Clostridium difficile strain M3 for prevention of recurrent C. difficile infection: a randomized clinical trial. JAMA 313:1719–1727. doi: 10.1001/jama.2015.3725 [DOI] [PubMed] [Google Scholar]
  • 21. Carlson PE, Walk ST, Bourgis AET, Liu MW, Kopliku F, Lo E, Young VB, Aronoff DM, Hanna PC. 2013. The relationship between phenotype, ribotype, and clinical disease in human Clostridium difficile isolates. Anaerobe 24:109–116. doi: 10.1016/j.anaerobe.2013.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Walk ST, Micic D, Jain R, Lo ES, Trivedi I, Liu EW, Almassalha LM, Ewing SA, Ring C, Galecki AT, Rogers MAM, Washer L, Newton DW, Malani PN, Young VB, Aronoff DM. 2012. Clostridium difficile ribotype does not predict severe infection. Clin Infect Dis 55:1661–1668. doi: 10.1093/cid/cis786 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Aitken SL, Alam MJ, Khaleduzzaman M, Walk ST, Musick WL, Pham VP, Christensen JL, Atmar RL, Xie Y, Garey KW. 2015. In the endemic setting, Clostridium difficile ribotype 027 is virulent but not hypervirulent. Infect Control Hosp Epidemiol 36:1318–1323. doi: 10.1017/ice.2015.187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Pettit LJ, Browne HP, Yu L, Smits WK, Fagan RP, Barquist L, Martin MJ, Goulding D, Duncan SH, Flint HJ, Dougan G, Choudhary JS, Lawley TD. 2014. Functional genomics reveals that Clostridium difficile Spo0A coordinates sporulation, virulence and metabolism. BMC Genomics 15:160. doi: 10.1186/1471-2164-15-160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Awad MM, Johanesen PA, Carter GP, Rose E, Lyras D. 2014. Clostridium difficile virulence factors: insights into an anaerobic spore-forming pathogen. Gut Microbes 5:579–593. doi: 10.4161/19490976.2014.969632 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Fishbein SR, Robinson JI, Hink T, Reske KA, Newcomer EP, Burnham C-A, Henderson JP, Dubberke ER, Dantas G. 2022. Multi-omics investigation of Clostridioides difficile-colonized patients reveals pathogen and commensal correlates of C. difficile pathogenesis. Elife 11:e72801. doi: 10.7554/eLife.72801 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Dubberke ER, Reske KA, Hink T, Kwon JH, Cass C, Bongu J, Burnham C-A, Henderson JP. 2018. Clostridium difficile colonization among patients with clinically significant diarrhea and no identifiable cause of diarrhea. Infect Control Hosp Epidemiol 39:1330–1333. doi: 10.1017/ice.2018.225 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Baym M, Kryazhimskiy S, Lieberman TD, Chung H, Desai MM, Kishony R. 2015. Inexpensive multiplexed library preparation for megabase-sized genomes. PLoS One 10:e0128036. doi: 10.1371/journal.pone.0128036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Bushnell B. 2014. BBMap: a fast, accurate, splice-aware aligner. Available from: https://www.osti.gov/servlets/purl/1241166
  • 33. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, Koren S, Phillippy AM. 2016. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol 17:132. doi: 10.1186/s13059-016-0997-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL. 2004. Versatile and open software for comparing large genomes. Genome Biol 5:R12. doi: 10.1186/gb-2004-5-2-r12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Richter M, Rosselló-Móra R. 2009. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A 106:19126–19131. doi: 10.1073/pnas.0906412106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Seemann T. 2010. Mlst. Available from: https://github.com/tseemann/mlst
  • 38. Jolley KA, Maiden MCJ. 2010. BIGSdb: scalable analysis of bacterial genome variation at the population level. BMC Bioinformatics 11:595. doi: 10.1186/1471-2105-11-595 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153 [DOI] [PubMed] [Google Scholar]
  • 40. He M, Sebaihia M, Lawley TD, Stabler RA, Dawson LF, Martin MJ, Holt KE, Seth-Smith HMB, Quail MA, Rance R, Brooks K, Churcher C, Harris D, Bentley SD, Burrows C, Clark L, Corton C, Murray V, Rose G, Thurston S, van Tonder A, Walker D, Wren BW, Dougan G, Parkhill J. 2010. Evolutionary dynamics of Clostridium difficile over short and long time scales. Proc Natl Acad Sci U S A 107:7527–7532. doi: 10.1073/pnas.0914322107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Carter GP, Lyras D, Allen DL, Mackin KE, Howarth PM, O’Connor JR, Rood JI. 2007. Binary toxin production in Clostridium difficile is regulated by CdtR, a LytTR family response regulator. J Bacteriol 189:7290–7301. doi: 10.1128/JB.00731-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Tonkin-Hill G, MacAlasdair N, Ruis C, Weimann A, Horesh G, Lees JA, Gladstone RA, Lo S, Beaudoin C, Floto RA, Frost SDW, Corander J, Bentley SD, Parkhill J. 2020. Producing polished prokaryotic pangenomes with the panaroo pipeline. Genome Biol 21:180. doi: 10.1186/s13059-020-02090-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Price MN, Dehal PS, Arkin AP. 2009. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol 26:1641–1650. doi: 10.1093/molbev/msp077 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Yu G. 2020. Using ggtree to visualize data on tree-like structures. Curr Protoc Bioinformatics 69:e96. doi: 10.1002/cpbi.96 [DOI] [PubMed] [Google Scholar]
  • 45. Seemann T. 2015. Snippy: fast bacterial variant calling from NGS reads. Available from: https://github.com/tseemann/snippy
  • 46. Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. 2021. Twelve years of SAMtools and BCFtools. Gigascience 10:giab008. doi: 10.1093/gigascience/giab008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Alves F, Castro R, Pinto M, Nunes A, Pomba C, Oliveira M, Silveira L, Gomes JP, Oleastro M. 2022. Molecular epidemiology of Clostridioides difficile in companion animals: genetic overlap with human strains and public health concerns. Front Public Health 10:1070258. doi: 10.3389/fpubh.2022.1070258 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Tisza MJ, Belford AK, Domínguez-Huerta G, Bolduc B, Buck CB. 2021. Cenote-taker 2 democratizes virus discovery and sequence annotation. Virus Evol 7:veaa100. doi: 10.1093/ve/veaa100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. 2021. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol 39:578–585. doi: 10.1038/s41587-020-00774-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Kieft K, Zhou Z, Anantharaman K. 2020. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome 8:90. doi: 10.1186/s40168-020-00867-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Roux S, Adriaenssens EM, Dutilh BE, Koonin EV, Kropinski AM, Krupovic M, Kuhn JH, Lavigne R, Brister JR, Varsani A, et al. 2019. Minimum information about an uncultivated virus genome (MIUViG). Nat Biotechnol 37:29–37. doi: 10.1038/nbt.4306 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Fishbein SRS, Hink T, Reske KA, Cass C, Struttmann E, Iqbal ZH, Seiler S, Kwon JH, Burnham CA, Dantas G, Dubberke ER. 2021. Randomized controlled trial of oral vancomycin treatment in Clostridioides difficile-colonized patients. mSphere 6:e00936-20. doi: 10.1128/mSphere.00936-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Lees JA, Galardini M, Bentley SD, Weiser JN, Corander J. 2018. Pyseer: a comprehensive tool for microbial pangenome-wide association studies. Bioinformatics 34:4310–4312. doi: 10.1093/bioinformatics/bty539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Seemann T. 2021. snp-dists. Available from: https://github.com/tseemann/snp-dists
  • 55. Williamson CHD, Stone NE, Nunnally AE, Roe CC, Vazquez AJ, Lucero SA, Hornstra H, Wagner DM, Keim P, Rupnik M, Janezic S, Sahl JW. 2022. Identification of novel, cryptic Clostridioides species isolates from environmental samples collected from diverse geographical locations. Microb Genom 8:000742. doi: 10.1099/mgen.0.000742 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Lyon SA, Hutton ML, Rood JI, Cheung JK, Lyras D. 2016. CdtR regulates TcdA and TcdB production in Clostridium difficile. PLoS Pathog 12:e1005758. doi: 10.1371/journal.ppat.1005758 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Dong Q, Lin H, Allen M-M, Garneau JR, Sia JK, Smith RC, Haro F, McMillen T, Pope RL, Metcalfe C, Burgo V, Woodson C, Dylla N, Kohout C, Sundararajan A, Snitkin ES, Young VB, Fortier L-C, Kamboj M, Pamer EG. 2023. Virulence and genomic diversity among clinical isolates of ST1 (BI/NAP1/027) Clostridioides difficile. bioRxiv:2023.01.12.523823. doi: 10.1101/2023.01.12.523823 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Zheng Y, Luo Y, Lv Y, Huang C, Sheng Q, Zhao P, Ye J, Jiang W, Liu L, Song X, Tong Z, Chen W, Lin J, Tang Y-W, Jin D, Fang W. 2017. Clostridium difficile colonization in preoperative colorectal cancer patients. Oncotarget 8:11877–11886. doi: 10.18632/oncotarget.14424 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Jain T, Croswell C, Urday-Cornejo V, Awali R, Cutright J, Salimnia H, Reddy Banavasi HV, Liubakka A, Lephart P, Chopra T, Revankar SG, Chandrasekar P, Alangaden G. 2016. Clostridium difficile colonization in hematopoietic stem cell transplant recipients: a prospective study of the epidemiology and outcomes involving toxigenic and nontoxigenic strains. Biol Blood Marrow Transplant 22:157–163. doi: 10.1016/j.bbmt.2015.07.020 [DOI] [PubMed] [Google Scholar]
  • 60. Kamboj M, Gennarelli RL, Brite J, Sepkowitz K, Lipitz-Snyderman A. 2019. Risk for Clostridiodes difficile infection among older adults with cancer. Emerg Infect Dis 25:1683–1689. doi: 10.3201/eid2509.181142 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Claro T, Daniels S, Humphreys H. 2014. Detecting Clostridium difficile spores from inanimate surfaces of the hospital environment: which method is best?. J Clin Microbiol 52:3426–3428. doi: 10.1128/JCM.01011-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Kumar N, Miyajima F, He M, Roberts P, Swale A, Ellison L, Pickard D, Smith G, Molyneux R, Dougan G, Parkhill J, Wren BW, Parry CM, Pirmohamed M, Lawley TD. 2016. Genome-based infection tracking reveals dynamics of Clostridium difficile transmission and disease recurrence. Clin Infect Dis 62:746–752. doi: 10.1093/cid/civ1031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Kiersnowska ZM, Lemiech-Mirowska E, Michałkiewicz M, Sierocka A, Marczak M. 2022. Detection and analysis of Clostridioides difficile spores in a hospital environment. Int J Environ Res Public Health 19:15670. doi: 10.3390/ijerph192315670 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Morgan DJ, Dubberke ER, Hink T, Paszkiewicz G, Burnham C-A, Pineles L, Magder L, Johnson JK, Leekha S, Harris AD. 2023. The impact of universal glove and gown use on Clostridioides difficile acquisition: a cluster-randomized trial. Clin Infect Dis 76:e1202–e1207. doi: 10.1093/cid/ciac519 [DOI] [PubMed] [Google Scholar]
  • 65. Miles-Jay A, Young VB, Pamer EG, Savidge TC, Kamboj M, Garey KW, Snitkin ES. 2021. A multisite genomic epidemiology study of Clostridioides difficile infections in the USA supports differential roles of healthcare versus community spread for two common strains. Microb Genom 7:000590. doi: 10.1099/mgen.0.000590 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Snydman DR, McDermott LA, Jenkins SG, Goldstein EJC, Patel R, Forbes BA, Johnson S, Gerding DN, Thorpe CM, Walk ST. 2020. Epidemiologic trends in Clostridioides difficile isolate ribotypes in United States from 2011 to 2016. Anaerobe 63:102185. doi: 10.1016/j.anaerobe.2020.102185 [DOI] [PubMed] [Google Scholar]
  • 67. Diaz OR, Sayer CV, Popham DL, Shen A. 2018. Clostridium difficile lipoprotein GerS is required for cortex modification and thus spore germination. mSphere 3:e00205-18. doi: 10.1128/mSphere.00205-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Wydau-Dematteis S, El Meouche I, Courtin P, Hamiot A, Lai-Kuen R, Saubaméa B, Fenaille F, Butel M-J, Pons J-L, Dupuy B, Chapot-Chartier M-P, Peltier J. 2018. Cwp19 is a novel lytic transglycosylase involved in stationary-phase autolysis resulting in toxin release in Clostridium difficile. mBio 9:e00648-18. doi: 10.1128/mBio.00648-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Balsells E, Shi T, Leese C, Lyell I, Burrows J, Wiuff C, Campbell H, Kyaw MH, Nair H. 2019. Global burden of Clostridium difficile infections: a systematic review and meta-analysis. J Glob Health 9:010407. doi: 10.7189/jogh.09.010407 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Seekatz AM, Wolfrum E, DeWald CM, Putler RKB, Vendrov KC, Rao K, Young VB. 2018. Presence of multiple Clostridium difficile strains at primary infection is associated with development of recurrent disease. Anaerobe 53:74–81. doi: 10.1016/j.anaerobe.2018.05.017 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental figures and tables. mbio.03300-23-s0001.pdf.

Fig. S1-S5; Tables S1-S3.

DOI: 10.1128/mbio.03300-23.SuF1

Data Availability Statement

The data sets generated and analyzed during the current study are available in NCBI GenBank under BioProject accession no. PRJNA980715.


Articles from mBio are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES