Abstract
The mucosa-associated microbiota is widely recognized as a potential trigger for Crohn’s disease pathophysiology but remains largely uncharacterised beyond its taxonomic composition. Unlike stool microbiota, the functional characterisation of these communities using current DNA/RNA sequencing approaches remains constrained by the relatively small microbial density on tissue, and the overwhelming amount of human DNA recovered during sample preparation. Here, we have used a novel ex vivo approach that combines microbe culture from anaerobically preserved tissue with metagenome sequencing (MC-MGS) to reveal patient-specific and strain-level differences among these communities in post-operative Crohn’s disease patients. The 16 S rRNA gene amplicon profiles showed these cultures provide a representative and holistic representation of the mucosa-associated microbiota, and MC-MGS produced both high quality metagenome-assembled genomes of recovered novel bacterial lineages. The MC-MGS approach also produced a strain-level resolution of key Enterobacteriacea and their associated virulence factors and revealed that urease activity underpins a key and diverse metabolic guild in these communities, which was confirmed by culture-based studies with axenic cultures. Collectively, these findings using MC-MGS show that the Crohn’s disease mucosa-associated microbiota possesses taxonomic and functional attributes that are highly individualistic, borne at least in part by novel bacterial lineages not readily isolated or characterised from stool samples using current sequencing approaches.
Subject terms: Biomarkers, Microbiome, Next-generation sequencing, Clinical microbiology, Metagenomics
Introduction
A step advance in human microbiome research was catalysed by 16 S rRNA gene amplicon sequencing. For the last two decades, this approach has produced a characterisation of the taxonomic profiles of stool and more recently, the mucosa-associated microbiota (MAM) in health and disease [1]. This is exemplified by the large number of case-control and observational studies of the stool and MAM from new-onset and chronic Crohn’s disease (CD) patients, which invariably show a decrease in bacterial diversity coincident with disease and in comparison to non-CD ‘healthy’ subjects. In contrast, longitudinal studies of disease course and the gut microbiome of CD patients are relatively uncommon. In that context, the Post-Operative Crohn’s Endoscopic Recurrence (POCER) study [2] incorporated a longitudinal taxonomic assessment of the MAM from the time of surgery and post-operatively, which identified both ‘protective’ (Faecalibacterium prausnitzii) and ‘pathogenic’ (Proteus spp.) bacteria in CD remission and recurrence, respectively [3, 4]. Although recent ‘multi-omics’ approaches have provided further insights beyond 16 S rRNA gene-based profiling [5, 6], much still needs to be learned about the microbial biology and functional implications of the MAM and in particular, its involvement with respect to disease location, severity, therapeutic modality (e.g., exclusive enteral nutrition or biologic drugs) and patient immune status [7].
In recent years, metagenomic sequencing (MGS) has greatly empowered gut microbiome studies, by enabling the identification of key microbial functional attributes that are associated with health and disease, including the reconstruction of microbial genomes from these datasets, and species/strain level of resolution of community members (e.g., [8, 9]). However, while MGS has emerged as the preferred approach with samples that are rich in microbial biomass (e.g., stool), constraints remain to the effective use of MGS with DNA from samples with limited microbial density and/or rich in non-microbial (host) DNA. For instance, the total DNA extracted from gut mucosal biopsy samples is often ≥99% host-derived [10], and methods designed to subtractively enrich microbial DNA from these types of samples have limited efficacy and/or still require very large sequencing depths, which is cost prohibitive for many studies [11–15].
Here, we use an ex vivo combination of microbe culture from cryopreserved tissue with shotgun metagenomic sequencing (MC-MGS) to provide a more holistic and functional characterisation of the MAM in post-operative Crohn’s disease patients. The biological veracity of the method was first established via the comparison of the 16 S rRNA gene amplicon profiles produced from DNA extracted directly from biopsy tissues, with those produced from the microbial cultures produced from patient-matched tissue samples. Next, we used these same DNA to produce MGS datasets and, despite some improvement in recovery of microbial data following the subtraction of human DNA, the MC-MGS datasets were substantially more comprehensive, revealing that urea metabolism represents a key and diverse metabolic guild in CD-MAM, and the identification of ‘novel’ lineages of bacteria inherent to these communities.
Materials and methods
Samples
Two adjacent pinch biopsy samples were collected from five post-operative CD patients at three sites: at (ANA) and below (BA) the anastomosis site, and from the rectum (REC). One biopsy from each site was immediately transferred into a screw-capped cryotube containing RNAlater. The second biopsy was transferred into a 7 ml glass serum bottle containing 3 ml of an anaerobically prepared, sterilized cryopreservative solution containing (per 100 ml): 0.023 g K2HPO4, 0.023 g KH2PO4, 0.023 g (NH4)2SO4, 0.046 g NaCl, 0.01 g MgSO4.7H2O, 0.007 g CaCl2, 0.1 g NaHCO3, 0.1 g L-cysteine HCl, 0.1 mg resazurin and 30% (v/v) glycerol. The tissue samples within their respective preservative solutions were stored at -80 °C until later use.
Microbial culture from biopsy tissue
Modified M2 medium [16] was used for microbial cultivation, which contained (per 100 ml): 1.0 g NZ-Amine, 0.25 g yeast extract, 0.1 g L-cysteine HCl, 0.1 mg resazurin, 0.045 g K2HPO4, 0.045 g KH2PO4, 0.045 g (NH4)2SO4, 0.09 g NaCl, 0.01 g MgSO4.7H2O, 0.012 g CaCl2 and 10 ml of clarified rumen fluid. In addition, the medium was supplemented with either a ‘diet-based’ carbohydrate mixture (M2Diet) consisting of D-glucose, soluble starch and D-(+)-cellobiose (0.2% w/v each) or ‘host-derived’ carbohydrates (M2Host) consisting of mucin (porcine stomach), L-fucose, N-acetyl-D-galactosamine, N-acetyl-D-glucosamine (0.2% w/v each) and N-acetylneuraminic acid (0.1% w/v). All carbohydrates were sourced as powders from Sigma Aldrich (MO, USA). For primary microbial cultures using matched biopsies collected from each patient at the site of anastomosis, or the rectum, the cryopreserved tissue was aseptically inoculated into Hungate tubes containing 10 ml of M2Diet medium, with an anaerobic atmosphere (N2:CO2:H2, 85:10:5) and maintained at 37oC in a water bath for 24 h. These primary cultures were preserved by adding 3 ml aliquots to 3 ml of the cryopreservative solution as described above, and stored at −80oC. In a later series of experiments, a biopsy collected from each patient below the site of anastomosis was transferred to 10 ml M2Host medium in Hungate tubes, also prepared with an anaerobic atmosphere (N2:CO2:H2, 85:10:5) and maintained at 37oC in a water bath for 24 hours. At the end of 24 h, 0.1 ml of these cultures were transferred into 10 ml of the M2Diet medium, to generate a secondary microbial-culture and 3 ml added to an equal volume of the cryopreservative solution and stored at −80oC. Concurrently, 0.1 ml of the cryopreserved M2Diet cultures described above were used to provide secondary cultures using the M2Host medium. At the end of each culture step, 0.5 ml aliquots were recovered, centrifuged and stored at −30oC before DNA extraction.
DNA extraction and sequencing approaches
DNA was extracted from biopsy tissues and microbial biomass using the repeated bead beating protocol combined with an automated column-based purification system (Maxwell®16) [17] and quantified using the Promega QuantiFluor® dsDNA System (Promega, WI, USA). A 200 ng subsample of the biopsy DNA was then processed with the NEBNext® Microbiome DNA Enrichment Kit (New England Biolabs, MA, USA) according to manufacturers’ instructions. The extracted DNA samples were then sent to the University of Queensland’s Australian Centre for Ecogenomics (www.ecogenomic.org). Here, dual-indexed 16 S rRNA gene amplicon libraries spanning the V4 hypervariable region were constructed with the iTag_515F (GTGYCAGCMGCCGCGGTAA) and iTag_806R (GACTACHVGGGTATCTAATCC) primers. PCR products of ~466 bp were amplified according to the specified workflow with an alteration in polymerase used to substitute NEBNext® Ultra™ II Q5® Mastermix (New England Biolabs #M0544) in standard PCR conditions for 25 cycles. Resulting PCR amplicons were purified using Agencourt AMPure XP beads (Beckman Coulter). Purified DNA was indexed with unique 8 bp barcodes using the Illumina Nextera XT 384 sample Index Kit A-D (Illumina FC-131–1002) in standard PCR conditions for 8 cycles with NEBNext® Ultra™ II Q5® Mastermix. Indexed amplicons were pooled together in equimolar concentrations and sequenced on MiSeq Sequencing System (Illumina) using paired end sequencing with V3 300 bp chemistry in the Australian Centre for Ecogenomics according to manufacturer’s protocol. Positive controls were from a known mock community to monitor for bias in the amplicon library construction, while negative amplification controls were from a mock processed reagent control to monitor for contamination in DNA extraction and library construction. For shotgun metagenomic sequencing (MGS), libraries were constructed using the Illumina Nextera XT DNA Library Preparation Kit (CA, USA), and with MGS sequencing performed with the Illumina NextSeq500 platform as 150 bp paired-end reads, to a sequencing depth of 3 Gbp per sample. Protocols for library preparation, sequencing and data quality assessments follow protocols developed and used by the Australian Centre for Ecogenomics (www.ecogenomic.org).
16 S rRNA gene amplicon sequencing data processing and analyses
The 16 S rRNA gene amplicon sequences were processed using the Quantitative Insights into Microbial Ecology (QIIME) pipeline (version 1.9.1) [18], where forward and reverse sequence reads were joined, demultiplexed and quality filtered using a Phred quality score cut-off of ≥20. The Greengenes database (version 13.8) [19] was then used to remove chimeric sequences through USEARCH (version 6.1.544) [20] and subsequently for assigning Operational Taxonomic Units (OTUs) via open reference picking at 97% sequence identity. The resultant OTU table was then filtered to exclude OTUs below 0.1% relative abundance, and OTUs present in the negative controls. Four samples (CD10_REC_Biopsy, CD10_REC_Enriched, CD14_REC_Enriched and CD33_REC_Enriched) produced <1000 reads each, and were excluded from the subsequent analyses. The filtered OTU table was then normalized using cumulative sum scaling (CSS). The normalized OTU table was used to generate Bray-Curtis, weighted and unweighted UniFrac Principal Coordinates Analysis (PCoA) plots and Sparse Partial Least Squares Discriminant Analysis (sPLS-DA) plots using the Calypso (version 8.72) [21] visualization tool and presence/absence plots were generated using the pheatmaps package (https://CRAN.R-project.org/package=pheatmap) available on the R Studio platform (www.rstudio.com).
Assessment of bacterial DNA enrichment from biopsy DNA
To measure the efficacy of the microbial enrichment step, three independent methods were used. First, we used a qPCR-based quantification of the bacterial 16 S rRNA gene, relative to the copy number of the human β-actin gene present in biopsy DNA before and after microbial enrichment, as described by Shanahan et al. [22]. The qPCR was performed using the QuantStudio™ 7 Flex Real-Time PCR System platform (Applied Biosystems, CA, USA), with the Power SYBR Green Master Mix (Life Technologies, UK). Next, we used two different bioinformatics approaches to quantify enrichment. An initial assessment of the raw, unfiltered sequences was performed using GraftM [23] to provide an absolute count on the number of reads assigned to bacterial/archaeal 16 S rRNA genes. Subsequently, we also used MetaPhlAn2 [24] with the quality-filtered MGS data (as described below) to again provide an absolute count of those reads assigned to specific microbial marker genes present in the different MGS datasets.
MGS data processing
Paired-end raw shotgun sequences generated from ACE were put through Trimmomatic (version 0.36) [25] to remove low quality reads and adaptor sequences. An average of 24 million quality filtered paired-end reads per sample were produced, ranging between 12.7 and 47.4 million reads across all samples, while 2 samples (CD14_REC_Enriched and CD33_REC_Enriched) were excluded due to poor sequencing depth. The quality-filtered paired-end reads were concatenated into a single file and then subjected to analyses via the HUMAaN2 pipeline [26], with taxonomic identification and classification using the MetaPhlAn2 analysis module [24], and with functional characterisation via ChocoPhlAn as the reference pangenome database and UniRef90 [27] as the protein reference database. The identified KEGG Orthologs were mapped to BRITE Hierarchy [28] using custom R scripts, while the between-class analysis (BCA) based on BRITE Hierarchy profiles and Monte Carlo permutation tests was performed using the ‘Ade4’ package available on R. The representative taxonomy and functional plots and heatmaps were generated using the ‘pheatmaps’ package available on R.
Construction of Metagenome-assembled genomes (MAGs)
Quality-filtered paired-end shotgun sequence reads from the MC-MGS datasets (n = 6 per patient) were used to create patient-specific metagenome assemblies using MEGAHIT (version 1.1.1) [29]. Contiguous sequences (contigs) were assembled using BamM (version 1.7.3, available at: http://ecogenomics.github.io/BamM/) and metagenomic binning was performed using MetaBAT (version 0.32.4) [30]. Estimation of the completeness and contamination of the MAGs was performed using CheckM (version 1.0.7) [31]. High-quality MAGs (≥90% completeness and ≤5% contamination scores) and mid-quality MAGs (50–89% completeness and ≤ 10% contamination scores) were then uploaded to Pathosystems Resource Integration Center (PATRIC) [32] to determine whole-genome nucleotide similarity estimation using MinHash (Mash) [33] based on hierarchical clustering to public genomes available on PATRIC (Mash distance < 0.05). Gene annotations were performed using Prokka [34].
Comparative genomics analyses of MAGs
The Genome Taxonomy Database Toolkit (GTDB-Tk) (version 1.1.1) [35] was used to predict whole genome phylogeny and taxonomic classification of the 47 high-quality MAGs based on the concatenation of 120 bacterial marker genes. The resultant output was then used to build a phylogenetic tree using FastTree (version 2.1.10) [36]. The JSpeciesWS webserver [37] was used for the pairwise genome comparison of Average Nucleotide Identity (ANI) percentages between the Escherichia coli MAGs. The E. coli MAGs were then functionally annotated based on KEGG Orthology [28] and were subsequently split into CD remission (Rutgeerts score i0/i1) or CD recurrence (Rutgeerts score i2) groups to determine differential KOs via the EnrichM comparative genomics pipeline (version 0.4.9) [38].
Isolation of urease-positive bacteria
Glycerol stocks of M2Diet cultures from the anastomotic biopsies of patients CD10, CD33 and CD34 were subjected to a ten-fold serial dilution and plated onto MacConkey (Oxoid Ltd, UK) and/or CLED (Cysteine-, Lactose-, Electrolyte deficient) agar (Oxoid Ltd, UK) under aerobic conditions. Single colonies were picked and replated to test for urease-positivity with Urea Agar Base plates with added Phenol Red; and positivity was confirmed by the coordinate pH shift and colour change surrounding these individual colonies. Positive candidate urease-producing colonies were replated onto habitat-simulating M2Diet agar to produce axenic cultures and taxonomically identified using Sanger sequencing based on the 16 S rRNA gene.
Results
In this study, anastomotic and rectal biopsies from five CD patients were collected 6 months post-surgery during colonoscopy as part of the POCER study [3] and transferred to either RNAlater (for DNA extraction), or a sterile, anaerobically prepared glycerol buffer as a cryopreservative for MC-MGS and stored at −80oC. A subsample of the DNA extracted directly from biopsy tissue samples was also subjected to microbial DNA enrichment. This additional step resulted in a twofold to eightfold increase in microbial DNA relative to total biopsy DNA in four out of the five patient samples, as measured by the qPCR-based quantification and two independent bioinformatics approaches as outlined in the Materials and Methods section (Table 1). However, the impact of the DNA enrichment step on the capture of the functional context of the MAM was small (Supplementary Table 2). The mean number of gene families identified by UniRef90 increased from 18,959 to 23,315 for the MGS datasets produced from biopsy DNA pre- and post-enrichment, respectively. In contrast though, the MC-MGS datasets produced on average 78,704 gene families. Interestingly, while the MC-MGS data did not contain human DNA sequences, approximately one-third of the dataset remained unassigned after both taxonomic and functional assignment. These findings suggest our microbial cultures are comprised not only of strains closely related to the reference genomes used in the HUMAaN2 pipeline, but a relatively large proportion of strains possessing novel microbial functionalities and gene diversity.
Table 1.
Patient ID | Fold-increase in microbial reads following NEBNext treatment | ||
---|---|---|---|
qPCR | GraftM | MetaPhlAn2 | |
CD10 | 2.2 | 5.7 | 7.3 |
CD12 | 8.4 | 5.9 | 8.8 |
CD14 | 2.6 | 0.9 | 0.2 |
CD33 | 5.2 | 6.8 | 6.9 |
CD34 | 5.2 | 5.7 | 5.2 |
All the cultures propagated from the matched biopsy tissue samples produced vigorous growth on both types of media used in this study. After 24 h incubation at 37oC, a subsample of these cultures were either stored in the same cryopreservative solution, or microbial biomass was harvested and total DNA extracted and subjected to 16 S rRNA gene amplicon profiling, as described above. The taxonomic profiles produced from these three different DNA samples are shown in Fig. 1, and were comprised of bacterial taxa typically reported in similar studies [1–4]. Furthermore, the community profiles arising from the M2Diet or M2Host cultures showed a high degree of similarity to those produced directly from total and microbe-enriched biopsy DNA. These findings are further illustrated by the PCoA plots based on OTU-level Bray–Curtis dissimilarity (Fig. 2) as well as the unweighted and weighted UniFrac distances (Supplementary Fig. 1), where the patient-based rather than method-based clustering of the MAM profiles generated from three types of DNA preparations are clearly evident. Importantly, the profiles confirmed the cultures possess a diverse collection of anaerobic bacteria including Lachnospiraceae, Ruminococcaceae, Oscillospira, Unc_Veillonellaceae, as well as Clostridium, Bacteroides and Fusobacterium. We also compared the relative abundances of the bacterial taxa present in microbial cultures to those produced from total biopsy DNA. The significant differences are shown in Supplementary Table 3, and show the microbial cultures produced from the 5 subjects do possess small reductions in the relative abundances of some taxa (e.g., Dorea spp. from subject CD10 and CD34, and Blautia spp. from subjects CD14, CD33 and CD34), whereas others show an increase in their relative abundance (e.g., Streptococcus spp. in CD14 and Enterococcus spp. and Unclassified_Enterobacteriaceae in CD33). However, the changes observed do not suggest there is a broad scale loss, or overgrowth, of bacterial taxa from the initial consortia generated from biopsy tissue. In summation, these results show our approach provides an excellent recovery and retention of the MAM from each CD patient at least upon their initial culture, and thereby, these cultures are amenable to MGS to provide a deep functional understanding of the MAM in these patients.
To that end, all the DNA samples were then sequenced to a depth of 3 Gbp, and the MetaPhlAn2 analysis of these datasets showed that the subtractive enrichment of microbial DNA prior to sequencing did increase the number of microbial reads for 4/5 samples (Table 1). However, the relative abundance of reads produced from host DNA in both types of samples remained very high (~99%, Supplementary Table 2). As such, the comprehensiveness and resolution of the MGS data of microbial origin produced at this sequencing depth remain a major constraint to community characterisation. In that context, and unlike the 16 S rRNA gene amplicon profiles, the MetaPhlAn2 analyses shows the microbial diversity recovered in the MGS data from each type of DNA preparation are dramatically different in terms of their richness (MC-MGS > microbe-enriched MGS > total biopsy MGS, Fig. 3). Only the MC-MGS data retained the inter-subject variations evident from the 16 S rRNA gene amplicon profiles, and this was also evident in the profiles of the HUMAaN2-identified functional pathways at the KEGG BRITE Hierarchy level. The microbial communities recovered from the 5 subjects appear functionally quite similar for many categories (Fig. 4), and although we have not subjected to a thorough statistical analysis due to limited sample numbers, the Principle Component Analysis (PCA) plot generated using Between-Class Analysis (BCA) of the identified KEGG BRITE hierarchy functional pathways (Fig. 5) clearly shows the functional profiles are not only patient-specific but also significantly different to each other (Monte Carlo test, p = 0.014).
The 3 Gbp sequencing depth also supported the recovery of 47 unique, high-quality Metagenome Assembled Genomes (MAGs, Table 2) and another 11 mid-quality MAGs (Supplementary Table 4) from the MC-MGS data. The phylogenetic tree of these 47 MAGs generated using GTDB bacterial marker genes revealed 3 MAGs are phylogenetically distinct (Fig. 6). Further analysis using GTDB-Tk showed these 3 MAGs were either unable to be taxonomically identified by FastANI, or have ANI scores of <95% relative to their closest reference genome. One of these MAGs represents a novel lineage within the order Bacillales, which by GTDB annotation is proposed to be Massilibacillaceae fam. nov. The other two are novel lineages currently affiliated with the Absiella and Fusobacterium genera (Supplementary Table 5).
Table 2.
CD10 | CD12 | CD14 | CD33 | CD34 |
---|---|---|---|---|
Bacteroides fragilisa | Anaerostipes hadrusa | Bacteroides vulgatus | Acidaminococcus intestini | Bacteroides uniformisa |
Coprobacillus sp. | Bacteroides ovatusa | Clostridium nexile | Bacteroides ovatus | Clostridium nexile |
Enterococcus faecium | Bacteroides vulgatus | Erysipelatoclostridium ramosum | Enterocloster clostridioformis | Coprobacillus sp. |
Escherichia coli | Blautia wexlerae | Clostridium sp. | Coprococcus sp.a | Enterococcus faecalis |
Morganella morganii | Clostridium innocuum | Escherichia coli | Enterococcus faecalis | Enterococcus faecium |
Parabacteroides distasonisa | Clostridium nexile | Novel Erysipelotrichaceae bacterium | Enterococcus faecium | Escherichia colia |
Streptococcus anginosus | Erysipelatoclostridium ramosum | Novel Fusobacterium sp. | Escherichia coli | Eubacterium rectale |
Clostridium sp. (1) | Ruminococcus gnavusa | Parabacteroides distasonisa | Novel Unclassified MAG | |
Clostridium sp. (2) | Streptococcus agalactiae | Sutterella wadsworthensisa | Streptococcus lutetiensis | |
Escherichia colia | Streptococcus parasanguinis | |||
Peptostreptococcus anaerobius | ||||
Streptococcus anginosus |
High quality MAGs are considered to be those assemblies predicted by CheckM to possess ≥90% completeness and <5% contamination. The MAGs annotated in bold are confirmed to possess urease genes; those annotated with apossess genes encoding accessory functions involved with urea metabolism. Three MAGs were determined to be of novel lineages using the GTDB-Tk genomic analysis tool (Supplementary Table 5).
We also constructed a reference dataset using the 47 high quality MAGs in Table 2 and mapped the filtered individual reads of the MC-MGS cultures from each subject to this dataset using the default settings in Bowtie2 [39]. The number of reads that show some homology with the MAGs ranged from 69 to 99% (Supplementary Table 6) indicative of the MAGs being truly representative of the microbial diversity present in the patient biopsies. In summary, when considered along with our results using the MGS data for taxonomic (Fig. 3) and functional (Figs. 4 and 5) analyses, both the MC-MGS data and the MAGs show the consortia now in culture represent a diverse range of gut bacteria that are not only consistent with the predictions generated by the 16 S rRNA gene amplicon sequencing, but also include previously uncultured bacteria.
The high-quality Escherichia coli MAGs recovered from all 5 CD patient biopsy samples provided the opportunity to more closely examine the genetic heterogeneity represented in this patient group. Importantly, the pairwise ANI comparisons among these MAGs ranged between 96 and 98% suggesting none of the E. coli MAGs are clonal in origin and represented strain-based variations between each other (Supplementary Table 7). On that basis, we then used EnrichM in an attempt to compare the E. coli MAG recovered from subject CD33 (Rutgeerts score of i2) with the E. coli MAGs from the other patients (Rutgeerts scores of i0/i1). Although this grouping is somewhat arbitrary, it is interesting to note this analysis revealed the E. coli MAG from subject CD33 differs from the others, in particular by its possession of 2 copies of KEGG ortholog K20444 (encoding rfbC, an O-antigen biosynthesis protein) and the sitABC manganese/iron transport system (Supplementary Table 8).
In view of these findings, aliquots (0.1 ml) of the glycerol stocks of MC-MGS cultures from the anastomotic biopsies of patients CD10, CD33 and CD34 were subjected to a ten-fold serial dilution and plated onto MacConkey and/or CLED (Cysteine-, Lactose-, Electrolyte deficient) agar under aerobic conditions. Single colonies were picked and replated to test for urease-positivity with Urea Agar Base plates with added Phenol Red; and positivity was confirmed by the coordinate pH shift and colour change surrounding these individual colonies. From these initial efforts, more than 30 urease-positive isolates were recovered from these selective cultures, and by 16 S rRNA gene amplicon sequencing include strains of Klebsiella pneumoniae, Escherichia fergusonii, Morganella morganii and Enterococcus faecium. Isolates confirmed to be Proteus mirabilis were also produced from the single patient (CD34) previously found to possess this bacterium.
Discussion
As expected, the yields of microbial DNA recovered from mucosal tissue biopsies using the direct tissue-based extraction and enrichment methods, and MC-MGS, are very different. However and importantly, these differences had limited effects on the 16 S rRNA gene amplicon profiles produced from the DNA preparations of matched tissue samples, showing that the patient-specific community structure was at least retained, and even augmented by the MC-MGS approach. In their study of treatment naïve pediatric CD subjects, Gevers et al. [1] reported that the 16 S rRNA gene amplicon profiles from the mucosal biopsies was not greatly different between collection sites from the same individual, which is consistent with our findings here, using either DNA extracted directly from biopsy tissue or via microbial culture using tissue from 3 different sites.
The 16 S rRNA gene amplicon profiles (Fig. 1) also show that the MAM of all patients possess bacterial taxa assigned to unclassified lineages of Erysipelotrichaceae, Lachnospiraceae, Ruminococcaceae and Enterobacteriaceae; as well as Bacteroides and Ruminococcus spp. In contrast, the large amounts of human-derived DNA present in both total and microbe-enriched DNA preparations from biopsy tissue (>90%) restricted both the amount and resolution of the MGS data to assess the MAM. With limited exceptions (Ruminococcus torques and Ruminococcus gnavus for CD10, and some Lachnospiraceae for CD33) the MC-MGS approach supported the improved resolution and recovery of viable bacteria representing all these lineages (Fig. 3). Subject CD33 appears to have a specific expansion in Bacteroides spp. diversity (e.g., B. dorei and B. massiliensis) and the MGS data shows all the biopsy samples from this patient contain a strain of Acidaminococcus intestini, suggesting that it is a notable member of the MAM, as is E. clostridioformis, which have all recently been associated with the IBD-MAM and/or specific IBD phenotypes [40–42]. Given that CD33 is the only subject diagnosed with mild post-operative recurrence at the time of tissue collection, we note these differences as evidence to further investigate the potential role(s) of these bacterial taxa in CD recurrence.
Importantly, the HUMAaN2 data statistics as well as the de novo MAG assemblies from the MC-MGS datasets show the CD mucosa-associated microbiota contains heretofore unclassified bacteria, present at varying levels of abundance. For instance, all the MC-MGS datasets identify, albeit at low levels (~0.1%) an unclassified strain of the Subdoligranulum lineage, which was recently shown to be reduced in IBD and a key node of the multi-omic IBD functional networks presented by Lloyd-Price et al. [5]. We were also able to recover two (one high-quality and one medium-quality) MAGs from two separate patient samples that were taxonomically affiliated with the family Erysipelotrichaceae. The involvement of the Erysipelotrichaceae in gastrointestinal health and disease remains contradictory [43], which largely arises from the dearth of cultured isolates providing greater resolution of this expansive bacterial family. For instance, taxa assigned to the Erysipelotrichaceae were found to be increased in the intestinal lumen of colorectal cancer patients versus healthy controls [44], and in fecal samples from a mouse model of Crohn’s disease [45]. Furthermore, Clostridium innocuum are affiliated with the Erysipelotrichacea and has been recently implicated in the development of ‘creeping fat’ as a complication associated with CD strictures and fibrosis [46]. In contrast, reductions in the relative abundance of Erysipelotrichaceae were reported from patients with both new-onset and recurrent CD [1]. Such disparities will remain and may be due to the lack of cultured representatives, leading to the poor taxonomic characterisation of taxa belonging to the Erysipelotrichaceae family. This further highlights the capacity of MC-MGS in not only providing a comprehensive assessment of the gut microbiota community, but also produces valuable resources that facilitate their initial enrichment and opportunities for the axenic cultivation of novel bacteria.
With specific reference to the Enterobacteriaceae, the MC-MGS approach verified the increased richness of this lineage in the MAM communities of these patients predicted from the 16 S rRNA gene amplicon profiles, and includes Morganella morganii, Proteus mirabilis, E. coli and K. pneumoniae. Recent multi-omics assessment of IBD shows both E. coli and K. pneumoniae are governors of key functional networks that are increased in IBD [5], the latter of which is a recognized producer of urease activity (e.g., [47]. We have previously reported as part of the POCER study that detectable levels of Proteus spp. are associated with an elevated risk of CD recurrence [4] and more recently, demonstrated a key role for P. mirabilis in CD inflammation [48]. Notably, Proteus spp. are now phylogenetically assigned to the Morganellaceae fam. nov. of which M. morganii is a type species [49]. A unifying metabolic trait across members of these bacterial taxa is urea metabolism, which has now been identified as a key functional change coincident with CD dysbiosis; and the inoculation of mice with recombinant, urease-positive strains of E. coli MP1 can promote gut dysbiosis through a bloom in Proteobacteria abundance [50]. Using this metabolic guild as a selective strategy as described in the Methods, we recovered from the biopsy outgrowth cultures a variety of urease-positive Proteobacteria using including Morganella morganii (subject CD10), Proteus mirabilis (subject CD34) and Klebsiella pneumoniae (CD34). The Prokka-based annotations of the MAGs further showed that 14/47 possess genes encoding either for urease, and/or genes encoding accessory functions putatively involved with urea metabolism (e.g., urea transporters, hydrogenase-urease nickel incorporation protein[s], and transcriptional activators). Based on these collective findings, the MC-MGS approach resulted in the production of deep sequence data and MAGs, but also axenic bacterial isolates representing key metabolic guilds, and thereby provides a variety of resources to study the role of the MAM in digestive health and disease.
In conclusion, we have shown here that an ex vivo combination of microbial culture with metagenomic sequencing enables a cost- and time-effective strategy to produce a deep, functional understanding of the mucosa-associated microbiota, where tissue samples contain a relatively small microbial load and/or large quantities of non-microbial (host) DNA. These communities are not only an excellent representation of the communities predicted by 16 S rRNA gene amplicon sequencing, but support the identification and detailed characterisation of tissue-specific ‘microbial dark matter’, as well as strain-level resolution of bacterial taxa present in these communities. For these reasons, MC-MGS can advance our multi-omics understanding of microbial dysbiosis beyond ecological principles applied to taxonomy-based datasets, and also bring genomes to life in the form of defined consortia and/or axenic isolates to enable greater clarity of the role of mucosa-associated microbiota in host-response, inflammation and inflammatory bowel disease.
Supplementary information
Acknowledgements
The work presented in this manuscript was supported by The Leona M. and Harry B. Helmsley Charitable Trust via the Australasian Gastrointestinal Research Foundation, as part of the Eastern IBD Gut Microbiota (ENIGMA) project. The samples analysed were obtained from the POCER study (ClinicalTrials.gov identifier NCT00989560) led by MAK, and was supported by the Gutsy Group, Gandel Philanthropy, Angior Foundation and Crohns Colitis Australia. The research by JJT, EMB, SK, ECH and MM was carried out at the Translational Research Institute, Woolloongabba, QLD 4102, Australia. The Translational Research Institute is supported by a grant from the Australian Government.
Author contributions
JJT, EMB, SK, ECH, MAK and MM designed the research. JJT, EMB, SK and ECH performed the research. MAK, AH and AWOB contributed materials and their governance, and critical revision of the manuscript for important intellectual content. JJT, EMB and ECH generated and analysed the data. JZ, FZ and QL contributed to the microbe isolation protocols and critical revision of the manuscript for important intellectual content. JC, JJYS, JY, SCN, MAK and MM provided project oversight and critical revision of the manuscript for important intellectual content. JJT, EMB, ECH and MM drafted the paper, and all authors have approved the final version of this article.
Data availability
The raw sequence data and MAGs presented here have been deposited in the NCBI database under BioProject ID PRJNA715634.
Compliance with ethical standards
Conflict of interest
The authors declare no competing interest.
Ethics approval
The samples used in this study were collected with patient consent as part of the Post-Operative Crohn’s Disease Recurrence (POCER) study [2] following the protocols and amendments approved by the Research Governance Unit of the St Vincent’s Hospital Melbourne (HREC-A 077/09) and the University of Queensland’s Human Research Ethics Committee (#2015001954). Detailed clinical metadata of the 5 CD patients can be found in Supplementary Table 1.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: J. J. Teh, E. M. Berendsen
These authors jointly supervised this work: J. Yu, S. C. Ng, M. A. Kamm, M. Morrison
Supplementary information
The online version contains supplementary material available at 10.1038/s41396-021-00991-1.
References
- 1.Gevers D, Kugathasan S, Denson LA, Vazquez-Baeza Y, Van Treuren W, Ren B, et al. The treatment-naive microbiome in new-onset Crohn’s disease. Cell Host Microbe. 2014;15:382–92.. doi: 10.1016/j.chom.2014.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.De Cruz P, Kamm MA, Hamilton AL, Ritchie KJ, Krejany EO, Gorelik A, et al. Crohn’s disease management after intestinal resection: a randomised trial. Lancet. 2015;385:1406–17. doi: 10.1016/S0140-6736(14)61908-5. [DOI] [PubMed] [Google Scholar]
- 3.De Cruz P, Kang S, Wagner J, Buckley M, Sim WH, Prideaux L, et al. Association between specific mucosa-associated microbiota in Crohn’s disease at the time of resection and subsequent disease recurrence: a pilot study. J Gastroenterol Hepatol. 2015;30:268–78. doi: 10.1111/jgh.12694. [DOI] [PubMed] [Google Scholar]
- 4.Wright EK, Kamm MA, Wagner J, Teo SM, Cruz P, Hamilton AL, et al. Microbial factors associated with postoperative Crohn’s disease recurrence. J Crohns Colitis. 2017;11:191–203. doi: 10.1093/ecco-jcc/jjw136. [DOI] [PubMed] [Google Scholar]
- 5.Lloyd-Price J, Arze C, Ananthakrishnan AN, Schirmer M, Avila-Pacheco J, Poon TW, et al. Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases. Nature. 2019;569:655–62.. doi: 10.1038/s41586-019-1237-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Franzosa EA, Sirota-Madi A, Avila-Pacheco J, Fornelos N, Haiser HJ, Reinker S, et al. Gut microbiome structure and metabolic activity in inflammatory bowel disease. Nat Microbiol. 2019;4:293–305. doi: 10.1038/s41564-018-0306-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Pizarro TT, Stappenbeck TS, Rieder F, Rosen MJ, Colombel JF, Donowitz M et al. Challenges in IBD Research: Preclinical Human IBD Mechanisms. Inflamm Bowel Dis 2019;25:S5–12. [DOI] [PubMed]
- 8.Patil KR, Haider P, Pope PB, Turnbaugh PJ, Morrison M, Scheffer T, et al. Taxonomic metagenome sequence assignment with structured output models. Nat Methods. 2011;8:191–2. doi: 10.1038/nmeth0311-191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Pasolli E, Asnicar F, Manara S, Zolfo M, Karcher N, Armanini F, et al. Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle. Cell. 2019;176:649–62 e20. doi: 10.1016/j.cell.2019.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Quince C, Walker AW, Simpson JT, Loman NJ, Segata N. Shotgun metagenomics, from sampling to analysis. Nat Biotechnol. 2017;35:833–44.. doi: 10.1038/nbt.3935. [DOI] [PubMed] [Google Scholar]
- 11.Douglas GM, Hansen R, Jones CMA, Dunn KA, Comeau AM, Bielawski JP, et al. Multi-omics differentially classify disease state and treatment outcome in pediatric Crohn’s disease. Microbiome. 2018;6:13. doi: 10.1186/s40168-018-0398-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Nelson MT, Pope CE, Marsh RL, Wolter DJ, Weiss EJ, Hager KR, et al. Human and extracellular DNA depletion for metagenomic analysis of complex clinical infection samples yields optimized viable microbiome profiles. Cell Rep. 2019;26:2227–40 e5. doi: 10.1016/j.celrep.2019.01.091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Oechslin CP, Lenz N, Liechti N, Ryter S, Agyeman P, Bruggmann R, et al. Limited correlation of shotgun metagenomics following host depletion and routine diagnostics for viruses and bacteria in low concentrated surrogate and clinical samples. Front Cell Infect Microbiol. 2018;8:375. doi: 10.3389/fcimb.2018.00375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Marotz CA, Sanders JG, Zuniga C, Zaramela LS, Knight R, Zengler K. Improving saliva shotgun metagenomics by chemical host DNA depletion. Microbiome. 2018;6:42. doi: 10.1186/s40168-018-0426-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hillmann B, Al-Ghalith GA, Shields-Cutler RR, Zhu Q, Gohl DM, Beckman KB, et al. Evaluating the information content of shallow shotgun metagenomics. mSystems. 2018;3:e00069–18.. doi: 10.1128/mSystems.00069-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Miyazaki K, Martin JC, Marinsek-Logar R, Flint HJ. Degradation and utilization of xylans by the rumen anaerobe Prevotella bryantii (formerly P. ruminicola subsp. brevis) B(1)4. Anaerobe. 1997;3:373–81. doi: 10.1006/anae.1997.0125. [DOI] [PubMed] [Google Scholar]
- 17.Shanahan ER, Zhong L, Talley NJ, Morrison M, Holtmann G. Characterisation of the gastrointestinal mucosa-associated microbiota: a novel technique to prevent cross-contamination during endoscopic procedures. Aliment Pharm Ther. 2016;43:1186–96. doi: 10.1111/apt.13622. [DOI] [PubMed] [Google Scholar]
- 18.Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, et al. QIIME allows analysis of high-throughput community sequencing data. Nat Methods. 2010;7:335–6. doi: 10.1038/nmeth.f.303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, et al. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006;72:5069–72. doi: 10.1128/AEM.03006-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–1. doi: 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
- 21.Zakrzewski M, Proietti C, Ellis JJ, Hasan S, Brion MJ, Berger B, et al. Calypso: a user-friendly web-server for mining and visualizing microbiome-environment interactions. Bioinformatics. 2017;33:782–3. doi: 10.1093/bioinformatics/btw725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Shanahan ER, Shah A, Koloski N, Walker MM, Talley NJ, Morrison M, et al. Influence of cigarette smoking on the human duodenal mucosa-associated microbiota. Microbiome. 2018;6:150. doi: 10.1186/s40168-018-0531-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Boyd JA, Woodcroft BJ, Tyson GW. GraftM: a tool for scalable, phylogenetically informed classification of genes within metagenomes. Nucleic Acids Res. 2018;46:e59. doi: 10.1093/nar/gky174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Truong DT, Franzosa EA, Tickle TL, Scholz M, Weingart G, Pasolli E, et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat Methods. 2015;12:902–3. doi: 10.1038/nmeth.3589. [DOI] [PubMed] [Google Scholar]
- 25.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Franzosa EA, McIver LJ, Rahnavard G, Thompson LR, Schirmer M, Weingart G, et al. Species-level functional profiling of metagenomes and metatranscriptomes. Nat Methods. 2018;15:962–8. doi: 10.1038/s41592-018-0176-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH, UniProt C. UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics. 2015;31:926–32. doi: 10.1093/bioinformatics/btu739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–62. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li D, Liu CM, Luo R, Sadakane K, Lam TW. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31:1674–6. doi: 10.1093/bioinformatics/btv033. [DOI] [PubMed] [Google Scholar]
- 30.Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3:e1165. doi: 10.7717/peerj.1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–55. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wattam AR, Abraham D, Dalay O, Disz TL, Driscoll T, Gabbard JL, et al. PATRIC, the bacterial bioinformatics database and analysis resource. Nucleic Acids Res. 2014;42:D581–91. doi: 10.1093/nar/gkt1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ondov BD, Treangen TJ, Melsted P, Mallonee AB, Bergman NH, Koren S, et al. Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 2016;17:132. doi: 10.1186/s13059-016-0997-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–9. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- 35.Chaumeil PA, Mussig AJ, Hugenholtz P, Parks DH. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 2019;36:1925–27. [DOI] [PMC free article] [PubMed]
- 36.Price MN, Dehal PS, Arkin AP. FastTree 2-approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5:e9490. doi: 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Richter M, Rossello-Mora R, Oliver Glockner F, Peplies J. JSpeciesWS: a web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics. 2016;32:929–31. doi: 10.1093/bioinformatics/btv681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Boyd JA. Development of meta-omic tools to explore microbial carbon cycling. Doctoral dissertation, The University of Queensland, Brisbane. 2019.
- 39.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Altomare A, Putignani L, Del Chierico F, Cocca S, Angeletti S, Ciccozzi M, et al. Gut mucosal-associated microbiota better discloses inflammatory bowel disease differential patterns than faecal microbiota. Dig Liver Dis. 2019;51:648–56.. doi: 10.1016/j.dld.2018.11.021. [DOI] [PubMed] [Google Scholar]
- 41.Butera A, Di Paola M, Vitali F, De Nitto D, Covotta F, Borrini F, et al. IL-13 mRNA tissue content identifies two subsets of adult ulcerative colitis patients with different clinical and mucosa-associated microbiota profiles. J Crohns Colitis. 2020;14:369–80.. doi: 10.1093/ecco-jcc/jjz154. [DOI] [PubMed] [Google Scholar]
- 42.Weng YJ, Gan HY, Li X, Huang Y, Li ZC, Deng HM, et al. Correlation of diet, microbiota and metabolite networks in inflammatory bowel disease. J Dig Dis. 2019;20:447–59.. doi: 10.1111/1751-2980.12795. [DOI] [PubMed] [Google Scholar]
- 43.Kaakoush NO. Insights into the Role of Erysipelotrichaceae in the Human Host. Front Cell Infect Microbiol. 2015;5:84. doi: 10.3389/fcimb.2015.00084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chen W, Liu F, Ling Z, Tong X, Xiang C. Human intestinal lumen and mucosa-associated microbiota in patients with colorectal cancer. PLoS One. 2012;7:e39743. doi: 10.1371/journal.pone.0039743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Schaubeck M, Clavel T, Calasan J, Lagkouvardos I, Haange SB, Jehmlich N, et al. Dysbiotic gut microbiota causes transmissible Crohn’s disease-like ileitis independent of failure in antimicrobial defence. Gut. 2016;65:225–37. doi: 10.1136/gutjnl-2015-309333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Ha CWY, Martin A, Sepich-Poore GD, Shi B, Wang Y, Gouin K, et al. Translocation of viable gut microbiota to mesenteric adipose drives formation of creeping fat in humans. Cell. 2020;183:666–83 e17. doi: 10.1016/j.cell.2020.09.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Maroncle N, Rich C, Forestier C. The role of Klebsiella pneumoniae urease in intestinal colonization and resistance to gastrointestinal stress. Res Microbiol. 2006;157:184–93. doi: 10.1016/j.resmic.2005.06.006. [DOI] [PubMed] [Google Scholar]
- 48.Zhang J, Hoedt EC, Liu Q, Berendsen E, Teh JJ, Hamilton A, et al. Elucidation of proteus mirabilis as a key bacterium in crohn’s disease inflammation. Gastroenterology. 2020. [DOI] [PubMed]
- 49.Adeolu M, Alnajar S, Naushad S. Genome-based phylogeny and taxonomy of the ‘Enterobacteriales’: proposal for Enterobacterales ord. nov. divided into the families Enterobacteriaceae, Erwiniaceae fam. nov., Pectobacteriaceae fam. nov., Yersiniaceae fam. nov., Hafniaceae fam. nov., Morganellaceae fam. nov., and Budviciaceae fam. nov. Int J Syst Evol Microbiol. 2016;66:5575–99. doi: 10.1099/ijsem.0.001485. [DOI] [PubMed] [Google Scholar]
- 50.Ni J, Shen TD, Chen EZ, Bittinger K, Bailey A, Roggiani M, et al. A role for bacterial urease in gut dysbiosis and Crohn’s disease. Sci Transl Med. 2017;9:eaah6888. [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequence data and MAGs presented here have been deposited in the NCBI database under BioProject ID PRJNA715634.