Skip to main content
mSystems logoLink to mSystems
. 2023 Oct 19;8(6):e00281-23. doi: 10.1128/msystems.00281-23

Targeted rRNA depletion enables efficient mRNA sequencing in diverse bacterial species and complex co-cultures

Kellie A Heom 1,2,#, Chatarin Wangsanuwat 1,2,4,#, Lazarina V Butkovich 1, Scott C Tam 1, Annette R Rowe 3, Michelle A O'Malley 1,2,, Siddharth S Dey 1,2,4,
Editor: Danielle Tullman-Ercek5
Reviewed by: Christina Homberger6
PMCID: PMC10734481  PMID: 37855606

ABSTRACT

Bacterial mRNA sequencing is inefficient due to the abundance of ribosomal RNA that is challenging to deplete. While commercial kits target rRNA from common bacterial species, they are frequently inefficient when applied to divergent species, including those from environmental isolates. Similarly, other methods typically employ large probe sets that tile the entire length of rRNAs; however, such approaches are infeasible when applied to many species. Therefore, we present EMBR-seq+, which requires fewer than 10 oligonucleotides per rRNA by combining rRNA blocking primers with RNase H-mediated depletion to achieve rRNA removal efficiencies of up to 99% in diverse bacterial species. Furthermore, in more complex microbial co-cultures between Fibrobacter succinogenes strain UWB7 and anaerobic fungi, EMBR-seq+ depleted both bacterial and fungal rRNA, with a fourfold improvement in bacterial rRNA depletion compared with a commercial kit, thereby demonstrating that the method can be applied to non-model microbial mixtures. Notably, for microbes with unknown rRNA sequences, EMBR-seq+ enables rapid iterations in probe design without requiring to start experiments from total RNA. Finally, efficient depletion of rRNA enabled systematic quantification of the reprogramming of the bacterial transcriptome when cultured in the presence of the anaerobic fungi Anaeromyces robustus or Caecomyces churrovis. We observed that F. succinogenes strain UWB7 downregulated several lignocellulose-degrading carbohydrate-active enzymes in the presence of anaerobic gut fungi, suggesting close interactions between two cellulolytic species that specialize in different aspects of biomass breakdown. Thus, EMBR-seq+ enables efficient, cost-effective, and rapid quantification of the transcriptome to gain insights into non-model microbial systems.

IMPORTANCE

Microbes present one of the most diverse sources of biochemistry in nature, and mRNA sequencing provides a comprehensive view of this biological activity by quantitatively measuring microbial transcriptomes. However, efficient mRNA capture for sequencing presents significant challenges in prokaryotes as mRNAs are not poly-adenylated and typically make up less than 5% of total RNA compared with rRNAs that exceed 80%. Recently developed methods for sequencing bacterial mRNA typically rely on depleting rRNA by tiling large probe sets against rRNAs; however, such approaches are expensive, time-consuming, and challenging to scale to varied bacterial species and complex microbial communities. Therefore, we developed EMBR-seq+, a method that requires fewer than 10 short oligonucleotides per rRNA to achieve up to 99% rRNA depletion in diverse bacterial species. Finally, EMBR-seq+ resulted in a deeper view of the transcriptome, enabling systematic quantification of how microbial interactions result in altering the transcriptional state of bacteria within co-cultures.

KEYWORDS: bacterial mRNA sequencing, rRNA depletion, non-model microbial sequencing, fungal and bacterial co-cultures, lignocellulose deconstruction

INTRODUCTION

Microbes remain one of nature’s largest and most diverse sources of biochemical transformations. Many bacterial processes, such as their ability to grow on unprocessed feedstocks, are interesting as their mechanisms for releasing sugars or other metabolites could lead to advances in bio-based fuels or other value-added chemicals (14). mRNA sequencing (mRNA-seq) provides an opportunity to gain insights into these biochemical reactions as it enables a quantitative and comprehensive view of the transcriptome, which underlies all biological activity. For example, mRNA-seq has been used to quantify the expression of proteins, transporters, and carbohydrate-active enzymes (CAZymes) that are upregulated by Fibrobacter succinogenes strain S85 during cellulose utilization compared with when grown on glucose (5). Furthermore, F. succinogenes and other cellulolytic strains have been isolated in nature from complex communities, such as the rumen microbiome, and therefore, it is also critical to characterize how interactions between different microbial species mutually tune their expression (68). An important aspect of these microbial communities is understanding how different community members utilize different classes of CAZymes to efficiently deconstruct biomass. CAZymes are classified based on biochemical function or sequence homology into six broad families [glycoside hydrolase (GH), glycosyltransferase (GT), polysaccharide lyase (PL), carbohydrate esterase (CE), auxiliary activities (AA), and carbohydrate binding module (CBM)], and the comparative CAZyme expression profiles of community members can help inform the design of engineered communities for bio-based energy solutions (9, 10). Thus, more broadly, efficient sequencing of mRNA from non-model bacteria in microbial consortia will enable a deeper understanding of the role of each member within complex communities and will provide opportunities to advance microbiome engineering for a variety of agricultural, environmental, or energy-related applications.

However, it is more challenging to perform mRNA-seq on bacteria and especially those that exist in co-cultures. This is because mRNA-seq is inherently inefficient in bacteria, due to limitations associated with removing ribosomal RNA. In eukaryotic species, the mRNA contains a poly-A tail at the 3′-end enabling poly-T primers to selectively reverse transcribe the mRNA and not the rRNA (11, 12). However, prokaryotic mRNA does not have a 3′-end poly-A tail, making it more challenging to uniquely capture mRNA compared with rRNA or other non-coding RNA molecules (13, 14). If left undepleted, rRNA can consume up to 90% of the sequencing data with uninformative reads, resulting in highly inefficient sequencing of mRNA (15, 16). Therefore, efficient, high-throughput and cost-effective rRNA depletion is key to obtaining high-resolution maps of gene expression programs in bacterial species.

To accomplish this, several commercial kits for prokaryotic rRNA depletion have been developed, including the Ribo-Zero Plus Microbiome rRNA Depletion Kit (Illumina), RiboMinus Bacteria 2.0 Transcriptome Isolation Kit (Thermo Fisher), and the NEBNext rRNA Depletion Kit for Bacteria (NEB). These kits leverage sequence-specific rRNA removal, which is highly efficient at depleting the target rRNA (Table S1). However, these kits are limited by the breadth of their probe sets, which restricts their application in diverse microbial systems. Furthermore, the cost of these kits typically ranges from approximately $30 to $80 per sample. In response to the lack of specificity to varied bacterial species and the high cost, many research groups have also developed methods for rRNA depletion. A majority of these methods employ either biotinylated oligonucleotides that hybridize with rRNA that are then removed using streptavidin-coated paramagnetic beads (17, 18) or oligonucleotides that hybridize with rRNA and act as a guide for RNase H-based enzymatic digestion (1921). Similarly, CRISPR-Cas9-mediated degradation of rRNA has also been used to sequence bacterial mRNA (22). However, all these methods use large probe sets, typically up to hundreds of probes per bacterial species, to achieve depletion along the full length of each rRNA subunit. An alternative approach to large oligonucleotide or guide RNA arrays is to use the terminator 5′-phosphate-dependent exonuclease, which selectively degrades rRNA containing 5′-monophosphate ends while having minimal activity on mRNA molecules with 5′-triphosphate ends (23). However, this method has previously been shown to be less efficient at depleting rRNA compared with the other techniques discussed above (24, 25) (Table S1).

To overcome challenges associated with the high cost and the time-intensive activity of developing large probe sets for every new bacterial species profiled, we previously published a method, EMBR-seq (Enriched mRNA by Blocked rRNA) (25), that only requires three to four 30-nt primers per rRNA species (approximately nine primers per bacterial species) for rRNA depletion. EMBR-seq used a combination of 3′-end polyadenylation of prokaryotic mRNA together with specifically designed blocking primers to prevent the downstream amplification of rRNA molecules. In this study, we have developed a significantly improved method, EMBR-seq+, that integrates our previous method with RNase H-based digestion to deplete rRNA at high efficiencies while still employing fewer than 10 oligonucleotides for each rRNA species. In addition, we show that EMBR-seq+ can be easily applied to diverse bacterial species, and compared with EMBR-seq, EMBR-seq+ enables rapid iterations in probe design without having to restart experiments from total RNA to deplete rRNA from microbial species with poorly annotated genomes and unknown rRNA sequences. We demonstrate this in a co-culture system of anaerobic bacteria and fungi native to the rumen microbiome and show that EMBR-seq+ is instrumental in providing deeper coverage of the bacterial transcriptome. In particular, for these bacterial-fungal co-cultures, we found that the RNase H step enabled EMBR-seq+ to outperform both EMBR-seq and RNase H-based depletion alone in terms of rRNA depletion efficiency. Finally, we also investigate transcriptional changes in the bacteria grown under monoculture or co-culture conditions to gain a deeper understanding of how cell-cell interactions between microbial species affect the CAZyme expression profile in these bacteria. In summary, EMBR-seq+ is a highly efficient, high-throughput, and cost-effective platform for sequencing bacterial mRNA that also allows for rapid improvements in probe design to deplete rRNA from poorly annotated and non-model microbial species.

RESULTS

EMBR-seq+ combines blocking primers and RNase H digestion to efficiently deplete rRNA

EMBR-seq+ builds off the EMBR-seq method we have previously developed, in which rRNA depletion is achieved using a small set of short blocking primers designed to hybridize at the 3′-end of 16S, 23S, and 5S rRNA in the bacterial species of interest (Fig. 1A). Briefly, in EMBR-seq, total RNA is polyadenylated in the presence of the rRNA blocking primers to preferentially polyadenylate mRNA. The blocking primers not only target the 3′-end of rRNA but also a few “hotspot” locations that are enriched with reads within the rRNA sequence (Table S2). For E. coli, the hotspots present in the “No depletion” control library are targeted and efficiently depleted in the other libraries using hotspot-specific blocking primers (see inset in Fig. 1B). Next, a poly-T primer containing a sample barcode, Illumina RA5 sequence, and T7 promoter in its overhang is used to reverse transcribe the polyadenylated molecules. Subsequently, the reverse transcription product is converted to cDNA and then amplified by in vitro transcription (IVT) using the T7 polymerase. As a majority of the rRNA molecules are reverse transcribed by the blocking primers instead of the poly-T primers, rRNA-derived cDNA molecules lack a T7 promoter and are therefore excluded from amplification during IVT. The resulting amplified RNA (aRNA) is converted to the final Illumina library by incorporating the Illumina RA3 sequence during a subsequent reverse transcription step followed by addition of the Illumina P5 and P7 adapters during PCR (Fig. 1A).

Fig 1.

Fig 1

EMBR-seq+ depletes rRNA from E. coli total RNA at high efficiency without introducing technical biases using a small number of rRNA-specific probes. (A) Schematic for EMBR-seq+ shows that a combination of blocking primers and RNase H-based targeting is used to deplete rRNA. This panel was generated using Biorender.com. (B) Cumulative percentage of reads mapping along E. coli 16S and 23S rRNA for different depletion methods. Bold lines indicate mean values, and shaded regions indicate the minimum and maximum over three independent experiments. Inverted triangles indicate location of hotspot regions targeted by blocking primers. Insets highlight the depletion of reads from hotspot locations along the 16S and 23S rRNA sequence. (C) Percent of sequencing reads mapping to rRNA and mRNA in E. coli. Bars indicate mean values and error bars indicate standard deviations over three independent experiments. rRNA depletion in the “RNase H,” “EMBR-seq,” and “EMBR-seq+” conditions is all statistically significant from the “No depletion” condition (permutation test, P < 10−5). (D) Number of genes detected above different gene expression thresholds for E. coli. Points indicate mean values, and error bars indicate standard deviations over three independent experiments. (E) Correlation of gene expression between the “No depletion” and “EMBR-seq+” conditions (Pearson’s r = 0.95). Reads per million (RPM) is computed after removal of rRNA reads. The x- and y-coordinates of each point indicate mean values over three independent experiments in the two conditions. For consistent comparison across methods and to control for the variability in sequencing depth across samples, panels B and D show data that have been downsampled to 1 million sequencing reads.

In EMBR-seq+, we introduced an additional RNase H treatment step to specifically target the remaining rRNA-derived molecules that are present in the aRNA after IVT. In this step, 50-nt ssDNA probes are hybridized near the junction of the Illumina RA5 adapter with the rRNA sequence, with additional probes targeting three to four hotspot locations along the rRNA-derived aRNA molecules. The probes form aRNA-DNA duplexes on the rRNA-derived aRNA molecules which are selectively degraded with RNase H, resulting in either the removal of the Illumina RA5 adapter at the 5′ end of these molecules or the generation of very short fragments that are excluded during the downstream PCR amplification or size selection steps, respectively, thereby preventing their incorporation into the final Illumina library. As the RNase H treatment is performed on aRNA molecules to cleave the partial RA5 Illumina adapter, EMBR-seq+ requires only three to four ssDNA probes per rRNA species (approximately nine per bacterial species) compared with most other RNase H-based depletion methods that typically need to tile the entire rRNA molecules with up to hundreds of ssDNA probes per bacterial species (1722). After RNase H treatment, the rRNA-depleted aRNA is reverse transcribed and then amplified with PCR to obtain the final Illumina libraries (Fig. 1A). We tested two enzymes and reaction temperatures: (i) Invitrogen RNase H at 16°C and (ii) Hybridase Thermostable RNase H at 45°C. We found that for both the “RNase H” only and EMBR-seq+ libraries, the rRNA depletion efficiency was very similar with the two RNase H enzymes and decided to proceed with the Invitrogen RNase H for all subsequent experiments (Fig. S1).

As proof-of-principle, we first applied these methods to E. coli where rRNA was depleted to 12.1% of the sequenced reads when treated with RNase H alone and to 3.7% with EMBR-seq and 3% with EMBR-seq+, suggesting that the combined use of blocking primers and RNase H depletion in EMBR-seq+ can remove most of the rRNA-derived molecules from the sequencing libraries (Fig. 1C). This efficiency of rRNA depletion in EMBR-seq+ is similar to that achieved by other methods and commercial kits (Table S1). In the libraries in which only RNase H was used for rRNA depletion, no blocking primers were added during the poly-A tail synthesis and initial reverse transcription step. Similarly, in the “No depletion” control, neither blocking primers nor RNase H depletion was performed. Furthermore, as expected, for the same sequencing depth, increased depletion of rRNA in EMBR-seq+ resulted in the detection of the most unique genes above every expression threshold over three orders of magnitude (Fig. 1D). Finally, gene expression levels quantified by EMBR-seq+ were highly correlated to the “No depletion” libraries (Pearson r = 0.95), suggesting that the use of blocking primers and RNase H in EMBR-seq+ does not introduce technical biases (Fig. 1E; Table S3). Similarly, the transcriptomes obtained from EMBR-seq and EMBR-seq+ were highly correlated, suggesting that RNase H-based depletion in EMBR-seq+ does not introduce technical biases (Table S3). Collectively, these experiments show that EMBR-seq+ can efficiently deplete rRNA from bacterial total RNA without introducing technical artifacts to quantify the transcriptome at high depth.

EMBR-seq+ can be easily applied in distant prokaryotic species

EMBR-seq+ requires fewer than 20 ssDNA oligonucleotides per bacterial species (fewer than 10 blocking primers and fewer than 10 probes for RNase H depletion) to deplete 16S, 23S, and 5S rRNA, and therefore, it is easy and inexpensive to customize the method for any species of interest (Table S2). We tested EMBR-seq+ on Fibrobacter succinogenes strain UWB7, a strain of cellulolytic bacteria isolated from cow rumen, and Geobacter metallireducens, a strain that reduces Fe(III) oxide and can exhibit syntropy with methanogens via direct interspecies electron transfer (2628). When employing EMBR-seq, rRNA was depleted to 10.3% in F. succinogenes strain UWB7 and 3.1% in G. metallireducens while EMBR-seq+ depleted rRNA to 7.6% in F. succinogenes strain UWB7 and 1.1% in G. metallireducens, highlighting the improved efficiency of rRNA depletion when combining blocking primers and RNase H digestion in EMBR-seq+ (Fig. 2A). Furthermore, as with E. coli, custom-designed blocking primers and RNase H probes were key to efficiently depleting rRNA reads from hotspot loci observed in the “No depletion” controls for both F. succinogenes strain UWB7 and G. metallireducens (Fig. 2B; Fig. S2A). Consistent with the improved rRNA depletion, an increased number of unique genes was detected in EMBR-seq+ above every expression threshold for both F. succinogenes strain UWB7 and G. metallireducens (Fig. 2C; Fig. S2B). Finally, when compared with the “No depletion” controls, the blocking primers and RNase H probes did not introduce technical bias in the quantification of gene expression in EMBR-seq+ (Pearson r = 0.94 and r = 0.97 for F. succinogenes strain UWB7 and G. metallireducens, respectively) (Fig. 2D; Fig. S2C; Table S3). These results show that the design of a small number of short probes in EMBR-seq+ can be used to easily extend the method with high efficiency to varied bacterial species.

Fig 2.

Fig 2

EMBR-seq+ can be easily extended to diverse bacterial species. (A) Percent of sequencing reads mapping to rRNA in F. succinogenes strain UWB7 and G. metallireducens for different rRNA depletion methods. Bars indicate mean values and error bars indicate standard deviations over three independent experiments. rRNA depletion in the “RNase H,” “EMBR-seq,” and “EMBR-seq+” conditions is statistically significant from the “No depletion” condition (permutation test, P < 10−5). (B) Cumulative percentage of reads mapping to 16S and 23S rRNA of F. succinogenes strain UWB7 for different rRNA depletion methods. Bold lines indicate mean values, and shaded regions indicate the minimum and maximum over three independent experiments. Inverted triangles indicate the location of hotspots targeted by blocking primers. (C) Number of genes detected above different gene expression thresholds for F. succinogenes strain UWB7. Points indicate mean values, and error bars indicate standard deviations over three independent experiments. (D) Correlation of gene expression between the “No depletion” and “EMBR-seq+” conditions for F. succinogenes strain UWB7 (Pearson’s r = 0.94). RPM is computed after removal of rRNA reads. The x- and y-coordinates of each point indicate mean values over three independent experiments in the two conditions. For consistent comparison across methods and to control for the variability in sequencing depth across samples, panels B and C show data that have been downsampled to 1 million sequencing reads.

EMBR-seq+ enables efficient prokaryotic mRNA sequencing in a fungal-bacterial co-culture system

Next, we aimed to apply EMBR-seq+ to more complex systems that mimic microbial communities in nature, such as co-cultures of bacteria with different fungal species. As an example, anaerobic organisms isolated from the rumen microbiome are particularly interesting because of their ability to grow on raw, unprocessed substrates such as grass and depolymerize lignocellulose, a feature attributed to carbohydrate-active enzymes found in their genomes. F. succinogenes is an extensively studied cellulolytic bacterial species, because it is frequently identified in rumen samples and found to encode a large number of CAZymes in its genome (5, 6, 26, 2932). More recently, anaerobic gut fungal genomes have also been discovered to be exceptionally abundant with CAZymes (3335). As these species exist in consortia in nature rather than in isolation, understanding interactions between different species is key to gaining insights into their function in native environments and for engineering stable microbial systems in vitro (3638). A previous study examined transcriptomic changes to the anaerobic gut fungi Anaeromyces robustus or Caecomyces churrovis when cultured together with F. succinogenes strain UWB7 compared with monoculture conditions and identified several novel components of the fungal secondary metabolism that were upregulated in co-culture (37). This study focused primarily on the remodeling of the fungal transcriptome induced by the presence of bacteria; however, this work did not characterize transcriptional changes in the bacteria F. succinogenes strain UWB7 in co-culture with A. robustus due to two technological limitations associated with (i) the low abundance of bacterial RNA compared with the fungi and 2(ii) the lack of efficient bacterial rRNA depletion.

To overcome these limitations, we focused on capturing the bacterial transcriptome by applying EMBR-seq+ to two independent co-cultures of F. succinogenes strain UWB7 with different anaerobic gut fungal partners, A. robustus or C. churrovis (34, 39, 40). In our initial co-culture experiments with F. succinogenes strain UWB7 and A. robustus, we applied EMBR-seq to total RNA extracted from co-culture pellets [denoted as “Unprocessed total RNA (EMBR-seq)”] to find only 11.8% of the mapped reads derived from the bacteria with the remaining 88.2% of reads derived from the fungi (Fig. 3A and B). We hypothesized that this large excess of fungal reads in our data set is derived from an initial overabundance of fungal RNA in the co-culture total RNA pellets and a lack of targeting to deplete fungal RNA in EMBR-seq+. As our goal was to characterize the transcriptome of F. succinogenes strain UWB7 within the co-culture at high resolution, we aimed to deplete the fungal mRNA and rRNA. We first used poly-T magnetic beads to deplete polyadenylated fungal mRNA prior to applying EMBR-seq, resulting in a library [denoted as “Poly-A depleted (EMBR-seq)”] with 24.9% bacteria reads.

Fig 3.

Fig 3

EMBR-seq+ efficiently depletes rRNA of F. succinogenes strain UWB7 grown in co-culture with A. robustus. (A) Schematic of fungal mRNA and rRNA removal from co-culture-derived total RNA samples for more efficient bacterial mRNA sequencing. This panel was generated using Biorender.com. (B) Percentage of reads mapping to the bacterial and fungal genomes using the three strategies described in panel A: total RNA isolated from co-culture pellets treated with EMBR-seq to remove bacterial rRNA [“Unprocessed total RNA (EMBR-seq)”], fungal poly-adenylated mRNA-depleted total RNA treated with EMBR-seq to remove bacterial rRNA [“Poly-A depleted (EMBR-seq)”], and “Poly-A depleted (EMBR-seq)” library additionally treated with RNase H to remove fungal and bacterial rRNA [“Poly-A depleted & RNase H treated (EMBR-seq+)”]. For the “Unprocessed total RNA (EMBR-seq)” library, n = 1. For “Poly-A depleted (EMBR-seq)” and “Poly-A depleted & RNase H treated (EMBR-seq+)” libraries, n = 3. Bars indicate mean values, and error bars indicate standard deviation from the mean. (C) Cumulative percentage of sequencing reads from the “Poly-A depleted (EMBR-seq)” library that align to the C. churrovis genome scaffold 672. The three rRNA hotspots were inferred to be the 3′-ends of 18S, 28S, and 5.8S rRNA for C. churrovis. Black inverted triangles and italicized text indicate target locations of the five RNase H probes used for fungal rRNA depletion in the EMBR-seq+ libraries. (D) Percent of sequencing reads mapping to the rRNA of F. succinogenes strain UWB7 for different depletion methods when grown in co-culture with A. robustus or C. churrovis. All four samples are poly-A depleted to remove fungal mRNA. Bars indicate mean values, and error bars indicate standard deviation over three independent experiments. For co-cultures of F. succinogenes strain UWB7 and A. robustus, rRNA depletion in the “RNase H,” “EMBR-seq,” and “EMBR-seq+” conditions is statistically significant from the “No depletion” condition (permutation test, P < 0.05). For co-cultures of F. succinogenes strain UWB7 and C. churrovis, rRNA depletion in the “RNase H,” and “EMBR-seq+” conditions is statistically significant from the “No depletion” condition (permutation test, P < 0.05). (E) Volcano plot of differentially expressed genes of F. succinogenes strain UWB7 grown in monoculture vs. co-culture with A. robustus, for the “No depletion” condition and after rRNA depletion using EMBR-seq+. n = 3 for both culture conditions and for both the library preparation conditions. Colored points indicate differentially expressed genes, determined by thresholds of |log2 (fold change)| > 0.8, Padj < 0.05, and RPM > 2. (F) Heatmap of differentially expressed CAZymes in F. succinogenes strain UWB7 grown in monoculture vs. co-culture with A. robustus. Genes are colored on the left side of the heatmap based on the CAZyme family they are associated with.

To increase the fraction of bacterial reads beyond the 2.1-fold achieved over “Unprocessed total RNA (EMBR-seq),” we next attempted to deplete the fungal rRNA. However, as with many non-model microbes, a major challenge was that the rRNA sequences were not annotated for either of the fungal genomes. Therefore, we first mapped the sequencing data from the “Poly-A depleted (EMBR-seq)” sample against the reference genome scaffolds and identified a series of adjacent pileups of mapped reads on the same scaffold that were spaced apart according to the expected sizes of the 18S, 28S, and 5.8S eukaryotic rRNA subunits (Fig. 3C). Thus, we inferred that the three genomic loci on this scaffold correspond to the fungal rRNA subunits and designed five RNase H probes to specifically deplete the RNA transcribed from these locations (Table S2). We also aligned our probe sequences against the rRNA annotations of another closely related fungal species, Orpinomyces sp. OUS1, and confirmed that they align to the 3′-ends of 18S, 28S, and 5.8S rRNA (41). By combining the upstream poly-A bead-based fungal mRNA depletion prior to EMBR-seq+ with the RNase H-based fungal rRNA depletion during EMBR-seq+ [denoted as “Poly-A depleted & RNase H treated (EMBR-seq+)”], we were able to successfully increase the percentage of bacterial reads in the final sequencing libraries to 56.8%, a 1.75-fold improvement over previous work (Fig. S3A) (37). We hypothesize that we still obtained reads mapping to the fungal genome due to the following reasons. First, the RNase H probes to deplete fungal rRNA were designed based on putative rRNA sequences resulting from our scaffold mapping-based approach. The lack of experimentally validated rRNA annotation may have led to suboptimal RNase H probe designs and less efficient rRNA depletion. Second, our poly-T pull down may not have captured a small fraction of fungal mRNA, resulting in these molecules being sequenced. Third, we mapped our sequencing reads to the entire fungal genome, rather than the coding transcriptome as these genomes are not well annotated resulting in transcriptome references that are likely incomplete. However, a consequence of this, together with the poly-adenylation step in EMBR-seq+, is that we potentially also sequenced and mapped non-polyadenylated non-coding RNA. Nevertheless, our combined fungal RNA depletion approach enabled us to effectively enrich bacterial RNA from our co-cultures. Furthermore, we also applied these two fungal depletion treatments to the co-culture of F. succinogenes strain UWB7 with C. churrovis to observe a similar enrichment of bacteria-derived reads to 41.3% in EMBR-seq+, suggesting that this is a robust approach to quantify the bacterial transcriptome in complex microbial communities (Fig. S4 and S5). More generally, the fungal rRNA depletion strategy demonstrates that the RNase H digestion component of EMBR-seq+ enables rapid iterations of probe designs; this step is implemented on a small amount of amplified aRNA, an intermediate product in EMBR-seq+, thereby offering a convenient approach to circumvent the need to prepare new libraries from total RNA or re-grow the co-cultures.

After successfully enriching for bacterial RNA from bacteria-fungal co-cultures, we next quantified the depletion of bacterial rRNA within these co-cultures. As before, we observed that EMBR-seq+ resulted in the highest depletion of F. succinogenes strain UWB7 rRNA compared with “RNase H” or “EMBR-seq” for both co-culture systems, demonstrating that the combination of blocking primers and RNase H digestion is highly effective in depleting bacterial rRNA (Fig. 3D). Furthermore, the rRNA depletion efficiency of EMBR-seq+ was significantly improved over a previous transcriptomic study using the same co-culture pairings (5.1- and 3.9-fold increased rRNA depletion for co-cultures with A. robustus and C. churrovis, respectively) (Fig. S3). Similarly, compared with the other methods, we observed that EMBR-seq+ detected the most genes above all expression thresholds and did not introduce technical bias in quantifying gene expression levels (Fig. S6). Overall, these results show that EMBR-seq+ can be easily adapted to efficiently enrich for bacterial mRNA from complex microbial communities.

Differential gene expression between monocultures and co-cultures

After establishing that EMBR-seq+ can provide a detailed view of bacterial gene expression programs within co-cultures, we next wanted to characterize the transcriptional response of F. succinogenes strain UWB7 in the presence of the two different strains of anaerobic fungi. To accomplish this, we used DESeq2 to perform differential gene expression analysis for F. succinogenes strain UWB7 grown in monoculture vs. co-culture. While 140 genes were upregulated for F. succinogenes strain UWB7 in monoculture and 128 genes were upregulated in co-culture with A. robustus for the “No depletion” data sets, we identified 224 upregulated genes in monoculture and 374 upregulated genes in co-culture for the “EMBR-seq+” data sets (Fig. 3E, Fig. S7 and S8). We observed similar robust results for experiments with C. churrovis, with 113 genes upregulated in monoculture and 158 genes upregulated in co-culture for the “No depletion” data sets, while bacterial RNA enriched and rRNA depleted data sets obtained from EMBR-seq+ displayed 247 upregulated genes in monoculture and 149 upregulated genes in co-culture (Fig. S7 and S9a; Table S4). While the relative trends between monoculture and co-culture conditions were consistent, we observed that the biological replicates for the co-cultures showed some variation, possibly due to the increased complexity of the interactions between species that might depend on their relative abundances. Therefore, to ensure consistency, all the monoculture and co-culture replicates were prepared from the same seed cultures of A. robustus, C. churrovis, and F. succinogenes strain UWB7, and we followed the same experimental timeline for all the experiments.

To better understand the deconstruction of carbon substrates by F. succinogenes strain UWB7, we used dbCAN2 (42, 43) to predict CAZyme annotations in the bacterial transcriptome. Within dbCAN2, we performed three predictive searches: (i) HMMER against the dbCAN database, (ii) DIAMOND against the CAZy database, and (iii) HMMR against the dbCAN_sub database. One hundred ninety-one genes were predicted to be CAZymes in at least two out of the three tools, and we found that 189 of these genes were expressed with an RPM of at least 2 in all the monocultures and co-culture replicates. Notably, in co-cultures with both A. robustus and C. churrovis, CAZymes represented a substantial fraction of the differentially regulated genes. For example, of the 191 CAZyme genes, 32 (17%) were differentially expressed, with 17 upregulated in monoculture and 15 upregulated in co-culture with A. robustus (Fig. 3F). In addition to CAZymes, F. succinogenes strain UWB7 also upregulated several molecular chaperones and components of toxin-antitoxin systems when grown in co-culture with either fungal species (Table S5). Finally, many of the transporters found by Swift et al. to be upregulated in co-culture were also upregulated in our EMBR-seq+ data sets (Table S6) (37). Overall, this expression profile highlights elements of stress response in the bacteria in the presence of the anaerobic gut fungi.

Interestingly, in monoculture and co-culture, the bacteria upregulated different subclasses of CAZymes. In monoculture, F. succinogenes strain UWB7 upregulated a number of lignin-active CAZymes in the glycoside hydrolase, polysaccharide lyase, carbohydrate esterase, and carbohydrate binding module subclasses (Fig. 3F; Table S7). Conversely, in co-cultures, the bacteria did not upregulate as many CAZymes in these classes and instead upregulated CAZymes in the glycosyl transferase subclass. While the GH, PL, and CE classes represent enzymes that hydrolyze or cleave different types of bonds found in lignocellulose, the functions of GT enzymes include forming new glycosidic bonds to modify or polymerize sugars (9). The GT enzymes found to be upregulated in the bacteria in co-culture have predicted functions for cell wall biosynthesis, suggesting that the cells could be incorporating sugars in their peptidoglycan layers. Thus, the dramatic transcriptomic shift away from lignocellulose breakdown observed in F. succinogenes strain UWB7 in the presence of anaerobic fungi highlights that the interaction between these two species, which are both known to be efficient lignocellulose degraders, substantially alters the bacterial response. Furthermore, out of the thousands of microbial members identified in the cattle rumen, members of the Fibrobacteres phylum have been found to devote the largest proportion of their proteome towards carbohydrate metabolism; therefore, the downregulation of several lignocellulose-modifying CAZymes in the presence of fungi potentially suggests that A. robustus releases sugars in such great excess that the F. succinogenes strain UWB7 redirects their efforts away from sugar hydrolysis (6, 44). Overall, these findings suggest that F. succinogenes strain UWB7 may not need to reach their full lignocellulose-degrading potential in this co-culture system, and from an engineering perspective, a more efficient bioreactor might pair either the F. succinogenes strain UWB7 or the A. robustus with another organism that has complementary behavior, such as the ability to ferment sugars or produce fatty acid products (10, 45, 46).

In the co-culture experiments with C. churrovis, a similar but less pronounced shift in CAZyme expression profile was observed for F. succinogenes strain UWB7 (Table S7; Fig. S9B). For example, compared with co-culture experiments with A. robustus, fewer GH CAZymes were upregulated in monoculture and a few GH genes were also upregulated in co-culture. These observations potentially reflect that co-cultures with different fungal strains result in altered substrate availability for F. succinogenes strain UWB7. This is consistent with recent work showing that C. churrovis does not release glucose and xylose in concentrations nearly as high as A. robustus, and the lower levels of excess reducing sugars could result in distinct signaling and transcriptome remodeling in the bacteria (46). This is likely due to the fact that C. churrovis lacks the rhizoid system found in most other anaerobic fungi, which limits its ability to degrade fibrous biomass (39, 40). Additionally, F. succinogenes strain UWB7 in co-culture upregulated a few xylan esterase genes CE6 and CBM+CE6 fusion genes. As these genes were not differentially expressed in co-culture with A. robustus, this result agrees with previous findings that C. churrovis is unique from other anaerobic fungi because it is a xylan-degrading specialist (39, 47, 48). Fibrobacteres do not have genes for xylan transport or metabolism, and previous work has suggested that their xylan machinery is used to gain access to cellulose (26); therefore, our result suggests a relationship wherein when grown on switchgrass with C. churrovis, there is either a direct or indirect benefit to the bacteria for expressing xylan esterase genes. These transcriptional responses in the bacteria highlight how high-quality mRNA-seq from EMBR-seq+ provides a detailed perspective of complex co-culture systems and contributes to our understanding of how these species interact both with each other and also with the substrate present.

DISCUSSION

In this study, we have developed EMBR-seq+, a method that depletes rRNA in prokaryotic species at high efficiencies to enable cost-effective and improved mRNA sequencing. To accomplish this, EMBR-seq+ uses a combination of selective polyadenylation on mRNA molecules, blocking primers that prevent downstream amplification of rRNA molecules, and RNase H digestion targeted at rRNA-derived amplified RNA. Importantly, the method is designed to only require a single blocking primer per rRNA sequence at the 3′-end, together with three to four additional blocking primers at “hotspots” and three to four probes to direct RNase H digestion of rRNA. By requiring blocking primers and RNase H probe targets at only a few strategic locations, EMBR-seq+ is significantly easier to implement and less expensive and can be rapidly adapted to any bacterial species, compared with alternate methods that require large primer sets to tile the entire length of rRNA molecules (1922).

A major change that we introduced in EMBR-seq+ compared with EMBR-seq is the inclusion of RNase H digestion downstream of IVT amplification. A key advantage of the RNase H treatment in EMBR-seq+ over other RNase H-based methods is that it is performed on IVT-amplified RNA rather than the starting total RNA; therefore, if the sequencing results from a previous run indicate the continued presence of one or more highly abundant rRNA fragments, additional RNase H probes can be designed and applied to the same IVT-amplified RNA without having to start again from total RNA, thereby significantly reducing experimental time. We demonstrate this approach in the depletion of fungal rRNA in the co-culture experiments (Fig. 3). Additionally, while many early bacterial single-cell RNA-seq methods lacked an rRNA depletion step, more recent reports employ CRISPR-Cas9-mediated rRNA depletion or the use of probes to only capture mRNA transcripts of interest (4953). However, as the RNase H-based digestion strategy described in this work requires only a few probes per rRNA species, in the future, it could readily be integrated with single-cell RNA-seq methods that involve IVT-based amplification (5457). In addition, the use of blocking primers to deplete rRNA can potentially also be scaled down to single cells as we previously showed that EMBR-seq efficiently depletes rRNA from as little as 0.02 ng input total RNA (25). This corresponds to a pool of ~200 single bacteria (assuming ~0.1 pg of RNA per bacterial cell) and is in line with the sensitivity requirements of the existing methods that employ combinatorial barcoding strategies (49, 51, 56, 57). Therefore, we anticipate that in the future EMBR-seq+ could potentially be applied to single cells, thereby adding to the growing toolkit for profiling individual bacterial cells.

In this study, we also showed that EMBR-seq+ can be applied to complex microbial communities, such as a co-culture system of anaerobic bacteria and fungi native to the rumen microbiome to deplete rRNA efficiently from both species. In a previous study, the Ribo-Zero Gold rRNA Removal Kit (Epidemiology) was used for rRNA depletion from the same bacterial and fungal co-cultures; however, rRNA reads contributed to more than 90% of the sequencing data, thereby requiring significantly higher sequencing depths to obtain sufficient coverage of the fungal and bacterial transcriptomes (37). This reduced depletion efficiency potentially arises from the kit’s lack of specificity towards non-model microbes due to sequence divergence, highlighting the importance of methods such as EMBR-seq+ that can be rapidly adapted to any species. Furthermore, in the aforementioned study, co-cultures of A. robustus and F. succinogenes strain UWB7 were sequenced to a depth of ~15 million reads to obtain only 20,000 bacterial mRNA reads, making it impractical to perform differential expression analysis on the bacterial RNA. In contrast, EMBR-seq+ libraries were sequenced to a depth of ~12 million reads to obtain ~1.9 million F. succinogenes strain UWB7 mRNA reads, representing a dramatic improvement in quantifying the bacterial transcriptome (Fig. S3). Overall, this demonstrates that the small number of probes required in EMBR-seq+ enables rapid application of this method to varied microbial species while retaining high rRNA depletion efficiency.

Finally, to gain new insights into how F. succinogenes strain UWB7 is affected in co-culture with anaerobic gut fungi, we used EMBR-seq+ to quantify transcriptional changes in the bacteria when transitioned from monoculture to co-culture conditions. In previous work, Swift et al. designed this co-culture system for the bacteria to exert pressure on the fungi and induce the production of fungal secondary metabolites; however, we found that the bacteria also exhibit a substantial stress response and changes to its gene expression program (37). We observed a transcriptional shift in the bacterial CAZyme profile, marked by a decrease in the expression of lignocellulose-active CAZymes in the presence of the anaerobic fungi. In agreement with this, a previous study showed that microbiome samples enriched in anaerobic fungi (including A. robustus) release excess reducing sugars in the growth supernatant that potentially acts as a nutrient source for other organisms in its environment (44, 46). Similarly, this is consistent with another previous finding where in fungal-dominated microbiome cultures (achieved by antibiotic treatment), antibiotic-resistant bacteria were found to survive on reed canary grass, despite the absence of CAZymes in their genomes (47). Thus, in general, efficient rRNA depletion using EMBR-seq+ enables a deeper view of the bacterial transcriptome to gain functional insights into the response of bacteria in complex cultures. Given the low cost, ease of design and efficiency of our method, we anticipate that high-throughput screens employing EMBR-seq+ will offer a strategy for rapidly understanding microbial interactions in diverse environmental systems.

MATERIALS AND METHODS

E. coli cell culture

Overnight cultures of Escherichia coli MG1655 (ATCC: 700926) were inoculated into fresh LB medium at 1:1,000 and grown at 37°C with shaking (225 rpm). After 4 h, the cells reached the exponential growth phase (verified by OD600 measurement) and the culture was centrifuged at 3,000 × g and 4°C for 10 min. The media were removed, and the pellet was resuspended in PBS to a concentration of 107 cells/µL. For RNA extraction, 5 × 107 cells were added to 750 µL TRIzol, mixed, and stored at −20°C.

G. metallireducens culture

Cultures of Geobacter metallireducens GS-15 (DSM 7210) were grown at 30°C in an anaerobic basal salt media (1.3 mM KCl, 4.6 mM NH4Cl, 4.4 mM NaH2PO4, and 30 mM NaHCO3) amended with 56 mM iron citrate and 20 mM acetate. Wolf’s vitamin mix (from DSMZ protocol 141) and metal reducer mineral mix (described by Bretschger et al.) were added from 100× stock to 1× final concentrations (58). All cultures were grown in 20 mL volumes in anaerobic serum vials from 1% culture transfers. Cultures were maintained in DMSO at −80°C when not in use. For harvesting the cells, 1 mL of culture was centrifuged at 8,000 × g and 4°C for 5 min to obtain a pellet. The media was removed and the pellet was resuspended in 750 µL of TRIzol, mixed, and stored at −20°C.

F. succinogenes strain UWB7 monoculture and co-culture with anaerobic gut fungi

Fibrobacter succinogenes strain UWB7 monoculture and co-culture with anaerobic gut fungi were prepared based on the experimental protocol described by Swift et al. (37). F. succinogenes strain UWB7 was previously gifted by Garret Suen at the University of Wisconsin-Madison, with isolation of the strain described by Neumann and Suen (59). The anaerobic gut fungi Anaeromyces robustus and Caecomyces churrovis were previously isolated via reed canary grass enrichment from fecal pellets of a Churro sheep at the Santa Barbara Zoo (33, 34, 39).

F. succinogenes strain UWB7, A. robustus, and C. churrovis were cultivated at 39°C using standard anaerobic techniques (60). Cultures were prepared with a CO2 headspace in Hungate tubes containing 9.0 mL of a modified formulation (MC-) of complex medium C (61), containing 0.25 g/L yeast extract, 0.5 g/L Bacto Casitone, and 7.5 vol% clarified rumen fluid, along with 0.1 g/L of substrate: Avicel PH-101 (Sigma-Aldrich, St. Louis, MO) or milled switchgrass (gift from the U.S. Department of Agriculture). Avicel PH-101 substrate was used for bacterial passaging, whereas switchgrass substrate was used for fungal passaging. After autoclaving, the MC- medium was supplemented with 0.1% (vol/vol) vitamin solution (ATCC, catalogue no. MD-VS) per 10 mL of final culture volume, as described by Teunissen (62). Each passaging Hungate tube was inoculated with 1.0 mL of cryostock or live inoculum culture, with cultures passaged anaerobically using the sterile syringe needle technique every 2–4 days.

In order to inoculate the F. succinogenes strain UWB7 monocultures and co-cultures, seed cultures of F. succinogenes strain UWB7, A. robustus, and C. churrovis were prepared. The F. succinogenes strain UWB7 seed culture was prepared by adding 1.0 mL of passaging tube inoculum to a serum bottle containing 40 mL of MC- and 0.4 g of Avicel PH-101. The seed culture of F. succinogenes strain UWB7 was incubated at 39°C for 24 h for co-culture inoculum and incubated for a total of 48 h for UWB7 monoculture inoculum. A. robustus and C. churrovis seed cultures were prepared by adding 1.0 mL of appropriate passaging tube inoculum to serum bottles containing 40 mL of MC- and 0.4 g of switchgrass. The fungal seed cultures were incubated at 39°C for 24 h before 1.0 mL inocula was used to inoculate co-culture Hungate tubes with 8.0 mL MC- and 0.1 g switchgrass (t = 0 h) in triplicate. The gut fungi were allowed to grow at 39°C for 24 h (t = 24 h) before 1.0 mL of F. succinogenes strain UWB7 seed inoculum was introduced. The A. robustus-F. succinogenes strain UWB7 co-cultures and C. churrovis-F. succinogenes strain UWB7 co-cultures were incubated for an additional 48 h (t = 72 h) before samples were harvested for RNA-seq. These experiments were performed in triplicate. F. succinogenes strain UWB7 monocultures were prepared by adding 1.0 mL seed culture inocula to Hungate tubes with 9.0 mL MC- and 0.1 g switchgrass. F. succinogenes strain UWB7 monocultures were then incubated at 39°C for 24 h before samples were harvested for RNA-seq. This experiment was performed in triplicate. The final contents of each co-culture and monoculture Hungate tube were transferred to 15 mL Falcon tubes (Fisher Scientific, Waltham, MA) and centrifuged at 3,200 × g and 4°C using a swinging-bucket rotor (Eppendorf A-4-81) for 5 min. Supernatants were decanted, and 1.0 mL of RNA preservative was added (63, 64). Cell pellets were flash frozen and stored at −80°C until RNA extraction.

TRIzol RNA extraction for E. coli and G. metallireducens cultures

TRIzol (Thermo Fisher Scientific, Cat. # 15596018) RNA extractions were performed following the manufacturer’s protocol. Briefly, 750 µL TRIzol containing cell lysates were thawed and mixed with 150 µL chloroform. After centrifugation, the clear aqueous layer was recovered and precipitated with 375 µL of isopropanol and 0.67 µL of GlycoBlue (Thermo Fisher Scientific, Cat. # AM9515). The pellet was washed twice with 75% ethanol, and after the final centrifugation, the resulting pellet was air dried and resuspended in nuclease-free water.

Liquid N2 grinding and RNeasy RNA extraction for F. succinogenes strain UWB7 monoculture and co-culture

For RNA extraction, cell pellets were thawed on ice and centrifuged at 3,200 × g and 4°C using a swinging-bucket rotor for 10 min. RNA was extracted from F. succinogenes strain UWB7 monoculture and co-culture pellets using the RNeasy Mini Kit (Qiagen, Germantown, MD) following the protocol for purification of total RNA from plant cells and tissues and filamentous fungi, using the liquid N2 method of cell lysis, QIAshredder homogenization, on-column DNase I digestion, and elution in 30 µL of RNase-free water.

EMBR-seq+

Overview

EMBR-seq libraries were prepared according to the protocol described in Wangsanuwat et al. (25). As input material for all libraries, 100 ng of total RNA was used. For the libraries denoted “EMBR-seq” and “EMBR-seq+,” blocking primers were added during the polyadenylation step and reverse transcription step. For the libraries denoted “No depletion” and “RNase H,” water was added instead of blocking primers during these two steps. In the libraries denoted “RNase H” and “EMBR-seq+,” the RNase H treatment was performed as described below. In the libraries denoted “No depletion” and “EMBR-seq,” the RNase H treatment was skipped; the amplified RNA product of in vitro transcription was used directly for the library reverse transcription step as described in Wangsanuwat et al. (25).

Blocking primer design for G. metallireducens and F. succinogenes strain UWB7

All the primers used in this study are listed in Table S2. The 16S, 23S, and 5S rRNA sequences were downloaded for G. metallireducens GS-15 (NCBI database ASM1292v1) and F. succinogenes strain UWB7 (JGI IMG database Taxon ID 2703719236). Primers were designed according to the following specifications: (i) reverse complement to the reference sequences at the 3′-end of each rRNA subunit, (ii) initial primer length of approximately 30 nucleotides, and (iii) initial primer length modified to ensure a Tm greater than 65°C.

Initial EMBR-seq libraries were prepared with a 50 µM blocking primer mix consisting only of the three 3′-end primers (16S, 23S, and 5S). Based on the preliminary sequencing results from these libraries, the following approach was used to determine the three to four most frequently sequenced internal rRNA genomic locations for which hotspot primers were designed. First, the reads mapping to rRNA were visualized as a cumulative distribution plot and the top 50 genomic mapping coordinates with the largest step increase in the cumulative distribution were identified. Next, these 50 locations were sorted from 5′- to 3′-end and clustered into groups based on natural breaks in the genomic coordinates. Finally, the sum of mapped reads per cluster was computed and the clusters were sorted based on this sum from highest to lowest. The top clusters with the greatest contribution to rRNA sequencing reads were identified, and the coordinates farthest downstream (towards the 3′-end) within each cluster were selected as hotspot targets. Typically, three to four hotspot targets were selected per rRNA subunit, as this offered a balance between efficient rRNA depletion and ease of primer design.

Hotspot primers were designed to bind the selected locations for additional rRNA depletion. Hotspot primers were designed according to the following specifications: (i) reverse complement to the reference sequence for each rRNA subunit, (ii) located approximately 100 nucleotides downstream (towards the 3′-end) from the start of the hotspot sequence, (iii) initial primer length of approximately 30 nucleotides, and (iv) initial primer length modified and/or initial primer position shifted upstream or downstream by a few bases to reach a Tm greater than 65°C.

To ensure that the rRNA depletion probes do not result in non-specific binding, the probe sequences were aligned against the transcriptome. BlastN was used to align EMBR-seq blocking primers to each gene of the transcriptome. To minimize non-specific binding, primers were generally designed to have a BlastN E-value less than 1 and a sequence similarity less than 15 nucleotides of consecutive matches, following the results of a previous study (21).

RNase H probe design

RNase H probes bind to aRNA, the IVT product, which has a sequence complementary to the original template transcriptome. Therefore, RNase H probes are designed to have the same sequence as the rRNA references. The locations of the RNase H probes were selected in the neighborhood of the 3′-end and hotspot blocking primers with a length of 50 nucleotides. The RNase H probe design can also be easily modified to target any additional abundant fragments found in prior sequencing libraries.

Preparation of blocking primer mix

Blocking primers for the species of interest (Table S2) were individually resuspended to 100 µM. The primers were mixed such that the three 3′-end primers together were 50 µM in the final primer mix, and all the hotspot primers together were 50 µM in the final primer mix, for a total primer mix concentration of 100 µM.

Polyadenylation

Two microliters of total RNA (50 ng/µL) was mixed with 1 µL of 5× first-strand buffer [250 mM Tris-HCl (pH 8.3), 375 mM KCl, 15 mM MgCl2, comes with Superscript II Reverse Transcriptase, Invitrogen Cat. # 18064014], 1 μL of blocking primer mix, 0.1 µL 10 mM ATP, and 0.1 µL E. coli poly-A polymerase (New England Biolabs, Cat. # M0276S). The samples were incubated at 37°C for 10 min. In the “No depletion” and “RNase H” libraries, 1 µL of nuclease-free water was added instead of the blocking primer mix.

Reverse transcription

The polyadenylation product was mixed with 0.5 µL 10 mM dNTPs (New England Biolabs, Cat. # N0447L), 1 µL reverse transcription primers (25 ng/µL; Table S2), and 1.3 µL blocking primer mix and heated to 65°C for 5 min and 58°C for 1 min and then quenched on ice. In the “No depletion” and “RNase H” libraries, 1.3 µL of nuclease-free water was added instead of the blocking primer mix. This product was then mixed with 1.2 µL 5× first strand buffer, 1 µL 0.1 M DTT, 0.5 µL RNaseOUT (Thermo Fisher Scientific, Cat. # 10777019), and 0.5 µL Superscript II Reverse Transcriptase and then incubated at 42°C for 1 h. Immediately afterward, the samples were heat inactivated at 70°C for 10 min.

Second-strand synthesis

The reverse transcription product was mixed with 33.5 µL nuclease-free water, 12 µL 5× second-strand buffer [100 mM Tris-HCl (pH 6.9), 23 mM MgCl2, 450 mM KCl, 0.75 mM β-NAD, 50 mM (NH4)2SO4; Invitrogen, Cat. # 10812014], 1.2 µL 10 mM dNTPs, 0.4 µL E. coli ligase (Invitrogen, Cat. # 18052019), 1.5 µL DNA polymerase I (Invitrogen, Cat. # 18010025), and 0.4 µL RNase H (Invitrogen, Cat. # 18021071) and incubated at 16°C for 2 h. The cDNA was purified with 1× AMPure XP DNA beads (Beckman Coulter, Cat. # A63881) and eluted in 24 µL nuclease-free water that was subsequently concentrated to 6.4 µL.

In vitro transcription

The product from the previous step was mixed with 9.6 µL of in vitro transcription mix (1.6 µL of each ribonucleotide, 1.6 µL 10× T7 reaction buffer, 1.6 µL T7 enzyme mix; MEGAscript T7 Transcription Kit, Thermo Fisher Scientific, Cat. # AMB13345) and incubated at 37°C for 13 h. The IVT product was mixed with 6 µL ExoSAP-IT PCR Product Cleanup Reagent (Thermo Fisher Scientific, Cat. # 78200.200.UL) and incubated at 37°C for 15 min. Next, it was treated with 5.5 µL fragmentation buffer [200 mM Tris-acetate (pH 8.1), 500 mM KOAc, 150 mM MgOAc] at 94°C for 3 min and immediately quenched with 2.75 µL stop buffer (0.5 M EDTA) on ice. The fragmented aRNA was size selected with 0.8× AMPure RNA beads (RNAClean XP Kit, Beckman Coulter, Cat. # A63987) and eluted in 18 µL nuclease-free water. In the libraries denoted “No depletion” and “EMBR-seq,” the fragmented and cleaned-up in vitro transcription product was used directly for Illumina library preparation by reverse transcription and PCR, as described previously (25, 55). In the “RNase H” and “EMBR-seq+” libraries, the RNase H treatment was performed before proceeding to Illumina library preparation.

RNase H probe mix and buffer preparation

RNase H probes (Table S2) were individually resuspended to 100 µM and mixed equimolarly, and the mixture was diluted to 10 µM (~155 ng/µL). RNase H buffer mix was prepared by mixing 10 µL of 5× first-strand buffer with 2.7 µL of 5× second strand buffer or scaling up as needed.

RNase H treatment for monocultures

After ExoSAP treatment, RNA fragmentation, and bead-based size selection, the aRNA concentration was measured on a NanoDrop One spectrophotometer. One thousand nanograms of aRNA was mixed with 500 ng (3.2 µL) of RNase H probe mix, 3.2 µL of RNase H buffer mix, and nuclease-free water to reach a total volume of 15 µL for each reaction. This mixture was preheated at 65°C for 5 min and quenched on ice. One microliter of RNase H (Invitrogen, Cat. # 18021071) was added, and the reaction was incubated at 16°C for 30 min. For tests with the thermostable RNase H, 1 µL of Hybridase Thermostable RNase H at 45°C [Lucigen (now Biosearch Technologies), Cat. # H39500] was added and the reaction was incubated at 45°C for 30 min. Finally, the RNase H product was size selected with 1× AMPure RNA beads and eluted in 15 µL nuclease-free water and then concentrated to 5 µL volume for Illumina library preparation.

Enrichment of F. succinogenes strain UWB7 RNA from co-cultures

Fungal mRNA removal

Total RNA extracted from co-culture pellets was concentrated to 30 µL (40–60 ng/μL) and treated with Dynabeads Oligo(dT)25 (Invitrogen Cat # 61002), according to the manufacturer’s protocol. Briefly, each sample was mixed with 30 µL of “Binding Buffer,” incubated at 65°C for 2 min, chilled on ice, mixed with 30 µL of washed oligo(dT)25 beads, and mixed at 300 rpm on a thermomixer at room temperature for 15 min. To prevent the beads from settling in the thermomixer, the samples were mixed by pipetting every 5 min. The reaction tubes were placed on a magnetic stand, and the supernatants (90–95 μL) were transferred to new tubes. The supernatants were purified with 1.8× AMPure RNA beads and eluted in 30 µL of nuclease-free water. This remaining RNA was used for preparing the “Poly-A depleted (EMBR-seq)” libraries.

Determination of fungal rRNA sequences for RNase H depletion

To determine potential rRNA sequences for C. churrovis, FastQC was used to identify the most overrepresented sequences in the “Poly-A depleted (EMBR-seq)” sample, and the top hits were aligned with BLAST against the full C. churrovis genome. The BLAST results suggested that the most overrepresented reads were aligned on scaffold 672.

Next, all the RNA-seq reads from the “Poly-A depleted (EMBR-seq)” libraries were mapped against scaffold 672 of the C. churrovis genome with the Burrows-Wheeler Aligner software package (BWA) (65). This scaffold contained four repeats of three hotspots which appeared to correspond to rRNA subunits, based on the spacing between hotspots (Fig. S5). The four repeats were observed because non-unique mapping was allowed. To produce Fig. 3C, the mapping was repeated on a version of scaffold 672 that is truncated to only the first 10,000 bases (one repeat of the predicted rRNA pattern). RNase H probes were designed according to the locations of the predicted hotspots for the 3′-ends of the 28S, 18S, and 5.8S rRNA subunits in C. churrovis. For RNase H treatment in co-culture EMBR-seq+ libraries, the C. churrovis probes were applied to co-cultures with A. robustus as well. Although they are different fungal species, we verified that the same RNase H target sequences are highly abundant in both A. robustus and C. churrovis genomes, and the 3′-ends of their rRNA are similar enough such that fungal rRNA depletion was efficient in both co-cultures (Fig. S4 and S5).

RNase H reaction for co-cultures

For co-culture samples, the RNase H probe mix was altered to include probes for depleting fungal rRNA along with bacterial rRNA. One thousand nanograms of aRNA derived from co-culture samples was mixed with 850 ng (5.5 µL) of co-culture RNase H probe mix, which consisted of the following: 100 ng each of the two fungal 28S probes with 50 ng each of the two fungal 18S, fungal 5.8S, and 10 bacterial RNase H probes. The aRNA and probes were mixed with 3.2 µL RNase H buffer mix and nuclease-free water to reach a total volume of 15 µL for each reaction. The reactions were preheated at 65°C for 5 min and quenched on ice. One microliter of RNase H was added, and the reaction was incubated at 16°C for 30 min. The RNase H product was size selected with 1× AMPure RNA beads and eluted in 15 µL nuclease-free water and then concentrated to 5 µL for Illumina library preparation.

RNA sequencing and bioinformatics analysis

Paired-end sequencing of all libraries was performed on an Illumina NextSeq 500 or HiSeq 4000 platform. All sequencing data have been deposited to Gene Expression Omnibus under the accession number GSE223404. In the sequencing libraries, read 1 contains the sample barcode (Table S2). Read 2 is mapped to the bacterial or fungal transcriptome. Prior to mapping, read 1 was trimmed to 25 bases, read 2 was trimmed to 51 bases, and only reads containing valid sample barcodes on read 1 were retained. Next, the reads were mapped to the appropriate reference transcriptome using Burrows-Wheeler Aligner (BWA) version 0.7.15 with default parameters. The “CDS from genomic FASTA” reference files for Escherichia coli K12 substr. MG1655 (ASM584v2 cds) and Geobacter metallireducens GS-15 (ASM1292v1 cds) were obtained from the NCBI database. The reference transcriptome for Fibrobacter succinogenes strain UWB7 (Taxon ID 2703719236) was obtained from the JGI IMG database (66). The reference genomes for Anaeromyces robustus and Caecomyces churrovis were obtained from the JGI MycoCosm portal (67). Each bacterial reference transcriptome was modified by appending the 16S, 23S, and 5S rRNA sequences. To determine rRNA content for each sample, the number of reads mapped to the rRNA was taken as a fraction of total reads mapped to the bacteria.

To compare the results of this work to those of Swift et al., raw sequencing reads (fastq files) were retrieved from the Sequencing Read Archive (SRA) for NCBI BioProject PRJNA666900 as specified in Swift et al. (37). Reads 1 and 2 were trimmed to 51 bases and mapped to the reference transcriptomes for F. succinogenes strain UWB7, A. robustus, and C. churrovis using the same BWA pipeline described above.

CAZyme annotations were predicted for the F. succinogenes strain UWB7 transcriptome using dbCAN2 (43). Three tools were run: (i) HMMER: dbCAN, (ii) DIAMOND: CAZy, and (iii) HMMR: dbCAN_sub, and CAZyme annotations were accepted if they were predicted with two out of the three tools.

Differential gene expression analysis was performed using DEseq2 version 1.36.0 (68), with a minimum absolute log2(fold change) of 0.8, statistical significance threshold for an adjusted P value of less than 0.05, and a minimum gene expression level of two reads per million.

To account for biases introduced by EMBR-seq+ in the F. succinogenes strain UWB7 monocultures (Fig. 2D), genes determined to be differentially expressed between the “No depletion” and “EMBR-seq” data set were filtered out and excluded from all analyses between monoculture and co-culture (Fig. 3E and F; Fig. S7 to S9; Tables S4 to S7). DE-Seq2 thresholds for the bias determination were |log2 (fold change)| > 1.5, Padj < 0.05, and RPM > 2. Filtering results are included in Table S4.

Supplementary Material

Reviewer comments
reviewer-comments.pdf (1.1MB, pdf)

ACKNOWLEDGMENTS

We would like to thank members of the Dey and O’Malley groups and Dr. Elizabeth Wilbanks for helpful discussions. The authors thank Dr. Jennifer Smith for the assistance with sequencing Illumina libraries. Sequencing was performed at the Biological Nanostructures Laboratory within the California NanoSystems Institute (CNSI), supported by the University of California, Santa Barbara (UCSB), and the University of California, Office of the President (UCOP). Additional sequencing was performed at Novogene. We further acknowledge the Center of Scientific Computing at CNSI for computational facilities that are funded by NSF MRSEC (DMR 2308708) and NSF CNS-1725797.

This work was supported by the Institute for Collaborative Biotechnologies grants W911NF-09–D-0001 and W911NF-19-2-0026 (M.A.O.), US Army Research Office contract W911NF-19-1-0010 (M.A.O.), the National Science Foundation grant MCB-1553721 (M.A.O.), NIH grants R01HD099517 and R01HG011013 (S.S.D.), the UCSB Academic Senate Faculty Research Grant (S.S.D.), and the CNSI Challenge Grant Program, supported by UCSB and UCOP (S.S.D. and M.A.O.).

K.A.H., C.W., and S.S.D. conceived the method and designed the experiments. K.A.H., C.W., and S.C.T. participated in data collection for EMBR-seq+. L.V.B. cultivated and performed RNA extractions for all monocultures and co-cultures of F. succinogenes strain UWB7, A. robustus, and C. churrovis. A.R.R. cultivated G. metallireducens. K.A.H. and C.W. analyzed the data. K.A.H., L.V.B., A.R.R., M.A.O., and S.S.D. wrote the manuscript. S.S.D. guided experimental design and data analysis. All authors have read and approved the final version of the manuscript.

Contributor Information

Michelle A. O'Malley, Email: momalley@ucsb.edu.

Siddharth S. Dey, Email: sdey@ucsb.edu.

Danielle Tullman-Ercek, Northwestern University, Evanston, Illinois, USA.

Christina Homberger, University Würzburg, Würzburg, Germany.

DATA AVAILABILITY

RNA-seq data are available in Gene Expression Omnibus under accession no. GSE223404.

SUPPLEMENTAL MATERIAL

The following material is available online at https://doi.org/10.1128/msystems.00281-23.

Supplemental Information. msystems.00281-23-s0001.pdf.

Figures S1–S9; Tables S1–S3, S6, and S7; captions to Tables S4 and S5.

DOI: 10.1128/msystems.00281-23.SuF1
Table S4. msystems.00281-23-s0002.xlsx.

Results of differential gene expression analysis.

DOI: 10.1128/msystems.00281-23.SuF2
Table S5. msystems.00281-23-s0003.xlsx.

Select bacterial stress response genes upregulated in co-culture.

DOI: 10.1128/msystems.00281-23.SuF3
OPEN PEER REVIEW. reviewer-comments.pdf.

An accounting of the reviewer comments and feedback.

DOI: 10.1128/msystems.00281-23.SuF4

ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.

REFERENCES

  • 1. Hess M, Sczyrba A, Egan R, Kim TW, Chokhawala H, Schroth G, Luo S, Clark DS, Chen F, Zhang T, Mackie RI, Pennacchio LA, Tringe SG, Visel A, Woyke T, Wang Z, Rubin EM. 2011. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science 331:463–467. doi: 10.1126/science.1200387 [DOI] [PubMed] [Google Scholar]
  • 2. Seshadri R, Leahy SC, Attwood GT, Teh KH, Lambie SC, Cookson AL, Eloe-Fadrosh EA, Pavlopoulos GA, Hadjithomas M, Varghese NJ, Paez-Espino D, Hungate1000 project collaborators, Perry R, Henderson G, Creevey CJ, Terrapon N, Lapebie P, Drula E, Lombard V, Rubin E, Kyrpides NC, Henrissat B, Woyke T, Ivanova NN, Kelly WJ. 2018. Cultivation and sequencing of rumen microbiome members from the Hungate1000 collection. Nat Biotechnol 36:359–367. doi: 10.1038/nbt.4110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Peng X “Nick, Gilmore SP, O’Malley MA. 2016. Microbial communities for bioprocessing: lessons learned from nature. Curr Opin Chem Eng 14:103–109. doi: 10.1016/j.coche.2016.09.003 [DOI] [Google Scholar]
  • 4. Lillington SP, Leggieri PA, Heom KA, O’Malley MA. 2020. Nature's recyclers: anaerobic microbial communities drive crude biomass deconstruction. Curr Opin Biotechnol 62:38–47. doi: 10.1016/j.copbio.2019.08.015 [DOI] [PubMed] [Google Scholar]
  • 5. Burnet MC, Dohnalkova AC, Neumann AP, Lipton MS, Smith RD, Suen G, Callister SJ. 2015. Evaluating models of cellulose degradation by Fibrobacter succinogenes S85. PLoS One 10:e0143809. doi: 10.1371/journal.pone.0143809 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Stewart RD, Auffret MD, Warr A, Walker AW, Roehe R, Watson M. 2019. Compendium of 4,941 rumen metagenome-assembled genomes for rumen microbiome biology and enzyme discovery. Nat Biotechnol 37:953–961. doi: 10.1038/s41587-019-0202-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Venturelli OS, Carr AC, Fisher G, Hsu RH, Lau R, Bowen BP, Hromada S, Northen T, Arkin AP. 2018. Deciphering microbial interactions in synthetic human gut microbiome communities. Mol Syst Biol 14:e8157. doi: 10.15252/msb.20178157 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Schink B. 1997. Energetics of syntrophic cooperation in methanogenic degradation. Microbiol Mol Biol Rev 61:262–280. doi: 10.1128/mmbr.61.2.262-280.1997 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Drula E, Garron ML, Dogan S, Lombard V, Henrissat B, Terrapon N. 2022. The carbohydrate-active enzyme database: functions and literature. Nucleic Acids Res 50:D571–D577. doi: 10.1093/nar/gkab1045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Lawson CE, Harcombe WR, Hatzenpichler R, Lindemann SR, Löffler FE, O’Malley MA, García Martín H, Pfleger BF, Raskin L, Venturelli OS, Weissbrodt DG, Noguera DR, McMahon KD. 2019. Common principles and best practices for engineering microbiomes. Nat Rev Microbiol 17:725–741. doi: 10.1038/s41579-019-0255-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Proudfoot NJ. 2011. Ending the message: poly(A) signals then and now. Genes Dev 25:1770–1782. doi: 10.1101/gad.17268411 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. 2008. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods 5:621–628. doi: 10.1038/nmeth.1226 [DOI] [PubMed] [Google Scholar]
  • 13. Perry RP, Kelley DE, LaTorre J. 1972. Lack of polyadenylic acid sequences in the messenger RNA of E. coli. Biochem Biophys Res Commun 48:1593–1600. doi: 10.1016/0006-291x(72)90896-0 [DOI] [PubMed] [Google Scholar]
  • 14. Rosenow C, Saxena RM, Durst M, Gingeras TR. 2001. Prokaryotic RNA preparation methods useful for high density array analysis: comparison of two approaches. Nucleic Acids Res 29:e112. doi: 10.1093/nar/29.22.e112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Giannoukos G, Ciulla DM, Huang K, Haas BJ, Izard J, Levin JZ, Livny J, Earl AM, Gevers D, Ward DV, Nusbaum C, Birren BW, Gnirke A. 2012. Efficient and robust RNA-seq process for cultured bacteria and complex community transcriptomes. Genome Biol 13:r23. doi: 10.1186/gb-2012-13-3-r23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Rosset R, Julien J, Monier R. 1966. Ribonucleic acid composition of bacteria as a function of growth rate. J Mol Biol 18:308–320. doi: 10.1016/s0022-2836(66)80248-6 [DOI] [PubMed] [Google Scholar]
  • 17. Kraus AJ, Brink BG, Siegel TN. 2019. Efficient and specific oligo-based depletion of rRNA. Sci Rep 9:12281. doi: 10.1038/s41598-019-48692-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Culviner PH, Guegler CK, Laub MT. 2020. A simple, cost-effective, and robust method for rRNA depletion in RNA-sequencing studies. mBio 11:e00010-20. doi: 10.1128/mBio.00010-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Huang Y, Sheth RU, Kaufman A, Wang HH. 2020. Scalable and cost-effective ribonuclease-based rRNA depletion for transcriptomics. Nucleic Acids Res 48:e20. doi: 10.1093/nar/gkz1169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Engelhardt F, Tomasch J, Häussler S. 2020. Organism-specific depletion of highly abundant RNA species from bacterial total RNA. Access Microbiol 2:acmi000159. doi: 10.1099/acmi.0.000159 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Choe D, Szubin R, Poudel S, Sastry A, Song Y, Lee Y, Cho S, Palsson B, Cho BK. 2021. Riborid: a low cost, advanced, and ultra-efficient method to remove ribosomal RNA for bacterial transcriptomics. PLoS Genet 17:e1009821. doi: 10.1371/journal.pgen.1009821 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Prezza G, Heckel T, Dietrich S, Homberger C, Westermann AJ, Vogel J. 2020. Improved bacterial RNA-seq by Cas9-based depletion of ribosomal RNA reads. RNA 26:1069–1078. doi: 10.1261/rna.075945.120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Kang Y, Norris MH, Zarzycki-Siek J, Nierman WC, Donachie SP, Hoang TT. 2011. Transcript amplification from single bacterium for transcriptome analysis. Genome Res 21:925–935. doi: 10.1101/gr.116103.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. He S, Wurtzel O, Singh K, Froula JL, Yilmaz S, Tringe SG, Wang Z, Chen F, Lindquist EA, Sorek R, Hugenholtz P. 2010. Validation of two ribosomal RNA removal methods for microbial metatranscriptomics. Nat Methods 7:807–812. doi: 10.1038/nmeth.1507 [DOI] [PubMed] [Google Scholar]
  • 25. Wangsanuwat C, Heom KA, Liu E, O’Malley MA, Dey SS. 2020. Efficient and cost-effective bacterial mRNA sequencing from low input samples through ribosomal RNA depletion. BMC Genomics 21:717. doi: 10.1186/s12864-020-07134-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Suen G, Weimer PJ, Stevenson DM, Aylward FO, Boyum J, Deneke J, Drinkwater C, Ivanova NN, Mikhailova N, Chertkov O, Goodwin LA, Currie CR, Mead D, Brumm PJ. 2011. The complete genome sequence of Fibrobacter succinogenes S85 reveals a cellulolytic and metabolic specialist. PLoS One 6:e18814. doi: 10.1371/journal.pone.0018814 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Childers SE, Ciufo S, Lovley DR. 2002. Geobacter metallireducens accesses insoluble Fe(III) oxide by chemotaxis. Nature 416:767–769. doi: 10.1038/416767a [DOI] [PubMed] [Google Scholar]
  • 28. Rotaru AE, Shrestha PM, Liu F, Markovaite B, Chen S, Nevin KP, Lovley DR. 2014. Direct interspecies electron transfer between Geobacter metallireducens and Methanosarcina barkeri. Appl Environ Microbiol 80:4599–4605. doi: 10.1128/AEM.00895-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Tajima K, Aminov RI, Nagamine T, Matsui H, Nakamura M, Benno Y. 2001. Diet-dependent shifts in the bacterial population of the rumen revealed with real-time PCR. Appl Environ Microbiol 67:2766–2774. doi: 10.1128/AEM.67.6.2766-2774.2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Krause DO, Denman SE, Mackie RI, Morrison M, Rae AL, Attwood GT, McSweeney CS. 2003. Opportunities to improve fiber degradation in the rumen: microbiology, ecology, and genomics. FEMS Microbiol Rev 27:663–693. doi: 10.1016/S0168-6445(03)00072-X [DOI] [PubMed] [Google Scholar]
  • 31. Shinkai T, Kobayashi Y. 2007. Localization of ruminal cellulolytic bacteria on plant fibrous materials as determined by fluorescence in situ hybridization and real-time PCR. Appl Environ Microbiol 73:1646–1652. doi: 10.1128/AEM.01896-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Raut MP, Couto N, Karunakaran E, Biggs CA, Wright PC. 2019. Deciphering the unique cellulose degradation mechanism of the ruminal bacterium Fibrobacter succinogenes S85. Sci Rep 9:20281. doi: 10.1038/s41598-019-55971-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Solomon KV, Haitjema CH, Henske JK, Gilmore SP, Borges-Rivera D, Lipzen A, Brewer HM, Purvine SO, Wright AT, Theodorou MK, Grigoriev IV, Regev A, Thompson DA, O’Malley MA. 2016. Early-branching gut fungi possess a large, comprehensive array of biomass-degrading enzymes. Science 351:1192–1195. doi: 10.1126/science.aad1431 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Haitjema CH, Gilmore SP, Henske JK, Solomon KV, de Groot R, Kuo A, Mondo SJ, Salamov AA, LaButti K, Zhao Z, Chiniquy J, Barry K, Brewer HM, Purvine SO, Wright AT, Hainaut M, Boxma B, van Alen T, Hackstein JHP, Henrissat B, Baker SE, Grigoriev IV, O’Malley MA. 2017. A parts list for fungal cellulosomes revealed by comparative genomics. Nat Microbiol 2:17087. doi: 10.1038/nmicrobiol.2017.87 [DOI] [PubMed] [Google Scholar]
  • 35. Seppälä S, Wilken SE, Knop D, Solomon KV, O’Malley MA. 2017. The importance of sourcing enzymes from non-conventional fungi for metabolic engineering and biomass breakdown. Metab Eng 44:45–59. doi: 10.1016/j.ymben.2017.09.008 [DOI] [PubMed] [Google Scholar]
  • 36. Gilmore SP, Lankiewicz TS, Wilken SE, Brown JL, Sexton JA, Henske JK, Theodorou MK, Valentine DL, O’Malley MA. 2019. Top-down enrichment guides in formation of synthetic microbial consortia for biomass degradation. ACS Synth Biol 8:2174–2185. doi: 10.1021/acssynbio.9b00271 [DOI] [PubMed] [Google Scholar]
  • 37. Swift CL, Louie KB, Bowen BP, Hooker CA, Solomon KV, Singan V, Daum C, Pennacchio CP, Barry K, Shutthanandan V, Evans JE, Grigoriev IV, Northen TR, O’Malley MA. 2021. Cocultivation of anaerobic fungi with rumen bacteria establishes an antagonistic relationship. mBio 12:e0144221. doi: 10.1128/mBio.01442-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Swift CL, Brown JL, Seppälä S, O’Malley MA. 2019. Co-cultivation of the anaerobic fungus Anaeromyces robustus with Methanobacterium bryantii enhances transcription of carbohydrate active enzymes. J Ind Microbiol Biotechnol 46:1427–1433. doi: 10.1007/s10295-019-02188-0 [DOI] [PubMed] [Google Scholar]
  • 39. Henske JK, Gilmore SP, Knop D, Cunningham FJ, Sexton JA, Smallwood CR, Shutthanandan V, Evans JE, Theodorou MK, O’Malley MA. 2017. Transcriptomic characterization of Caecomyces churrovis: a novel, non-rhizoid-forming lignocellulolytic anaerobic fungus. Biotechnol Biofuels 10:305. doi: 10.1186/s13068-017-0997-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Brown JL, Swift CL, Mondo SJ, Seppala S, Salamov A, Singan V, Henrissat B, Drula E, Henske JK, Lee S, LaButti K, He G, Yan M, Barry K, Grigoriev IV, O’Malley MA. 2021. Co-cultivation of the anaerobic fungus Caecomyces churrovis with Methanobacterium bryantii enhances transcription of carbohydrate binding modules, dockerins, and pyruvate formate lyases on specific substrates. Biotechnol Biofuels 14:234. doi: 10.1186/s13068-021-02083-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Nicholson MJ, Theodorou MK, Brookman JL. 2005. Molecular analysis of the anaerobic rumen fungus Orpinomyces - insights into an AT-rich genome. Microbiology (Reading) 151:121–133. doi: 10.1099/mic.0.27353-0 [DOI] [PubMed] [Google Scholar]
  • 42. Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y. 2012. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res 40:W445–W451. doi: 10.1093/nar/gks479 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Zhang H, Yohe T, Huang L, Entwistle S, Wu P, Yang Z, Busk PK, Xu Y, Yin Y. 2018. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res 46:W95–W101. doi: 10.1093/nar/gky418 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Henske JK, Wilken SE, Solomon KV, Smallwood CR, Shutthanandan V, Evans JE, Theodorou MK, O’Malley MA. 2018. Metabolic characterization of anaerobic fungi provides a path forward for bioprocessing of crude lignocellulose. Biotechnol Bioeng 115:874–884. doi: 10.1002/bit.26515 [DOI] [PubMed] [Google Scholar]
  • 45. Scarborough MJ, Lawson CE, Hamilton JJ, Donohue TJ, Noguera DR. 2018. Metatranscriptomic and thermodynamic insights into medium-chain fatty acid production using an anaerobic microbiome. mSystems 3:e00221-18. doi: 10.1128/mSystems.00221-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Brown JL, Perisin MA, Swift CL, Benyamin M, Liu S, Singan V, Zhang Y, Savage E, Pennacchio C, Grigoriev IV, O’Malley MA. 2023. Co-cultivation of anaerobic fungi with Clostridium acetobutylicum bolsters butyrate and butanol production from cellulose and lignocellulose. J Ind Microbiol Biotechnol 49:kuac024. doi: 10.1093/jimb/kuac024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Peng X, Wilken SE, Lankiewicz TS, Gilmore SP, Brown JL, Henske JK, Swift CL, Salamov A, Barry K, Grigoriev IV, Theodorou MK, Valentine DL, O’Malley MA. 2021. Genomic and functional analyses of fungal and bacterial consortia that enable lignocellulose breakdown in goat gut microbiomes. Nat Microbiol 6:499–511. doi: 10.1038/s41564-020-00861-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Leggieri PA, Valentine MT, O’Malley MA. 2022. Biofilm disruption enhances growth rate and carbohydrate-active enzyme production in anaerobic fungi. Bioresour Technol 358:127361. doi: 10.1016/j.biortech.2022.127361 [DOI] [PubMed] [Google Scholar]
  • 49. Blattman SB, Jiang W, Oikonomou P, Tavazoie S. 2020. Prokaryotic single-cell RNA sequencing by in situ combinatorial indexing. Nat Microbiol 5:1192–1201. doi: 10.1038/s41564-020-0729-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Imdahl F, Vafadarnejad E, Homberger C, Saliba AE, Vogel J. 2020. Single-cell RNA-sequencing reports growth-condition-specific global transcriptomes of individual bacteria. Nat Microbiol 5:1202–1206. doi: 10.1038/s41564-020-0774-1 [DOI] [PubMed] [Google Scholar]
  • 51. Kuchina A, Brettner LM, Paleologu L, Roco CM, Rosenberg AB, Carignano A, Kibler R, Hirano M, DePaolo RW, Seelig G. 2020. Microbial single-cell RNA sequencing by split-pool barcoding. Science 371:eaba5257. doi: 10.1126/science.aba5257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Homberger C, Hayward RJ, Barquist L, Vogel J. 2023. Improved bacterial single-cell RNA-seq through automated MATQ-seq and Cas9-based removal of rRNA reads. mBio 14:e0355722. doi: 10.1128/mbio.03557-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. McNulty R, Sritharan D, Pahng SH, Meisch JP, Liu S, Brennan MA, Saxer G, Hormoz S, Rosenthal AZ. 2023. Probe-based bacterial single-cell RNA sequencing predicts toxin regulation. Nat Microbiol 8:934–945. doi: 10.1038/s41564-023-01348-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Hashimshony T, Wagner F, Sher N, Yanai I. 2012. CEL-Seq: single-cell RNA-Seq by multiplexed linear amplification. Cell Rep 2:666–673. doi: 10.1016/j.celrep.2012.08.003 [DOI] [PubMed] [Google Scholar]
  • 55. Hashimshony T, Senderovich N, Avital G, Klochendler A, de Leeuw Y, Anavy L, Gennert D, Li S, Livak KJ, Rozenblatt-Rosen O, Dor Y, Regev A, Yanai I. 2016. CEL-Seq2: sensitive highly-multiplexed single-cell RNA-seq. Genome Biol 17:77. doi: 10.1186/s13059-016-0938-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Wang B, Lin AE, Yuan J, Koch MD, Adamson B, Wingreen NS, Gitai Z. 2022. Massively-parallel microbial mRNA sequencing (M3-Seq) reveals heterogeneous behaviors in bacteria at single-cell resolution. Microbiology. doi: 10.1101/2022.09.21.508688 [DOI] [PMC free article] [PubMed]
  • 57. Ma P, Amemiya HM, He LL, Gandhi SJ, Nicol R, Bhattacharyya RP, Smillie CS, Hung DT. 2023. Bacterial droplet-based single-cell RNA-seq reveals antibiotic-associated heterogeneous cellular states. Cell 186:877–891. doi: 10.1016/j.cell.2023.01.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Bretschger O, Obraztsova A, Sturm CA, Chang IS, Gorby YA, Reed SB, Culley DE, Reardon CL, Barua S, Romine MF, Zhou J, Beliaev AS, Bouhenni R, Saffarini D, Mansfeld F, Kim B-H, Fredrickson JK, Nealson KH. 2007. Current production and metal oxide reduction by Shewanella oneidensis MR-1 wild type and mutants. Appl Environ Microbiol 73:7003–7012. doi: 10.1128/AEM.01087-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Neumann AP, Suen G. 2018. The phylogenomic diversity of herbivore-associated Fibrobacter spp. is correlated to lignocellulose-degrading potential. mSphere 3:e00593-18. doi: 10.1128/mSphere.00593-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Haitjema CH, Solomon KV, Henske JK, Theodorou MK, O’Malley MA. 2014. Anaerobic gut fungi: advances in isolation, culture, and cellulolytic enzyme discovery for biofuel production. Biotechnol Bioeng 111:1471–1482. doi: 10.1002/bit.25264 [DOI] [PubMed] [Google Scholar]
  • 61. Davies DR, Theodorou MK, Lawrence MI, Trinci AP. 1993. Distribution of anaerobic fungi in the digestive tract of cattle and their survival in faeces. J Gen Microbiol 139:1395–1400. doi: 10.1099/00221287-139-6-1395 [DOI] [PubMed] [Google Scholar]
  • 62. Teunissen MJ, Op den Camp HJ, Orpin CG, Huis in ’t Veld JH, Vogels GD. 1991. Comparison of growth characteristics of anaerobic fungi isolated from ruminant and non-ruminant herbivores during cultivation in a defined medium. J Gen Microbiol 137:1401–1408. doi: 10.1099/00221287-137-6-1401 [DOI] [PubMed] [Google Scholar]
  • 63. Erster O, Shkedi O, Benedek G, Zilber E, Varkovitzky I, Shirazi R, Oriya Shorka D, Cohen Y, Bar T, Yechieli R, Tepperberg Oikawa M, Venkert D, Linial M, Oiknine-Djian E, Mandelboim M, Livneh Z, Shenhav-Saltzman G, Mendelson E, Wolf D, Szwarcwort-Cohen M, Mor O, Lewis Y, Zeevi D. 2021. Improved sensitivity, safety, and rapidity of COVID-19 tests by replacing viral storage solution with lysis buffer. PLoS ONE 16:e0249149. doi: 10.1371/journal.pone.0249149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Malmstrom R. 2015. Rnalater recipe V1. doi: 10.17504/protocols.io.c56y9d [DOI]
  • 65. Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. doi: 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Chen IM, Chu K, Palaniappan K, Pillay M, Ratner A, Huang J, Huntemann M, Varghese N, White JR, Seshadri R, Smirnova T, Kirton E, Jungbluth SP, Woyke T, Eloe-Fadrosh EA, Ivanova NN, Kyrpides NC. 2019. IMG/M V.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes. Nucleic Acids Res 47:D666–D677. doi: 10.1093/nar/gky901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Grigoriev IV, Nikitin R, Haridas S, Kuo A, Ohm R, Otillar R, Riley R, Salamov A, Zhao X, Korzeniewski F, Smirnova T, Nordberg H, Dubchak I, Shabalov I. 2014. Mycocosm portal: gearing up for 1000 fungal genomes. Nucleic Acids Res 42:D699–D704. doi: 10.1093/nar/gkt1183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Love MI, Huber W, Anders S. 2014. Moderated estimation of fold change and dispersion for RNA-seq data with DeSeq2. Genome Biol 15:550. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reviewer comments
reviewer-comments.pdf (1.1MB, pdf)
Supplemental Information. msystems.00281-23-s0001.pdf.

Figures S1–S9; Tables S1–S3, S6, and S7; captions to Tables S4 and S5.

DOI: 10.1128/msystems.00281-23.SuF1
Table S4. msystems.00281-23-s0002.xlsx.

Results of differential gene expression analysis.

DOI: 10.1128/msystems.00281-23.SuF2
Table S5. msystems.00281-23-s0003.xlsx.

Select bacterial stress response genes upregulated in co-culture.

DOI: 10.1128/msystems.00281-23.SuF3
OPEN PEER REVIEW. reviewer-comments.pdf.

An accounting of the reviewer comments and feedback.

DOI: 10.1128/msystems.00281-23.SuF4

Data Availability Statement

RNA-seq data are available in Gene Expression Omnibus under accession no. GSE223404.


Articles from mSystems are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES