Skip to main content
Nature Communications logoLink to Nature Communications
. 2022 Jul 5;13:3870. doi: 10.1038/s41467-022-31433-x

Metaproteomics reveals enzymatic strategies deployed by anaerobic microbiomes to maintain lignocellulose deconstruction at high solids

Payal Chirania 1,2,3,#, Evert K Holwerda 3,4,#, Richard J Giannone 1,3, Xiaoyu Liang 4, Suresh Poudel 1, Joseph C Ellis 1,3, Yannick J Bomble 3,5, Robert L Hettich 1,3,, Lee R Lynd 3,4,
PMCID: PMC9256739  PMID: 35790765

Abstract

Economically viable production of cellulosic biofuels requires operation at high solids loadings—on the order of 15 wt%. To this end we characterize Nature’s ability to deconstruct and utilize mid-season switchgrass at increasing solid loadings using an anaerobic methanogenic microbiome. This community exhibits undiminished fractional carbohydrate solubilization at loadings ranging from 30 g/L to 150 g/L. Metaproteomic interrogation reveals marked increases in the abundance of specific carbohydrate-active enzyme classes. Significant enrichment of auxiliary activity family 6 enzymes at higher solids suggests a role for Fenton chemistry. Stress-response proteins accompanying these reactions are similarly upregulated at higher solids, as are β-glucosidases, xylosidases, carbohydrate-debranching, and pectin-acting enzymes—all of which indicate that removal of deconstruction inhibitors is important for observed undiminished solubilization. Our work provides insights into the mechanisms by which natural microbiomes effectively deconstruct and utilize lignocellulose at high solids loadings, informing the future development of defined cultures for efficient bioconversion.

Subject terms: Microbiome, Proteomics, Applied microbiology, Biofuels


Efficient solubilization of plant cell wall carbohydrates is required for microbial production of biofuels from lignocellulosic biomass. Here, the authors employ metaproteomics to interrogate enzymatic strategies of a methanogenic microbiome deconstructing switchgrass at increasing solids loading.

Introduction

Biological production of liquid fuels from lignocellulosic feedstocks is of high interest as society navigates a transition away from fossil resources1. However, the recalcitrant character of such feedstocks impedes biological conversion and represents a major cost barrier2,3. One-step consolidated bioprocessing (CBP) without added enzymes, mediated by defined cultures of anaerobic lignocellulose-fermenting bacteria, is a promising strategy for converting lignocellulose to liquid fuels at low cost4. Because substantial titers of liquid fuel products are required to avoid high costs for product recovery and fermentation, biological processes for conversion of lignocellulose need to operate at high solids loadings—typically on the order of 15 wt% or more57. Around two-thirds of the mass content of lignocellulose is carbohydrate. An efficient sugar-to-liquid-biofuel microbial metabolism can achieve an end-product at 50% yield. Not considering titer restrictions and solids handling issues, 150 g/L solids loading would result in a maximum biofuel titer for ethanol of ~50 g/L.

For both enzymatic hydrolysis mediated by fungal cellulase810 and for anaerobic digestion mediated by undefined microbial consortia1113, fractional carbohydrate solubilization is relatively constant with increasing solids until little to no free water remains, above which rates decline14,15. The solids loading at which diminishing solubilization is observed varies from system to system but is generally in the range of 150–180 g/L (15 to 18 wt%)1619. However, for defined culture systems—e.g., those that might be used for the production of compounds other than methane, H2, and mixed organic acids from cellulosic biomass via CBP—carbohydrate solubilization has been observed to decline with increasing solids at much lower loadings, e.g., <80 g/L20,21. The basis for this decline is unknown, although controlled experiments indicate that it is not explained by inhibition by fermentation products or inadequate growth media22,23. Hence, the diagnosis and remediation of declining lignocellulose solubilization is important to advance CBP toward commercial application.

Deconstruction of lignocellulose typically involves dozens of proteins with diverse functions and structures2426. Undefined microbial consortia such as those occurring in anaerobic digestion systems represent a wealth of diversity2729, both in terms of organisms as well as the carbohydrate active enzymes (CAZymes)30,31 they express. The composition and functional characterization of lignocellulose-fermenting microbiomes has been explored over the last decade using emergent omic technologies, including metagenomics, metaproteomics, and metatranscriptomics32,33. These techniques have been used to document up to a million protein-encoding sequences3436, tens of thousands of genes and transcripts encoding CAZymes3638 from over 200 gene families36, and over 1000 identified microbial species39,40. Detailed inventories of CAZymes have been characterized from diverse environments including the rumen37, anaerobic digesters41, termites42,43, and the moose38 among others. Although most studies have not examined the spatial location of microbes and enzymes, Kougias et al.34 differentiated enzymes and organisms present in the planktonic and substrate-adhered phases. Liang et al.44 inventoried changes in CAZyme and microbiome composition for thermophilic, anaerobic digestion of switchgrass at various residence times. Abbassi-Guendouz et al.45 performed metagenomic and phylogenetic analysis of mesophilic anaerobic digestion of shredded cardboard with solids concentration varying from 10 to 30 wt%. Most studies to date have focused on the metabolic potential of these communities (metagenomics), with only a handful of expression-based studies (metatranscriptomics) that have been mostly confined to single batch conditions41,46,47. Metaproteomics is uniquely suited to measure the physical presence, abundance, and location of both intracellular and extracellular enzymatic machinery and how they differ across conditions. Thus, metaproteomics provides complementary information to metagenomic or metatranscriptomics, but has not been reported for anaerobic lignocellulose-fermenting microbiomes as a function of solids loading.

In this study, we employ LC-MS/MS-based metaproteomic measurements in order to gain insight into mechanisms of lignocellulose deconstruction at solids loadings representative of those anticipated in an industrial process. An anaerobic, thermophilic, semi-continuously fed, methanogenic microbial enrichment cultivated over an extended period (550 days), referred to here as a lignocellulose-fermenting microbiome, is sampled at various solids loadings at steady-state and fractionated to identify key microbes and/or enzymes. We document solubilization performance with increasing solids, changes in the abundance of CAZymes across fractions, and the details of the methanogenesis pathways. The data and results described in this paper are an extension of the previous work described in Liang et al.44, where different residence times were examined (20 to 3.3 days) at one fixed solids loading of 30 g/L. The resulting metagenomes from that work are used as a basis for the new metaproteomics analysis described in this paper, where the residence time is fixed at 10 days, with increasing solids loading from 30 to 75, 120, and finally 150 g/L of the same feedstock.

Results

Microbiome maintains undiminished solubilization at higher solids

An anaerobic lignocellulosic thermophilic (55 °C) methanogenic microbiome was established by enriching an anaerobic digester inoculum on 30 g/L unpretreated June-harvested non-senescent switchgrass and incubating the culture for over 120 days before entering the data collection phase. The microbiome with a 1 L working volume was operated semi-continuously at a 10-day residence time by daily withdrawal of 100 mL of well-mixed bioreactor contents followed by addition of 100 mL total volume of switchgrass and growth media44. The bioreactor was operated at solids loadings of 30, 75, 120, and 150 g/L, with the cultivation at each loading lasting for at least 50 days, leading to an overall microbiome cultivation time of 550 days (Fig. 1A, Supplementary Table 1, and Methods).

Fig. 1. An anaerobic thermophilic microbiome exhibits undiminished fractional carbohydrate solubilization with increasing solids loadings between 30 and 150 g/L.

Fig. 1

A Experimental overview. A lignocellulose-fermenting microbiome was fed semi-continuously with increasing amounts of switchgrass. Samples were analyzed for carbohydrate solubilization and metaproteomics. Metaproteomic analysis was carried out two ways; whole sample analysis (1D LC-MS/MS) and sample fractionation followed by multidimensional (2D LC-MS/MS) analysis. Prior to multidimensional measurements, each sample was separated by centrifugation into supernatant (SNT), planktonic cells (PC), and substrate bound (SB) fractions. B Stable carbohydrate solubilization (blue bars) at ~66.8% was obtained, with the rate of substrate solubilization (red circles) increasing linearly with solids loading. Data for 30 g/L was partially published prior in44. The data represents average values for different time-points during steady-state conditions for one microbiome (see Supplementary Table 2), error bars represent ± one standard deviation. The individual data points for total fractional carbohydrate solubilization (blue bars) are shown as dots (black circles) for each solids loading. The carbohydrate solubilization rate linear fit line (red dashed line) is based on carbohydrate solubilization rate data (red circles), which is calculated from fractional solubilization data (blue bars) adjusted for residence time in hours, the total carbohydrate content of switchgrass and the switchgrass loading concentration (30, 75, 120, and 150 g/L respectively). C The methane content of the off-gas (dark green bars) was constant, with the rate of off-gas (light green plus-shaped markers) and methane production (light green crosses) increasing in proportion to solids loading. Data for 30 g/L was partially published prior. Data points for methane content and off-gas production rate are averages for different time-points during steady-state conditions for one microbiome (see Supplementary Table 2), all error bars represent ± one standard deviation. The individual data points for methane concentration in the off-gas (dark green bars) are shown as dots (black circles) for each solids loading. Methane off-gas rate data is based on methane concentration in the off-gas and the off-gas production rate. Fit lines for the off-gas production rate (light green dashed line) and the methane off-gas production rate (light green dotted line) are linearly fitted to the data. Source data are provided as a Source Data file.

The fraction of carbohydrate solubilized at steady state—quantified by the loss of glucose, xylose, and arabinose moieties in the solids—remained relatively constant at ~66.8% regardless of solids loading over the range considered. The rate of carbohydrate solubilization increased in proportion to solids loading (Fig. 1B). The main products observed were methane (CH4) and carbon dioxide (CO2), whereas volatile fatty acids (VFA) and hydrogen (H2) were not detected at steady state (Supplementary Fig. 1, Supplementary Table 2). Methane and CO2 measured in effluent gas ranged from 48.7 to 49.9%, and 47.2 to 49.0% respectively. Nitrogen was also present at < 5%, likely introduced in the feeding and sampling process. Volumetric gas production and the methane production rate increased proportionally with increasing solids loading (Fig. 1C, Supplementary Table 2), indicating that product formation by the microbiome was not differently affected up to 150 g/L solids.

Fractionation reveals spatial distribution of enzymatic categories

A metaproteomic analysis was undertaken at each solids loading to provide a detailed molecular view of the composition and function of the lignocellulose-fermenting microbiome. The overall amount of microbial biomass, as assessed by an initial screen of unfractionated microbiome via 1D LC-MS/MS-based metaproteomic analysis, did not increase proportionally (p-value > 0.05, two-tailed Welch’s t-test) with the 5-fold increase in substrate concentration; although significant abundance changes across a myriad of functional categories were observed with increasing solids (Fig. 1A, Supplementary Note 1, Supplementary Data 1). In order to further examine the changes in the community’s functional structure across solids loadings and deepen the overall analysis, a multidimensional metaproteomic measurement was conducted on fractionated microbiome samples. As follows, each sample was split into three fractions: supernatant (SNT), planktonic cells (PC), and substrate-bound (SB). Sample fractionation to segregate extracellular proteins from planktonic- and substrate-bound microbes and enzymes has been demonstrated previously, enabling the exploration of metabolic variability due to spatial localization of enzymes and microbes34,48.

A total of 16,644 microbial protein groups were quantified based on unique peptide evidence across the three fractions at each substrate loading (Supplementary Data 24). There were both complexity and proteomic differences between the fractions, as only a third of the proteins were common among all three (Supplementary Fig. 2A). However, within each fraction, a comparable number of proteins were quantified across all solids loadings (Supplementary Fig. 2B), suggesting complexity differences were driven by the type of fraction rather than the solids loading. When considering proteomic changes within each fraction, replicates of each solids loading clustered together, while broader differences were observed between low (30 g/L) and high loadings (120, 150 g/L) (Supplementary Fig. 2C).

Since the methane production rate increased proportionally with increasing solids, we first examined methanogenesis pathway-related proteomic signatures across the three fractions and assessed whether the observed increase was corroborated by metaproteomic analysis (Supplementary Fig. 36, Supplementary Data 5, Supplementary Note 2). The PC and SB fractions, which both include whole cells, had a greater representation of enzymes mapping to the different methanogenic pathways than the SNT fraction (Supplementary Fig. 3A). All methanogenesis pathway enzymes exhibited a clear increasing trend in the PC fraction that correlated with increasing solids loadings. This fraction also had the greatest representation of enzymes involved in the methanogenesis process (Supplementary Figs. 3A, B and 5). Closer inspection revealed that the PC fraction contained the highest total abundance of proteins from Euryarchaeota such that this phylum contributed substantially to the PC metaproteome relative to the other fractions (Supplementary Fig. 3C). Among the archaeal proteins, both the aggregate abundance and diversity (protein count) of those mapping the hydrogenotrophic pathway markedly exceeded that of the acetoclastic pathway (Supplementary Figs. 3A and 7), suggesting this to be the major route for methanogenesis.

A diverse Firmicutes-dominated CAZyme repertoire is expressed

CAZymes are directly responsible for catalyzing the conversion of carbohydrates harbored in lignocellulose into a soluble form. To fully characterize this central process and detail how CAZymes from different organisms and/or phyla synergize to efficiently deconstruct lignocellulose at increased solids, the fractionated metaproteomes were analyzed in terms of CAZyme type, abundance, and location. Prior to this analysis, a survey of the entire metagenome was conducted using the dbCAN2 meta server49 to annotate important CAZyme families and their functional activities. Figure 2 shows the overview of CAZymes identified in each fraction and their general trends with solids loading (Supplementary Data 6). In total, 551 CAZyme protein groups were quantified (representing 1,246 proteins or ~14% of the total CAZymes annotated in the metagenome), with differences evident across the fractions (Fig. 2A). Notably, only 75 CAZymes (~14%) were present in all three fractions, whereas the PC and SB fractions harbored the greatest number of unique CAZymes (Fig. 2A). These differences demonstrate the compartmentalization of distinct CAZymes in the microbiome system and the importance of measuring spatially resolved fractions. About twenty percent (110 of 551) of the CAZymes harbored a carbohydrate binding domain (CBM) or a cellulosomal domain (cohesin or dockerin) (Supplementary Fig. 8, Supplementary Data 6). The proportion of these affinity-conferring CAZymes declined in the SNT fraction (from ~60% to ~20%) with increasing solids, while minimal changes were observed in the PC (~30%) and SB (~50%) fractions (Supplementary Fig. 8, 9), suggesting that free enzymes are somewhat excluded from binding substrate at lower solids loadings and remain in the SNT until surfaces become available.

Fig. 2. Overview of the measured CAZymes’ repertoire in the three fractions.

Fig. 2

A Venn diagram showing the overlap of CAZymes quantified in the three fractions. B For each fraction, the distribution of quantified CAZymes across the CAZyme classes (GH, CE, PL, GT, AA) with linkages showing the contributing taxonomic phyla (top half). Phyla producing <10 CAZymes were grouped as ‘Others’. C Aggregate abundance trend of CAZymes from different CAZyme classes with increasing solids loadings per fraction. A two-tailed Welch’s t-test was performed with Benjamini Hochberg FDR correction from each solids loading versus the 30 g/L loading. + means adjusted (adj.) p-value ≤ 0.05 and * means adj. p-value ≤ 0.05 with absolute fold change ≥2× vs. respective 30 g/L condition. Exact p-values for each comparison are listed in Supplementary Data 6. D The relative distribution of each deconstruction sub-category within GHs. Data are presented as mean values ± SD (n = 4 for 30 and 120 g/L, n = 3 for 75 g/L, and n = 5 for 150 g/L). SNT supernatant, PC planktonic cells, SB substrate bound, GH glycoside hydrolase, CE carbohydrate esterase, PL polysaccharide lyase, GT glycosyl transferase, AA auxiliary activity. Source data are provided as a Source Data file.

Taxonomically, members of the phyla Firmicutes (Clostridia followed by Bacillus), known for their cellulolytic capabilities in thermophilic environments, were the dominant contributors of these measured CAZymes. However, fraction-specific differences in the taxonomic lineages of the CAZymes were observed (Fig. 2B). For example, CAZymes from the phylum Chloroflexi were more prevalent in the PC fraction while enzymes from the phylum Thermotogae were more represented in the SNT and SB fractions (Fig. 2B), suggesting the presence of functional niches. From a functional perspective, Glycoside Hydrolases (GHs) formed the most numerous and abundant CAZyme class irrespective of the fractions (Fig. 2B), in line with previous studies50. Other CAZyme classes—polysaccharide lyases (PLs), carbohydrate esterases (CEs), and glycosyl transferases (GTs)—appeared to be dominant in the substrate- or cell-associated fractions while auxiliary activities (AAs) were more represented in the extracellular fraction (Fig. 2B), further indicating the enrichment of enzymatic activities based on the cellular location.

Quantitative assessment of CAZyme classes revealed trends in their abundances from low to high solids loadings in all three fractions (Fig. 2C, Supplementary Fig. 9C). AAs formed a small component of the CAZyme repertoire; however, they exhibited a significant increase in abundance in all fractions with increasing solids loading. Meanwhile, the abundances of PLs and CEs both decreased significantly in the SNT fraction at higher solids (120 and 150 g/L) but exhibited opposing trends in PC and SB fractions, especially in the case of PLs. The abundance of PLs is roughly similar across both SNT and SB fractions at low solids. However, as solids increase, the PLs decrease in the SNT while proportionally increasing in the SB, indicating that these enzymes may be overexpressed at low solids but only become associated with substrate fractions at higher solids when presumably more binding area is available (Fig. 2C).

GHs – considered to be the workhorses of carbohydrate depolymerization, were the most abundant enzyme class observed across all fractions but exhibited only a <1.5-fold increase in their total abundance while solids increased by 5-fold (Fig. 2C). Proteins from 53 GH families were identified and had varying trends with solids loadings across the three fractions (Supplementary Fig. 10, Supplementary Data 7). To tease apart the overall GH trends observed in Fig. 2C, the identified GH proteins were grouped into broad deconstruction categories based on their most common activity, and quantitatively assessed (Fig. 2D, Supplementary Data 6). The contributions from the main chain polysaccharide-hydrolyzing GHs (longer polymer; colored brown in (Fig. 2D) decreased with increased solids in all three fractions. This trend was countered by a noticeable increase in either the small oligosaccharide acting GHs (SNT; orange) or the hemicellulose debranching GHs (PC and SB; purple) (Fig. 2D). These observations indicate that the microbiome adjusts its GH machinery to continue solubilization at high solids with minimal impact to aggregate GH abundance and prompted us to further evaluate functional CAZyme changes across the major component polymers of switchgrass.

Oligosaccharide acting β-glucosidases increase at higher solids

Multiple GH proteins across the three main cellulolytic categories (endoglucanases, exoglucanases, and β-glucosidases) were present in the cultures at varying levels. Both endoglucanases (mostly GH9 and GH5, in accordance with other studies46,47) and β-glucosidases (primarily GH3) were highly represented, indicating the importance of these cellulolytic activities in the system (Fig. 3, Supplementary Data 7). Unexpectedly, at 150 g/L solids—representing a 5-fold increase in switchgrass—the collective abundance of endoglucanases, a main constituent of many cellulose degrading microbes, did not change in the SB fraction, decreased significantly in the SNT fraction, and only increased moderately (1.4-fold in 120 g/L) in the PC fraction relative to 30 g/L (Fig. 3, Supplementary Fig. 11, 12D). Also, exoglucanases- considered the main enzymes responsible for cellulose chain depolymerization, exhibited a similar abundance trend with increasing solids, specifically with regard to SB and SNT fractions. In contrast, a significant increase in the cumulative abundance of β-glucosidases was observed especially in the SNT fraction (~12-fold increase) where small oligosaccharides such as cellobiose might occur (Figs. 2D, 3, Supplementary Fig. 11, 12A). β-glucosidases are known to relieve end-product inhibition of cellulases, improve conversion yields and cellulose saccharification rates, and lower total enzyme requirements51,52. These observations are consistent with a synergistic role of β-glucosidases in maintaining cellulose hydrolysis at higher solids loadings. Taxa-level dynamics of β-glucosidase expression were also observed (Supplementary Fig. 12). While members of multiple phyla produced β-glucosidases, the increase in abundance at high solids was driven by the phylum Dictyoglomi (Supplementary Fig. 12A). Distantly related to Caldicellulosiruptor spp., members of this phylum are known to be extremely thermophilic, possess rigid cell membranes, and produce thermostable enzymes5355.

Fig. 3. Analysis of CAZyme functional categories at higher solids loadings.

Fig. 3

(left side) Enumeration of quantified unique CAZyme proteins across different CAZyme families (GH, CE, PL) after organizing them based on functional annotation as described in Supplementary Data 3. (right side) Heatmap depicting the change in aggregate abundance of CAZymes in each functional category across solids loadings for each fraction. Log2 differences from 30 g/L solids in the respective fraction are shown. A two-tailed Welch’s t-test was performed with Benjamini Hochberg FDR correction for each solids loading versus the 30 g/L loading. + means adjusted (adj.) p-value ≤ 0.05 and * means adj. p-value ≤ 0.05 with absolute fold change ≥2× vs. respective 30 g/L condition. Exact p-values for each comparison are listed in Supplementary Data 6. SNT: supernatant, PC: planktonic cells, and SB: substrate bound fraction. Source data are provided as a Source Data file.

Hemicellulose debranching enzymes increase at higher solids

Multiple proteins across different GH and CE families were grouped into a variety of hemicellulolytic activities and their trends with increasing solids loading explored. These activities included: xylanases, xylosidases, arabinosidases, glucuronidases, fucosidases, galactosidases, mannosidases, and acetyl-xylan esterases (Fig. 3, Supplementary Fig. 11, Supplementary Data 7). Both the major xylan-hydrolyzing enzymatic classes, xylanases and xylosidases, were represented by numerous proteins and were most prevalent in the SB fraction (Fig. 3, Supplementary Fig. 12b, Supplementary Data 7). The total abundance of xylanases at 150 g/L solids loading decreased significantly in the SNT and PC fractions (~10-fold and ~2-fold, respectively) compared to 30 g/L, but remained relatively unchanged in the SB fraction (Fig. 3, Supplementary Fig. 12B). However, xylosidases exhibited a consistent increase at higher solids in the SB fraction (~3-fold in 120 g/L and 1.9-fold in 150 g/L) (Fig. 3, Supplementary Fig. 12B). Since xylan breakdown was undiminished at high solids loading, the observations here suggest xylosidases may be needed for maintaining overall xylan hydrolysis—similar to what was observed with β-glucosidases. Likewise, the importance of xylosidases at higher solids was further substantiated by community dynamics, where the increase was driven by a protein group from Bacteroidetes (Supplementary Fig. 12B), members of whom are key xylanolytic organisms in the rumen and human gut ecosystems56.

Along with xylan-acting enzymes, numerous hemicellulose-acting and debranching enzymes were expressed by the microbial community (Fig. 3, Supplementary Data 7). In particular, proteins with arabinosidase activity were highly represented (61 proteins) and demonstrated increased abundance with increasing solids loadings in all fractions (Fig. 3, Supplementary Fig. 12E). Even though arabinose makes up only ~4% of switchgrass carbohydrate content44,57, the disproportionately high number of proteins dedicated to cleaving branching arabinose residues or arabinan, coupled with their uniform increase in abundance, suggests a need for arabinosidases at higher solids. Similarly, despite galactose constituting <2% of the sugar content in switchgrass57, 47 proteins with galactosidase activity were detected. This group is composed primarily of family GH2 and contains CAZymes with both α- and β-galactosidase activities that cleave the branching galactose residues in mannan and xylan. Their abundance significantly increased in the PC fraction at higher solids (Fig. 3), a trend driven by the phylum Chloroflexi along with Firmicutes-Clostridia (Supplementary Fig 12F). The filamentous Chloroflexi are enriched in the PC and SB fractions and are known carbohydrate fermenters. They are likely involved in the degradation of polymeric organic compounds facilitating the growth of other bacteria58 to maintain overall solubilization. Conversely, the abundance of other enzymes considered important for solubilization, such as acetyl-xylan esterases (34 protein groups) and C6 phosphorylases (44 protein groups including cellobiose phosphorylases), decreased in abundance or remained relatively unchanged at higher solids loadings although they were well-represented (Fig. 3, Supplementary Data 7).

Pectin-acting enzymes increase in substrate-associated fractions

Pectin represents a small fraction of switchgrass on a mass basis (~2%), but it provides structural tenacity and contributes to its recalcitrance59,60. Numerous pectin-acting proteins representing different CAZyme families (PL, GH, and CE) were expressed by the microbiome, which can be roughly grouped into four functional categories: galacturonan lyase, rhamnogalacturonan lyase, rhamnosidase, and pectinesterase (Fig. 3, Supplementary Fig. 11, Supplementary Data 7). While pectinesterases and rhamnosidases had decreasing abundances at higher solids, significant increases in the two pectin lyases, driven by the phyla Firmicutes, were observed in PC and/or SB fractions at high solids. These pectin lyases can be particularly important because they cleave the alpha-1,4- linkages in esterified pectin (main chain) without requiring accessory enzymes or water molecules, as is the case with PLs60.

Bacterial AA6 proteins increase dramatically at higher solids

Auxiliary activities (AAs) are a recently constituted class of CAZymes that act on lignin or are involved in modification of polysaccharides such as cellulose in the plant cell wall61. A significant increase in the abundance of AA enzymes was observed across all three fractions as solids loadings increased (Fig. 2C). In the current study, 3 AA families were identified (Fig. 4A). AA2 (peroxidase) and AA3 (glucose-methanol-choline (GMC) oxidoreductase) were identified only in the PC fraction at relatively low abundance. The cumulative abundance of AA2 proteins had a decreasing trend with increasing solids loadings (Supplementary Fig. 13A). AA3 was an order of magnitude less abundant than AA2 and present only at higher solids but was identified by only one peptide (Supplementary Fig. 13B). In contrast, AA6 proteins were 1-2 orders of magnitude more abundant than AA2 and represented by multiple proteins (11/14 AA proteins mapping to family AA6) across the three fractions. AA6s exhibited significantly increased abundance at higher solids (~23-fold in SNT at 150 g/L); a trend that was consistently observed across all three sample fractions (Fig. 4A, B). Members of the phyla Firmicutes-Clostridia and Firmicutes-Bacilli were the primary contributors to the AA6 protein group, but the increase in abundance at higher solids was driven by the phylum Firmicutes-Bacilli (Fig. 4B). AA6 family enzymes are characterized as 1,4-benzoquinone reductases and catalyze the NADPH-mediated conversion p-benzoquinone to hydroquinone61 (Fig. 4C). Although the switchgrass was not pretreated in the current study, p-benzoquinone is a pretreatment- and lignin-derived growth/enzyme inhibitor that is difficult to remove, suggesting that AA6-like enzymes could be involved in biomass hydrolysate detoxification62,63. While recent studies have identified an oxidoreductase gene, ZMO1116, that can convert benzoquinone to non-toxic hydroquinone62,63, multiple sequence alignment analysis of the AA6 proteins identified here only revealed local sequence similarities to ZMO1116 (Supplementary Fig. 14), suggesting a potentially different mode of action in this system.

Fig. 4. The abundance of bacterial AA6 proteins as a function of higher solids loadings.

Fig. 4

A The number of proteins expressed from the different auxiliary activity (AA) families in each fraction. B The change in total abundance of AA6 proteins across solids loadings per fraction (right Y-axis, line plot) with relative abundance contributions by phyla (left Y-axis, bar plot). Error bars are ± standard deviation and * means two-tailed Welch’s t-test corrected p-value < 0.05 and absolute fold change ≥2× versus respective 30 g/L condition. C A proposed mechanistic hypothesis based on observed increases in protein abundances and known functions depicting the suggested role of AA6 proteins in enabling solubilization at higher solids loadings via ROS and Fenton chemistry. D The mean trends of proteins that are known to respond to oxidative stress across solids loadings per fraction. Bacterioferritin (Bfr; 18, 23, 20 proteins in SNT, PC, SB respectively), superoxide dismutase (SOD2;15,11,7) and superoxide reductase (SOR; 9, 10, 13). Mean Z-scores are shown with 50% confidence interval. + means two-tailed Welch’s t-test p-value < 0.05 and * means p-value < 0.05 and absolute fold change ≥2× versus respective 30 g/L condition. Exact p-values for each comparison are listed in Supplementary Data 9. SNT: supernatant, PC: planktonic cells, and SB: substrate bound fraction. Source data are provided as a Source Data file.

Similar to biochemical mechanisms observed in brown-rot fungi, AA6 in the bacterial community may also be involved in the production of extracellular oxyradicals for lignin modification. AA6, or quinone reductase in brown-rot fungi, is involved in the generation of extracellular Fenton reagents via redox cycling61,64. Unlike white-rot fungi, which degrade and utilize lignin, cellulolytic brown-rot fungi modify lignin using non-enzymatic, energetically inexpensive Fenton chemistry to breach the lignin barrier and enable enzyme access to the underlying polysaccharides65,66. Fenton reactions make use of peroxide (H2O2) and iron (Fe2+ to Fe3+) to produce reactive oxygen species (ROS), such as highly reactive hydroxyl radicals, which can non-specifically cleave lignin66. Since AA6 in the CAZy database (all annotated AA6 are from fungi) are suspected to produce Fenton reagents61, the bacterial community may follow a route similar to that of brown-rot fungi to effectively solubilize increasing concentrations of switchgrass (Fig. 4C). Although proteins annotated for direct production of H2O2 were not identified in the samples, multiple oxidases were observed (Supplementary Data 8). Additionally, hallmarks of oxidative stress such as superoxide dismutases67,68 (SOD2) and superoxide reductases68 (SOR) were also present in the samples, both of which increased significantly in abundance with increasing solids in one (SOR, in SB fraction) or all (SOD2) fractions (Fig. 4C, D, Supplementary Fig. 13C, D, Supplementary Data 9). Furthermore, bacterioferritins from multiple taxa were also detected and significantly increased in all three fractions (Fig. 4d, Supplementary Fig. 13C). Bacterioferritins are enzymes with ferroxidase activity; ferroxidases are upregulated in brown-rot fungi during Fenton reactions69. Additionally, as is the case in brown-rot fungi, limited proteins for lignin metabolism/assimilation were observed in the samples, indicating lignin cleavage may enhance enzyme accessibility rather than be used for energy (Supplementary Fig. 1520). Notably and in contrast to fungal AA6s which are known to act aerobically, these bacterial AA6s appear to have an anoxic mode of operation, which has also been observed in a recent study70. Specifically, these results suggest that in bacterial communities, AA6 enzymes (here produced by Firmicutes) could be involved in the generation of Fenton reagents and subsequent cleavage of the lignin barrier to increase the accessibility of lignocellulose to attack by CAZymes.

Discussion

The thermophilic, lignocellulose-fermenting, methanogenic anaerobic microbiome reported here exhibits a key feature desired for an industrial process: undiminished fractional carbohydrate solubilization with increasing substrate loading (Fig. 1). In search of clues as to how this is achieved, this study reports the first metaproteomic characterization of a microbiome as a function of substrate loading across different fractions, focusing on salient CAZyme and methanogenesis features. Comparing relative protein abundances at 150 g/L and 30 g/L substrate loadings in Fig. 5A, a greater than 2-fold increase was observed in one or more fractions for β-glucosidases, hemicellulose-debranching and xylosidase enzymes, pectinases, and AA6 enzymes, as well as all hydrogenotrophic methanogenesis pathway enzymes. The latter corroborates the observed increase in methane production as solids increased (Fig. 1C), and as described before in anaerobic thermophilic microbiomes71, is the preferred methanogenesis route under thermophilic temperatures72,73. Coupled with syntrophic acetate oxidation for acetate degradation, this metabolic process is performed by members of Euryarchaeota, who may not need to be proximal to lignocellulosic substrate. As can be seen from Fig. 5B, this process primarily occurred in the cell containing fractions, PC and SB, with the bulk of the activity in the PC fraction for 150 g/L.

Fig. 5. Microbiome functionally and spatially orchestrates enzyme expression to continue solubilization at high solids.

Fig. 5

A Average fold changes (ratios) in aggregate protein abundance of the major enzymatic categories at 150 g/L solids loading relative to 30 g/L solids loading compared across the three fractions. B The relative proportion of the respective fraction metaproteome that each of these enzymatic categories make up is shown here. This qualitative representation depicts the spatial distribution of these enzymatic categories among the three fractions and changes in their distribution from 30 g/L to 150 g/L. SNT: supernatant, PC: planktonic cells, and SB: substrate bound fraction. Source data are provided as a Source Data file.

Considering enzymatic deconstruction of lignocellulose, the largest increases were seen for β-glucosidases in the SNT, pectinases in the PC and SB fractions, and for AA6 family, debranching, and xylosidase enzymes in all fractions. The observation that the microbiome produces these particular enzymes instead of increasing amounts of backbone depolymerizing enzymes (such as endoglucanases and xylanases) suggests their importance for undiminished hydrolysis at high solids—potentially through removal of accumulated inhibitive (hemi)cellulose solubilization products (β-glucosidases and xylosidases) and enabling enzymatic access to the preferred carbohydrate polymers. For example, debranching of xylan “decorations” alters its interaction with other cell-wall polymers (cellulose and lignin) effectively decreasing the recalcitrance of feedstock74. Pectinases/polygalacturonases synergistically improve the hydrolytic efficiency of cellulases by removing pectin, and thereby improve access to cellulose7577. Finally, the marked increase in AA6 enzymes—purported to utilize Fenton chemistry for deconstruction and permeation of the lignin barrier—fits this paradigm. These AA6 family enzymes are also particularly notable since oxidative reactions are not generally associated with anaerobic lignocellulose fermentation. However, Schalk et al.43 recently documented a role for fungal driven AA6-mediated Fenton chemistry in the termite gut. Although further investigation is needed in the mechanism proposed here, the observations suggest that AA6 plays an important synergistic role with other CAZyme categories for high solids deconstruction under anaerobic thermophilic conditions. Our results also provide insight into the spatial location of bacterial enzymes, their phylogenetic origin(s), and how these are impacted by substrate loading (Fig. 5B). At the substrate loading of 30 g/L, β-glucosidases and AA6 enzymes are relatively more prominent in the substrate-bound fraction, but at 150 g/L become most prominent in the supernatant. In contrast, hemicellulose-debranching enzymes and pectinases are most prominent in the supernatant at 30 g/L, but in the substrate-bound fraction at 150 g/L. The clear difference in functional distribution across fractions highlights the importance of this approach to provide both relevant spatial location information as well as enhancements to the depth of the metaproteomic measurement.

With increasing solids, the collective protein synthesis effort of the microbiome essentially shifts toward enzymes that act on compounds known to be inhibitors of deconstruction - cellobiose7880, hemicellulose81,82, pectin83,84, and lignin8587, with roughly constant effort devoted to the CAZymes that mediate mainline (hemi)cellulolytic deconstruction. These observations provided by metaproteomics provide an unprecedented level of molecular detail which will be critical to inform development of defined cultures for conversion of cellulosic biomass to products other than methane, which thus far exhibit decreasing fractional carbohydrate solubilization with increasing substrate loading20,21,88.

Methods

Feedstocks and microbiome origin

Mid-season switchgrass (Panicum virgatum L., Cave in Rock) harvested in June 2012 at Rock Springs Research Farm (Spring Mills, PA, 40.71040290971864°N, −77.94974506560891 °W) was used as the substrate in this study. Characterization, preparation and storage information is described in detail in Liang et al.44. An existing microbial community was used in this study, as described in Liang et al.44. The inoculum for the initial enrichment was obtained by sampling the anaerobic digester from Vermont Technical College (Randolph, Vermont). The enrichment was matured as a microbiome for over 120 days before experiments with different residence times were initiated, the results of which are described in Liang et al.44. Reactor HR1 (formerly named R2 in Liang et al.44) ran at various residence times (RT) and was cultivated for a total 214 days at 30 g/L solids loading before it was transitioned to 75 g/L to generate the additional higher solids data for the study described here. A detailed characterization of the microbiome can be found in Liang et al.44. Potential microbial contributions via the addition of unsterilized feedstock to the microbiome were investigated by Liang et al.44, and found to be small.

Cultivation medium

Mixed enrichment (ME) medium was used in this study44, except that the amount of Wolfe’s vitamin solution was doubled for 75 g/L and up compared to 30 g/L. The following stock solutions were used: Wolfe’s modified elixir stock solution89 (50×), Wolfe’s vitamin solution89 (50×), Ammonium & Phosphate stock solution (50×) and Ferrous iron stock solution (1000×) are described in Liang et al.44. The solutions were either autoclaved (Ferrous iron stock solution) or filter-sterilized (all other stock solutions). Switchgrass was added (without sterilization) to the reactor to a concentration of 30, 75, 120, or 150 g/L (as-is basis with 6.05% moisture) according to the solids loading per feeding period. The pH of the medium was adjusted with 1 N Sodium Hydroxide solution after feeding to maintain a value above 6.0 as needed.

Cultivation system

The bioreactor set-up used for this study was a Qplus multi-bioreactor system (Sartorius Stedim, Bohemia NY) which includes Module Operator Service Program (MFCS) data collection software (3.0, level 43, 2008 Sartorius Stedim Systems) recording primary fermentation data (pH, temperature, stirrer rpm, pH controlled base addition). The scientific plotting package Veusz was used for visualization of primary fermentation and analyzed data (v3.4.01, https://veusz.github.io/). Additional details for the bioreactor set-up are described in Liang et al.44. The 1-liter working volume bioreactor was stirred at 280 rpm with the temperature controlled at 55 °C. The set up was operated semi-continuously at a residence time of 10 days; every day 10% of working volume (100 mL) of the broth was removed and replaced by fresh switchgrass and media components totaling 100 mL in volume. The removed broth was used for analysis. When the solids loading in the feed was equal to 150 g/L, the stirring speed for HR1 was increased to 500 rpm for one-minute right after the sampling and feeding event to ensure efficient mixing. Feeding aliquots of 100 mL of sterilized growth medium + switchgrass with pH of 8.0 at room temperature was sufficient to keep the pH in the bioreactor above 6.0 for all residence times.

Cultivation start-up and operation

As described in Liang et al.44, prior to this study the bioreactor was operated semi-continuously for 214 days, fed with medium containing 30 g/L switchgrass at various residence times (20–3.3 days). For an RT of 10 days and 30 g/L solids loading the total cultivation time was 50 days of operation as described in Liang et al.44 and fundamental data and samples for subsequent analysis for that solids loading were incorporated into the results described here. For the current study the reactors were fed with medium containing increasing loadings of switchgrass starting at 75 g/L, 120 g/L and 150 g/L. At the start of this study the solids loading was increased from 30 to 75 g/L and the RT fixed at 10 days (from 3.3 days). The frequency of slurry removal and feeding was maintained at 10 times per RT resulting in 1 feeding-sampling event each day for the 10-day residence time and the length of the overall cultivation time.

The microbiome was operated for 506 days after transition to 75 g/L and RT = 10 days. It was operated for 204 days at 75 g/L switchgrass loading in the feed and was then fed 120 g/L switchgrass for 85 days. Finally, the solids loading was increased to 150 g/L and maintained at this level for 217 days. The reactor went through medium component optimization during the first 150 days at 75 g/L solids loading and RT = 10 days and was fed with the medium described earlier from 151th day on. Steady state at this condition was considered to start from 161th day. For all the other conditions, the reactor was considered to have reached steady state after 3 RTs. Total reactor operating length and steady state length at each condition is summarized in Supplementary Table 1 and in Supplementary Fig. 1. Slurry was withdrawn via a 50 mL pipette by removing a plug from the head-plate as described in Liang et al.44. When the solids loading was 150 g/L, the opening of the 50 mL pipette was cut wider to enable sample removal, the pipetted-volume indication was adjusted accordingly (total volume equaling 50 mL).

Biogas measurement

Measurement of biogas production rate was conducted using a wet tip gas meter (www.wettipgasmeter.com) filled with acidified water (pH < 2). At each feeding, data was recorded either manually or via a data logger (HOBO Pendant Event, 64 K, Onset Computer Corporation, Pocasset, MA). The concentrations of CH4, CO2, H2 and N2 were measured by a Model 310 Education Gas Chromatograph (SRI Instruments, Torrance, CA and a 1.8 m x 3 mm stainless steel HayeSep D packed column at 50 °C) with a thermal conductivity detector (150 °C) using helium (20 mL/min) for CH4, CO2 and N2 and nitrogen (13.5 mL/min) for H2 as carrier gases. For data see Supplementary Table 2 and Supplementary Fig. 1).

Fractional carbohydrate solubilization

Measurement of fractional carbohydrate solubilization, FCS, is based on the amount of glucose, xylose and arabinose in the solids before and after fermentation. Sugar content is determined by acid hydrolysis via the quantitative saccharification protocol90,91. Whole broth samples were centrifuged for 10–11 min at 2,800 × g (30–120 g/L) or 12,000 × g (150 g/L). Pelleted solids were dried overnight in a 60 °C oven (30–120 g/L) or by lyophilization (150 g/L), weighed and then subjected to quantitative saccharification. The hydrolysis products monomeric glucose, xylose and arabinose were then quantified by HPLC (Waters, Milford, MA) with an Aminex HPX-87H column (Bio-Rad, Hercules, CA) at 60 °C and detected by refractive index. HPLC eluent was 5 mM sulfuric acid with a flow rate of 0.6 mL/min. FCS equals to the mass of the initial carbohydrate minus the mass of the final carbohydrate divided by the initial carbohydrate44. Steady state FCS data as shown in Fig. 1b is based on averages of two or more samples after at least 3 residence times following a change in solids loading90. For data see Supplementary Table 2 and Supplementary Fig. 1).

Quantification of volatile fatty acids

Measurement of volatile fatty acids (VFA) was done by analyzing filtered liquid broth samples for formic acid, acetic acid, propionic acid, butyric acid, iso-butyric acid and valeric acid. All measurements were performed in duplicate against a known standard (Volatile-free acid mix standard, 46975-U SUPELCO, Sigma-Aldrich). Analysis was by HPLC with an Aminex HPX-87H column as described for Fractional Carbohydrate Solubilization. For data see Supplementary Table 2 and Supplementary Fig. 1).

Assembly of the bioreactor gene catalog for metaproteomics analysis

The metagenomic data generated in Liang et al.44, which described the development of a stable switchgrass-fermenting microbiome at various residence times and temperatures, was examined for functional gene profile diversity. Metagenomic sequence reads of 15 samples were downloaded from the US Department of Energy’s Joint Genome Institute website under proposal ID 502908 (https://genome.jgi.doe.gov/portal/ChaofEnCultures/ChaofEnCultures.info.html). All metagenomic sequence files were combined into a single file in a conglomerative format to capture as many prokaryote coding regions as possible. Quality filtering and trimming was performed using Atropos with a minimum sequence length of 50 nucleotides and a minimum Phred score of 30 to ensure only the highest quality sequences remained. The high-quality sequences were then assembled into 506,732 contigs (a total of 777,641,587 bp) using MEGAHIT and prokaryote genes identified using Prodigal. The minimum contig size was 200 bp and the maximum contig was 606,950 bp with an average of 1,535 bp in length and the N50 of 6,420 bp. In all, 1,076,153 prokaryote genes were identified from these contigs. The genes were then assembled into a bioreactor gene catalog in FASTA format for use in metaproteomics analysis. The bioreactor gene set comprised of 485,820 full length prokaryote genes and 590,333 fragmented prokaryote genes. For metaproteomic analysis, only the full-length genes were kept for inclusion in the bioreactor gene atlas to ensure high quality results.

Sample fractionation, preparation, and measurements for 2D-LC-MS/MS-based metaproteomics

Switchgrass fermentation whole-broth samples for each of the four solids loadings (30 g/L, n = 4; 75 g/L, n = 3; 120 g/L, n = 4; and 150 g/L, n = 5) at steady-state conditions were collected and stored at −80 °C for analysis. For metaproteomic measurements, samples were thawed under cold running water to keep the temperature at 4 °C. Each sample was then fractionated into three phases by centrifugation to enrich for the microbes and enzymes adhered to the lignocellulosic substrate (or substrate bound fraction- “SB”), the planktonic microbes (or planktonic cells fraction- “PC”), and proteins which were secreted (supernatant fraction- “SNT”) as described. For centrifugation-mediated enrichment, each sample was centrifuged at 200 × g for 10 min to pellet the residual solid substrates and bound microbial matter (SB fraction), and the resulting pre-supernatant primarily containing the free cells and enzymes was transferred into a new tube. This pre-supernatant was further centrifuged at 1000 × g for 20 min. The resulting pellet was the PC fraction, enriched for unbound microbial cells and the resulting supernatant was the SNT fraction, enriched for free proteins. Each segregated fraction was then analyzed separately for metaproteomics.

Each sample combination of loading and fraction was lysed by bead beating with 0.15 mm zirconium oxide beads in Tris-HCl (100 mM at pH 8.0) containing 4% SDS (sodium dodecyl sulfate, Sigma) and 10 mM DL-Dithiothreitol (Sigma). Resulting lysates were precleared by centrifugation at 21,000 × g for 10 min, incubated at 90 °C for 10 min to denature proteins, and adjusted to 30 mM IAA (iodoacetamide, Sigma) followed by a 20-minute incubation in darkness at room temperature to alkylate/block cysteine residues. Crude protein was isolated and cleaned up by chloroform-methanol-extraction. Resulting protein pellets were washed with methanol, air dried, and re-solubilized in freshly prepared ABC (ammonium bicarbonate, Sigma) buffer (100 mM, pH 8.0) containing 4% SDC (sodium deoxycholate, Sigma). Protein concentrations were determined using the BCA (Bicinchoninic Acid) protein assay (Pierce). Fixed amount of protein sample (300 μg) was concentrated on a 10 kDa MWCO centrifugal concentrator (Vivaspin500 PES; Sartorius), rinsed with ABC buffer, and digested in situ with MS grade trypsin protease (1:75 w/w; Pierce-Thermo Scientific) overnight at room temperature, and again for 3 h after fresh trypsin addition. After digestion, samples were filtered through the concentrator membrane to remove under-digested proteins and collect tryptic peptides. The resulting peptide solution was acidified with formic acid (FA; LC/MS grade) to a final concentration of 1% to precipitate remaining SDC, followed by additional removal of the precipitate using water-saturated ethyl-acetate. Peptides were then concentrated to dryness using a SpeedVac, resuspended in 0.5% formic acid, quantified by BCA protein assay, and analyzed by 2D-LC-MS/MS using a Vanquish UHPLC system (Thermo Scientific) with autosampler coupled to an Orbitrap Q Exactive Plus mass spectrometer (Thermo Scientific).

Fixed amounts of peptides depending on the complexity of the fraction: 10 μg peptides for SB, 14 μg for PC, and 6 μg for SNT fractions, respectively, were loaded into an in-house built triphasic MudPIT back column (100 μm ID packed with RP-SCX-RP; RP- reversed-phase C18 resin, 5 μm Kinetex, Phenomenex; SCX- strong-cation exchange, 5 μm Luna) coupled to an in-house pulled nanospray emitter (75 μm ID) packed with 30 cm of 5 μm RP C18 resin for online 2D HPLC separation. Peptides from each sample type were then trapped, desalted, separated, and analyzed over successive salt cuts of ammonium acetate (35, 50, 100, and 500 mM), each followed by an organic gradient to elute peptides. The eluting peptides were measured and sequenced by the mass spectrometer (operated via Xcalibur v.4.2.47, Thermo Scientific) in data-dependent mode.

Metaproteomics data analysis

The acquired peptide fragmentation spectra were searched against the metaproteome database generated from the 30 g/L samples’ metagenomes (as described above) appended with common contaminant proteins, and the switchgrass (Panicum virgatum) proteome employing a target decoy approach using the MS Amanda algorithm (v2.0) integrated in Proteome Discoverer software (version 2.3.0.523, Thermo Scientific). The resulting peptide spectrum matches (PSMs) were required to be at least 5 amino acids long, fully tryptic with a maximum 2 missed cleavages, contain a static modification of 57.0214 Da on cysteine (carbamidomethylated) residues, and a dynamic modification of 15.9949 Da on methionine (oxidized) residues. False-discovery rates (FDRs) were controlled at 1% (IMP-Elutator node in Proteome Discoverer) at both the PSM and peptide levels. Peptides were quantified by extracting the chromatographic area-under-the-curve (AUC) and match between runs was conducted by performing a grouped consensus step in Proteome Discoverer per fraction. To bypass ambiguities related to shared peptides among very similar to identical proteins, the proteins with peptide evidence were grouped based on sequence homology and peptide evidence by enabling protein grouping in Proteome Discoverer. The peptides were then assigned to protein groups (represented by master seed proteins) utilizing the inbuilt parsimony principle. The AUCs of peptides uniquely mapping to a protein group were summed to obtain protein (group) abundances and protein groups with at least 1 unique peptide were considered for further analysis. The resulting protein lists were filtered to select highly confident protein groups (FDR of ≤1%) which had at least 2 MS/MS spectra captured, were detected in >2 samples across the fraction experimental set. The resulting protein abundances were bioinformatically processed as described previously92. Briefly, protein (group) abundances in each fraction were log2 transformed and distributions were normalized (LOESS normalization followed by median centering) across samples using InfernoRDN. Missing values were imputed to simulate the mass spectrometer’s limit of detection using mean minus 2.2 times the standard deviation with a width of 0.3 times the standard deviation in Perseus. For each fraction, significant differences in protein abundances between the 30 g/L condition and the higher solids loadings were assessed by two-tailed t-test at a Benjamini-Hochberg FDR corrected p-value of ≤0.05.

For functional analysis, the metaproteome database was functionally annotated with Kyoto Encyclopedia of Genes and Genomes (KEGG) orthologous (KO) terms, E.C. numbers, and definitions as obtained from GhostKOALA93, eggNOG-mapper94, and KEGG GHOSTX searches. Protein-level phylum level taxonomical assignments were assigned to each protein group using GhostKOALA after implementing a cutoff of ≥100 score. Additional taxonomic information as needed for select proteins was inferred by tblastn analysis against the metagenomic bins described in the previous study for the metagenomes. Additional taxonomic annotations for the metagenomic bins were obtained using CheckM and GTDB-Tk analyses performed in KBase as needed. For identification for CAZymes in the metaproteome, dbCAN2 meta server49 was run and proteins were qualified as CAZymes based on software recommendations and after manual curation. The dbCAN2 searches were performed using the HMMER, DIAMOND, and Hotpep tools and proteins annotated by ≥2 of these tools were qualified as CAZymes. Sequences qualified as CAZyme by ˂2 tools were manually inspected. HMMER annotations took priority over DIAMOND and Hotpep tools. Additionally, assignment of dockerin domains was done using InterProScan for all the proteins in the metaproteome database. For assessment of proteins related to cellulosomes, the presence of CAZyme domain with a dockerin domain was needed. KEGG and eggNOG assignments for each identified protein group were used to infer the potential function (enzymatic activity) of the proteins. Proteins related to methanogenesis were identified using KEGG and MetaCyc annotations. To examine statistical variance of categories of proteins in each fraction, pairwise Welch’s t-tests were performed between the respective solids loadings and 30 g/L followed by a Benjamini-Hochberg FDR correction. While comparisons were considered statistically significant if resulting adjusted p-value was ≤0.05, in most cases a cut off of 2× fold change was applied to assess significance. Statistics were calculated using Python, in Excel, or in Perseus. All figures were rendered using Python, R, Excel, or BioRender. Python scripting was done using the following libraries: Pandas, NumPy, Seaborn (https://seaborn.pydata.org/), Matplotlib. Venn diagrams were created using BioVenn. Metabolic pathway information was obtained from MetaCyc and KEGG mapper. For sequence alignment of ZMO1116 sequence to the identified AA6 sequences, different multiple sequence alignment tools- MUSCLE, MAFFT, and ClustalOmega were utilized by choosing the protein alignment option with the default parameters.

1D-LC-MS/MS metaproteomics measurement and data analysis for estimation of microbial cell density

For estimation of cell density, a volume-based metaproteomics analysis was conducted for steady state aspirates at each of the four solids loading conditions described above. Samples were thawed at 4 °C, mixed, and equal volumes from each sample were processed. Three methodological replicates for each biological replicate were processed. Samples were lysed by bead-beating in Tris-HCl (100 mM at pH 8.0) using 0.15 mm zirconium oxide beads, followed by adjustment to 4% SDS, heat-treatment (95 °C for 10 min), and centrifugation (21,000 g for 10 min). Samples were adjusted to 10 mM DL-Dithiothreitol (10 min at 90 °C) to reduce proteins, cysteines alkylated by 30 mM iodoacetamide (20 min incubation in darkness) and cleaned up via protein aggregation capture95. Crude protein was estimated by a Nanodrop OneC spectrophotometer (Thermo Scientific) using 205 nm absorbance. Aggregated protein (on magnetic Sera-Mag (GE Healthcare) beads) was then digested with MS-grade trypsin (fixed amount; Pierce) in 100 mM Tris-HCl, pH 8.0 overnight at 37 °C, and again for 3 h at 37 °C the following day. Tryptic peptides were acidified to 0.5% formic acid, filtered through a 10 kDa MWCO spin filter (Vivaspin 500; Sartorius), and quantified by Nanodrop OneC. Samples were loaded by volume, i.e., 3 µL of peptides from each sample, and analyzed by 1D LC-MS/MS using a Vanquish uHPLC coupled directly to an Orbitrap Q Exactive mass spectrometer (Thermo Scientific), as previously described92. Peptides were separated by a 180 min organic gradient across an in-house pulled nanospray emitter (inner diameter, 75 µm) packed with 15 cm of 1.7-micron Kinetex C18 reversed-phase resin (Phenomenex). Eluting peptides were measured and sequenced by data-dependent acquisition. Peptide fragmentation spectra were searched against the concatenated databases and quantified using Proteome Discoverer software as described above.

A total of 14,386 peptides from 26,127 detected peptides were considered quantifiable after removal of low confidence hits and contaminants and were used for further analyses (Supplementary Data 1). Functional and taxonomic annotations of the peptides were derived by mapping back to the corresponding proteins. Peptide abundances for peptides mapping to a taxonomic or functional category were summed to identify trends of the category with solids loadings. In-house scripts were used to ascertain the taxonomic source of a peptide if it mapped to >1 protein with different taxonomic origins based on GhostKOALA assignment. Peptides were qualified as microbial (Bacteria or Archaea), plant (Switchgrass or other plant sequences), or mixed origin (indistinguishable). Peptides from cellulolytic organisms were determined if they mapped to proteins from CAZymes as determined by dbCAN2. Peptides from dockerin were determined from InterProScan. Peptides from methanogenic organisms were determined if they belonged to Euryarchaeota proteins. Pairwise comparisons for each solids loadings against the 30 g/L condition were conducted using Welch’s t-test and a cutoff of p-value ≤ 0.05 was used. Python and R libraries were used to generate figures.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Supplementary information

Peer Review File (390.6KB, pdf)
41467_2022_31433_MOESM3_ESM.pdf (26.6KB, pdf)

Description Additional Supplementary Files

Supplementary Data 1 (9.2MB, xlsx)
Supplementary Data 2 (1.4MB, xlsx)
Supplementary Data 3 (8.5MB, xlsx)
Supplementary Data 4 (6MB, xlsx)
Supplementary Data 5 (153.9KB, xlsx)
Supplementary Data 6 (789.5KB, xlsx)
Supplementary Data 7 (15.8KB, xlsx)
Supplementary Data 8 (10.5KB, xlsx)
Supplementary Data 9 (47.4KB, xlsx)
Reporting Summary (456.6KB, pdf)

Acknowledgements

Oak Ridge National Laboratory: Jason Witham (bioinformatics), Dawn M. Klingeman (sample preparation), Steven D. Brown (coordination) and James Elkins (microbial community insights). The Pennsylvania State University: Tom Richard (microbial enrichment insights). Dartmouth College: Xiongjun Shao (assistance in experimental and method design), Sean Murphy, Jules Wheaton, Lion Herfort, Liang Tian and Anela Arifi (assistance with sample analysis and microbiome operation). Funding was provided by the BioEnergy Science Center and the Center for Bioenergy Innovation, both at the U.S. Department of Energy (DOE) Research Center supported by the Office of Biological and Environmental Research in the DOE Office of Science.

Source data

Source Data (802.3KB, xlsx)

Author contributions

E.K.H., L.R.L., P.C. R.J.G., and R.L.H. designed the study. X.L. and E.K.H. performed and analyzed the fermentation experiments. P.C., S.P., and R.J.G. conducted the metaproteomics measurements. J.C.E. co-assembled the metagenomes and generated the metaproteome database. P.C., R.J.G, and R.L.H performed all the metaproteomics data analyses. E.K.H. and Y.J.B. helped in the interpretation of CAZyme results, Y.J.B. co-wrote the paper. P.C., E.K.H., R.J.G., X.L., R.L.H. and L.R.L. wrote the paper. All authors edited and reviewed the paper.

Peer review

Peer review information

Nature Communications thanks Jagroop Pandhal and Fabio Squina for their contribution to the peer review of this work. Peer reviewer reports are available.

Data availability

The data on solubilization and microbiome performance generated in this study is available in the supplementary information files. All proteomics raw mass spectra used for protein quantification in this study are available at the ProteomeXchange Consortium via the MassIVE repository (MassIVE accession: MSV000088319 [https://massive.ucsd.edu/ProteoSAFe/dataset.jsp?task=05b15f47bc0145759b12f5da310d3a6a]; ProteomeXchange accession: PXD029582). All proteome abundance data along with the mapped annotations are available as Supplementary Data files. Source data are provided with this paper.

Competing interests

L.R.L. is a shareholder in a startup company focusing on cellulosic biofuel production. All other authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Payal Chirania, Evert K. Holwerda.

Contributor Information

Robert L. Hettich, Email: hettichrl@ornl.gov

Lee R. Lynd, Email: lee.r.lynd@dartmouth.edu

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-022-31433-x.

References

  • 1.Lynd LR. The grand challenge of cellulosic biofuels. Nat. Biotechnol. 2017;35:912–915. doi: 10.1038/nbt.3976. [DOI] [PubMed] [Google Scholar]
  • 2.Lynd LR, Wyman CE, Gerngross TU. Biocommodity engineering. Biotechnol. Prog. 1999;15:777–793. doi: 10.1021/bp990109e. [DOI] [PubMed] [Google Scholar]
  • 3.Himmel ME, et al. Biomass recalcitrance: engineering plants and enzymes for biofuels production. Science. 2007;315:804–807. doi: 10.1126/science.1137016. [DOI] [PubMed] [Google Scholar]
  • 4.Lynd LR, Weimer PJ, van Zyl WH, Pretorius IS. Microbial cellulose utilization: Fundamentals and biotechnology (vol 66, pg 506, 2002) Microbiol Mol. Biol. R. 2002;66:739–739. doi: 10.1128/MMBR.66.4.739.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Modenbach AA, Nokes SE. Enzymatic hydrolysis of biomass at high-solids loadings: a review. Biomass-. Bioenerg. 2013;56:526–544. doi: 10.1016/j.biombioe.2013.05.031. [DOI] [Google Scholar]
  • 6.Chen XW, et al. DMR (deacetylation and mechanical refining) processing of corn stover achieves high monomeric sugar concentrations (230 g L−1) during enzymatic hydrolysis and high ethanol concentrations (>10% v/v) during fermentation without hydrolysate purification or concentration. Energ. Environ. Sci. 2016;9:1237–1245. doi: 10.1039/C5EE03718B. [DOI] [Google Scholar]
  • 7.Lynd LR, et al. Toward low-cost biological and hybrid biological/catalytic conversion of cellulosic biomass to fuels. Energ. Environ. Sci. 2022;15:938–990. doi: 10.1039/D1EE02540F. [DOI] [Google Scholar]
  • 8.Jorgensen H, Vibe-Pedersen J, Larsen J, Felby C. Liquefaction of lignocellulose at high-solids concentrations. Biotechnol. Bioeng. 2007;96:862–870. doi: 10.1002/bit.21115. [DOI] [PubMed] [Google Scholar]
  • 9.Hodge DB, Karim MN, Schell DJ, McMillan JD. Model-based fed-batch for high-solids enzymatic cellulose hydrolysis. Appl. Biochem. Biotechnol. 2009;152:88–107. doi: 10.1007/s12010-008-8217-0. [DOI] [PubMed] [Google Scholar]
  • 10.Du J, et al. Enzymatic liquefaction and saccharification of pretreated corn stover at high-solids concentrations in a horizontal rotating bioreactor. Bioprocess Biosyst. Eng. 2014;37:173–181. doi: 10.1007/s00449-013-0983-6. [DOI] [PubMed] [Google Scholar]
  • 11.Liotta F, et al. Effect of total solids content on methane and volatile fatty acid production in anaerobic digestion of food waste. Waste Manag Res. 2014;32:947–953. doi: 10.1177/0734242X14550740. [DOI] [PubMed] [Google Scholar]
  • 12.Liotta F, et al. Modified Anaerobic Digestion Model No.1 for dry and semi-dry anaerobic digestion of solid organic waste. Environ. Technol. 2015;36:870–880. doi: 10.1080/09593330.2014.965226. [DOI] [PubMed] [Google Scholar]
  • 13.Sawatdeenarunat C, Surendra KC, Takara D, Oechsner H, Khanal SK. Anaerobic digestion of lignocellulosic biomass: challenges and opportunities. Bioresour. Technol. 2015;178:178–186. doi: 10.1016/j.biortech.2014.09.103. [DOI] [PubMed] [Google Scholar]
  • 14.Abbassi-Guendouz A, et al. Total solids content drives high solid anaerobic digestion via mass transfer limitation. Bioresour. Technol. 2012;111:55–61. doi: 10.1016/j.biortech.2012.01.174. [DOI] [PubMed] [Google Scholar]
  • 15.Wang H, et al. Establishing practical strategies to run high loading corn stover anaerobic digestion: methane production performance and microbial responses. Bioresour. Technol. 2020;310:123364. doi: 10.1016/j.biortech.2020.123364. [DOI] [PubMed] [Google Scholar]
  • 16.Motte, J. C. et al. Total solids content: a key parameter of metabolic pathways in dry anaerobic digestion. Biotechnol. Biofuels6, Artn 164 10.1186/1754-6834-6-164 (2013). [DOI] [PMC free article] [PubMed]
  • 17.Du J, et al. Identifying and overcoming the effect of mass transfer limitation on decreased yield in enzymatic hydrolysis of lignocellulose at high solid concentrations. Bioresour. Technol. 2017;229:88–95. doi: 10.1016/j.biortech.2017.01.011. [DOI] [PubMed] [Google Scholar]
  • 18.Kristensen JB, Felby C, Jorgensen H. Yield-determining factors in high-solids enzymatic hydrolysis of lignocellulose. Biotechnol. Biofuels. 2009;2:11. doi: 10.1186/1754-6834-2-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Li C, Mortelmaier C, Winter J, Gallert C. Effect of moisture of municipal biowaste on start-up and efficiency of mesophilic and thermophilic dry anaerobic digestion. Bioresour. Technol. 2014;168:23–32. doi: 10.1016/j.biortech.2014.02.118. [DOI] [PubMed] [Google Scholar]
  • 20.Verbeke TJ, Garcia GM, Elkins JG. The effect of switchgrass loadings on feedstock solubilization and biofuel production by Clostridium thermocellum. Biotechnol. Biofuels. 2017;10:1–9. doi: 10.1186/s13068-017-0917-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Shao, X. J., Murphy, S. J. & Lynd, L. R. Characterization of reduced carbohydrate solubilization during Clostridium thermocellum fermentation with high switchgrass concentrations. Biomass Bioenerg.139, ARTN 105623, 10.1016/j.biombioe.2020.105623 (2020).
  • 22.Holwerda EK, et al. Metabolic and evolutionary responses of Clostridium thermocellum to genetic interventions aimed at improving ethanol production. Biotechnol. Biofuels. 2020;13:40. doi: 10.1186/s13068-020-01680-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Holwerda EK, et al. The exometabolome of Clostridium thermocellum reveals overflow metabolism at high cellulose loading. Biotechnol. Biofuels. 2014;7:155. doi: 10.1186/s13068-014-0155-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Bayer EA, Lamed R, Himmel ME. The potential of cellulases and cellulosomes for cellulosic waste management. Curr. Opin. Biotechnol. 2007;18:237–245. doi: 10.1016/j.copbio.2007.04.004. [DOI] [PubMed] [Google Scholar]
  • 25.Cantarel BL, et al. The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009;37:D233–D238. doi: 10.1093/nar/gkn663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42:D490–D495. doi: 10.1093/nar/gkt1178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Nelson MC, Morrison M, Yu Z. A meta-analysis of the microbial diversity observed in anaerobic digesters. Bioresour. Technol. 2011;102:3730–3739. doi: 10.1016/j.biortech.2010.11.119. [DOI] [PubMed] [Google Scholar]
  • 28.Sundberg C, et al. 454 pyrosequencing analyses of bacterial and archaeal richness in 21 full-scale biogas digesters. FEMS Microbiol Ecol. 2013;85:612–626. doi: 10.1111/1574-6941.12148. [DOI] [PubMed] [Google Scholar]
  • 29.Ma, S. et al. A microbial gene catalog of anaerobic digestion from full-scale biogas plants. Gigascience10, 10.1093/gigascience/giaa164 (2021). [DOI] [PMC free article] [PubMed]
  • 30.Allgaier M, et al. Targeted discovery of glycoside hydrolases from a switchgrass-adapted compost community. PLoS ONE. 2010;5:e8812. doi: 10.1371/journal.pone.0008812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.D’Haeseleer P, et al. Proteogenomic analysis of a thermophilic bacterial consortium adapted to deconstruct switchgrass. PLoS ONE. 2013;8:e68465. doi: 10.1371/journal.pone.0068465. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lillington SP, Leggieri PA, Heom KA, O’Malley MA. Nature’s recyclers: anaerobic microbial communities drive crude biomass deconstruction. Curr. Opin. Biotech. 2020;62:38–47. doi: 10.1016/j.copbio.2019.08.015. [DOI] [PubMed] [Google Scholar]
  • 33.Lim JW, Park T, Tong YW, Yu Z. The microbiome driving anaerobic digestion and microbial analysis. Adv. Bioenergy. 2020;5:1–61. doi: 10.1016/bs.aibe.2020.04.001. [DOI] [Google Scholar]
  • 34.Kougias, P. G. et al. Spatial distribution and diverse metabolic functions of lignocellulose-degrading uncultured bacteria as revealed by genome-centric metagenomics. Appl. Environ. Microb.84, 10.1128/aem.01244-18 (2018). [DOI] [PMC free article] [PubMed]
  • 35.Campanaro S, et al. Metagenomic analysis and functional characterization of the biogas microbiome using high throughput shotgun sequencing and a novel binning strategy. Biotechnol. Biofuels. 2016;9:26. doi: 10.1186/s13068-016-0441-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.van der Lelie D, et al. The metagenome of an anaerobic microbial community decomposing poplar wood chips. PLoS ONE. 2012;7:e36740. doi: 10.1371/journal.pone.0036740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Comtet-Marre S, et al. Metatranscriptomics reveals the active bacterial and eukaryotic fibrolytic communities in the rumen of dairy cow fed a mixed diet. Front Microbiol. 2017;8:67. doi: 10.3389/fmicb.2017.00067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Svartström O, et al. Ninety-nine de novo assembled genomes from the moose (Alces alces) rumen microbiome provide new insights into microbial plant biomass degradation. ISME J. 2017;11:2538–2551. doi: 10.1038/ismej.2017.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Campanaro S, et al. New insights from the biogas microbiome by comprehensive genome-resolved metagenomics of nearly 1600 species originating from multiple anaerobic digesters. Biotechnol. Biofuels. 2020;13:25. doi: 10.1186/s13068-020-01679-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Jiang C, et al. Characterizing the growing microorganisms at species level in 46 anaerobic digesters at Danish wastewater treatment plants: a six-year survey on microbial community structure and key drivers. Water Res. 2021;193:116871. doi: 10.1016/j.watres.2021.116871. [DOI] [PubMed] [Google Scholar]
  • 41.Tomazetto, G., Pimentel, A. C., Wibberg, D., Dixon, N. & Squina, F. M. Multi-omic Directed Discovery of Cellulosomes, Polysaccharide Utilization Loci, and Lignocellulases from an Enriched Rumen Anaerobic Consortium. Appl Environ Microbiol86, 10.1128/AEM.00199-20 (2020). [DOI] [PMC free article] [PubMed]
  • 42.Liu N, et al. Functional metagenomics reveals abundant polysaccharide-degrading gene clusters and cellobiose utilization pathways within gut microbiota of a wood-feeding higher termite. ISME J. 2019;13:104–117. doi: 10.1038/s41396-018-0255-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Schalk F, et al. The termite fungal cultivar termitomyces combines diverse enzymes and oxidative reactions for plant biomass conversion. mBio. 2021;12:e0355120. doi: 10.1128/mBio.03551-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Liang X, et al. Development and characterization of stable anaerobic thermophilic methanogenic microbiomes fermenting switchgrass at decreasing residence times. Biotechnol. Biofuels. 2018;11:1–18. doi: 10.1186/s13068-018-1238-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Abbassi-Guendouz A, et al. Microbial community signature of high-solid content methanogenic ecosystems. Bioresour. Technol. 2013;133:256–262. doi: 10.1016/j.biortech.2013.01.121. [DOI] [PubMed] [Google Scholar]
  • 46.Zhu N, et al. Metagenomic and metaproteomic analyses of a corn stover-adapted microbial consortium EMSD5 reveal its taxonomic and enzymatic basis for degrading lignocellulose. Biotechnol. Biofuels. 2016;9:1–23. doi: 10.1186/s13068-015-0423-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Gharechahi J, Salekdeh GH. A metagenomic analysis of the camel rumen’s microbiome identifies the major microbes responsible for lignocellulose degradation and fermentation. Biotechnol. Biofuels. 2018;11:216. doi: 10.1186/s13068-018-1214-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Dumitrache, A. et al. Specialized activities and expression differences for Clostridium thermocellum biofilm and planktonic cells. Sci. Rep-Uk7, ARTN 43583 10.1038/srep43583 (2017). [DOI] [PMC free article] [PubMed]
  • 49.Zhang, H. et al. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Res.46, 10.1093/nar/gky418 (2019). [DOI] [PMC free article] [PubMed]
  • 50.Reddy AP, et al. Discovery of microorganisms and enzymes involved in high-solids decomposition of rice straw using metagenomic analyses. PLoS One. 2013;8:e77985. doi: 10.1371/journal.pone.0077985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Ostby, H., Hansen, L. D., Horn, S. J., Eijsink, V. G. H. & Varnai, A. Enzymatic processing of lignocellulosic biomass: principles, recent advances and perspectives. J. Ind. Microbiol. Biot., 10.1007/s10295-020-02301-8 (2020). [DOI] [PMC free article] [PubMed]
  • 52.Blumer-Schuette SE, et al. Thermophilic lignocellulose deconstruction. Fems Microbiol Rev. 2014;38:393–448. doi: 10.1111/1574-6976.12044. [DOI] [PubMed] [Google Scholar]
  • 53.Brumm, P. J., Gowda, K., Robb, F. T. & Mead, D. A. The complete genome sequence of Hyperthermophile Dictyoglomus turgidum DSM 6724 (TM) reveals a specialized carbohydrate fermentor. Front. Microbiol.7, ARTN 1979, 10.3339/fmicb.2016.01979 (2016). [DOI] [PMC free article] [PubMed]
  • 54.Nishida H, Beppu T, Ueda K. Whole-genome comparison clarifies close phylogenetic relationships between the phyla Dictyoglomi and Thermotogae. Genomics. 2011;98:370–375. doi: 10.1016/j.ygeno.2011.08.001. [DOI] [PubMed] [Google Scholar]
  • 55.Zou ZZ, et al. A new thermostable beta-glucosidase mined from Dictyoglomus thermophilum: properties and performance in octyl glucoside synthesis at high temperatures. Bioresour. Technol. 2012;118:425–430. doi: 10.1016/j.biortech.2012.04.040. [DOI] [PubMed] [Google Scholar]
  • 56.Dodd D, Mackie RI, Cann IK. Xylan degradation, a metabolic property shared by rumen and human colonic Bacteroidetes. Mol. Microbiol. 2011;79:292–304. doi: 10.1111/j.1365-2958.2010.07473.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Hu ZJ, Sykes R, Davis MF, Brummer EC, Ragauskas AJ. Chemical profiles of switchgrass. Bioresour. Technol. 2010;101:3253–3257. doi: 10.1016/j.biortech.2009.12.033. [DOI] [PubMed] [Google Scholar]
  • 58.Speirs LBM, Rice DTF, Petrovski S, Seviour RJ. The phylogeny, biodiversity, and ecology of the chloroflexi in activated sludge. Front Microbiol. 2019;10:2015. doi: 10.3389/fmicb.2019.02015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Biswal AK, et al. Sugar release and growth of biofuel crops are improved by downregulation of pectin biosynthesis. Nat. Biotechnol. 2018;36:249–257. doi: 10.1038/nbt.4067. [DOI] [PubMed] [Google Scholar]
  • 60.Li, F., Foucat, L. & Bonnin, E. Effect of solid loading on the behaviour of pectin-degrading enzymes. Biotechnology for Biofuels14, ARTN 107 10.1186/s13068-021-01957-3 (2021). [DOI] [PMC free article] [PubMed]
  • 61.Levasseur, A., Drula, E., Lombard, V., Coutinho, P. M. & Henrissat, B. Expansion of the enzymatic repertoire of the CAZy database to integrate auxiliary redox enzymes. Biotechnology for Biofuels6, Artn 41 10.1186/1754-6834-6-41 (2013). [DOI] [PMC free article] [PubMed]
  • 62.Qiu ZY, Fang C, He NL, Bao J. An oxidoreductase gene ZMO1116 enhances the p-benzoquinone biodegradation and chiral lactic acid fermentability of Pediococcus acidilactici. J. Biotechnol. 2020;323:231–237. doi: 10.1016/j.jbiotec.2020.08.015. [DOI] [PubMed] [Google Scholar]
  • 63.Yan, Z., Gao, X. C., Gao, Q. Q. & Bao, J. Mechanism of tolerance to the lignin-derived inhibitor p-benzoquinone and metabolic modification of biorefinery fermentation strains. Appl Environ Microb85, ARTN e01443-19 10.1128/AEM.01443-19 (2019). [DOI] [PMC free article] [PubMed]
  • 64.Jensen KA, Houtman CJ, Ryan ZC, Hammel KE. Pathways for extracellular fenton chemistry in the brown rot basidiomycete Gloeophyllum trabeum. Appl Environ. Micro. 2001;67:2705–2711. doi: 10.1128/AEM.67.6.2705-2711.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Bugg TDH, Ahmad M, Hardiman EM, Rahmanpour R. Pathways for degradation of lignin in bacteria and fungi. Nat. Prod. Rep. 2011;28:1883–1896. doi: 10.1039/c1np00042j. [DOI] [PubMed] [Google Scholar]
  • 66.Arantes V, Jellison J, Goodell B. Peculiarities of brown-rot fungi and biochemical Fenton reaction with regard to their potential as a model for bioprocessing biomass. Appl Microbiol Biot. 2012;94:323–338. doi: 10.1007/s00253-012-3954-y. [DOI] [PubMed] [Google Scholar]
  • 67.Cairo, J. P. L. F. et al. Expanding the knowledge on lignocellulolytic and redox enzymes of worker and soldier castes from the lower termite coptotermes gestroi. Front. Microbiol.7, ARTN 1518 10.3389/fmicb.2016.01518 (2016). [DOI] [PMC free article] [PubMed]
  • 68.Slesak I, Slesak H, Kruk J. Oxygen and hydrogen peroxide in the early evolution of life on earth: in silico comparative analysis of biochemical pathways. Astrobiology. 2012;12:775–784. doi: 10.1089/ast.2011.0704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Kersten P, Cullen D. Extracellular oxidative systems of the lignin-degrading Basidiomycete Phanerochaete chrysosporium. Fungal Genet Biol. 2007;44:77–87. doi: 10.1016/j.fgb.2006.07.007. [DOI] [PubMed] [Google Scholar]
  • 70.McGivern BB, et al. Decrypting bacterial polyphenol metabolism in an anoxic wetland soil. Nat. Commun. 2021;12:2466. doi: 10.1038/s41467-021-22765-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Hagen, L. H. et al. Quantitative Metaproteomics Highlight the Metabolic Contributions of Uncultured Phylotypes in a Thermophilic Anaerobic Digester. Appl. Environ. Microbiol.83, 10.1128/AEM.01955-16 (2017). [DOI] [PMC free article] [PubMed]
  • 72.Dyksma S, Jansen L, Gallert C. Syntrophic acetate oxidation replaces acetoclastic methanogenesis during thermophilic digestion of biowaste. Microbiome. 2020;8:105. doi: 10.1186/s40168-020-00862-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Timmers PHA, et al. Metabolism and occurrence of methanogenic and sulfate-reducing syntrophic acetate oxidizing communities in haloalkaline environments. Front Microbiol. 2018;9:3039. doi: 10.3389/fmicb.2018.03039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Lopes AM, Ferreira EX, Moreira LRS. An update on enzymatic cocktails for lignocellulose breakdown. J. Appl. Microbiol. 2018;125:632–645. doi: 10.1111/jam.13923. [DOI] [PubMed] [Google Scholar]
  • 75.Pakarinen A, Zhang J, Brock T, Maijala P, Viikari L. Enzymatic accessibility of fiber hemp is enhanced by enzymatic or chemical removal of pectin. Bioresour. Technol. 2012;107:275–281. doi: 10.1016/j.biortech.2011.12.101. [DOI] [PubMed] [Google Scholar]
  • 76.Zheng YX, et al. Semi-continuous production of high-activity pectinases by immobilized Rhizopus oryzae using tobacco wastewater as substrate and their utilization in the hydrolysis of pectin-containing lignocellulosic biomass at high solid content. Bioresour. Technol. 2017;241:1138–1144. doi: 10.1016/j.biortech.2017.06.066. [DOI] [PubMed] [Google Scholar]
  • 77.Wang JH, et al. Efficient saccharification of agave biomass using Aspergillus niger produced low-cost enzyme cocktail with hyperactive pectinase activity. Bioresour. Technol. 2019;272:26–33. doi: 10.1016/j.biortech.2018.09.069. [DOI] [PubMed] [Google Scholar]
  • 78.Gruno M, Valjamae P, Pettersson G, Johansson G. Inhibition of the Trichoderma reesei cellulases by cellobiose is strongly dependent on the nature of the substrate. Biotechnol. Bioeng. 2004;86:503–511. doi: 10.1002/bit.10838. [DOI] [PubMed] [Google Scholar]
  • 79.Halliwell G, Griffin M. Nature and mode of action of cellulolytic component C1 of Trichoderma-Koningii on native cellulose. Biochem J. 1973;135:587–594. doi: 10.1042/bj1350587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Chen M, et al. Strategies to reduce end-product inhibition in family 48 glycoside hydrolases. Proteins. 2016;84:295–304. doi: 10.1002/prot.24965. [DOI] [PubMed] [Google Scholar]
  • 81.Kumar R, Wyman CE. Strong cellulase inhibition by Mannan polysaccharides in cellulose conversion to sugars. Biotechnol. Bioeng. 2014;111:1341–1353. doi: 10.1002/bit.25218. [DOI] [PubMed] [Google Scholar]
  • 82.Qing Q, Yang B, Wyman CE. Xylooligomers are strong inhibitors of cellulose hydrolysis by enzymes. Bioresour. Technol. 2010;101:9624–9630. doi: 10.1016/j.biortech.2010.06.137. [DOI] [PubMed] [Google Scholar]
  • 83.Chung, D. et al. Deletion of a gene cluster encoding pectin degrading enzymes in Caldicellulosiruptor bescii reveals an important role for pectin in plant biomass recalcitrance. Biotechnology for Biofuels7, ARTN 147 10.1186/s13068-014-0147-1 (2014). [DOI] [PMC free article] [PubMed]
  • 84.Xiao, C. W. & Anderson, C. T. Roles of pectin in biomass yield and processing for biofuels. Front Plant Sci4, ARTN 67 10.3389/fpls.2013.00067 (2013). [DOI] [PMC free article] [PubMed]
  • 85.Qin, L. et al. Inhibition of lignin-derived phenolic compounds to cellulase. Biotechnology for Biofuels9, ARTN 70 10.1186/s13068-016-0485-2 (2016). [DOI] [PMC free article] [PubMed]
  • 86.Li X, et al. Inhibitory effects of lignin on enzymatic hydrolysis: the role of lignin chemistry and molecular weight. Renew. Energ. 2018;123:664–674. doi: 10.1016/j.renene.2018.02.079. [DOI] [Google Scholar]
  • 87.Rahikainen JL, et al. Inhibitory effect of lignin during cellulose bioconversion: The effect of lignin chemistry on non-productive enzyme adsorption. Bioresour. Technol. 2013;133:270–278. doi: 10.1016/j.biortech.2013.01.075. [DOI] [PubMed] [Google Scholar]
  • 88.Kubis, M. R., Holwerda, E. K. & Lynd, L. R. Declining carbohydrate solubilization with increasing solids loading during fermentation of cellulosic feedstocks by Clostridium thermocellum: documentation and diagnostic tests. Biotechnol Biof Biop15, ARTN 12 10.1186/s13068-022-02110-4 (2022). [DOI] [PMC free article] [PubMed]
  • 89.Lovley DR, Greening RC, Ferry JG. Rapidly growing rumen methanogenic organism that synthesizes coenzyme M and has a high affinity for formate. Appl Environ. Microbiol. 1984;48:81–87. doi: 10.1128/aem.48.1.81-87.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Saeman JF, Bubl JL, Harris EE. Quantitative Saccharification of Wood and Cellulose. Ind. Eng. Chem. Anal. Ed. 1945;17:35–37. doi: 10.1021/i560137a008. [DOI] [Google Scholar]
  • 91.Sluiter, A. & National Renewable Energy Laboratory (U.S.). Determination of structural carbohydrates and lignin in biomass: laboratory analytical procedure (LAP): issue date, 4/25/2008, https://purl.fdlp.gov/GPO/LPS94089.
  • 92.Walker, C., Ryu, S., Giannone, R. J., Garcia, S. & Trinh, C. T. Understanding and Eliminating the Detrimental Effect of Thiamine Deficiency on the Oleaginous Yeast Yarrowia lipolytica. Appl. Environ. Microbiol.86, 10.1128/AEM.02299-19 (2020). [DOI] [PMC free article] [PubMed]
  • 93.Kanehisa M, Sato Y, Morishima K. BlastKOALA and GhostKOALA: KEGG Tools for Functional Characterization of Genome and Metagenome Sequences. J. Mol. Biol. 2016;428:726–731. doi: 10.1016/j.jmb.2015.11.006. [DOI] [PubMed] [Google Scholar]
  • 94.Huerta-Cepas J, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47:D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Batth TS, et al. Protein aggregation capture on microparticles enables multipurpose proteomics sample preparation. Mol. Cell Proteom. 2019;18:1027–1035. doi: 10.1074/mcp.TIR118.001270. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File (390.6KB, pdf)
41467_2022_31433_MOESM3_ESM.pdf (26.6KB, pdf)

Description Additional Supplementary Files

Supplementary Data 1 (9.2MB, xlsx)
Supplementary Data 2 (1.4MB, xlsx)
Supplementary Data 3 (8.5MB, xlsx)
Supplementary Data 4 (6MB, xlsx)
Supplementary Data 5 (153.9KB, xlsx)
Supplementary Data 6 (789.5KB, xlsx)
Supplementary Data 7 (15.8KB, xlsx)
Supplementary Data 8 (10.5KB, xlsx)
Supplementary Data 9 (47.4KB, xlsx)
Reporting Summary (456.6KB, pdf)

Data Availability Statement

The data on solubilization and microbiome performance generated in this study is available in the supplementary information files. All proteomics raw mass spectra used for protein quantification in this study are available at the ProteomeXchange Consortium via the MassIVE repository (MassIVE accession: MSV000088319 [https://massive.ucsd.edu/ProteoSAFe/dataset.jsp?task=05b15f47bc0145759b12f5da310d3a6a]; ProteomeXchange accession: PXD029582). All proteome abundance data along with the mapped annotations are available as Supplementary Data files. Source data are provided with this paper.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES