Abstract
The rumen is a complex ecosystem. It is the primary site for microbial fermentation of ingested feed allowing conversion of a low nutritional feed source into high quality meat and milk products. However, digestive inefficiencies lead to production of high amounts of environmental pollutants; methane and nitrogenous waste. These inefficiencies could be overcome by development of forages which better match the requirements of the rumen microbial population. Although challenging, the application of meta-proteomics has potential for a more complete understanding of the rumen ecosystem than sequencing approaches alone. Here, we have implemented a meta-proteomic approach to determine the association between taxonomies of microbial sources of the most abundant proteins in the rumens of forage-fed dairy cows, with taxonomic abundances typical of those previously described by metagenomics. Reproducible proteome profiles were generated from rumen samples. The most highly abundant taxonomic phyla in the proteome were Bacteriodetes, Firmicutes and Proteobacteria, which corresponded with the most abundant taxonomic phyla determined from 16S rRNA studies. Meta-proteome data indicated differentiation between metabolic pathways of the most abundant phyla, which is in agreement with the concept of diversified niches within the rumen microbiota.
Introduction
The rumen plays host to a complex microbiome consisting of over seven thousand species from protozoa, archaea, bacteria and fungi and is the primary site for microbial fermentation of ingested feed in ruminants1. This symbiotic relationship allows the host to utilise the nutrients in otherwise indigestible lignocellulose plant material2. However, inefficiencies in this system result in lost energy and negative impacts including production of greenhouse gases and leaching of nitrogenous pollutants into the environment3. Opportunities exist to mitigate these impacts and increase efficiency by modification of rumen function via improved feeds, however this requires a greater understanding of the rumen microbiome at a systems-level4.
To date culturing techniques and more recently ‘omics technologies have been directed towards enhancing knowledge of the diversity, structure and function of the rumen microbial community4–10. Metagenomic sequencing data is frequently used to assess the functional potential of the rumen microbiome and recently meta-transcriptomics has increasingly been used to predict function11. However, translation does not always directly follow gene expression patterns and so functionality needs to be confirmed at the protein level and through understanding the effect of post-translational modifications. Proteome evaluation is not a trivial task in complex environments such as the rumen and thus the information from the rumen meta-proteome is much poorer than that generated from meta-genomic and meta-transcriptomic analyses.
Meta-proteomics has been described as the entire protein complement of the microbiota in an environment at a given point in time12. Aqueous, terrestrial and eukaryotic (humans, mice, termites and plants) ecosystems have been the focus of most meta-proteomic analyses to date13. In complex samples, such as the rumen, proteomic analyses are incredibly challenging14, not least because of the presence of interfering compounds such as humic substances, which are difficult to remove from the sample and can bind to proteins and cause structural modification15–17. If these issues could be solved, analysis of the meta-proteome of the rumen microbial community would reveal details about microbial community activity, structure, function and metabolic pathway transformations that are presently lacking.
The aims of this study were (1) To use a meta-proteomic approach to analyse the rumen system to determine if the most abundant proteins present matched those predicted by meta-transcriptomic data. (2) Use this information to identify the most translationally active organisms in the rumen microbiome. This information will aid in the further understanding of the active microbial metabolic pathways in the rumen of cows fed fresh forage. Previously, Snelling and Wallace14 reported inconsistent proteomic gel images from rumen samples, which limited interpretation. The methodology described here produced reproducible proteome profiles of a quality suitable for more detailed investigations. By comparison of our data with that derived from 16S rRNA profiling from comparable data sets10, it was possible to compare the taxonomic abundances derived from protein family identification to rumen microbe transcription profiles18, to enable the exploration of the abundance/functional relationship underpinning rumen resilience19.
Results
Reproducibility of the rumen metaproteome
Preliminary investigations showed that previously published methodologies for extraction of protein for proteomic analysis such as TCA/acetone precipitation20,21, the direct use of detergents22–24, phenol/methanol extraction25,26 or a combination of methods27 did not generate the purity required for proteomic analysis of protein extracts from rumen samples (data not shown). In contrast, an extraction methodology involving serial washing using a 0.9% NaCl solution (described by Wilmes and Bond12,28 as being effective in other systems in removing large quantities of interfering substances from the sample), a separation step using a 40% Percoll solution (Supplementary Fig. S1) and subsequent urea/detergent based protein extraction (as described in Materials and Methods) enabled multiple, distinct bands to be distinguished in rumen samples from cows by one dimensional SDS-PAGE (Fig. 1).
The SDS-PAGE protein profiles (Fig. 1) produced for each cow replicate (n = 3) were subjected to MS/MS analysis after the entire lane was divided into sections as indicated in Fig. 1 so that the proteins contained in the entire lane were sequenced. These sections were uneven to account for differential band density and so maximise detection of low abundance proteins from poorly staining regions of the gel. Nevertheless it was noted that most information was derived from those gel sections containing recognisable bands. Database interrogation and peptide sequence filtering was undertaken as per parameters outlined in Materials and Methods. A total of 167,506 putative peptides were generated across all the samples of which 93, 957 (56%) had a match to the non-redundant NCBI database (all replicates combined). Individually, this was comprised of 32,845 (55% total peptides), 28,632 (56% total peptides) 32,480 (55% total peptides) peptides from cows 1, 2 and 3 respectively. Protein identifications and taxonomic classifications were similar between cow replicates, showing reproducibility of the methodology for sample preparation. Over 80% of protein families detected were common to all three cow replicates with only 10% unique to each cow. Less than 10% of protein families were present in only two out of the three cow replicates (Fig. 2). Differences in protein diversity between cow replicates were very low at phylum, order and family level as determined from Shannon and Simpson diversity comparisons (Fig. 2).
Distribution of abundant organisms
No significant differences were observed in the overall abundance and distribution of taxonomic profiles between replicates (Fig. 2). Within the rumen metaproteome the abundance of bacterial proteins was found to be significantly greater than that of fungal, plant or protozoal proteins in the samples assessed using this methodology. Of the taxa determined by Unipept analysis29, bacteria represented 77% of all taxa with the phylum Bacteroidetes as the most abundant (49% abundance) followed by Firmicutes (12% abundance) and Proteobacteria (4% abundance; Fig. 3). To determine whether these differences were robust to the database analysis performed, analyses were also performed using Metaproteome analyser30. This analysis confirmed these three bacterial phyla as the most abundant, although there were some slight differences in relative percentage abundances as compared with analysis by UniPept with Bacteroidetes, Firmicutes and Proteobacteria comprising 67%, 16% and 5% of taxonomic abundances respectively.
To determine if the data obtained from this metaproteomic study could be identified in organisms from the rumen, a comparison of our rumen metaproteome was made against the Hungate 1000 genome database (of which 406 genomes were available)10. We identified hits within 346 of the Hungate genomes. To identify if the taxonomic profiles identified from the metaproteome were representative of those known from the rumen, we also carried out a comparison with previously published 16S rRNA datasets10. This demonstrated 88% (cows 1 and 3) and 75% (cow 2) of the phyla identified from the metaproteome were known to exist in the 16S rRNA datasets. When comparing further taxonomic rankings of the metaproteome data to the 16S rRNA datasets10, 76%, 52% and 61% (cows 1, 2 and 3 respectively) matched at the order level and at family level matches were 67% for cow 1, 48% for cow 2 and 61% for cow 3 showing a tendency towards a slight decrease in the taxonomic correlations as taxonomy becomes more specific.
Functional characterisation of the rumen meta-proteome
Protein families were categorised into functional components per biological processes, molecular functions and cellular location. Gene ontology analysis showed that most of the highly abundant proteins were categorised into ten biological processes which varied little amongst cow replicates. Glycolysis, electron transport and carbohydrate metabolism were predominant pathways with major molecular functions relating to oxidoreductase, isomerase, kinase and binding activities identified (Table 1). The subcellular locations of the top 25 protein families were determined to be present in the cytoplasm for 60% of the highly abundant proteins with the remainder determined at 16% for membrane proteins, 20% and 4% for cytosolic and nucleus bound proteins respectively (Supplementary Data Table 1). At the phylum level the distribution of the top 25 protein family summary functions revealed that Bacteriodetes were most highly represented for nearly all the major protein functions determined, apart from protein folding (Fig. 4). Mapping the 25 most abundant protein families from each cow replicate (Supplementary Data Table 1 and Fig. 5) using iPATH (interactive pathways explorer 231,32) from KEGG identifiers33 revealed that many pathways from all 3 cows were related to carbohydrate metabolism, nucleotide metabolism, pyruvate metabolism, amino acid metabolism, glycine, serine and threonine metabolism, oxidative phosphorylation and in the biosynthesis of other secondary metabolites (Fig. 5). Analysis of KEGG pathways information showed over 40% of pathways identified were shared between all cows (Supplementary Data Fig. 2), with less than 30% of pathways unique to each cow. Pathway mapping was also carried out on the top most abundant phyla (65%) observed from this study (Fig. 4), showing mapping differences between Bacteriodetes, Proteobacteria, Firmicutes (Fig. 6). The main pathways highlighted for Bacteriodetes were involved in those designated as nucleotide metabolism, carbon metabolism, carbohydrate and lipid metabolism and lipoic acid metabolism, which could be further broken down into fatty acid metabolism, valine, leucine and isoleucine degradation, glycolysis, gluconeogenesis, TCA cycle, and pyruvate metabolism. Pathways identified in Firmicutes were nucleotide, carbon metabolism and carbohydrate and lipid metabolism, which were further divided into purine metabolism, pyrimidine metabolism, glycolysis, gluconeogenesis, pentose phosphate pathways, fructose and mannose metabolism, terpenoid synthesis and nitrogen metabolism. Proteobacteria were involved in nucleotide metabolism, carbon metabolism and carbohydrate and lipid metabolism, which could be further subdivided into pyrimidine metabolism, glycine, serine and threonine metabolism, methane metabolism and nitrogen metabolism.
Table 1.
Cow 1 | Cow 2 | Cow 3 | ||||||
---|---|---|---|---|---|---|---|---|
Top 25 Protein Families | Summary Function | Percentage abundance | Top 25 Protein Families | Summary Function | Percentage abundance | Top 25 Protein Families | Summary Function | Percentage abundance |
elongation factor Tu | Biosynthesis | 44.70 | Serum albumin | Transport | 25.75 | elongation factor Tu | Biosynthesis | 31.82 |
Serum albumin | Transport | 10.76 | glyceraldehyde-3-phosphate dehydrogenase | Glycolysis | 18.01 | glyceraldehyde-3-phosphate dehydrogenase | Glycolysis | 11.61 |
pyruvate, phosphate dikinase | Transport | 7.96 | pyruvate, phosphate dikinase | Transport | 9.33 | 50S ribosomal protein | Translation | 11.35 |
glutamate dehydrogenase | Biosynthesis | 7.42 | elongation factor Tu | Biosynthesis | 7.05 | 30S ribosomal protein S1 | Biosynthesis | 8.53 |
phosphoglycerate kinase | Glycolysis | 4.47 | 50S ribosomal protein | Translation | 6.88 | glutamate dehydrogenase | Biosynthesis | 7.29 |
50S ribosomal protein | Translation | 4.44 | 30S ribosomal protein S1 | Biosynthesis | 3.14 | hypothetical protein | Metabolism | 4.09 |
glyceraldehyde-3-phosphate dehydrogenase | Glycolysis | 3.57 | triose-phosphate isomerase | Gluconeogenesis | 2.71 | triose-phosphate isomerase | Gluconeogenesis | 2.35 |
phosphoenolpyruvate synthase | Metabolism | 2.90 | thioredoxin | Protein Folding | 2.06 | thioredoxin | Protein Folding | 2.09 |
30S ribosomal protein S1 | Biosynthesis | 2.62 | succinate dehydrogenase | Metabolism | 1.93 | phosphoglycerate kinase | Glycolysis | 1.83 |
ATP synthase subunit beta | Transport | 1.72 | nitrogen-fixing protein NifU | Nitrogen fixation | 1.72 | succinate dehydrogenase | Metablolism | 1.77 |
thioredoxin | Protein Folding | 1.23 | hypothetical protein | Metabolism | 1.63 | nitrogen-fixing protein NifU | Nitrogen fixation | 1.47 |
hypothetical protein | Metabolism | 1.10 | transketolase | Metabolism | 1.55 | pyruvate, phosphate dikinase | Transport | 1.41 |
molecular chaperone DnaK | Binding | 1.00 | fructose-bisphosphate aldolase | Glycolysis | 1.33 | transketolase | Metabolism | 1.18 |
fucose isomerase | Carbohydrate degradation | 0.77 | DNA-directed RNA polymerase, alpha subunit | Binding | 1.25 | molecular chaperone DnaK | Binding | 1.05 |
rubrerythrin | Binding | 0.69 | phosphoenolpyruvate carboxykinase (ATP) | Gluconeogenesis | 1.25 | phosphoenolpyruvate carboxykinase (ATP) | Gluconeogenesis | 1.05 |
succinate dehydrogenase | Metabolism | 0.69 | pyruvate ferredoxin (flavodoxin) oxidoreductase | Transport | 1.20 | ribosomal proteins | Biosynthesis | 0.98 |
DNA-directed RNA polymerase subunit beta | Binding | 0.56 | racemase | Metabolism | 1.20 | pyruvate ferredoxin (flavodoxin) oxidoreductase | Transport | 0.95 |
ribosomal proteins | Biosynthesis | 0.44 | molecular chaperone DnaK | Binding | 1.03 | DNA-directed RNA polymerase, alpha subunit | Binding | 0.69 |
nitrogen-fixing protein NifU | Nitrogen fixation | 0.28 | ATP synthase subunit beta | Transport | 0.95 | fucose isomerase | Carbohydrate degradation | 0.69 |
diphosphate–fructose-6-phosphate 1-phosphotransferase | Glycolysis | 0.26 | pyruvate synthase | Transport | 0.90 | rubrerythrin | Binding | 0.65 |
phosphoglucomutase | Metabolism | 0.26 | phosphoglucomutase | Glycolysis | 0.77 | methylmalonyl-CoA mutase | Metabolism | 0.49 |
racemase | Metabolism | 0.23 | ribosomal proteins | Biosynthesis | 0.77 | energy transducer TonB | Transport | 0.46 |
transketolase | Metabolism | 0.23 | rubrerythrin | Binding | 0.77 | phosphoglucomutase | Glycolysis | 0.43 |
pyruvate ferredoxin (flavodoxin) oxidoreductase | Transport | 0.21 | fucose isomerase | Carbohydrate degradation | 0.60 | fructose-bisphosphate aldolase | Glycolysis | 0.39 |
transcription termination factor NusA | Binding | 0.21 | serine hydroxymethyltransferase | Metabolism | 0.52 | pyruvate synthase | Transport | 0.39 |
Comparison of meta-proteome analyses to meta-transcript abundance
A comparison was conducted to determine whether the most highly abundant peptides expressed from proteomic analyses correlated with the most highly expressed transcripts from meta-transcriptomic datasets from the rumen using an extremely large rumen meta-transcriptome data set18. Although 80% of the transcriptome from Shi et al.18 aligned with the metagenome assembly10, the number of genes expressed was small compared to the entirety of the genes represented in the metagenome (0.89% of the metagenome). Of those expressed genes, there were matches with 71% of the metaproteome. The most abundant protein families present in the metaproteomic data and their percentage distribution amongst matched contigs in the transcriptome data set are shown in Fig. 7, where elongation factor Tu, glyceraldehyde-3-phosphate dehydrogenase and phosphoglycerate kinase were the most abundant matches to the transcriptomic data.
Discussion
The rumen microbial consortium is composed of thousands of species of bacteria, methanogenic archaea, protozoa, fungi and viruses with bacteria being the most abundant micro-organisms in the rumen reaching up to 1011 cells/ml rumen fluid34. The extensive microbial diversity of the rumen9,35 confers resilience to the host and the ability to adapt to different dietary materials36. However, animal-to-animal variation in rumen bacterial communities can impact the response of individuals to changes in diet37. Indeed, it is thought that diet is one of the major drivers of taxonomic composition in the rumen microbiome37–39. Understanding how changes in microbial diversity affect the function of the rumen or enable sustained feed utilisation is a current challenge for ruminant science.
Predictions based on gene sequences from culture-independent and metagenomic techniques have shown that there is considerable redundancy in the rumen ecosystem40. In particular, the potential for expression of key enzymes by multiple species makes determining the relationship between microbial abundance and functionality far from straightforward. In this work, we used a meta-proteomic approach to assess the relationship between the abundance of rumen proteins, their source organisms and typical patterns of gene expression from meta-transcriptomic data.
Brulc et al.37 determined that the distribution of rumen phylotypes falls predominantly into three main categories; Bacteriodetes, Firmicutes and Proteobacteria, regardless of the microbiome analysed. Within this, Henderson et al.39, considered there to be a “core bacterial microbiome”, in which Bacteriodetes and Firmicutes predominate. Supporting this, the average composition of the rumen bacterial community from metagenomic data was found to be 50% Bacteriodetes, 43% Firmicutes and 5.4% Proteobacteria41. By reference to published metagenomes, it was possible to similarly characterise the trend of protein diversity of the rumen by their predicted species of origin. We found that the metaproteome was in agreement with the taxonomic profiles from metagenomic and 16S rRNA profiles10,41 with Bacteriodetes, Firmicutes and Proteobacteria representing most highly abundant phyla. The protein extraction protocol developed here is effective and reproducible for extraction of bacterial proteins from rumen samples. Finally, we showed high replicability between the cows sampled both taxonomically and in protein sequence abundance.
The rumen microbiota gain energy from glucose units derived from the degradation of fibre which is used to drive microbial protein synthesis and proliferation. Fibre degradation is considered to be due to the activity of a consortium of fibrolytic bacteria42. Once thought to be abundant in the rumen, Fibrobacter succinogens, typically only represents 1–3% of total rumen bacteria9,42,43 and has been found to be missing entirely in up to half of samples analysed in some studies41. Similarly, relatively few peptides in our data could be identified as originating from F. succinogenes and the functions of those detected were primarily involved in protein biosynthesis. The lack of fibrolytic activity in these samples is surprising. Firstly this could be a result of lack of translation. For example glycohydrolase activity has been shown to be an inducible part of colonisation44 and for these studies rumen fluid was collected from animals which had been fasting overnight. Greater presence of glycohydrolases would be predicted from samples taken during feeding. Secondly, we have described here improvements to the protocol for extraction of proteins from rumen bacteria. It is recognised that detection of cell-free extracellular proteins of bacterial origin (which could include fibrolytic activity) will require further modification to separate proteins from those contaminants (eg humic acids) which are abundant in the rumen liquor.
While functional analysis of the rumen meta-proteome identified proteins involved in diverse metabolic processes, unsurprisingly they originated from only a few bacterial phyla representing those most abundant in the rumen. The most abundant protein functions observed were associated with ontologies representing core metabolic pathways such as, glycolysis, gluconeogenesis, protein biosynthesis and electron transport (Table 1). While these activities were identified in more than one phylum (Fig. 4), the largest proportion were from Bacteroidetes suggesting a central role of this phylum in microbial metabolism and reflecting phylogenetic dominance. Snelling and Wallace14 also found that more proteins belonged to Bacteroidetes than Firmicutes or Proteobacteria.
The most highly abundant protein family found belonged to elongation factor Tu, which accounts for 5 to 10% of total cell protein45. These proteins, along with 30S and 50S ribosomal proteins are important in protein biosynthesis, where the GTP-dependent binding of aminoacyl-tRNA to the A-site of ribosomes is promoted. Glycolysis and the citric acid cycle provide sources of energy within the rumen46. In this study, multiple components of the glycolytic pathway (glucose to pyruvate conversion) were detected within the rumen metaproteome with glyceraldehyde-3-phosphate dehydrogenase found to be the most abundant protein contributing to this biological process in the rumen all cows. This enzyme is involved in the first step in the glycolytic pathway catalysing the reversible oxidative phosphorylation of D-glyceraldehyde-3-phosphate to 1,3-bisphospho-D-glycerate in the presence of NAD+ and phosphate. The most common phyla contributing to this process was the Firmicutes, with the families Ruminococcaeae and Lachnospiracaea dominating. However, other contributing phyla included Bacteriodetes, Proteobacteria, Spirochetes and Fibrobacteres. Given its core role, proteins associated with glycolysis are likely to be present in all phyla. Therefore we consider that the differential detection most likely relates to differential abundance of the phyla within the rumen microbiome or to a lack of annotated sequence data. In the second step of the glycolysis pathway dihydroxyacetone phosphate (DHAP) is converted to D-glyceraldehyde-3-phosphate and phosphoglycerate kinase by the triosphosphate isomerase, which was found to be highly abundant in the metaproteome. The same was observed with fructose bisphosphate aldolase involved in step 4 of the glycolytic pathway, showing an abundance of proteins relating to glycolysis distributed within the phylum Bacteriodetes, Firmicutes and Proteobacteria. The detection of multiple proteins from the same pathway increases the confidence that a particular pathway is represented in vivo.
Electron transport featured highly in the top 25 most highly abundant protein families from the metaproteome. This included ATP synthase, thioredoxin, rubrythrin and pyruvate synthase, the latter of which is required for the transfer of electrons from pyruvate to ferredoxin for the generation of VFA by the rumen microbiome. Pyruvate can be the central intermediary metabolite in rumen metabolism and is the branch point where pathways diverge to form various fermentation products including the major VFAs namely acetate, propionate and butyrate1. Not unsurprisingly pyruvate: ferredoxin (flavodoxin) oxidoreductase was also amongst the most highly abundant proteins identified. This is linked to the pyruvate pathway and required by intestinal anaerobes for the oxidation of NADH and the production of VFA that are absorbed into the animal’s bloodstream and used for energy generation.
Phosphoenolpyruvate carboxykinase was detected in the rumen metaproteome of two of the three cows. These genes have been reported to have a relatively high expression in cow rumen datasets47 and is thought to be one of the enzymes involved in the fermentation of cellulose to succinate. Phosphoenolpyruvate carboxykinase catalyses the conversion of oxaloacetate (OAA) to phosphoenolpyruvate (PEP) through direct phosphoryl transfer between the nucleoside triphosphate and OAA, as part of the gluconeogenesis pathway. However, this reaction is reversible and may operate in the direction of pyruvate formation and so may contribute to availability of substrates for fermentation. The catalysis of the phosphorylation of pyruvate is driven by pyruvate phosphate dikinase. In this study, the peptides associated with this central function were determined to belong to the phylum Firmicutes, particularly from the families Lachnospiracaea and Ruminococcaea. As discussed above for glycolysis, the lack of ubiquitous detection most likely relates to differential abundance of the phyla within the rumen microbiome or to a lack of sufficient annotated sequence data. Succinate dehydrogenase was found present in all three cows. This enzyme catalyses the synthesis of fumarate from succinate, which is required for both aerobic and anaerobic growth. While the succinate pathway, acrylate pathway and the propanediol pathway all produce propionate, the succinate pathway is the one most commonly used by bacteria48. Bacteriodetes was found to be the most prevalent phylum representative of this pathway in this study, supporting previous observations of this group utilising the succinate pathway via methylmalonyl CoA49.
Glutamate dehydrogenase was amongst the most abundant protein functions identified from our dataset, dominated by representatives from the Bacteriodetes family Prevotellacea. Glutamate dehydrogenase is important for the catalysis of α-ketogluterate to glutamate and in enabling assimilation of ammonia within the rumen and microbial protein synthesis50. The efficiency by which nitrogen is captured is important for the creation of readily available and utilisable sources of energy for microbial protein synthesis51. Nitrogen fixing species within the rumen have previously been described by Hungate2 using traditional culture techniques. Although the reduction of gaseous nitrogen to ammonia occurs in lower abundances than some other processes, our meta-proteomic analysis showed an abundance of nitrogen fixing proteins in the rumen.
In conclusion, the most abundant proteins identified from this meta-proteomics based study were determined as those involved in known major metabolic pathways within the rumen. While these were distributed across several major phyla, the activity of the Bacteriodetes dominated the metaproteome, within which the family Prevotellaecae was most abundant. This is unsurprising as Prevotella are typically associated with microbial proteolytic activity in the rumen52 and recent analyses indicate a role in early colonisation associated with fibre degradation11,44,53. The functional analysis of the roles of bacterial phyla within the rumen have demonstrated the potential for diversified niches and although some overlap has been determined, differentiation between metabolic pathways can be observed. This correlates with the investigation by Rubino et al.11 where significant differences were observed in key metabolic processes, maintaining niche specialisation.
This study has highlighted that while in general the most abundant proteins of the rumen appear to be associated with the most highly expressed genes from published rumen microbiome data, the relationship between individual proteins and relative transcript abundance is not straightforward. Although the transcriptomic data is informative there is still a potential mismatch with translated data. Therefore, the continued development of methods which provide greater sensitivity, detailed annotation and a more in depth analysis from the meta-proteome have the potential to reveal wider distribution of functionality amongst the rumen meta-proteome, enabling greater understanding of the resilience: redundancy balance within this ecosystem.
Materials and Methods
Collection of rumen fluid
Rumen fluid was obtained from three fistulated non-lactating Holstein-Friesian dairy cows from the same herd. All experimentation using animals was conducted under the Animals in Experimentation Act 1986 (UK Home Office Licence number 40-3579) under the authority of EU directive 2010/63/EU. The cows were under the same feeding regime where all had free access to a perennial ryegrass pasture diet prior to commencement of the experiment. Samples were collected after overnight fasting. A rumen sample was taken from each animal and up to one litre of rumen fluid was squeezed through gauze to filter out fibrous material. Each sample was maintained at 39 °C until sample preparation and for no longer than 2 hours.
Protein extraction and visualisation
Processed rumen fluid from each animal was split into 2 × 500 ml aliquots and centrifuged at 4 °C for 30 minutes at 8000 × g to remove fibrous matter. The supernatant was aspirated from the pelleted material before the supernatant was centrifuged at 100,000 × g for 30 minutes at 4 °C. The resulting pellet was washed×3 with 0.9% NaCl solution12 and then further separated from potentially interfering contaminants such as humic substances using 40% Percoll solution (Sigma) layered underneath the pellet and centrifuged at 10, 000 × g (with slow acceleration and deceleration) for 30 min at 4 °C. The pellet was subsequently re-suspended in 0.2% PBS solution to wash the pellet and then centrifuged for 10 min at 12 000 × g at 4 °C. The supernatant was removed and the resulting pellet was resuspended in 5 ml of lysis buffer (7 M urea, 2 M thiourea, 4% (w/v) CHAPS, 10 mM Tris, 1 mM EDTA, 50 mM dithiothreitol (DTT), protease inhibitors (1 tablet/10mls, Roche, UK) and sonicated on ice for 8 × 30 second busts. The samples were then precipitated with 20% TCA, 1% phosphotungstic acid, 0.2% DTT in acetone, overnight at −20 °C followed by x3 30 minute washes at −20 °C in acetone containing 0.2% DTT. The resulting pellets were air dried and re-suspended in rehydration buffer (8 M urea, 2 M thiourea, 4% CHAPS, 13 mM DTT, for 2 hours at room temperature. Samples were then centrifuged for 45 min at 13, 000 × g. Quantification of protein content in prepared samples was conducted by the method of Bradford54. Proteins were separated on a one-dimensional SDS-PAGE gel (12.5%T, 3.3%C) and bands were visualised using Coomassie blue staining.
Protein identification and database searching
Bands were manually excised and subjected to an in-gel tryptic digest according to the method of Shevchenko et al.55, with slight modification56. In brief, sequencing grade modified trypsin (Promega, UK) was diluted to 6.25 ng/µl in 25 mM NH4HCO3 and incubated at 37 °C overnight. Digested protein samples were resuspended in 20 µl 0.1% formic acid for LC MS/MS analysis on an Agilent 6550 iFunnel Q-TOF mass spectrometer with a Dual AJS ESI source coupled to an 1290 series HPLC system (Agilent, Cheshire, UK). A 2.1 × 50 mm 1.8 micron Zorbax Eclipse Plus C18 column was used. 10 µL of sample was injected for analysis. Liquid chromatography was performed at a flow of 0.1 mL/minute with a piece-linear gradient using water with 0.1% v/v formic acid (A) and acetonitrile with 0.1% v/v formic acid (B)(0 to 3% B over 2 minutes, 3 to 40% B over 7 minutes, 40 to 100% B over 1 minutes, hold at 100% B for 1 minute). Ions were generated using a Dual AJS ESI source. Tandem mass spectrometry was performed in AutoMS/MS mode in the 300 to 1700 range, at a rate of 0.6 spectra per second, performing MS2 on the 5 most intense ions in the precursor scan. Masses were excluded for 0.1 min after MS2 was performed. Reference mass locking was used for internal calibration using the mass of 922.009798 Da. Resulting ESI MS/MS spectra was analysed using Mass Hunter Qualitative Analysis software (Agilent, U.K), which was then processed using an in-house MASCOT server for peptide identification with the significance threshold set at 0.05. Data base searching was conducted against the NCBI database with a limit of 2 missed cleavages with the variable modification set as the oxidation of methionine and the fixed modification as carbamidomethylation of cysteine.
Comparison of rumen meta-proteome with published genomic datasets of the rumen microbiome
The peptides generated from the rumen meta-proteome were compared to the meta-analysis and comparative study of several rumen 16S rRNA gene based surveys, which examined the culturable fraction of the rumen microbiome10. Here we compared the taxonomic diversity observed from the rumen meta-proteome to taxonomic counts obtained from the 16S data. In addition, meta-proteome data was also searched against highly expressed genes according to ranking within a large recently published gene expression rumen microbiome dataset18, which consists of 20 rumen samples obtained from members of the same herd. Each sample was sequenced to produce 945 Gb of meta-genomic data and 120 Gb of meta-transcriptomic data. The meta-genomic assembly was produced using Spherical (http://github.com/thh32/Spherical) with Velvet (1.2.10) as the assembler, kmer of 51, subset size of 50 Gb and 6 iterations and aligned to the meta-genomic assembly with Bowtie2 using default settings. For each aligned sample, Samtools idxstats was used, with default settings, to identify the number of reads aligning to each contig in the metagenome assembly and an FPKM value was then calculated per contig for each sample. Abundances from the meta-proteome dataset were then compared to those matching from the meta-transcriptome dataset.
To determine meta-proteome matches to the Hungate 1000 genomic database, the Hungate 1000 genomes (which at point of analysis in 2017 contained the genomes of 406 rumen bacterial accessions) were used as a database against which the predicted peptides from the metaproteome were aligned using DIAMOND (v0.7.9). A minimum bit score of 40 was used as a cut-off for homologous matches. The output was then filtered by overlap using MGKIT to prevent multiple proteins hitting the same genomic region.
Data availability
The datasets analysed during the study are available from the corresponding author on reasonable request.
Electronic supplementary material
Acknowledgements
This work was supported by the BBSRC (BBS/E/W/10964A01).
Author Contributions
E.H.H. performed proteomic experiments and analysed the meta-proteomic data. C.J.C. compared meta-proteomic and meta-genomic data. T.H. performed the comparative analysis of meta-proteomic data to the Shi et al. (2014) data set. A.K.-S. conceived the experiment. E.H.H. and A.K.-S. drafted the manuscript. All authors were involved in the review and finalisation of the manuscript.
Competing Interests
The authors declare no competing interests.
Footnotes
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-018-28827-7.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Hobson, P. S. & Stewart, C. S. (eds) The rumen microbial ecosystem. Second edition. (Chapman and Hall, 1997).
- 2.Hungate, R. E. The rumen and its microbes ISBN: 978-1-4832-3308-6 (Academic press inc, New York, London, 1966).
- 3.Theodorou MK, Kingston Smith A, Abberton M. Improvement of forages to increase the efficiency of nutrient and energy use in temperate pastoral livestock systems. Archiva Zootechnica. 2008;11(4):5–20. [Google Scholar]
- 4.Kingston-Smith AH, Davies TE, Stevens PR, Mur LAJ. Comparative metabolite fingerprinting of the rumen system during colonization of three forage grass (Lolium perenne. L) varieties. Plos One. 2013;8:e82801. doi: 10.1371/journal.pone.0082801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Edwards JE, McEwan NJ, Travis AJ, Wallace JR. 16S rDNA library-based analysis of ruminal bacteria diversity. Antonie van Leeuwenhoek. 2004;86(3):263–81. doi: 10.1023/B:ANTO.0000047942.69033.24. [DOI] [PubMed] [Google Scholar]
- 6.Kim M, Morrison M, Yu Z. Status of the phylogenetic diversity census of ruminal microbiomes. FEMS Microbiol Ecol. 2011;76:49–63. doi: 10.1111/j.1574-6941.2010.01029.x. [DOI] [PubMed] [Google Scholar]
- 7.Fouts DE, et al. Generation Sequencing to Define Prokaryotic and Fungal Diversity in the Bovine Rumen. Plos One. 2012;7(11):e48289. doi: 10.1371/journal.pone.0048289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ross EM, et al. High throughput whole rumen metagenome profiling using untargeted massively parallel sequencing. BMC Genetics. 2012;13:53. doi: 10.1186/1471-2156-13-53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Morgavi DP, Kelly WJ, Janssen PH, Attwood GT. Rumen microbial (meta)genomics and its application to ruminant production. Animal. 2013;7(Suppl 1):184–201. doi: 10.1017/S1751731112000419. [DOI] [PubMed] [Google Scholar]
- 10.Creevey CJ, Kelly WJ, Henderson G, Leahy SC. Determining the culturability of the rumen bacterial microbiome. Microbial Biotechnology. 2014;7(5):467–79. doi: 10.1111/1751-7915.12141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rubino F, et al. Divergent functional isoforms drive niche specialisation for nutrient acquisition and use in rumen microbiome. The ISME Journal. 2017;11(4):932–944. doi: 10.1038/ismej.2016.172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wilmes P, Bond PL. The application of two‐dimensional polyacrylamide gel electrophoresis and downstream analyses to a mixed community of prokaryotic microorganisms. Environ. Microbiol. 2004;6:911–920. doi: 10.1111/j.1462-2920.2004.00687.x. [DOI] [PubMed] [Google Scholar]
- 13.Hettich RL, Pan C, Chourey K, Giannone RJ. Metaproteomics: Harnessing the power of high performance mass spectrometry to identify the suite of proteins that control metabolic activities in microbial communities. Anal Chem. 2013;85(9):4203–4214. doi: 10.1021/ac303053e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Snelling TJ, Wallace RJ. The rumen microbial metaproteome as revealed by SDS-PAGE. BMC Microbiol. 2017;17:9. doi: 10.1186/s12866-016-0917-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Benndorf D, Balcke GU, Harms H, von Bergen M. Functional metaproteome analysis of protein extracts from contaminated soil and ground water. The ISME Journal. 2007;1:224–234. doi: 10.1038/ismej.2007.39. [DOI] [PubMed] [Google Scholar]
- 16.Chourey K, et al. Direct cellular lysis/protein extraction protocol for soil metaproteomics. J. Proteome Research. 2010;9(12):6615–22. doi: 10.1021/pr100787q. [DOI] [PubMed] [Google Scholar]
- 17.Wu X, Gong F, Wang. W. Protein extraction from plant tissues for 2DE and its application in proteomic analysis. Proteomics. 2014;14:645–658. doi: 10.1002/pmic.201300239. [DOI] [PubMed] [Google Scholar]
- 18.Shi W, et al. Methane yield phenotypes linked to differential gene expression in the sheep rumen microbiome. Genome Res. 2014;24:1517–1525. doi: 10.1101/gr.168245.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Weimer PJ. Redundancy, resilience, and host specificity of the ruminal microbiota: implications for engineering improved ruminal fermentations. Frontiers in Microbiology. 2015;6:296. doi: 10.3389/fmicb.2015.00296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Damerval C, De Vienne D, Zivy M, Thiellement H. Technical improvements in two-dimensional electrophoresis increase the level of genetic variation detected in wheat seedling proteins. Electrophoresis. 1986;7:52–54. doi: 10.1002/elps.1150070108. [DOI] [Google Scholar]
- 21.Hart E, Onime LA, Davies T, Morphew R, Kingston-Smith A. The effects of PPO activity on the proteome of ingested red clover and implications for improving the nutrition of grazing cattle. Journal of Proteomics. 2016;141:67–76. doi: 10.1016/j.jprot.2016.04.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Vâlcu CM, Schlink K. Efficient extraction of proteins from woody plant samples for two-dimensional electrophoresis. Proteomics. 2006;6:4166–4175. doi: 10.1002/pmic.200500660. [DOI] [PubMed] [Google Scholar]
- 23.Toyoda A, Iio W, Mitsumori M, Minato H. Isolation and identification of cellulose-binding proteins from sheep rumen contents. Applied and Environmental Microbiology. 2009;75(6):1667–1673. doi: 10.1128/AEM.01838-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Schneider T, et al. Who is who in litter decomposition? Metaproteomics reveals major microbial players and their biogeochemical functions. The ISME Journal. 2012;6:1749–1762. doi: 10.1038/ismej.2012.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Carpentier SC, et al. Preparation of protein extracts from recalcitrant plant tissues: an evaluation of different methods for twodimensional gel electrophoresis analysis. Proteomics. 2005;5:2497–2507. doi: 10.1002/pmic.200401222. [DOI] [PubMed] [Google Scholar]
- 26.Faurobert M, Pelpoir E, Chaϊb J. Phenol extraction of proteins for proteomic studies of recalcitrant plant tissues. Methods Mol. Biol. 2007;355:9–14. doi: 10.1385/1-59745-227-0:9. [DOI] [PubMed] [Google Scholar]
- 27.Wang W, Vignani R, Scali M, Cresti M. A universal and rapid protocol extraction from recalcitrant plant tissues for proteomic analysis. Electrophoresis. 2006;27(13):2782–2786. doi: 10.1002/elps.200500722. [DOI] [PubMed] [Google Scholar]
- 28.Wilmes P, Wexler M, Bond PL. Metaproteomics provides functional insight into activated sludge wastewater treatment. Plos One. 2008;3:e1778. doi: 10.1371/journal.pone.0001778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Meusure B, et al. The Unipept metaproteomics analysis pipeline. Proteomics. 2015;15(8):1437–1442. doi: 10.1002/pmic.201400361. [DOI] [PubMed] [Google Scholar]
- 30.Muth T, Behne A, Heyer R, Kohrs F, Benndorf D. The MetaProteomeAnalyzer: A Powerful Open-Source Software Suite for Metaproteomics Data Analysis and Interpretation. Journal of Proteome Research. 2015;14(3):1557–65. doi: 10.1021/pr501246w. [DOI] [PubMed] [Google Scholar]
- 31.Letunic I, Yamada T, Kanehisa M, Bork P. iPath: interactive exploration of biochemical pathways and networks. Trends Biochem Sci. 2008;33(3):101–3. doi: 10.1016/j.tibs.2008.01.001. [DOI] [PubMed] [Google Scholar]
- 32.Yamada T, Letunic I, Okuda S, Kanehisa M, Bork P. iPath2.0: interactive pathway explorer. Nucleic Acids Research. 2011;39(Suppl 2):W412–W415. doi: 10.1093/nar/gkr313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–D462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wright AD, Klieve. AD. Does the complexity of the rumen microbial ecology preclude methane mitigation. Animal Feed Science and Technology. 2011;166–167:248–253. doi: 10.1016/j.anifeedsci.2011.04.015. [DOI] [Google Scholar]
- 35.De Menezes AB, et al. Microbiome analysis of dairy cows fed pasture or total mixed ration diets. FEMS Microbiol Ecol. 2011;78(2):256–65. doi: 10.1111/j.1574-6941.2011.01151.x. [DOI] [PubMed] [Google Scholar]
- 36.Hart KJ, Yanez-Ruiz DR, Duval SM, McEwan NR, Newbold CJ. Plant extracts to manipulate rumen fermentation. Animal feed and science technology. 2008;147(1–3):8–35. doi: 10.1016/j.anifeedsci.2007.09.007. [DOI] [Google Scholar]
- 37.Brulc JM, et al. Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases. Proc Natl Acad Sci USA. 2009;106(6):1948–53. doi: 10.1073/pnas.0806191105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Journay, J. P. Rumen microbial metabolism and ruminant digestion. Paris. Institut national de la recherche agronomique, ISBN. 978-2-7592-0522-6 (1991).
- 39.Henderson G, et al. Rumen microbial community composition varies with diet and host, but a core microbiome is found across a wide geographical range. Scientific reports. 2015;5:14567. doi: 10.1038/srep14567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hess M, et al. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science. 2011;331:463–7. doi: 10.1126/science.1200387. [DOI] [PubMed] [Google Scholar]
- 41.Jami E, Mizrahi I. Composition and Similarity of Bovine Rumen Microbiota across Individual Animals. Plos One. 2012;7(3):e33306. doi: 10.1371/journal.pone.0033306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Koike S, Kobayashi Y. Fibrolytic Rumen Bacteria: Their Ecology and Functions Asian-Aust. J. Anim. Sci. 2009;22(1):131–138. [Google Scholar]
- 43.Shinkai, T., Ohji, R., Matsumoto, N. & Kobayashi, Y. FEMS Microbiol Letts, 10.1111/j.1574-6968.2009.01565.x (2009). [DOI] [PubMed]
- 44.Mayorga O, et al. Temporal metagenomic and metabolomic chracterization of fresh perennial ryegrass degradation by rumen bacteria. Frontiers in Microbiology. 2016;7:1854. doi: 10.3389/fmicb.2016.01854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Weijland A, Harmark K, Cool RH, Anborgh PH, Parmeggiani A. Elongation factor Tu: a molecular switch in protein biosynthesis. Mol Microbiol. 1992;6(6):683–8. doi: 10.1111/j.1365-2958.1992.tb01516.x. [DOI] [PubMed] [Google Scholar]
- 46.Czerkawski, J. W. An introduction to rumen studies. Pergamon Press, Great Britain, A. Wheaton and Co Ltd, Exeter, ISBN 0-08-025487-X (1986).
- 47.Jiang, Y., Xiong, X., Danska, J. & Parkinson, J. Metatranscriptomic analysis of diverse microbial communities reveals core metabolic pathways and microbiome-specific functionality. Microbiome4.2, 10.1186/s40168-015-0146-x (2016). [DOI] [PMC free article] [PubMed]
- 48.Reichardt N, et al. Phylogenetic distribution of three pathways for propionate production within the human gut microbiota. The ISME Journal. 2014;8:1323–1335. doi: 10.1038/ismej.2014.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Macy JM, Probst I. The biology of gastrointestinal bacteriodes. Annual Review of Microbiology. 1979;33:561–594. doi: 10.1146/annurev.mi.33.100179.003021. [DOI] [PubMed] [Google Scholar]
- 50.Wallace, R. J., Onodera, R. & Cotta, M. A. Metabolism of nitrogen-containing compounds. In: The rumen microbial ecosystem (Hobson, P. N. & Stewart, C. S., Eds) 283–328 (Blackie Academic and Professional London, 1997).
- 51.McDonald, P. et al. Animal Nutrition. Seventh edition. ISBN: 978-1-4082-0423-8 (Pearson Education, 2011).
- 52.Wallace RJ. The proteolytic systems of ruminal microorganisms. Annales de Zootechnie. 1996;45(Suppl. 1):301–308. doi: 10.1051/animres:19960653. [DOI] [Google Scholar]
- 53.Huws, S. et al. Temporal dynamics of the metabolically active rumen bacteria colonising fresh perennial ryegrass. FEMS Microbiology Ecology 92(1), fiv 137, 10.1093/femsec/fiv137 (2016). [DOI] [PubMed]
- 54.Bradford MM. A rapid and sensitive method for the quantification of microgram quantities of protein utilizing the principle of protein-dye binding. Analytical biochemistry. 1976;72:248–254. doi: 10.1016/0003-2697(76)90527-3. [DOI] [PubMed] [Google Scholar]
- 55.Shevencho A, Wilme M, Vorm O, Mann M. Mass Spectromic sequencing of proteins silve-stained polyacrylamide gels. Analytical Chemistry. 1996;68:850–858. doi: 10.1021/ac950914h. [DOI] [PubMed] [Google Scholar]
- 56.Hart EH, et al. A new enabling proteomics methodology to investigate membrane associated proteins from parasitic nematodes: case study using ivermectin resistant and ivermectin susceptible isolates of Caenorhabditis elegans and Haemonchus contortus. Veterinary Parasitology. 2015;207:266–275. doi: 10.1016/j.vetpar.2014.12.003. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets analysed during the study are available from the corresponding author on reasonable request.