Abstract
Next generation sequencing (NGS) technology is a widely accepted tool used by microbial ecologists to explore complex microbial communities in different ecosystems. As new NGS platforms continue to become available, it becomes imperative to compare data obtained from different platforms and analyze their effect on microbial community structure. In the present study, we compared sequencing data from both the 454 and Ion Torrent (PGM) platforms on the same DNA samples obtained from the rumen of dairy cows during their transition period. Despite the substantial difference in the number of reads, error rate and length of reads among both platforms, we identified similar community composition between the two data sets. Procrustes analysis revealed similar correlations (M2 = 0.319; P = 0.001) in the microbial community composition between the two platforms. Both platforms revealed the abundance of the same bacterial phyla which were Bacteroidetes and Firmicutes; however, PGM recovered an additional four phyla. Comparisons made at the genus level by each platforms revealed differences in only a few genera such as Prevotella, Ruminococcus, Succiniclasticum and Treponema (p < 0.05; chi square test). Collectively, we conclude that the output generated from PGM and 454 yielded concurrent results, provided stringent bioinformatics pipelines are employed.
Keywords: Next generation sequencing, Rumen microbiome, Transition period, Dairy cows
Introduction
Microbes are integral components of a diverse group of ecosystems and have co-evolved with their host/habitat in a mutually symbiotic relationship (Hobson & Stewart, 1997). The foregut (rumen) of ruminants is comprised of a complex microbial genetic web (rumen microbiome) that plays a pivotal role in the host nutrition and ultimate wellbeing of the animals (Hobson & Stewart, 1997). Bacteria are predominant in the rumen microbiome and are responsible for the conversion of indigestible plant biomass to energy and also aid in the formation of microbial protein; both processes drive the production efficiency of ruminants (Firkins, 2010). Interactions between different microbial domains in the rumen play a significant role in determining the ruminal microbial ecology and their functional contribution to host metabolism (Kumar et al., 2015). Rumen microbial dynamics are influenced by a number of factors including host specificity, diet, age and the environment (Edwards et al., 2004). Elucidation of the interactions among microbial domains particularly in dairy cows has the potential to improve production such as feed efficiency and milk fat synthesis (Weimer, 2015). The transition period in dairy cows refers to a critical phase in the lactation cycle, lasting from three weeks before calving to three weeks post-calving, where the dairy cow experiences stress due to changes in diet, metabolism and physiological status. Although the dynamics of rumen bacteria during the transition period has received attention in the recent past (Lima et al., 2015; Pitta et al., 2014; Wang et al., 2012), further studies are required to understand the rumen microbial dynamics during the different phases of lactation across a large group of dairy cows.
Cultivation-independent approaches have greatly enhanced our knowledge on microbial diversity and also enabled us to assess their functional contribution to the host metabolism (Stahl et al., 1988). Particularly, next-generation sequencing (NGS) technology has enabled the sequencing of human and microbial genomes in a relatively short period of time (Caporaso et al., 2011). The most widely used high-throughput sequencing platforms available in the market include Roche 454 pyrosequencing, Ion Torrent Personal Genome Machine (PGM), and Illumina HiSeq (Liu et al., 2012). Although these platforms were originally tailored for large-scale operations such as whole genome sequencing, their bench-top versions (454 Jr, PGM, and MiSeq, respectively) have evolved since 2011 and have been extensively applied to bacterial genome sequencing (Loman et al., 2012). Since 2011, both MiSeq and PGM platforms have undergone improvements, including longer read lengths, more reads per unit cost, faster turn-around time, and a reduction in error rates (Salipante et al., 2014). Although a general comparison between these platforms has been reported (Lam et al., 2012; Quail et al., 2012), studies comparing the efficacy of these platforms on the same samples are limited (Salipante et al., 2014; Scott & Ely, 2015). As the Roche 454 platform phases out (Fordyce et al., 2015), there is a need for comparative studies that can aid in the transition from Roche 454 to other platforms.
The use of next generation platforms has greatly enhanced our knowledge of rumen microbes, their genes and enzymes (Brulc et al., 2009; Hess et al., 2011; Jami, White & Mizrahi, 2014). To date, there are nearly 55 research articles (based on Pubmed, 28 Jan, 2015) related to bacterial diversity from the rumen environment using Roche 454 while only 2 from MiSeq and 3 were reported based on Ion Torrent platforms. In an attempt to find a suitable alternative to the Roche 454 platform in relation to our microbial genomic work, we evaluated the use of Ion Torrent as an alternative to the 454 platform for the study of rumen microbial composition via 16S tag sequencing.
Materials and Methods
Sample collection
Dairy cows that were donors of rumen fluid were maintained at Marshak farm and were maintained according to the ethics committee and IACUC standards for the University of Pennsylvania (approval #804302). Four primiparous and four multiparous cows were sampled at three weeks prior to the anticipated calving date (S1), and again at 1–3 days post-calving (S2). Details of the animal experiment design, sampling protocol, and type of diet are described in a previous study (Pitta et al., 2014). Two samples were removed from the analysis due to a low number of reads in the Roche 454 sequencing run: one each from the primiparous and multiparous group in the pre-calving period. Thus, we analyzed a total of 14 samples, with six samples from pre-calving period and eight samples from post-calving period (Table S1).
DNA extraction, PCR amplification, and 16S rRNA sequencing
The genomic DNA was extracted from all the rumen samples employing PSP Spin Stool DNA Plus Kit (Invitek, Berlin, Germany) using the protocol of Dollive et al. (2012). The genomic DNA was amplified using the specific primers (27F) and BSR357, targeting the V1–V2 region of the 16S rRNA bacterial gene. The primer sequences and PCR conditions for Roche 454 are described in Pitta et al. (2014). Though the primer sequences for Ion Torrent were similar to Roche 454, the forward primer carried the Ion Torrent trP1 (5′-CCTCTCTATGGGCAGTCGGTGAT-3′) and the reverse primer carried the A adapter (5′-CCATCTCATCCCTGCGTGTCTCCGACTCAG-3′), followed by a 10–12 nucleotide sample-specific barcode sequence and a GAT barcode adapter. The PCR mix was prepared using the Platinum PCR SuperMix High Fidelity kit (Invitrogen, Carlsbad, CA, USA). PCR conditions were the same for Roche 454 and Ion Torrent, as given by Pitta et al. (2014). Amplicons of 16S rDNA were purified using 1:1 volume of Agentcourt AmPure XP beads (Beckman-Coulter, Brea, CA, USA). The purified PCR products from the rumen samples were pooled in equal concentration prior to sequencing in Roche 454 (Roche 454 Life Sciences, Branford, CT, USA) and Ion Torrent platforms.
Bioinformatics and statistical analysis
To evaluate the similarities and dissimilarities between Roche 454 and Ion Torrent, we analyzed the 16S pyrosequence reads using the QIIME pipeline (version 1.8.0) (Caporaso et al., 2010a) and a small number of custom python scripts, followed by statistical analysis in R (R Core Team, 2013). Reads from both platforms were discarded if they did not match the expected sample-specific barcode and 16S primer sequences (forward and reverse primers), or if they were shorter than 50 bp or longer than 480 bp, or if they contained one or more ambiguous base calls. Reads were also discarded if a long homopolymer sequence was present; the threshold used was 5 bp for both platforms. Operational taxonomic units (OTUs) were formed at 97% similarity using UCLUST (Edgar, 2010). Taxonomic assignments within the GreenGenes taxonomy (12/10 release, McDonald et al., 2012) were generated using the RDP Classifier version 2.2 (Wang et al., 2007). We randomly sub-sampled (rarified) the resulting OTUs to 1,212 sequence per sample for both Roche 454 and Ion Torrent. Representative sequences for each were aligned to 16S reference sequences with PyNAST (Caporaso et al., 2010b). The resultant multiple sequence alignment was used to infer a phylogenetic tree with FastTree Price et al., 2010. To find shared OTUs between Roche 454 and Ion Torrent, we used a “closed reference” OTU picking approach. To accomplish this, we identified Roche 454 OTUs that were shared between platforms, representative sequences from Roche 454 were compared to a reference database consisting of the Ion Torrent representative sequences. The same approach was adopted to identify shared OTUs in the Ion Torrent data. In this step, a sequence was considered a “hit” if it matched a sequence in the reference database at greater than 97% sequence identity.
Two measures of alpha diversity were calculated: Shannon entropy, an indicator of evenness in community structure, and richness, the number of OTUs observed. Analyses of community similarity (β-diversity) were performed for both the platforms separately by calculating pairwise distances using the phylogenetic metric UniFrac (Lozupone & Knight, 2005) on platform specific OTUs. We ran a Procrustes analysis of weighted UniFrac distance, comparing the principal coordinate matrices from Roche 454 and Ion Torrent. The goodness of fit (M2 value) was measured by summing over the residuals, and significance was assessed by the Monte Carlo label permutation method (Gower, 1975).
Representative sequences from each OTU were chosen and taxonomy was assigned using the default methods in QIIME. To test for differences in taxon abundance, we normalized the abundances to the total number of reads in each sample (relative abundance). We considered the phyla appearing in at least 75% of samples. A generalized linear mixed-effects model was constructed with the lme4 package for R (Bates et al., 2013). The model used a binomial link function and included a random effect term for each animal. Study day was modeled as a continuous longitudinal variable with the following values: S1= − 3 weeks, S2=0.285 weeks.
Results
Analysis of 16S sequence clusters
A total of 39,592 and 280,284 raw sequences were obtained from Roche 454 and Ion Torrent respectively, across a total of 14 samples on each platform. To minimize differences between the two platforms, we employed similar quality control protocols, including quality filtering, primer detection, and read demultiplexing. The number of reads recovered after quality filtering the reads of the 14 samples was 29,057 (73.39%) from Roche 454 and 203,910 (72.75%) from Ion Torrent (Table S1). Clustering with ULCUST generated 6,471 and 30,322 OTUs in Roche 454 and Ion Torrent, respectively. We randomly subsampled (rarified) the OTUs to 1,212 sequences per sample. Following the random subsampling step, we identified 2,824 shared OTUs from a total of 7,142 (40%) in the Roche 454 data. In the Ion Torrent data, we identified 3,118 shared OTUs from 9,813 OTUs (32%).
Taxonomic comparisons
Taxonomic assignment of the OTUs identified 15 (Roche 454) and 18 (Ion Torrent) phyla in the rumen of cows used in this study. The three bacterial phyla that were recovered by Ion Torrent platform alone were Acidobacteria, GN02, and Verrucomicrobia (Table S2), which accounted for a very low abundance (0.01%) and were detected only in a few samples. The most abundant phyla in both the platforms were Bacteroidetes followed by Firmicutes, which together constituted over 90% of each sample in Roche 454 and over 86% in Ion Torrent (Fig. 1 and Table S2) The influence of study day (before and after calving) and study group (primiparous and multiparous cows) on the abundance of Bacteroidetes and Firmicutes appears to be similar for both Roche 454 and Ion Torrent data sets (Table 1). However, the influences of study group and study day in lower abundant phyla were different, for example the influence of study group and study day on Fibrobacteres is significant in Roche 454 whereas it is insignificant in Ion Torrent datasets.
Table 1. Mean sample proportion of bacterial phyla by study group (Pp, primiparous and Mp, multiparous) and study day (S1, 3 weeks prior to calving; S2, 1–3 days after calving) in Roche 454 and Ion Torrent samples.
Phylum | Platform | Study group (SG) | Study day (SD) | P-value | ||||
---|---|---|---|---|---|---|---|---|
Pp | Mp | S1 | S2 | PSG | PSD | PSG∗SD | ||
Bacteroidetes | Roche 454 | 0.8450 | 0.7700 | 0.7640 | 0.8400 | ** | *** | *** |
Ion Torrent | 0.7613 | 0.7032 | 0.6940 | 0.7610 | * | *** | *** | |
Cyanobacteria | Roche 454 | 0.0040 | 0.0030 | 0.0060 | 0.0020 | *** | ** | |
Ion Torrent | 0.0069 | 0.0050 | 0.0077 | 0.0047 | ||||
Fibrobacteres | Roche 454 | 0.0110 | 0.0150 | 0.0140 | 0.0110 | ** | *** | *** |
Ion Torrent | 0.0082 | 0.0115 | 0.0086 | 0.0108 | *** | |||
Firmicutes | Roche 454 | 0.0800 | 0.1060 | 0.1200 | 0.0730 | *** | *** | |
Ion Torrent | 0.1164 | 0.1420 | 0.1492 | 0.1142 | *** | |||
Proteobacteria | Roche 454 | 0.0080 | 0.0200 | 0.0150 | 0.0140 | *** | ** | ** |
Ion Torrent | 0.0096 | 0.0097 | 0.0105 | 0.0090 | ||||
Spirochaetes | Roche 454 | 0.0010 | 0.0040 | 0.0030 | 0.0020 | ** | ||
Ion Torrent | 0.0035 | 0.0073 | 0.0067 | 0.0044 | ||||
Tenericutes | Roche 454 | 0.0050 | 0.0110 | 0.0120 | 0.0050 | * | ||
Ion Torrent | 0.0094 | 0.0109 | 0.0136 | 0.0076 | ** | |||
TM7 | Roche 454 | 0.0030 | 0.0050 | 0.0050 | 0.0040 | ** | * | * |
Ion Torrent | 0.0050 | 0.0089 | 0.0093 | 0.0052 | * | ** |
Notes.
The magnitude of the p-values for the effect of study group (PSG), study day (PSD), and the interaction term (PSG∗SD) are shown on the right.
P < 0.001.
P < 0.01.
P < 0.05.
We further compared the Roche 454 and Ion Torrent datasets at the genus level for the two major phyla observed (Bacteroidetes and Firmicutes). Genera with a proportion exceeding 1% in at least one sample were included in the analysis. In Roche 454, three genera from the Bacteroidetes group and one genus among Firmicutes showed differences by study group, whereas three genera from Bacteroidetes and four genera from Firmicutes showed differences by study day. In contrast, a similar analysis of Ion Torrent data revealed no differences within Bacteroidetes group and revealed differences among three genera in the Firmicutes with study group. With regard to study day, two genera in Bacteroidetes group and one genus in Firmicutes showed differences (Table 2). Notably, we found that BF311 (an uncultured genus of Bacteroidetes), Mogibacteriaceae (Firmicutes), and Selenomonas (Firmicutes) were detected only in the Ion Torrent platform.
Table 2. Mean sample proportion of bacterial genera by study group (Pp, primiparous and Mp, pluriparous) and study day (S1, 3 weeks prior to calving; S2, 1–3 days after calving) in 454 and Ion torrent samples.
Taxon | Platform | Study group (SG) | Study day (SD) | P-value | ||||
---|---|---|---|---|---|---|---|---|
Pp | Mp | S1 | S2 | PSG | PSD | PSG∗SD | ||
Bacteroidetes | ||||||||
Paraprevotellaceae | Roche 454 | 0.0040 | 0.0080 | 0.0040 | 0.0070 | |||
Ion Torrent | 0.0070 | 0.0070 | 0.0070 | 0.0070 | ||||
Bacteroidales | Roche 454 | 0.0420 | 0.0720 | 0.0620 | 0.0540 | ** | *** | *** |
Ion Torrent | 0.0430 | 0.0630 | 0.0560 | 0.0500 | ||||
BF311 | Roche 454 | ND | ND | ND | ND | |||
Ion Torrent | 0.0020 | 0.0040 | 0.0040 | 0.0020 | ||||
CF231 | Roche 454 | 0.0130 | 0.0180 | 0.0150 | 0.0150 | |||
Ion Torrent | 0.0160 | 0.0170 | 0.0150 | 0.0170 | ||||
Prevotella | Roche 454 | 0.7200 | 0.5680 | 0.5820 | 0.6900 | ** | *** | *** |
Ion Torrent | 0.6280 | 0.5270 | 0.5250 | 0.6170 | *** | *** | ||
Prevotellaceae | Roche 454 | 0.0040 | 0.0060 | 0.0030 | 0.0070 | * | * | |
Ion Torrent | 0.0040 | 0.0050 | 0.0040 | 0.0040 | ||||
RF16 | Roche 454 | 0.0350 | 0.0430 | 0.0610 | 0.0230 | *** | *** | |
Ion Torrent | 0.0360 | 0.0420 | 0.0520 | 0.0290 | *** | |||
S24-7 | Roche 454 | 0.0150 | 0.0360 | 0.0140 | 0.0340 | * | ||
Ion Torrent | 0.0120 | 0.0190 | 0.0090 | 0.0200 | ||||
YRC22 | Roche 454 | 0.0100 | 0.0100 | 0.0130 | 0.0080 | |||
Ion Torrent | 0.0110 | 0.0110 | 0.0110 | 0.0110 | ||||
Firmicutes | ||||||||
Mogibacteriaceae | Roche 454 | ND | ND | ND | ND | |||
Ion Torrent | 0.0020 | 0.0030 | 0.0030 | 0.0020 | ||||
Butyrivibrio | Roche 454 | 0.0040 | 0.0020 | 0.0030 | 0.0030 | |||
Ion Torrent | 0.0070 | 0.0050 | 0.0060 | 0.0050 | ||||
Clostridiales | Roche 454 | 0.0240 | 0.0290 | 0.0370 | 0.0180 | *** | *** | |
Ion Torrent | 0.0330 | 0.0380 | 0.0460 | 0.0270 | * | *** | * | |
Coprococcus | Roche 454 | 0.0020 | 0.0020 | 0.0030 | 0.0020 | |||
Ion Torrent | 0.0060 | 0.0030 | 0.0040 | 0.0050 | ||||
Lachnospiraceae | Roche 454 | 0.0130 | 0.0140 | 0.0150 | 0.0120 | |||
Ion Torrent | 0.0210 | 0.0190 | 0.0230 | 0.0180 | ||||
RFN20 | Roche 454 | 0.0040 | 0.0150 | 0.0130 | 0.0060 | ** | ** | |
Ion Torrent | 0.0040 | 0.0150 | 0.0110 | 0.0090 | * | |||
Ruminococcaceae | Roche 454 | 0.0060 | 0.0120 | 0.0140 | 0.0060 | ** | ||
Ion Torrent | 0.0080 | 0.0210 | 0.0170 | 0.0120 | *** | |||
Ruminococcus | Roche 454 | 0.0040 | 0.0080 | 0.0070 | 0.0050 | |||
Ion Torrent | 0.0100 | 0.0110 | 0.0110 | 0.0100 | ||||
Selenomonas | Roche 454 | ND | ND | ND | ND | |||
Ion Torrent | 0.0030 | 0.0020 | 0.0010 | 0.0040 | ||||
Succiniclasticum | Roche 454 | 0.0110 | 0.0070 | 0.0130 | 0.0060 | *** | ||
Ion Torrent | 0.0100 | 0.0070 | 0.0110 | 0.0070 | ||||
Veillonellaceae | Roche 454 | 0.0020 | 0.0020 | 0.0010 | 0.0030 | |||
Ion Torrent | 0.0030 | 0.0040 | 0.0030 | 0.0040 |
Notes.
ND, Not Detected; The magnitude of the p-values for the effect of study group (PSG), study day (PSD), and the interaction term (PSG∗SD) are shown on the right.
P < 0.001.
P < 0.01.
P < 0.05.
Among Bacteroidetes members, the single most abundant genus in both the platforms was Prevotella (Roche 454: 81% and Ion Torrent: 75%; Fig. 4 and Table S2). The influence of study day on microbial composition of Prevotella was similar for both Roche 454 and Ion Torrent data sets, whereas, the influence of study group was observed only in Roche 454. In both platforms, most of the remaining sequences in the phylum Bacteroidetes were assigned either at the family or order level to unidentified unnamed species of the Prevotellaceae family or Bacteroidales order (Figs. S1 and S2). Genera in the Firmicutes phylum were also similar between the two platforms. The OTUs assigned to Firmicutes represented a substantial number of genera from Clostridiales, along with the Lachnospiraceae, Ruminococcaceae and Veillonellaceae families (Table 2; Figs. S3 and S4).
Comparison between Roche 454 and Ion Torrent platforms
Alpha diversity
The number of observed species per sample was higher for the Ion Torrent method, compared to Roche 454 (Fig. 2). The Shannon index value was similar between the two platforms at various sequencing depths, indicating a similar number of highly abundant species in both platforms.
Beta diversity
We then quantified the resemblance between bacterial communities measured by Roche 454 and Ion Torrent. Weighted UniFrac distances for both communities were plotted using principal coordinate analysis (PCoA), and then the two PCoA plots were aligned using generalized Procrustes analysis (Gower, 1975). The aligned PCoA plot is visualized in Fig. 3. A comparison of pairwise sample distances by Procrustes analysis revealed moderate but statistically significant agreement in the microbial community composition between the two platforms (M2 = 0.36; P = 0.001).
Taxonomic comparison at genus level
A sample-to-sample to comparison at the genus level between the two platforms was performed using chi square test (Table S3). For this analysis, twenty nine genera were compared by sample for both platforms (Table S3). Of these, only Prevotella from Bacteroidetes phylum, and one unclassified bacterial lineage were found to be different between the two platforms in more that 50% of samples (p < 0.05; chi square test). A majority of the remaining genera were found to be different between only one or two samples. Thus, the composition of bacterial communities was generally consistent between platforms, with two notable exceptions.
Discussion
The introduction of NGS technology has had a dramatic effect on researchers’ ability to study bacterial communities via DNA sequencing. The throughput has increased by 500,000-fold and the number of reads per genome is increasing 100-fold every year, yet the cost of sequencing is reducing by half every 5 months (Baker, 2010). The rumen microbiome of herbivores is a classic example that has been explored in detail using different NGS approaches (De la Fuente Oliver et al., 2014; Jami, White & Mizrahi, 2014; Peng et al., 2015; Pitta et al., 2014; Pitta et al., 2010), but several inconsistencies among these reports prevail, some of which may be attributed to differences in the approach employed. In this study, we attempted to account for biases that could be introduced due to different NGS platforms while making comparisons between 16S rDNA bacterial profiles in the rumen of dairy cows. We concluded that, although the percentage of common sequences was low between the platforms, the microbial fingerprints and phylogenetic composition retrieved by both Roche 454 and Ion Torrent platforms are comparable, with minor exceptions.
Ion Torrent vs. Roche 454 Roche platforms
Considering the limitations such as low quality reads, variations in read length, and higher number of homopolymer sequences associated with Ion Torrent (Loman et al., 2012), we leveraged to extract more number of sequences per sample in Ion Torrent platform with the result that throughput was several fold higher in magnitude as compared to Roche 454 datasets. High quality reads with greater read lengths despite lower throughout from Roche 454 platform is the most preferred and desirable feature for microbial diversity studies (Brulc et al., 2009). Both the platforms delivered long reads, but the read length of Ion Torrent had a wider range compared to Roche 454 (Roche 454; 53–492, Ion Torrent; 8–559). In our study, we observed that a greater proportion of sequences were eliminated at 3 and 4 homopolymer length while a greater proportion of sequences were retained at 5 homopolymer length. For downstream analysis, we recovered 29,057 (73.39%) from a total of 39,592 raw reads in Roche 454 platform and 203,910 (72.75%) from a total of 280,284 raw reads in Ion Torrent platform at 5 homopolymer length. To account for unequal distribution of reads per sample between platforms, (1,212–3,036 in Roche 454 vs. 5,653–27,336 in Ion Torrent), we adopted normalization through subsampling of reads at the minimum sequencing depth for both platforms. This additional step was performed to avoid the influence of this differential distribution of reads on the bacterial composition.
Ruminal Bacterial diversity dynamics
The effect of diet and age on the community composition, evident in our previous study using Roche 454 platforms (Pitta et al., 2014), was reproduced when the same set of DNA samples were sequenced on Ion Torrent. This increases confidence that the two platforms will yield similar results in future studies of bacterial communities in the rumen. The finding is reinforced by Procrustes analysis, which showed that overall community composition was similar between both platforms. Although the overall level of similarity was not great enough to combine data from two platforms in a single analysis of UniFrac distance, we determined that independent distance analyses were reproducible between platforms. Further, we were able to infer that increasing the sequence depth did not unduly influence community profiles when analyzed by UniFrac distance, reinforcing the findings of Caporaso et al. (2011).
The taxonomic composition of bacterial communities at the phylum level recovered by Ion Torrent is congruent with the Roche 454 data. The most abundant phyla were Bacteroidetes followed by Firmicutes, which together constituted over 90% of each sample in Roche 454 and over 86% in Ion Torrent. The phylum-level abundance observed here agrees with other reports that have employed Roche 454 (Pitta et al., 2014; Pitta et al., 2010) and also Ion Torrent (De la Fuente Oliver et al., 2014). In addition to these phyla, Fibrobacteres, Proteobacteria and Tenericutes, Actinobacteria and Spirochaetes are commonly reported in rumen samples from dairy cows, but contribute to a very low abundance. This observation was also recapitulated in our findings for both platforms. Additionally, the 16S rDNA-based taxonomic composition of bacterial communities retrieved here is in accordance with other shotgun metagenomic datasets sequenced on the Ion Torrent platform (Patel et al., 2014; Pitta et al., 2015; Singh et al., 2014).
Differences between datasets from the two platforms were evident in phyla that contributed to less than 5% of the populations. A sample-to-sample comparison revealed differences in phylogenetic composition from family and beyond. For example, at the genus level, although Prevotella was abundant in both platforms, the lower abundance of Prevotella observed in Ion Torrent compared to Roche 454 may be due to an increase in number (14) of several genera that were not detected in the Roche 454 data. Despite these differences at the genus level, the effect of age and diet on different bacterial genera in the rumen of dairy cows was consistent between the two platforms.
Conclusions
It has become evident in the recent past that the rumen microbiome plays a significant role in improving the production efficiencies of dairy cows. As next-generation sequencing platforms and chemistries continue to expand and improve, we expect that major advancements in sequencing may contribute to significant improvements in dairy production through improving nutrition and management that support a production efficient microbiome. While different sequencing platforms are applied to explore the rumen microbiome, it is imperative that studies should also compare and contrast findings from different platforms to avoid discrepancies across rumen microbiome studies. The results presented here show that major conclusions based on Roche 454 data are also reproduced on the Ion Torrent platform. Thus, we found that the Ion Torrent platform is a suitable option for future rumen microbiology studies, provided that researchers are consistent in DNA extraction methods, PCR protocols, and bioinformatics pipelines.
Supplemental Information
Acknowledgments
We are thankful to the Biomedical Research Core Facilities, University of Pennsylvania, for sequencing services.
Funding Statement
Part of the funding for this study came from internal grants from Center for Host-Microbe interactions, University Research Foundation from the University of Pennsylvania. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare there are no competing interests.
Author Contributions
Nagaraju Indugu analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.
Kyle Bittinger analyzed the data, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.
Sanjay Kumar wrote the paper.
Bonnie Vecchiarelli performed the experiments, contributed reagents/materials/analysis tools, wrote the paper.
Dipti Pitta conceived and designed the experiments, wrote the paper, reviewed drafts of the paper.
Animal Ethics
The following information was supplied relating to ethical approvals (i.e., approving body and any reference numbers):
Dairy cows that were donors of rumen fluid were maintained at Marshak farm and were maintained according to the ethics committee and IACUC standards for the University of Pennsylvania (approval #804302).
Data Availability
The following information was supplied regarding data availability:
The raw sequence data and corresponding mapping files are available in figshare.com (http://figshare.com/articles/comparison˙of˙454˙and˙PGM˙platforms/1536771).
References
- Baker (2010).Baker M. Next-generation sequencing: adjusting to data overload. Nature Methods. 2010;7:495–499. doi: 10.1038/nmeth0710-495. [DOI] [Google Scholar]
- Bates et al. (2013).Bates D, Maechler M, Bolker B, Walker S. lme4: linear mixed-effects models using Eigen and S4. (R package version 1) 2013 Available at https://cran.r-project.org/web/packages/lme4/index.html .
- Brulc et al. (2009).Brulc JM, Antonopoulos DA, Miller MEB, Wilson MK, Yannarell AC, Dinsdale EA, Edwards RE, Frank ED, Emerson JB, Wacklin P. Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases. Proceedings of the National Academy of Sciences of the United States of America. 2009;106:1948–1953. doi: 10.1073/pnas.0806191105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caporaso et al. (2010b).Caporaso JG, Bittinger K, Bushman FD, DeSantis TZ, Andersen GL, Knight R. PyNAST: a flexible tool for aligning sequences to a template alignment. Bioinformatics. 2010b;26:266–267. doi: 10.1093/bioinformatics/btp636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caporaso et al. (2010a).Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, Fierer N, Pena AG, Goodrich JK, Gordon JI. QIIME allows analysis of high-throughput community sequencing data. Nature Methods. 2010a;7:335–336. doi: 10.1038/nmeth.f.303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caporaso et al. (2011).Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Lozupone CA, Turnbaugh PJ, Fierer N, Knight R. Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. Proceedings of the National Academy of Sciences of the United States of America. 2011;108:4516–4522. doi: 10.1073/pnas.1000080107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De la Fuente Oliver et al. (2014).De la Fuente Oliver G, Belanche A, Girwood SE, Pinloche E, Wilkinson T, Newbold CJ. Pros and cons of ion-torrent next generation sequencing versus terminal restriction fragment length polymorphism T-RFLP for studying the rumen bacterial community. PLoS ONE. 2014;9(7) doi: 10.1371/journal.pone.0101435. e101435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dollive et al. (2012).Dollive S, Peterfreund GL, Sherrill-Mix S, Bittinger K, Sinha R, Hoffmann C, Bachman MA. A tool kit for quantifying eukaryotic rRNA gene sequences from human microbiome samples. Genome Biology. 2012;13:R60. doi: 10.1186/gb-2012-13-7-r60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edgar (2010).Edgar RC. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–2461. doi: 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
- Edwards et al. (2004).Edwards JE, McEwan NR, Travis AJ, Wallace RJ. 16S rDNA library-based analysis of ruminal bacterial diversity. Antonie Van Leeuwenhoek. 2004;86:263–281. doi: 10.1023/B:ANTO.0000047942.69033.24. [DOI] [PubMed] [Google Scholar]
- Firkins (2010).Firkins JL. Reconsidering rumen microbial consortia to enhance feed efficiency and reduce environmental impact of ruminant livestock production systems. Revista Brasileira de Zootecnia. 2010;39:445–457. doi: 10.1590/S1516-35982010001300049. [DOI] [Google Scholar]
- Fordyce et al. (2015).Fordyce SL, Mogensen HS, Børsting C, Lagacé RE, Chang C-W, Rajagopalan N, Morling N. Second-generation sequencing of forensic STRs using the Ion Torrent™ HID STR 10-plex and the Ion PGM™. Forensic Science International: Genetics. 2015;14:132–140. doi: 10.1016/j.fsigen.2014.09.020. [DOI] [PubMed] [Google Scholar]
- Gower (1975).Gower JC. Generalized procrustes analysis. Psychometrika. 1975;40:33–51. doi: 10.1007/BF02291478. [DOI] [Google Scholar]
- Hess et al. (2011).Hess M, Sczyrba A, Egan R, Kim T-W, Chokhawala H, Schroth G, Luo S, Clark DS, Chen F, Zhang T. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science. 2011;331:463–467. doi: 10.1126/science.1200387. [DOI] [PubMed] [Google Scholar]
- Hobson & Stewart (1997).Hobson PN, Stewart CS. The rumen microbial ecosystem. 2nd edition. Berlin, Heidelberg: Springer Science & Business Media; 1997. [Google Scholar]
- Jami, White & Mizrahi (2014).Jami E, White BA, Mizrahi I. Potential role of the bovine rumen microbiome in modulating milk composition and feed efficiency. PLoS ONE. 2014;9(1):e1599. doi: 10.1371/journal.pone.0085423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar et al. (2015).Kumar S, Indugu N, Vecchiarelli B, Pitta DW. Associative patterns among anaerobic fungi, methanogenic archaea, and bacterial communities in response to changes in diet and age in the rumen of dairy cows. Frontiers in Microbiology. 2015;6:781. doi: 10.3389/fmicb.2015.00781. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lam et al. (2012).Lam HY, Clark MJ, Chen R, Chen R, Natsoulis G, O’Huallachain M, Dewey FE, Habegger L, Ashley EA, Gerstein MB. Performance comparison of whole-genome sequencing platforms. Nature Biotechnology. 2012;30:78–82. doi: 10.1038/nbt.2065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lima et al. (2015).Lima FS, Oikonomou G, Lima SF, Bicalho ML, Ganda EK, Oliveira Filho JC, Lorenzo G, Trojacanec P, Bicalho RC. Prepartum and postpartum rumen fluid microbiomes: characterization and correlation with production traits in dairy cows. Applied and Environmental Microbiology. 2015;81(4):1327–1337. doi: 10.1128/AEM.03138-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu et al. (2012).Liu L, Li Y, Li S, Hu N, He Y, Pong R, Lin D, Lu L, Law M. Comparison of next-generation sequencing systems. Journal of Biomedicine and Biotechnology. 2012;2012 doi: 10.1155/2012/251364. Article 251364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loman et al. (2012).Loman NJ, Misra RV, Dallman TJ, Constantinidou C, Gharbia SE, Wain J, Pallen MJ. Performance comparison of benchtop high-throughput sequencing platforms. Nature Biotechnology. 2012;30:434–439. doi: 10.1038/nbt.2198. [DOI] [PubMed] [Google Scholar]
- Lozupone & Knight (2005).Lozupone C, Knight R. UniFrac: a new phylogenetic method for comparing microbial communities. Applied and Environmental Microbiology. 2005;71:8228–8235. doi: 10.1128/AEM.71.12.8228-8235.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McDonald et al. (2012).McDonald D, Price MN, Goodrich J, Nawrocki EP, DeSantis TZ, Probst A, Andersen GL, Knight R, Hugenholtz P. An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. The ISME Journal. 2012;6(3):610–618. doi: 10.1038/ismej.2011.139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKenna et al. (2008).McKenna P, Hoffmann C, Minkah N, Aye PP, Lackner A, Liu Z, Lozupone CA, Hamady M, Knight R, Bushman FD. The macaque gut microbiome in health, lentiviral infection, and chronic enterocolitis. PLoS Pathog. 2008;4:e1599. doi: 10.1371/journal.ppat.0040020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patel et al. (2014).Patel DD, Patel AK, Parmar NR, Shah TM, Patel JB, Pandya PR, Joshi CG. Microbial and Carbohydrate Active Enzyme profile of buffalo rumen metagenome and their alteration in response to variation in the diet. Gene. 2014;545:88–94. doi: 10.1016/j.gene.2014.05.003. [DOI] [PubMed] [Google Scholar]
- Peng et al. (2015).Peng S, Yin J, Liu X, Jia B, Chang Z, Lu H, Jiang N, Chen Q. First insights into the microbial diversity in the omasum and reticulum of bovine using Illumina sequencing. Journal of Applied Genetics. 2015;56(3):393–401. doi: 10.1007/s13353-014-0258-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pitta et al. (2015).Pitta D, Indugu N, Kumar S, Vecchiarelli B, Sinha R, Baker L, Bhukya B, Ferguson J. Metagenomic assessment of the functional potential of the rumen microbiome in Holstein dairy cows. Anaerobe. 2015;38:50–60. doi: 10.1016/j.anaerobe.2015.12.003. [DOI] [PubMed] [Google Scholar]
- Pitta et al. (2014).Pitta D, Kumar S, Vecchiarelli B, Shirley D, Bittinger K, Baker L, Ferguson J, Thomsen N. Temporal dynamics in the ruminal microbiome of dairy cows during the transition period. Journal of Animal Science. 2014;92:4014–4022. doi: 10.2527/jas.2014-7621. [DOI] [PubMed] [Google Scholar]
- Pitta et al. (2010).Pitta DW, Pinchak WE, Dowd SE, Osterstock J, Gontcharova V, Youn E, Dorton K, Yoon I, Min BR, Fulford J. Rumen bacterial diversity dynamics associated with changing from bermudagrass hay to grazed winter wheat diets. Microbial Ecology. 2010;59:511–522. doi: 10.1007/s00248-009-9609-6. [DOI] [PubMed] [Google Scholar]
- Quail et al. (2012).Quail MA, Smith M, Coupland P, Otto TD, Harris SR, Connor TR, Bertoni A, Swerdlow HP, Gu Y. A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers. BMC Genomics. 2012;13:341. doi: 10.1186/1471-2164-13-341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salipante et al. (2014).Salipante SJ, Kawashima T, Rosenthal C, Hoogestraat DR, Cummings LA, Sengupta DJ, Harkins TT, Cookson BT, Hoffman NG. Performance comparison of Illumina and ion torrent next-generation sequencing platforms for 16S rRNA-based bacterial community profiling. Applied and Environmental Microbiology. 2014;80:7583–7591. doi: 10.1128/AEM.02206-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scott & Ely (2015).Scott D, Ely B. Comparison of genome sequencing technology and assembly methods for the analysis of a GC-Rich bacterial genome. Current Microbiology. 2015;70:338–344. doi: 10.1007/s00284-014-0721-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singh et al. (2014).Singh K, Reddy B, Patel A, Panchasara H, Parmar N, Patel A, Shah T, Bhatt V, Joshi C. Metagenomic analysis of buffalo rumen microbiome: effect of roughage diet on Dormancy and Sporulation genes. Meta Gene. 2014;2:252–268. doi: 10.1016/j.mgene.2014.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stahl et al. (1988).Stahl DA, Flesher B, Mansfield HR, Montgomery L. Use of phylogenetically based hybridization probes for studies of ruminal microbial ecology. Applied and Environmental Microbiology. 1988;54:1079–1084. doi: 10.1128/aem.54.5.1079-1084.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team (2013).R Core Team . R, a language and environment for statistical computing. Version 2.15. 0. Vienna: R Foundation for Statistical Computing; 2013. [Google Scholar]
- Wang et al. (2007).Wang Q, Garrity GM, Tiedje JM, Cole JR. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Applied and Environmental Microbiology. 2007;73:5261–5267. doi: 10.1128/AEM.00062-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang et al. (2012).Wang X, Li X, Zhao C, Hu P, Chen H, Liu Z, Liu G, Wang Z. Correlation between composition of the bacterial community and concentration of volatile fatty acids in the rumen during the transition period and ketosis in dairy cows. Applied and Environmental Microbiology. 2012;78:2386–2392. doi: 10.1128/AEM.07545-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weimer (2015).Weimer PJ. Redundancy, resilience, and host specificity of the ruminal microbiota: implications for engineering improved ruminal fermentations. Frontiers in Microbiology. 2015;6:296. doi: 10.3389/fmicb.2015.00296. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The following information was supplied regarding data availability:
The raw sequence data and corresponding mapping files are available in figshare.com (http://figshare.com/articles/comparison˙of˙454˙and˙PGM˙platforms/1536771).