Sample statistics determined following high-throughput shotgun sequencing on the Illumina HiSeq platform, sequence processing, taxonomic, and functional assignments. The total sequences before and after host sequence removal are shown, and the following quality checking through MG-RAST (v4.0.3). The sequences assigned to domain Bacteria through RefSeq. High-quality sequences that did not receive a taxonomic identity were listed as “Other Seqs”. Alpha diversity was determined for each sample based on the Shannon and Simpson metrics as implemented through QIIME (v1.9.1) at the species level of taxonomic resolution. The number of sequences assigned to a functional gene through KEGG Orthology (KO) was also listed.