Abstract
Over the past decade, it has become exceedingly clear that the microbiome is a critical factor in human health and disease, and thus should be investigated to develop innovative treatment strategies. The field of metagenomics has come a long way in leveraging the advances of next generation sequencing technologies resulting in the capability to identify and quantify all microorganisms present in human specimens. However, the field of metagenomics if still in its infancy, specifically in regards to the limitations in computational analysis, statistical assessments, standardization, and validation due to vast variability in the cohorts themselves, experimental design, and bioinformatic workflows. This review summarizes the methods, technologies, computational tools and model systems for characterizing and studying the microbiome. We also discuss important considerations investigators must make when interrogating the involvement of the microbiome in health and disease in order to establish robust results and mechanistic insights before moving into therapeutic design and intervention.
Keywords: microbiome, data analysis, visualization, metagenomics
Introduction
What is the microbiome and why is it important?
As modern microbiology and the development of next generation sequencing technologies have matured, research has increasingly focused on the complex microbial communities that interact with the host to influence disease processes [1]. Recent studies have focused on the composition of the communities, or the “microbiota”, which is a group of microscopic organisms in any given region, whether it be a specific body site, habitat, or ecosystem [2]. Other investigators have taken this a step further and expanded their exploration to the “microbiome” which is the collective genomes and gene products of the microbiota residing within a host or environment [3]. Classically, however, these terms have been used interchangeably. These investigations have facilitated the creation of the field of metagenomics, which is the examination of all genetic material recovered directly from environmental or living samples [4]. Metagenomic based experimentation has enabled the ability to bypass in vitro studies or cultivation in order to characterize the composition and functionality of entire microbial communities [5]. The field of metagenomics and microbiome analyses have received growing scientific attention due to large research consortiums such as the Human Microbiome Project (HMP) [3], the European MetaHIT [6], and the Integrative Human Microbiome Project (iHMP) [7].
Classic infectious disease and pathology have historically focused on one pathogen equating to one disease [8]. However, we now understand that imbalances in the microbiota are also associated with many different disease states. Often, reduction in microbial diversity and outgrowth of specific species can induce negative effects like inflammation or infection [9]. Reports have shown evidence for the potential involvement of the microbiome in almost all walks of health-related complications to include obesity [10], diabetes [11], cardiovascular disease [12], oncogenesis [13], cancer therapy response [14], Alzheimer’s [15], and other neurodegenerative conditions [16].
The focus of this review is to describe what “tools” are at our disposal if one wants to begin to characterize the impact of the microbiome on evolution of a disease, or the effect of a disease or intervention on the microbiome. One must consider, however, that we are in the infancy of metagenomics. Microbiome studies produce vast amounts of data which necessitate sophisticated computational tools and the technologies utilized are rapidly evolving. Additionally, many of the mathematical tools available provide assessments of association, but not causation. Thus, investigators need to exercise caution that microbiome characterization, data analyses, and modeling is only a small piece of the discovery process and should be used to complement classic in vitro techniques and in vivo model experimentation in order to establish cause and effect.
Sequencing Methods, Technological Advances and Bioinformatic Tools for Studying the Microbiome
There are a number of different technologies available to study the microbiome. For this overview, we will first discuss the fundamental methods, and then the primary technologies available for use with each method (Figure 1).
Figure 1.

Technologies for studying the microbiome
Marker Gene Analysis
Targeted sequencing methods are the most common methods used and encompass 16S ribosomal RNA gene sequencing for bacteria, and internal transcribed spacer (ITS) region sequencing for fungi, among other, less common target genes. While highly conserved, these genes have diverged over time and provide a unique barcode that can be assigned to specific taxonomies and can also be counted to identify the frequency of each member within the microbial community. One challenge that arises in the analyses of these marker genes is defining and identifying a unique sequence. Both the 16S gene and ITS genes contain hypervariable regions that can change over a short period of time and can even differ within a single cell. Additionally, individual taxa can share identical gene sequences (e.g. Escherichia and Shigella) complicating the definition of a unique sequence and differentiation of individual taxa [17]. Moreover, each individual gene is amplified with PCR and each PCR product is sequenced a single time, leading to multiple opportunities for errors that can impact the identification of a unique sequence [18]. This has led to the adoption of operational taxonomic units (OTUs), a method for binning sequences using a threshold of divergence (most often 97% or 99%) [19–21]. The choice of this threshold is often arbitrary as these cutoffs do not match biologically relevant cutoffs and can change based upon the choice of variable regions sequenced [22]. This OTU binning allows one to control for biological and technical variations and each OTU can be treated as a distinct category that can be analyzed with or without taxonomic information [23]. Typically, however, data is analyzed using taxonomic information. Taxonomy can be assigned via machine learning methods such as the RDP classifier [24], a naive Bayesian classifier, or mapping to popular reference databases, such as Greengenes [25] and SILVA [26]. Standard microbiome analysis packages, such as Mothur [27], QIIME [28], and DADA2 [29] provide an interface for taxonomic assignment.
Current 16S and ITS sequencing relies most frequently on the sequencing length allowed by the Illumina MiSeq (Illumina, San Diego, CA), frequently leveraging the 2×300 sequencing length to cover as many variable regions as possible or gain the highest accuracy possible. This sequencing method utilizes region specific primers, such as V1-V3 or V4, to sequence from both the forward primer and the reverse primer, generating a full amplicon for each taxa within the microbial community [30, 31]. There is a current push to move to full length 16S and ITS sequencing methods with either PacBio or Oxford Nanopore sequencing technologies, but the databases available for each marker gene lag behind the technological advances and limit the utilities of these methods [32].
Shotgun Metagenomics
Marker gene sequencing methodologies have revealed incredible insights into the role of the microbiome in health and disease, but only focus on a small portion of microbial genomes. Shotgun metagenomics approaches consist of methods that utilize untargeted sequencing methods to capture all microbial genomes present within a sample [33, 34]. The capture of the full repertoire of genetic information from a microbiome sample allows the study of bacteria, fungi, DNA viruses, and other microbes but is dependent on reference genomes and scientific knowledge [35–40]. This information can be used similarly to amplicon sequence-based methods to identify which taxa are present and the relative abundance of each [41, 42], or to identify gene coding sequences and mine the functional potential of the microbial community [43].
Thus far there is no consensus regarding the best assembly methods for shotgun metagenomic data. Metagenomic shotgun assemblies are either performed de novo, based on reference genomes, or a hybrid of both. Much like whole genome sequencing, short read data can be joined via overlapping reads and used to construct contigs (Overlap, Layout, Consensus assembly), however, the computational demands of this method are impractical [44]. Thus, many de novo assembly methods instead use the de Bruijn graph approach to assemble sequencing reads [45]. Commonly used software tools for de novo assembly include MetaVelvet [46], IDBAUD [47], metaSPAdes [48], and MEGAHIT [49]. In reference-guided metagenomic assembly, such as MetaCompass, sequencing reads are mapped against reference databases in order to reconstruct contigs [44]. As stated above, reference-based assemblers are restricted by the availability of reference genomes and quality of the database.
Often researchers perform read-based profiling of representative or discriminative genes (or markers) from unassembled shotgun metagenomics reads and compare those to reference databases in order to assign taxonomy or annotate genes. Taxonomic binning can utilize similar DNA compositions or nucleotide patterns like k-mer lengths, GC content, or gene homology [50]. Kraken, for example, utilizes unique k-mer distributions across sequences in order to assign taxonomy [51]. In contrast, MetaPhlAn2 uses unique clade-specific marker genes to differentiate microbial taxa and estimate their relative abundance [52].
Metatranscriptomics, Metabolomics, and Metaproteomics
Metatranscriptomic methods leverage similar analytical ideas to shotgun metagenomics but specifically capture the RNA transcribed from microbial cells, allowing assessments of the expression activities of these organisms [53]. Shotgun metagenomics and metatranscriptomic methods primarily rely upon Illumina sequencing methodologies, most often the HiSeq or NovaSeq families of instruments due to the high throughput and low cost per base. However, there has also been a movement toward PacBio and Oxford Nanopore sequencing technologies to leverage longer read lengths that aid in gene calling and genetic mapping to reference genomes. A standard workflow involves the isolation of the total RNA from the microbiome sample, RNA enrichment, fragmentation, cDNA synthesis, and preparation of transcriptome libraries for sequencing [54]. Typically, RNA sequence reads are mapped to different genomes and pathways (KEGG) [55] to identify the taxonomy of transcriptionally active organisms as well as the function of their expressed genes. Bioinformatics programs like SOAPdenovo [56], have been used for alignment and assembly of metatranscriptomic data from microbiome samples. Comparisons are made between different groups or variables to determine which pathways are up and down regulated or during different health and disease conditions [57].
Metabolomics analyses focus on profiling the metabolites microbiota produce and how these products interact with both microbiota and host metabolism [58, 59]. These methods often quantify small molecules such as antibiotics, antibiotic biproducts, and host and/or bacterial metabolism intermediates. Metabolomics often utilize mass spectrometry to identify known metabolites [59]. Metaproteomics also use mass spectrometry, but focus instead on identifying and quantifying the proteins present within a microbiome [60, 61]. Metaproteomics and metabolomics are both rapidly developing technologies for studying the microbiome.
Characterizing the Microbiome
Microbial Diversity Measurements
In general, microbiome differences are typically evaluated by comparing either (or both) alpha and beta diversity metrics. Alpha diversity metrics quantify within sample diversity and can be compared across groups [62]. For example, it is commonplace to compare the mean species diversity between samples from a diseased versus a control group. Regularly used alpha diversity metrics are species richness estimators such as observed OTUs and Chao1 index [63], as well as Shannon and Inverse Simpson indices, which measure both species richness and evenness [64]. Another less frequent way to measure diversity are phylogenetic richness estimators, such as Faith’s phylogenetic diversity [65]. Estimators of both richness and evenness, such as Shannon and Inverse Simpson, are more considered more robust as they are less sensitive to the number of sequences per sample [65]. The Shannon index is more influenced by rare OTUs, while Inverse Simpson is more influenced by the dominant/abundant OTUs [66].
Beta diversity compares between sample diversity, and is often calculated by comparing feature dissimilarity, resulting in a distance matrix between all pairs of samples [67]. One conventional calculation for beta diversity is the Bray–Curtis dissimilarity, which is a quantitative measure that accounts for taxa abundance when comparing two communities [68]. In addition to taxa abundances, Weighted Unifrac distance also takes into consideration phylogenetic relatedness when measuring the differences in two communities [69]. Unweighted Unifrac distance, on the other hand, is a qualitative measure which only considers the presence and absence of taxa. Both Unifrac measurements require a phylogenetic tree as the scores are derived by calculating total branch distances between shared and unshared bacteria on a phylogenetic tree [69]. The Jaccard index, or similarity coefficient, is another qualitative measurement that does not consider relative abundances, but feature presence/absence [65]. By evaluating taxa presence, absence, and abundances one can investigate the extent of the difference between the compositions of communities between samples. If possible, the addition of relatedness allows for the potential to assess evolutionary divergence. It is important to consider that although taxa can be distinct, related organisms are likely to perform similar functions. Software for calculating alpha and beta diversity are included in commonly used bioinformatics pipelines such as QIIME [28], Mothur [27], as well as VEGAN and phloseq [70] packages in R.
Functional analysis
In addition to the taxonomic composition of a microbiome, researchers can identify differences in metabolic function between different microbial populations. Using 16S sequencing data, one can predict a functional profile by using programs such as PICRUSt [71] or Tax4Fun [72]. Using the relative abundance of taxa within the community, these programs can predict the gene content potential functionality based on the reference genome for each taxa present. However, these methods are a rough approximation, as they do not take into consideration actual protein expression and are highly dependent on the reference genomes and their annotations.
Shotgun and metatranscriptome approaches can also allow for functional analysis. Once a metagenome is assembled, gene predictions are made using tools such as MetaGeneMark and Glimmer-MG. Following coding gene identification, functional annotation is carried out using computationally demanding protein sequence homology-based searches (typically UBLAST and USEARCH based [73]) against databases of orthologues (e.g. EggNOG [74] or COG [75]), enzymes (e.g. KEGG [55]), or protein domains and families (e.g. Pfam [76], TIGRFAMs [77], or InterPro [78]). Pathway enrichment analysis, clustering, and scoring can be performed using programs such as pathfinder [79], or a metabolic network can be derived as well using programs such as KEGGscape [80].
Due to the vast number of tools and computational demands, a number of publicly available automated pipelines have been created which include quality-filtering, gene calling, functional annotation, and basic statistics and visualization all on one platform, such as MG-RAST [81] and MEGAN-CE [82]. Additionally, programs like HUMAnN2 can be used to determine the presence/absence and abundance of microbial pathways and gene families in a community using short read data directly [83]. The results are generally represented in the form of a table, heatmap, bar plot, or butterfly plot containing functional pathway abundance levels relative to a reference group in order to visualize variation between different communities.
As discussed above, more precise examinations of microbiota functionality are possible using metabolomics, metatranscriptomics, and metaproteomics. Direct measurements of fecal (or other sample type) metabolites can be measured via liquid chromatography and high resolution/tandem mass spectrometry (MS/MS) [84]. Many metabolites, however, are highly volatile, thus collection and proper storage of samples are critical for reproducibility and concordance. Although the metabolite content of the sample can be measured, this method cannot differentiate human vs. microbial derived products, nor can it distinguish which microorganism the metabolite is derived from [85]. Metatranscriptomic sequencing offers characterization of which RNAs are transcribed, but does not reflect a perfect representation of functionality as protein expression also depends on translation and post-translational modifications [53]. Metaproteomics also employs a chemistry-based approach to characterize proteins being produced by the microbiota, with the advantage being these measurements can be used to assess the “expressed” proteins of microbial community members [85].
Data Visualization and Statistical Methods for Differential Group Comparisons
In most microbiome studies, the approach to analysis is to look for differential microbial diversity, taxa abundance, or functional components (e.g., genes or pathways) between the comparison groups (i.e., treatment versus group control). Statistical methods for microbiome analysis are an ever-evolving field due to the inherent complexity of microbiome datasets. The employment of common statistical methods are often difficult because microbiome data sets are high-dimensional as they can potentially have thousands of taxonomic units, zero-inflated due to the majority of taxa being rare or differences in sequencing depth, and most data output are compositional [86]. Moreover, the main limitations of microbiome studies in general are a lack of statistical power and the inability to control for confounding factors that are intrinsic to clinical cohort studies.
Most investigators start by visualizing the data in order to discover potential associations or markers within their dataset which can then be further tested via more rigorous statistical methodologies. Due to the complexity of microbiome data, visualization methods often use dimension-reduction-based ordination methods, such as principal coordinate analysis (PCoA) or principal component analysis (PCA) [87]. These methods reduce the distance matrices into two- or three-dimensional visual representations of sample distances. These samples then can be easily labelled (e.g, using color, shape, etc.) by different categories in order to overlay important clinical metadata. This allows the investigator to visualize potential clustering by clinical variables in an unsupervised way.
One can then apply more sophisticated statistical evaluation to determine if the clustering is biologically significant. When looking at differences in the community composition as a whole, ANOSIM has been used to assess significant clustering differences by comparing within- and between-group similarity using distance metrics. ANOSIM confirms or rejects the null hypothesis that the average similarity between samples of one group is the same as the average similarity between samples of a different group [88]. PERMANOVA is a non-parametric permutation test which performs a multivariate analysis of variance based on distance matrices to test the overall difference in microbiome community structure between different clusters or groups [89, 90]. ANCOM is a similar method which detects differentially abundant taxa between microbial populations based on log-ratios [91].
Heat maps are another popular method of visualizing microbiome data to detect potential clusters or differences between groups [87]. The most generic heat maps compare taxa abundance between samples or the presence/absence of specific gene families. Often, hierarchical clustering can be combined with the heatmaps in order to further group samples with similar bacterial profiles into branches of a dendogram. Further, clinical metadata can also be overlaid onto the heatmap in order to discover potential clinical cofactors associated with specific bacterial profiles.
Conventional statistical approaches, such as the t-test, Wilcoxon rank-sum test, ANOVA, or the Kruskal-Wallis test are often used to compare simpler features between groups such as to determine differences in alpha diversity or abundances of single known specific taxa associated with disease phenotypes [88, 92]. However when comparing lower level taxonomic differences, such as genus, species, or OTUs, these traditional tests are prone to false positives due to the large number of variables if not corrected for false discovery [92]. One option is to use linear discriminant analysis of effect sizes (LEfSe), which is a widely used method developed specifically for microbiome data [93]. This method first calculates the Kruskal-Wallis rank sum p-value to detect significant differentially abundant features between groups, then performs linear discriminant analysis to determine the effect size of those specific attributes. Ideally, all associations should be experimentally tested directly either in vitro or in vivo to confirm the detected associations.
Graphical Networks and Machine Learning
Network analysis a popular approach for visualizing microbiome community interactions. Correlation networks are not only used to illustrate perturbations to the community structure in different scenarios, but can also be used to see how environmental factors, metabolites, clinical traits, or other microbes (i.e. interkingdom interactions) interact within the microbial community. Graphical networks are often used to compare interactions in different states (e.g. diseased vs. healthy state) or to show which organisms co-occur or mutually exclude each other.
Generally, networks can be inferred using pairwise relationships where networks are based on similarity or correlation coefficients between pairwise variables. Commonly used programs to infer correlation networks for microbiome data are SparCC (Sparse Correlations for Compositional data) [94], CCLasso (Correlation inference for Compositional data through Lasso) [95], and SPEIC-EASI (Sparse and Compositionally Robust Inference of Microbial Ecological Networks) [96]. At the current, SPEIC-EASI is the most widely used model as it appears to be the most robust estimating interactions using algorithms for both sparse neighborhood and inverse covariance selection after first CLR transforming the count data. Regression-related methods can also be used to predict the abundance of one species from the abundance of combinations of other sets species, such as sparse regression [97], Dirichlet-multinomial regression [98], and generalized boosted linear models (GBLMs) [97]. Another tactic is contingent on the presence–absence patterns of taxa in association with different phenotypes/outcomes, also known as association rule mining [99].
Machine learning algorithms are emerging as a widely accepted tool in the analyses of microbiome datasets, as they can be used on many different types of high-dimensional data and are especially useful for feature selection in multi-feature datasets [65]. Random Forest [100], for example, is an ensemble machine learning technique for classification and regression that is often applied to identify important taxa and clinical covariates that can distinguish different phenotypes/ categories or predict specific outcomes. Although less accurate and stable, methods such as CART analyses (Classification and Regression Trees) can more interpretable and thus clinically actionable as decision trees allow the investigator to have insights into which variables are important at what cut off, and in what order [101]. One must consider however that these models must always be cross-validated either via sample and replacement, or by enrolling separate cohorts for training, testing, and validation.
Model systems for studying the microbiome
Animal models
As previously discussed, the aforementioned analyses only derive association or correlation of microbiome features with disease phenotypes, but to not determine causation. Thus, interactions between the microbiome and the host must be tested and extrapolated from model systems in order to gain mechanistic insights. Animal models are extensively utilized to study the human microbiome as animals allow for ease of experimental manipulation and control of specific variables, scalability, and reproducibility that is mostly unattainable in human studies. Gnotobiotic, or germ-free mice, are frequently used to test either the effects of the absence of the microbiome or specific communities or strains of interest can be administered to the animal and evaluated for effects on the host [102]. For example, “humanized” murine models are produced when a human fecal source is used to colonize the mouse [103]. Other less expensive animal models include zebrafish, fruit flies, and Caenorhabditis elegans which can also be used to test whether the microbiome composition and function correlate with such host variables (i.e. age, genotype, phenotype, diet etc.), or experimental exposures [104]. Of course, important caveats for animal models include evolutionary distance which results in relevant differences in anatomy, physiology, and the microbiome composition itself in relation to the human host [105].
In vitro and ex vivo models
Rather than using model organisms, an investigator may choose to study microbiome interactions in vitro or ex vivo [106]. The main experimental systems currently used to study microbial interaction with the host include co-culturing microorganisms with primary host tissues or cell lines; microfluidic culture with engineered tissue; and intestinal enteroids or organoids [107]. Additionally, in order to understand specific microbial changes, one may choose to use an in vitro bioreactor device [108], such as a simulator of the human intestinal microbial ecosystem (SHIME) [109]. These simulator devices, are continuous culture systems that mimic the human digestive tract microbiome and allow the control of nutrient availability, pH, and other environmental conditions [110]. More advanced systems even control for compartmentalization. Although the use of in vitro primary cells and established cell lines allows for simplicity and reproducibility, these models are limited in their translatability in that they are a single cell type and only a few organisms can we tested at once [106]. Ex vivo systems allow for the ability to model cross-talk between cell types not generally found in in vitro systems. Moreover, systems like InTESTine™ system by TNO [111] and enteroids/organoids [112] allow for modeling of bacterial-host interactions in live compartmentalized and physiologically relevant tissue. However many ex vivo systems have not been validated, and remain costly.
Important Future Considerations for Microbiome Investigation
The current tools available to study the microbiome are maturing rapidly and the future is bright. As more researchers begin to investigate the role of the microbiome in human health and disease, more focus is brought to technological advancements as well as developing robust statistical methods and analytical tools.
One current limitation gaining significant attention is how to estimate the sample size necessary to address a specific research question. Current methods exist to estimate the sample size needed for PERMANOVA based analyses [89], as well as for comparisons leveraging the Dirichlet-Multinomial distribution [98], but methods to identify the number of samples needed to test hypotheses with many of the methods listed above remain undeveloped. This is partially due to the lack of established metrics for determining effect size in terms of changes in the microbiome. We can consider an effect size in two ways for microbiome studies: how much change must be observed to be biologically relevant, and how much difference in the microbial consortium must exist to differentiate two or more groups? Work is ongoing to establish the source of both technical and biological variation in the microbiome, and the more studies we have assessing topics such as the diet’s impact on the gut, the more insight we have into the consistency and scale of differences observed [113, 114]. Both of these are needed to inform effect size estimates used within sample size calculations.
In addition to the need for better sample size estimation, there is a current need for advanced statistical modeling tools, particularly ones to analyze data from longitudinal studies. One of the hallmarks of the human microbiome is its dynamic nature that can be transformed and altered over time [115, 116], but this may potentially be in direct contrast to the observation of a core microbiota that resists perturbation [117–119]. Better longitudinal methods are required to truly interrogate this variability and elucidate the complex relationships between taxa, the host, and the environment over time. Methods have recently been released to visualize longitudinal data and generate estimates adapted from microbial ecology [120], and to generate dynamic Bayesian networks [121]. Additional advances in longitudinal modeling are forthcoming and will continue to expand the repertoire of analytical tools available to scientists.
While a general consensus in bioinformatics and biostatistics methods has been reached for 16S targeted sequencing-based methods, there has been a recent batch of publications and methods attempting to control and adjust for the technological variability inherent within amplicon based methods [22, 29]. These methods focus on statistical modeling of the errors of Illumina based sequencing to correct individual reads, purporting to generate species or strain specific groupings, similar to OTUs. Early assessments of these methods using known mock communities show the true complexity of microbial communities is overestimated by these methods, but the potential for error correction is promising [29]. While the utility and impact of such methods is to be determined, it is important to consider such potential advancements as we continue to explore the impact of the microbiome in health and disease.
Contrary to 16S targeted sequencing methods, there is still a need for universal and comprehensive analytical pipelines for shotgun metagenomics and metatranscriptomics. In particular, while there are available tools that leverage reference genomes for alignment [122], utilize custom databases of marker genes [52], perform metagenomic assemblies [48], and utilize unique k-mer patterns to identify taxa [51], a consensus has not been reached within the microbiome community as to which method(s) is superior and which pipeline(s) best implements these methods. Until this consensus is reached, the barrier for interrogating the metagenome may be too high for many researchers to adopt the tools in their own studies.
As we continue to remove the technological barriers, and reduce the cost of generating metagenomic, metatranscriptomic, metabolomic, and metaproteomic data, there will continue to be a push into the next phase of analytical approaches dealing with integrated ‘omics data. Current methods for integrating these data types rely upon linking them through a common database, such as UniProt [123] for metagenomics data and metaproteomic data, or ordinating them in multidimensional space [124, 125]. This integrated omics approach will also be integral in further attempts to mine the microbiome for antimicrobial resistance determinants [126], virulence genes that impact the outcomes of infection and facilitate bacterial recalcitrance to treatment [127], and assessments of the roles of accessory genes and genes of unknown function within the microbiome [128, 129].
Conclusion
In summary, we stand on the precipice of a rapidly developing research area that holds incredible promise in understanding and defining microbial interactions at the human interface. However, under a “bedside, to bench, and back to bedside” theory of investigation where studies often start with observational retrospective or prospective cohort studies, researchers must take special care in their experimental design to systematically include all steps in the examination process in order to make proper advancements in the field (Figure 2). This includes not only microbiome characterization and associations with clinical metadata using patient populations, but proper statistical analysis and modeling methods which take into consideration the eccentricities of metagenomics, validation of findings using other cohorts, mechanistic discovery in vitro, ex vivo, or in small animal models. Only then can we come full circle back to the patient using interventional strategies with microbiome-based therapeutics
Figure 2.

A “bedside, to bench, and back to bedside” approach to microbiome investigation
Future Unmet Needs and Implications for the Clinician.
Universal analytical pipelines for shotgun metagenomics and metatranscriptomics still remain elusive among the research community, thus researchers from multiple disciplines and institutions must work together to come to a consensus.
Comprehensive methods for multi-omic integration continue to be ambiguous, yet investigations combining metagenomics, metabolomics, transcriptomics, and host genomics are crucial to moving the field forward.
In order to bring microbiome research into the therapeutic realm, investigators need to move beyond clinical association studies by validating their models in other clinical cohorts and understanding the mechanisms of causation in vitro, ex vivo, and animal model systems.
Footnotes
Publisher's Disclaimer: This Author Accepted Manuscript is a PDF file of an unedited peer-reviewed manuscript that has been accepted for publication but has not been copyedited or corrected. The official version of record that is published in the journal is kept up to date and so may therefore differ from this version.
References
- 1.Song EJ, Lee ES, and Nam YD, Progress of analytical tools and techniques for human gut microbiome research. J Microbiol, 2018. 56(10): p. 693–705. [DOI] [PubMed] [Google Scholar]
- 2.Thursby E and Juge N, Introduction to the human gut microbiota. Biochem J, 2017. 474(11): p. 1823–1836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Turnbaugh PJ, et al. , The human microbiome project. Nature, 2007. 449(7164): p. 804–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tringe SG and Rubin EM, Metagenomics: DNA sequencing of environmental samples. Nat Rev Genet, 2005. 6(11): p. 805–14. [DOI] [PubMed] [Google Scholar]
- 5.Riesenfeld CS, Schloss PD, and Handelsman J, Metagenomics: genomic analysis of microbial communities. Annu Rev Genet, 2004. 38: p. 525–52. [DOI] [PubMed] [Google Scholar]
- 6.Qin J, et al. , A human gut microbial gene catalogue established by metagenomic sequencing. Nature, 2010. 464(7285): p. 59–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Integrative HMPRNC, The Integrative Human Microbiome Project: dynamic analysis of microbiome-host omics profiles during periods of human health and disease. Cell Host Microbe, 2014. 16(3): p. 276–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bantock GG, The Modern Doctrine of Bacteriology, or the Germ Theory of Disease. Br Med J, 1899. 1(1997): p. 846–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Belkaid Y and Hand TW, Role of the microbiota in immunity and inflammation. Cell, 2014. 157(1): p. 121–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Maruvada P, et al. , The Human Microbiome and Obesity: Moving beyond Associations. Cell Host Microbe, 2017. 22(5): p. 589–599. [DOI] [PubMed] [Google Scholar]
- 11.Jamshidi P, et al. , Is there any association between gut microbiota and type 1 diabetes? A systematic review. Gut Pathog, 2019. 11: p. 49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ahmadmehrabi S and Tang WHW, Gut microbiome and its role in cardiovascular diseases. Curr Opin Cardiol, 2017. 32(6): p. 761–766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Scott AJ, et al. , International Cancer Microbiome Consortium consensus statement on the role of the human microbiome in carcinogenesis. Gut, 2019. 68(9): p. 1624–1632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gopalakrishnan V, et al. , The Influence of the Gut Microbiome on Cancer, Immunity, and Cancer Immunotherapy. Cancer Cell, 2018. 33(4): p. 570–580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Shen L and Ji HF, Associations Between Gut Microbiota and Alzheimer’s Disease: Current Evidences and Future Therapeutic and Diagnostic Perspectives. J Alzheimers Dis, 2019. 68(1): p. 25–31. [DOI] [PubMed] [Google Scholar]
- 16.Rieder R, et al. , Microbes and mental health: A review. Brain Behav Immun, 2017. 66: p. 9–17. [DOI] [PubMed] [Google Scholar]
- 17.Vetrovsky T and Baldrian P, The variability of the 16S rRNA gene in bacterial genomes and its consequences for bacterial community analyses. PLoS One, 2013. 8(2): p. e57923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Xue Z, Kable ME, and Marco ML, Impact of DNA Sequencing and Analysis Methods on 16S rRNA Gene Bacterial Community Analysis of Dairy Products. mSphere, 2018. 3(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Stackebrandt E and GOEBEL BM, Taxonomic note: a place for DNA-DNA reassociation and 16S rRNA sequence analysis in the present species definition in bacteriology. International journal of systematic and evolutionary microbiology, 1994. 44(4): p. 846–849. [Google Scholar]
- 20.Yarza P, et al. , Uniting the classification of cultured and uncultured bacteria and archaea using 16S rRNA gene sequences. Nat Rev Microbiol, 2014. 12(9): p. 635–45. [DOI] [PubMed] [Google Scholar]
- 21.Johnson JS, et al. , Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat Commun, 2019. 10(1): p. 5029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Edgar RC, Updating the 97% identity threshold for 16S ribosomal RNA OTUs. Bioinformatics, 2018. 34(14): p. 2371–2375. [DOI] [PubMed] [Google Scholar]
- 23.Hamady M and Knight R, Microbial community profiling for human microbiome projects: Tools, techniques, and challenges. Genome Res, 2009. 19(7): p. 1141–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang Q, et al. , Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol, 2007. 73(16): p. 5261–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.McDonald D, et al. , An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea. ISME J, 2012. 6(3): p. 610–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Yilmaz P, et al. , The SILVA and “All-species Living Tree Project (LTP)” taxonomic frameworks. Nucleic Acids Res, 2014. 42(Database issue): p. D643–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Schloss PD, et al. , Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol, 2009. 75(23): p. 7537–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Caporaso JG, et al. , QIIME allows analysis of high-throughput community sequencing data. Nat Methods, 2010. 7(5): p. 335–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Callahan BJ, et al. , DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods, 2016. 13(7): p. 581–3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Walker JN, et al. , Insights into the Microbiome of Breast Implants and Periprosthetic Tissue in Breast Implant-Associated Anaplastic Large Cell Lymphoma. Sci Rep, 2019. 9(1): p. 10393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zhou W, et al. , Longitudinal multi-omics of host-microbe dynamics in prediabetes. Nature, 2019. 569(7758): p. 663–671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Callahan BJ, et al. , High-throughput amplicon sequencing of the full-length 16S rRNA gene with single-nucleotide resolution. Nucleic Acids Res, 2019. 47(18): p. e103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Quince C, et al. , Shotgun metagenomics, from sampling to analysis. Nat Biotechnol, 2017. 35(9): p. 833–844. [DOI] [PubMed] [Google Scholar]
- 34.Sharpton TJ, An introduction to the analysis of shotgun metagenomic data. Front Plant Sci, 2014. 5: p. 209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Huffnagle GB and Noverr MC, The emerging world of the fungal microbiome. Trends Microbiol, 2013. 21(7): p. 334–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Sam QH, Chang MW, and Chai LY, The Fungal Mycobiome and Its Interaction with Gut Bacteria in the Host. Int J Mol Sci, 2017. 18(2). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Nash AK, et al. , The gut mycobiome of the Human Microbiome Project healthy cohort. Microbiome, 2017. 5(1): p. 153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Stern J, et al. , Virome and bacteriome: two sides of the same coin. Curr Opin Virol, 2019. 37: p. 37–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Mukhopadhya I, et al. , The gut virome: the ‘missing link’ between gut bacteria and host immunity? Therap Adv Gastroenterol, 2019. 12: p. 1756284819836620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Moreno-Gallego JL, et al. , Virome Diversity Correlates with Intestinal Microbiome Diversity in Adult Monozygotic Twins. Cell Host Microbe, 2019. 25(2): p. 261–272.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Xia LC, et al. , Accurate genome relative abundance estimation based on shotgun metagenomic reads. PLoS One, 2011. 6(12): p. e27992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Nayfach S and Pollard KS, Toward Accurate and Quantitative Comparative Metagenomics. Cell, 2016. 166(5): p. 1103–1116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Qin J, et al. , A human gut microbial gene catalogue established by metagenomic sequencing. Nature, 2010. 464(7285): p. 59–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ayling M, Clark MD, and Leggett RM, New approaches for metagenome assembly with short reads. Brief Bioinform, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Compeau PE, Pevzner PA, and Tesler G, How to apply de Bruijn graphs to genome assembly. Nat Biotechnol, 2011. 29(11): p. 987–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Namiki T, et al. , MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads. Nucleic Acids Res, 2012. 40(20): p. e155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Peng Y, et al. , IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics, 2012. 28(11): p. 1420–8. [DOI] [PubMed] [Google Scholar]
- 48.Nurk S, et al. , metaSPAdes: a new versatile metagenomic assembler. Genome Res, 2017. 27(5): p. 824–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Li D, et al. , MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics, 2015. 31(10): p. 1674–6. [DOI] [PubMed] [Google Scholar]
- 50.Claesson MJ, Clooney AG, and O’Toole PW, A clinician’s guide to microbiome analysis. Nat Rev Gastroenterol Hepatol, 2017. 14(10): p. 585–595. [DOI] [PubMed] [Google Scholar]
- 51.Wood DE and Salzberg SL, Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol, 2014. 15(3): p. R46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Truong DT, et al. , MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat Methods, 2015. 12(10): p. 902–3. [DOI] [PubMed] [Google Scholar]
- 53.Bashiardes S, Zilberman-Schapira G, and Elinav E, Use of Metatranscriptomics in Microbiome Research. Bioinform Biol Insights, 2016. 10: p. 19–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Bikel S, et al. , Combining metagenomics, metatranscriptomics and viromics to explore novel microbial interactions: towards a systems-level understanding of human microbiome. Comput Struct Biotechnol J, 2015. 13: p. 390–401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Kanehisa M and Goto S, KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res, 2000. 28(1): p. 27–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Xie Y, et al. , SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads. Bioinformatics, 2014. 30(12): p. 1660–6. [DOI] [PubMed] [Google Scholar]
- 57.Jorth P, et al. , Metatranscriptomics of the human oral microbiome during health and disease. MBio, 2014. 5(2): p. e01012–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Lamichhane S, et al. , Gut metabolome meets microbiome: A methodological perspective to understand the relationship between host and microbe. Methods, 2018. 149: p. 3–12. [DOI] [PubMed] [Google Scholar]
- 59.Zierer J, et al. , The fecal metabolome as a functional readout of the gut microbiome. Nat Genet, 2018. 50(6): p. 790–795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Lai LA, et al. , Metaproteomics Study of the Gut Microbiome. Methods Mol Biol, 2019. 1871: p. 123–132. [DOI] [PubMed] [Google Scholar]
- 61.Blakeley-Ruiz JA, et al. , Metaproteomics reveals persistent and phylum-redundant metabolic functional stability in adult human gut microbiomes of Crohn’s remission patients despite temporal variations in microbial taxa, genomes, and proteomes. Microbiome, 2019. 7(1): p. 18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Kuczynski J, et al. , Experimental and analytical tools for studying the human microbiome. Nat Rev Genet, 2011. 13(1): p. 47–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Chao A, Estimating the population size for capture-recapture data with unequal catchability. Biometrics, 1987. 43(4): p. 783–91. [PubMed] [Google Scholar]
- 64.Kim BR, et al. , Deciphering Diversity Indices for a Better Understanding of Microbial Communities. J Microbiol Biotechnol, 2017. 27(12): p. 2089–2093. [DOI] [PubMed] [Google Scholar]
- 65.Knight R, et al. , Best practices for analysing microbiomes. Nat Rev Microbiol, 2018. 16(7): p. 410–422. [DOI] [PubMed] [Google Scholar]
- 66.Bent SJ and Forney LJ, The tragedy of the uncommon: understanding limitations in the analysis of microbial diversity. ISME J, 2008. 2(7): p. 689–95. [DOI] [PubMed] [Google Scholar]
- 67.Barwell LJ, Isaac NJ, and Kunin WE, Measuring beta-diversity with species abundance data. J Anim Ecol, 2015. 84(4): p. 1112–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Staley C, Kaiser T, and Khoruts A, Clinician Guide to Microbiome Testing. Dig Dis Sci, 2018. 63(12): p. 3167–3177. [DOI] [PubMed] [Google Scholar]
- 69.Lozupone C and Knight R, UniFrac: a new phylogenetic method for comparing microbial communities. Appl Environ Microbiol, 2005. 71(12): p. 8228–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.McMurdie PJ and Holmes S, phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One, 2013. 8(4): p. e61217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Langille MG, et al. , Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol, 2013. 31(9): p. 814–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Asshauer KP, et al. , Tax4Fun: predicting functional profiles from metagenomic 16S rRNA data. Bioinformatics, 2015. 31(17): p. 2882–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Edgar RC, Search and clustering orders of magnitude faster than BLAST. Bioinformatics, 2010. 26(19): p. 2460–1. [DOI] [PubMed] [Google Scholar]
- 74.Huerta-Cepas J, et al. , eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res, 2016. 44(D1): p. D286–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Tatusov RL, et al. , The COG database: an updated version includes eukaryotes. BMC Bioinformatics, 2003. 4: p. 41. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Finn RD, et al. , Pfam: the protein families database. Nucleic Acids Res, 2014. 42(Database issue): p. D222–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Selengut JD, et al. , TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes. Nucleic Acids Res, 2007. 35(Database issue): p. D260–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Hunter S, et al. , InterPro: the integrative protein signature database. Nucleic Acids Res, 2009. 37(Database issue): p. D211–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Ulgen E, Ozisik O, and Sezerman OU, pathfindR: An R Package for Comprehensive Identification of Enriched Pathways in Omics Data Through Active Subnetworks. Front Genet, 2019. 10: p. 858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Nishida K, et al. , KEGGscape: a Cytoscape app for pathway data integration. F1000Res, 2014. 3: p. 144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Keegan KP, Glass EM, and Meyer F, MG-RAST, a Metagenomics Service for Analysis of Microbial Community Structure and Function. Methods Mol Biol, 2016. 1399: p. 207–33. [DOI] [PubMed] [Google Scholar]
- 82.Huson DH, et al. , MEGAN Community Edition - Interactive Exploration and Analysis of Large-Scale Microbiome Sequencing Data. PLoS Comput Biol, 2016. 12(6): p. e1004957. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Abubucker S, et al. , Metabolic reconstruction for metagenomic data and its application to the human microbiome. PLoS Comput Biol, 2012. 8(6): p. e1002358. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Peters DL, et al. , Metaproteomic and Metabolomic Approaches for Characterizing the Gut Microbiome. Proteomics, 2019. 19(16): p. e1800363. [DOI] [PubMed] [Google Scholar]
- 85.Verberkmoes NC, et al. , Shotgun metaproteomics of the human distal gut microbiota. ISME J, 2009. 3(2): p. 179–89. [DOI] [PubMed] [Google Scholar]
- 86.Galloway-Pena J and Guindani M, Editorial: Novel Approaches in Microbiome Analyses and Data Visualization. Front Microbiol, 2018. 9: p. 2274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Sudarikov K, Tyakht A, and Alexeev D, Methods for The Metagenomic Data Visualization and Analysis. Curr Issues Mol Biol, 2017. 24: p. 37–58. [DOI] [PubMed] [Google Scholar]
- 88.Xia Y and Sun J, Hypothesis Testing and Statistical Analysis of Microbiome. Genes Dis, 2017. 4(3): p. 138–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Kelly BJ, et al. , Power and sample-size estimation for microbiome studies using pairwise distances and PERMANOVA. Bioinformatics, 2015. 31(15): p. 2461–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Tang ZZ, Chen G, and Alekseyenko AV, PERMANOVA-S: association test for microbial community composition that accommodates confounders and multiple distances. Bioinformatics, 2016. 32(17): p. 2618–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Mandal S, et al. , Analysis of composition of microbiomes: a novel method for studying microbial composition. Microb Ecol Health Dis, 2015. 26: p. 27663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Staley C and Sadowsky MJ, Practical considerations for sampling and data analysis in contemporary metagenomics-based environmental studies. J Microbiol Methods, 2018. 154: p. 14–18. [DOI] [PubMed] [Google Scholar]
- 93.Segata N, et al. , Metagenomic biomarker discovery and explanation. Genome Biol, 2011. 12(6): p. R60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Friedman J and Alm EJ, Inferring correlation networks from genomic survey data. PLoS Comput Biol, 2012. 8(9): p. e1002687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Fang H, et al. , CCLasso: correlation inference for compositional data through Lasso. Bioinformatics, 2015. 31(19): p. 3172–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Kurtz ZD, et al. , Sparse and compositionally robust inference of microbial ecological networks. PLoS Comput Biol, 2015. 11(5): p. e1004226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Faust K, et al. , Microbial co-occurrence relationships in the human microbiome. PLoS Comput Biol, 2012. 8(7): p. e1002606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.La Rosa PS, et al. , Hypothesis testing and power calculations for taxonomic-based human microbiome data. PLoS One, 2012. 7(12): p. e52078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Karpinets TV, Park BH, and Uberbacher EC, Analyzing large biological datasets with association networks. Nucleic Acids Res, 2012. 40(17): p. e131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Knights D, Costello EK, and Knight R, Supervised classification of human microbiota. FEMS Microbiol Rev, 2011. 35(2): p. 343–59. [DOI] [PubMed] [Google Scholar]
- 101.Zhang Q, et al. , Selection of models for the analysis of risk-factor trees: leveraging biological knowledge to mine large sets of risk factors with application to microbiome data. Bioinformatics, 2015. 31(10): p. 1607–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Kennedy EA, King KY, and Baldridge MT, Mouse Microbiota Models: Comparing Germ-Free Mice and Antibiotics Treatment as Tools for Modifying Gut Bacteria. Front Physiol, 2018. 9: p. 1534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Gootenberg DB and Turnbaugh PJ, Companion animals symposium: humanized animal models of the microbiome. J Anim Sci, 2011. 89(5): p. 1531–7. [DOI] [PubMed] [Google Scholar]
- 104.Douglas AE, Simple animal models for microbiome research. Nat Rev Microbiol, 2019. [DOI] [PubMed] [Google Scholar]
- 105.Hacquard S, et al. , Microbiota and Host Nutrition across Plant and Animal Kingdoms. Cell Host Microbe, 2015. 17(5): p. 603–16. [DOI] [PubMed] [Google Scholar]
- 106.Pearce SC, et al. , Intestinal in vitro and ex vivo Models to Study Host-Microbiome Interactions and Acute Stressors. Front Physiol, 2018. 9: p. 1584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Dutton JS, et al. , Primary Cell-Derived Intestinal Models: Recapitulating Physiology. Trends Biotechnol, 2019. 37(7): p. 744–760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.McDonald JA, et al. , Simulating distal gut mucosal and luminal communities using packed-column biofilm reactors and an in vitro chemostat model. J Microbiol Methods, 2015. 108: p. 36–44. [DOI] [PubMed] [Google Scholar]
- 109.Van den Abbeele P, et al. , Incorporating a mucosal environment in a dynamic gut model results in a more representative colonization by lactobacilli. Microb Biotechnol, 2012. 5(1): p. 106–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Auchtung JM, Robinson CD, and Britton RA, Cultivation of stable, reproducible microbial communities from different fecal donors using minibioreactor arrays (MBRAs). Microbiome, 2015. 3: p. 42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Stevens LJ, et al. , A higher throughput and physiologically relevant two-compartmental human ex vivo intestinal tissue system for studying gastrointestinal processes. Eur J Pharm Sci, 2019. 137: p. 104989. [DOI] [PubMed] [Google Scholar]
- 112.Nigro G, et al. , Intestinal Organoids as a Novel Tool to Study Microbes-Epithelium Interactions. Methods Mol Biol, 2019. 1576: p. 183–194. [DOI] [PubMed] [Google Scholar]
- 113.Debelius J, et al. , Tiny microbes, enormous impacts: what matters in gut microbiome studies? Genome Biol, 2016. 17(1): p. 217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Poussin C, et al. , Interrogating the microbiome: experimental and computational considerations in support of study reproducibility. Drug Discov Today, 2018. 23(9): p. 1644–1657. [DOI] [PubMed] [Google Scholar]
- 115.David LA, et al. , Host lifestyle affects human microbiota on daily timescales. Genome Biol, 2014. 15(7): p. R89. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Caporaso JG, et al. , Moving pictures of the human microbiome. Genome Biol, 2011. 12(5): p. R50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Mehta RS, et al. , Stability of the human faecal microbiome in a cohort of adult men. Nat Microbiol, 2018. 3(3): p. 347–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Faith JJ, et al. , The long-term stability of the human gut microbiota. Science, 2013. 341(6141): p. 1237439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Martinez I, Muller CE, and Walter J, Long-term temporal analysis of the human fecal microbiota revealed a stable core of dominant bacterial species. PLoS One, 2013. 8(7): p. e69621. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Baksi KD, Kuntal BK, and Mande SS, ‘TIME’: A Web Application for Obtaining Insights into Microbial Ecology Using Longitudinal Microbiome Data. Front Microbiol, 2018. 9: p. 36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Lugo-Martinez J, et al. , Dynamic interaction network inference from longitudinal microbiome data. Microbiome, 2019. 7(1): p. 54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Cleary JG, et al. , Quantitative Analysis of Shotgun Metagenomic Data with the Real Time Genomics Platform. Journal of Biomolecular Techniques : JBT, 2013. 24(Suppl): p. S33–S33. [Google Scholar]
- 123.UniProt: a hub for protein information. Nucleic Acids Res, 2015. 43(Database issue): p. D204–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Misra BB, et al. , Integrated Omics: Tools, Advances, and Future Approaches. J Mol Endocrinol, 2018. [DOI] [PubMed] [Google Scholar]
- 125.Noor E, Cherkaoui S, and Sauer U, Biological insights through omics data integration. Current Opinion in Systems Biology, 2019. 15: p. 39–47. [Google Scholar]
- 126.Baron SA, Diene SM, and Rolain J-M, Human microbiomes and antibiotic resistance. Human Microbiome Journal, 2018. 10: p. 43–52. [Google Scholar]
- 127.Escudeiro P, et al. , Antibiotic Resistance Gene Diversity and Virulence Gene Diversity Are Correlated in Human Gut and Environmental Microbiomes. mSphere, 2019. 4(3). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Thomas AM and Segata N, Multiple levels of the unknown in microbiome research. BMC Biol, 2019. 17(1): p. 48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Bernard G, et al. , Microbial Dark Matter Investigations: How Microbial Studies Transform Biological Knowledge and Empirically Sketch a Logic of Scientific Discovery. Genome Biol Evol, 2018. 10(3): p. 707–715. [DOI] [PMC free article] [PubMed] [Google Scholar]
