Skip to main content
Philosophical Transactions of the Royal Society B: Biological Sciences logoLink to Philosophical Transactions of the Royal Society B: Biological Sciences
. 2020 Oct 5;375(1812):20190577. doi: 10.1098/rstb.2019.0577

The soil in our microbial DNA informs about environmental interfaces across host and subsistence modalities

Stephanie L Schnorr 1,2,
PMCID: PMC7702791  PMID: 33012224

Abstract

In this study, I use microbiome datasets from global soil samples and diverse hosts to learn whether soil microbial taxa are found in host microbiomes, and whether these observations fit the narrative that environmental interaction influences human microbiomes. A major motivation for conducting host-associated microbiome research is to contribute towards understanding how the environment may influence host physiology. The microbial molecular network is considered a key vector by which environmental traits may be transmitted to the host. Research on human evolution seeks evidence that can inform about the living experiences of human ancestors. This objective is substantially enhanced by recent work on ancient biomolecules from preserved microbial tissues, such as dental calculus, faecal sediments and whole coprolites. A challenge yet is to distinguish authentic biomolecules from environmental contaminants deposited contemporaneously, primarily from soil. However, we do not have sound expectations about the soil microbial elements arriving to host-associated microbiomes in a modern context. One assumption in human microbiome research is that proximity to the natural environment should affect biodiversity or impart genetic elements. I present evidence supporting the assumption that environmental soil taxa are found among host-associated gut taxa, which can recapitulate the surrounding host habitat ecotype. Soil taxa found in gut microbiomes relate to a set of universal ‘core’ taxa for all soil ecotypes, demonstrating that widespread host organisms may experience a consistent pattern of external environmental cues, perhaps critical for development. Observed differentiation of soil feature diversity, abundance and composition among human communities, great apes and invertebrate hosts also indicates that lifestyle patterns are inferable from an environmental signal that is retrievable from gut microbiome amplicon data.

This article is part of the theme issue ‘Insights into health and disease from ancient biomolecules’.

Keywords: soil microbiome, gut microbiome, human evolution, coprolites, human environment, biodiversity

1. Introduction

Soil encapsulates some of the highest biodiversity per unit volume of material as compared with other environmental mediums [1,2]. It houses microorganisms such as bacteria, microeukaryotes, fungi and invertebrate microfauna, while also being an anchor point for plants and a multifaceted resource for larger animal organisms, who derive food or shelter from its crevices. One can think of soil much like water, as a medium that organisms live not only on but throughout, and like occupants of aquatic environments, those that inhabit soil ecosystems take on components of their environmental [3,4]. The microbiome, that is, the microscopic organisms including viruses, bacteria, archaea and single-celled fungi or eukaryotes, is a living component of habitable ecosystems that permeates the structural web of organic matter. Although organisms are defined by the epithelial boundaries that protect internal tissues, defending self from other, infiltration from external elements still occurs. Soil microbiomes are distinctive communities among diverse soil ecologies, and their compositions are the fingerprints of broader ecosystems, such as forests, grasslands, shrublands or deserts [5]. Naturally, the soil microbiome can be found in organisms that consume soil such as termites and earthworms, but soil microbes also infiltrate other arenas that are simply proximate to soil spaces, such as air currents, plant leaf surfaces, animal skin and human domiciles [68]. Where these microbes rest on, they also reside in, and host-associated microbiomes sustain a certain small percentage of persistent inoculation from the environment. Whether or not contaminants remain transient or become endemic is still under investigation [9], but they do participate in gene transfers [1013], if not directly in wholescale community cell replacement. Through faecal microbiome data from human populations spanning a range of living-condition modalities, it might be possible to understand the variation in soil transmission—in this case of soil taxa—to the gut ecosystem.

(a). Anthropological perspective on human microbiome data can inform widely

The question of environmental influence on human host microbiomes can be studied from the robust data generated from dozens of human societies around the world, many of whom are small-scale Indigenous populations occupying the global ‘south’ with varying limited affects from built environments, urbanization, environmental toxins or integration into market economies [14]. Of particular interest to researchers in anthropology is how over time human ancestors have shaped their environment to suit their needs. Studies on human biology in evolution look at the interface between humans and the environment, but the focus is usually on the impact of human activity on the external world [15]. However, humans adapt to environmental challenges such as climate, resource availability, pathogens or xenobiotics, often by modifying their behaviours, which can impact auxiliary genomes found internally in human microbiomes. Owing to a strong predilection for constructed spaces, or built environments, the human-environmental connection depends on a multitude of factors tethered to geography, subsistence and culture. The intensity of association (physical contact, duration, permeation) may determine the prevalence of an environmental signal (environmental microbial molecules) within a host-associated ecosystem. Factors relevant to exposure circumstances include whether environmental interaction comes from a wild, cultivated or urban setting, whether exposure vectors can enter the airways, oral cavity and gastrointestinal tract, or whether exposure is only superficial, such as on washed surfaces. Variations in these modalities would likely impart more distinguishing features defining microbial community structure or diversity across host communities. Humans have dramatically altered how they occupy their environments primarily through subsistence and economic transitions, which has both positive and negative consequences on our own health [16], but overwhelmingly negatively impacts the environment through the loss of biodiversity [17]. When we learned how to kill infectious microbial agents, we were still blind to the widespread benefits their counterparts might perform, and how integrated our own physiologies are with trillions of microscopic organisms that colonize every surface of our bodies and our environments. Renewed efforts to catalogue the (micro)biological world around us has shown us that human activity is costing us the variability inherent to nature [18], and that these costs undercut the stabilizing dynamics of natural ecosystems.

In the wake of massive globalization of hyper-industrialized societies, it is of paramount importance to take action against the collapse of macro- and micro-ecosystems due to the loss of biodiversity. This means that we should look to ancient biomolecules as an archive, offering a glimpse of the ecological status prior to widespread anthropogenic disturbance to our internal and external ecosystems. However, ancient specimens are rare, and those that exist may be too degraded for sufficient recovery of ancient DNA to reconstruct informative ecologies. One major concern when interrogating ancient microbiomes is the issue of contamination, since both authentic-endogenous and contaminant-exogenous molecules are predominately of bacterial origin [19]. Yet we should expect some transient or even opportunistically colonizing bacteria naturally to be found in host-associated microbiome ‘tissues’, such as in dental plaque and calculus, and in faeces, through daily exposure to environmental (air, water, soil) microbiota [2]. In order to know whether ancient microbiomes retain authentic signals of the host environment, we need to know to what extent the environment intrudes upon present-day host-associated microbiomes. Investigating environmental inputs on host-associated microbiomes is also important for understanding the intrinsic connection between organisms and their environment, and can be used as an indicator of disturbance hotspots that are likely to impact host health.

(b). The nature of entanglement between soils in human environments and microbiomes

Soil is ubiquitous in natural landscapes, and while it may be often hidden, it is still the foundation of the built environment in urban settings, and is among the implied actors of ‘environmental interactions'. How humans occupy their landscapes surely impacts their physiology, since the organism–environment interface determines the type of molecular information exchanges that can occur, where those molecules come from, and how they are conveyed. The microbiome is a potent reservoir of rapidly adapting microorganisms that accumulate traits through discrete chemical interactions, most notably horizontal gene transfer [20]. For host-associated microbiomes, these can be regarded as an impressionable symbiotic organ, being at once an integral part of host physiology as well as a signal of the environment, and therefore be an indicator of the environmental interactions that humans experience. More than a decade of research into human host-associated microbiomes has shown that as environments and behaviours with health-mediating effects change, so too are associated microbiome communities, to varying degrees, perturbed [21].

Host-associated microbiomes congregate at epithelial surfaces or in fluids, most notably in breast milk and along the mouth, airway, skin, urogenital tract and intestines [22]. Visualization of the community is accomplished most directly with metagenomics, a kind of horizontal interrogation of multiple species, and represents a transpose of the vertically oriented information field one obtains from single organism genomics. The ability to recover ancient molecules such as proteins [23,24], DNA [25] and lipids [26,27] opens the door to deep time and to date has informed most notably how organisms have evolved, what they have produced, and to whom they related. With recovery of ancient host-associated microbiome molecules [23,28,29], the reconstruction expands to include also features of the environmental, insofar as they pertain to host activities. The combination of genomic and microbiomic information is highly complementary towards understanding who human ancestors were (genomics: population history), the immediate circumstances of their physiological condition (proteomics: immune factors), and their environmental experiences (microbiomics: genetic and taxonomic arrangements).

This analysis probes the hominid (humans and great apes) gut microbiome for similarities to a set of soil microbiome core taxa from diverse ecosystems. The presence, diversity and composition of shared taxa may be evidence of soil microbial infiltration into the host-associated microbial ecosystems. The aim is to apply these findings to expectations about whether ancient DNA of human microbiomes could tell us about the human–environmental interface in evolution. Great ape data from bonobos, chimpanzees and gorillas, as well as termite data from a multitude of taxa (including cockroaches, crickets and one scarab beetle), were used to inform about the presence of soil taxa due to phylogeny, feeding habits and host habitat (see table 1 for sample data summary and electronic supplementary material, table S1, for full sample metadata). In particular, through this work, I aim to expand the ecological and evolutionary plausibility for the existence of soil taxa in the gut microbiome of humans and their ancestors by including data from African great apes, and to benchmark expectations about soil taxa incorporation into host-associated microbiomes in an extreme example by using datasets from many diverse termite species and other invertebrates.

Table 1.

Summary of host-associated datasets included in this study.

host N (samples with soil core taxa) study platform 16S region community (as specified in original study)
beetle 1 Dietrich [30] 454 v3v4 n.a.
bonobo 70 Moeller [31] Illumina v4 LA, ML
chimp 44 Moeller [31] Illumina v4 Gombe
cockroach 7 Dietrich [30] 454 v3v4 n.a.
cricket 2 Dietrich [30] 454 v3v4 n.a.
fungal comb 2 Li [32] 454 v3v4 n.a.
gorilla 181 Moeller [31] Illumina v4 Campo, Belgique, Djoum, Bipindi
human 27 Ayeni [33] Illumina v3v4 Bassa, urban Nigerian
23 Duboise [34] Illumina v4 Montreal, Nunavut
6 Li [35] Illumina v4v5 Tibetan, Han
78 Obregon-Tito [36] Illumina v4 Matses, Norman, Tunapuco
43 Pires [37] Illumina v4 Puruzinho, Buiucu, Rio
1 Schnorr [38] 454 v4 Hadza
107 Smits [39] Illumina v4 Hadza
1 Stagaman [21] Illumina v4 Shuar
529 Yatsunenko [34] Illumina v4 Malawi, USA, Venezuela
termite 77 Bourguignon [40] Illumina v3v4 n.a.
3 Chew [41] Illumina v3v4 n.a.
15 Dietrich [30] 454 v3v4 n.a.
14 Kohler [42] 454 v3v4 n.a.
153 Li [32] 454 v3v4 n.a.
43 Schnorr [43] Illumina v4 n.a.

2. Methods

A methodological work-flow diagram of the following procedures is provided in figure 1.

Figure 1.

Figure 1.

Methods workflow of bioinformatics process using 16S rRNA amplicons containing the V4 hypervariable region. The schematic shows the process undertaken to derive the soil-core OTUs and the subsequent abundance tables of host-associated sample data after searching for the soil-core OTUs among their sequenced communities. The first step was to obtain the full set of OTUs at 97% identity using the USEARCH UPARSE algorithm (totalling 16 936 OTUs), and then these OTUs were filtered for only those that achieved a presence in at least 80% of the soil samples from the Global Soil Dataset (a total of 112 OTUs). Then, both the Global Soil Dataset and the Host-Associated Data were searched for the soil-core OTUs, deriving the abundance tables that were used for analysis in R. The taxonomy summaries derived from VSEARCH SINTAX call against the SILVA v132 16S database were used for sourcetracker2 analysis, and the taxonomy derived from the BLAST search against the NCBI 16S rRNA database was used otherwise for annotating the soil-core OTUs. Colour boxes indicate a microbiome dataset (green), a process or software (gold), or a reference database (blue).

(a). Deriving soil-core taxa

The 16S V3-V4 global soil dataset [44] was downloaded from the National Center for Biotechnology Information Short Read Archive (NCBI SRA) and curated following the original study methods, which involved clipping 20 bases from both ends of each read. Unique sequences with a minimum of 5 reads (less than 5 were considered spurious) were de novo clustered into operational taxonomic units (OTUs) at 97% similarity using USEARCH UPARSE [45] resulting in 16 936 OTUs, and the OTU table was built for all soil samples using the VSEARCH ‘usearch_global’ call [46] at 97% identity to the OTU representative sequences. A set of 112 core OTUs were found among the soil samples, defined as OTUs that were present in at least 80% of samples (183 out of 229 samples). Taxonomy of the soil-core OTUs was obtained by a BLAST search against the NCBI prokaryotic 16S rRNA database, taking top hits only, with a per cent ID score greater than 84% (electronic supplementary material, table S2).

(b). Finding soil-core taxa among host-associated datasets

Host-associated 16S rRNA datasets (electronic supplementary material, table S1) that sequenced at least a portion of the V4 region were selected in order to search for presence and abundance of the derived ‘soil-core’ OTU dataset [21,30,3234,3639,4143,4749]. The soil-core OTUs were used as the reference database with VSEARCH ‘usearch_global’ call at 97% identity (terminal gaps are excluded from the identity score), resulting in dataset-specific OTU tables of the counts of soil-core OTU sequences per sample. The resulting soil-core OTU count tables were merged and used for downstream analysis in R.

(c). Verifying soil-core taxa

To validate that the soil-core OTUs are not environmental contaminates, sourcetracker2 [50] was used to identify the soil-core components of the original soil samples from other host-associated and environmental microbiomes. Data used for the ‘sinks’ included the original global soil dataset samples [44]; datasets used as the ‘sources’ included human hunter–gatherer (HG) [36,38,51], rural, urban and pastoralist gut microbiome data [37,5153]; human skin microbiome data [54]; great ape microbiome data [48]; and two soil microbiome datasets (including the original global soil dataset) [5,44,55]. All datasets were run through de novo OTU picking at 97% using VSEARCH implementation of USEARCH UPARSE, and taxonomy mapped against the SILVA v132 16S database [56] using VSEARCH SINTAX with confidence cut-off specified at 0.8. All resulting relative abundance taxa tables were summarized at the genus level (L5) and then merged using QIIME merge OTU tables [57]. The unassigned bin was removed prior to running sourcetracker2. The sourcetracker2 gibbs command was run with default parameters except that source rarefaction was set to 0 because of prior normalization.

(d). Tracing compositional similarity of soil-core from host sample sets

To understand whether soil-core taxa found among the host-associated microbiomes could be identified in composition to soil profiles, sourcetracker2 was again used, this time on the soil-core taxa tables from the V4 region 16S rRNA datasets where some samples could be rarefied to 100 sequences [30,32,33,39,4042,47,48]. The taxonomy for the soil-core OTUs was found using the VSEARCH SINTAX call against the SILVA v132 16S database (done previously for verifying the soil-core taxa), and then the taxonomy annotations were added to the soil-core OTU tables, rarefied to 100, summarized at the genus (L5) level, and then merged. Sources included three soil microbiome datasets [5,44,55], a human skin dataset [54] and human gut microbiome for three subsistence specifications (foraging, rural farming, industrial agriculture) [36]. Source-tracked proportion tables were sorted by the proportion of ‘unknown’ composition and samples evaluated if at least 25% of the community profile could be traced to a source dataset.

(e). Analyses

Analyses conducted on the merged soil-core count table and the alpha-diversity calculations were handled in R v. 3.4.1 using the {vegan}, {made4}, {propr} and {reshape2} packages [5863]. Visualization of results were achieved using the base R graphics alongside the {vioplot}, {scales}, {colorspace} and {ggplot2} packages [6466]. The input core count table was normalized relative to the number of total reads for each sample. In the human samples, where age was provided with the associated metadata, samples were binned into three lifestages: ‘child’ (aged 0–10 years), ‘adolescent’ (aged 11–15) and ‘adult’ (aged greater than 15). Statistical significance for separation between groups was calculated using permutational multivariate analysis of variance (PERMANOVA) with adonis in the {vegan} package.

(f). Alpha-diversity analysis

Alpha diversity for observed OTUs and Shannon diversity were calculated using QIIME on the merged soil-core count table from multiple rarefactions to a depth of 1000 observations, and resulting matrices were imported to R for aggregation and visualization. Rarefaction curves were created using the rarecurve function in {vegan} on the normalized non-rarefied count table (electronic supplementary material, figure S1A). The observed OTUs and Shannon diversity matrices were transformed using {reshape2} to long form. First, by-sequence alpha-diversity values were obtained by aggregating the mean values across all samples at each sequence depth and plotted to visualize the effect of sampling, indicating that diversity value is highly impacted up to a depth of approximately 300 sequences (electronic supplementary material, figure S1B), given that few samples among all queried datasets had greater than 1% of soil-core sequences (electronic supplementary material, figure S1C). Alpha-diversity values at a depth of 300 observations were visualized in boxplots, but this strategy omitted 712 out of 1015 samples. Instead, the maximum diversity values were found for each individual sample and used for boxplot visualization, but as a result, these calculations should be taken with caution (electronic supplementary material, figure S1D). Given these limitations, a projection of the diversity values was accomplished using the {vegan} richness estimator function, specaccum, which shows species accumulation curves based on the ‘exact’ method of expected/extrapolated accumulation from a moment-based estimation of the mean. Richness estimations projects the number of expected species by site (as opposed to by sequence in rarefaction curves), which was accomplished on subsets of the original count table (by-host, by-subsistence and by-age).

(g). Beta-diversity analysis: ordination, heatplots, composition and prevalence

Principal components analysis (PCA) was conducted on the centre log-ratio (clr)-transformed data, first using the {propr} package and propr command with [ivar = ’clr’] to obtain the log-ratio data frame, and then using the capscale function from the {vegan} package, specifying the LHS as the log-ratio data frame and RHS as ‘approximately 1’ with [distance = ’euclidean’] to obtain PCA with no constraints. Since there was a high degree of sparsity and 0 counts, the table was also filtered for samples that have at least 0.1% of total reads assigned to a soil taxon prior to ordination and clustering of the whole dataset, which reduced the number of samples from 1278 to 324, but gave a more accurate representation of sample clustering by composition. Feature gradients represent the number of unique soil-core OTUs (features), which were fitted onto PCA plots using ordisurf function of {vegan}. Heatmaps were generated from the clr-transformed data using the {made4} package with the ‘ward.D2’ clustering method. Prevalence heatmaps were generated by taking the top-half most-abundant OTUs (56 total out of 112) for the host-associated samples. Host groups with less than 10 individuals were removed (beetle, cockroach, cricket and fungal comb), leaving large host groups for prevalence mapping. The heatmaps show the per cent of prevalence (presence/absence) of soil-core OTUs for each host group and each human community, respectively, with the soil-core prevalence in the original soil samples as comparative.

3. Results

In the earlier work by Delgado-Baquerizo et al. [44], which looked at the dominant bacterial communities from a global atlas of soil ecosystems, roughly 2% (approx. 500) of all bacterial OTUs found among soil microbiomes represented nearly half of the actual bacterial abundance in these communities. The ‘core’ OTUs were thus defined as the top 10% most abundant OTUs that were found in greater than 55% of soil samples. Here, in the present analysis, a highly abundant set of ‘soil-core’ OTUs were found using a more stringent selective criteria, aimed at capturing truly ubiquitous soil taxa that may be traceable to host-associated microbiomes. The soil-core OTUs were determined here to be OTUs found in at least 80% (183 out of 229) of the global soil microbiomes from the same soil samples previously reported in Delgado-Baquerizo et al. [44], encompassing a wide array of ecosystems from dry grasslands to tropical forests. These soil-core OTUs represented 0.66% of the total number of OTUs (112 out of 16 936 OTUs), but on average, 27.5% (7.4%) of total OTU abundance of total identified OTUs among all soil samples (minimum 6.5%, maximum 42.5%) (electronic supplementary material, figure S2A). Sourcetracker2 analysis was used to trace soil-core profiles of each global soil sample in order to understand whether soil-core taxa were actually reflective of soil ecosystems, or whether they were contaminants, perhaps from reagents or airborne particles. The soil-core OTU abundances for global soil sample types do in fact map on to soil microbial ecosystems rather than host-associated microbiomes (approx. 99% soil identity), and recapitulate the soil ecosystem from which the samples came (electronic supplementary material, figure S2B). The soil-core OTUs served as the representative OTU database against which diverse human- and animal-host gut microbiome communities were mapped at 97% identity (see figure 1 for workflow depiction). The resulting abundance tables demonstrate that there is some component of soil-core taxa that are prevalent in host-associated gut microbiome communities (figure 2), potentially deriving from direct contact with soils (a proxy for the intensity of interaction with the natural environment).

Figure 2.

Figure 2.

Heatmaps of prevalence of 56 top-most abundant OTUs. The prevalence (the per cent of samples with the OTU present) of the top-half most abundant soil-core OTUs as measured among host samples (a) and then within humans (b) shows the differential distribution of soil-core bacteria based on the host category and community, respectively. Gorillas and then chimpanzees appear to have the most numerous consistently present taxa, while the bonobo, human and termite host samples have lower group-based prevalence, indicating a more dispersed, soil-core representation. The soil-core OTU prevalence in the soil samples is expectedly near to 100% across all OTUs, reinforcing the notion that these soil-core OTUs are indeed ubiquitous in global soil ecosystems. Taxonomic annotation of OTUs is based on top BLAST hit from the full NCBI 16S rRNA database.

Overall, diverse hosts, which included humans, African great apes, termites, cockroaches, crickets, and one species of beetle, significantly differed in the amount, diversity and distribution of soil-core OTUs (PERMANOVA, R2 = 0.223, p < 0.001). An overall picture of the effect of sequencing depth on the diversity of soil-core taxa indicates that the evenness of soil-core taxonomic abundance decreases as more new taxa are found, which can be expected from sparse or rare data. Since Shannon diversity decreases concordant with an increase in OTU count for soil-core taxa (electronic supplementary material, figure S1B), then indeed the event of soil-core taxonomic presence in host-associated gut microbiomes is rare, and the distribution is sparse. To drill down into more specific host effects, it was necessary to use diversity projection curves. Richness estimations, or species accumulation curves, project the number of expected species with the addition of each new site (individual host). The non-human great apes exhibit the highest estimation of species richness (but with varying accumulation rates) while humans (aggregated) and the termite species (distinguished by whether or not they farm fungus) have more gradual accumulation (figure 3a). Humans and non-fungus-farming termites appear to have a slower rate of accumulation but less severe plateau, while the non-human great apes and the fungus-farming termites indicate plateau phases following varying rates of early accumulation. These differences likely relate to the fact that the human and non-fungus-farming termites have more heterogeneous intraspecific subsistence practices within their grouping [67,68]. The mean relative abundance of the soil-core taxa summarized by host also show distinctive variations in the distributed proportions of present soil-core taxa (figure 3b; see electronic supplementary material, table S2 for by-host soil-core OTU proportions) with some hosts dominated by just one or two taxa (beetle, cricket, fungal comb, gorilla and human) and others having higher representation by many taxa (bonobo, chimp, cockroach and termite). Additionally, the proportion of sample sequences that mapped to soil-core OTUs ranged between less than 1% to 20%, and were highest surprisingly among the chimpanzee, gorilla and human hosts, rather than among termites and other invertebrates (figure 3c), which was unexpected and may indicate any number of phenomena such as feeding habit, host foraging selectivity, environmental disturbance from host activity, or potentially, contamination during sample collections. The soil-core taxonomic compositions are clearly distinguished between environmental (soil) and host microbiomes, and also separated by host species (figure 3d; PERMANOVA, R2 = 0.192, p < 0.001), but these profiles are actually best explained by the more granular categorization of ‘community’ (i.e. country, region or ethnic group) (PERMANOVA, R2 = 0.32, p < 0.001). When richness estimates are viewed for human data alone, stratification is apparent based upon subsistence type, but also for lifestage, defined as ‘child’, ‘adolescent’ and ‘adult’ (see Methods for binning criteria; figure 4a,b). Rural-farming and hunter–gatherer (HG) groups have greater accumulation of taxa per site than do urban-living humans, which conforms to the hypothesis that direct interaction with earth soil is a necessary condition for soil bacteria to infiltrate the gastrointestinal tract. Curiously, samples assigned to children also accumulate soil-core taxa richness faster than those assigned to adolescents and adults. Children tend to interact with their environment more, often orally, at very young ages, but also through play and exploration phenotypes that characterize this lifestage, and which may continue into adolescence. It is expected that children and adolescents would have greater soil-core taxonomic richness than adults; however, adolescents are observed to have a slightly lower richness accumulation rate than adults, but still fall entirely within the 95% confidence area. This effect may simply be a matter of a small age range for adolescents, therefore encompassing a smaller span of (and fewer) individuals than the other binned lifestage categories. Viewed by age, the number of soil-core taxa observed appears to have a general negative association with increasing age in humans (Pearson correlation, ρ = − 0.09, p = 0.018). When this relationship is looked at specifically within individuals from three different countries (USA, Malawi, Venezuela; from Yatsunenko et al. [34]), the pattern is upheld mainly due to observations on individuals from the USA (Wilcoxon, p = 0.019; figure 4c). This result is good evidence for the notion that while an urban lifestyle may reduce soil-bacteria exposure overall, children may still be exposed to soil from natural environmental exploration, and the contrast between children and adults in urban settings is most apparent. There are also more sampled individuals from the USA, which may bias this analysis. Human community microbiomes from populations living as hunter–gatherers, subsistence farmers and foragers, pastoralists, and in urban-industrialized settings also showed marked variation in the abundance and distribution of soil-core taxa, which is also best explained by community-level assignments (PERMANOVA, R2 = 0.37, p < 0.001; figure 4d), yet taxonomic distribution did not have any apparent relation to age or lifestage (PERMANOVA, R2 = 0.039, p = 0.129).

Figure 3.

Figure 3.

Soil-core OTU distribution among host samples. The projected richness from the number of soil-core taxa found by site in host-microbiome communities (a) shows African great apes to harbour the highest richness per site, but human and termite sample projections indicate greater variation in by-site richness. The soil-core relative abundance averaged by host (b) shows distribution of abundance across the 112 soil-core OTUs. The proportion of soil-core OTUs found in all sequences is shown averaged by host (c). Clustering based on soil-core abundance tables shows that samples cluster by host, but stochastically, and are highly interspersed, depending on whether taxonomic abundance is concentrated in a few or many OTUs (d).

Figure 4.

Figure 4.

Human soil-core taxonomic profiles by subsistence, community and age. Rural farming communities and hunter–gatherer communities have much higher richness projections than urban communities (a), while rural arctic and pastoral communities are under sampled but appear to have a lower curve trajectory. Children have higher richness than adolescent or adult individuals (b), which corresponds to a negative linear trend in the number of soil-core OTUs by age and is particularly well-represented in lifestage groupings among samples from the USA (c). Distribution of samples by clr-transformed abundance table of soil-core OTUs shows clusters by subsistence (top colour bar, left-hand colour key) and by community (bottom colour bar, right-hand colour key), primarily differentiating hunter–gatherer and some rural farming samples from urban and other rural groups (d).

Ordinations were used to visually explore the link between different hosts and their soil-core microbial distribution to understand whether patterns might suggest an effect of lifestyle and environment, or whether the soil-core presence is an extended phylogenetic trait, as has been suggested of gut microbiomes in past works [30,47,69,70]. Continuing this exploration with the addition of the great ape and termite datasets helps to discern the ecological (termite) and evolutionary (great apes) plausibility for soil taxa to arrive in the human gut microbiome as a result of proximity to the environment. Principal component analysis of Euclidean distance of the clr-transformed soil-core abundance table, filtered for individuals achieving at least 0.1% total soil-core sequence abundance, indicates host-dependent clustering to some degree. The full dataset shows clustering contingent on ecology (soil versus host-associated microbiomes) and host species (figure 5a). Of interest, there is interspersion between some human samples among great-ape and termite samples, but extreme separation between some human and gorilla samples obliterates finer differentiation among other hosts (figure 5a). When the dataset is reduced to only those studies containing individuals identified as having a hunter–gatherer subsistence [36,38,39], it is possible to see some meaningful compositional-based stratification of samples, which in part relates to the community and subsistence practice (figure 4d). These human samples derive entirely from datasets contributing hunter–gatherer, mixed subsistence, or rural farming group sampling, alongside samples from westernized urban communities for comparative work. Isolating the human data for ordination again depicts clustering by community and country of origin for the individuals (figure 5b), which of course relates to subsistence, lifestyle and geography.

Figure 5.

Figure 5.

Ordination of clr-transformed soil-core OTU abundance table. Soil-core clustering showing soil and host-associated soil-core OTU distribution indicates major separation between the environment and the host-associated ecologies, as well as between host organisms (a). Human community datasets were evaluated independently and also depict some community-based clustering (b and c). Comparative analysis looking at human sample stratification within the same study (that depicted both small-scale and westernized communities) clearly differentiates the hunter–gatherer groups (Hadza and Matses) from rural farmers and westernized urban-living individuals (Tunapuco and Norman; b). Comparing all human samples from across all studies (filtered for samples achieving at least 0.1% of soil-core OTUs in the total microbiome community) shows HG and some rural farming groups separate from a tight cluster of mostly individuals from the USA and some from small-scale Venezuelan communities (c). The deviations from clear subsistence patterned clustering here may be due to the inclusion of young children. The gradient (green topographical lines) indicates the OTU features (richness) distribution across the plot space, and is not instructive for explaining the distributions.

Finally, the soil-core taxonomic composition of interrogated sample datasets was analyzed using sourcetracker2 software in order to understand whether these profiles bore any relation to non-host-associated soil microbiome communities versus host-associated human gut communities, and human skin communities, or if instead they appear as only random accumulations of environmental contaminants. Datasets submitted for source tracking included only those whose sample soil-core OTU tables could be rarefied to a minimum of 100 sequences, since extremely sparse tables would not be traceable to a source community profile. Even with such a low threshold criteria, only two human datasets consisting of individuals from rural and urban Nigeria (Ayeni et al. [33]) and individuals from Malawi, Venezuela and the USA (Yatsunenko et al. [34]) could be included on account of the overall high-frequency of a low per cent of soil-core OTU prevalence in accessed datasets (figure 3c; electronic supplementary material, figure S1C). In addition, the African great ape and termite datasets [30,32,41,46] were included for comparison and to better understand the environmental predictive power of the recovered soil-core community in each host. Of the 388 ‘sink’ samples, a total of 34 contained a greater than 25% identifiable taxonomic profile, all of which map to soil environments rather than human-associated microbiome communities (figure 6). The highest proportion environmental match also closely matched the actual environment from which the sample originates: human communities from rural Nigeria, Venezuela and Malawi were traced to dry forest soil communities; bonobo and chimp samples also matched to dry forests, while gorillas largely matched temperate forest communities; termite samples matched dry forest, dry grassland and temperate forest communities. These results are encouraging evidence towards the hypothesis that the environment can be a source of endogenous molecular diversity found among gut microbiome genetic data, in modern and in ancient samples. Furthermore, universally shared soil microbial traits are apparently spread throughout environments and animal host-associated microbial ecosystems, implying that we should expect the environment to be a reservoir of conserved microbial information (e.g. taxa, genes, proteins, metabolites). According to these results, supported through source-tracking analysis, microbial molecular information does arrive in the host-associated gut microbiome, and in patterns suggestive of a fairly uninhibited pathway through the animal digestive tract, dependent upon patterns of exposure that relate to community-level lifestyle properties. Therefore, I answer the question posed by Blum and colleagues, ‘Does soil contribute to the human gut microbiome?’ in the affirmative [2].

Figure 6.

Figure 6.

Sourcetracker2 identification of soil-core profiles from host-associated microbiomes. For samples that achieved greater than 75% identification from sourcetracker2 analysis, the soil-core profiles are maximally representative of soil communities (shown in the cool to warm colour gradient) and do not map on to any host-associated microbiome sources provided in the analysis (discrete categorical colours: sky blue, human skin; tan, Tunapuco rural farmers; green, Matses HG; red, westernized urban-living individuals in Norman, Oklahoma, USA).

4. Discussion and conclusion

The concept of a core microbiome community is widely applied to host microbiomes, but it also remains valid for environmental microbiome communities where certain structural or metabolic parameters are shared. This assumption has been previously confirmed through identification of a core soil microbiome community that is shared among globally distributed soil microbiomes from diverse ecosystems [44].

The results of this initial study on the presence and pattern of a soil-core microbial component in host-associated microbiomes suggest that a fingerprint of an individual's environment in the form of soil-associated bacterial taxa may be found among gut microbiome sequence data. This has important implications for studying materials for traces of ancient molecules, particularly from sources that may contain host-associated microbial elements such as latrine sediment, coprolites, soil sediments and dental calculus. Identification of soil bacteria in endogenous sample data may be troubling from the standpoint that soil microbial elements integrated within a microbiome are not readily distinguishable from environmental contamination (in fact, the definition of contamination is hazy in this regard), but this work serves as a proof of concept that more sensitive analyses to detect soil-origin microbiota would be a rewarding venture for a variety of reasons, including ancient environmental reconstruction, understanding factors in microbiome assimilation, and clarifying the medium of environmental molecular transmission to host-associated ecosystems.

Soil and other ecological features such as water and air are important reservoirs for microbes that impact host physiology [6,7,10,18] or seeding obligate symbioses [71,72]. A study of edible fungus-farming termites found one novel termite Treponema strain that clustered separately with Treponema strains found in human hunter–gatherer and rural farming groups, lending support to the idea of a shared universal reservoir for rare or novel bacterial strains in host-associated microbiomes [43]. Such a finding hints at the possibility that environmental taxa play important roles in structuring host-associated microbiomes, and that ‘infiltrators’ are regularly transiting the gastro-intestinal tract and contributing to the molecular cross-talk, or even directly participating in metabolic activities within the lumen. One potential issue of note, however, is the uncertain possibility of post-depositional environmental contamination, since sample collection methods across studies are not consistent and are rarely specified in the original publications. Human and African great ape data came from faecal samples, and invertebrate data came from homogenized whole-gut isolates. The African great ape samples were collected from deposits on the forest floor, and so it is plausible that these could be contaminated with soil bacteria after evacuation. However, if that were the case, a more consistent base-line level of soil-core bacterial presence in the African great ape specimens would be expected, which would be set apart from the rest of the samples. Instead, not only do bonobo samples fail to show a comparable proportion of soil-core sequences (less than 2%), there is a species-specific distribution of soil-core OTUs among the great apes with some overlap (figure 3b–d), similar to what is seen in the original paper results on the full taxonomic distribution of the microbiome [31]. The omnivorous and fungus-farming invertebrates also intersperse with a cluster of the great ape data (figure 5a), suggesting a separate mode of integration of soil-core taxa, since invertebrate samples were removed whole from intact specimens. Still, if soil taxa inoculation to host-associated microbiomes is to be studied further, more careful documentation on the manner of sample collection, including the intermittent context of the sample, between the deposit and the stabilized collection vessel, is critical.

Of the main findings in the present analysis, a number of observations stand out that warrant further exploration. First, it is notable that using a fairly constrained sample set (only datasets for which the V4 region of the 16S rRNA gene is available), it was still possible to witness host-species, subsistence and community-based clustering of the soil-core abundance tables. This means that a number of factors influence the infiltration of soil taxa to the host gut microbiome that can be informative for attempting to reconstruct the ecology of an unknown microbiome sample from ancient remains. Individuals from rural human societies that farm or forage for subsistence contain a higher abundance of total reads mapping to soil-core taxa, and also greater richness of taxonomic units, than do urban-living individuals. However, samples deriving from individuals living in a rural arctic community or who live in a pastoralist society had very few reads assigned to soil-core OTUs and could not be used in subsequent analyses due to the sparsity of data. Technical factors could certainly influence this finding, but so far this observation is consistent with the overall picture from previous works of gut microbiome biodiversity found among individuals from these communities. The gut microbiome biodiversity from some high-altitude pastoralist groups as well as traditional populations in arctic environments is comparable with individuals from market-integrated communities [34,73]. In addition, environmental factors (such as sourced drinking water) can swamp variation due to diet or subsistence practice [52]. Rarefaction curves shown by host are congruent with a scenario in which close-association or feeding on soil (such as in the case for termites) permits high amounts of soil taxa to mingle with host-associated microbiomes.

A second important finding is that the ability to harbour high abundance and diversity of soil taxa in a hominid gut is phylogenetically feasible, given that chimpanzees had the greatest number of sequences assigned to soil-core taxa and had the greatest richness by sequence and by site. This finding is interesting but puzzling, since both bonobos and chimpanzees typically feed from ripe canopy fruits and share activity times occupying terrestrial and arboreal spaces, while gorillas are ground floor occupants and feeders [74]. Further exploration of great ape data for similar soil taxa should be highly illuminating towards understanding what aspects of hominid host ecologies stimulate environmental inoculations. Also expanding this exploration to include cercopithecine monkeys would be important for understanding the role of diet versus activity habits on patterns of soil taxa presence [75,76].

Third, the soil-core taxa profiles in human samples varied depending on human community (a combination of subsistence, lifestyle and geography) and depending on lifestage (child, adolescent or adult) in patterns that are predicted from a hypothesis of environmental exposure-mediated bacterial incorporation. The samples from rural farmers contain the greatest abundance and richness of soil-core taxa, even more than the two HG communities included here. Again, though, there are technical limitations—fewer HG samples and fewer total reads from those samples. Still, more soil-core taxa found among rural farmer samples may make sense if these people must work with soil to cultivate plants for their livelihood. Predictably, samples from urban-living individuals are well below richness estimates of the other subsistence categories. Ordination clustering indicates that the composition of the soil-similar OTUs found in humans is driven by their social community category, which can be a rough proxy for subsistence. Regarding differences seen by lifestage, children (aged 0–10) had the highest richness by site as is expected because they tend to explore their environment more, and this proclivity transcends whether they were from rural or urban or hunter–gatherer groups. This universal pattern is likely a critical stimulus towards the development of microbiome diversity and host immune system training. Curiously though, the composition of soil-core taxa did not relate to lifestage, only the quantity and richness of taxa were significantly influenced by age.

Finally, sourcetracker analysis on the soil-core abundance tables derived from host gut microbiome data demonstrates that the soil-core component of host gut microbiomes are not aberrant environmental contamination. The identifiable components of soil-core profiles maximally represent soil ecosystems and negligibly resemble other host gut or skin microbiomes. When the soil-core profiles of the original global soil samples themselves were analysed with sourcetracker, the soil-core profiles were overwhelmingly identified as soil ecosystems and recapitulated their origin environments. If the sourcetracker analysis on the soil-core profiles from the original soil datasets had resulted in unmapped or unknown compositional identities, then the soil-core taxa may have been volatile environmental contaminants rather than actual conserved ubiquitous soil taxa.

On the basis of these findings, two exciting prospects come to light: (i) soil taxonomic information recovered from unknown host microbiomes may help identify an environmental setting in which the host may have lived, aiding the endeavours to learn about ancient host environment, physiology and microbial ecologies; and (ii) universal soil bacteria (or other global soil molecular traits) may have had a consistent role in guiding the development of the human microbiome and immune system throughout evolution despite human global dispersals.

Supplementary Material

Fig. S1
rstb20190577supp1.pdf (984.4KB, pdf)

Supplementary Material

Fig. S2
rstb20190577supp2.pdf (288.8KB, pdf)

Supplementary Material

Supplementary tables
rstb20190577supp3.xlsx (178.9KB, xlsx)

Acknowledgements

The author thanks Dwayne Geller for providing technical resources, Kai Staats for providing technical support and Andres Gomez for providing helpful feedback on early drafts of the results.

Data accessibility

Data accessibility is provided from accessions in the original manuscripts.

Competing interests

The author declares no competing interests.

Funding

This work was supported by the National Science Foundation SBE Postdoctoral Research Fellowship (award no. 1810060).

References

  • 1.Fierer N. 2017. Embracing the unknown: disentangling the complexities of the soil microbiome. Nat. Rev. Microbiol. 15, 579–590. ( 10.1038/nrmicro.2017.87) [DOI] [PubMed] [Google Scholar]
  • 2.Blum WE, Zechmeister-Boltenstern S, Keiblinger KM. 2019. Does soil contribute to the human gut microbiome? Microorganisms 7, 1–16. ( 10.3390/microorganisms7090287) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Sampedro L, Jeannotte R, Whalen J. 2006. Trophic transfer of fatty acids from gut microbiota to the earthworm Lumbricus terrestris. Soil Biol. Biochem. 38, 2188–2198. ( 10.1016/j.soilbio.2006.02.001) [DOI] [Google Scholar]
  • 4.Radek R. 1999. Flagellates, bacteria, and fungi associated with termites: diversity and function in nutrition—a review. Ecotropica 5, 183–196. [Google Scholar]
  • 5.Fierer N, et al. 2012. Cross-biome metagenomic analyses of soil microbial communities and their functional attributes. Proc. Natl Acad. Sci. USA 109, 21 390–21 395. ( 10.1073/pnas.1215210110) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Caliz J, Triado-Margarit X, Camarero L, Casamayor EO. 2018. A long-term survey unveils strong seasonal patterns in the airborne microbiome coupled to general and regional atmospheric circulations. Proc. Natl Acad. Sci. USA 115, 12 229–12 234. ( 10.1073/pnas.1812826115) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Copeland JK, Yuan L, Layeghifard M, Wang PW, Guttman DS. 2015. Seasonal community succession of the phyllosphere microbiome. Mol. Plant-Microbe Interact. 28, 274–285. ( 10.1094/MPMI-10-14-0331-FI) [DOI] [PubMed] [Google Scholar]
  • 8.Leung MHY, Lee PKH. 2016. The roles of the outdoors and occupants in contributing to a potential pan-microbiome of the built environment: a review. Microbiome 4, 21 ( 10.1186/s40168-016-0165-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bokulich NA, et al. 2016. Antibiotics, birth mode, and diet shape microbiome maturation during early life. Sci. Transl. Med. 8, 343ra82 ( 10.1126/scitranslmed.aad7121) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hehemann J-H, Correc G, Barbeyron T, Helbert W, Czjzek M, Michel G. 2010. Transfer of carbohydrate-active enzymes from marine bacteria to Japanese gut microbiota. Nature 464, 908–912. ( 10.1038/nature08937) [DOI] [PubMed] [Google Scholar]
  • 11.Pehrsson EC, et al. 2016. Interconnected microbiomes and resistomes in low-income human habitats. Nature 533, 212–216. ( 10.1038/nature17672) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Kurokawa K, et al. 2007. Comparative metagenomics revealed commonly enriched gene sets in human gut microbiomes. DNA Res. 14, 169–181. ( 10.1093/dnares/dsm018) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Liu L, Chen X, Skogerbø G, Zhang P, Chen R, He S, Huang D-W. 2012. The human microbiome: a hot spot of microbial horizontal gene transfer. Genomics 100, 265–270. ( 10.1016/j.ygeno.2012.07.012) [DOI] [PubMed] [Google Scholar]
  • 14.Schnorr SL. 2018. Meanings, measurements, and musings on the significance of patterns in human microbiome variation. Curr. Opin. Genet. Dev. 53, 43–52. ( 10.1016/j.gde.2018.06.014) [DOI] [PubMed] [Google Scholar]
  • 15.Sullivan AP, Bird DW, Perry GH. 2017. Human behaviour as a long-term ecological driver of non-human evolution. Nat. Ecol. Evol. 1, 65 ( 10.1038/s41559-016-0065) [DOI] [PubMed] [Google Scholar]
  • 16.Wall DH, Nielsen UN, Six J. 2015. Soil biodiversity and human health. Nature 528, 69–76. ( 10.1038/nature15744) [DOI] [PubMed] [Google Scholar]
  • 17.Mills JG, et al. 2019. Relating urban biodiversity to human health with the ‘holobiont’ concept. Front. Microbiol. 10, 550 ( 10.3389/fmicb.2019.00550) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Barberan A, Ladau J, Leff JW, Pollard KS, Menninger HL, Dunn RR, Fierer N. 2015. Continental-scale distributions of dust-associated bacteria and fungi. Proc. Natl Acad. Sci. USA 112, 5756–5761. ( 10.1073/pnas.1420815112) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Handt O, Höss M, Krings M, Pääbo S. 1994. Ancient DNA: methodological challenges. Experientia 50, 524–529. ( 10.1007/BF01921720) [DOI] [PubMed] [Google Scholar]
  • 20.Soucy SM, Huang J, Gogarten JP. 2015. Horizontal gene transfer: building the web of life. Nat. Rev. Genet. 16, 472–482. ( 10.1038/nrg3962) [DOI] [PubMed] [Google Scholar]
  • 21.Stagaman K, et al. 2018. Market integration predicts human gut microbiome attributes across a gradient of economic development. mSystems 3, e00122–17 ( 10.1128/mSystems.00122-17) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Schnorr SL, Sankaranarayanan K, Lewis CM, Warinner C. 2016. Insights into human evolution from ancient and contemporary microbiome studies. Curr. Opin. Genet. Dev. 41, 14–26. ( 10.1016/j.gde.2016.07.003) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Warinner C, et al. 2014. Pathogens and host immunity in the ancient human oral cavity. Nat. Genet. 46, 336–344. ( 10.1038/ng.2906) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Demarchi B, et al. 2016. Protein sequences bound to mineral surfaces persist into deep time. Elife 5, e17092 ( 10.7554/eLife.17092) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Weyrich LS, et al. 2017. Neanderthal behaviour, diet, and disease inferred from ancient DNA in dental calculus. Nature 544, 357–361. ( 10.1038/nature21674) [DOI] [PubMed] [Google Scholar]
  • 26.Salazar-García DC, Debono Spiteri C, Demarchi B.. 2013. Integrated novel applications for dietary reconstructions in Prehistory. The European Archaeologist 40, Winter.
  • 27.Spiteri CD, et al. 2016. Regional asynchronicity in dairy production and processing in early farming communities of the northern Mediterranean. Proc. Natl Acad. Sci. USA 113, 13 594–13 599. ( 10.1073/pnas.1607810113) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Tito RY, et al. 2012. Insights from characterizing extinct human gut microbiomes. PLoS ONE 7, e51146 ( 10.1371/journal.pone.0051146) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Søe MJ, et al. 2018. Ancient DNA from latrines in Northern Europe and the Middle East (500 BC–1700 AD) reveals past parasites and diet. PLoS ONE 13, e0195481 ( 10.1371/journal.pone.0195481) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Dietrich C, Köhler T, Brune A. 2014. The cockroach origin of the termite gut microbiota: patterns in bacterial community structure reflect major evolutionary events. Appl. Environ. Microbiol. 80, 2261–2269. ( 10.1128/AEM.04206-13) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Moeller AH, Li Y, Mpoudi Ngole E, Ahuka-Mundeke S, Lonsdorf EV, Pusey AE, Peeters M, Hahn BH, Ochman H. 2014. Rapid changes in the gut microbiome during human evolution. Proc. Natl Acad. Sci. USA 111, 16 431–16 436. ( 10.1073/pnas.1419136111) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Li H, et al. 2016. Age polyethism drives community structure of the bacterial gut microbiota in the fungus-cultivating termite Odontotermes formosanus. Environ. Microbiol. 18, 1440–1451. ( 10.1111/1462-2920.13046) [DOI] [PubMed] [Google Scholar]
  • 33.Ayeni FA, et al. 2018. Infant and adult gut microbiome and metabolome in rural Bassa and urban settlers from Nigeria. Cell Rep. 23, 3056–3067. ( 10.1016/j.celrep.2018.05.018) [DOI] [PubMed] [Google Scholar]
  • 34.Dubois G, Girard C, Lapointe F-J, Shapiro JB. 2017. The Inuit gut microbiome is dynamic over time and shaped by traditional foods. Microbiome 5, 1–2. ( 10.1186/s40168-017-0370-7) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Li H, Li T, Li X, Wang G, Lin Q, Qu J. 2018. Gut microbiota in Tibetan herdsmen reflects the degree of urbanization. Front. Microbiol. ( 10.3389/fmicb.2018.01745) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Obregon-Tito AJ, et al. 2015. Subsistence strategies in traditional societies distinguish gut microbiomes. Nat. Commun. 6, 1–9. ( 10.1038/ncomms7505) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Pires ES, et al. 2019. The gut microbiome and metabolome of two riparian communities in the Amazon. Front. Microbiol. 10, 2003 ( 10.3389/fmicb.2019.02003) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Schnorr SL, et al. 2014. Gut microbiome of the Hadza hunter-gatherers. Nat. Commun. 5, 3654 ( 10.1038/ncomms4654) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Smits SA, et al. 2017. Seasonal cycling in the gut microbiome of the Hadza hunter-gatherers of Tanzania. Science 357, 802–806. ( 10.1126/science.aan4834) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Bourguignon T, Lo N, Dietrich C, Sobotnik J, Sidek S, Roisin Y, Brune A, Evans TA. 2018. Rampant host switching shaped the termite gut microbiome. Curr. Biol. 28, 1–6. ( 10.1016/j.cub.2018.01.035) [DOI] [PubMed] [Google Scholar]
  • 41.Chew YM, Lye S, Salleh MM, Yahya A. 2018. 16S rRNA metagenomic analysis of the symbiotic community structures of bacteria in foregut, midgut, and hindgut of the wood-feeding termite Bulbitermes sp. Symbiosis 76, 187–197. ( 10.1007/s13199-018-0544-5) [DOI] [Google Scholar]
  • 42.Köhler T, Dietrich C, Scheffrahn RH, Brune A. 2012. High-resolution analysis of gut environment and bacterial microbiota reveals functional compartmentation of the gut in wood-feeding higher termites (Nasutitermes spp.). Appl. Environ. Microbiol. 78, 4691–4701. ( 10.1128/AEM.00683-12) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Schnorr SL, Hofman CA, Netshifhefhe SR, Duncan FD, Honap TP, Lesnik J, Lewis CM. 2019. Taxonomic features and comparisons of the gut microbiome from two edible fungus-farming termites (Macrotermes falciger; M. natalensis) harvested in the Vhembe district of Limpopo, South Africa. BMC Microbiol. 19, 164 ( 10.1186/s12866-019-1540-5) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Delgado-Baquerizo M, Oliverio AM, Brewer TE, Benavent-González A, Eldridge DJ, Bardgett RD, Maestre FT, Singh BK, Fierer N. 2018. A global atlas of the dominant bacteria found in soil. Science 359, 320–325. ( 10.1126/science.aap9516) [DOI] [PubMed] [Google Scholar]
  • 45.Edgar RC. 2013. UPARSE: highly accurate OTU sequences from microbial amplicon reads. Nat. Methods 10, 996–998. ( 10.1038/nmeth.2604) [DOI] [PubMed] [Google Scholar]
  • 46.Rognes T, Flouri T, Nichols B, Quince C, Mahe F. 2016. VSEARCH: a versatile open source tool for metagenomics. PeerJ 4, e2584 ( 10.7717/peerj.2584) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Moeller AH, Peeters M, Ndjango J-B, Li Y, Hahn BH, Ochman H. 2013. Sympatric chimpanzees and gorillas harbor convergent gut microbial communities. Genome Res. 23, 1715–1720. ( 10.1101/gr.154773.113) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Yatsunenko T, et al. 2012. Human gut microbiome viewed across age and geography. Nature 486, 222–227. ( 10.1038/nature11053) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Makonde HM, Mwirichia R, Oslemo Z, Boga HI, Klenk H-P. 2015. 454 Pyrosequencing-based assessment of bacterial diversity and community structure in termite guts, mounds and surrounding soils. Springerplus 4, 471 ( 10.1186/s40064-015-1262-6) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Knights D, Kuczynski J, Charlson ES, Zaneveld J, Mozer MC, Gollman RG, Bushman FD, Knight R, Kelley ST. 2011. Bayesian community-wide culture-independent microbial source tracking. Nat. Methods 8, 761–763. ( 10.1038/nmeth.1650) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Hansen ME, et al. 2019. Population structure of human gut bacteria in a diverse cohort from rural Tanzania and Botswana. Genome Biol. 20, 1–21. ( 10.1186/s13059-018-1616-9) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Jha AR, et al. 2018. Gut microbiome transition across a lifestyle gradient in Himalaya. PLoS Biol. 16, e2005396 ( 10.1371/journal.pbio.2005396) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Martinez I, Stegen JC, Maldonado-Gomez M, Eren AM, Siba PM, Greenhill AR, Walter J. 2015. The gut microbiota of rural Papua New Guineans: composition, diversity patterns, and ecological processes. Cell Rep. 11, 527–538. ( 10.1016/j.celrep.2015.03.049) [DOI] [PubMed] [Google Scholar]
  • 54.Perez-Perez GI, Gao Z, Jourdain R, Ramirez J, Gany F, Clavaud C, Demaude J, Breton L, Blaser MJ. 2016. Body site is a more determinant factor than human population diversity in healthy skin microbiome. PLoS ONE 11, e0151990 ( 10.1371/journal.pone.0151990) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Leff JW, et al. 2015. Consistent responses of soil microbial communities to elevated nutrient inputs in grasslands across the globe. Proc. Natl Acad. Sci. USA 112, 10 967–10 972. ( 10.1073/pnas.1508382112) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glockner F. 2013. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 41, D590–D596. ( 10.1093/nar/gks1219) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Caporaso JG, et al. 2010. QIIME allows analysis of high-throughput community sequencing data. Nat. Methods 7, 335–336. ( 10.1038/nmeth0510-335) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.R Core Team. 2017. R: a language and environment for statistical computing. https://www.R-project.org/.
  • 59.Oksanen J, et al. 2013. vegan: Community ecology package. R package version. 2.0-10. CRAN.
  • 60.Wickham H. 2016. Flexibly Reshape Data: a reboot of the Reshape Package. https://github.com/hadley/reshape.
  • 61.Culhane AC, Thioulouse J, Perrière G, Higgins DG. 2005. MADE4: an R package for multivariate analysis of gene expression data. Bioinformatics 21, 2789–2790. ( 10.1093/bioinformatics/bti394) [DOI] [PubMed] [Google Scholar]
  • 62.Quinn T, Richardson MF, Lovell D, Crowley T. 2017. propr: An R-package for identifying proportionally abundant features using compositional data analysis. Sci. Rep. 7, 1–9. ( 10.1038/s41598-017-16520-0) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Wickham H. 2016. Ggplot2: elegant graphics for data analysis. New York, NY: Springer-Verlag. [Google Scholar]
  • 64.Zeileis A, Fisher JC, Hornik K, Ihaka R, McWhite CD, Murrell P, Stauffer R, Wilke CO. 2019. colorspace: A toolbox for manipulating and assessing colors and palettes. arXiv 1903.06490.
  • 65.Wickham H, Seidel D.. 2019. Scales: scale functions for visualization. R package version 1.1. 0.
  • 66.Adler D, Kelly ST. 2020. vioplot: violin plot. R package version 0.3.5. https://github.com/TomKellyGenetics/vioplot.
  • 67.Mikaelyan A, Dietrich C, Köhler T, Poulsen M, Sillam-Dusses D, Brune A. 2015. Diet is the primary determinant of bacterial community structure in the guts of higher termites. Mol. Ecol. 24, 5284–5295. ( 10.1111/mec.13376) [DOI] [PubMed] [Google Scholar]
  • 68.Crittenden AN, Schnorr SL. 2017. Current views on hunter-gatherer nutrition and the evolution of the human diet. Am. J. Phys. Anthropol. 162, 84–109. ( 10.1002/ajpa.23148) [DOI] [PubMed] [Google Scholar]
  • 69.Moeller AH, et al. 2016. Cospeciation of gut microbiota with hominids. Science 353, 380–382. ( 10.1126/science.aaf3951) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Ley RE, Peterson DA, Gordon JI. 2006. Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell 124, 837–848. ( 10.1016/j.cell.2006.02.017) [DOI] [PubMed] [Google Scholar]
  • 71.Ruby EG. 1996. Lessons from a cooperative bacterial-animal association: the Vibrio fischeri–Euprymna scolopes light organ symbiosis. Annu. Rev. Microbiol. 50, 591–624. ( 10.1146/annurev.micro.50.1.591) [DOI] [PubMed] [Google Scholar]
  • 72.Leuthold RH, Badertscher S, Imboden H. 1989. The inoculation of newly formed fungus comb with Termitomyces in Macrotermes colonies (Isoptera, Macrotermitinae). Insectes Soc. 36, 328–338. ( 10.1007/BF02224884) [DOI] [Google Scholar]
  • 73.Liu W, Zhang J, Wu C, Cai S, Huang W, Chen J. 2016. Unique features of ethnic Mongolian gut microbiome revealed by metagenomic analysis. Sci. Rep. 6, 1–11. ( 10.1038/srep34826) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Hohmann G, Potts K, N'Guessan A, Fowler A, Mundry R, Ganzhorn JU, Ortmann S. 2010. Plant foods consumed by Pan: exploring the variation of nutritional ecology across Africa. Am. J. Phys. Anthropol. 141, 476–485. ( 10.1002/ajpa.21168) [DOI] [PubMed] [Google Scholar]
  • 75.Amato KR, et al. 2019. Convergence of human and Old World monkey gut microbiomes demonstrates the importance of human ecology over phylogeny. Genome Biol. 20, 201 ( 10.1186/s13059-019-1807-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Wrangham RW, Conklin-Brittain NL. 1998. Dietary response of chimpanzees and cercopithecines to seasonal variation in fruit abundance. I. Antifeedants. Int. J. Primatol. 19, 949–970. ( 10.1023/A:1020318102257) [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Fig. S1
rstb20190577supp1.pdf (984.4KB, pdf)
Fig. S2
rstb20190577supp2.pdf (288.8KB, pdf)
Supplementary tables
rstb20190577supp3.xlsx (178.9KB, xlsx)

Data Availability Statement

Data accessibility is provided from accessions in the original manuscripts.


Articles from Philosophical Transactions of the Royal Society B: Biological Sciences are provided here courtesy of The Royal Society

RESOURCES