Skip to main content
Frontiers in Microbiology logoLink to Frontiers in Microbiology
. 2014 Jun 19;5:298. doi: 10.3389/fmicb.2014.00298

Communities of microbial eukaryotes in the mammalian gut within the context of environmental eukaryotic diversity

Laura Wegener Parfrey 1,*,, William A Walters 2, Christian L Lauber 3, Jose C Clemente 1,, Donna Berg-Lyons 1, Clotilde Teiling 4,, Chinnappa Kodira 4,, Mohammed Mohiuddin 4, Julie Brunelle 4, Mark Driscoll 4, Noah Fierer 3,5, Jack A Gilbert 6,7, Rob Knight 1,8
PMCID: PMC4063188  PMID: 24995004

Abstract

Eukaryotic microbes (protists) residing in the vertebrate gut influence host health and disease, but their diversity and distribution in healthy hosts is poorly understood. Protists found in the gut are typically considered parasites, but many are commensal and some are beneficial. Further, the hygiene hypothesis predicts that association with our co-evolved microbial symbionts may be important to overall health. It is therefore imperative that we understand the normal diversity of our eukaryotic gut microbiota to test for such effects and avoid eliminating commensal organisms. We assembled a dataset of healthy individuals from two populations, one with traditional, agrarian lifestyles and a second with modern, westernized lifestyles, and characterized the human eukaryotic microbiota via high-throughput sequencing. To place the human gut microbiota within a broader context our dataset also includes gut samples from diverse mammals and samples from other aquatic and terrestrial environments. We curated the SILVA ribosomal database to reflect current knowledge of eukaryotic taxonomy and employ it as a phylogenetic framework to compare eukaryotic diversity across environment. We show that adults from the non-western population harbor a diverse community of protists, and diversity in the human gut is comparable to that in other mammals. However, the eukaryotic microbiota of the western population appears depauperate. The distribution of symbionts found in mammals reflects both host phylogeny and diet. Eukaryotic microbiota in the gut are less diverse and more patchily distributed than bacteria. More broadly, we show that eukaryotic communities in the gut are less diverse than in aquatic and terrestrial habitats, and few taxa are shared across habitat types, and diversity patterns of eukaryotes are correlated with those observed for bacteria. These results outline the distribution and diversity of microbial eukaryotic communities in the mammalian gut and across environments.

Keywords: protist, microbial ecology, microbial diversity, salinity, host-associated eukaryotes, parasites, intestinal protozoa, human microbiome

Introduction

A rich understanding of the distribution of microbial diversity across environments has emerged from high-throughput sequencing studies in the past decade. These studies have described many spatial and temporal patterns of variability within environments and have defined the major divisions in microbial community composition (Nemergut et al., 2013). Salinity represents the primary division among environmental samples for bacterial and archaeal communities (Lozupone and Knight, 2007; Auguet et al., 2010; Wang et al., 2011), while the vertebrate gut has the most distinct bacterial communities (Ley et al., 2008b). Studies characterizing microbial diversity deeply across hundreds to thousands of samples are now common for bacteria (e.g., the Human Microbiome Project, the Earth Microbiome Project, MetaHIT), but are just beginning for microbial eukaryotes (Tara Oceans, ICOMM, BioMarks). As a result, progress characterizing the distribution of protist diversity lags behind our knowledge of bacteria, but morphological surveys (Larsen and Patterson, 1990; Patterson, 1996; Foissner, 2006; Weisse, 2008) combined with recent molecular data (Amaral-Zettler et al., 2009; Caron, 2009; Baldwin et al., 2013; Bates et al., 2013) provide a foundation of knowledge on the biogeography of protists across environments.

Our understanding of the diversity and function of host-associated microbial communities has grown exponentially in recent years, fueled by high-throughput sequencing and motivated by the realization that microbes have a profound influence on their host (McFall-Ngai et al., 2013; Sommer and Backhed, 2013). There are many commonalities in the bacterial taxa that comprise the microbiota across mammals, with the phyla Bacteroidetes and Firmicutes being predominant components (Ley et al., 2008b; Muegge et al., 2011). Overall, the mammalian gut harbors lower bacterial diversity and fewer phyla-level taxa than other environments (Ley et al., 2006). Across mammals, microbiota composition varies according to host phylogeny and diet (Ley et al., 2008b; Russell et al., 2014), and the composition of the human microbiota resembles that of our primate relatives (Ley et al., 2008b). Within humans gut microbiota is influenced by diet, health status, and age (Fierer et al., 2012; Lozupone et al., 2012). In addition, adoption of a western lifestyle, characterized by diets rich in processed food, antibiotic usage, and hygienic habits, has a particularly strong influence on the microbiota (De Filippo et al., 2010; Yatsunenko et al., 2012; Ursell et al., 2013). Diversity of the human bacterial microbiota has clearly declined in Western populations compared to populations with traditional agrarian lifestyles (De Filippo et al., 2010; Cho and Blaser, 2012; Lozupone et al., 2012; Yatsunenko et al., 2012).

Progress characterizing the eukaryotic component of the mammalian microbiome lags behind bacteria because high-throughput sequencing based investigations into the diversity of the mammalian microbiota have focused almost exclusively on bacteria (Parfrey et al., 2011; Andersen et al., 2013). The mammalian intestinal tract is home to many eukaryotes, including animals (e.g., helminths) and protists (e.g., amoebae and flagellates), and these taxa have been investigated for decades from a parasitological point of view with microscopy and targeted molecular approaches (Bogitsh et al., 2005). Studies of the eukaryotic component of the mammalian microbiota from a community perspective are beginning to come online, though many questions remain to be investigated (Andersen et al., 2013). Although sample sizes are generally small to date, these studies have shown that anaerobic fungi are dominant in mice (Scupham et al., 2006). Western human fecal communities include Blastocystis (Scanlan and Marchesi, 2008) and fungi (Dollive et al., 2012), while a survey of a single African individual revealed higher microbial eukaryote diversity (Hamad et al., 2012). The diversity of the eukaryotic microbiota in the human gut has not yet been systematically investigated from a community perspective in non-western populations. These populations provide an important perspective for understanding the eukaryotic microbiota that humans have co-evolved with over millions of years.

Eukaryotic microbes in the gut are generally considered parasites, and have long been recognized to contribute to host morbidity and mortality (Bogitsh et al., 2005). However, many are commensal (Bogitsh et al., 2005), or play beneficial roles as probiotics (McFarland and Bernasconi, 1993) or cellulose degraders (Kittelmann and Janssen, 2011). Further, increasing evidence suggests that eliminating the diverse microbial community that co-evolved with mammals over millions of years is detrimental to host health (Cho and Blaser, 2012; Lozupone et al., 2012), in support of the Old Friends Hypothesis (or hygiene hypothesis) (Rook, 2012). Eukaryotic microbes were part of our ancestral gut community and intestinal helminths were nearly universal (Goncalves et al., 2003). In humans, the transition to modern lifestyles is associated with dramatically lower diversity and prevalence of intestinal helminths, and with a rise in the prevalence of autoimmune disease (Rook, 2012). Yet, we know little about their role in healthy people. Recent analyses of common protists in the gut suggests that they may be part of the healthy microbiota in humans (Petersen et al., 2013).

Here, we use high-throughput sequencing to characterize eukaryotic communities found in the vertebrate gut from a diverse collection of mammalian fecal samples, including humans from the US and from remote communities in Malawi. To provide a broader context for understanding of the diversity of microbial eukaryotes in the gut, we also characterized a collection of samples from a wide range of other environments, including human skin, marine water, freshwater, soil, and air. The bacterial communities in these samples were also characterized to enable comparison of eukaryotic and bacterial biodiversity. In order to gain deeper insight into the distribution of eukaryotic diversity, we curated the SILVA reference database (Pruesse et al., 2007) so that both the taxonomy assigned to reference sequences and the phylogenetic tree constructed from these reference sequences reflects current knowledge. Eukaryotic environmental sequences are placed within this explicit phylogenetic context and assess the distribution of eukaryotic clades across environments.

Methods

Sample set

We selected 185 samples that span a wide range of environments in order to assess broad patterns in eukaryotic communities (Table S1). The dataset analyzed here was chosen to include individuals from geographically diverse populations with contrasting lifestyles to enable testing the hypothesis that the transition to modern, highly hygienic lifestyles are correlated with low levels of diversity of eukaryotic microbes. We included samples from 23 individuals that reside in agrarian communities in Malawi that follow traditional lifestyles and 16 samples from 13 individuals residing in the US (Boulder, CO and Philadelphia, PA) and follow modern lifestyles (Table 1). Three individuals from Boulder were sampled at two time points 2 months apart (Costello et al., 2009). The US populations live in urban or suburban areas, consumed typical western diets, and did not report any health problems at the time of sampling (Costello et al., 2009; Yatsunenko et al., 2012). Individuals from populations in Malawi ate diets rich in maize, legumes, and other plants (Table S1 from Yatsunenko et al., 2012) and were healthy and well-nourished at the time of sampling (Yatsunenko et al., 2012; Smith et al., 2013). These samples have been described in detail previously and bacterial diversity was previously reported (Costello et al., 2009; Yatsunenko et al., 2012; Smith et al., 2013). In addition, we included 22 samples from other mammals, also previously described and characterized for bacteria (Ley et al., 2008a; Muegge et al., 2011), to gain insight into the diversity of eukaryotic human microbiota relative to other mammals. Collection of the human fecal samples for these previously published studies was done according to protocols approved by Human Research Committees at the institutions involved which allow samples to be used for further research. De-identified DNA was sent to the University of Colorado for amplification. Collection of skin and oral samples was approved by the University of Colorado Human Research Committee (protocol 0109.23), which allows the samples to be used for further research. Finally, we included samples from wide variety of environments, many of which have been previously characterized for bacterial or fungal communities (Table S1). These include air sampled over terrestrial environments (Bowers et al., 2011a, 2012), soil (Lauber et al., 2009; Ramirez et al., 2010; Eilers et al., 2012), freshwater (Shade et al., 2012), marine water, lichens (Bates et al., 2011), leaf litter (McGuire et al., 2012), and human oral and skin samples (Costello et al., 2009; Verhulst et al., 2011). The sequence data and MiMARKs (Yilmaz et al., 2011) compliant metadata is available for this study at the QIIME database (http://www.microbio.me/qiime/: study #1519 for eukaryotes and #1517 for bacteria) and at EBI (accession numbers ERP006039 and ERP005135).

Table 1.

Human fecal samples.

Sample name Villagea Age (years) Original studyb Total seqs Filtered seqsc Blastocystis Entamoeba
h101M M: Mbiza 24.4 2 1041 982 ST3 coli, hartmanii
k57B.6Post M: M: Mbiza 2.0 1 649 638 ST3 coli
h101A.4 M: Mbiza 2.3 2 719 546 ST3
h101B.4 M: Mbiza 2.3 2 901 521 ST3
k84M M: Mayaka 30.6 1 821 493 ST1, ST3 coli, dispar, hartmanii
h186M M: Mayaka 31.6 2 400 367 ST1, ST3 coli, hartmanii
k26M.1 M: Mitondo 29.5 1 929 361 ST3 coli, dispar, histolytica, hartmanii
h186A.1 M: Mayaka 2.0 2 1024 305 coli
h146B.2 M: Mayaka 1.7 2 319 261 ST2, ST3 hartmanii
h146M M: Mayaka 33.5 2 246 233 ST1, ST3 hartmanii
m55M M: Mbiza adult 1 277 229 ST1, ST2 coli, hartmanii
k57M M: Mbiza 30.8 1 233 212 ST1, ST2, ST3 coli
k80M M: Mayaka 27.2 1 256 168 ST1 coli
m55S M: Mbiza child 1 263 165 ST2 coli, hartmanii
h47M M: Chamba adult 2 526 144 ST2 coli
k80A.7 M: Mayaka 1.9 1 285 118 dispar, histolytica
k84A.1 M: Mayaka 0.9 1 774 45 ST1, ST2, ST3
h186B.1 M: Mayaka 2.0 2 900 44 hartmanii
h146A.2 M: Mayaka 1.7 2 935 10 ST3 hartmanii
h18A.3 M: Chamba 1.1 2 1028 8 coli
h47A.1 M: Chamba 0.6 2 1032 6
h47B.1 M: Chamba 0.6 2 400 3
h18B.5 M: Chamba 1.6 2 193 1
USBldChld5 U: Boulder 3 2 485 123 coli
USchp60Mom U: Philadelphia 33 2 1006 47
USchp18Child U: Philadelphia 3 2 977 35
USBldChld8 U: Boulder 1.6 2 671 29 coli
M22Fcsw U: Boulder adult 3 935 14
USBldChld4 U: Boulder 6 2 1159 7
M21Fcsw U: Boulder adult 3 825 5
USBldChld10 U: Boulder 1.3 2 913 1
USBldChld2 U: Boulder 4.5 2 492 0
USchp33ChildA U: Philadelphia 5 2 378 0
USchp33Mom U: Philadelphia 45 2 781 0
F11Fcsw U: Boulder adult 3 139 0
M11Fcsw U: Boulder adult 3 502 0
M23Fcsw U: Boulder adult 3 156 0
M24Fcsw U: Boulder adult 3 221 0
M31Fcsw U: Boulder adult 3 269 0
a

Country where village is located: M, Malawi and U, USA.

b

Original study: 1 = Smith et al., 2013; 2 = Yatsunenko et al., 2012; 3 = Costello et al., 2009.

c

Filtered sequences have the following removed: Bacteria, Archaea, non-18S rDNA, mammalian DNA, plants.

Microbial community characterization

Sequences were PCR amplified with primers 515f and 1119r (Bates et al., 2012). The forward primer 515f (5' GTGCCAGCMGCCGCGGTAA 3') is 3-domain universal and 1119r (5' GGTGCCCTTCCGTCA 3') is targeted toward eukaryotes. Primer specificity to eukaryotes and predicted amplification efficiency of eukaryotic lineages was assessed with the taxa coverage module in PrimerProspector (Walters et al., 2011). This program assesses the complementarity between the primer sequence and a reference database, in this case SILVA 111, and assigns a score based on the number of mismatches or gaps between the primer sequence and the reference, and mismatches as the 3' end of the primer are more heavily penalized (http://pprospector.sourceforge.net/tutorial.html). Taxa coverage was assessed at three thresholds corresponding to three levels of specificity (Table S2). A threshold of 0.5 is predicted to generate efficient amplification and allows up to one mismatch at the 5' end of the primer. The threshold of 1 allows one mismatch at the 3' end of the primer or two mismatches in other primer regions, and threshold 2 allows 2–5 mismatches at the 3' or 5' ends of the primer respectively and amplification is expected to be poor or non-existent. This primer pair has high predicted specificity to eukaryotes, matching 86–90% of eukaryotic sequences but less than 0.5% of bacterial and archaeal sequences at a threshold of 0.5 and 1, respectively (Table S2). Many of the taxa expected to be in the mammalian gut based on parasitological studies are predicted to amplify well, including Dientamoeba, Entamoeba, Blastocystis, Balantidium, parabasalids, and nematodes (Table S2). However, there are two mismatches between the Giardia 18S sequence and the reverse primer suggesting a low efficiency (Table S2).

DNA was extracted with the MoBio PowerSoil kit following EMP standard protocols. PCR amplification was done in triplicate with an annealing temperature of 50C for 40 cycles. These permissive conditions were used to amplify the broadest range of eukaryotic taxa. Quantitation and pooling were done according to EMP standard protocols. The final pool was sent to Roche Core Facility. The libraries were amplified, sequenced and processed at the Roche Core Facility. Amplification was done according to the emPCR Amplification Method Manual—Lib-A LV GS FLX Titanium Series with the following edits for long amplicons. Using the Titanium Lib-A emPCR kit, the emulsions were made with A beads and A amp primers only and the following reagents: 1050 μL MBGW, 1500 μL emPCR additive, 860 μL 5× amplification mix, 300 μL Primer (A), 200 μL Enzyme mix, and 5 μL PPiase. The cycling conditions were 4 min at 94C followed by 50 cycles of 30 s at 94C and 10 min at 60C, ending with a hold at 10C. The library was then run as a standard XL+ run. This FLX+ run was sequenced with the standard flow order (400 cycles of TACG nucleotide flows), following the instructions in the Sequencing Method Manual—GS FLX+ Series—XL+ kit, as can be found on the www.my454.com website.

Data processing and quality filtering

Data processing was done at the Roche Core Facility according to the GS FLX System Software Manual modified to optimize performance for metagenomic amplicon sequences. In order to generate high quality data for amplicons metagenomic applications, the default pipeline was tuned to meet the data quality requirements of the QIIME pipeline. The data was processed using 26amp_sl1000 pipeline which has the following tuning steps modified: (1) vfScanLimit was increased from the default of 700 to 1000, (2) the valley filter setting vfTrimBackScaleFactor was increased from the default value by a factor of 0.5, and (3) the quality filter setting QscoreTrimFactor was modified from the default value to a more stringent value. The Amplicon pipeline template was used to generate the modified pipeline XML file with the rCAFIE algorithm turned on.

Usearch version 6.1 was used to screen sequence for chimeras (Edgar, 2010). Sequences were additionally filtered for quality using split_libraries within QIIME version 1.5.0 (Caporaso et al., 2010b). Quality filtering excluded sequences with an average quality score of 25 or lower, reads longer than 1200 bp or shorter than 200 bp and reads with more than 5 ambiguous bases. We found that sequence quality dropped off significantly toward the end of the read, so we employed a strategy truncating sequences when quality scores that fell below 25 in a sliding window of 50 bp. These truncated reads were retained as long as they passed other quality filters and these averaged 444 bp in length.

In order to quantify concordance in the diversity patterns of bacterial and eukaryotic communities we sequenced the bacterial communities as well as the eukaryotic communities. Bacteria were sequenced with the 515f/806r primers (Walters et al., 2011) on the Illumina GAIIx platform at Washington University. Bacterial data was processed using standard protocols within the QIIME database (www.microbio.me/qiime). Archaea are also amplified with this primer set, but were excluded from the analysis in order to focus on the eukaryote to bacteria comparison and because there were too few Archaea OTUs for meaningful comparison. Low abundance OTUs, those containing less than 0.05% of the total reads in the dataset, were filtered out as recommended for Illumina sequence data (Bokulich et al., 2013). The samples were filtered to only include those 113 samples that had at least 150 sequences per samples in the eukaryotic data, and of these, samples with fewer than 3000 sequences were excluded from the analysis. The full dataset was used for taxon-based analyses and all samples were rarefied to 3000 sequences per sample for diversity analyses.

OTU picking and taxonomy assignment

Eukaryotic sequence reads from the 454 FLX+ run were clustered into OTUs with a 97% similarity threshold, which was chosen to minimize the impact of sequencing error in inflating OTU numbers (Stoeck et al., 2010; Bates et al., 2013). Reads were clustered into OTUs according to the open reference protocol (http://qiime.org/tutorials/open_reference_illumina_processing.html) using UCLUST (Edgar, 2010) within QIIME. This involves first clustering reads against the curated SILVA 108 eukaryotic database clustered at 97%, and these OTUs inherited the reference taxonomy. Sequences that failed to assign to the reference dataset were then clustered at 97% de novo with UCLUST. Taxonomy was assigned to these de novo sequences in one of two ways in order to maximize the taxonomic information and reliability. First, taxonomy was assigned using BLAST against the SILVA 108 97% reference database with an e-value cutoff of e-100. In cases where the e-value was less than e-100 taxonomy was assigned using the RDP classifier trained with the SILVA 108 97% reference set at genus level. Taxonomy assignments were also confirmed in using the PR2 reference database (Guillou et al., 2013). The resulting OTUs were filtered to exclude bacteria, archaea, vertebrates (thus removing host DNA), and plants (to exclude dietary sources) as well as non-SSU rDNA sequences. Finally, singleton sequences were excluded from the analysis to reduce the likelihood of including PCR and sequencing artifacts. After filtering, we excluded samples from further analysis that had fewer than 150 eukaryotic sequences/sample. This left 3883 OTUs from 113 samples (out of 185 total samples), corresponding to 84,576 sequences. Downstream diversity analyses used data rarefied to 150 sequences per sample, and taxonomy plots used the full dataset. In order to take full advantage of this dataset we assessed the taxonomic composition of human gut samples falling below the 150 sequences per sample threshold. In this case, a taxon (OTU) was considered present if the OTU was represented by least 5 sequences in the sample in question.

Although 150 sequences per sample is a low number by high-throughput sequencing standards, this sequencing depth adequately captures the diversity present (Figure S1). Direct comparison of numbers of bacterial and eukaryotic taxa is not possible because two different sequencing platforms were used here and the number of sequences per sample is much lower for eukaryotes. However, we can compare the relative differences in alpha diversity between sample types for eukaryotes and bacteria respectively, and sequencing depth for both domains adequately sample diversity. Rarefaction curves of Faith's Phylogenetic Diversity metric level off by 150 sequences per sample, particularly for host-associated samples (Figure S1). Similarly, we have adequate sampling of bacterial diversity and rarefaction curves are leveling off by 3000 sequences per sample for host-associated samples (Figure S1).

A phylogenetic tree reflecting the current understanding of eukaryotic relationships was constructed using the curated SILVA alignment as a template and the SILVA 108 tree as a constraint on the backbone relationships (see SILVA curation below). The representative set of sequences from this study was first aligned to the SILVA 108 97% representative set with PyNAST (Caporaso et al., 2010a). Representative sequences for each of the 3883 OTUs that aligned to the SILVA reference alignment were used to build a phylogenetic tree for diversity analysis and to assess patterns of phylogenetic groups by environment. The resulting alignment was dynamically filtered to remove the 10% most entropic positions and positions with greater than 95% gaps. This alignment was then used to build a phylogenetic tree with the topology constrained to the SILVA 108 97% tree (see below) in RAxML (Stamatakis, 2006). This tree was used for visualization in TopiaryExplorer (Pirrung et al., 2011), which allows branches to be colored according to sample metadata or taxonomy. The p-test from Martin (2002) and UniFrac test (Lozupone and Knight, 2005) were performed on the tree to assess whether the distribution of sequences from particular environments across the tree were significantly different than random, implemented in the beta_significance script within QIIME. In order to visually compare the diversity in the vertebrate gut to other environments, we filtered the tree to include equal sample numbers and equal (rarefied) sequences per sample. This was done by first filtering the OTU table to include the 32 fecal samples with more than 150 sequences per sample and a subsampled set of 32 environmental samples spanning the range of environments, and then rarefied to 150 sequences per sample for both eukaryotic 18S and bacterial 16S. This normalized OTU table was used to filter tips from the 16S and 18S trees.

Diversity analyses were carried out in QIIME using data rarefied to 150 sequences per sample for eukaryotes and 3000 sequences per sample for bacteria. The differences in rarefaction level are a result of the different sequencing platforms used for these datasets. Phylogenetically informed analyses of alpha and beta diversity [phylogenetic distance and unweighted UniFrac (Lozupone and Knight, 2005), respectively] utilized the tree described above. Non-phylogenetic beta diversity metrics performed poorly because very few OTUs were found across multiple sample types (Table 2). Unweighted UniFrac distance matrices were used in Analysis of variance tests (ANOSIM) to assess statistical differences across environments within QIIME. To assess the impact of unbalanced numbers of samples across habitat types, we randomly subsampled the dataset to include equal numbers of samples from each environment and then recalculated diversity metrics and performed ANOSIM tests. This procedure was repeated 1000 times. We visualized the differences in beta-diversity across sample types with non-metric multidimensional scaling (NMDS) plots, which were constructed in the software Primer E (Clarke and Gorley, 2006).

Table 2.

Proportion of shared eukaryotic OTUs.*

Environment Fecal Skin Terrestrial Freshwater Marine
Fecal 190 1 3 1 0
Skin 1 68 34 6 1
Terrestrial 3 34 1796 80 2
Freshwater 1 6 80 354 4
Marine 0 1 2 4 482
Total OTUs 190 68 1796 354 482
% Unique 97% 38% 93% 74% 99%
*

Calculations were done based on the full dataset, and exclude fungi. Fungi have low taxonomic resolution for 18S rRNA (Schoch et al., 2012), thus shared fungal 97% OTUs may be quite divergent.

We took advantage of the long sequence reads from the 454 FLX+ to further investigate the phylogenetic position of Entamoeba and Blastocystis, the two most common taxa detected in the gut. We aligned Entamoeba and Blastocystis representative sequences to the reference taxa from the PR2 database, and then constructed maximum likelihood phylogenies with RAxML. These trees were constrained to the reference phylogeny for these clades, which was derived from the literature (Stensvold et al., 2011; Alfellani et al., 2013). The placement of Entamoeba and Blastocystis sequences was used to confirm the taxonomic identities of these OTUs (Table 1).

Curation of the SILVA eukaryotic database

The SILVA 108 ribosomal database (Pruesse et al., 2007) was downloaded from SILVA (http://www.arb-SILVA.de/). Sequences were initially filtered to remove unclassified environmental sequences. The remaining ~55,000 sequences were dereplicated by clustering at 97% with UCLUST, resulting in ~11,000 sequences. A representative set was then chosen for these OTUs based on the longest sequence. The filtered out environmental sequences were then clustered against the representative set of 97% OTUs using UCLUST ref within QIIME. Those sequences that did not match the reference dataset were then clustered at 97% de novo and the longest representative sequence chosen for each cluster. This resulted in a final SILVA eukaryotic 97% representative set with 14,236 sequences.

The 97% reference dataset was aligned with PyNAST (Caporaso et al., 2010a) in QIIME with a threshold of 70% similarity and a template alignment from Katz et al. (2011) [TreeBase study 11336, matrix M8584; (Katz et al., 2011)]. The resulting alignment was dynamically filtered to remove the 20% most entropic positions and positions with more than 90% gaps. A phylogenetic tree was constructed with RAxML version 7.3.0 (Stamatakis et al., 2008), using the tree topology from the multigene study of Parfrey et al. (2010) with updates based on subsequent papers (e.g., Adl et al., 2012) as a constraint.

The database taxonomy was curated to reflect current views of eukaryotic taxonomy and maximize the taxonomic information available for environmental sequences. Major clade information was added based on Parfrey et al. (2010) and Adl et al. (2012). To maximize the informativeness of the SILVA data set, high-level taxonomy was assigned to uncultured environmental sequences by placing these uncultured reads into the tree of SILVA representative sequences with the RAxML EPA algorithm (Berger et al., 2011) and assessing their position in a phylogenetic tree. Sequences that were nested within clades were assigned taxonomy based on that clade at a high level (e.g., Ciliate or Fungi). Sequences that were mislabeled (i.e., sequence labeled as fungi that fell within the plants) were identified in the tree, confirmed by BLAST and then removed from the representative set. The curated SILVA 108 database is available at http://qiime.org/home_static/dataFiles.html.

Results and discussion

Eukaryotic diversity in the human gut

Eukaryotic microbes are common components of the human gut microbiota in healthy individuals. Blastocystis, Entamoeba, trichomonads, and yeast were frequently detected in human gut samples (Figure 1). Closer inspection of the taxa reveals that most are likely commensal rather than pathogens. For example, Entamoeba was detected in both populations. While the genus Entamoeba includes E. histolytica, the causative agent of the deadly amoebic dysentery (Bogitsh et al., 2005), the vast majority of Entamoeba sequences detected here fall within the commensal species Entamoeba coli, E. dispar, and E. hartmanni (Table 1). Entamoeba histolytica was detected in low abundance in two individuals that also harbored E. dispar.

Figure 1.

Figure 1

Relative taxon abundance of mammalian (including human) fecal samples demonstrates heterogeneity in the presence of eukaryotic lineages across mammals, while the same bacterial lineages are consistently dominant. (A) Eukaryotes, (B) bacteria. Each bar represents an individual fecal sample from humans and other mammals, and only samples with at least 150 sequences in the 18S are represented.

Blastocystis was abundant in many samples (Figure 1), and represented by subtypes ST1, ST2, and ST3 (Table 1). Historically, Blastocystis has been considered a pathogen and it is associated with Irritable Bowel Syndrome (Yakoob et al., 2010; Poirier et al., 2012). However, the clinical importance of Blastocystis, its pathogenicity, and variation in pathogenicity among subtypes, is widely debated (Tan et al., 2010; Coyle et al., 2012; Scanlan and Stensvold, 2013). Some evidence suggests that Blastocystis is a normal component of the microbiota in many individuals—perhaps even a beneficial component—as it has been detected at high prevalence in healthy people (Scanlan and Marchesi, 2008; Petersen et al., 2013; Andersen et al., submitted), its presence is negatively correlated with intestinal disease (Petersen et al., 2013), but see Cekin et al. (2012). High prevalence of Blastocystis has been reported in other epidemiological studies of African countries, up to 100% reported in a Senegalese cohort, half of which had no gastrointestinal symptoms (El Safadi et al., 2014). Many other taxa that populate parasitology textbooks were also detected at lower levels, including Chilomastix, nematodes, and other parabasalids. We do not detect common gut symbionts such as Dientamoeba (Parabasalia), Cryptosporidium (Apicomplexa), or Giardia (Diplomonadida). The primers used here are a poor match for Giardia (Table S2) and may have failed to amplify Giardia DNA. The primers are predicted to work well with Cryptosporidium, but our DNA extraction method (bead beating rather than freeze thaw cycles) may have been insufficient to break open the robust spores of Cryptosporidium (and similar problems may further hinder our ability to detect Giardia). Dientamoeba is also predicted to amplify with our primers (Table S2). While prevalence is generally quite high in Europe, Dientamoeba prevalence is variable worldwide and generally low (less than 5%) in Africa (Barratt et al., 2011). However, specific diagnostic assays would be necessary to rule out presence of these taxa with any confidence.

We assessed eukaryotic diversity across two geographically distant populations whose inhabitants follow either traditional, agrarian lifestyles (Malawi) or modern, urban lifestyles (US). However, our ability to compare eukaryotic diversity across populations is hampered by low counts of eukaryotic sequences in US individuals and young children. Taxa presence above was calculated based on OTUs represented by at least five sequences in a given sample. In order to compare diversity across populations and across sample types more broadly, we filtered out samples with fewer than 150 eukaryotic sequences. While all but three human fecal samples had greater than 150 sequences per sample in total, 27 samples fell below this threshold after removing sequences from bacteria, host, and dietary plants. These non-target taxa account for 94–100% of the sequences from all but one US samples and most children age two and younger (Table 1). One samples from a three-year-old US child had a large portion of sequences derived from Entamoeba coli. The primer set used here targets eukaryotic 18S has a low affinity for vertebrate 18S sequences, and successfully amplified the eukaryotic community in most samples, including environmental samples and mammalian feces (Table S1). We suspect that the high proportion of non-target sequences amplified in samples from the US and from small children reflects a lower eukaryotic biomass and/or diversity in these samples. This hypothesis requires further investigation, but is inline with other results. Previous studies report lower bacterial diversity in western populations and in young children (reviewed in Lozupone et al., 2012). Further, lower prevalence of gut symbionts is associated with the adoption of western lifestyles (Rook, 2012), and prevalence and diversity are lower in temperate regions compared to the tropics (Bogitsh et al., 2005; Harhay et al., 2010).

Eukaryotic microbiota in the mammalian gut

Mammals as a whole harbor a diverse community of eukaryotic microbes in their gut, and compositional differences follow host phylogeny and diet. The human gut microbioes is similar that of other mammals, particularly of primates.

Diet drives differences in bacterial community composition across mammalian species (Ley et al., 2008b; Muegge et al., 2011). We also see compositional differences according to diet in the eukaryotic communities. Herbivores make up most of our mammalian samples that successfully amplified, and are differentiated between hindgut and foregut fermenters. The presence and absence of entire lineages varies according to dietary group, for example only hindgut fermenting herbivores harbor litostome ciliates and anaerobic fungi (e.g., Neocallimastix; Figure 1). Lineages that are present in multiple host species such as Blastocystis and Entamoeba show species level divergence that tracks host phylogeny. Artiodactyls harbor Entamoeba bovis, while primates have Entamoeba coli and E. hartmanii (Table S1). Host-specificity is also observed in the distribution of Blastocystis subtypes (Table S1). We detected Blastocystis ST1, ST2, and ST3 in humans (Table 1) and also in the primates (baboon and orangutan) (Table S1). Kangaroos, foregut-fermenting herbivores, had large numbers of Blastocystis ST8 (Figure 1; Table S1).

Diversity patterns for eukaryotic microbes within the mammalian gut differ in two ways from those of bacteria. First, eukaryotic microbes show a patchy distribution across samples, such that the most abundant lineages in some samples are completely absent from others (Figure 1). In contrast, bacterial community composition at comparably high taxonomic levels is broadly consistent across individuals and across populations; e.g., Bacteroidetes and Firmicutes are generally the dominant phyla (Figure 1; Ley et al., 2008b; Consortium, 2012; Yatsunenko et al., 2012). Second, within a phylum-level lineage there is less diversity at the strain and species level for eukaryotes, even after controlling for differences in sequencing depth (Figure 2). This suggests that presence or absence of deep lineages may be more informative than variation at lower taxonomic levels for eukaryotes.

Figure 2.

Figure 2

Comparison of the phylogenetic distribution of taxa from mammalian gut to other environments. Sequences detected in the mammalian gut come from a smaller number of lineages and have lower overall diversity compared to other environments, reflecting the limited number of lineages that have successfully colonized animal hosts. Tree contains sequences from 32 mammalian gut samples (red) and 32 samples total from skin, terrestrial, and aquatic habitats (blue). Tips present correspond to the data rarefied to 150 sequences per sample for comparison. (A) Eukaryotic 18S rRNA tree constructed using RAxML with the topology constrained to the SILVA 108 reference tree. (B) Bacterial 16S rRNA tree from Greengenes 2011 release. Branches are colored according to the environment that contributed the majority of the sequences.

Diversity of gut microbiota compared to other environments

The microbial eukaryotic communities detected in the mammalian gut are quite distinct from environmental communities both at the OTU level, as seen in the low numbers of shared OTUs (Table 2) and at higher taxonomic levels (Figures 2, 3). Just 3% of non-fungal OTUs from the gut are shared with skin, terrestrial, and aquatic environments (Table 2). The composition eukaryotic communities in the mammalian gut is significantly different than the composition found in environmental samples (ANOSIM p = 0.001, R = 0.76), and this is true for bacteria as well (ANOSIM p = 0.001, R = 0.94). Overall, beta-diversity patterns observed for eukaryotes are significant similar to bacterial beta-diversity as assessed by Mantel tests comparing the unweighted UniFrac distance matrices (p = 001, R = 0.658; N = 113). The distinctiveness of gut communities can also be seen when the branches of the 18S and 16S trees are colored according to the environment where the sequences were detected (Figure 2). Sequences from the gut are significantly clustered in both 16S and 18S (Figure 2) as assessed by the phylogenetic test [p-test p < 0.001; (Martin, 2002)] and UniFrac significance test (p < 0.001).

Figure 3.

Figure 3

Bar chart of the relative abundance of sequences falling into the major clades of eukaryotes depicts the overall divergence in community composition across sample types. Major clades are the deepest divisions within eukaryotes (Parfrey et al., 2010; Adl et al., 2012) and are roughly equal to the phyla or superphyla level of bacteria.

In accordance with previous observations, fewer lineages of eukaryotes reside in the mammalian gut than in other habitats, and those lineages that have successfully colonized the vertebrate gut have diversified as they have co-evolved with their hosts over millions of years (Parfrey et al., 2011). Similar patterns have also been observed for bacteria (Ley et al., 2006). Here, we see significantly lower levels of alpha diversity in gut communities compared to other environments for eukaryotes (t-test comparing Faith's phylogenetic distance in the gut vs. environmental samples: p < 0.001), and bacteria (p < 0.001).

Eukaryotic communities associated with human skin resemble terrestrial samples

Eukaryotic communities associated with human skin are composed mostly of fungi and have low diversity overall, in line with expectations from other studies (Paulino et al., 2006; Findley et al., 2013). Skin samples group with terrestrial samples in NMDS plots of unweighted UniFrac (Figure 4). Similarity in the fungi detected in skin and terrestrial samples accounts for much of this similarity; 70% of the OTUs on skin are fungi, and of these more than 80% (113 OTUs) are shared with soil or other terrestrial samples. The low taxonomic resolution of fungi with the 18S marker may inflate the number shared OTUs to some extent (Schoch et al., 2012). Non-fungal OTUs detected on skin correspond to mites and a handful of low abundance OTUs that are commonly found in soil such as cercozoan flagellates. The overlap between skin and soil communities may reflect frequent contact between skin and soil, or with airborne microbes, which can have high abundances of soil-associated taxa (Bowers et al., 2011b). In support of this hypothesis, skin bacterial communities also frequently group with environmental samples (Figure 4). These results are suggestive, but are drawn from skin and soil samples taken in different locations within different studies (see Methods). Testing the hypothesis that skin communities resemble terrestrial environments because contact enables frequent dispersal requires samples from human skin and the surrounding environment, including dust and soil, collected at the same time.

Figure 4.

Figure 4

NMDS plot of unweighted UniFrac reveal separation across major environmental categories. Plots (A) Eukaryotes and (B) Bacteria show the distinction between fecal samples (red and orange) and those from other environments, including skin (pink). Air samples were collected over terrestrial habitats.

Comparison of eukaryotic communities in other habitats

Our dataset includes samples from a range of environments and enables us to compare eukaryotic communities across environmental habitats. Microbial eukaryotic communities are highly differentiated across host marine, freshwater, and terrestrial habitats as assessed by ANOSIM (Figure 4; ANOSIM R = 0.78, p = 0.001). The sample set analyzed here includes more soil and other terrestrial samples, such as lichens and leaf litter than water samples (Table S1), but the differences across habitat types persist when the data is subsampled to equal sample numbers across habitat types (see Methods). For each of the 1000 sub-sampled trials, the divide between freshwater, marine, and terrestrial environments was highly significant and explains much of the variation (ANOSIM ranges: p = 0.001 to 0.005 and R = 0.65 to 0.60). These habitats were also significantly clustered in the 18S tree (p-test p = 0.001 for each pair of environments).

Beta-diversity differences across environments are underlain by a strong differentiation in the high-level clades present across environments (Figure 3). Some clades are restricted to one type of sample, for example, Amoebozoa (Entamoeba) and parabasalids are characteristic of fecal samples and cryptophytes comprise a large portion of the freshwater community, while the recently identified Picozoa clade (formerly “picobiliphytes”; Seenivasan et al., 2013) is restricted to marine environments. Yet, across all environments, diversity is dominated by just a few clades. Animals, fungi, alveolates, Cercozoa, and stramenopiles make up 79% of all sequences (Figure 3). At the OTU level very few taxa are shared across habitats (Table 2).

Communities from environmental samples show a distinct separation between terrestrial and water samples, and between marine and freshwater samples in beta-diversity plots (Figure 4). In accordance with previous studies that report salinity as the most important factor structuring bacterial and archaeal community composition (Lozupone and Knight, 2007; Auguet et al., 2010; Wang et al., 2011), and we also see a major divide in bacterial community composition between freshwater vs. marine habitats (Figure 4). Eukaryotic taxa also cross the saline/non-saline boundary infrequently (e.g., Shalchian-Tabrizi et al., 2008; Logares et al., 2009; Brate et al., 2010). In our data, compositional differences between freshwater and marine eukaryotic communities are highly significant (ANOSIM p < 0.001, R = 0.58), though our dataset includes a limited number of samples. Interestingly, the difference between aquatic and terrestrial environments are also significant and explain more variation in community structure (ANOSIM R = 0.71 for terrestrial vs. freshwater and R = 0.85 for marine vs. terrestrial comparisons). Further studies that include large numbers of samples from all three habitat types, preferably from consistent geographic locations, will be necessary to determine the deepest divisions in eukaryotic community composition across environments.

Conclusions

Our results demonstrate clearly that microbial eukaryotes are a normal component of the mammalian microbiota, and that the communities they form, although not as diverse as bacterial communities in the gut, are nonetheless diverse and correlate with key features of their hosts. Interestingly, humans with non-western diets and lifestyles are comparable to other mammals in the microbial eukaryote diversity they harbor. In contrast, humans living Western lifestyles instead have very low diversity of gut microbial eukaryotes. Whether these differences are due to diet, hygiene, level of contact with animals, host genetics, or other lifestyle factors that differ among the populations surveyed remains a topic for further work: of particular interest is whether the loss of the microbial eukaryote diversity with which we as mammals have co-evolved is a trigger for the autoimmune diseases that are far more prevalent in Western populations.

One intriguing difference between eukaryotic and bacterial communities is that eukaryotic communities in the vertebrate gut are heterogeneous across samples, whereas the dominant bacterial lineages are consistently recovered across individuals and across species. The patchy distribution of eukaryotes across individuals, combined with the host-species specificity of resident eukaryotic microbes, suggests that it will be difficult to clearly identify the healthy, or “normal,” core eukaryotic microbiota of the human gut, just as is it is also difficult to identify a core gut bacterial community shared across humans (Li et al., 2013). Consequently, future studies of microbial eukaryote communities should focus more on identifying variation that is associated with different phenotypic states, including disease states.

Finally, comparison of the mammalian gut to other environments shows that fewer deep lineages are associated with the gut than in free-living communities, and alpha diversity is lower. This pattern resembles the pattern found in bacteria in the same environments. Eukaryotes have less diversification within lineages at shallow levels than observed for bacteria, however, suggesting that although the big picture of high-level diversification is the same across these taxa, the fine-grained patterns may differ. With the improved tools for eukaryotic surveys presented here, we are now poised to characterize microbial eukaryotes across environments on a large scale in projects such as the Earth Microbiome Project, providing a much richer understanding of the relationships between pathogens, commensals, and beneficial members of our microbial eukaryote community.

Conflict of interest statement

Several authors on this manuscript are, or were at the time of the study, employees of 454 Life Sciences, a Roche Company (Branford, CT), whose technology is used in this study.

Acknowledgments

Many thanks to Jessica Metcalf for helpful discussions and comments throughout. Thanks to Se Jin Song and Jon Leff for comments and consultations on statistics and to Antonio Gonzalez for technical help. We wish to thank the National Ecological Observatory Network (a project sponsored by the National Science Foundation and managed under cooperative agreement by NEON, Inc.) for donation of the soil samples. Thanks to Jeffrey Gordon for providing access to fecal samples. Funding was provided the Crohn's and Colitis Foundation of America to Rob Knight (WU-09-72), National Institutes of Health to Rob Knight (NHGRI HG004872-03) and the Howard Hughes Medical Institute.

Supplementary material

The Supplementary Material for this article can be found online at: http://www.frontiersin.org/journal/10.3389/fmicb.2014.00298/abstract

Figure S1

Rarefaction curves with alpha diversity metric PD Whole tree. Rarefaction curves are approaching an asymptote indicating diversity has been adequately captured, especially for fecal samples. Error bars are standard deviation. (A) Eukaryotes and (B) Bacteria.

References

  1. Adl S. M., Simpson A. G. B., Lane C. E., Lukeš J., Bass D., Bowser S. S., et al. (2012). The revised classification of eukaryotes. J. Euk. Microbiol. 59, 429–514 10.1111/j.1550-7408.2012.00644.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alfellani M. A., Stensvold C. R., Vidal-Lapiedra A., Onuoha E. S., Fagbenro-Beyioku A. F., Clark C. G. (2013). Variable geographic distribution of Blastocystis subtypes and its potential implications. Acta Trop. 126, 11–18 10.1016/j.actatropica.2012.12.011 [DOI] [PubMed] [Google Scholar]
  3. Amaral-Zettler L. A., McCliment E. A., Ducklow H. W., Huse S. M. (2009). A method for studying protistan diversity using massively parallel sequencing of v9 hypervariable regions of small-subunit ribosomal RNA genes. PLoS ONE 4:e6372 10.1371/journal.pone.0006372 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Andersen L. O., Vedel Nielsen H., Stensvold C. R. (2013). Waiting for the human intestinal eukaryotome. ISME J. 7, 1253–1255 10.1038/ismej.2013.21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Auguet J. C., Barberan A., Casamayor E. O. (2010). Global ecological patterns in uncultured archaea. ISME J. 4, 182–190 10.1038/ismej.2009.109 [DOI] [PubMed] [Google Scholar]
  6. Baldwin D. S., Colloff M. J., Rees G. N., Chariton A. A., Watson G. O., Court L. N., et al. (2013). Impacts of inundation and drought on eukaryote biodiversity in semi-arid floodplain soils. Mol. Ecol. 22, 1746–1758 10.1111/mec.12190 [DOI] [PubMed] [Google Scholar]
  7. Barratt J. L. N., Harkness J., Marriott D., Ellis J. T., Stark D. (2011). A review of Dientamoeba fragilis carriage in humans: several reasons why this organism should be considered in the diagnosis of gastrointestinal illness. Gut Microbes 2, 3–12 10.4161/gmic.2.1.14755 [DOI] [PubMed] [Google Scholar]
  8. Bates S. T., Berg-Lyons D., Lauber C. L., Walters W. A., Knight R., Fierer N. (2012). A preliminary survey of lichen associated eukaryotes using pyrosequencing. Lichenologist 44, 137–146 10.1017/S0024282911000648 [DOI] [Google Scholar]
  9. Bates S. T., Clemente J. C., Flores G. E., Walters W. A., Parfrey L. W., Knight R., et al. (2013). Global biogeography of highly diverse protistan communities in soil. ISME J. 7, 652–659 10.1038/ismej.2012.147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bates S. T., Cropsey G. W., Caporaso J. G., Knight R., Fierer N. (2011). Bacterial communities associated with the lichen symbiosis. Appl. Environ. Microbiol. 77, 1309–1314 10.1128/AEM.02257-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Berger S. A., Krompass D., Stamatakis A. (2011). Performance, accuracy, and web server for evolutionary placement of short sequence reads under maximum likelihood. Syst. Biol. 60, 291–302 10.1093/sysbio/syr010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bogitsh B., Carter C., Oeltmann T. (2005). Human Parasitology. Amsterdam: Elsevier [Google Scholar]
  13. Bokulich N. A., Subramanian S., Faith J. J., Gevers D., Gordon J. I., Knight R., et al. (2013). Quality-filtering vastly improves diversity estimates from illumina amplicon sequencing. Nat. Methods 10, 57–59 10.1038/nmeth.2276 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bowers R. M., McCubbin I., Hallar A., Fierer N. (2012). Seasonal variability in airborne bacterial communities at a high-elevation site in the Colorado rocky mountains. Atmos. Environ. 50, 41–49 10.1016/j.atmosenv.2012.01.005 [DOI] [Google Scholar]
  15. Bowers R. M., McLetchie S., Knight R., Fierer N. (2011a). Spatial variability in airborne bacterial communities across land-use types and their relationship to the bacterial communities of potential source environments. ISME J. 5, 601–612 10.1038/ismej.2010.167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Bowers R. M., Sullivan A. P., Costello E. K., Collett J. L., Jr., Knight R., Fierer N. (2011b). Sources of bacteria in outdoor air across cities in the midwestern united states. Appl. Environ. Microbiol. 77, 6350–6356 10.1128/AEM.05498-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Brate J., Klaveness D., Rygh T., Jakobsen K. S., Shalchian-Tabrizi K. (2010). Telonemia-specific environmental 18s rDNA pcr reveals unknown diversity and multiple marine-freshwater colonizations. BMC Microbiol. 10:168 10.1186/1471-2180-10-168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Caporaso J. G., Bittinger K., Bushman F. D., DeSantis T. Z., Andersen G. L., Knight R. (2010a). PyNAST: a flexible tool for aligning sequences to a template alignment. Bioinformatics 26, 266–267 10.1093/bioinformatics/btp636 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Caporaso J. G., Kuczynski J., Stombaugh J., Bittinger K., Bushman F. D., Costello E. K., et al. (2010b). QIIME allows analysis of high-throughput community sequencing data. Nat. Methods 7, 335–336 10.1038/nmeth.f.303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Caron D. A. (2009). New accomplishments and approaches for assessing protistan diversity and ecology in natural ecosystems. Bioscience 59, 287–299 10.1525/bio.2009.59.4.7 [DOI] [Google Scholar]
  21. Cekin A. H., Cekin Y., Adakan Y., Tasdemir E., Koclar F. G., Yolcular B. O. (2012). Blastocystosis in patients with gastrointestinal symptoms: a case-control study. BMC Gastroenterol. 12:122 10.1186/1471-230X-12-122 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Cho I., Blaser M. J. (2012). The human microbiome: at the interface of health and disease. Nat. Rev. Genet. 13, 260–270 10.1038/nrg3182 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Clarke K., Gorley R. (2006). Primer v6: User Manual. Plymouth: PRIMER-E [Google Scholar]
  24. Consortium H. M. P. (2012). Structure, function and diversity of the healthy human microbiome. Nature 486, 207–214 10.1038/nature11234 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Costello E. K., Lauber C. L., Hamady M., Fierer N., Gordon J. I., Knight R. (2009). Bacterial community variation in human body habitats across space and time. Science 326, 1694–1697 10.1126/science.1177486 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Coyle C. M., Varughese J., Weiss L. M., Tanowitz H. B. (2012). Blastocystis: to treat or not to treat. Clin. Infect. Dis. 54, 105–110 10.1093/cid/cir810 [DOI] [PubMed] [Google Scholar]
  27. De Filippo C., Cavalieri D., Di Paola M., Ramazzotti M., Poullet J. B., Massart S., et al. (2010). Impact of diet in shaping gut microbiota revealed by a comparative study in children from europe and rural africa. Proc. Natl. Acad. Sci. U.S.A. 107, 14691–14696 10.1073/pnas.1005963107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Dollive S., Peterfreund G. L., Sherrill-Mix S., Bittinger K., Sinha R., Hoffmann C., et al. (2012). A tool kit for quantifying eukaryotic rRNA gene sequences from human microbiome samples. Genome Biol. 13:R60 10.1186/gb-2012-13-7-r60 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Edgar R. C. (2010). Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–2461 10.1093/bioinformatics/btq461 [DOI] [PubMed] [Google Scholar]
  30. Eilers K., Debenport S., Anderson S., Fierer N. (2012). Digging deeper to find unique microbial communities: the strong effect of depth on the structure of bacterial and archaeal communities. Soil Biol. Biochem. 50, 58–65 10.1016/j.soilbio.2012.03.011 [DOI] [Google Scholar]
  31. El Safadi D., Gaayeb L., Meloni D., Cian A., Poirier P., Wawrzyniak I., et al. (2014). Children of Senegal river basin show the highest prevalence of Blastocystis sp. Ever observed worldwide. BMC Infect. Dis. 14:164 10.1186/1471-2334-14-164 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Fierer N., Ferrenberg S., Flores G. E., Gonzalez A., Kueneman J., Legg T., et al. (2012). From animalcules to an ecosystem: application of ecological concepts to the human microbiome. Annu. Rev. Ecol. Evol. Syst. 43, 137–155 10.1146/annurev-ecolsys-110411-160307 [DOI] [Google Scholar]
  33. Findley K., Oh J., Yang J., Conlan S., Deming C., Meyer J. A., et al. (2013). Topographic diversity of fungal and bacterial communities in human skin. Nature 498, 367–370 10.1038/nature12171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Foissner W. (2006). Biogeography and dispersal of micro-organisms: a review emphasizing protists. Acta Protozool. 45, 111–136 [Google Scholar]
  35. Goncalves M. L., Araujo A., Ferreira L. F. (2003). Human intestinal parasites in the past: new findings and a review. Mem. Inst. Oswaldo Cruz 98(Suppl. 1), 103–118 10.1590/S0074-02762003000900016 [DOI] [PubMed] [Google Scholar]
  36. Guillou L., Bachar D., Audic S., Bass D., Berney C., Bittner L., et al. (2013). The protist ribosomal reference database (PR2): a catalog of unicellular eukaryote small sub-unit rRNA sequences with curated taxonomy. Nucleic Acids Res. 41, D597–D604 10.1093/nar/gks1160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Hamad I., Sokhna C., Raoult D., Bittar F. (2012). Molecular detection of eukaryotes in a single human stool sample from Senegal. PLoS ONE 7:e40888 10.1371/journal.pone.0040888 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Harhay M. O., Horton J., Olliaro P. L. (2010). Epidemiology and control of human gastrointestinal parasites in children. Expert Rev. Anti Infect. Ther. 8, 219–234 10.1586/eri.09.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Katz L. A., Grant J., Parfrey L. W., Gant A., O'Kelly C. J., Anderson O. R., et al. (2011). Subulatomonas tetraspora nov. Gen. Nov. Sp. Is a member of a previously unrecognized major clade of eukaryotes. Protist 162, 762–773 10.1016/j.protis.2011.05.002 [DOI] [PubMed] [Google Scholar]
  40. Kittelmann S., Janssen P. H. (2011). Characterization of rumen ciliate community composition in domestic sheep, deer, and cattle, feeding on varying diets, by means of PCR-DGGE and clone libraries. FEMS Microbiol. Ecol. 75, 468–481 10.1111/j.1574-6941.2010.01022.x [DOI] [PubMed] [Google Scholar]
  41. Larsen J., Patterson D. J. (1990). Some flagellates (Protista) from tropical marine sediments. J. Nat. Hist. 24, 801–937 10.1080/00222939000770571 [DOI] [Google Scholar]
  42. Lauber C. L., Hamady M., Knight R., Fierer N. (2009). Pyrosequencing-based assessment of soil pH as a predictor of soil bacterial community structure at the continental scale. Appl. Environ. Microbiol. 75, 5111–5120 10.1128/AEM.00335-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Ley R. E., Hamady M., Lozupone C., Turnbaugh P. J., Ramey R. R., Bircher J. S., et al. (2008a). Evolution of mammals and their gut microbes. Science 320, 1647–1651 10.1126/science.1155725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Ley R. E., Lozupone C. A., Hamady M., Knight R., Gordon J. I. (2008b). Worlds within worlds: evolution of the vertebrate gut microbiota. Nat. Rev. Microbiol. 6, 776–788 10.1038/nrmicro1978 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Ley R. E., Peterson D. A., Gordon J. I. (2006). Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell 124, 837–848 10.1016/j.cell.2006.02.017 [DOI] [PubMed] [Google Scholar]
  46. Li K., Bihan M., Methe B. A. (2013). Analyses of the stability and core taxonomic memberships of the human microbiome. PLoS ONE 8:e63139 10.1371/journal.pone.0063139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Logares R., Brate J., Bertilsson S., Clasen J. L., Shalchian-Tabrizi K., Rengefors K. (2009). Infrequent marine-freshwater transitions in the microbial world. Trends Microbiol. 17, 414–422 10.1016/j.tim.2009.05.010 [DOI] [PubMed] [Google Scholar]
  48. Lozupone C., Knight R. (2005). UniFrac: a new phylogenetic method for comparing microbial communities. Appl. Environ. Microbiol. 71, 8228–8235 10.1128/AEM.71.12.8228-8235.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Lozupone C. A., Knight R. (2007). Global patterns in bacterial diversity. Proc. Natl. Acad. Sci. U.S.A. 104, 11436–11440 10.1073/pnas.0611525104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Lozupone C. A., Stombaugh J. I., Gordon J. I., Jansson J. K., Knight R. (2012). Diversity, stability and resilience of the human gut microbiota. Nature 489, 220–230 10.1038/nature11550 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Martin A. P. (2002). Phylogenetic approaches for describing and comparing the diversity of microbial communities. Appl. Environ. Microbiol. 68, 3673–3682 10.1128/AEM.68.8.3673-3682.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. McFall-Ngai M., Hadfield M. G., Bosch T. C., Carey H. V., Domazet-Loso T., Douglas A. E., et al. (2013). Animals in a bacterial world, a new imperative for the life sciences. Proc. Natl. Acad. Sci. U.S.A. 110, 3229–3236 10.1073/pnas.1218525110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. McFarland L., Bernasconi P. (1993). Saccharomyces boulardii: a review of an innovative biotherapeutic agent. Microb. Ecol. Health Dis. 6, 157–171 10.3109/08910609309141323 [DOI] [Google Scholar]
  54. McGuire K. L., Fierer N., Bateman C., Treseder K. K., Turner B. L. (2012). Fungal community composition in neotropical rain forests: the influence of tree diversity and precipitation. Microb. Ecol. 63, 804–812 10.1007/s00248-011-9973-x [DOI] [PubMed] [Google Scholar]
  55. Muegge B. D., Kuczynski J., Knights D., Clemente J. C., Gonzalez A., Fontana L., et al. (2011). Diet drives convergence in gut microbiome functions across mammalian phylogeny and within humans. Science 332, 970–974 10.1126/science.1198719 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Nemergut D. R., Schmidt S. K., Fukami T., O'Neill S. P., Bilinski T. M., Stanish L. F., et al. (2013). Patterns and processes of microbial community assembly. Microbiol. Mol. Biol. Rev. 77, 342–356 10.1128/MMBR.00051-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Parfrey L. W., Grant J., Tekle Y. I., Lasek-Nesselquist E., Morrison H. G., Sogin M. L., et al. (2010). Broadly sampled multigene analyses yield a well-resolved eukaryotic tree of life. Syst. Biol. 59, 518–533 10.1093/sysbio/syq037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Parfrey L. W., Walters W. A., Knight R. (2011). Microbial eukaryotes in the human microbiome: ecology, evolution, and future directions. Front. Microbiol. 2:153 10.3389/fmicb.2011.00153 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Patterson D. J. (1996). Free-Living Freshwater Protozoa. New York, NY: John Wiley & Sons, Inc [Google Scholar]
  60. Paulino L. C., Tseng C. H., Strober B. E., Blaser M. J. (2006). Molecular analysis of fungal microbiota in samples from healthy human skin and psoriatic lesions. J. Clin. Microbiol. 44, 2933–2941 10.1128/JCM.00785-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Petersen A. M., Stensvold C. R., Mirsepasi H., Engberg J., Friis-Moller A., Porsbo L. J., et al. (2013). Active ulcerative colitis associated with low prevalence of Blastocystis and Dientamoeba fragilis infection. Scand. J. Gastroenterol. 48, 638–639 10.3109/00365521.2013.780094 [DOI] [PubMed] [Google Scholar]
  62. Pirrung M., Kennedy R., Caporaso J. G., Stombaugh J., Wendel D., Knight R. (2011). TopiaryExplorer: visualizing large phylogenetic trees with environmental metadata. Bioinformatics 27, 3067–3069 10.1093/bioinformatics/btr517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Poirier P., Wawrzyniak I., Vivares C. P., Delbac F., El Alaoui H. (2012). New insights into Blastocystis spp.: a potential link with irritable bowel syndrome. PLoS Pathog. 8:e1002545 10.1371/journal.ppat.1002545 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Pruesse E., Quast C., Knittel K., Fuchs B. M., Ludwig W., Peplies J., et al. (2007). SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with arb. Nucleic Acids Res. 35, 7188–7196 10.1093/nar/gkm864 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Ramirez K. S., Lauber C. L., Knight R., Bradford M. A., Fierer N. (2010). Consistent effects of nitrogen fertilization on soil bacterial communities in contrasting systems. Ecology 91, 3463–3470 discussion: 3503–3414. 10.1890/10-0426.1 [DOI] [PubMed] [Google Scholar]
  66. Rook G. A. (2012). Hygiene hypothesis and autoimmune diseases. Clin. Rev. Allergy Immunol. 42, 5–15 10.1007/s12016-011-8285-8 [DOI] [PubMed] [Google Scholar]
  67. Russell J. A., Dubilier N., Rudgers J. A. (2014). Nature's microbiome: introduction. Mol. Ecol. 23, 1225–1237 10.1111/mec.12676 [DOI] [PubMed] [Google Scholar]
  68. Scanlan P. D., Marchesi J. R. (2008). Micro-eukaryotic diversity of the human distal gut microbiota: qualitative assessment using culture-dependent and -independent analysis of faeces. ISME J. 2, 1183–1193 10.1038/ismej.2008.76 [DOI] [PubMed] [Google Scholar]
  69. Scanlan P. D., Stensvold C. R. (2013). Blastocystis: getting to grips with our guileful guest. Trends Parasitol. 29, 523–529 10.1016/j.pt.2013.08.006 [DOI] [PubMed] [Google Scholar]
  70. Schoch C. L., Seifert K. A., Huhndorf S., Robert V., Spouge J. L., Levesque C. A., et al. (2012). Nuclear ribosomal internal transcribed spacer (its) region as a universal DNA barcode marker for fungi. Proc. Natl. Acad. Sci. U.S.A. 109, 6241–6246 10.1073/pnas.1117018109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Scupham A. J., Presley L. L., Wei B., Bent E., Griffith N., McPherson M., et al. (2006). Abundant and diverse fungal microbiota in the murine intestine. Appl. Environ. Microbiol. 72, 793–801 10.1128/AEM.72.1.793-801.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Seenivasan R., Sausen N., Medlin L. K., Melkonian M. (2013). Picomonas judraskeda gen. Et sp. Nov.: the first identified member of the picozoa phylum nov., a widespread group of picoeukaryotes, formerly known as ‘picobiliphytes’. PLoS ONE 8:e59565 10.1371/journal.pone.0059565 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Shade A., Read J. S., Youngblut N. D., Fierer N., Knight R., Kratz T. K., et al. (2012). Lake microbial communities are resilient after a whole-ecosystem disturbance. ISME J. 6, 2153–2167 10.1038/ismej.2012.56 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Shalchian-Tabrizi K., Brate J., Logares R., Klaveness D., Berney C., Jakobsen K. S. (2008). Diversification of unicellular eukaryotes: cryptomonad colonizations of marine and fresh waters inferred from revised 18s rRNA phylogeny. Environ. Microbiol. 10, 2635–2644 10.1111/j.1462-2920.2008.01685.x [DOI] [PubMed] [Google Scholar]
  75. Smith M. I., Yatsunenko T., Manary M. J., Trehan I., Mkakosya R., Cheng J., et al. (2013). Gut microbiomes of malawian twin pairs discordant for kwashiorkor. Science 339, 548–554 10.1126/science.1229000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Sommer F., Backhed F. (2013). The gut microbiota–masters of host development and physiology. Nat. Rev. Microbiol. 11, 227–238 10.1038/nrmicro2974 [DOI] [PubMed] [Google Scholar]
  77. Stamatakis A. (2006). RAxML-vi-hpc: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22, 2688–2690 10.1093/bioinformatics/btl446 [DOI] [PubMed] [Google Scholar]
  78. Stamatakis A., Hoover P., Rougemont J. (2008). A rapid bootstrap algorithm for the RAxML web-servers. Syst. Biol. 57, 758–771 10.1080/10635150802429642 [DOI] [PubMed] [Google Scholar]
  79. Stensvold C. R., Lebbad M., Victory E. L., Verweij J. J., Tannich E., Alfellani M., et al. (2011). Increased sampling reveals novel lineages of Entamoeba: consequences of genetic diversity and host specificity for taxonomy and molecular detection. Protist 162, 525–541 10.1016/j.protis.2010.11.002 [DOI] [PubMed] [Google Scholar]
  80. Stoeck T., Bass D., Nebel M., Christen R., Jones M. D., Breiner H. W., et al. (2010). Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water. Mol. Ecol. 19(Suppl. 1), 21–31 10.1111/j.1365-294X.2009.04480.x [DOI] [PubMed] [Google Scholar]
  81. Tan K. S., Mirza H., Teo J. D., Wu B., Macary P. A. (2010). Current views on the clinical relevance of Blastocystis spp. Curr. Infect. Dis. Rep. 12, 28–35 10.1007/s11908-009-0073-8 [DOI] [PubMed] [Google Scholar]
  82. Ursell L. K., Treuren W. V., Metcalf J. L., Pirrung M., Gewirtz A., Knight R. (2013). Replenishing our defensive microbes. Bioessays 35, 810–817 10.1002/bies.201300018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Verhulst N. O., Qiu Y. T., Beijleveld H., Maliepaard C., Knights D., Schulz S., et al. (2011). Composition of human skin microbiota affects attractiveness to malaria mosquitoes. PLoS ONE 6:e28991 10.1371/journal.pone.0028991 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Walters W. A., Caporaso J. G., Lauber C. L., Berg-Lyons D., Fierer N., Knight R. (2011). PrimerProspector: de novo design and taxonomic analysis of barcoded polymerase chain reaction primers. Bioinformatics 27, 1159–1161 10.1093/bioinformatics/btr087 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Wang J. J., Yang D. M., Zhang Y., Shen J., van der Gast C., Hahn M. W., et al. (2011). Do patterns of bacterial diversity along salinity gradients differ from those observed for macroorganisms? PLoS ONE 6:e27597 10.1371/journal.pone.0027597 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Weisse T. (2008). Distribution and diversity of aquatic protists: an evolutionary and ecological perspective. Biodivers. Conserv. 17, 243–259 10.1007/s10531-007-9249-4 [DOI] [Google Scholar]
  87. Yakoob J., Jafri W., Beg M. A., Abbas Z., Naz S., Islam M., et al. (2010). Blastocystis hominis and Dientamoeba fragilis in patients fulfilling irritable bowel syndrome criteria. Parasitol. Res. 107, 679–684 10.1007/s00436-010-1918-7 [DOI] [PubMed] [Google Scholar]
  88. Yatsunenko T., Rey F. E., Manary M. J., Trehan I., Dominguez-Bello M. G., Contreras M., et al. (2012). Human gut microbiome viewed across age and geography. Nature 486, 222–227 10.1038/nature11053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Yilmaz P., Kottmann R., Field D., Knight R., Cole J. R., Amaral-Zettler L., et al. (2011). Minimum information about a marker gene sequence (MiMARKS) and minimum information about any (x) sequence (MiXS) specifications. Nat. Biotechnol. 29, 415–420 10.1038/nbt.1823 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1

Rarefaction curves with alpha diversity metric PD Whole tree. Rarefaction curves are approaching an asymptote indicating diversity has been adequately captured, especially for fecal samples. Error bars are standard deviation. (A) Eukaryotes and (B) Bacteria.


Articles from Frontiers in Microbiology are provided here courtesy of Frontiers Media SA

RESOURCES