Skip to main content
mSystems logoLink to mSystems
. 2020 Nov 10;5(6):e00522-20. doi: 10.1128/mSystems.00522-20

Fermented-Food Metagenomics Reveals Substrate-Associated Differences in Taxonomy and Health-Associated and Antibiotic Resistance Determinants

John Leech a,b,c, Raul Cabrera-Rubio a,b, Aaron M Walsh a, Guerrino Macori a,b, Calum J Walsh a,b, Wiley Barton a, Laura Finnegan a,b,c, Fiona Crispie a,b, Orla O’Sullivan a,b, Marcus J Claesson a,c, Paul D Cotter a,b,
Editor: Lawrence A Davidd
PMCID: PMC7657593  PMID: 33172966

Fermented foods are regaining popularity worldwide due in part to a greater appreciation of the health benefits of these foods and the associated microorganisms. Here, we use state-of-the-art approaches to explore the microbiomes of 58 of these foods, identifying the factors that drive the microbial composition of these foods and potential functional benefits associated with these populations. Food type, i.e., dairy-, sugar-, or brine-type fermented foods, was the primary driver of microbial composition, with dairy foods found to have the lowest microbial diversity and, notably, potential health promoting attributes were more common in fermented foods than nonfermented equivalents. The information provided here will provide significant opportunities for the further optimization of fermented-food production and the harnessing of their health-promoting potential.

KEYWORDS: diversity, fermented, shotgun metagenomics

ABSTRACT

Fermented foods have been the focus of ever greater interest as a consequence of purported health benefits. Indeed, it has been suggested that consumption of these foods helps to address the negative consequences of “industrialization” of the human gut microbiota in Western society. However, as the mechanisms via which the microbes in fermented foods improve health are not understood, it is necessary to develop an understanding of the composition and functionality of the fermented-food microbiota to better harness desirable traits. Here, we considerably expand the understanding of fermented-food microbiomes by employing shotgun metagenomic sequencing to provide a comprehensive insight into the microbial composition, diversity, and functional potential (including antimicrobial resistance and carbohydrate-degrading and health-associated gene content) of a diverse range of 58 fermented foods from artisanal producers from a number of countries. Food type, i.e., dairy-, sugar-, or brine-type fermented foods, was the primary driver of microbial composition, with dairy foods found to have the lowest microbial diversity. From the combined data set, 127 high-quality metagenome-assembled genomes (MAGs), including 10 MAGs representing putatively novel species of Acetobacter, Acidisphaera, Gluconobacter, Companilactobacillus, Leuconostoc, and Rouxiella, were generated. Potential health promoting attributes were more common in fermented foods than nonfermented equivalents, with water kefirs, sauerkrauts, and kvasses containing the greatest numbers of potentially health-associated gene clusters. Ultimately, this study provides the most comprehensive insight into the microbiomes of fermented foods to date and yields novel information regarding their relative health-promoting potential.

IMPORTANCE Fermented foods are regaining popularity worldwide due in part to a greater appreciation of the health benefits of these foods and the associated microorganisms. Here, we use state-of-the-art approaches to explore the microbiomes of 58 of these foods, identifying the factors that drive the microbial composition of these foods and potential functional benefits associated with these populations. Food type, i.e., dairy-, sugar-, or brine-type fermented foods, was the primary driver of microbial composition, with dairy foods found to have the lowest microbial diversity and, notably, potential health promoting attributes were more common in fermented foods than nonfermented equivalents. The information provided here will provide significant opportunities for the further optimization of fermented-food production and the harnessing of their health-promoting potential.

INTRODUCTION

Fermentation is a form of food preservation with origins that can be traced back to the Neolithic age (1). Despite recent advances in food preservation and processing, fermentation continues to be widely used as a means of preservation and is the focus of renewed interest due to increased appreciation of the organoleptic, nutritive, and—especially—health-promoting properties attributed to many fermented foods (2, 3). Indeed, various fermented foods have been shown to have enhanced attributes relative to the corresponding raw ingredients by virtue of the microbial metabolites produced (48), the removal of allergens (9), other desirable biological activities (10, 11), and/or the presence of microbes that have the potential to confer benefits following consumption (12, 13). Furthermore, although antibiotic use, sanitation, and food processing have greatly reduced the number of deaths due to infectious diseases, these activities have also minimized our exposure to microbes and are thought to have contributed to the “industrialization” of the human microbiome and associated increases in chronic diseases (14, 15). It has been suggested that fermented foods offer a means of safe microbial exposure to compensate for the absence/removal of desirable host microbes (15, 16). Due to these potential benefits and to an increasing appreciation that the study of these foods provides valuable fundamental insights into simple microbial communities (17, 18), developing an even greater understanding of the microbiology of these foods has the potential to be of considerable value.

Advances in high-throughput sequencing technology have revolutionized the study of microbial populations, including those present in foods. Although, to date, the vast majority of studies relating to fermented foods have employed amplicon sequencing to study bacterial and fungal composition (1936), there have been some exceptional studies in which shotgun sequencing has been used to gain a greater insight into the taxonomy and functional potential of specific fermented foods (3750). Despite this, studies across a broad variety of such foods using this approach have been lacking to date. Here, we address this issue by using shotgun metagenomic sequencing to investigate the microbiota of a broad range of artisanal fermented foods, including many that were previously unexplored.

RESULTS

Fermented-food microbiomes can be distinguished on the basis of substrate type.

The microbiomes of the consumable portion of 58 fermented-food samples (347,841,507 total reads; with an average of 5,997,267 reads per sample), the majority of which represented three main substrate types, i.e., dairy (such as kefir and cheese; n = 11), brine (such as sauerkraut and kimchi; n = 26), and sugar (such as kombucha and water kefir; n = 18), was investigated through shotgun metagenomic. Analysis of these data and other associated metadata (i.e., country of origin [“country”], specific source of product [“producer”], presence or absence of starter culture [“fermentation”], solid or liquid foods [“state”], and [“substrate”]) (Table 1), revealed that the microbiomes of these foods most significantly clustered on the basis of food substrate (Table 2 and Fig. 1). Ten characteristics of the food microbiome were defined, and differences across these characteristics were statistically examined (Table 2).

TABLE 1.

Fermented foods and metadataa

Sample ID Origin Producer Substrate State Fermentation
Wagashi rind FS00a Benin 1 Dairy Solid Starter
Wagashi core FS00b Benin 1 Dairy Solid Starter
Bread kvass FS01 Russia 2 Sugar Liquid Starter
Carrot kimchi FS02 UK 2 Brine Solid Spontaneous
Boza FS03 UK 2 Sugar Liquid Starter
Turnip FS05 UK 2 Brine Solid Spontaneous
Orange FS06 UK 2 Sugar Solid Spontaneous
Krauthehi (sauerkraut) FS07 Germany 2 Brine Solid Spontaneous
Tepache FS08 Mexico 2 Sugar Liquid Spontaneous
Ginger beer FS09 UK 2 Sugar Liquid Spontaneous
Tempeh FS10 UK 2 Soy Solid Starter
Cucumber FS11 UK 2 Brine Solid Spontaneous
Milk kefir FS12 UK 2 Dairy Liquid Starter
Water kefir FS13 UK 2 Sugar Liquid Starter
Tofu chili FS16 China 3 Soy Solid Spontaneous
Daikon FS17 China 3 Brine Solid Spontaneous
Pickled vegetables FS19 China 3 Brine Solid Spontaneous
Raw sauerkraut and juniper berries FS22 Ireland 4 Brine Solid Spontaneous
Brown rice amazake FS23 Japan 4 Brine Solid Spontaneous
Beetroot kvass FS24 Ireland 5 Brine Liquid Starter
Kefir and fennel soup FS25 Ireland 5 Dairy Liquid Starter
Mead FS26 Ireland 5 Sugar Liquid Spontaneous
Sauerkraut FS27 Ireland 5 Brine Solid Spontaneous
Dill dearg (sauerkraut) FS28 Ireland 6 Brine Solid Spontaneous
Kimchi FS29 Ireland 6 Brine Solid Spontaneous
Golden child (sauerkraut) FS30 Ireland 6 Brine Solid Spontaneous
Water kefir hibiscus FS31 Ireland 6 Sugar Liquid Starter
Water kefir lemon FS32 Ireland 6 Sugar Liquid Starter
Water kefir ginger FS33 Ireland 6 Sugar Liquid Starter
Kombucha vinegar FS34 Ireland 6 Sugar Liquid Starter
Ryazhenka FS35 Russia 7 Dairy Liquid Starter
Agousha FS36 Russia 7 Dairy Liquid Starter
Rostagroèkport vorožnyj FS37 Russia 7 Dairy Solid Starter
Ruž’a FS38 Russia 7 Dairy Solid Starter
Sauerkraut FS39 Ireland 8 Brine Solid Spontaneous
Kombucha FS40 Ireland 8 Sugar Liquid Starter
Apple cider vinegar FS41 Ireland 8 Sugar Liquid Starter
Raw milk kefir FS42 Ireland 9 Dairy Liquid Starter
Pasteurized milk kefir FS43 Ireland 9 Dairy Liquid Starter
Water kefir (pear, ginger, and honey) FS44 Ireland 9 Sugar Liquid Starter
Water kefir (pear, ginger, and sugar) FS45 Ireland 9 Sugar Liquid Starter
Dilly carrots FS46 Ireland 10 Brine Solid Spontaneous
Brussels sprout kimchi FS47 Ireland 10 Brine Solid Spontaneous
Kimchi FS48 Ireland 10 Brine Solid Spontaneous
Garlic kraut FS49 Ireland 10 Brine Solid Spontaneous
Dukkah kraut FS50 Ireland 10 Brine Solid Spontaneous
Ginger sliced in 2% brine FS51 Ireland 10 Brine Solid Spontaneous
Daikon radish in 2% brine FS52 Ireland 10 Brine Solid Spontaneous
Okra in 2% brine FS53 Ireland 10 Brine Solid Spontaneous
Tomatoes and mustard seeds in 2% brine FS54 Ireland 10 Brine Solid Spontaneous
Kombucha FS55 Ireland 10 Sugar Liquid Starter
Cherry water kefir FS56 Ireland 10 Sugar Liquid Starter
Beet kvass FS57 Ireland 10 Brine Liquid Starter
Coconut kefir FS58 Ireland 5 Coconut_kefir Liquid Starter
Carrot sticks FS59 Ireland 5 Brine Solid Spontaneous
Labne FS60 Ireland 5 Dairy Solid Starter
Lemon and ginger fizz FS61 Ireland 5 Sugar Liquid Starter
Scallion kimchi FS62 Ireland 5 Brine Solid Spontaneous
a

“Origin” indicates country of origin, “Producer” is a numeric code for each producer who supplied foods, “Substrate” lists the main ingredient fermented, “State” discriminates between solid and liquid foods, and “Fermentation” refers to whether a starter culture was used (starter) or not (spontaneous).

TABLE 2.

ANOSIM results in order by descending R statistica

R statistic Level Variable P Padj
0.651 Family Type 0.001 0.008
0.551 Genus Type 0.001 0.013
0.514 Carbs Type 0.001 0.004
0.436 Species Type 0.001 0.050
0.345 Superfocus level 3 Type 0.001 0.004
0.289 Superfocus level 1 Type 0.001 0.005
0.280 Phylum Type 0.001 0.006
0.221 Carbs Producer 0.001 0.004
0.210 Superfocus level 2 Type 0.001 0.005
0.202 Family Fermentation 0.001 0.006
0.171 Species Fermentation 0.001 0.017
0.169 Species State 0.001 0.025
0.167 Family State 0.001 0.007
0.163 AMR Type 0.004 0.010
0.160 Species Producer 0.003 0.008
0.154 Carbs Fermentation 0.001 0.003
0.149 Genus Fermentation 0.001 0.010
0.117 Superfocus level 1 State 0.002 0.006
0.111 Superfocus level 3 Fermentation 0.002 0.006
0.106 AMR Fermentation 0.005 0.012
0.097 Genus State 0.007 0.015
0.094 Superfocus level 3 State 0.006 0.013
0.093 Superfocus level 1 Fermentation 0.002 0.006
0.080 Superfocus level 2 Fermentation 0.006 0.014
0.076 Superfocus level 2 State 0.012 0.024
0.073 Carbs State 0.019 0.035
0.070 Bacteriocin State 0.018 0.035
a

Only results that remained significant (P < 0.05) after Benjamini-Hochberg corrections (i.e., Benjamini-Hochberg adjusted P values [Padj]) are included here (see the full table in Data Set S1, sheet 8). AMR, antimicrobial resistance; Carbs, carbohydrates.

FIG 1.

FIG 1

Beta diversity. (A) Nonmetric multidimensional scaling (NMDS) of Bray-Curtis distances between 58 samples, calculated for species-level composition. Samples are colored by substrate. (B) NMDS of Bray-Curtis distances between 58 samples, calculated for the Superfocus level 3 composition. Samples are colored by substrate. (C) NMDS of Bray-Curtis distances of carbohydrate pathways assigned with HUMAnN2. Samples are colored by substrate. (D) Maximum-likelihood phylogenetic tree of 16 Lactococcus lactis strains from different food samples. Strains are colored according to food substrate source. All figures show clear shifts in samples/strains by substrate.

DATA SET S1

(Sheet 1) Species level relative abundance of 58 foods. (Sheet 2) Superfocus level 1 relative abundance of 58 foods. (Sheet 3) Superfocus level 2 relative abundance of 58 foods. (Sheet 4) Superfocus level 3 relative abundance of 58 foods. (Sheet 5) Bacteriocin profile (in gene counts) of 58 foods. (Sheet 6) Fermented-food groups used for statistical analyses including N number of samples in each group. (Sheet 7) The genes that were shown in previous studies to be necessary for some probiotic attribute. These genes were then searched for in the metagenomes in this study. (Sheet 8) ANOSIM results for all 50 profiles. (Sheet 9) Table showing foods included in this study and the recipes used to make them. Some of the recipes are directly from the producer. When this information was not available, recipes were created from other online resources. Download Data Set S1, XLSX file, 1.0 MB (1,000.2KB, xlsx) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Taxonomy was the most distinguishing feature of the food substrates, as measured by the R statistic, supported by nonmetric multidimensional scaling (NMDS) plots and partial least-squares discriminant analysis (PLS-DA) (Fig. 1 and 2; Table 2). Substrate-related differences were greatest at the family level but were also significant at the species, genus, and phylum levels (Table 2). To further determine whether taxonomic differences at species level across substrates extend to the strain level, a further analysis of Lactococcus lactis-assigned reads, selected on the basis of being the species present across the greatest number of food samples, revealed that strains also phylogenetically cluster according to food substrate (Fig. 1), with samples of the same type having a lower cophenetic distance than samples of different types (P < 0.05). There was no clustering of L. lactis strains according to any other factor. Functional analysis revealed that substrate had the most considerable impact on the functional profile of the foods (Table 2 and Fig. 1). Carbohydrate pathways also most considerably differed across the food groups (Table 2). Indeed, of the features examined, the bacteriocin gene profile was the only characteristic that was not statistically different across the food substrates.

FIG 2.

FIG 2

PLS-DA variance of sample clustering according to fermentation process and primary substrate. Constrained PLS-DA ordination of samples according to fermentation process illustrates that not all samples exhibit coordination of detected species composition that is dependent on the classification of the fermentation process. Samples deviating from the core fermentation-type clusters show unique compositions. PLS-DA, partial least-squares discriminant analysis. Ellipses represent confidence levels of 0.9 for the respective data. Axis plots are boxplots of the plotted data, illustrating the distribution of samples according to axis.

Three foods tested did not correspond to one of the three main food substrates, and their microbiomes were also distinct (Fig. 1). Two of these were derived from soy-based fermentations, which are known for their alkaline fermentation environment (51), and the third was a coconut kefir, i.e., a dairy kefir grain-based fermentation but of a coconut carbohydrate.

Starter presence/absence, solid/liquid state, and producer contribute to differences in microbiota.

Although less obvious from a clustering perspective, other factors, such as starter presence/absence, solid/liquid state, and producer, were also significant drivers of microbiome differences (Fig. S1, Table 2). The presence or absence of a starter culture was associated with differences in family, species, carbohydrate, genus, Superfocus level 3 (SF3), and the antimicrobial resistance (AMR) profile of foods (in order of descending effect size), but to a lesser extent than substrate. Superfocus software assigns function to a metagenome and collapses the functions into 3 levels of specificity, with level 3 being the most specific. Solid/liquid state was significant at three taxonomic levels and all four functional profiles (three Superfocus levels and HUMAnN2 carbohydrate pathways), but again with a smaller effect size than substrate and starter status (Table 2). However, it was the only factor that was associated with significant differences across bacteriocin profiles. The specific producer of the foods was reflected by the carbohydrate-related functions and species composition, but the country of origin, in instances where a sufficiently large number of samples were sourced from a specific country, did not influence any of the factors investigated (Table 2).

FIG S1

NMDS plots of the Bray-Curtis distances of the species in 58 fermented foods colored by substrate (A), fermentation starter or spontaneous (B), country of origin (C), producer (D), and solid or liquid foods (E). Download FIG S1, TIF file, 2.3 MB (2.4MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Microbial diversity differs between dairy foods and other food types.

Overall, 476 unique species, present at above 0.1% relative abundance, were assigned to the 58 foods, 301 different species of which were detected in brine foods, 242 in sugar foods, and 70 in dairy foods. This corresponded to an average of 11.5, 13.5, and 6.4 different species per sample for brine, sugar, and dairy foods, respectively. In line with these results, alpha-diversity analyses demonstrated that the microbiomes of dairy-based fermented foods had significantly lower alpha diversity than those of either brine or sugar foods (Fig. 3), which did not significantly differ from one another. It was also evident that, as expected, the alpha diversity of spontaneously fermented foods was significantly higher than those produced using starter cultures (Fig. 4). Across the specific foods, a spontaneously fermented orange preserve contained the highest number of species (n = 67), while a sample of tepache, a slightly alcoholic spontaneously fermented drink from Mexico, contained the lowest number of observed species (n = 12).

FIG 3.

FIG 3

Alpha diversity by substrate. (A) Number of species (abundance >0.1%) per sample. Analysis of variance (ANOVA) was used since the data had a normal distribution. (B) Shannon index of samples. Kruskal-Wallis was used since the data were nonparametric. (C) Simpson’s diversity index of samples. Kruskal-Wallis was used since the data were nonparametric. For all three panels, pairwise tests were carried out between dairy, brine, and sugar (t test for parametric and Wilcoxon pairwise test for nonparametric). Coconut kefir and soy had insufficient sample sizes for pairwise comparisons.

FIG 4.

FIG 4

Differences by fermentation. (A) AMR profile of spontaneous fermented foods and starter culture foods. The AMR classes are normalized by counts per million per sample (CPM). (B) Alpha-diversity boxplots examined across fermentation type (spontaneous or starter). A t test was used for number of species since the data were parametric; a Wilcoxon test was used for the Shannon diversity index and Simpson’s index since the data were nonparametric.

Lactic acid bacteria dominate brine foods.

The brine-type foods tested comprised 26 plant substrate-derived foods fermented in a saline solution. Unlike both dairy- and sugar-type fermented foods, the majority of the brine-based foods undergo a spontaneous fermentation and therefore rely on fermentation by autochthonous microbes (52). Brine foods mostly contained bacteria, with fungal assignment being 3.9% of the phylum level relative abundance. Archaea accounted for less than 0.5%. Among brine-type foods, Lactobacillus was the most abundant genus, comprising 46.8% of all reads assigned at the genus level. Lactiplantibacillus plantarum was the most abundant species (9.6% relative abundance on average), followed by L. brevis (7.9%), L. mucosae (4.7%), L. xianfangensis (4.1%), and L. sakei (3%). Leuconostoc mesenteroides (4.7%) and Pediococcus parvulus (4.3%) were also present in significant quantities. Across the brine-type foods, Bifidobacteriaceae were detected at a relative abundance of 1.6%. At the species level, 0.8% of species were assigned as Bifidobacterium longum, and 0.01% were assigned to B. breve. No other bifidobacteria were assigned at the species level.

The seven sauerkraut samples and five kimchi samples analyzed contained many of the genera regarded as being typical of these foods, such as Lactobacillus, Leuconostoc, the yeast Pichia, Rahnella, and the yeast Kazachstania (21, 24). Some more unusual species were found at low abundance, including Perkinsus marinus (0.33%), an oyster pathogen, which was detected in a scallion kimchi sample and possibly originated from the shrimp paste used in the manufacture of this kimchi. Other brine-type fermented foods were investigated using this approach for the first time. These included a lacto-fermented cucumber, fermented green tomatoes, and beet kvass, the latter being classified as a brine-type food since, unlike other typical kvass, this product contained large quantities of salt and no added sugar. These foods contained many plant-associated taxa, including large proportions of Pseudomonas, Lactobacillus, and Pediococcus. Brown rice amazake was particularly notable by virtue of containing a high relative abundance of B. longum (7.3%). A corresponding metagenome assembled genome (MAG) was recovered and found to be a 98.8% match with a previously sequenced B. longum E18 genome. In total, B. longum was detected across 4 of the 7 sauerkraut samples, 3 of the 5 kimchi samples, and 9 of the 13 other brine samples. The microbiota of other vegetables fermented in brine (Table 1) is described in greater depth in Data Set S1, sheet 1, in the supplemental material.

From a functional potential perspective, 18.4% of Superfocus level 1 (SF1) functions within the brine food microbiome were predicted to relate to carbohydrate metabolism. When functional pathways were investigated at a deeper level, xylose utilization (0.6%, SF3), fermentation (1.4%, SF2), and response to osmotic stress (1%, SF2) were among the most common functionalities (Data Set S1, sheets 2 to 4). A complete list of the relative abundances of the Superfocus pathways, for all foods, can be found in Data Set S1, sheets 2 to 4.

The microbiota composition of dairy foods is more homogeneous than that of other fermented foods.

Eleven dairy-type fermented foods were studied. Information supplied by the producers established that all of these foods were produced through the use of starter cultures to initiate fermentation, thus likely contributing to their reduced diversity relative to other foods (21). Dairy foods contained the least eukaryote-assigned reads, with 1.6% of the phylum-level relative abundance assigned to fungal groups. Firmicutes (89.2%) and Proteobacteria (7.9%) dominated. L. lactis dominated at the species level, corresponding to, on average, 44.8% of relative abundance and was present at a relative abundance at or above 90% in three of the dairy foods, all of which were kefir or kefir-type foods. The next most abundant species was Streptococcus thermophilus (16%), followed by S. infantarius (5.7%), the yeast Kluyveromyces marxianus (3.7%), Escherichia coli (3.5%), Lactococcus raffinolactis (3%), and L. mesenteroides (2.9%). It is notable that viruses [including (pro)phage] also made up a significant portion of the dairy food microbiota (7.8%).

Kefir composition, including that of agousha, a Russian beverage akin to kefir, was consistent with previous studies (25, 53, 54), although some taxa not previously associated with kefir, such as Bifidobacterium mongoliense, were detected. Two samples of kefir were fermented with the same kefir grain but differed in that one was made with pasteurized milk (FS43), while the other was made with unpasteurized milk (FS42) (Table 1). While there were a number of species found in only one of these two samples, only one, Pseudomonas helleri, found in pasteurized milk, was present at above 1% abundance (3%). Profiling of a samples of wagashi, a cheese from Benin, for the first time revealed that the core and the rind had a similar taxonomic composition, and included S. infantarius, K. marxianus, and a worrying abundance of Escherichia coli. Ryazhenka, a fermented baked milk product from Russia, was enriched with S. thermophilus (33.2%). Other species, such as S. macedonius (2.8%), were also present, but the most striking feature of this food was the 54% assignment to viruses. Kraken (55) was used to investigate the viral component in more detail. A total of 97% of these reads were assigned to the order Caudovirales and, in turn, 99.9% of these Caudovirales were assigned as Streptococcus phage, with Streptococcus phage DT1 (57%) being the most abundant. Ruž’a, a Russian cheese, had a large relative abundance of L. lactis (50.1%), followed by L. mesenteroides (25%) and L. raffinolactis (15.9%). Another Russian sample, rostagroèkport vorožnyj, a quark-like fermented snack, was high in S. thermophilus (50.5%). Irish labne consisted mainly of S. thermophilus (86%) but also contained Lacticaseibacillus paracasei (3.2%) and L. casei (2.3%).

At a functional level, carbohydrate metabolism (16.7%) was the most abundant SF1 pathway in fermented dairy. SF2 results highlighted the presence of genes with homology to those encoding resistance to antibiotics and the production of toxic compounds (2.8% of the reads). Several of the most abundant SF3 pathways in dairy foods had phage related functions, including the most abundant function, i.e., phage head and packaging (3.2%).

Sugar foods are dominated by Acetobacteraceae.

Eighteen sugar-type fermented foods were assessed, including fermented fruit, kombucha, and water kefir. Some of these foods, such as kombucha, kvass, and water kefir, contained large quantities of added table sugar, whereas the substrates used for the production of fermented orange or mead, honey, and water, had naturally high levels of sugar. Furthermore, although these foods were all assigned to the “sugar foods” category (Table 1), they encompassed a wide variety of raw ingredients and fermentation methods, including examples of both spontaneous and starter-type fermentations.

Sugar based fermentations contained the highest abundance of reads assigned to fungi, with 19.7% of phylum level reads assigned to Ascomycota. Similar to the other foods, Proteobacteria (48.9%) and Firmicutes (28.2%) dominated. Sugar foods contained many species previously associated with alcohol-generating fermentations, such as the yeasts Saccharomyces eubayanus (2.7%), Brettanomyces bruxellensis (5.2%), Hanseniaspora valbyensis (9.3%), and the bacterial species Oenococcus oeni (5%). Many of the other species were well-known kombucha-associated species such as Gluconobacter oxydans (5%), Acetobacter cerevisiae (2.5%), and Komagataeibacter rhaeticus (2%). At the species level, H. valbyensis was the most abundant (9.3% average abundance). However, this reflects very high abundance in specific instances, e.g., the relative abundance in mead was 93.7%, whereas this species was not detected in 10 of the other 18 sugar-type fermented foods. Lactobacillus was the most abundant genus (25.8%), but its abundance was lower than that found for dairy and brine foods. Within this genus, L. mali (7.6%) and L. plantarum (5.3%) were the most common species. Acetobacter was the next most abundant genus (10.9%), and its distribution, along with other members of the Acetobacteraceae, made it the most abundant family (33.3%).

Among specific sugar food types, seven samples of water kefir were analyzed, and typical water kefir-associated taxa, including Kluyvera, Gluconobacter, Brettanomyces, Acetobacter, and Lactobacillus (27), were found. In addition, Ethanoligenens harbinense, a species previously found in the wastewater that results from molasses production (56), was present in three of the water kefir samples. Two kombucha samples and a kombucha vinegar sample were examined. Typical kombucha microorganisms were identified (22, 23, 26). However, while the genera were similar, the abundance and type of species differed, e.g., Komagataeibacter and Acetobacter, were present in both kombucha samples, but one sample contained 13.7% K. xylinus and 5.2% A. okinawensis, while the other sample had 34.2% K. rhaeticus and 4.1% A. senegalensis (see Data Set S1, sheet 1). The microbiota of tepache, a slightly alcoholic Mexican fermented beverage, was investigated through the use of shotgun sequencing for the first time. Tepache contained the lowest number of species of all foods, consisting mainly of L. plantarum (85%), Levilactobacillus brevis (4.6%), and Acetobacter syzygii (3.6%). Mead, produced using autochthonous microbes present in honey, contained four different species of the yeast Hanseniaspora, including the aforementioned H. valbyensis. Hanseniaspora has not been described in mead previously, with Saccharomyces generally being the most common genus (57), and is widely used for industrial-scale mead production (58). The mead sample was also notable by virtue of generating by far the highest relative abundance of reads assigned to eukaryotes (>96%). In addition to Hanseniaspora, Zygosaccharomyces rouxii, Torulaspora delbrueckii, Saccharomyces cerevisiae, and a rare yeast, Saitoella complicate, were the other fungal species identified. The microbiomes of boza, orange preserve, apple cider vinegar, ginger beer, lemon and ginger fizz, bread kvass, and beet kvass are also presented in Data Set S1, sheet 1.

The most abundant SF1 function found in sugar foods was carbohydrate metabolism (14.5%). Resistance to antibiotics and toxic compounds (3.8%) and osmotic stress (1%) were the most common SF2 functions, while analysis of SF3 pathways highlighted the frequency of several pathways involved in the synthesis of amino acids, such as both methionine (0.79%) and purine (0.68%) biosynthesis.

The fermented-food resistome differs according to food and fermentation type.

Large variability in both the counts per million of antimicrobial genes (CPM) and of antimicrobial resistance (AMR) class were apparent across the different foods, with AMR profiles significantly differing across substrate and in line with the presence/absence of a starter (Fig. 4A, Fig. 5D, and Table 2). Dairy had an average of 3,686 CPM per sample, brine had 426 CPM, and sugar had 261 CPM. However, the core and the rind of wagashi inflated the dairy results and, if these are excluded, the average CPM for dairy foods dropped considerably to 1,947.

FIG 5.

FIG 5

Descriptive plots. (A) Heatmap showing the square root of the relative abundance of the top 25 species across all foods. Metadata categories are shown along the top x axis. Both rows and columns are clustered according to similarity. (B) Heatmap showing the relative abundance of the bacteriocin profile binned according to food substrate. (C) Heatmap showing the square root of the relative abundance of the Superfocus level 1 pathways. (D) Antimicrobial resistance (AMR) genes in CPM per food (pink), per milk sample (blue), and per human sample (green). Thirteen of the sixteen milk samples and nine fermented-food samples are not shown since no AMR genes were detected in these samples. Metadata for the food substrate are indicated by the boxes on the left of the CPM bars.

With respect to specific AMR classes, multidrug resistance was the most commonly assigned gene category across all three food substrates, corresponding to 2,422, 293, and 133 CPM per sample on average for dairy-, brine-, and sugar-type foods, respectively. Beta-lactam resistance genes were the next most common class in dairy (718 CPM) and sugar (101 CPM) foods, while tetracycline resistance genes were the second most numerous category of AMR genes in brine (45 CPM). It was also noted that a 5-fold-higher abundance of AMR genes occurred in starter culture fermentations relative to spontaneous fermentations. Multidrug resistance genes again dominated, corresponding to 1,326 CPM for starter cultures and 236 CPM for spontaneous fermentations. Beta-lactam resistance genes were the next highest in foods containing starter cultures (428 CPM), whereas tetracycline resistance genes were next highest in spontaneously fermented foods (48 CPM). The high CPM for both dairy and starter containing foods are consistent with the fact that dairy foods were those for which starters were most extensively used. When gene distribution was investigated from the perspective of specific food substrates, the wagashi cheese rind was found to have the highest CPM, i.e., 17,381, with tempeh being next highest at 5,657 CPM. AMR genes counts in kombucha and water kefirs were generally low, and no known AMR genes were identified in 9 of the 58 foods, i.e., 1 kombucha, 2 water kefirs, 3 kimchi, 1 pickled carrot, 1 pickled vegetable, and 1 apple cider vinegar. Of the nine fermented foods for which no AMR genes were assigned, four were sugar-type (including two water kefirs) and five were brine-type (including three kimchis). It is notable that very few AMR genes were assigned in the two other kimchis studied (<42 CPM), while across the five other water kefir samples, three contained very few AMR genes (<6 CPM) but two had relatively high counts (>1,000 CPM). Across the two samples of kombucha, one did not contain assigned AMR genes, while the other contained 1.6 CPM.

To provide context, the frequency with which AMR genes are detected in fermented foods was compared to that across human stool samples and unfermented milk samples (Fig. 5D). Milk samples (n = 16) were from various stages of processing to produce skim milk powder, including unpasteurized milk silos and dry milk powders. Although a subset of three milk (unpasteurized) samples had relatively high AMR CPM, milk samples generally had lower AMR CPM than did fermented-food or human gut samples (P > 0.01). In contrast, human gut samples (29 random stool samples from the Human Gut Microbiome Project [59]) had significantly more AMR CPM than fermented foods and milk (P > 0.01). However, eight fermented foods had particularly high CPM of AMR genes. These eight foods were the two wagashi cheese samples, tempeh, fermented ginger, three milk kefirs, and labne. Of these eight foods, six were dairy, and seven were starter-generated foods. For a further 12 foods, the AMR CPM were similar to those for human samples, whereas 38 foods had AMR CPM that were lower than those for human samples.

The presence of putative health promoting genes differs markedly across fermented foods but exceeds that of nonfermented foods.

Bacteriocins are ribosomally synthesized antimicrobial peptides, many producers of which have been sourced from fermented foods. The bacteriocin-producing potential across the 58 fermented-food samples was investigated, with 55 putative bacteriocin-encoding gene clusters being assigned across 54 of the foods (no gene clusters identified in 4 samples (Data Set S1, sheet 5). Zoocin A- and enterolysin A-like gene clusters were highly abundant across all three fermented-food substrates. Clusters corresponding to another bacteriolysin subclass, the helveticin J-like proteins, were more frequently detected in dairy and sugar-type foods than in brine-type foods (Fig. 3B). Carocin D- and colicin A-like clusters had a high abundance in brine and sugar, but not dairy, foods. As noted above, there was a significant difference in the distribution of bacteriocins between solid and liquid food types (Table 2), with liquid foods having a higher relative abundance of helveticin J, propioncin F-like and pediocin clusters, and solid foods having more carnocin CP52-like and microsin 24-like clusters. Examining the pediocin sequences in more detail revealed homology with pedA and pedB.

Given that bacteriocin production is regarded as a probiotic trait, these findings prompted an investigation of other potentially health-associated gene clusters (PHAGCs) within these fermented-food microbiomes. PHAGCs were divided into three broad categories. Gene clusters binned as “survival” are genes that were shown to be important for surviving the low pH of the stomach or the bile salts of the small intestine (60). Gene clusters binned as “colonization” are genes that were shown to be vital for colonizing the gut microbiome. These included genes responsible for surface proteins and exopolysaccharide production. “Modulation” gene clusters were all of the other potentially health promoting gene clusters that did not fit the previous two bins. These genes were shown to affect the host phenotype in other ways, such as stimulating the host immune system in the case of d-phenyl-lactic acid (13) or the production of γ-aminobutyric acid (GABA) (61, 62). The majority of these PHAGCs genes are based on studies reviewed in reference 60). Shotgun metagenomic data from nonfermented foods, i.e., unpasteurized whole milk, pasteurized skimmed milk, and milk powder, was used for comparative purposes. In general, the fermented foods contained considerably more PHAGCs than the nonfermented substrates. Among the fermented foods, a larger number of PHAGCs were found in brine- and sugar-type foods than in dairy foods, with several water kefirs, sauerkrauts, beet kvasses, and one kombucha being the foods with highest levels of PHAGCs (Fig. 6). With respect to the individual PHAGC subcategories, all fermented foods contained more colonization-type PHAGCs than the nonfermented controls. In the case of the modulation and survival clusters, the number of PHAGCs in some fermented foods, such as scallion kimchi, labne, agousha, and mead, were no greater than those in the nonfermented foods.

FIG 6.

FIG 6

Heatmap showing the presence of potentially health-associated gene clusters (PHAGCs) across all 58 foods and 16 unfermented milk samples. Gene clusters are binned as potentially inferring an ability of the metagenome to colonize the gastrointestinal tract, survive transit to the gut, and modulate the host phenotype. Each row is normalized across all samples, thus only comparing foods to one another.

Metagenomic assembly reveals 10 putative new species.

Metagenome assembled genomes (MAGs) were assembled from the reads and quality checked. A total of 443 MAGs were assembled in total, with 127 genomes above 80% completeness and having less than 10% contamination (Fig. 7A). Traitar (63) was used to predict the growth phenotypes of the 127 MAGs. The outputs were concatenated into a single output for each food substrate (Fig. 7B) and provided intuitive results, such as a high correlation between lactose utilization and dairy foods and high glucose oxidation potential in sugar food microbiomes. Consilience between the Traitar and taxonomic output is supported by the abundance of L. lactis in dairy and brine samples. FastANI (64) was used to assign taxonomy and to assess novelty and established that 10 of these MAGs had an <95% identity to known NCBI prokaryote genomes. Seven of these potentially novel MAGs are acetic acid bacteria, two are lactic acid bacteria, and one belongs to the family Yersiniaceae (Table 3). The highest identity match for three of the novel MAGs was Acidisphaera rubrifaciens. All three of these MAGs came from water kefir. The four remaining acetic acid bacteria were best matched with Acetobacter aceti (MAG from water kefir), Gluconobacter cerinus (MAG from bread kvass), and Acetobacter malorum (MAGs from rostagroèkport vorožnyj and apple cider vinegar). The two novel LABs were best matched with Leuconostoc gelidium (sauerkraut MAG) and Companilactobacillus kimchiensis (boza MAG). The final novel MAG, from the water kefir microbiome, most closely resembled Rouxiella chamberiensis.

FIG 7.

FIG 7

Metagenome assembled genomes. (A) Phylogenetic tree of the 127 high-quality MAGs, with outer rings showing the metadata for the food. The green arrows indicate which MAGs are potentially novel species. (B) Predicted phenotypes of the 127 MAGs concatenated into their respective substrate. Both rows and columns are clustered according to similarity.

TABLE 3.

Putatively novel MAGs with FastANI identity scores to the closest genome in the NCBI database

Food Sample Closest NCBI match % Identity
Bread kvass FS01 Gluconobacter cerinus 93.4228
Raw milk kefir FS41 Acetobacter malorum 86.3852
Sauerkraut FS39 Acetobacter malorum 85.9458
Boza FS03 Companilactobacillus kimchiensis 82.2453
Water kefir lemon FS32 Rouxiella chamberiensis 81.3335
Golden child (sauerkraut) FS30 Leuconostoc gelidum subsp. gasicomitatum 81.0244
Cherry water kefir FS56 Acetobacter aceti ATCC 23746 78.5186
Water kefir hibiscus FS31 Acidisphaera rubrifaciens HS-AP3 78.4976
Water kefir ginger FS33 Acidisphaera rubrifaciens HS-AP3 78.475
Water kefir lemon FS32 Acidisphaera rubrifaciens HS-AP3 78.0727

DISCUSSION

The practice of fermenting foods can be traced back over many millennia (65). Recently, shifts in consumer preference have resulted in a renewed interest in fermented foods, with the associated global market estimated to reach $40 billion USD by 2024 (66). The development of a better understanding of the microbial composition and functional potential of these foods provides an insight into features that are common among, and different between, fermented foods and ascertain potential roles of individual species, including novel species and strains. Importantly, the taxonomic resolution of shotgun metagenomics allows strain level identification of the microbiome but also facilitates an assessment of functional profile, bacteriocin and AMR gene distribution, determination of PHAGCs, the assembly of MAGs, and the determination of predicted phenotypes. It is important to note that samples were not replicated in order to facilitate the inclusion of a larger overall number of fermented foods. Although this limits the conclusions that can be made for specific foods, a number of notable observations were made across broader fermented-food types. Furthermore, since over 5,000 varieties of fermented foods exist around the world, often with multiple varieties of each of these foods, it was not possible to represent all fermented foods. For this reason, this study has focused on a subset of artisanal fermented foods that are easy to produce at home, and thus generalizations relating to specific substrates are limited to the samples examined here.

Fermentation substrate is the strongest driver of the composition and functional potential of the microbiomes of fermented foods. The type of nutrients available to the microbes determined the diversity within each food to the greatest extent. The biggest effect of substrate was found between the families present in each food substrate, with Lactobacillaceae (linear discriminant analysis [LDA] = 5.68) most persistent in brine foods, Streptococcaceae (LDA = 5.92) in dairy foods, and Acetobacteraceae (LDA = 5.5) in sugar-based foods (Fig. S2 to S8). The different substrates impose functional requirements on the microbes, such as a necessity for osmotic stress tolerance in both brine- and sugar-type foods. While the study focused on substrates that are used commonly for small-scale fermented-food production, including those made most easily in the home, it will be interesting in the future to extend the analysis to other important fermentation substrates such as meat, fish, and grains. The analyses here reflect the composition of the foods typically consumed and not the starter cultures used to produce these foods or the intermediary microbiomes that eventually produce the end product.

FIG S2

LDA plots reveal the attributes driving the differences between the food substrates at the species level. Download FIG S2, TIF file, 2.1 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S3

LDA plots reveal the attributes driving the differences between the food substrates at the genus level. Download FIG S3, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S4

LDA plots reveal the attributes driving the differences between the food substrates at family level. Download FIG S4, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S5

LDA plots reveal the attributes driving the differences between the food substrates at the phylum level. Download FIG S5, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S6

LDA plots reveal the attributes driving the differences between the food substrates at Superfocus level 1. Download FIG S6, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S7

LDA plots reveal the attributes driving the differences between the food substrates at Superfocus level 2. Download FIG S7, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S8

LDA plots reveal the attributes driving the differences between the food substrates at Superfocus level 3. Download FIG S8, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Other factors, such as the presence or absence of a starter culture, also contributed to differences in that starter culture-derived foods had the lowest alpha diversity, likely a result of adding a community of specialist microbes to the food that have been selected to outcompete the autochthonous microbes. The similar microbiome profiles of two kefir samples made from the same starter, but using raw or pasteurized milk, respectively, highlight this point. The differences in diversity between solid and liquid foods is likely due to the selective pressures of mobility, nutrient availability (in a homogenous liquid compared to a less homogenous solid food), and moisture content in solid foods compared to liquid foods. Unsurprisingly given the diversity of fermented foods, the country of origin did not significantly influence any of the characteristics examined. Outside of composition and top-level functionalities, other traits did vary in line with other categories, in that the bacteriocin gene cluster profile differed significantly across solid and liquid foods, and AMR-encoding genes differed across food substrate and between spontaneous and starter-type fermentations. It is unclear why bacteriocin gene clusters differed across solid and liquid foods, but perhaps the matrices of solid foods require different ecological tools for competitive advantage than liquid substrates.

Analysis revealed that the microbiomes of starter culture-type fermentations contain more assigned AMR-associated genes. However, this difference could represent the more extensive characterization of starter culture microbes and their associated genomes and AMR profiles, leading to better assignment of AMR genes from starter cultures strains than those involved in spontaneous fermentations. In comparison with human gut metagenomes, the majority of the fermented foods had a lower AMR CPM. Of the eight foods with higher AMR CPM, only three stood out as having considerably higher CPM; two were subsamples of the same food, i.e., wagashi cheese. In contrast, kimchi and kombucha samples were notable by virtue of either lacking detectable AMR genes or having very low CPM. Kimchi shared many taxa with other brine-type foods, so the differences observed may reflect strain-level differences. Comparisons with the unfermented milk samples showed that fermented foods have less AMR CPM than raw milk but more than pasteurized milk and its biproducts. Metagenomic sequencing of a larger collection of these fermented foods, coupled with antibiotic resistance assessments of isolated strains, will be necessary to determine how representative these results are.

Bacteriocin production is regarded as a probiotic trait. These peptides and, in the case of bacteriolysins, proteins are thought to be produced by bacteria to gain a competitive advantage over other taxa, typically those occupying the same environmental niche. Bacteriocin production can contribute to the quality and safety of foods through the removal of spoilage and pathogenic bacteria, but bacteriocin production in situ in the gut can also enable the producing bacteria to become established, compete against undesirable taxa, and contribute to host-microbe dialogue (67, 68). The bacteriocin profile did not differ according to food substrate, with zoocin A- and enterolysin A-like genes being most abundant across all food substrates. However, the bacteriocin-associated genes present in solid and liquid foods differed significantly from one another in that liquid foods were enriched with pediocin-like genes. After a further analysis of the pediocin sequences, homology with pedA and pedB, required for the production of pediocin AcH/PA-1, was apparent. These bacteriocins are best known for their strong antilisterial effects (69). Pediocin AcH/PA-1 has also been shown to be active against enterococci and staphylococci (70), and the presence of these genes potentially adds to the safety of these foods and their potential to be health promoting. Solid foods had a higher abundance of carnocin CP52-like bacteriocins, which are known for activity against Listeria and Enterococcus, again potentially adding to the safety of these foods (71).

Across a broader range of PHAGCs, it was apparent that these gene clusters were more common in fermented than in nonfermented foods. Sugar and brine foods were found to contain the highest levels of PHAGCs. Microbes in sugar-type foods generally must persist in low-pH environments, with some kombucha fermentations dropping to as low as pH 3 (72). In contrast, although also somewhat acidic, a milk kefir fermentation is regarded as complete when the pH reaches 4.5 (73), while the pH of most cheeses is between pH 5.1 and 5.9. Many of the sugar foods also contained colonization-associated PHAGCs. It was also noted that brine-type foods had the highest abundance of Lactobacillaceae, specific representatives of which have been exploited for their probiotic activity. A combination of these various factors likely contributes to the higher abundance of PHAGCs in both of these foods relative to dairy foods. However, even within the respective food substrate groups, the PHAGCs present varied considerably, with foods such as water kefirs, sauerkrauts, pickled veg, ginger, kvass, and kombucha being enriched in PHAGCs. These foods all contained colonization and survival PHAGCs at a higher frequency, e.g., glycotransferases for colonization in kombucha and pickled veg, and bile salt metabolism genes in water kefir and fermented sliced ginger. d-Lactate dehydrogenase pathways were consistently identified in these foods but were absent from other such as scallion kimchi, carrot sticks, and agousha. This observation is notable as d-lactate dehydrogenase is the enzyme responsible for producing d-phenyl-lactic acid (d-PLA), a metabolite known to modulate the host immune system (13). Glutamate decarboxylase, which converts glutamate into γ-aminobutyric acid (GABA), was present in some (kombucha, kvass, coconut kefir, and some water kefir samples), but not all, PHAGC-enriched foods. GABA is a well-known modulator of mood (74), while this enzymatic reaction also consumes protons and thus contributes to acid resistance (75). Although in vivo studies are required to directly examine the health benefits of specific fermented foods, these insights can undoubtedly help to identify foods and strains that are more likely to be health promoting, facilitate the production of fermented foods optimized for health promotion, and direct the experimental design of human intervention studies.

Finally, this study discovered 127 high quality MAGs, of which 10 are putative novel species. Three putative new Acetobacter species from water kefir, milk kefir, and sauerkraut, a Gluconobacter from bread kvass, a Leuconostoc from sauerkraut, and a Companilactobacillus from boza were assembled from the shotgun data. While these species are apparently novel, the corresponding genera are found in fermented foods at a high frequency. However, two MAGs representing genera that have not been found in fermented foods before were assembled, i.e., a Rouxiella species and three Acidisphaera species, all from water kefir samples. Rouxiella chamberiensis and Acidisphaera rubrifaciens are the only previously known members of their respective genera. Rouxiella chamberiensis was isolated from parenteral nutrition bags and has been shown to ferment d-glucose but not sucrose (76) and Acidisphaera rubrifaciens has been found in acidic hot springs and mine drainage systems and, like many of the other sugar taxa, is acidophilic (77). The assembly of these and other MAGs in the future will contribute toward the building of fermented food, and other food, microbe databases, equivalent to those available for the more complex human gut microbiome (78), to enable the more accurate and rapid identification of food microbes. Such databases will be key in the application of metagenomics-based approaches on a widespread basis by the food industry.

Overall, this study combines many novel insights into fermented-food microbiomes. First, the taxonomic composition of the 58 foods has been described, including many foods that have not been described using next generation sequencing (NGS) previously. Second, the functional profile of these foods has been characterized and, like the taxonomic profile, highlights the differences between starting material and microbial composition. Importantly, given the current interest in fermented foods as a healthy food choice and the role diet plays in modulating the gut microbiome, the health-promoting potential of the microbes in these various foods has been explored. Finally, genomes, including potentially novel taxa, were assembled from these foods and will contribute to the better assignment of reads from fermented food, and indeed broader food chain microbiome studies, in the future.

MATERIALS AND METHODS

A total of 58 samples of fermented foods were collected from various artisanal producers (Table 1). Foods were sampled when they were ready for consumption. In all cases, the edible portion of the fermentation was sampled. For example, for foods such as milk kefir or kombucha, the starter grain or SCOBY (symbiotic colony of bacteria and yeast), respectively, were not sampled, but rather sampling focused on the liquid portion of these products that is consumed. Portions (5 g) of foods were placed in a stomacher bag. Then, 50 ml of sterile Maximum Recovery Diluent (MRD) was added to the bag. The contents were homogenized in a stomacher (BagMixer 400; Interscience) for 20 min. After this step, both solid and liquid foods were extracted using the same method. Next, 50 ml of the homogenized solution was centrifuged at 10,000 rpm, at room temperature, for 10 min. The supernatant was discarded. The pellet was resuspended in 550 μl of SL buffer in a 2-ml tube (SL buffer from the GeneAll kit described below). Then, 33 μl of proteinase K was added to the tube, followed by incubation at 55°C for 30 min. The solution was then transferred to a bead-beating tube and placed in a Qiagen TissueLyser II for 10 min at 20/s. The GeneAll Exgene extraction protocol in step 4 was then followed until the final elution step; 30 μl of elution buffer (EB) was used here instead of the 50 μl suggested in the protocol.

Sequencing.

Library preparation was carried out according to the Nextera XT protocol (Illumina) (79). DNA was quantified by using a Qubit high-sensitivity dsDNA assay. The final library quality was assessed by using Agilent high-sensitivity DNA chromatin immunoprecipitation, and quantification was done by qPCR using a KAPA library quantification kit (Illumina; Roche). Sequencing was carried out on the NextSeq500 using a 300-cycle High Output v2 kit.

Bioinformatics.

A total of 347,841,507 reads were obtained from the Nextseq sequencing run in the form of Bcl files, which were converted to fastq format using bcl2fastq software. Quality trimming was performed using the trimBWAstyle.usingBAM.pl script. Using Picard (https://github.com/broadinstitute/picard), fastq was converted to Sam format. Picard was also used to remove duplicates. The sequences were then quality checked and trimmed using the trimBWAstyle.usingBam.pl script from the Bioinformatics Core at UC Davis Genome Center (https://github.com/genome/genome/blob/master/lib/perl/Genome/Site/TGI/Hmp/HmpSraProcess/trimBWAstyle.usingBam.pl). Host reads accounted for >8% of the reads. They were not removed since reference genomes were not available for all food substrates. All metagenomes were dealt with consistently, and the low abundance of nonmicrobial reads was low. Forward and reverse reads were then combined into a single fasta file for each sample using the fq2fa command from IDBA-UD (80).

Ten profiles of each microbiome were described. These included four taxonomic levels (species, genus, family, and phylum), four functional profiles (Superfocus 1, Superfocus 2, Superfocus 3, and Carbohydrate functions, which are a subset of HUMAnN2 output), the bacteriocin gene profile, and the antimicrobial resistance gene profile.

Kaiju v1.5.0 (81) was used to assign taxonomy to the reads, using the NCBI BLAST nonredundant protein database, including fungi and microbial eukaryotes, discarding taxa with a relative abundance of <0.1%. This setting was chosen since other studies have shown a high false-positive discovery rate below this threshold (82). All percentages reported at all taxonomic levels are percentages of the assigned reads only. Species-level assignment was updated for lactic acid bacteria, as previously described (83). Superfocus (84) was used to assign functionality to the reads. Superfocus assigns reads to homologues gene families to determine functionality. It collapses these gene families to higher levels of organization for a more generic function. Superfocus level 1 is the highest level of organization, followed by levels 2 and 3, with 3 having the most specific function. Data Set S1, sheet 1, shows the complete list of microbes and their relative abundance for each food. The phylogenetic tree of L. lactis was created in GraPhlAn (85), using the StrainPhlAn (86) output, which used Metaphlan2 (87) taxonomic assignment.

Statistical analyses were carried out in R-3.2.2 (88) using vegan (89). Analysis of similarities (ANOSIM) was carried out between each metadata category containing six or more samples (Data Set S1, sheet 6). The Benjamini-Hochberg false discovery rate was applied to the ANOSIM results. The linear discriminant analysis (LDA) effect size (LEfSe) (90) method was used to determine whether any taxa or pathways were differentially abundant between groups (see Text S1 in the supplemental material).

TEXT S1

LDA of differences between different food substrates. Download Text S1, DOCX file, 0.01 MB (14.2KB, docx) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Antimicrobial resistance.

Antimicrobial resistome analysis was performed by aligning paired-end metagenomes reads against the MEGAres database (v1.0.1) (91). To reduce type I errors, this database was first manually curated to remove any genes corresponding to antimicrobial resistance arising from point mutations. The alignment was performed using the –very-sensitive-local preset of Bowtie2 (v2.3.4). The Resistome Analyser tool (https://github.com/cdeanj/resistomeanalyzer) was used to format the output, and the results were normalized for sequencing depth across samples as counts per million reads (CPM).

Bacteriocin assignment.

Bacteriocin assignment was performed with the BLAST analysis of the bacteriocin genome mining tool (BAGEL) of the predicted genes with the Prodigal tool against the BAGEL4 bacteriocin databases (92).

Carbohydrate pathways.

The carbohydrate function was assigned to reads with the HUMAnN2 pipeline (93), which assigned the function based on the ChocoPhlan databases and genes based on UniRef (94). To further simplify the exploration of the abundance data of the gene family were grouped into the functional category Gene Ontology (GO), specifically carbohydrate-related functions, performing a more in-depth analysis.

Metagenomic assembled genomes.

Metagenome assembly was carried out using IDBA-UD. MetaBAT 2 (95) was used for genome binning, with default settings. CheckM (96) was implemented to check the quality of metagenome assembled genomes (MAGs). Low-quality MAGs, i.e., <80% completeness and/or >10% contamination, were removed from downstream analysis. Kaiju (81) and PhyloPhlAn (97) were used to assign taxonomy to the MAGs. The average nucleotide identity (ANI) of MAGs to reference genomes, which were downloaded from RefSeq (98), was calculated using FastANI (64). Putatively novel MAGs were assigned as potentially new species using the ANI threshold described previously (78). The phenotypes of MAGs were predicted using Traitar (63). MAGs were annotated using Prokka (99).

PLS-DA analyses.

Partial least-squares discriminant analysis (PLS-DA) plots were generated using the KODAMA R package (v1.5) (100). Default parameters of the KODAMA software were used on species from the taxonomic profile with the semisupervised constraining of data ordination according to the fermentation process of samples. The final visualization of data was performed in R (v3.5.1) using ggplot2 (v3.1.1) (101).

PHAGC screening.

Shotgun sequences for 16 nonfermented dairy samples were downloaded from ENA (study accession number PRJEB31110) with a median of 18,041 reads per sample, after removing Bos taurus reads. The 16 dairy samples were as follows: raw tanker milk, n = 2; skimmed milk powder, n = 6; pasteurized skimmed milk, n = 4; and raw silo whole milk, n = 4. The fermented- and nonfermented-food sequences were then assigned UniRef clusters (90) using the HUMAnN2 software (93). Using the UniRef clusters obtained from HUMAnN2 output, the presence or absence of clusters shown to influence the potential health-promoting properties of bacteria was determined (13, 60, 102). The list of search terms can be found in Data Set S1, sheet 7. The total numbers of PHAGCs present in each food were binned into one of the following three categories: survival, modulation, and colonization. A heatmap was created using Pheatmap (103). The rows of the heatmap were scaled, so that the values are comparative between the foods and not an absolute count of the numbers of gene clusters found in each food.

Data availability.

All raw reads can be accessed from the ENA under project accession number PRJEB35321.

ACKNOWLEDGMENTS

We thank the fermented-food producers for kindly supplying samples.

This research was conducted with the financial support of Science Foundation Ireland (SFI) under grant SFI/12/RC/2273.

P.D.C. conceived the study idea and design. J.L. and G.M. collected samples and extracted DNA. F.C. and L.F. conducted sequencing. J.L., R.C.-R., A.M.W., C.J.W., and W.B. conducted bioinformatics analysis. P.D.C. and J.L. wrote the manuscript with contributions from everybody else. O.O., M.J.C., and P.D.C. supervised the project.

We declare that we have have no competing interests.

REFERENCES

  • 1.Bourdichon F, Casaregola S, Farrokh C, Frisvad JC, Gerds ML, Hammes WP, Harnett J, Huys G, Laulund S, Ouwehand A, Powell IB, Prajapati JB, Seto Y, Ter Schure E, Van Boven A, Vankerckhoven V, Zgoda A, Tuijtelaars S, Hansen EB. 2012. Food fermentations: microorganisms with technological beneficial use. Int J Food Microbiol 154:87–97. doi: 10.1016/j.ijfoodmicro.2011.12.030. [DOI] [PubMed] [Google Scholar]
  • 2.Marco ML, Heeney D, Binda S, Cifelli CJ, Cotter PD, Foligné B, Gänzle M, Kort R, Pasin G, Pihlanto A, Smid EJ, Hutkins R. 2017. Health benefits of fermented foods: microbiota and beyond. Curr Opin Biotechnol 44:94–102. doi: 10.1016/j.copbio.2016.11.010. [DOI] [PubMed] [Google Scholar]
  • 3.Nielsen ES, Garnås E, Jensen KJ, Hansen L, Olsen PS, Ritz C, Krych L, Nielsen DSS. 2018. Lacto-fermented sauerkraut improves symptoms in IBS patients independent of product pasteurization: a pilot study. Food Funct 9:5323–5335. doi: 10.1039/C8FO00968F. [DOI] [PubMed] [Google Scholar]
  • 4.Gahruie HH, Eskandari MH, Mesbahi G, Hanifpour MA. 2015. Scientific and technical aspects of yogurt fortification: a review. Food Sci Human Wellness 4:1–8. [. doi: 10.1016/j.fshw.2015.03.002.]. [DOI] [Google Scholar]
  • 5.Villarreal‐Soto SA, Beaufort S, Bouajila J, Souchard JP, Taillandier P. 2018. Understanding kombucha tea fermentation: a review. J Food Sci 83:580–588. doi: 10.1111/1750-3841.14068. [DOI] [PubMed] [Google Scholar]
  • 6.Jayabalan R, Malbaša RV, Sathishkumar M. 2017. Kombucha tea: metabolites, p 965–978. In Fungal metabolites. Springer, New York, NY. [Google Scholar]
  • 7.Hong K-J, Lee C-H, Kim SW. 2004. Aspergillus oryzae GB-107 fermentation improves nutritional quality of food soybeans and feed soybean meals. J Medicinal Food 7:430–435. doi: 10.1089/jmf.2004.7.430. [DOI] [PubMed] [Google Scholar]
  • 8.Tamang JP, Tamang N, Thapa S, Dewan S, Tamang B, Yonzan H, Rai AK, Chettri R, Chakrabarty J, Kharel N. 2012. Microorganisms and nutritional value of ethnic fermented foods and alcoholic beverages of North East India. NISCAIR Online Periodicals http://nopr.niscair.res.in/handle/123456789/13415.
  • 9.Hertzler SR, Clancy SM. 2003. Kefir improves lactose digestion and tolerance in adults with lactose maldigestion. J Am Diet Assoc 103:582–587. doi: 10.1053/jada.2003.50111. [DOI] [PubMed] [Google Scholar]
  • 10.Xiao Y, Wang L, Rui X, Li W, Chen X, Jiang M, Dong M. 2015. Enhancement of the antioxidant capacity of soy whey by fermentation with Lactobacillus plantarum B1-6. J Funct Foods 12:33–44. [. doi: 10.1016/j.jff.2014.10.033.]. [DOI] [Google Scholar]
  • 11.Shin D, Jeong D. 2015. Korean traditional fermented soybean products: Jang. J Ethnic Foods 2:2–7. [. doi: 10.1016/j.jef.2015.02.002.]. [DOI] [Google Scholar]
  • 12.Slattery C, Cotter PD, O’Toole PW. 2019. Analysis of health benefits conferred by Lactobacillus species from kefir. Nutrients 11:1252. doi: 10.3390/nu11061252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Peters A, Krumbholz P, Jäger E, Heintz-Buschart A, Çakir MV, Rothemund S, Gaudl A, Ceglarek U, Schöneberg T, Stäubert C. 2019. Metabolites of lactic acid bacteria present in fermented foods are highly potent agonists of human hydroxycarboxylic acid receptor 3. PLoS Genet 15:e1008145. doi: 10.1371/journal.pgen.1008145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sonnenburg ED, Sonnenburg JL. 2019. The ancestral and industrialized gut microbiota and implications for human health. Nat Rev Microbiol 17:383–390. doi: 10.1038/s41579-019-0191-8. [DOI] [PubMed] [Google Scholar]
  • 15.Sonnenburg JL, Sonnenburg ED. 2019. Vulnerability of the industrialized microbiota. Science 366:eaaw9255. doi: 10.1126/science.aaw9255. [DOI] [PubMed] [Google Scholar]
  • 16.Chilton S, Burton J, Reid G. 2015. Inclusion of fermented foods in food guides around the world. Nutrients 7:390–404. doi: 10.3390/nu7010390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wolfe BE, Dutton RJ. 2015. Fermented foods as experimentally tractable microbial ecosystems. Cell 161:49–55. doi: 10.1016/j.cell.2015.02.034. [DOI] [PubMed] [Google Scholar]
  • 18.Wuyts S, Van Beeck W, Oerlemans EF, Wittouck S, Claes IJ, De Boeck I, Weckx S, Lievens B, De Vuyst L, Lebeer S. 2018. Carrot juice fermentations as man-made microbial ecosystems dominated by lactic acid bacteria. Appl Environ Microbiol 84:e00134-18. doi: 10.1128/AEM.00134-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Knight S, Klaere S, Morrison‐Whittle P, Goddard MR. 2018. Fungal diversity during fermentation correlates with thiol concentration in wine. Austral J Grape Wine Res 24:105–112. doi: 10.1111/ajgw.12304. [DOI] [Google Scholar]
  • 20.Liu X, Kuda T, Takahashi H, Kimura B. 2018. Bacterial and fungal microbiota of spontaneously fermented Chinese products, Rubing milk cake and Yan-Cai vegetable pickles. Food Microbiol 72:106–111. doi: 10.1016/j.fm.2017.11.014. [DOI] [PubMed] [Google Scholar]
  • 21.Du R, Ge J, Zhao D, Sun J, Ping W, Song G. 2018. Bacterial diversity and community structure during fermentation of Chinese sauerkraut with Lactobacillus casei 11MZ-5-1 by Illumina MiSeq sequencing. Lett Appl Microbiol 66:55–62. doi: 10.1111/lam.12824. [DOI] [PubMed] [Google Scholar]
  • 22.De Filippis F, Troise AD, Vitaglione P, Ercolini D. 2018. Different temperatures select distinctive acetic acid bacteria species and promotes organic acids production during Kombucha tea fermentation. Food Microbiol 73:11–16. doi: 10.1016/j.fm.2018.01.008. [DOI] [PubMed] [Google Scholar]
  • 23.Coton M, Pawtowski A, Taminiau B, Burgaud G, Deniel F, Coulloumme-Labarthe L, Fall A, Daube G, Coton E. 2017. Unraveling microbial ecology of industrial-scale Kombucha fermentations by metabarcoding and culture-based methods. FEMS Microbiol Ecol 93. [. doi: 10.1093/femsec/fix048.]. [DOI] [PubMed] [Google Scholar]
  • 24.Lee M, Song JH, Jung MY, Lee SH, Chang JY. 2017. Large-scale targeted metagenomics analysis of bacterial ecological changes in 88 kimchi samples during fermentation. Food Microbiol 66:173–183. doi: 10.1016/j.fm.2017.05.002. [DOI] [PubMed] [Google Scholar]
  • 25.Marsh AJ, O’Sullivan O, Hill C, Ross RP, Cotter PD. 2013. Sequencing-based analysis of the bacterial and fungal composition of kefir grains and milks from multiple sources. PLoS One 8:e69371. doi: 10.1371/journal.pone.0069371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Marsh AJ, O’Sullivan O, Hill C, Ross RP, Cotter PD. 2014. Sequence-based analysis of the bacterial and fungal compositions of multiple kombucha (tea fungus) samples. Food Microbiol 38:171–178. doi: 10.1016/j.fm.2013.09.003. [DOI] [PubMed] [Google Scholar]
  • 27.Marsh AJ, O’Sullivan O, Hill C, Ross RP, Cotter PD. 2013. Sequence-based analysis of the microbial composition of water kefir from multiple sources. FEMS Microbiol Lett 348:79–85. doi: 10.1111/1574-6968.12248. [DOI] [PubMed] [Google Scholar]
  • 28.Dertli E, Çon AH. 2017. Microbial diversity of traditional kefir grains and their role on kefir aroma. LWT Food Sci Technol 85:151–157. doi: 10.1016/j.lwt.2017.07.017. [DOI] [Google Scholar]
  • 29.Zalewska B, Kaevska M, Slana I. 2018. Sequence analysis of changes in microbial composition in different milk products during fermentation and storage. Curr Microbiol 75:202–205. doi: 10.1007/s00284-017-1366-z. [DOI] [PubMed] [Google Scholar]
  • 30.Zhong Z, Hou Q, Kwok L, Yu Z, Zheng Y, Sun Z, Menghe B, Zhang H. 2016. Bacterial microbiota compositions of naturally fermented milk are shaped by both geographic origin and sample type. J Dairy Sci 99:7832–7841. doi: 10.3168/jds.2015-10825. [DOI] [PubMed] [Google Scholar]
  • 31.Lee M, Song JH, Jung MY, Lee SH, Chang JY. 2017. Large-scale targeted metagenomics analysis of bacterial ecological changes in 88 kimchi samples during fermentation. Food Microbiol 66:173–183. doi: 10.1016/j.fm.2017.05.002. [DOI] [PubMed] [Google Scholar]
  • 32.Lee SH, Jung JY, Jeon CO. 2015. Source tracking and succession of kimchi lactic acid bacteria during fermentation. J Food Sci 80:M1871–M1877. doi: 10.1111/1750-3841.12948. [DOI] [PubMed] [Google Scholar]
  • 33.Kim MJ, Kwak HS, Kim SS. 2018. Effects of salinity on bacterial communities, Maillard reactions, isoflavone composition, antioxidation, and antiproliferation in Korean fermented soybean paste (doenjang). Food Chem 245:402–409. doi: 10.1016/j.foodchem.2017.10.116. [DOI] [PubMed] [Google Scholar]
  • 34.Medina E, Brenes M, García-García P, Romero C, de Castro A. 2018. Microbial ecology along the processing of Spanish olives darkened by oxidation. Food Control 86:35–41. doi: 10.1016/j.foodcont.2017.10.035. [DOI] [Google Scholar]
  • 35.Wang Z-M, Lu Z-M, Shi J-S, Xu Z-H. 2016. Exploring flavour-producing core microbiota in multispecies solid-state fermentation of traditional Chinese vinegar. Sci Rep 6:26818. doi: 10.1038/srep26818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Diaz M, Kellingray L, Akinyemi N, Adefiranye OO, Olaonipekun AB, Bayili GR, Ibezim J, Du Plessis AS, Houngbédji M, Kamya D, Mukisa IM, Mulaw G, Manthi Josiah S, Onyango Chienjo W, Atter A, Agbemafle E, Annan T, Bernice Ackah N, Buys EM, Joseph Hounhouigan D, Muyanja C, Nakavuma J, Odeny DA, Sawadogo-Lingani H, Tesfaye Tefera A, Amoa-Awua W, Obodai M, Mayer MJ, Oguntoyinbo FA, Narbad A. 2019. Comparison of the microbial composition of African fermented foods using amplicon sequencing. Sci Rep 9:1–8. doi: 10.1038/s41598-019-50190-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wu L-H, Lu Z-M, Zhang X-J, Wang Z-M, Yu Y-J, Shi J-S, Xu Z-H. 2017. Metagenomics reveals flavour metabolic network of cereal vinegar microbiota. Food Microbiol 62:23–31. doi: 10.1016/j.fm.2016.09.010. [DOI] [PubMed] [Google Scholar]
  • 38.Lu Z-M, Liu N, Wang L-J, Wu L-H, Gong J-S, Yu Y-J, Li G-Q, Shi J-S, Xu Z-H. 2016. Elucidating and regulating the acetoin-producing role of microbial functional group in multispecies acetic acid fermentation. Appl Environ Microbiol 82:5860–5868. doi: 10.1128/AEM.01331-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Zepeda MM, Edwards N, Madsen M, Abel-Kistrup M, Puetz L, Sicheritz-Ponten T, Swiegers J. 2017. Influence of Oenococcus oeni and Brettanomyces bruxellensis on aged wine microbial taxonomic and functional profiles. bioRxiv doi: 10.1101/216390v1.full. [DOI]
  • 40.Illeghems K, Weckx S, De Vuyst L. 2015. Applying meta-pathway analyses through metagenomics to identify the functional properties of the major bacterial communities of a single spontaneous cocoa bean fermentation process sample. Food Microbiol 50:54–63. doi: 10.1016/j.fm.2015.03.005. [DOI] [PubMed] [Google Scholar]
  • 41.Lyu C, Chen C, Ge F, Liu D, Zhao S, Chen D. 2013. A preliminary metagenomic study of puer tea during pile fermentation. J Sci Food Agric 93:3165–3174. doi: 10.1002/jsfa.6149. [DOI] [PubMed] [Google Scholar]
  • 42.Sulaiman J, Gan HM, Yin W-F, Chan K-G. 2014. Microbial succession and the functional potential during the fermentation of Chinese soy sauce brine. Front Microbiol 5:556. doi: 10.3389/fmicb.2014.00556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Ferrocino I, Bellio A, Giordano M, Macori G, Romano A, Rantsiou K, Decastelli L, Cocolin L. 2017. Shotgun metagenomics and volatilome profile of the microbiota of fermented sausages. Appl Environ Microbiol 84:e02120-17. doi: 10.1128/AEM.02120-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Jung JY, Lee SH, Kim JM, Park MS, Bae J-W, Hahn Y, Madsen EL, Jeon CO. 2011. Metagenomic analysis of kimchi, the Korean traditional fermented food. Appl Environ Microbiol 77:2264–2274. doi: 10.1128/AEM.02157-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Yao G, Yu J, Hou Q, Hui W, Liu W, Kwok L-Y, Menghe B, Sun T, Zhang H, Zhang W. 2017. A perspective study of koumiss microbiome by metagenomics analysis based on single-cell amplification technique. Front Microbiol 8:165. doi: 10.3389/fmicb.2017.00165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Walsh AM, Crispie F, Daari K, O’Sullivan O, Martin JC, Arthur CT, Claesson MJ, Scott KP, Cotter PD. 2017. Strain-level metagenomic analysis of the fermented dairy beverage Nunu highlights potential food safety risks. Appl Environ Microbiol 83:e01144-17. doi: 10.1038/s43016-020-0129-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Nalbantoglu U, Cakar A, Dogan H, Abaci N, Ustek D, Sayood K, Can H. 2014. Metagenomic analysis of the microbial community in kefir grains. Food Microbiol 41:42–51. doi: 10.1016/j.fm.2014.01.014. [DOI] [PubMed] [Google Scholar]
  • 48.Wolfe BE, Button JE, Santarelli M, Dutton RJ. 2014. Cheese rind communities provide tractable systems for in situ and in vitro studies of microbial diversity. Cell 158:422–433. doi: 10.1016/j.cell.2014.05.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Verce M, De Vuyst L, Weckx S. 2019. Shotgun metagenomics of a water kefir fermentation ecosystem reveals a novel Oenococcus species. Front Microbiol 10:479. doi: 10.3389/fmicb.2019.00479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Walsh AM, Macori G, Kilcawley KN, Cotter PD. 2020. Meta-analysis of cheese microbiomes highlights contributions to multiple aspects of quality. Nat Food 1:500–510. doi: 10.1038/s43016-020-0129-3. [DOI] [PubMed] [Google Scholar]
  • 51.Parkouda C, Nielsen DS, Azokpota P, Ouoba LII, Amoa-Awua WK, Thorsen L, Hounhouigan JD, Jensen JS, Tano-Debrah K, Diawara B, Jakobsen M. 2009. The microbiology of alkaline-fermentation of indigenous seeds used as food condiments in Africa and Asia. Crit Rev Microbiol 35:139–156. doi: 10.1080/10408410902793056. [DOI] [PubMed] [Google Scholar]
  • 52.Miller ER, Kearns PJ, Niccum BA, Schwartz JOM, Ornstein A, Wolfe BE. 2019. Establishment limitation constrains the abundance of lactic acid bacteria in the Napa cabbage phyllosphere. Appl Environ Microbiol 85:e00269-19. doi: 10.1128/AEM.00269-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Tu M-Y, Chen H-L, Tung Y-T, Kao C-C, Hu F-C, Chen C-M. 2015. Short-term effects of kefir-fermented milk consumption on bone mineral density and bone metabolism in a randomized clinical trial of osteoporotic patients. PLoS One 10:e0144231. doi: 10.1371/journal.pone.0144231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Walsh AM, Crispie F, Kilcawley K, O’Sullivan O, O’Sullivan MG, Claesson MJ, Cotter PD. 2016. Microbial succession and flavor production in the fermented dairy beverage kefir. mSystems 1:e00052-16. doi: 10.1128/mSystems.00052-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Wood DE, Salzberg SL. 2014. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15:R46. doi: 10.1186/gb-2014-15-3-r46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Xing D, Ren N, Li Q, Lin M, Wang A, Zhao L. 2006. Ethanoligenens harbinense gen. nov., sp. nov., isolated from molasses wastewater. Int J Syst Evol Microbiol 56:755–760. doi: 10.1099/ijs.0.63926-0. [DOI] [PubMed] [Google Scholar]
  • 57.Bahiru B, Mehari T, Ashenafi M. 2006. Yeast and lactic acid flora of Tej, an indigenous Ethiopian honey wine: variations within and between production units. Food Microbiol 23:277–282. doi: 10.1016/j.fm.2005.05.007. [DOI] [PubMed] [Google Scholar]
  • 58.Mendes-Ferreira A, Cosme F, Barbosa C, Falco V, Inês A, Mendes-Faia A. 2010. Optimization of honey-must preparation and alcoholic fermentation by Saccharomyces cerevisiae for mead production. Int J Food Microbiol 144:193–198. doi: 10.1016/j.ijfoodmicro.2010.09.016. [DOI] [PubMed] [Google Scholar]
  • 59.Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R, Gordon JI. 2007. The human microbiome project. Nature 449:804–810. doi: 10.1038/nature06244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Lebeer S, Vanderleyden J, De Keersmaecker SC. 2008. Genes and molecules of lactobacilli supporting probiotic action. Microbiol Mol Biol Rev 72:728–764. doi: 10.1128/MMBR.00017-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Hayakawa K, Kimura M, Kasaha K, Matsumoto K, Sansawa H, Yamori Y. 2004. Effect of a γ-aminobutyric acid-enriched dairy product on the blood pressure of spontaneously hypertensive and normotensive Wistar-Kyoto rats. Br J Nutr 92:411–417. doi: 10.1079/BJN20041221. [DOI] [PubMed] [Google Scholar]
  • 62.Mombereau C, Kaupmann K, Froestl W, Sansig G, van der Putten H, Cryan JF. 2004. Genetic and pharmacological evidence of a role for GABA B receptors in the modulation of anxiety-and antidepressant-like behavior. Neuropsychopharmacology 29:1050–1062. doi: 10.1038/sj.npp.1300413. [DOI] [PubMed] [Google Scholar]
  • 63.Weimann A, Mooren K, Frank J, Pope PB, Bremges A, McHardy AC. 2016. From genomes to phenotypes: Traitar, the microbial trait analyzer. mSystems 1:e00101-16. doi: 10.1128/mSystems.00101-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. 2018. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nat Commun 9:5114. doi: 10.1038/s41467-018-07641-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Nair BM, Prajapati JB. 2003. The history of fermented foods, p 17–42. In Handbook of fermented functional foods. CRC Press, Boca Raton, FL. [Google Scholar]
  • 66.Persistence Market Research. 2017. Global market study on fermented ingredients: pharmaceutical applications of fermented ingredients to gain traction during 2017–2022. Persistence Market Research, New York, NY: https://www.prnewswire.com/news-releases/global-market-study-on-fermented-ingredients-pharmaceutical-applications-of-fermented-ingredients-to-gain-traction-during-2017-2022-300532114.html. [Google Scholar]
  • 67.Cotter PD, Hill C, Ross RP. 2005. Food microbiology: bacteriocins: developing innate immunity for food. Nat Rev Microbiol 3:777–788. doi: 10.1038/nrmicro1273. [DOI] [PubMed] [Google Scholar]
  • 68.Dobson A, Cotter PD, Ross RP, Hill C. 2012. Bacteriocin production: a probiotic trait? Appl Environ Microbiol 78:1–6. doi: 10.1128/AEM.05576-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Halami PM, Ramesh A, Chandrashekar A. 2005. Fermenting cucumber, a potential source for the isolation of pediocin-like bacteriocin producers. World J Microbiol Biotechnol 21:1351–1358. doi: 10.1007/s11274-005-4858-0. [DOI] [Google Scholar]
  • 70.Papagianni M, Anastasiadou S. 2009. Pediocins: the bacteriocins of pediococci—sources, production, properties, and applications. Microb Cell Fact 8:3. doi: 10.1186/1475-2859-8-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Herbin S, Mathieu F, Brulé F, Branlant C, Lefebvre G, Lebrihi A. 1997. Characteristics and genetic determinants of bacteriocin activities produced by Carnobacterium piscicola CP5 isolated from cheese. Curr Microbiol 35:319–326. doi: 10.1007/s002849900262. [DOI] [PubMed] [Google Scholar]
  • 72.Lončar E, Djurić M, Malbaša R, Kolarov L, Klašnja M. 2006. Influence of working conditions upon kombucha conducted fermentation of black tea. Food Bioprod Process 84:186–192. doi: 10.1205/fbp.04306. [DOI] [Google Scholar]
  • 73.Guzel-Seydim Z, Seydim A, Greene A. 2000. Organic acids and volatile flavor components evolved during refrigerated storage of kefir. J Dairy Sci 83:275–277. doi: 10.3168/jds.S0022-0302(00)74874-0. [DOI] [PubMed] [Google Scholar]
  • 74.Jacobson LH, Vlachou S, Slattery DA, Li X, Cryan JF. 2018. The gamma-aminobutyric acid B receptor in depression and reward. Biol Psychiatry 83:963–976. doi: 10.1016/j.biopsych.2018.02.006. [DOI] [PubMed] [Google Scholar]
  • 75.Cotter PD, Hill C. 2003. Surviving the acid test: responses of gram-positive bacteria to low pH. Microbiol Mol Biol Rev 67:429–453. doi: 10.1128/mmbr.67.3.429-453.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Le Flèche-Matéos A, Levast M, Lomprez F, Arnoux Y, Andonian C, Perraud M, Vincent V, Ar Gouilh M, Thiberge J-M, Vandenbogaert M, Diancourt L, Caro V, Burguière AM, Manuguerra J-C. 2015. Rouxiella chamberiensis gen. nov., sp. nov., a new Enterobacteriaceae isolated from parenteral nutrition bags. Int J Syst Evol Microbiol 65:1812–1818. doi: 10.1099/ijs.0.000179. [DOI] [PubMed] [Google Scholar]
  • 77.Hiraishi A, Matsuzawa Y, Kanbe T, Wakao N. 2000. Acidisphaera rubrifaciens gen. nov., sp. nov., an aerobic bacteriochlorophyll-containing bacterium isolated from acidic environments. Int J Syst Evol Microbiol 50:1539–1546. doi: 10.1099/00207713-50-4-1539. [DOI] [PubMed] [Google Scholar]
  • 78.Pasolli E, Asnicar F, Manara S, Zolfo M, Karcher N, Armanini F, Beghini F, Manghi P, Tett A, Ghensi P, Collado MC, Rice BL, DuLong C, Morgan XC, Golden CD, Quince C, Huttenhower C, Segata N. 2019. Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle. Cell 176:649–662. doi: 10.1016/j.cell.2019.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Clooney AG, Fouhy F, Sleator RD, O’ Driscoll A, Stanton C, Cotter PD, Claesson MJ. 2016. Comparing apples and oranges?: next generation sequencing and its impact on microbiome analysis. PLoS One 11:e0148028. doi: 10.1371/journal.pone.0148028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Peng Y, Leung HC, Yiu S-M, Chin FY. 2012. IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28:1420–1428. doi: 10.1093/bioinformatics/bts174. [DOI] [PubMed] [Google Scholar]
  • 81.Menzel P, Ng K, Krogh A. 2016. Fast and sensitive taxonomic classification for metagenomics with Kaiju. Nat Commun 7:11257. doi: 10.1038/ncomms11257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Walsh AM, Crispie F, O’Sullivan O, Finnegan L, Claesson MJ, Cotter PD. 2018. Species classifier choice is a key consideration when analysing low-complexity food microbiome data. Microbiome 6:50. doi: 10.1186/s40168-018-0437-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Zheng J, Wittouck S, Salvetti E, Franz CMAP, Harris HMB, Mattarelli P, O’Toole PW, Pot B, Vandamme P, Walter J, Watanabe K, Wuyts S, Felis GE, Gänzle MG, Lebeer S. 2020. A taxonomic note on the genus Lactobacillus: description of 23 novel genera, emended description of the genus Lactobacillus Beijerinck 1901, and union of Lactobacillaceae and Leuconostocaceae. Int J Syst Evol Microbiol 70:2782–2858. doi: 10.1099/ijsem.0.004107. [DOI] [PubMed] [Google Scholar]
  • 84.Silva GGZ, Green KT, Dutilh BE, Edwards RA. 2016. SUPER-FOCUS: a tool for agile functional analysis of shotgun metagenomic data. Bioinformatics 32:354–361. doi: 10.1093/bioinformatics/btv584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Asnicar F, Weingart G, Tickle TL, Huttenhower C, Segata N. 2015. Compact graphical representation of phylogenetic data and metadata with GraPhlAn. PeerJ 3:e1029. doi: 10.7717/peerj.1029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Truong DT, Tett A, Pasolli E, Huttenhower C, Segata N. 2017. Microbial strain-level population structure and genetic diversity from metagenomes. Genome Res 27:626–638. doi: 10.1101/gr.216242.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Truong DT, Franzosa EA, Tickle TL, Scholz M, Weingart G, Pasolli E, Tett A, Huttenhower C, Segata N. 2015. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat Methods 12:902–903. doi: 10.1038/nmeth.3589. [DOI] [PubMed] [Google Scholar]
  • 88.Team RC. 2014. R: a language and environment for statistical computing (version 3.1.2). R Foundation for Statistical Computing, Vienna, Austria: ISBN 3-900051-07-0 http://www.R-project.org. [Google Scholar]
  • 89.Oksanen J, Kindt R, Legendre P, O’Hara B, Stevens MHH, Oksanen MJ. 2007. Suggests M: the vegan package. Community Ecol Package 10:631–637. [Google Scholar]
  • 90.Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, Huttenhower C. 2011. Metagenomic biomarker discovery and explanation. Genome Biol 12:R60. doi: 10.1186/gb-2011-12-6-r60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Lakin SM, Dean C, Noyes NR, Dettenwanger A, Ross AS, Doster E, Rovira P, Abdo Z, Jones KL, Ruiz J, Belk KE, Morley PS, Boucher C. 2017. MEGARes: an antimicrobial resistance database for high throughput sequencing. Nucleic Acids Res 45:D574–D580. doi: 10.1093/nar/gkw1009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.van Heel AJ, de Jong A, Song C, Viel JH, Kok J, Kuipers OP. 2018. BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins. Nucleic Acids Res 46:W278–W281. doi: 10.1093/nar/gky383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Abubucker S, Segata N, Goll J, Schubert A, Izard J, Cantarel B, Rodriguez-Mueller B, Zucker J, Thiagarajan M, Henrissat B. 2017. HUMAnN2: the HMP Unified Metabolic Analysis Network 2. https://huttenhower.sph.harvard.edu/humann.
  • 94.Suzek BE, Huang H, McGarvey P, Mazumder R, Wu CH. 2007. UniRef: comprehensive and nonredundant UniProt reference clusters. Bioinformatics 23:1282–1288. doi: 10.1093/bioinformatics/btm098. [DOI] [PubMed] [Google Scholar]
  • 95.Kang DD, Froula J, Egan R, Wang Z. 2015. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ 3:e1165. doi: 10.7717/peerj.1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Segata N, Börnigen D, Morgan XC, Huttenhower C. 2013. PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes. Nat Commun 4:2304. doi: 10.1038/ncomms3304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Pruitt KD, Tatusova T, Maglott DR. 2007. NCBI reference sequences (RefSeq): a curated nonredundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res 35:D61–D65. doi: 10.1093/nar/gkl842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  • 100.Cacciatore S, Tenori L, Luchinat C, Bennett PR, MacIntyre DA. 2017. KODAMA: an R package for knowledge discovery and data mining. Bioinformatics :621–623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Wickham H. 2016. ggplot2: elegant graphics for data analysis. Springer, New York, NY. [Google Scholar]
  • 102.Abriouel H, Pérez Montoro B, Casimiro-Soriguer CS, Pérez Pulido AJ, Knapp CW, Caballero Gómez N, Castillo-Gutiérrez S, Estudillo-Martínez MD, Gálvez A, Benomar N. 2017. Insight into potential probiotic markers predicted in Lactobacillus pentosus MP-10 genome sequence. Front Microbiol 8:891. doi: 10.3389/fmicb.2017.00891. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Kolde R, Kolde MR. 2015. Package ‘pheatmap’. R Package 1(7). https://CRAN.R-project.org/package=pheatmap. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

DATA SET S1

(Sheet 1) Species level relative abundance of 58 foods. (Sheet 2) Superfocus level 1 relative abundance of 58 foods. (Sheet 3) Superfocus level 2 relative abundance of 58 foods. (Sheet 4) Superfocus level 3 relative abundance of 58 foods. (Sheet 5) Bacteriocin profile (in gene counts) of 58 foods. (Sheet 6) Fermented-food groups used for statistical analyses including N number of samples in each group. (Sheet 7) The genes that were shown in previous studies to be necessary for some probiotic attribute. These genes were then searched for in the metagenomes in this study. (Sheet 8) ANOSIM results for all 50 profiles. (Sheet 9) Table showing foods included in this study and the recipes used to make them. Some of the recipes are directly from the producer. When this information was not available, recipes were created from other online resources. Download Data Set S1, XLSX file, 1.0 MB (1,000.2KB, xlsx) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S1

NMDS plots of the Bray-Curtis distances of the species in 58 fermented foods colored by substrate (A), fermentation starter or spontaneous (B), country of origin (C), producer (D), and solid or liquid foods (E). Download FIG S1, TIF file, 2.3 MB (2.4MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S2

LDA plots reveal the attributes driving the differences between the food substrates at the species level. Download FIG S2, TIF file, 2.1 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S3

LDA plots reveal the attributes driving the differences between the food substrates at the genus level. Download FIG S3, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S4

LDA plots reveal the attributes driving the differences between the food substrates at family level. Download FIG S4, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S5

LDA plots reveal the attributes driving the differences between the food substrates at the phylum level. Download FIG S5, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S6

LDA plots reveal the attributes driving the differences between the food substrates at Superfocus level 1. Download FIG S6, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S7

LDA plots reveal the attributes driving the differences between the food substrates at Superfocus level 2. Download FIG S7, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

FIG S8

LDA plots reveal the attributes driving the differences between the food substrates at Superfocus level 3. Download FIG S8, TIF file, 2.0 MB (2.1MB, tif) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

TEXT S1

LDA of differences between different food substrates. Download Text S1, DOCX file, 0.01 MB (14.2KB, docx) .

Copyright © 2020 Leech et al.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Data Availability Statement

All raw reads can be accessed from the ENA under project accession number PRJEB35321.


Articles from mSystems are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES