Abstract
Accurate diagnosis of the components of our food and a standard lexicon for clear communication is essential for regulating global food trade and identifying food frauds. Reliable identification of wild collected foods can be particularly difficult, especially when they originate in under-documented regions or belong to poorly known groups such as Fungi. Porcini, one of the most widely traded wild edible mushrooms in the world, are large and conspicuous and they are used as a food both on their own and in processed food products. China is a major exporter of porcini, most of it ending up in Europe. We used DNA-sequencing to identify three species of mushroom contained within a commercial packet of dried Chinese porcini purchased in London. Surprisingly, all three have never been formally described by science and required new scientific names. This demonstrates the ubiquity of unknown fungal diversity even in widely traded commercial food products from one of the most charismatic and least overlooked groups of mushrooms. Our rapid analysis and description makes it possible to reliably identify these species, allowing their harvest to be monitored and their presence tracked in the food chain.
Keywords: Coalescence, Contaminant, Biodiversity, Conservation, Global trade, Naming, Turbo-taxonomy, Phylogenetics, Species
Introduction
Kingdom Fungi is one of the most diverse groups of eukaryotes with estimates ranging from 500,000 to nearly 10 million species, yet they remain vastly underdocumented (Bass & Richards, 2011). Present rates of description, which add on average about 1,200 new species annually (Hibbett et al., 2011), are grossly inadequate for the task. Given that human society has derived tremendous benefit from the foods, medicines, and ecological services provided by as little as 1% of the fungi we know of, the impact of this missing diversity on human livelihoods is potentially profound. Importantly, this missing diversity is not just restricted to remote, underexplored regions of the world, but is a pervasive phenomenon where even our foods can harbor unknown species.
Although taxonomists regard new fungal taxa as commonplace, they are often of little apparent consequence to human society and largely go unnoticed by the public. Like all groups of organisms, our knowledge of fungal diversity is biased towards taxa of greatest concern to ourselves, such as edible fungi. For example, wild mushrooms collected and sold as food around the world generally belong to a handful of well-known taxa (e.g., truffles and chanterelles), most of which have long histories of use in European cuisine. However, even some of these well-known groups have been shown to contain underappreciated levels of diversity. One of these, porcini, has recently been shown to be far more diverse than previously thought (Dentinger et al., 2010; Feng et al., 2012), suggesting the potential for unknown species to end up in the international food supply chain. Although no porcini are known to be poisonous, food allergens have been reported from them (Torricelli, Johansson & Wütrich, 1997; Helbling et al., 2002; Castillo et al., 2013). Therefore, insufficient knowledge of the porcini species contained in food products could pose a health concern.
Porcini are estimated to have an annual worldwide consumption up to 100,000 metric tons (Hall et al., 1998). However, their harvest is restricted to wild foraging since, to date, their cultivation has failed. The high prices for this wild food foraged locally in Europe and North America has driven the market towards less costly sources, such as China (Sitta & Floriani, 2008). According to the official website of Yunnan Province (www.yunnan.cn), the major exporter of wild mushrooms in China, locally-sourced porcini have been exported to Europe since 1973, and mushrooms of Chinese origin now account for approximately half of all dried porcini in Italy (Sitta & Floriani, 2008). The Chinese species of porcini have been shown previously to be more closely related to European Boletus aereus than they are to the core commercial species, B. edulis, with which they last shared a common ancestor up to ∼56 million years ago (Dentinger et al., 2010; Feng et al., 2012).
Given what was previously known about the commercial porcini originating in China, we hypothesized that the contents of a commercially available packet of porcini in the UK would contain multiple species. We set out to rapidly diagnose these species using molecular-based ‘turbo-taxonomy’ (Butcher et al., 2012) that employs a combination of modern tools and approaches. Our results show that, with a combination of phylogenetic taxonomy and e-published nomenclature, three previously unnamed species of porcini could be quickly recognized and formally named from a single packet sold in a London grocer.
Material and Methods
A packet of dried porcini was purchased from a grocer in southwest greater London in October 2013. Fifteen pieces of mushroom were removed arbitrarily from the packet and DNA was extracted using the Sigma Extract-N-Amp kit. The full ITS region of the nrDNA was PCR-amplified using primers ITS1F and ITS4 (White et al., 1990; Gardes & Bruns, 1993). Successful amplicons were purified using ExoSAP-IT (USB, Cleveland, OH) and sequenced bidirectionally using BigDye3.1 with an ABI 3730 (Applied Biosystems, Foster City, CA). Complementary unidirectional reads were aligned and edited using Sequencher 4.2 (GeneCodes, Ann Arbor, MI).
New sequences were combined with 22 related sequences identified using a combination of BLAST searches and the corresponding top hits’ putative species clades reported by Dentinger et al. (2010) and Feng et al. (2012). These related sequences were downloaded from GenBank and correspond to “Boletus sp. nov. 2” (EU231965, EU231966; Dentinger et al., 2010), “Boletus sp. nov. 6” (JN563907, JN563908, JN563909, JN563911, JN563912, JN563913, JN563917; Feng et al., 2012), “Boletus sp. nov. 3” (EU231964; Dentinger et al., 2010), “Boletus sp. nov. 7” (JQ172782, JQ172783, JN563901, JN563902, JN563903, JN563904, JN563905, JN563906; Feng et al., 2012), and “Boletus sp. nov. 5” (JQ563914, JQ563915, JQ563916, JQ563918, JQ563919; Feng et al., 2012). A total of 38 ingroup sequences and one outgroup sequence (Boletus aereus, UDB000940) were aligned using MUSCLE (Edgar, 2004) in SeaView v4.4.0 (Galtier, Gouy & Gautier, 1996) and the terminal gaps converted to missing data. The final matrix consisted of 802 aligned positions, of which 742 were constant and 26 were parsimony uninformative (34 autapomorphic). Minimum and maximum intra- and inter-specific uncorrected “p” distances were calculated using PAUP∗v4.0 (Swofford, 2002). A maximum likelihood tree was generated under a GTR + G substitution model using the Pthreads parallelized version of RAxML v7.0.3 (Stamatakis, 2006; Ott et al., 2007) with nonparametric rapid bootstrapping set to automatically terminate with the ‘autoMRE’ function. A GMYC analysis using the single method (Pons et al., 2006; Fujisawa & Barraclough, 2013) was conducted with the ‘splits’ package (v1.0-18) in R version 2.15.0 (R Core Team, 2014) on an ultrametric tree generated using BEAST v1.8.0 (Drummond et al., 2012). The BEAST analysis applied a rate-smoothing algorithm using an uncorrelated lognormal relaxed clock model (Drummond et al., 2006), the GTR+G substitution model, speciation under a Yule process, the ‘ucld.mean’ prior set to a gamma distribution with a shape of .001 and a scale of 1,000 with all other priors set to default values, and 10 million generations sampling every 1,000 generations. An ultrametric starting tree was provided using the best ML tree from RAxML with branches transformed using non-parametric rate smoothing in TreeEdit v1.0a10. The perl script Burntrees (Nylander JAA, http://www.abc.se/~nylander/burntrees/burntrees.html) was used to sample every 98 trees from the stationary posterior distribution in the BEAST analysis after the first 250 were discarded as the burn-in. These 100 trees were imported for Bayesian GMYC (bGMYC) analysis in R (Reid & Carstens, 2012). Twenty-six GMYC models were evaluated within the 95% confidence and significant clusters were described as new taxa using the ‘turbo-taxonomy’ approach (Butcher et al., 2012), facilitated by the rapid e-publishing tool available through Index Fungorum (www.indexfungorum.org). Voucher material was deposited in the fungarium at the Royal Botanic Gardens, Kew (K) and all sequences were submitted to GenBank (KF815926–KF815937, KF854281, KF854282, KF854283).
Results and Discussion
The GMYC model with the greatest significant ML score included three ML clusters (1–10 clusters with 95% confidence) plus the root (4 ML entities; 2–23 with 95% confidence). GMYC supports for the three ML clusters were weak, low bGMYC posterior probabilities indicated a substantial level of phylogenetic uncertainty, while the maximum likelihood bootstraps supported reciprocal monophyly (79%, 76% and 100% for each cluster respectively; Fig. 1). Percent sequence similarity did not support distinction between any of the three species detected by GMYC and bootstrapping, where the minimum uncorrected pairwise distances between clades was greater than the maximum uncorrected pairwise distances within clades (Table 1). This result suggests that, while GMYC may be particularly sensitive to phylogenetic uncertainty as revealed by the low support values, for this dataset it performs better at diagnosing phylogenetic units than the commonly used percent similarity threshold of 97% (e.g., O’Brien et al., 2005). The phylogenetic uncertainty observed is almost certainly caused by a high ratio of parsimony uninformative variable sites (60 variable positions, 34 parsimony uninformative) to phylogenetically informative changes (26 positions). Of the informative characters, only 11 of them correspond to variable positions between the two closest taxa, B. bainiugan and B. meiweiniuganjun, with five sequences showing heterozygous bases at 6 positions (possibly due to incomplete lineage sorting) and only three of these corresponding to synapomorphic substitutions (Fig. 1). Five sequences contained autapomorphic substitutions in 18 positions, representing more than half of all parsimony uninformative characters, with up to 9 autapomorphies occurring in a single sequence (JN563917). These autapomorphies translate into longer terminal branch lengths relative to internal nodes, which reduces the distinction of within and between cluster branching patterns, a phenomenon that is known to affect GMYC supports (Fujisawa & Barraclough, 2013). These substitutions may indicate true variation in the ITS region, yet 94% come from sequences downloaded from GenBank, with only two sequences (JN563906, JN563917) contributing 83% of the autapomorphies. We suspect that, rather than true variation, these substitutions may instead be the result of sequencing and editing errors. Such errors can have large impacts on phylogenetic inference when the number of phylogenetically informative sites is small, such as in ITS sequences of recently diverged fungi, underscoring the importance of careful scrutiny during sequence preparation.
Table 1. Intra- and inter-specific uncorrected ITS barcode sequence distances of the three unnamed species discovered in a commercial packet of dried porcini.
B. bainiugan | B. meiweiniuganjun | B. shiyong | |
---|---|---|---|
Boletus bainuigan | 0–2.2 | ||
B. meiweiniuganjun | 0.4–3.3 | 0–1.5 | |
B. shiyong | 1.9–4.2 | 1.5–3.3 | 0–3.6 |
Three species could be identified based on corroboration of ML-supported reciprocal monophyly and GMYC clustering, and these corresponded to lineages previously reported in phylogenetic analyses (Dentinger et al., 2010; Feng et al., 2012; Sitta & Floriani, 2008), but none of which were formally named or described. Review of recent treatments of Chinese boletes also did not provide names for these taxa, which have been treated as a handful of species that occur in Europe and North America (Zang, 2006). New names were formally published on 12 October 2013 (see http://www.indexfungorum.org/Publications/Index%20Fungorum%20no.29.pdf for terse descriptions,1 voucher information, and GenBank accessions corresponding to these taxa). We hope that by naming these taxa and providing reference sequences for comparison, we will encourage mycologists with ready access to fresh collections of these species to record and document their characteristics and discover new features that may help to distinguish them.
Together with improvements in single-locus diagnosis leading to more robust inferences of evolutionary significant units (Butcher et al., 2012), rapid survey and diagnosis of vast communities of undescribed diversity is initiating a revolution in taxonomy (Riedel et al., 2013). This is particularly true for Fungi, which are hyperdiverse and largely cryptic, requiring indirect detection with environmental sequencing for documenting their true diversity (Taylor et al., 2014; Lücking et al., 2014). As a consequence, a vast quantity of fungal diversity is only known from DNA sequences, and these are accumulating in public databases at incredibly rapid rates (Hibbett et al., 2011). Although recent attempts to accelerate species description using short, unique DNA sequences ‘DNA barcoding’ (Hebert et al., 2003) and rapid, short description ‘turbo-taxonomy’ (Butcher et al., 2012) hold promise for meeting the enormous challenge of documenting hyperdiverse and largely unknown groups of organisms (Riedel et al., 2013), they still remain marginal to traditional methods for formal diagnosis of fungal diversity.
Turbo-taxonomy is an important improvement to efficiency in reconciling molecular diagnosis with a standard application of names that enable universal communication about biodiversity. Together, DNA sequence-based diagnosis and turbo-taxonomy catalyze description of new species, thereby greatly accelerating the rate at which diversity can be documented and recognized. Although descriptions based on features of organisms that are readily observed without specialized techniques are ideal, this is not always possible and descriptions based on features of DNA sequences could be automated to satisfy rules on naming. Automated pipelines that integrate analysis, taxonomy, and nomenclature will soon accelerate this revolution, enabling us to capture the most comprehensive baseline information on global organismal diversity possible. Given estimated rates of species extinction from 0.1 to 5% per year (Costello, May & Stork, 2013), and using recent estimates of global fungal diversity of ∼6 million species (Taylor et al., 2014), extinction rates may exceed description rates in Fungi by up to 5 times. An ‘integrative fast track’ approach (Riedel et al., 2013) offers the only tractable solution presently available to filling this knowledge gap. And as has been shown here with the three new species of porcini in a widely available commercial product, this knowledge gap can and does have direct impacts on our lives.
Conclusions
Our analysis of 15 pieces of dried porcini mushrooms from a single commercial packet showed three species corresponding to lineages that although previously reported in phylogenetic analyses have never been formally named or described until now. The recognition of these species enables them to be monitored in foods and facilitates countries’ adherence to international agreements on exploitation of wildlife, i.e., the Convention on Biological Diversity.
Supplemental Information
Acknowledgments
We are grateful to Rachel Mason Dentinger, who serendipitously supported this research through a spontaneous purchase of dried porcini for our dinner, and to Paul Kirk for nomenclatural advice and for facilitating the e-publication of the taxonomic treatments cited in this study. Meredith Oyen helped locate and translate the Chinese website. We are also grateful for the comments from the editor and four reviewers that improved this report.
Funding Statement
Funding was provided by the Royal Botanic Gardens, Kew. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Footnotes
The numbers reported in the original descriptions should be multiplied by 2.43 to achieve correct measurements of cells and spores.
Additional Information and Declarations
Competing Interests
Bryn T.M. Dentinger is an Academic Editor for PeerJ.
Author Contributions
Bryn T.M. Dentinger conceived and designed the experiments, performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, wrote the paper, prepared figures and/or tables, reviewed drafts of the paper.
Laura M. Suz performed the experiments, wrote the paper, reviewed drafts of the paper.
DNA Deposition
References
- Bass & Richards (2011).Bass D, Richards TA. Three reasons to re-evaluate fungal diversity ‘on Earth and in the ocean’. Fungal Biology Reviews. 2011;25:159–164. doi: 10.1016/j.fbr.2011.10.003. [DOI] [Google Scholar]
- Butcher et al. (2012).Butcher BA, Smith MA, Sharkey MJ, Quicke DLJ. A turbo-taxonomic study of Thai Aleiodes (Aleiodes) and Aleiodes (Arcaleiodes) (hymenoptera: braconidae: rogadinae) based largely on COI barcoded specimens, with rapid descriptions of 179 new species. Zootaxa. 2012;3457:1–232. [Google Scholar]
- Castillo et al. (2013).Castillo C, Lara B, Cruz M-J, Muñoz X. Protein identification of two allergens of Boletus edulis causing occupational asthma. American Journal of Respiratory and Critical Care Medicine. 2013;187:1146–1148. doi: 10.1164/rccm.201211-2110LE. [DOI] [PubMed] [Google Scholar]
- Costello, May & Stork (2013).Costello MJ, May RM, Stork NE. Can we name Earth’s species before they go extinct? Science. 2013;339:413–416. doi: 10.1126/science.1230318. [DOI] [PubMed] [Google Scholar]
- Dentinger et al. (2010).Dentinger BTM, Ammirati JF, Both EE, Desjardin DE, Halling RE, Henkel TW, Moreau P-A, Nagasawa E, Soytong K, Taylor AF, Watling R, Moncalvo J-M, McLaughlin DJ. Molecular phylogenetics of porcini mushrooms (Boletus section Boletus) Molecular Phylogenetics and Evolution. 2010;57:1276–1292. doi: 10.1016/j.ympev.2010.10.004. [DOI] [PubMed] [Google Scholar]
- Drummond et al. (2006).Drummond AJ, Ho SYW, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biology. 2006;4:e570. doi: 10.1371/journal.pbio.0040088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drummond et al. (2012).Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Molecular Biology and Evolution. 2012;29:1969–1973. doi: 10.1093/molbev/mss075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edgar (2004).Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Research. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feng et al. (2012).Feng B, Xu J, Wu G, Zeng N-K, Li Y-C, Tolgor B, Kost GW, Yang Z-L. DNA sequence analyses reveal abundant diversity, endemism and evidence for Asian origin of the porcini mushrooms. PLoS ONE. 2012;7:e570. doi: 10.1371/journal.pone.0037567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fujisawa & Barraclough (2013).Fujisawa T, Barraclough TG. Delimiting species using single-locus data and the generalized mixed Yule coalescent approach: a revised method and evaluation on simulated data sets. Systematic Biology. 2013;62:707–724. doi: 10.1093/sysbio/syt033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galtier, Gouy & Gautier (1996).Galtier N, Gouy M, Gautier C. SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Computer Applications in the Biosciences. 1996;12:543–548. doi: 10.1093/bioinformatics/12.6.543. [DOI] [PubMed] [Google Scholar]
- Gardes & Bruns (1993).Gardes M, Bruns TD. ITS primers with enhanced specificity for basidiomycetes—application to the identification of mycorrhizae and rusts. Molecular Ecology. 1993;2:113–118. doi: 10.1111/j.1365-294X.1993.tb00005.x. [DOI] [PubMed] [Google Scholar]
- Hall et al. (1998).Hall IR, Lyon AJE, Wang Y, Sinclair L. Ectomycorrhizal fungi with edible fruiting bodies 2. Boletus edulis. Economic Botany. 1998;52:44–56. doi: 10.1007/BF02861294. [DOI] [Google Scholar]
- Hebert et al. (2003).Hebert PDN, Cywinska A, Ball SL, De Waard JR. Biological identifications through DNA barcodes. Proceedings of the Royal Society of London. Series B, Biological Sciences. 2003;270:313–321. doi: 10.1098/rspb.2002.2218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Helbling et al. (2002).Helbling A, Bonadies N, Brander KA, Pichler WJ. Boletus edulis: a digestion-resistant allergen may be relevant for food allergy. Clinical & Experimental Allergy. 2002;32:771–775. doi: 10.1046/j.1365-2222.2002.01400.x. [DOI] [PubMed] [Google Scholar]
- Hibbett et al. (2011).Hibbett DS, Ohman A, Glotzer D, Nuhn M, Kirk PM, Nilsson RH. Progress in molecular and morphological taxon discovery in Fungi and options for formal classification of environmental sequences. Fungal Biology Reviews. 2011;39:147–182. [Google Scholar]
- Lücking et al. (2014).Lücking R, Dal-Forno M, Sikaroodi M, Gillevet PM, Bungartz F, Moncada B, Yánez-Ayabaca A, Chaves JL, Coca LF, Lawrey JD. A single macrolichen constitutes hundreds of unrecognized species. Proceedings of the National Academy of Sciences of the United States of America. 2014;111:11091–11096. doi: 10.1073/pnas.1403517111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maddison & Maddison (2011).Maddison WP, Maddison DR. Mesquite: a modular system for evolutionary analysis. (Version 2.75) 2011 Available at http://mesquiteproject.org .
- O’Brien et al. (2005).O’Brien HE, Parrent JL, Jackson JA, Moncalvo J-M, Vilgalys R. Fungal community analysis by large-scale sequencing of environmental samples. Applied and Environmental Microbiology. 2005;71:5544–5550. doi: 10.1128/AEM.71.9.5544-5550.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ott et al. (2007).Ott M, Zola J, Aluru S, Stamatakis A. Large-scale maximum likelihood-based phylogenetic analysis on the IBM BlueGene/L. Proceedings of ACM/IEEE supercomputing conference; 2007. Article No. 4. [Google Scholar]
- Pons et al. (2006).Pons J, Barraclough TG, Gómez-Zurita J, Cardoso A, Duran DP, Hazell S, Kamoun S, Sumlin WD, Vogler AP. Sequence-based species delimitation for the DNA taxonomy of undescribed insects. Systematic Biology. 2006;55:595–609. doi: 10.1080/10635150600852011. [DOI] [PubMed] [Google Scholar]
- R Core Team (2014).R Core Team . R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2014. Available at http://www.R-project.org . [Google Scholar]
- Reid & Carstens (2012).Reid NM, Carstens BC. Phylogenetic estimation error can decrease the accuracy of species delimitation: a Bayesian implementation of the general mixed Yule-coalescent model. BMC Evolutionary Biology. 2012;12:196. doi: 10.1186/1471-2148-12-196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riedel et al. (2013).Riedel A, Sagata K, Suhardjono YR, Tänzler R, Balke M. Integrative taxonomy on the fast track—towards more sustainability in biodiversity research. Frontiers in Zoology. 2013;10:15. doi: 10.1186/1742-9994-10-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sitta & Floriani (2008).Sitta N, Floriani M. Nationalization and globalization trends in the wild mushroom commerce of Italy with emphasis on porcini (Boletus edulis and allied species) Economic Botany. 2008;62:307–322. doi: 10.1007/s12231-008-9037-4. [DOI] [Google Scholar]
- Stamatakis (2006).Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]
- Swofford (2002).Swofford DL. PAUP*: Phylogenetic Analysis using Parsimony (* and other methods) Version 4.0. Sunderland, MA: Sinauer Associates; 2002. [Google Scholar]
- Taylor et al. (2014).Taylor DL, Hollingsworth TN, McFarland JW, Lennon NJ, Nusbaum C, Ruess RW. A first comprehensive census of fungi in soil reveals both hyperdiversity and fine-scale niche partitioning. Ecological Monographs. 2014;84:3–20. doi: 10.1890/12-1693.1. [DOI] [Google Scholar]
- Torricelli, Johansson & Wütrich (1997).Torricelli R, Johansson SG, Wütrich B. Ingestive and inhalative allergy to the mushroom Boletus edulis. Allergy. 1997;52:747–751. doi: 10.1111/j.1398-9995.1997.tb01232.x. [DOI] [PubMed] [Google Scholar]
- White et al. (1990).White TJ, Bruns T, Lee S, Taylor J. Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. In: Innis MA, Gelfand DH, Shinsky JJ, White TJ, editors. PCR protocols: a guide to methods and applications. San Diego: Academic Press; 1990. [Google Scholar]
- Zang (2006).Zang M. Flora fungorum sinicorum: boletaceae (I) Beijing: Science Press; 2006. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.