Abstract
The nuclear ribosomal internal transcribed spacer (nrITS) region has been widely used in fungal diversity studies. Environmental metabarcoding has increased the importance of the fungal DNA barcode in documenting fungal diversity and distribution. The DNA barcode gap is seen as the difference between intra- and inter-specific pairwise distances in a DNA barcode. The current understanding of the barcode gap in macrofungi is limited, inhibiting the development of best practices in applying the nrITS region toward research on fungal diversity. This study examined the barcode gap using 5146 sequences representing 717 species of macrofungi from eleven genera, eight orders and two phyla in datasets assembled by taxonomic experts. Intra- and inter-specific pairwise distances were measured from sequence and phylogenetic data. The results demonstrate that barcode gaps are influenced by differences in intra- and inter-specific variance in pairwise distances. In terms of DNA barcode behavior, variance is greater in the ITS1 than ITS2, and variance is greater in both relative to the combined nrITS region. Due to the difference in variance, the barcode gaps in the ITS2 region are greater than in the ITS1. Additionally, the taxonomic approach of “splitting” taxa into numerous taxonomic units produces greater barcode gaps when compared to “lumping”. The results show variability in the barcode gaps between fungal taxa, demonstrating a need to understand the accuracy of DNA barcoding in quantifying species richness. For taxonomic studies, variability in nrITS sequence data supports the application of multiple molecular markers to corroborate the taxonomic and systematic delineation of species.
Keywords: DNA barcode, pairwise distances, nrITS, ITS1, ITS2
1. Introduction
The nuclear ribosomal internal transcribed spacer (nrITS) region is the DNA barcode for fungi [1]. Since its establishment as the fungal DNA barcode, the nrITS region has become the de facto first step in identifying fungi using the phylogenetic species concept [2]. In establishing the nrITS region as the DNA barcode for fungi, prior and subsequent studies evaluated the region’s efficacy for this purpose. In 2008, Nilsson et al. [3] examined within-species (intra-specific) variations within fungi. Using data obtained from the International Nucleotide Sequence Database (INSD), they evaluated satisfactory sequence data from 4185 fungal species to determine intraspecific variations in the entirety of the nrITS sequence data and between the parts of the nrITS (the ITS1 and ITS2 regions). This evaluation showed that 75% or more of species across the kingdom fungi were ≤3% variable in the nrITS region, regardless of the region (whole, ITS1, or ITS2). When Schoch et al. [1] declared the nrITS region to be the appropriate region for the fungal DNA barcode in 2012, they evaluated its efficacy in terms of the relative ease of producing sequence data from it, the availability of existing data, and its ability to recognize species. This later determination was made through the evaluation of the DNA barcode gap in the nrITS and other regions by examining 142 species of Pezizomycotina (Ascomycota) and 43 species of Basidiomycota.
The DNA barcode gap is defined as the region (e.g., gap) that separates the distribution of intra-specific pairwise distances from inter-specific distances among related taxa using a molecular barcode (Figure 1) [4]. Discussions surrounding the barcode gap, its definition and its appropriate application began in the early 2000s at the height of the debate about the efficacy of using DNA barcodes in diversity studies [5]. At this time, the discussion around the behavior and interpretation of the barcode gap was somewhat hypothetical as sufficient sequence data to explore this phenomenon were limited. Current interest in fungal DNA barcodes invites scrutiny toward their application, and barcode gap analysis could be used toward this end [6].
Many studies that have used the fungal DNA barcode to explore anything from fungal taxonomy to the documentation of fungal species richness. Fewer studies have evaluated the barcode gap. This feature in fungal barcodes has been evaluated in Ascomycete groups like Morchella [7] and Cryptococcus [8]. In 2017, Badotti et al. performed the most comprehensive evaluation of Basidiomycota to date and identified variations in the barcode gap among genera such as Agaricus, Hebeloma and Lactarius [9]. This study relied on the taxonomic identifications supplied by the databases to detect barcode gaps in some genera. A study that produces side-by-side comparisons of the barcode gap among multiple macrofungal genera using expertly curated datasets can yield important insights into the efficacy of the barcode gap for documenting and quantifying fungal diversity.
The evaluation of DNA barcode data in fungi has increased in as high-throughput sequencing technology (e.g., Illumina, 454 pyrosequencing, etc.) has increased the availability of barcode data from environmental samples to quantify and characterize fungal diversity [10]. Most of the early high-throughput technologies produced short-read sequences that were limited to either the ITS1 or ITS2 region of the nrITS. Improvements in sequencing quality and depth allowed for a deep exploration of fungal richness in environmental samples via metabarcoding studies. However, debate surrounding which portion of the nrITS region, ITS1 or ITS2, was better had developed [11,12,13]. Adding to the discussion around metabarcoding studies is that the identification of fungi depends upon adequate representations of nrITS data from accurately identified fungal taxa in sequence databases [14,15,16]. This is one of the many reasons for the GenBank Fungi RefSeq ITS Project (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353/).
Increasing the representation of sequences from reference specimens contained within fungaria will continue to improve such databases [17]. Unfortunately, time is not kind to DNA, making it increasingly difficult to obtain a whole DNA barcode from older material [17,18,19]. The adoption of high-throughput sequencing technology for the purpose of DNA barcoding macrofungal collections is a potential solution to the problem of old DNA [18,20,21]. Both Forin et al., 2018, and Miller et al., 2022, demonstrated the efficacy of Illumina MiSeq sequencing in targeting the ITS2 region of old and ancient fungarium specimens [21,22]. This approach has also been demonstrated in lichens [23,24,25]. Adoption of high-throughput sequencing technology has proven to be more successful than Sanger methods in sequencing ancient DNA from specimens and the ability to multiplex hundreds of samples increases the efficiency in terms of time and money [19,22]. The ability to acquire sequence data from numerous, ancient fungarium collections adds to a collection’s value by incorporating more specimens into future systematic and taxonomic studies.
The effectiveness of using either ITS1 or ITS2 to identify taxonomic diversity has been explored in many studies (e.g., [3,9,26,27]). Many of these identify ITS1 as the most effective portion of the nrITS region for species delineation, while some metabarcoding studies found that the ITS2 region is more successful at documenting fungal richness in mixed communities [13,28]. The existence of introns in the 5′ end of the small subunit in certain Ascomycota lineages likely limits primer annealing and makes amplifying the ITS1 region challenging [29,30]. Similarly, the length heterogeneity of the ITS1 region complicates sequencing in genera like Cantharellus [31], Astraeus, Russula and Lactarius [32]. These challenges tend to undermine the utility of the ITS1 region. Regardless, the higher rate of variation in ITS1 relative to ITS2 argues in favor of its efficacy in DNA barcoding [26]. How the nrITS, ITS1 and ITS2 regions differ in efficacy across taxa can be explored by evaluating intra- vs. inter-specific pairwise distances. The increased availability of nrITS sequence data, along with the availability of comprehensive taxonomic studies, provides an opportunity for such a study.
Despite technical and physical challenges of obtaining full length nrITS sequence data from ancient specimens, its continued use, either in whole or in parts, will continue to aid taxonomic and systematic studies of many groups of macrofungi (e.g., the fungi which produce macroscopic sexual reproductive structures in the phyla Ascomycota and Basidiomycota—primarily the subphyla Pezizomycotina and Agaricomycotina, respectively). Its application has even expanded to members of the amateur mycological community who have taken up DNA barcoding in order to contribute to fungal identification [33,34,35].
This study aims to explore intra- and inter-specific variations in the nrITS, ITS1 and ITS2 regions of numerous macrofungal lineages to evaluate the barcode gap. This will be accomplished using the pairwise distances of nucleotide sequences and phylogenetic data using datasets that were assembled by taxonomic experts or defined in recent and comprehensive taxonomic studies. The results will help provide perspective on the variation in nrITS DNA barcode sequence data in macrofungi toward developing best practices for fungal taxonomic and environmental research.
2. Materials and Methods
Datasets for macrofungal genera representing multiple phyla and orders were assembled according to recent taxonomy and the availability of relevant sequence data. These datasets represent Agaricales (Hebeloma [36,37,38,39,40,41,42,43,44,45,46], Laccaria [47,48], and Marasmius), Boletales (Suillus [49]), Russulales (Russula, Stereum [50]), Polyporales (Trametes [51]), Thelephorales (Sarcodon), Cantharellales (Hydnum), Pezizales (Morchella) and Eurotiales (Elaphomyces [52,53]) (Figure 2). The assignation of nrITS sequences to species followed the conclusions made by the authors or the conclusions of the referenced taxonomic studies. The author assignation of species to sequences followed taxonomic delineations defined in the taxonomic literature through phylogenetic and morphological characters. These were combined with the instincts of the authors, who have expert knowledge of these groups. All efforts were made to include ≥3 sequences per species to capture intraspecific variation. However, if only one sequence was available for a bona fide species, it was included to assist in measuring interspecific variation.
The genus Morchella was split into two datasets based on challenges with aligning nrITS sequence data across the entire genus. The two datasets represent the M. elata group and the M. esculenta group. The genus Stereum was analyzed as a single dataset using two different taxonomic approaches based on the study by DeLong-Duhon in 2020 [50]. One approach is identified as the “lumping” model, which categorized sequences into twenty taxa. The other is the “splitting” model, which categorized sequences into thirty taxa.
For each genus studied, sequence data were assembled in manual alignment editors, Mesquite [54] or AliView [55], with automated alignment performed using Muscle [56] or MAAFT [57]. Matrices were then inspected, and additional adjustments were performed manually to correct misalignments. Sequence data that did not include ≥50% of the ITS1 or ITS2 regions were excluded from the analysis. Aligned matrices were saved into three files: (1) the entire nuclear ribosomal internal transcribed spacer region (ITS1+5.8S+ITS2) with 18S and 28S portions trimmed (nrITS), (2) the ITS1 only, and (3) the ITS2 only. These files were each saved in an aligned fasta format for a pairwise distance analysis. Phylogenies of the nrITS, ITS1, and ITS2 datasets were produced in RAxML [58], using default parameters with a GTR model of evolution, implemented via the CIPRES Web Portal [59], or run on local computers. The results of the bipartition files were saved as newick files. Altogether, six data files were created for each of the eleven genera. These files represent the nrITS, ITS1, and ITS2 datasets in fasta and newick formats.
Analyses of the data from the fasta and newick files were performed in R. A pairwise distance analysis of the aligned nucleotide fasta files was carried out using the dist.dna function from the ape package, with the parameter pairwise.deletion set to TRUE to prevent the removal of sites with missing data (i.e., to keep the characters with gaps) [60]. Analyses were performed with the default K80 distance parameters (different rates of transitions and transversions), as well as the raw parameters to approximate p-distance values. A phylogenetic pairwise distance analysis was performed on the RAxML newick files using cophenetic.phylo() from the ape package, which provided values of the distances between tips.
The distributions of the pairwise distance data were evaluated using boxplots, calculations of the mean, median, variance, and quantiles. A “stress test” of the barcode gap was performed by adjusting the quantiles around the distribution at 85%, 90% and 95% quantiles (Figure 1b). For each of these probabilities the upper quantile of the intra-specific pairwise distance distribution was subtracted from the lower quantile of the inter-specific pairwise distance distribution. A positive value indicates the size of the gap between the intra- and inter-specific distributions. A negative number indicates the absence of a gap. This barcode gap assessment was tested on the pairwise distributions of the nrITS, ITS1 and ITS2 datasets from each genus. Shrinking or loss of the barcode gap was observed as the quantiles around the distributions increased.
3. Results
The R markdown files used for analysis, including the data from the aligned fasta and RAxML newick format results, are publicly available through the Open Science Framework (OSF) and on our GitHub repository. For access, please see the Data Availability Statement at the end of the manuscript. The files and the R scripts are annotated and provide access to the data necessary for the absolute reproducibility of the results in this study.
This study assembled twelve datasets representing a total of 5147 sequences and 716 species. These datasets represent individual macrofungal genera from eight orders (Agaricales, Boletales, Russulales, Polyporales, Thelephorales, Cantharellales, Pezizales and Eurotiales) and two phyla (Ascomycota and Basidiomycota). The number of species per dataset consisted of as few as 11 (Sarcodon) to as many as 148 (Russula), with the number of sequences ranging from 62 (Sarcodon) to 1616 (Hebeloma). The nrITS datasets ranged from 656 (Trametes) to 1356 (Morchella esculenta clade) nucleotides in length, the ITS1 datasets ranged from 183 (Stereum) to 756 (Morchella esculenta clade) nucleotides in length and the ITS2 datasets ranged from 235 (Stereum) to 455 (Morchella esculenta clade) nucleotides in length. The mean aligned lengths of the datasets were 820.1 (nrITS), 333.3 (ITS1) and 311.7 (ITS2) nucleotides.
The results for the pairwise distance analyses using the K80 and raw models and for the phylogenetic pairwise distances are comparatively similar. Because of this, most of the data presented below reflect the analyses using the K80 model unless otherwise specified. The results for the raw model and phylogenetic pairwise distance analyses are presented in the Supplementary Materials.
The results of the pairwise distance analysis of the sequence data are summarized in Figure 3 as boxplots. The overall means from 5147 sequences comparing the intra-specific nrITS, ITS1, and ITS2 pairwise distance results are 0.006, 0.013 and 0.008 respectively. The inter-specific pairwise distance means are 0.061, 0.099 and 0.079, which are displayed as horizontal dotted lines for the nrITS (black), ITS1 (red) and ITS2 (gray) datasets in Figure 3. These mean values are likely skewed by the Hebeloma data from this genus comprise around one-third of the entire dataset. The mean differences in the sequence data between the inter- and intra-specific distances for nrITS, ITS1 and ITS2 are 0.055, 0.087 and 0.071, respectively. Similar results are demonstrated when the pairwise distances were analyzed using a “raw” model of evolution (Supplemental Materials Table S1) or phylogenetic distances (Supplemental Materials Table S2).
The variances in the intra-specific pairwise distance distributions for these regions are 0.0001, 0.0004 and 0.0002, with the ratio of ITS1 to ITS2 nearly double at 1.93. The inter-specific distribution variances were 0.0032, 0.0112 and 0.0071, with an ITS1 to ITS2 ratio of 1.57. The ITS1 region had a higher mean and more variation than the ITS2 region in both inter and intra-specific pairwise distances of the sequence data given the above ratios. This is visually observed in Figure 3 and Figure 4. The latter presents a bar graph that displays the variances in pairwise distances, analyzed using the “raw” model, for macrofungal genera.
The barcode gap is defined as the difference between the highest intra-specific and lowest inter-specific distance measurements. This is what is expected when using a dataset that does not contain aberrations or outliers in the data. However, in most datasets in this study, the lower limits of the inter-specific pairwise distances contained a zero, negating any evaluation of the barcode gap. The barcode gap “stress test” presented in Table 1 was performed to address this. The expectation was that the quantiles will drop the outlying tails of the intra- and inter-specific distributions and provide an opportunity to evaluate the barcode gap (Figure 1b).
Table 1.
85% Quantiles | 90% Quantiles | 95% Quantiles | ||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
nrITS | ITS1 | ITS2 | nrITS | ITS1 | ITS2 | nrITS | ITS1 | ITS2 | ||||||||||||
Name | Ntaxa | Nseq | Gap? | Size | Gap? | Size | Gap? | Size | Gap? | Size | Gap? | Size | Gap? | Size | Gap? | Size | Gap? | Size | Gap? | Size |
Hebeloma | 103 | 1616 | TRUE | 0.0051 | TRUE | 0.0043 | TRUE | 0.0052 | TRUE | 0.0016 | FALSE | TRUE | 0.0000 | FALSE | FALSE | FALSE | ||||
Laccaria | 73 | 268 | TRUE | 0.0121 | TRUE | 0.0094 | TRUE | 0.0105 | TRUE | 0.0063 | FALSE | TRUE | 0.0100 | TRUE | 0.0018 | FALSE | TRUE | 0.0000 | ||
Marasmius | 132 | 220 | TRUE | 0.0152 | TRUE | 0.0128 | TRUE | 0.0046 | TRUE | 0.0075 | FALSE | FALSE | FALSE | FALSE | FALSE | |||||
Suillus | 56 | 767 | TRUE | 0.0230 | TRUE | 0.0153 | TRUE | 0.0331 | TRUE | 0.0142 | FALSE | TRUE | 0.0235 | TRUE | 0.0013 | FALSE | TRUE | 0.0039 | ||
Russula | 148 | 295 | TRUE | 0.0771 | TRUE | 0.0720 | TRUE | 0.1192 | TRUE | 0.0648 | TRUE | 0.0531 | TRUE | 0.0900 | TRUE | 0.0380 | TRUE | 0.0274 | TRUE | 0.0515 |
Stereum | ||||||||||||||||||||
“lumping” | 20 | 460 | TRUE | 0.0036 | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | |||||||||
“splitting” | 30 | 460 | TRUE | 0.0156 | FALSE | TRUE | 0.0322 | TRUE | 0.0114 | FALSE | TRUE | 0.0230 | TRUE | 0.0058 | FALSE | TRUE | 0.0125 | |||
Trametes | 29 | 152 | FALSE | FALSE | TRUE | 1E-04 | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | ||||||||
Sarcodon | 11 | 62 | TRUE | 0.0413 | TRUE | 0.0472 | TRUE | 0.0527 | TRUE | 0.0152 | TRUE | 0.0423 | TRUE | 0.0134 | FALSE | TRUE | 0.0277 | TRUE | 0.0000 | |
Hydnum | 20 | 362 | TRUE | 0.0425 | TRUE | 0.0552 | TRUE | 0.0503 | TRUE | 0.0349 | TRUE | 0.0418 | TRUE | 0.0450 | TRUE | 0.0267 | TRUE | 0.0258 | TRUE | 0.0313 |
Morchella | ||||||||||||||||||||
elata group | 37 | 233 | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | FALSE | |||||||||
esculenta group | 21 | 107 | TRUE | 0.0306 | TRUE | 0.0607 | TRUE | 0.0043 | TRUE | 0.0196 | TRUE | 0.0407 | FALSE | TRUE | 0.0029 | TRUE | 0.0082 | FALSE | ||
Elaphomyces | 35 | 144 | TRUE | 0.0397 | TRUE | 0.0380 | TRUE | 0.0393 | TRUE | 0.0103 | FALSE | TRUE | 0.0199 | FALSE | FALSE | FALSE | ||||
Count | AVE | Count | AVE | Count | AVE | Count | AVE | Count | AVE | Count | AVE | Count | AVE | Count | AVE | Count | AVE | |||
11 | 0.0278 | 9 | 0.0350 | 11 | 0.0320 | 10 | 0.0186 | 4 | 0.0445 | 8 | 0.0281 | 6 | 0.0127 | 4 | 0.0223 | 6 | 0.0166 |
In summary, when assessing the barcode gap using quantiles, the entire nrITS region and the ITS2 region produce similar results at the 85, 90 and 95% quantiles using distance analyses under the K80 model (Table 1). These datasets retained their barcode gaps at the 90 and 95% quantiles more often in macrofungal taxa relative to the ITS1 datasets. The entire nrITS and ITS2 region produced the most consistent barcode gaps across the studied datasets. The ITS1 region resolved the fewest number of barcode gaps among the sequence data.
At the 85% quantile, the nrITS region produced a barcode gap for the majority of the datasets (11/13). The exceptions to this are the Trametes and the Morchella elata groups (avg. size size among existing barcode gaps: 0.0278, Table 1). The results were similar when assessing the 85% quantile for the pairwise distance of ITS2, in which the Stereum “lumping” and Morchella elata groups were the only datasets lacking a barcode gap (avg. size size among existing barcode gaps: 0.0320). For the 85% quantile for the pairwise distance of the ITS1 region, only 9 of the 13 datasets had sufficient barcode gaps. The Stereum “lumping” and “splitting”, Trametes and Morchella elata group were the datasets where a barcode gap was not observed (avg. size size among existing barcode gaps: 0.0350). At the 90% quantile, a majority of the nrITS region (10/13) and the ITS2 region (8/13) were retained. At the 95% quantile, these two datasets retained slightly less than half of the barcode gaps across all taxa (6/13). In contrast, the ITS1 region retained gaps in only four datasets (the Russula, Trametes, Sarcodon and Morchella esculenta groups) at both the 90% and 95% quantiles.
The variances in intra- and inter-specific pairwise distance distributions were evaluated across sequences and taxa. Under all models evaluated, Marasmius has the most intra-specific variation, while Hydnum has the lowest (Figure 3). For inter-specific variation, Elaphomyces has highest while the Stereum “lumping” model has the lowest. Figure 3 compares intra-specific and inter-specific variances for datasets across nrITS, ITS1 and ITS2 data. All distribution results of means and variances under K80, raw, and phylogenetic analysis can be found in Supplementary Material Tables S3–S5.
The genus Stereum was analyzed using two different approaches to species recognition, “lumping” and “splitting”, according to DeLong-Duhon et al. (unpublished but available on bioRχiv [50]). Their study applied Assemble Species by Automatic Partitioning (ASAP) method using MAFFT [57], and the species partitioning with the lowest (e.g., best) scores estimated either 20 or 30 species as optimal partitions. A pairwise distance analysis of Stereum was performed with both these approaches, and similar results were produced using the K80 and raw models and using phylogenetic distance data. In Figure 3, the lumping partitioning results are displayed in colored outlines and white-filled boxplots, while the splitting partioning results are presented as color-filled boxplots. Each result is displayed side by side for intra- and inter-specific nrITS, ITS1 and ITS2 comparisons. In Figure 4, Stereum under the lumping model (Stereum_20) has the second-highest variance for intra-specific pairwise distributions (Figure 4a). By comparison the splitting model (Stereum_30) is number eight on this list of intra-specific variances. In the inter-specific comparisons this relationship is inversed where the lumping model has the lowest variance and the splitting model has the second-lowest variance (Figure 4b).
4. Discussion
Using the barcode gap to analyze nrITS sequence data across macrofungal genera produced a number of interesting observations. First, variance in distance scores appears to have the greatest influence on the presence of a barcode gap. This can be seen in comparisons of nrITS, ITS1, and ITS2 sequence data, along with approaches used to recognize species diversity. Secondly, using the barcode gap to identify cutoffs for sequence clustering is challenging. One reason for this is that barcode gaps differ widely across taxonomic groups. The other reason is the fact that intra- and inter-specific distributions often overlap, eliminating the ability to use the barcode gap to infer cutoffs. This supports the discussion that nrITS sequence data has limited efficacy for taxonomic identification and verification across macrofungal genera [2].
ITS1 vs. ITS2–The ITS1 sequence data alone are nearly twice as variable as that of the ITS2 data (Figure 4). This difference between the two regions has been widely observed in studies that evaluate the DNA barcode in fungi [11,12,13]. It is also correlated with fewer observations of barcode gaps from the ITS1 datasets in the barcode gap assessment (Table 1). In the Laccaria, Russula, Suillus, Sarcodon, Hydnum and Stereum “splitting” datasets a barcode gap was retained for ITS2 all the way up through the 95% quantiles. In contrast, only four ITS1 datasets were retained at the 95% quantile. One difference being that the Morchella esculenta group was retained for ITS1 and not ITS2 at this quantile. Of all the taxa, Marasmius has the highest intra-specific variance. It retained a barcode gap for all sequence data at the 85% quantile and for the nrITS at the 90% quantile. For all other comparisions the barcode gap was lost (Table 1).
The fact that the ITS1 region struggles to maintain its barcode gap should affect the interpretation of ITS1′s role as a superior barcode marker [26]. Regardless, even though the presence or absence of a barcode gap can affect species recognition, it is still a theoretical approach to assessing species. It is not the final determination for species recognition. What these results do reinforce is the need to gather additional data in order corroborate evidence of species in taxonomic studies [61]. For environmental metabarcoding studies, it would be useful to broadly screen multiple taxonomic groups to understand the relative thresholds for recognizing MOTUs from the environment. This is discussed more below.
“Lumping” vs. “Splitting”–How species are partitioned within a taxonomic group affects the variance in pairwise distance measurements. As expected, this also influences the barcode gap. The Stereum dataset, was analyzed using “lumping” and “splitting” approaches in which the same dataset was divided into 20 species or as many as 30 species, respectively. Figure 5 illustrates how intra-specific variance is broader in the “lumping” hypothesis and narrower in the “splitting” hypothesis (Figure 5a,b respectively). In contrast, inter-specific variation is only slightly narrower for the “lumping” approach than the “splitting” (Figure 5c,d). When looking at the barcode gap stress test of Stereum, the wider variance in intra-specific variation in the “lumping” model appears to have a greater affect than the widening of the inter-specific variance in the “splitting” model.
Variance values for all K80 analyses are presented in Supplementary Material Table S3 (raw and phylogenetic results are in Tables S4 and S5 respectively). In comparing the variances for Stereum for the “lumping” and “splitting” approaches, the intra-specific variances are 1.54 × 10−4 and 4.52 × 10−5 respectively. The intra-specific variances are 1.75 × 10−4 and 2.04 × 10−4. If we take the high/low variance ratio between “lumping” and “splitting”, the ratio for intra-specific variances is near 3.5, while for inter-specific variances it is 1.2. This demonstrates how intra-specific variance is influenced under the “lumping” and “splitting” approaches with Stereum. Whether this observation is universal needs to be evaluated further in additional taxa.
Estimating cutoffs for MOTUs using barcode gaps–The ideal cutoff for recognizing MOTUs in environmental sequence data has been evaluated [3], but the challenge is in applying such universal cutoffs to a pool of sequences that likely represent fungi from multiple phyla [62]. Table 2 shows that establishing cutoffs for the genera in this study using the barcode gap is challenging, even for expertly curated datasets. Regardless, the fact that the middle of the proposed barcode gap for the taxa in this study varies from less than 2% to nearly 6% shows that a universal cutoff runs the risk of over- or underestimating fungal diversity, depending on the taxa.
Table 2.
ITS | ITS1 | ITS2 | ||||
---|---|---|---|---|---|---|
Names | Gap? | Mean | Gap? | Mean | gap? | Mean |
Hebeloma | FALSE | 0.7% | FALSE | 1.1% | FALSE | 1.2% |
Laccaria | TRUE | 1.8% | FALSE | 2.5% | FALSE | 2.5% |
Marasmius | FALSE | 5.0% | FALSE | 10.1% | FALSE | 5.9% |
Suillus | TRUE | 1.7% | FALSE | 2.9% | TRUE | 2.1% |
Russula | TRUE | 4.0% | TRUE | 4.5% | TRUE | 5.7% |
Stereum | ||||||
“lumping” | FALSE | 3.5% | FALSE | 6.9% | FALSE | 5.4% |
“splitting” | TRUE | 2.5% | FALSE | 5.8% | TRUE | 3.3% |
Trametes | FALSE | 1.8% | FALSE | 2.4% | FALSE | 2.9% |
Sarcodon | FALSE | 3.7% | TRUE | 5.5% | TRUE | 3.7% |
Hydnum | TRUE | 2.0% | TRUE | 2.8% | TRUE | 2.9% |
Morchella | ||||||
elata group | FALSE | 1.8% | FALSE | 5.0% | FALSE | 1.8% |
esculenta group | TRUE | 1.6% | TRUE | 2.4% | FALSE | 1.1% |
Elaphomyces | FALSE | 2.4% | FALSE | 4.0% | FALSE | 4.2% |
Count | 6 | 4 | 5 | |||
Max | 4.0% | 5.5% | 5.7% | |||
Min | 1.6% | 2.4% | 2.1% |
These results are precisely what was described by Ryberg (2015) when looking at the misrepresentation of species when assessing MOTUs [62]. He pointed out that several studies have observed an overlap of within- (intra-) and between-species (inter-) distances. The reality is that this overlap eliminates the barcode gap in nrITS sequence data, severely challenging the use of the marker for species recognition.
While the risk of over- or underestimating species diversity in environmental studies is real, the efforts needed to address these issues are likely to be impractical. Understanding community diversity and composition can be an abstract exercise. To explore how fungal communities contribute to ecosystem health, understanding the approximate diversity of a taxonomic group is sufficient if you have an understanding of the functional roles of the group. In such cases, working with sequence similarity cutoffs of 95-98% in forming clusters for MOTUs ought to provide researchers with sufficient resolution to characterize fungal communities.
Applying nrITS for taxonomy–Regarding describing new species, nrITS or its parts can aid in identifying novel taxa, but multiple molecular markers are necessary and strongly advised when delineating new species [61]. Fungal species are more than their sequence data. As a result, the current recommendation is refrain from using under-complex methods for species identification and to use multi-dimensional approaches such as the “consolidated species concept” [63]. Approaches that combine morphological, ecological and phylogenetic species concepts, using evidence from multiple molecular markers, are critical for achieving an accurate interpretation of taxonomic diversity [64]. Studies that describe new species of fungi would be of greater value if they provided a proper exploration of the fungal barcode gap by defining intra- and inter-specific distances. This would improve the utility of nrITS sequence data in the future.
Applying nrITS for species identification–This study evaluated the variance in nrITS sequence data across macrofungi because there is a growing need to produce DNA sequence data that improves reference sequences in databases [65,66]. Two approaches, which are not mutually exclusive, can be made toward this end: (1) increasing the representation of sequence data from curated fungarium specimens (quantity) and (2) increasing the expert evaluation of existing and new sequence data (quality). Ideally, reference sequences would be vastly improved by both, but in many cases, it has been the lack of quality, not quantity, that limits the usefulness of reference databases. In 2019, Hofstetter et al. highlighted the difficulty of using databases like GenBank nucleotide (nt) due to 30% of the sequences being erroneous in their identifications [14]. Olds et al., 2023, likewise estimated that 65% of the fungal specimens in global collections are misidentified, have outdated identifications, or have not been identified [19]. Additionally, metabarcoding studies frequently encounter dark taxa from environmental samples which are potentially a symptom of missing representation of known fungal taxa in existing sequence databases [67,68,69]. These dark taxa are even encountered in fungaria [70]. To address this, some researchers request that only sequence data from types be used [71]. However, as mentioned previously, the DNA from type specimens might not be obtainable, and the type sequence alone does not capture intra-specific variation for the species. The ability to provide quality reference data requires the preservation of source specimens from which additional taxonomic data can be evaluated. This emphasizes the importance of sequencing macrofungal collections to enable the recognition and identification of fungal taxa.
Macrofungi are a low-hanging fruit with respect to applying the fungal DNA barcode because these fungi produce sporocarps that are easily vouchered, referenced, and sequenced relative to microfungi and dark fungal taxa (DFT). For the latter, the calls to utilize sequence data as the type reference for DFT have increased in the last decade [72,73,74]. In 2023, Nilsson et al. revised the debate by establishing a set of best practices for defining species from DNA sequence data only [74]. While the current study addresses macrofungi, it provides some context for the discussion of DFT in the need to understand the boundaries around intra-specific sequence variation when establishing species. This would speak to the reproducibility of and confidence in recognizing taxa in future sequencing efforts.
The continued application of the barcode gap to fungi–The DNA barcode gap offers insight into the efficacy of nrITS sequence data or any additional barcode sequence data in macrofungi. Based on the work of this and other studies, the application of the barcode gap in fungi has its limitations [9,62]. Phillips et al., 2022, have cautioned that too many studies apply the DNA barcode to their organisms without sufficient statistical scrutiny of the barcode gap [6]. While they do provide strong arguments for scrutinizing the barcode gap, in fungi this may be moot where the barcode gap does not exist using nrITS sequence data. The definition of a fungal species is governed by so much more than a single molecular region. As a result, scrutinizing barcode gaps in secondary and tertiary barcodes for fungi is a practical next step in order to satisfy the desire of using molecular data for species recognition and identification.
In this study we have demonstrated that several macrofungal groups lack a clear barcode gap. This reality likely stems from wide variances in their intra- and inter-specific distance distributions. Compounding the issue is how variable distance measurements can be between taxa and between nrITS, ITS1 and ITS2 regions. If seeking a barcode gap signal from sequence data, the ITS2 region was more reliable than the ITS1 region across the taxa used in this study. This study adds to the already significant wealth of information about the strengths and weaknesses of the fungal DNA barcode and the growing emphasis of using additional molecular markers in taxonomic and systematic studies of fungi.
Acknowledgments
A.W.W. wishes to thank Michelle DePrenger-Levin for help with the R scripts used in this study. We appreciate the contribution of images from Mike Wood (Mykoweb.com, May 2023). S. D.-D. acknowledges Andrew Forbes and Robin Bagley as advisors and mentors. Several datasets were analyzed using the CIPRES Science Web Portal.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jof9080788/s1, Figure S1: Pairwise distances of DNA barcode sequence data using a raw substitution model. Horizontal lines are the inter-specific means for nrITS, ITS1 and ITS2 pairwise distance distributions from 5146 sequences. Stereum is evaluated under two partitions. The “lumping” partition has 20 species and is represented by open boxplots. The “splitting” partition has 30 species and is represented in the filled boxplots; Figure S2: Pairwise distances of DNA barcode sequence data using phylogenetic pairwise distances. Horizontal lines are the inter-specific means for nrITS, ITS1 and ITS2 pairwise distance distributions from 5146 sequences. Stereum is evaluated under two partitions. The “lumping” partition has 20 species and is represented by open boxplots. The “splitting” partition has 30 species and is represented in the filled boxplots; Figure S3: Variances in pairwise distance distributions (K80 model) ranked nrITS sequence data; Figure S4: Variances in phylogenetic pairwise distances ranked nrITS sequence data; Table S1: Bar Code Gap Assessment of DNA sequence data from Macrofungal Genera (raw model data); Table S2: Bar Code Gap Assessment of DNA sequence data from Macrofungal Genera (phylo model data); Table S3: Means and Variances from Pairwise Distances model K80; Table S4: Means and Variances from Pairwise Distances model raw; Table S5: Means and Variances from Phylogenetic Pairwise Distances.
Author Contributions
The identified authors contributed the following: A.W.W. project conception, coordination, data, analysis, writing; U.E. data, analysis, writing; N.N. data, analysis; C.R.N. data, analysis, writing; R.A.S. data, analysis; J.L.L. data, analysis; B.A.P. data, analysis, writing; M.H. data, analysis; T.W.O. data, analysis; S.D.-D. data, analysis; H.J.B. data; G.M.M. data. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Data and scripts used in this study can be found through the OSF link: https://osf.io/rvk42/; and through the GitHub repository: https://github.com/DenverBotanicGardens/DNA-Barcode-Gap-Project.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
The participation of C.R.N. was supported by the National Science Foundation, grant DEB-2030779. The participation of J.J.L was supported by National Science Foundation, grant DEB-2106105.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Schoch C.L., Seifert K.A., Huhndorf S., Robert V., Spouge J.L., Levesque C.A., Chen W., Consortium F.B. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc. Natl. Acad. Sci. USA. 2012;109:6241–6246. doi: 10.1073/pnas.1117018109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lücking R., Aime M.C., Robbertse B., Miller A.N., Ariyawansa H.A., Aoki T., Cardinali G., Crous P.W., Druzhinina I.S., Geiser D.M. Unambiguous identification of fungi: Where do we stand and how accurate and precise is fungal DNA barcoding? IMA Fungus. 2020;11:1–32. doi: 10.1186/s43008-020-00033-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Nilsson R.H., Kristiansson E., Ryberg M., Hallenberg N., Larsson K.-H. Intraspecific ITS variability in the Kingdom Fungi as expressed in the international sequence databases and its implications for molecular species identification. Evol. Bioinform. 2008;4:193–201. doi: 10.4137/EBO.S653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Meyer C.P., Paulay G. DNA barcoding: Error rates based on comprehensive sampling. PLoS Biol. 2005;3:e422. doi: 10.1371/journal.pbio.0030422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Moritz C., Cicero C. DNA barcoding: Promise and pitfalls. PLoS Biol. 2004;2:e354. doi: 10.1371/journal.pbio.0020354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Phillips J.D., Gillis D.J., Hanner R.H. Lack of Statistical Rigor in DNA Barcoding Likely Invalidates the Presence of a True Species’ Barcode Gap. Front. Ecol. Evol. 2022;10:859099. doi: 10.3389/fevo.2022.859099. [DOI] [Google Scholar]
- 7.Sa W., Qiao J., Gao Q., Li Z., Shang Q. DNA Barcoding and Species Classification of Morchella. Genes. 2022;13:1806. doi: 10.3390/genes13101806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hoang M.T.V., Irinyi L., Chen S.C., Sorrell T.C., ISHAM Barcoding of Medical Fungi Working Group. Meyer W. Dual DNA barcoding for the molecular identification of the agents of invasive fungal infections. Front. Microbiol. 2019;10:1647. doi: 10.3389/fmicb.2019.01647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Badotti F., de Oliveira F.S., Garcia C.F., Vaz A.B.M., Fonseca P.L.C., Nahum L.A., Oliveira G., Góes-Neto A. Effectiveness of ITS and sub-regions as DNA barcode markers for the identification of Basidiomycota (Fungi) BMC Microbiol. 2017;17:42. doi: 10.1186/s12866-017-0958-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Taylor D.L., Walters W.A., Lennon N.J., Bochicchio J., Krohn A., Caporaso J.G., Pennanen T. Accurate estimation of fungal diversity and abundance through improved lineage-specific primers optimized for Illumina amplicon sequencing. Appl. Environ. Microbiol. 2016;82:7217–7226. doi: 10.1128/AEM.02576-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Blaalid R., Kumar S., Nilsson R.H., Abarenkov K., Kirk P., Kauserud H. ITS 1 versus ITS 2 as DNA metabarcodes for fungi. Mol. Ecol. Resour. 2013;13:218–224. doi: 10.1111/1755-0998.12065. [DOI] [PubMed] [Google Scholar]
- 12.Mello A., Napoli C., Murat C., Morin E., Marceddu G., Bonfante P. ITS-1 versus ITS-2 pyrosequencing: A comparison of fungal populations in truffle grounds. Mycologia. 2011;103:1184–1193. doi: 10.3852/11-027. [DOI] [PubMed] [Google Scholar]
- 13.Bazzicalupo A.L., Bálint M., Schmitt I. Comparison of ITS1 and ITS2 rDNA in 454 sequencing of hyperdiverse fungal communities. Fungal Ecol. 2013;6:102–109. doi: 10.1016/j.funeco.2012.09.003. [DOI] [Google Scholar]
- 14.Hofstetter V., Buyck B., Eyssartier G., Schnee S., Gindro K. The unbearable lightness of sequenced-based identification. Fungal Divers. 2019;96:243–284. doi: 10.1007/s13225-019-00428-3. [DOI] [Google Scholar]
- 15.Hilário S., Santos L., Phillips A.J., Alves A. Caveats of the internal transcribed spacer region as a barcode to resolve species boundaries in Diaporthe. Fungal Biol. 2021;126:54–74. doi: 10.1016/j.funbio.2021.10.005. [DOI] [PubMed] [Google Scholar]
- 16.Lücking R., Truong B.V., Huong D.T.T., Le N.H., Nguyen Q.D., Nguyen V.D., Von Raab-Straube E., Bollendorff S., Govers K., Di Vincenzo V. Caveats of fungal barcoding: A case study in Trametes s. lat.(Basidiomycota: Polyporales) in Vietnam reveals multiple issues with mislabelled reference sequences and calls for third-party annotations. Willdenowia. 2020;50:383–403. doi: 10.3372/wi.50.50302. [DOI] [Google Scholar]
- 17.Osmundson T.W., Robert V.A., Schoch C.L., Baker L.J., Smith A., Robich G., Mizzan L., Garbelotto M.M. Filling gaps in biodiversity knowledge for macrofungi: Contributions and assessment of an herbarium collection DNA barcode sequencing project. PLoS ONE. 2013;8:e62419. doi: 10.1371/journal.pone.0062419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Runnel K., Abarenkov K., Copoț O., Mikryukov V., Kõljalg U., Saar I., Tedersoo L. DNA barcoding of fungal specimens using PacBio long-read high-throughput sequencing. Mol. Ecol. Resour. 2022;22:2871–2879. doi: 10.1111/1755-0998.13663. [DOI] [PubMed] [Google Scholar]
- 19.Olds C.G., Berta-Thompson J.W., Loucks J.J., Levy R.A., Wilson A.W. Applying a modified metabarcoding approach for the sequencing of macrofungal specimens from fungarium collections. Appl. Plant Sci. 2023;11:e11508. doi: 10.1002/aps3.11508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.McRae B.H., Beier P. Circuit theory predicts gene flow in plant and animal populations. Proc. Natl. Acad. Sci. USA. 2007;104:19885–19890. doi: 10.1073/pnas.0706568104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Miller A.N., Karakehian J., Raudabaugh D.B. Next-Generation Sequencing of Ancient and Recent Fungarium Specimens. J. Fungi. 2022;8:932. doi: 10.3390/jof8090932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Forin N., Nigris S., Voyron S., Girlanda M., Vizzini A., Casadoro G., Baldan B. Next generation sequencing of ancient fungal specimens: The case of the Saccardo mycological herbarium. Front. Ecol. Evol. 2018;6:129. doi: 10.3389/fevo.2018.00129. [DOI] [Google Scholar]
- 23.Gueidan C., Li L. A long-read amplicon approach to scaling up the metabarcoding of lichen herbarium specimens. MycoKeys. 2022;86:195. doi: 10.3897/mycokeys.86.77431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kistenich S., Halvorsen R., Schrøder-Nielsen A., Thorbek L., Timdal E., Bendiksby M. DNA sequencing historical lichen specimens. Front. Ecol. Evol. 2019;7:5. doi: 10.3389/fevo.2019.00005. [DOI] [Google Scholar]
- 25.Leavitt S.D., Kueler R., Newberry C.C., Rosentreter R., St Clair L.L. Shotgun sequencing decades-old lichen specimens to resolve phylogenomic placement of type material. Plant Fungal Syst. 2019;64:237–247. doi: 10.2478/pfs-2019-0020. [DOI] [Google Scholar]
- 26.Wang X.C., Liu C., Huang L., Bengtsson-Palme J., Chen H., Zhang J.H., Cai D., Li J.Q. ITS 1: A DNA barcode better than ITS 2 in eukaryotes? Mol. Ecol. Resour. 2015;15:573–586. doi: 10.1111/1755-0998.12325. [DOI] [PubMed] [Google Scholar]
- 27.Garnica S., Schön M.E., Abarenkov K., Riess K., Liimatainen K., Niskanen T., Dima B., Soop K., Frøslev T.G., Jeppesen T.S. Determining threshold values for barcoding fungi: Lessons from Cortinarius (Basidiomycota), a highly diverse and widespread ectomycorrhizal genus. FEMS Microbiol. Ecol. 2016;92:fiw045. doi: 10.1093/femsec/fiw045. [DOI] [PubMed] [Google Scholar]
- 28.Ihrmark K., Bödeker I., Cruz-Martinez K., Friberg H., Kubartova A., Schenck J., Strid Y., Stenlid J., Brandström-Durling M., Clemmensen K.E. New primers to amplify the fungal ITS2 region–evaluation by 454-sequencing of artificial and natural communities. FEMS Microbiol. Ecol. 2012;82:666–677. doi: 10.1111/j.1574-6941.2012.01437.x. [DOI] [PubMed] [Google Scholar]
- 29.Perotto S., Nepote-Fus P., Saletta L., Bandi C., Young J.P.W. A diverse population of introns in the nuclear ribosomal genes of ericoid mycorrhizal fungi includes elements with sequence similarity to endonuclease-coding genes. Mol. Biol. Evol. 2000;17:44–59. doi: 10.1093/oxfordjournals.molbev.a026237. [DOI] [PubMed] [Google Scholar]
- 30.Vrålstad T., Myhre E., Schumacher T. Molecular diversity and phylogenetic affinities of symbiotic root-associated ascomycetes of the Helotiales in burnt and metal polluted habitats. New Phytol. 2002;155:131–148. doi: 10.1046/j.1469-8137.2002.00444.x. [DOI] [PubMed] [Google Scholar]
- 31.Feibelman T.P., Bayman P., Cibula W.G. Length variation in the internal transcribed spacer of ribosomal DNA in chanterelles. Mycol. Res. 1994;98:614–618. doi: 10.1016/S0953-7562(09)80407-3. [DOI] [Google Scholar]
- 32.Rosenblad M.A., Martín M.P., Tedersoo L., Ryberg M., Larsson E., Wurzbacher C., Abarenkov K., Nilsson R.H. Detection of signal recognition particle (SRP) RNAs in the nuclear ribosomal internal transcribed spacer 1 (ITS1) of three lineages of ectomycorrhizal fungi (Agaricomycetes, Basidiomycota) MycoKeys. 2016;13:21–33. [Google Scholar]
- 33.Irga P.J., Barker K., Torpy F.R. Conservation mycology in Australia and the potential role of citizen science. Conserv. Biol. 2018;32:1031–1037. doi: 10.1111/cobi.13121. [DOI] [PubMed] [Google Scholar]
- 34.Irga P.J., Dominici L., Torpy F.R. The mycological social network a way forward for conservation of fungal biodiversity. Environ. Conserv. 2020;47:243–250. doi: 10.1017/S0376892920000363. [DOI] [Google Scholar]
- 35.Sheehan B., Stevenson R., Schwartz J. Crowdsourcing Fungal Biodiversity: Approaches and standards used by an all-volunteer community science project. Biodivers. Inf. Sci. Stand. 2021;5:e74225. doi: 10.3897/biss.5.74225. [DOI] [Google Scholar]
- 36.Eberhardt U., Schütz N., Bartlett P., Beker H.J. 96 North American taxa sorted–Peck’s Hebeloma revisited. Mycologia. 2022;114:337–387. doi: 10.1080/00275514.2021.2012063. [DOI] [PubMed] [Google Scholar]
- 37.Eberhardt U., Schütz N., Bartlett P., Hosaka K., Kasuya T., Beker H.J. Revisiting Hebeloma (Hymenogastraceae, Agaricales) in Japan: Four species recombined into other genera but three new species discovered. Mycol. Prog. 2022;21:447–472. doi: 10.1007/s11557-021-01757-x. [DOI] [Google Scholar]
- 38.Eberhardt U., Beker H.J., Vesterholt J. Decrypting the Hebeloma crustuliniforme complex: European species of Hebeloma section Denudata subsection Denudata (Agaricales) Pers.-Mol. Phylogeny Evol. Fungi. 2015;35:101–147. doi: 10.3767/003158515X687704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Eberhardt U., Beker H.J., Vesterholt J., Dukik K., Walther G., Vila J., Fernández Brime S. European species of Hebeloma section Theobromina. Fungal Divers. 2013;58:103–126. doi: 10.1007/s13225-012-0188-3. [DOI] [Google Scholar]
- 40.Eberhardt U., Beker H.J., Borgen T., Knudsen H., Schütz N., Elborne S.A. A survey of Hebeloma (Hymenogastraceae) in Greenland. MycoKeys. 2021;79:17. doi: 10.3897/mycokeys.79.63363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Beker H.J., Eberhardt U., Schütz N., Gulden G. A review of the genus Hebeloma in Svalbard. Mycoscience. 2018;59:303–309. doi: 10.1016/j.myc.2017.12.001. [DOI] [Google Scholar]
- 42.Eberhardt U., Beker H.J., Schütz N., Pedersen O.S., Sysouphanthong P., Læssøe T. Adventurous cuisine in Laos: Hebeloma parvisporum, a new species in Hebeloma section Porphyrospora. Mycologia. 2020;112:172–184. doi: 10.1080/00275514.2019.1680220. [DOI] [PubMed] [Google Scholar]
- 43.Grilli E., Beker H.J., Eberhardt U., Schütz N., Leonardi M., Vizzini A. Unexpected species diversity and contrasting evolutionary hypotheses in Hebeloma (Agaricales) sections Sinapizantia and Velutipes in Europe. Mycol. Prog. 2016;15:1–46. doi: 10.1007/s11557-015-1148-6. [DOI] [Google Scholar]
- 44.Eberhardt U., Beker H.J., Schuetz N., Mikami M., Kasuya T. Rooting Hebelomas: The Japanese ‘Hebeloma radicosum’ is a distinct species, Hebeloma sagarae sp. nov.(Hymenogastraceae, Agaricales) Phytotaxa. 2020;456:125–144. doi: 10.11646/phytotaxa.456.2.1. [DOI] [Google Scholar]
- 45.Eberhardt U., Schütz N., Beker H.J., Lee S.S., Horak E. Hebeloma in the Malay Peninsula: Masquerading within Psathyrella. MycoKeys. 2021;77:117. doi: 10.3897/mycokeys.77.57394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Eberhardt U., Beker H.J., Vesterholt J., Schütz N. The taxonomy of the European species of Hebeloma section Denudata subsections Hiemalia, Echinospora subsect. nov. and Clepsydroida subsect. nov. and five new species. Fungal Biol. 2016;120:72–103. doi: 10.1016/j.funbio.2015.09.014. [DOI] [PubMed] [Google Scholar]
- 47.Wilson A.W., Hosaka K., Mueller G.M. Evolution of ectomycorrhizae as a driver of diversification and biogeographic patterns in the model mycorrhizal mushroom genus Laccaria. New Phytol. 2017;213:1862–1873. doi: 10.1111/nph.14270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Wilson A.W., May T.W., Mueller G.M. Biogeography of Mycorrhizal Symbiosis. Springer; Berlin/Heidelberg, Germany: 2017. Biogeography of the ectomycorrhizal mushroom genus Laccaria; pp. 273–297. [Google Scholar]
- 49.Nguyen N., Vellinga E.C., Bruns T.D., Kennedy P. Phylogenetic assessment of global Suillus ITS sequences supports morphologically defined species and reveals synonymous and undescribed taxa. Mycologia. 2016;108:1216–1228. doi: 10.3852/16-106. [DOI] [PubMed] [Google Scholar]
- 50.DeLong-Duhon S., Bagley R.K., Forbes A.A. DNA, Morphology, and Ecology Resurrect Previously Synonymized Species of North American Stereum and Suggest Extensive Undescribed Global Diversity. bioRxiv. 2020 doi: 10.1101/2020.10.16.342840. [DOI] [Google Scholar]
- 51.Justo A., Hibbett D.S. Phylogenetic classification of Trametes (Basidiomycota, Polyporales) based on a five–marker dataset. Taxon. 2011;60:1567–1583. doi: 10.1002/tax.606003. [DOI] [Google Scholar]
- 52.Paz A., Bellanger J.-M., Lavoise C., Molia A., Ławrynowicz M., Larsson E., Ibarguren I., Jeppson M., Læssøe T., Sauve M. The genus Elaphomyces (Ascomycota, Eurotiales): A ribosomal DNA-based phylogeny and revised systematics of European ‘deer truffles’. Pers.-Mol. Phylogeny Evol. Fungi. 2017;38:197–239. doi: 10.3767/003158517X697309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Molia A., Larsson E., Jeppson M., Læssøe T., Larsson K. Elaphomyces section Elaphomyces (Eurotiales, Ascomycota)—Taxonomy and phylogeny of North European taxa, with the introduction of three new species. Fungal Syst. Evol. 2020;5:283–300. doi: 10.3114/fuse.2020.05.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Maddison W.P., Maddison D.R. Mesquite: A Modular System for Evolutionary Analysis. Version 2.75. Scientific Research; Atlanta, GA, USA: 2011. [Google Scholar]
- 55.Larsson A. AliView: A fast and lightweight alignment viewer and editor for large datasets. Bioinformatics. 2014;30:3276–3278. doi: 10.1093/bioinformatics/btu531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Edgar R. MUSCLE: A multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 2004;5:113. doi: 10.1186/1471-2105-5-113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Katoh K., Standley D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Stamatakis A. RAxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–1313. doi: 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Miller M.A., Holder M.T., Vos R., Midford P.E., Liebowitz T., Chan L., Hoover P., Warnow T. The CIPRES Portals. 2009. [(accessed on 19 July 2023)]. Available online: https://scholar.google.com/citations?view_op=view_citation&hl=en&user=BYZtDXEAAAAJ&cstart=20&pagesize=80&citation_for_view=BYZtDXEAAAAJ:_kc_bZDykSQC.
- 60.Paradis E., Claude J., Strimmer K. APE: Analyses of phylogenetics and evolution in R language. Bioinformatics. 2004;20:289–290. doi: 10.1093/bioinformatics/btg412. [DOI] [PubMed] [Google Scholar]
- 61.Aime M.C., Miller A.N., Aoki T., Bensch K., Cai L., Crous P.W., Hawksworth D.L., Hyde K.D., Kirk P.M., Lücking R. How to publish a new fungal species, or name, version 3.0. IMA Fungus. 2021;12:1–15. doi: 10.1186/s43008-021-00063-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ryberg M. Molecular operational taxonomic units as approximations of species in the light of evolutionary models and empirical data from Fungi. Mol. Ecol. 2015;24:5770–5777. doi: 10.1111/mec.13444. [DOI] [PubMed] [Google Scholar]
- 63.Quaedvlieg W., Binder M., Groenewald J., Summerell B., Carnegie A., Burgess T., Crous P.W. Introducing the consolidated species concept to resolve species in the Teratosphaeriaceae. Pers.-Mol. Phylogeny Evol. Fungi. 2014;33:1–40. doi: 10.3767/003158514X681981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Hawksworth D.L., Lücking R. Fungal diversity revisited: 2.2 to 3.8 million species. Microbiol. Spectr. 2017;5:10. doi: 10.1128/microbiolspec.FUNK-0052-2016. [DOI] [PubMed] [Google Scholar]
- 65.Tedersoo L., Abarenkov K., Nilsson R.H., Schüssler A., Grelet G.-A., Kohout P., Oja J., Bonito G.M., Veldre V., Jairus T. Tidying up international nucleotide sequence databases: Ecological, geographical and sequence quality annotation of ITS sequences of mycorrhizal fungi. PLoS ONE. 2011;6:e24940. doi: 10.1371/journal.pone.0024940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Nilsson R.H., Ryberg M., Kristiansson E., Abarenkov K., Larsson K.-H., Kõljalg U. Taxonomic reliability of DNA sequences in public sequence databases: A fungal perspective. PLoS ONE. 2006;1:e59. doi: 10.1371/journal.pone.0000059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Tedersoo L., Bahram M., Puusepp R., Nilsson R.H., James T.Y. Novel soil-inhabiting clades fill gaps in the fungal tree of life. Microbiome. 2017;5:42. doi: 10.1186/s40168-017-0259-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ryberg M., Nilsson R.H. New light on names and naming of dark taxa. MycoKeys. 2018;30:31. doi: 10.3897/mycokeys.30.24376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Nilsson R.H., Larsson K.-H., Taylor A.F.S., Bengtsson-Palme J., Jeppesen T.S., Schigel D., Kennedy P., Picard K., Glöckner F.O., Tedersoo L. The UNITE database for molecular identification of fungi: Handling dark taxa and parallel taxonomic classifications. Nucleic Acids Res. 2019;47:D259–D264. doi: 10.1093/nar/gky1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Brock P.M., Döring H., Bidartondo M.I. How to know unknown fungi: The role of a herbarium. New Phytol. 2009;181:719–724. doi: 10.1111/j.1469-8137.2008.02703.x. [DOI] [PubMed] [Google Scholar]
- 71.Raja H.A., Oberlies N.H., Stadler M. Occasional comment: Fungal identification to species-level can be challenging. Phytochemistry. 2021;190:112855. doi: 10.1016/j.phytochem.2021.112855. [DOI] [PubMed] [Google Scholar]
- 72.Parra L.A., Zamora J.C., Hawksworth D.L., Hibbett D.S., Kirk P.M., Lücking R., Rambold G., Bensch K., Yao Y.-J., Robert V. Proposals for consideration at IMC11 to modify provisions related solely to fungi in the International Code of Nomenclature for algae, fungi, and plants. IMA Fungus. 2018;9:i–vii. doi: 10.1007/BF03449481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Lücking R., Kirk P.M., Hawksworth D.L. Sequence-based nomenclature: A reply to Thines et al. and Zamora et al. and provisions for an amended proposal “from the floor” to allow DNA sequences as types of names. IMA Fungus. 2018;9:185–198. doi: 10.5598/imafungus.2018.09.01.12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Nilsson R.H., Ryberg M., Wurzbacher C., Tedersoo L., Anslan S., Põlme S., Spirin V., Mikryukov V., Svantesson S., Hartmann M. How, not if, is the question mycologists should be asking about DNA-based typification. MycoKeys. 2023;96:143–157. doi: 10.3897/mycokeys.96.102669. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data and scripts used in this study can be found through the OSF link: https://osf.io/rvk42/; and through the GitHub repository: https://github.com/DenverBotanicGardens/DNA-Barcode-Gap-Project.