Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2019 Dec 16;14(12):e0226527. doi: 10.1371/journal.pone.0226527

Increased performance of DNA metabarcoding of macroinvertebrates by taxonomic sorting

Kevin K Beentjes 1,2,*, Arjen G C L Speksnijder 1, Menno Schilthuizen 1,2, Marten Hoogeveen 1, Rob Pastoor 1, Berry B van der Hoorn 1
Editor: Sebastian D Fugmann3
PMCID: PMC6913968  PMID: 31841568

Abstract

DNA-based identification through the use of metabarcoding has been proposed as the next step in the monitoring of biological communities, such as those assessed under the Water Framework Directive (WFD). Advances have been made in the field of metabarcoding, but challenges remain when using complex samples. Uneven biomass distributions, preferential amplification and reference database deficiencies can all lead to discrepancies between morphological and DNA-based taxa lists. The effects of different taxonomic groups on these issues remain understudied. By metabarcoding WFD monitoring samples, we analyzed six different taxonomic groups of freshwater organisms, both separately and combined. Identifications based on metabarcoding data were compared directly to morphological assessments performed under the WFD. The diversity of taxa for both morphological and DNA-based assessments was similar, although large differences were observed in some samples. The overlap between the two taxon lists was 56.8% on average across all taxa, and was highest for Crustacea, Heteroptera, and Coleoptera, and lowest for Annelida and Mollusca. Taxonomic sorting in six basic groups before DNA extraction and amplification improved taxon recovery by 46.5%. The impact on ecological quality ratio (EQR) scoring was considerable when replacing morphology with DNA-based identifications, but there was a high correlation when only replacing a single taxonomic group with molecular data. Different taxonomic groups provide their own challenges and benefits. Some groups might benefit from a more consistent and robust method of identification. Others present difficulties in molecular processing, due to uneven biomass distributions, large genetic diversity or shortcomings of the reference database. Sorting samples into basic taxonomic groups that require little taxonomic knowledge greatly improves the recovery of taxa with metabarcoding. Current standards for EQR monitoring may not be easily replaced completely with molecular strategies, but the effectiveness of molecular methods opens up the way for a paradigm shift in biomonitoring.

Introduction

Now that the use of DNA barcoding for the identification of species [1] has proven its merit, research is shifting towards the integration of molecular identifications in ecological and biodiversity assessments across different biomes [25]. Integration of molecular techniques can provide a significant added value for the monitoring of biological quality elements (BQEs) in fields such as the quality monitoring of freshwater under the European Framework Directive (WFD) [6]. To date, many of the BQEs analyzed for WFD monitoring are still assessed using traditional morphology-based methods [7]. These traditional methods, however, are known to be hampered by difficulties in identification and substantial differences between assessors [810], and can be expensive due to their time-consuming nature [1113].

Recent advances have shown the efficacy of DNA metabarcoding to assess macroinvertebrate samples [5,14] and to obtain metrics for bioassessments [1517]. Although DNA-based methods are generally perceived as an improvement over the traditional morphological assessments [18], challenges remain to be solved before DNA-based methods can be fully incorporated into routine bio-monitoring. Studies employing metabarcoding of aquatic macroinvertebrates are often limited to single samples [19], a select subset of taxa [20] or rely on mock communities [2124]. Research that does cover a broader variety of WFD monitoring samples often deals with differences in taxonomic resolution between morphological and DNA analyses [16,25]. One of the main confounding effects in the use of molecular approaches is the effect of primer bias and preferential amplification in complex samples, leading to taxonomic bias [26,27]. Interactions between taxa from various organism groups, of varying sizes and in varying biomass ratios remain understudied, and implications can be severe, limiting the possibility to relate metabarcoding read data to actual taxon abundances [28], even though these actual abundances might not be as important for simple ecological quality ratio calculations used by water monitoring agencies [29].

In this paper, we assess the implementation of DNA metabarcoding for species identification in bulk samples collected under the WFD. We evaluate the performance of DNA metabarcoding-based identification of taxa across six different taxonomic groups that collectively cover most of the traditional macroinvertebrate samples collected for WFD freshwater quality assessments: Annelida, Crustacea, Heteroptera/Coleoptera, Mollusca, Trichoptera/Odonata/Ephemeroptera, and Diptera. Our aim is to assess the effects of taxonomic sorting on the recovery of taxa from bulk metabarcoding, and the impact of replacing these groups with molecular data on ecological quality ratio (EQR) scoring. While EQRs are a simplified way to look at community compositions, they provide an insight in water quality, and are widely used by water monitoring agencies to assess the status of surface waters under the WFD [7,29]. We also discuss some concerns on DNA reference databases that may hinder successful application of molecular methodology in biomonitoring.

Materials and methods

Sample selection and processing

Freshwater macroinvertebrate samples were collected in the Hoogheemraadschap Rijnland monitoring district in 2010 and 2012 by ecological survey company Aquon (Leiden, the Netherlands). Samples were collected and analyzed according to standardized WFD monitoring guidelines [30]. Specimens were sorted by Aquon taxonomists into seven different categories during morphological analysis, and stored separately in ethanol per taxon group: ANNE (Annelida), ACA (Hydrachnidia, stored in Koenike’s fluid), CRUS (Crustacea), HECO (Heteroptera and Coleoptera), MOLL (Mollusca), TOE (Trichoptera, Odonata and Ephemeroptera), and REST (miscellaneous, predominantly Chironomidae and other Diptera). Specimens were identified to lowest possible level, preferably species level. For this study, we selected 25 samples out of 138 from the monitoring cycles of 2010 and 2012. More recent samples could not be used, as there is a five-year retention period for WFD monitoring samples. Samples were selected based on the WFD ecological quality ratio (EQR) scores (range 0.158–0.759), as well as the Shannon-index (range 0.840–4.326), to represent a broad range of sample diversities and complexities (for all 138 samples, EQR ranged from 0.059 to 0.847 and Shannon-index ranged from 0.602 to 4.326). EQR scores in the Dutch WFD monitoring range from 0.0 to 1.0, and are divided into 5 categories ranging from “bad” (EQR 0.0–0.2) to “high”(EQR 0.8–1.0) (for more detail, see [25]). The 25 selected samples represented four out of five quality classes, in the 138 samples there was only one sample that was scored as “high”. The full taxon lists with specimen counts have been included in the supplementary data (S1 File).

Not all of the seven groups were present in all samples. The water mites (ACA) were excluded from the analysis, as they were preserved in Koenike’s fluid (45% water, 45%, glycerin, 10% glacial acid acetic), which had a negative impact on the preservation of DNA and we were unable to obtain useable DNA extracts from the samples. To account for the missing taxa, water mites were also removed from the morphological lists during the comparison of DNA and morphology.

DNA extraction and amplification

Specimens were homogenized in 15 ml sterile tubes containing 10 steel beads (5 mm diameter), using the IKA Ultra Turrax Tube Drive (IKA, Staufen, Germany) in a fixed volume of 5.0 ml 96% ethanol. Each tube was ground three times for one minute on the maximum speed setting (6000 rpm). A tube with only 5.0 ml of 96% ethanol was used as an extraction blank. After homogenization, 500 μl of the ethanol with ground specimens was transferred to a 2 ml tube, and the ethanol was evaporated using a Concentrator plus vacuum centrifuge (Eppendorf, Nijmegen, the Netherlands). DNA was extracted from the remaining dry debris using the Nucleomag 96 Tissue kit (Macherey-Nagel, Düren, Germany) on the Kingfisher Flex Purification System (Thermo Fisher, Waltham, MA, US), with a final elution in 150 μl. To simulate a total DNA extraction on all taxa of one sampling location combined, 5.0 μl of DNA extract from each of the taxonomically sorted samples belonging to one location was combined into a pool, which was amplified and sequences in the same way as the sorted samples.

A two-step PCR protocol was used to create a dual index amplicon library, using primers BF1 and BR2 [22] to amplify a 316 base pair fragment of the COI barcoding region. These primers have been shown to successfully amplify a wide range of freshwater macroinvertebrates. All 183 samples (158 individually extracted tubes and 25 pools) were amplified and labeled separately, using two PCR replicates for each sample. First round PCRs were performed in 20 μl reactions containing 1x Phire Green Reaction Buffer, 10 μg BSA (Promega, Madison, WI, US), 0.5 mM dNTPs, 0.4 μl Phire Hot Start II DNA Polymerase (Thermo Fisher, Waltham, MA, US), 0.65 μM of each primer and 2.0 μl of template DNA. Initial denaturation was performed at 98°C for 30 seconds, followed by 30 cycles at 98°C for 5 seconds, 50°C for 5 seconds and 72°C for 15 seconds, followed by final elongation at 72°C for 5 minutes. PCR success was checked on an E-Gel 96 pre-cast agarose gel (Thermo Fisher, Waltham, MA, USA). PCR products were then cleaned with a one-sided size selection using NucleoMag NGS-Beads (Macherey-Nagel, Düren, Germany), at a 1:0.9 ratio.

Second round PCRs to add the individual P5 and P7 Illumina labels (Nextera XT Index Kit; Illumina, San Diego, CA, USA) were performed using 3.0 μl of cleaned PCR product from the first round in a 20 μl reaction containing 1x TaqMan Environmental Master Mix 2.0 (Thermo Fisher, Waltham, MA, USA) and 0.5 μM of each primer. Initial denaturation was performed at 95°C for 10 minutes, followed by 14 cycles at 95°C for 30 seconds, 55°C for 60 seconds and 72°C for 30 seconds, followed by final elongation at 72°C for 7 minutes. All PCRs were performed in 96-well plates, with replicates in separate plates. Each plate contained two wells with an artificial internal control (AIC) sample that was used to gauge the amount of cross-contamination between samples in the amplification process in the laboratory. The artificial control was based on the COI barcode region of a Reeve’s muntjac (Muntiacus reevesi) with several primer sets built into the sequence, and synthesized by IDT (Leuven, Belgium) (S1 Fig). Second round PCR products were quantified on the QIAxcel (Qiagen, Venlo, the Netherlands) and pooled equimolarly per PCR plate using the QIAgility (Qiagen, Venlo, the Netherlands). Pools were cleaned with a one-sided size selection using NucleoMag NGS-Beads (ratio 1:0.9) then quantified on the Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA) with the DNA High Sensitivity Kit. The pools were then combined equimolarly into one sample and sequenced in one run of Illumina MiSeq (v3 Kit, 2x300 paired-end) at Baseclear (Leiden, the Netherlands). Sequence data is available from the NCBI Sequence Read Archive (Bioproject accession PRJNA550542).

Bioinformatics

Quality filtering and clustering of the entire dataset was performed in a custom pipeline on the OpenStack environment of Naturalis Biodiversity Center through a Galaxy instance [31]. Raw sequences were merged using FLASH v1.2.11 [32] (minimum overlap 50, mismatch ratio 0.2); non-merged reads were discarded. Primers were trimmed from both ends of the merged reads using Cutadapt v1.16 [33] (minimum match 10, mismatch ratio 0.2). Any read without both primers present and anchored was discarded. PRINSEQ v0.20.4 [34] was used to remove reads with length below 313 bp and above 319 bp, to allow for natural variations in coding sequence as well as potential primer slippage [35]. Sequences were dereplicated and clustered into Molecular Operational Taxonomic Units (MOTUs) using VSEARCH v2.10.3 [36] with a cluster identity of 98% and a minimal accepted abundance of 2 before clustering. The presence of AIC reads in the regular (non-control) samples, as well as the presence of non-AIC DNA in the control samples was used to determine the MOTU filtering threshold; only MOTUs with read abundances above 0.025% were retained for each replicate. Samples with fewer than 4,000 reads were discarded and PCR replicates were combined according to the additive strategy, counting all MOTUs, irrespective of how many replicates they occurred in [37], as the intent was to recover as many taxa as possible.

MOTU sequences were compared to a custom reference database using an extended BLAST+ script (https://github.com/naturalis/galaxy-tool-BLAST). The custom reference dataset included 2,757 COI barcodes obtained from WFD species collected in the Netherlands as part of the national DNA barcoding campaign [38], supplemented with sequences obtained from BOLD [39] belonging to the 795 genera listed on the Dutch WFD species list. A total of 350,449 public sequences of 679 genera were retrieved from BOLD using the package ‘bold’ [40] in R [41] (sequences downloaded 28 June 2018). The remaining genera were either not present in the BOLD database (107 genera) or had no public sequences linked to them (9 genera). The exclusion of sequences not identified to at least genus level allowed for linking taxa to the Dutch Species Register (https://www.nederlandsesoorten.nl/) based on genus names, making all taxonomic data compatible for use in lowest common ancestor analysis. The final database was dereplicated, removing all entries that had 100% identical DNA sequences and species names. MOTUs were also compared to a second custom reference library containing COI sequences and bacterial genomes downloaded from NCBI GenBank [42] (sequences downloaded 21 August 2018), to help filter out non-macroinvertebrate MOTUs and correct for misidentifications based on contaminated (e.g. Homo sapiens or Wolbachia) or otherwise erroneous sequences in the BOLD database.

The top 100 hits were obtained for both BLAST comparisons. Anticipating gaps in the DNA database, we developed a custom lowest common ancestor (LCA) tool to be able to assign higher-level taxonomic assignments for MOTUs without direct hits (>98% match and 100% coverage) in the reference database. The LCA tool was based on MEGAN [43], with adaptations to allow for the use of custom taxonomic databases and integration into the Galaxy infrastructure (https://github.com/naturalis/galaxy-tool-lca). The LCA script was performed on the top 5% hits, with bit-score >170, a minimum identity of 80% and a minimum coverage of 80%. The LCA tool was set to identify MOTUs no further than genus level. All direct hits (>98% match) were retrieved directly and accumulated based on taxon name associated with the sequences. To check for non-Dutch taxa and synonyms, a custom taxon matcher tool (https://github.com/naturalis/galaxy-tool-taxonmatcher) was used to compare all the names obtained to taxa recorded in the Dutch Species Register. In case of multiple taxa having a direct hit, the names were manually checked and taxonomy was determined based on the following set of rules: (1) non-Dutch species were removed, (2) synonyms were resolved, (3) sub-species level identifications were set to species level, (4) when a MOTU matched both genus level sequences and species level reference sequences of the same genus, species level identifications were retained, (5) putative misidentifications or contaminations were removed, based on expert judgment and the top 100 BLAST hits, (6) if one species matched consistently higher than another, the species with a better match was retained, (7) in case of equal matches with multiple species, all species names were retained (e.g. species complexes that could not be resolved with the available reference sequences).

Comparison morphology versus molecular identification

After applying the LCA script, MOTUs with the same taxonomic assignment were aggregated. Individual samples were then accumulated into their respective locations, with exception of the pool sample. Taxa lists obtained from the molecular analysis were compared to the WFD taxa lists based on conventional morphological identifications provided by Aquon. Morphological taxa lists were first matched to the Dutch Species Register using the same script that was used to compare the taxa lists retrieved from metabarcoding, to make the species names in both lists compatible. Before the comparison, redundancy was removed from both taxa lists, to exclude uncertainties in identifications or potential duplicates (i.e., a genus level identification was omitted if the list also contained specimens from that genus that were identified to species level).

DNA-based taxon lists from the pools and the separately sequenced samples added together were both compared to the morphological list manually. Each entry on the combined lists was classed into one of the following categories: (1) “found”, where there was an exact match between both lists; (2) “identified at a different level”, when there was a match, but either one of the lists had a higher-level identification; (3), “putative misidentification”, in cases where two different species from the same genus were listed on the respective lists; (4) “missing in reference” when the morphologically identified species was not covered by the DNA reference database; (5) “not found”, when the taxon was covered in the reference database, but only encountered in the morphological list; (6) “extra”, when the taxon was only encountered in the DNA list. To calculate the overlap between morphology and DNA, the first three categories were grouped together as being found in both lists, the taxa missing from the reference were counted towards the taxa only found in the morphology.

To analyze if uneven sequencing depth between samples pooled prior to amplification and the separately sequenced samples added together had any effect on taxonomic recovery, and to allow for better comparison between samples, all data was rarefied to the lowest read count available. Pooled samples were all rarefied to 15,000 reads, separately sequence samples representing the different taxon groups were each rarefied to 2,500 reads to adjust for the fact that most pools consisted of six taxon groups.

Ecological quality ratio (EQR) scores were calculated according to the Dutch standards for both morphological and DNA-based taxon lists, using the QBWat software version 5.33 [44] (with redundancies removed as described previously). Scores were calculated based on presence/absence data (with all specimen counts set to one) for both morphological and molecular data. Previous research has shown that abundances had limited impact on the EQR score [29].

Results

Sequence run statistics

Sequencing resulted in a total of 9,998,809 read pairs. After merging and quality filtering, 9,081,986 sequences were retained for MOTU clustering. AIC reads were detected in several non-control samples. A 0.025% threshold for filtering low-abundance MOTUs from each sample removed control reads from all samples. After filtering the MOTU table 2,460 MOTUs were retained in the non-control samples, representing 8,200,488 reads. Out of 366 replicates (158 sorted samples, 25 pools, all in duplicate), 77 with fewer than 4,000 reads were discarded. On average, PCR replicates had 28,345 reads (range 4,197–69,919), and 43.0 MOTUs (range 2–132). There was no correlation between number of reads and number of MOTUs in each sample.

Taxonomic composition

Using the two reference libraries, 1,837 MOTUs were identified as macrofauna taxa listed on the Dutch WFD taxon list on at least order level. A total of 319 MOTUs had direct matches above 98% percent, representing 213 distinct species or species complexes. The remaining MOTUs were identified to genus (1,394 MOTUs, 121 genera), family (93 MOTUs, 12 families) or order level (31 MOTUs, 11 orders). MOTUs that were not identified to at least order level were discarded. The final dataset of the sorted and separately amplified groups represented 208 species, 159 genera, 75 families and 34 orders. The data for the pools that were combined before the PCR amplification contained 172 species, 139 genera, 65 families, and 31 orders. The morphological lists covered 214 species, 151 genera, 73 families, and 30 orders (excluding the water mites) (S1 File). DNA-based taxon richness was significantly correlated with morphological taxon richness for both sorted samples (r = 0.662, p = 0.001) and pooled samples (r = 0.602, p = 0.002), where redundant taxa had been removed (Fig 1A). An additional 13 macroinvertebrates identified at species level were lost by the 0.025% threshold filtering (and only observed in the data that was discarded by this filter step). Seven of these were also recorded in the morphological assessment, the other six were only found using DNA. One of the species found in the discarded DNA-based data was Musculium lacustre, which was present in four samples where it was also detected morphologically, but only with one or two reads in each case.

Fig 1. Taxon and MOTU richness.

Fig 1

Relation between the morphological richness of samples and the (A) DNA taxon richness and (B) MOTU richness, for both the sorted samples (green triangles) and the pooled samples (orange circles), with a 95% confidence interval. Taxon richness was based on the taxon lists where redundant taxa had been removed. Correlations were significant for the taxon richness for both sorted samples (r = 0.662, p = 0.001) and pooled samples (r = 0.602, p = 0.002), but not for MOTU richness (r = 0.365, p = 0.072 and r = 0.331, p = 0.115, respectively).

To exclude the influence of sequencing depth (as sorted samples combined represented more sequencing depth than the pooled samples), we rarefied the samples to such an extent that sorted samples represented only one sixth of the pooled samples (most pools consisted of six combined extracts). Without rarefaction, the sorted samples had an average of 272,914 reads (range 170,637–424,726), which was 4.8 times more than the pools had (57,283 on average, range 15,061–122,002). They also had 2.67 times as many MOTUs and 1.52 times as many taxa as the pools. With rarefaction the sorted samples still had 2.22 times as many MOTUs and 1.40 times as many taxa; neither was significantly lower than without rarefaction.

Comparison morphology versus molecular identification

Retaining the redundant taxa, the average richness of pooled samples (32.5 on average, range 16–56) was significantly lower than that of the sorted samples (47.6 on average, range 22–76) (Dunn’s test, p = 0.005). When redundant taxa were removed, the richness of the pooled samples (22.9 on average, range 10–38) was again lower than the sorted samples (30.6 on average, range 12–54), but not significantly. Compared to the morphological richness with redundant taxa (46.7 on average, range 16–89), the richness of the pooled samples was significantly lower (Dunn’s test, p = 0.027). The richness of the pooled samples was also significantly lower than the morphological richness when redundancy was removed (40.8 on average, range 14–75) (Dunn’s test, p < 0.001). The richness of the sorted samples was not significantly different from the morphological richness in either situation.

For 13 out of 24 separately processed mollusc samples (one sample did not include molluscs) we were unable to amplify molluscs using the standard approach for DNA extraction and PCR. Additionally, four annelid samples, three Heteroptera/Coleoptera (HECO) samples, one crustacean sample, and one TOE sample failed to amplify, although the latter two only contained three and two species, respectively. The failed mollusc samples on average contained 13.2 morphologically identified taxa (range 5–20), the failed HECO samples 18.3 taxa (range 7–25) and the missing annelids accounted for 6.3 taxa (range 2–12). If taxa from the failed samples are excluded from the analysis (as they can only count towards the fraction of taxa not found by DNA), the overlap between the taxon list from sorted samples added together and the morphological taxon list was 56.8% on average (range 32.5–91.7%). On average, 22.9% of taxa were only found in morphology (range 0–50.0%), and 20.3% were only recovered using DNA (range 5.6–35.0%) (Fig 2A). If failed samples are included, the overlap between morphology and DNA was 47.6% on average (range 22.9–73.3%). For the pooled samples, the combined taxon lists contained an average of 47.3 taxa, with a 40.3% overlap between morphology and DNA (range 13.0–62.8%). 48.1% of taxa were only recorded in the morphological list (range 27.9–84.1%), and only 11.7% were found exclusively with DNA (range 0.0–27.8%) (Fig 2B). In 14 out of 24 samples (one pooled sample failed to amplify), the fraction of taxa only found with morphology was higher than the fraction of overlap between the two taxon lists, the fraction of taxa only found using DNA was never higher than the fraction of taxa found only in the morphological analysis. In contrast, in 14 out of 25 samples where taxa were sorted prior to DNA analysis, the fraction of taxa exclusively found with DNA was higher than the morphology-only fraction.

Fig 2. Overlap between morphological and DNA-based taxa.

Fig 2

The fractions of total observed diversity present in both morphological assessment and DNA-based methods (yellow) and the fractions only represented in morphology (red) and DNA (blue), for the sorted samples (A) and pooled DNA analysis (B). The fractions were also assessed for each of the sorted taxa groups separately according to the following sorting of taxa: ANNE (annelids), CRUS (crustaceans), HECO (Heteroptera and Coleoptera), MOLL (molluscs), REST (rest groups, almost exclusively chironomids and other dipterans) and TOE (Trichoptera, Odonata and, Ephemeroptera). Error bars indicate the standard error.

The three categories that were counted towards the overlap contained 402 entries (72.6%) where there was a direct match between the species recorded in the morphological analysis, and the species identification obtained from metabarcoding. In 124 cases (22.4%) there was a match between morphology and metabarcoding, but the entries on both lists were not identified to the same taxonomic level. The majority of these were annelids not covered in the reference database at species level (but were identified from molecular data at higher level using LCA) and dipterans identified to species level in the metabarcoding analysis but only identified at genus level or higher in the morphological data. The remainder were 28 cases of putative misidentifications (5.1%), where both list contained a different species from the same genus.

Looking at the six taxa groups separately (again excluding the failed samples), the overlap varies. The highest overlap was found in the crustaceans and HECO samples (71.4% and 72.6%, respectively), even though for HECO in one case the morphological and DNA-based taxon lists did not overlap at all (both, however, only contained one species each). The lowest overlap was found in the annelid samples (47.8% on average). Overlap for the MOLL, REST and TOE samples was 53.9%, 56.4% and 64.3% on average, respectively (Fig 2A). For the REST samples, the fraction of taxa found only in the DNA was larger than the fraction of taxa only recorded morphologically, for all other groups there were more taxa in the morphology list than there were on the DNA list. In 18 samples, more taxa were found with DNA than with morphology, in 26 samples more taxa were obtained with morphology. For 18 samples the morphology and DNA taxon lists was a complete match, although some taxa were not identified up to the same taxonomic level for both methods. In addition to the previously mentioned HECO sample, there was one other sample in the TOE set where DNA and morphology were mutually exclusive (S2 Fig). In the pooled samples, the overlap between morphology and DNA was considerably lower for most taxa groups, but most noticeable in the HECO and mollusc samples, where most taxa were only present on the morphological list. For all groups, more taxa were found with morphology than were found with DNA metabarcoding (Fig 2B).

Ecological quality ratios

The EQR scores based on the DNA data differed considerably from the morphology-based EQR scores for both the pooled and the sorted samples (Fig 3A and 3B). There was only a moderate correlation between the morphology- and DNA-based scores (Pearson correlation, r = 0.596 and 0.545, respectively). The scores obtained from the pooled samples were usually lower than the morphological scores (16 out of 24). For the sorted samples, half the samples (13 of 25) had a lower score using molecular identifications, the other half (12 of 25) scored higher based on DNA data. The average absolute difference in EQR score was similar for both datasets: 0.12 for the pooled samples (range 0.007–0.302) and 0.11 for the sorted samples (range 0.007–0.310). Using the pooled samples, 15 out of 24 locations scored in a different quality class (five higher, ten lower), and for the sorted samples, 12 of 25 ended up in a different quality class (five higher, seven lower). When replacing just one of the groups with molecular data for the EQR calculations, the correlations between the two scores were much stronger (ranging from r = 0.937 for REST to 0.989 for CRUS, p < 0.001 for all groups), even with the complete removal of some groups due to failed samples (Fig 3C–3H).

Fig 3. EQR scores.

Fig 3

Comparison for the EQR score calculated on morphological data versus the score calculated on DNA data for (A) the pooled and (B) the sorted and separately sequenced samples. Both showed a moderate correlation between the scores (Pearson correlation, p = 0.002 and 0.005, respectively). Better correlations were found when only replacing one of six taxon groups with molecular data (on the y-axis): (C) Annelida, (D) Crustacea, (E) Heteroptera and Coleoptera, (F) Mollusca, (G) Chironomidae and other Diptera and (H) Trichoptera, Odonata, and Ephemeroptera (black circles, Pearson correlation values provided in the panels, p < 0.001 for all groups). To assess the influence of each of the respective groups on the EQR score, the original scores (x-axis) were also compared to EQR scores where one taxon group was completely removed (y-axis) from the taxon list before the analysis (C-H, red triangles, Pearson correlation values provided in the panels, p < 0.001 for all groups).

Discussion

We found that pre-sorting of samples into six basic taxon groups vastly improved the recovery of taxa using metabarcoding of bulk samples, with 46.5% more taxa found as compared to the samples where DNA was pooled prior to amplification and sequencing (47.6 versus 32.5 on average). The average overlap between the morphological and molecular (for the sorted samples) taxon lists was 56.8%, with the fractions of taxa found in only the morphology and only the DNA roughly equal (22.9% and 20.3%, respectively) (Fig 2A). Discrepancies between morphology and DNA-based species lists were expected, based on missing taxa from the reference database, known difficulties with morphological identification of taxa [9,10], and primer biases [28] as contributing factors. Even though they were tested mainly on insects, the primers used in this study showed good in silico potential for all taxonomic groups included in our samples [22], especially compared to some other oft-used primers. While there may be primers that perform better for a specific group, a single, broad-range primer set that perform equally well on all taxa will most likely never exist [27]. Large differences were already observed between two morphological assessments in freshwater monitoring samples in previous studies, where there was more than 30% difference in identification of taxa. All taxon groups seemed to be equally prone to errors in morphological identification, even those deemed difficult to identify [45]. In our data we see that the overlap between morphology and DNA varies between the different groups, being highest for the Crustacea and the Heteroptera /Coleoptera. The poor performance of the mollusc samples may not be entirely attributable to the primers, as molluscs are the group that is most affected by differences in biomass between the different species in a sample.

There have been few studies comparing morphological identifications and DNA-based identifications on actual samples, instead of relying on mock communities. A study assessing the taxa detected by morphology and DNA on Finnish WFD samples (using the same primers as this study) found considerably more taxa with DNA than they did with morphology [16], but morphological assessments did not include species or genus level identifications for certain groups, such as the species-rich Chironomidae. The taxonomic resolution in the present study was comparable between morphology and DNA metabarcoding, and explains why richness estimations were more comparable on average. Still, we found some differences between taxon lists caused by disparity in resolution for certain taxa. On the side of the morphology, higher-level taxonomic identifications have been made due to the difficulty of distinguishing taxa, especially those in larval stages. For example, none of the Ceratopogonidae had been identified beyond family level using morphology, but five different genera were detected with DNA. On the other hand, the DNA reference database did not cover all the taxa that were listed in the morphological dataset (6.5% of the morphologically identified species had no DNA reference). For instance, every specimen of Alboglossiphonia was only identified up to genus level using the LCA tool in the DNA analysis, as all three species recorded in the morphological analysis were unaccounted for in the reference database (sequences could still be identified to genus level based on matches to congeneric species).

Some groups that were examined in this study consists of considerably more taxa than others. This difference in group size inevitably leads to a larger number of “lost taxa” when one taxon dominates the reads due to the effects of preferential amplification. In the majority of the pooled samples (15 of 24) more than half of the reads is provided by one of the six groups (S3 Fig), and in eleven samples more than half the reads even belonged to a single taxon. While some have argued that for general patterns in biodiversity, the effects of primer bias may be limited, the taxonomic bias caused by primer mismatches in certain taxonomic groups can be an issue when trying to reconstruct taxa lists [27]. Taxonomic sorting can improve the recovery of taxa, as witnessed by the improved performance of the sorted and separately sequenced samples in comparison to the pooled samples, which represent a broader range taxa. In the sorted samples, 46.5% more taxa were found than in the pooled samples (47.6 versus 32.5 on average), also leading to more overlap with the morphological list (56.8% versus 40.3% on average). Similar improvements have been found when using a size-based sorting of specimens prior to DNA extraction and amplification, where around 30% more taxa were found compared to non-sorted samples [23], although others report that amplification bias across size ranges may be limited with deep sequencing [27]. When assessing the separate groups, the effect of the pooling of samples prior to DNA amplification and sequencing has the largest effect on the HECO and mollusc samples, where 65.6% and 46.6% fewer taxa were found in comparison to the sorted and separately sequenced samples (Fig 2B). Rarefaction showed that the reduced sequencing depth of the pools, when compared to the combined separately sequenced samples, was not solely responsible for the reduction in detected taxa. Even when rarefied to the same sequencing depth, we still obtained 40.0% more taxa in the sorted samples. A study assessing the taxonomic recovery of tropical forest arthropod communities showed similar findings, where there was some decline in MOTUs recovered for specific taxon groups in increasingly complex mixtures. This was mostly caused by the introduction of other groups, which were apparently amplified preferentially [27], comparable to our observations with HECO and mollusc taxa. Taxonomic sorting into the groups presented in this study is relatively straightforward and would require only superficial knowledge of taxonomy. Compared to genus or species level sorting and identification, both the time and costs involved are between one and two orders of magnitude lower [13,46].

The difficulties in identifying specimens using morphology can also express themselves in the DNA-based identities, by way of having erroneously identified specimens within the DNA reference library. We encountered a variety of unresolved taxa and putative identification errors in the reference data downloaded from BOLD. In the 350,449 public sequences we found 554 cases where congeneric species had identical sequences. These are not necessarily identification errors, as some closely-related species are known to be indistinguishable by the DNA barcode region [47], but do highlight the need to not look at just the “top 1” or “best hit” matches when comparing sequences to a reference database. When multiple hits with the same scores are found, matching algorithms do not always consistently place the same match at the top of the list, introducing random variation between analyses when only looking at the first hit. Additionally, 47 cases of identical sequences with different species from different genera were found, some of which could be traced back to actual contaminated sequences (e.g. Homo sapiens or Wolbachia). Most of such misidentified records have been flagged by BOLD curators, which was verified by manually checking a random selection of records. Moreover, recent analysis of the BOLD data revealed a relatively high number of specimens that had been identified using “reverse BIN taxonomy”, adding further levels of uncertainty to the reference datasets retrieved from BOLD [48]. To improve the use of public data such as the sequences deposited in BOLD, the ability to filter data based on record flags or identification method is essential.

An incomplete reference databases is a major issue that limits the use of metabarcoding for species identification [49,50]. While 93.5% of the morphologically identified species of this study had reference sequences, database coverage for all Dutch WFD taxa is only 86.1%. There are large differences for each of the taxa groups as defined by this study, with 63.5% of annelids covered by reference sequences (54 of 85 species), while 96.5% of the TOE group has been barcoded (273 out of 283 species). Additionally, we still observe difficulties in identification for species known to have high genetic diversity. For example, 14 MOTUs were identified as Asellus aquaticus, another 109 were identified at genus level as Asellus (for which only A. aquaticus is recorded in the Netherlands). The tendency to overestimate richness based solely on MOTUs (see also Fig 1) has already been reported in the past, with population and haplotype differences increasing richness estimates [25,51]. The length threshold used in this study (allowing for sequences which were three base pairs shorter or longer than the 316 bp target to pass quality filtering) may have contributed to an overestimation of richness based on MOTUs. We aimed to mitigate this effect by aggregating all MOTUs with identical taxonomic identification and discarding any unidentified MOTU from the analysis. While alternate clustering methods may exaggerate or downplay this effect of overestimation, the difference in intraspecific variation between taxonomic groups will lead to either overestimations for taxa with high intraspecific variation or underestimations by lumping taxa with low interspecific variation depending on the cluster settings. The observed variation also suggests that the DNA reference library could be improved by better geographical coverage, incorporating a wider range of haplotype variation. Another phenomenon that may have caused an overabundance of Asellus and other genera in the MOTUs, is the presence of pseudogenes that have been amplified [52,53], especially with deep sequencing of highly abundant taxa. Many of the MOTUs identified at genus level have fewer reads (1,768 on average) compared to the MOTUs with species level identification (75,128 reads on average). Similar patterns were seen for other genera as well (e.g. Helobdella, Limnomysis), including genera that have more than one species recorded for the Netherlands, and which were all represented in the reference database (e.g. Cymatia, Erythromma, Noterus).

Haplotypes and pseudogenes aside, we should be wary of the fact that many taxon groups still contain undescribed diversity and cryptic species [54], which may be perceived as overestimations of taxa when using DNA-based identification methods. This information can still be valuable, as it has been shown in mayflies that cryptic species exhibit a wide variety of tolerances and responses to ecosystem stressors [55]. Furthermore, MOTU level analysis of Chironomidae has demonstrated that even without binomial names, different putative taxa could be identified, showing different response patterns [56]. This opens possibilities to use DNA-based delimitations for comparative quality assessments and impact studies even for those taxon groups that are poorly defined in reference databases, which is still hampering the use of DNA-based identification in various groups [57]. DNA-based identification may not always exactly reflect the observations made by traditional morphological methods, but at least may provide a more consistent way of identifying taxa [18]. Morphological assignments are prone to discrepancies between assessors, as shown by large differences between identification made in audits of WFD assessments [9,10]. The choice of tools and parameters used in the processing of raw sequence data (such as filtering and clustering) can have a significant impact on taxonomic inferences as well [37,58], but in comparison with morphological assessments should be easier to report and standardize. Molecular data is also more easily re-analyzed when new insights are developed, and is backwards compatible with updated reference databases.

The impact of DNA-based identifications on the EQR scoring is considerable, for both the pooled and the sorted samples, with neither giving a better approximation of the morphology-based score (Fig 3A and 3B). The correlation between the two scores was only moderate (Pearson correlation, r = 0.596 and 0.545, respectively). EQR scores for the pooled samples were generally lower than the morphology-based EQRs (16 out of 24), whereas the sorted samples provided scores that were lower in half the samples, higher in the other half. The average absolute difference between the EQRs obtained from morphological data and the DNA-based scores was similar for both datasets (0.12 and 0.11 for the pooled and sorted samples, respectively). When replacing just one of the six groups from the morphological taxon list with DNA-based identifications, the impact on the EQR score was considerably lower. This allows for the use of DNA metabarcoding for a select group of taxa, for example in cases where morphological assessments are difficult or time-consuming (such as Chironomidae), without the need to recalibrate the entire EQR scoring method. In such cases, it would also be possible to use primers that are tailored more specifically to the investigated taxa, in order to limit primer bias. The largest deviation was seen in the mollusc samples, where the absolute difference was 0.033 on average (range 0–0.091), but the scores were still strongly correlated (Pearson correlation, r = 0.966, p < 0.001). Molluscs were also the group for which most samples failed to amplify (13 out of 24, S2 Fig), but this did not seem to have too much of an impact on the scoring. However, complete removal of groups can have a substantial impact on the EQR score, especially for the annelids (Fig 3C–3H). Removal of the water mites, which were excluded from the analysis due to inability to obtain DNA, had a comparable impact on the EQR scoring to some of the other groups (S4 Fig). To minimize the effect of stacking these impacts, the water mites were completely discarded from all EQR analyses.

The fact that DNA-based EQR scores are so different from the scores based on traditional morphological surveys can partly be attributed to the changes in taxonomic resolution and deviations between the two taxon lists, but the differences are likely exaggerated by the changes in richness as well. For the Dutch EQR calculations, the percentage of characteristic taxa, and positive and negative indicator species (as a fraction of the total richness) play an important role for the final score [29,59]. While the average taxon richness was not significantly different between the morphological assessment and the sorted and separately sequenced samples used for the DNA-based calculations, the differences for each sample were considerable (Fig 1A), with an average difference in richness between morphology and DNA of 12.0 (range 1–48). Together, the changes in the number of negative and positive indicators, and the changes in the ratios of these indicators can have a significant impact on the final EQR score (Fig 3A and 3B).

Molecular techniques may not directly replace traditional morphology under the current WFD monitoring standards, and future monitoring requires a paradigm shift to fully incorporate the potential of DNA-based methodology. Time is needed for new techniques to prove their worth in the field of biomonitoring and be accepted by monitoring agencies and policy makers. Being able to replace morphological assessment for only one or a few taxon groups without needing to redefine the framework for BQE monitoring, opens possibilities for gradual implementation of DNA-based identifications for those groups that are most difficult to identify, time-consuming or where taxonomic expertise is getting scarce.

Conclusions

There are considerable differences when directly comparing the outcome of traditional morphological assessment and DNA metabarcoding-based identifications of bulk samples, including their effects on the EQR score under current standards. Our data shows that DNA metabarcoding compares better to morphological assessments for some taxonomic groups than for others, partly based on the underlying DNA reference database, or lack thereof. Mismatches were observed between morphology and metabarcoding, but the latter will be less reliant on individual biases introduced by different assessors, and therefore lead to more consistent assessments. Taxonomic sorting into basic groups improves the taxon recovery, as shown in this study, where 46.5% more taxa were found when samples were sorted into six basic groups prior to DNA amplification and sequencing. Even when corrected for sequencing depth, sorted samples still produce around 40% more taxa as non-sorted samples. DNA-based assessments may not directly replace traditional monitoring in the near future, but can certainly contribute to the current methodology, especially for those groups that are perceived as difficult to identify, to allow for more consistent and faster identifications. Metabarcoding would greatly improve with addition of vouchered specimens to reference databases. Furthermore, we show that replacing only one of six taxa groups assessed in this study by molecular data has limited impact on the EQR scoring, opening possibilities for gradual replacement of traditional identification, or supplementing the traditional identification with DNA-based tools, which will help with the acceptance of molecular methodology in WFD monitoring.

Supporting information

S1 Fig. Artificial internal control.

The artificial control (AIC) used to measure cross-contamination. The sequence is based on the COI barcode region of a Reeve’s muntjac (Muntiacus reevesi) with several primer sets built into the sequence (forward strand shown). Binding sites for COI primers are shown, BF1 and BR2 used in this study are highlighted in red.

(EPS)

S2 Fig. Overlap between morphology and DNA per sample.

The overlap between morphology and DNA (in yellow), as well as the fractions of taxa only detected with DNA (blue) and morphology (red), for each of the 25 samples separately, as well as averages (last column), separated for each of the six taxa groups.

(EPS)

S3 Fig. Specimen and read abundances.

Relative abundances of (A) specimens in the traditional morphological assessment and reads in the metabarcoding data of (B) separately sequenced taxa groups combined and (C) samples pooled prior to amplification. In addition to the six groups assessed in this study, the fractions of water mites (in morphology), as well as vertebrates and unidentified MOTUs (in DNA data) have been included.

(EPS)

S4 Fig. EQR scores without water mites.

Comparison of EQR scores for the morphological data with and without water mites (ACA). No DNA was obtained from water mites due to the buffer they were stored in. Pearson correlation value provided in the panel, p < 0.001.

(EPS)

S1 File. Taxon lists.

Taxon lists for the three datasets: Morphologically identified taxa (with specimen counts), DNA-based identifications from the sorted samples, and DNA-based identifications form the pooled samples (both with read counts).

(XLSX)

Acknowledgments

We thank Aquon and Wouter Balster for supplying the samples and morphological identifications, as well as Bart Schaub of Hoogheemraadschap van Rijnland for providing us with access to the collection of WFD samples.

Data Availability

All relevant data are within the paper and its Supporting Information files. Raw sequence data is available from the NCBI Sequence Read Archive (Bioproject accession PRJNA550542).

Funding Statement

This study was part of the DNA Waterscan project, funded by the Gieskes-Strijbis Fonds (https://gieskesstrijbisfonds.nl/). The funder provided support in the form of material costs and the salaries for authors KKB, MH and RP. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Hebert PDN, Cywinska A, Ball SL, DeWaard JR. Biological identifications through DNA barcodes. Proc R Soc B Biol Sci. 2003;270: 313–21. 10.1098/rspb.2002.2218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Taberlet P, Coissac E, Pompanon F, Brochmann C, Willerslev E. Towards next-generation biodiversity assessment using DNA metabarcoding. Mol Ecol. 2012;21: 2045–50. 10.1111/j.1365-294X.2012.05470.x [DOI] [PubMed] [Google Scholar]
  • 3.Leray M, Yang JY, Meyer CP, Mills SC, Agudelo N, Ranwez V, et al. A new versatile primer set targeting a short fragment of the mitochondrial COI region for metabarcoding metazoan diversity: application for characterizing coral reef fish gut contents. Front Zool. 2013;10: 34 10.1186/1742-9994-10-34 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Pauls SU, Alp M, Bálint M, Bernabò P, Čiampor F, Čiamporová-Zaťovičová Z, et al. Integrating molecular tools into freshwater ecology: developments and opportunities. Freshw Biol. 2014;59: 1559–1576. 10.1111/fwb.12381 [DOI] [Google Scholar]
  • 5.Pawlowski J, Kelly-Quinn M, Altermatt F, Apothéloz-Perret-Gentil L, Beja P, Boggero A, et al. The future of biotic indices in the ecogenomic era: Integrating (e)DNA metabarcoding in biological assessment of aquatic ecosystems. Science of the Total Environment. 2018. pp. 1295–1310. 10.1016/j.scitotenv.2018.05.002 [DOI] [PubMed] [Google Scholar]
  • 6.European Union. Directive 2000/60/EC of the European Parliament and of the Council of 23 October 2000 establishing a framework for Community action in the field of water policy. Off J Eur Parliam. 2000;L327: 1–82. 10.1039/ap9842100196 [DOI] [Google Scholar]
  • 7.Birk S, Bonne W, Borja A, Brucet S, Courrat A, Poikane S, et al. Three hundred ways to assess Europe’s surface waters: An almost complete overview of biological methods to implement the Water Framework Directive. Ecol Indic. 2012;18: 31–41. 10.1016/j.ecolind.2011.10.009 [DOI] [Google Scholar]
  • 8.Sweeney BW, Battle JM, Jackson JK, Dapkey T. Can DNA barcodes of stream macroinvertebrates improve descriptions of community structure and water quality? J North Am Benthol Soc. 2011;30: 195–216. 10.1899/10-016.1 [DOI] [Google Scholar]
  • 9.Haase P, Murray-Bligh J, Lohse S, Pauls S, Sundermann A, Gunn R, et al. Assessing the impact of errors in sorting and identifying macroinvertebrate samples. Hydrobiologia. 2006;566: 505–521. 10.1007/s10750-006-0075-6 [DOI] [Google Scholar]
  • 10.Stribling JB, Pavlik KL, Holdsworth SM, Leppo EW. Data quality, performance, and uncertainty in taxonomic identification for biological assessments. J North Am Benthol Soc. 2008;27: 906–919. 10.1899/07-175.1 [DOI] [Google Scholar]
  • 11.Darling JA, Mahon AR. From molecules to management: adopting DNA-based methods for monitoring biological invasions in aquatic environments. Environ Res. 2011;111: 978–88. 10.1016/j.envres.2011.02.001 [DOI] [PubMed] [Google Scholar]
  • 12.Stein ED, Martinez MC, Stiles S, Miller PE, Zakharov E V. Is DNA barcoding actually cheaper and faster than traditional morphological methods: results from a survey of freshwater bioassessment efforts in the United States? PLoS One. 2014;9: e95525 10.1371/journal.pone.0095525 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Marshall JC, Steward AL, Harch BD. Taxonomic resolution and quantification of freshwater macroinvertebrate samples from an Australian dryland river: the benefits and costs of using species abundance data. Hydrobiologia. 2006;572: 171–194. 10.1007/s10750-005-9007-0 [DOI] [Google Scholar]
  • 14.Gibson J, Shokralla S, Porter TM, King I, van Konynenburg S, Janzen DH, et al. Simultaneous assessment of the macrobiome and microbiome in a bulk sample of tropical arthropods through DNA metasystematics. Proc Natl Acad Sci. 2014;111: 8007–12. 10.1073/pnas.1406468111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Aylagas E, Borja Á, Irigoien X, Rodríguez-Ezpeleta N. Benchmarking DNA metabarcoding for biodiversity-based monitoring and assessment. Front Mar Sci. 2016;3: 96 10.3389/fmars.2016.00096 [DOI] [Google Scholar]
  • 16.Elbrecht V, Vamos EE, Meissner K, Aroviita J, Leese F. Assessing strengths and weaknesses of DNA metabarcoding-based macroinvertebrate identification for routine stream monitoring. Methods Ecol Evol. 2017;8: 1265–1275. 10.1111/2041-210X.12789 [DOI] [Google Scholar]
  • 17.Aylagas E, Borja Á, Muxika I, Rodríguez-Ezpeleta N. Adapting metabarcoding-based benthic biomonitoring into routine marine ecological status assessment networks. Ecol Indic. 2018;95: 194–202. 10.1016/j.ecolind.2018.07.044 [DOI] [Google Scholar]
  • 18.Bush A, Compson Z, Monk W, Porter T, Steeves R, Emilson E, et al. Studying ecosystems with DNA metabarcoding: lessons from aquatic biomonitoring. bioRxiv. 2019; 578591 10.1101/578591 [DOI] [Google Scholar]
  • 19.Hajibabaei M, Shokralla S, Zhou X, Singer GAC, Baird DJ. Environmental barcoding: a next-generation sequencing approach for biomonitoring applications using river benthos. PLoS One. 2011;6: e17497 10.1371/journal.pone.0017497 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Carew ME, Pettigrove VJ, Metzeling L, Hoffmann AA. Environmental monitoring using next generation sequencing: rapid identification of macroinvertebrate bioindicator species. Front Zool. 2013;10: 45 10.1186/1742-9994-10-45 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Bista I, Carvalho GR, Tang M, Walsh K, Zhou X, Hajibabaei M, et al. Performance of amplicon and shotgun sequencing for accurate biomass estimation in invertebrate community samples. Molecular Ecology Resources. 21 May 2018. 10.1111/1755-0998.12888 [DOI] [PubMed] [Google Scholar]
  • 22.Elbrecht V, Leese F. Validation and Development of COI Metabarcoding Primers for Freshwater Macroinvertebrate Bioassessment. Front Environ Sci. 2017;5: 11 10.3389/fenvs.2017.00011 [DOI] [Google Scholar]
  • 23.Elbrecht V, Peinert B, Leese F. Sorting things out: Assessing effects of unequal specimen biomass on DNA metabarcoding. Ecol Evol. 2017;7: 6918–6926. 10.1002/ece3.3192 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lobo J, Shokralla S, Costa MH, Hajibabaei M, Costa FO. DNA metabarcoding for high-throughput monitoring of estuarine macrobenthic communities. Sci Rep. 2017;7: 15618 10.1038/s41598-017-15823-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Gibson JF, Shokralla S, Curry C, Baird DJ, Monk WA, King I, et al. Large-scale biomonitoring of remote and threatened ecosystems via high-throughput sequencing. PLoS One. 2015;10: e0138432 10.1371/journal.pone.0138432 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Pawluczyk M, Weiss J, Links MG, Egaña Aranguren M, Wilkinson MD, Egea-Cortines M, et al. Quantitative evaluation of bias in PCR amplification and next-generation sequencing derived from metabarcoding samples. Anal Bioanal Chem. 2015;407: 1841–1848. 10.1007/s00216-014-8435-y [DOI] [PubMed] [Google Scholar]
  • 27.Creedy TJ, Ng WS, Vogler AP. Toward accurate species-level metabarcoding of arthropod communities from the tropical forest canopy. Ecol Evol. 2019;9: 3105–3116. 10.1002/ece3.4839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Elbrecht V, Leese F. Can DNA-based ecosystem assessments quantify species abundance? Testing primer bias and biomass—sequence relationships with an innovative metabarcoding protocol. PLoS One. 2015;10: e0130324 10.1371/journal.pone.0130324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Beentjes KK, Speksnijder AGCL, Schilthuizen M, Schaub BEM, van der Hoorn BB. The influence of macroinvertebrate abundance on the assessment of freshwater quality in The Netherlands. Metabarcoding and Metagenomics. 2018;2: e26744 10.3897/mbmg.2.26744 [DOI] [Google Scholar]
  • 30.Bijkerk R, editor. Handboek Hydrobiologie. Biologisch onderzoek voor de ecologische beoordeling van Nederlandse zoete en brakke oppervlaktewateren. Rapport 2014–02 [Internet]. 2014. Available: https://www.stowa.nl/publicaties/handboek-hydrobiologie [Google Scholar]
  • 31.Afgan E, Baker D, Batut B, Van Den Beek M, Bouvier D, Ech M, et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018;46: W537–W544. 10.1093/nar/gky379 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Magoč T, Salzberg SL. FLASH: Fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27: 2957–2963. 10.1093/bioinformatics/btr507 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011;17: 10 10.14806/ej.17.1.200 [DOI] [Google Scholar]
  • 34.Schmieder R, Edwards R. Quality control and preprocessing of metagenomic datasets. Bioinformatics. 2011;27: 863–864. 10.1093/bioinformatics/btr026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Elbrecht V, Hebert PDN, Steinke D. Slippage of degenerate primers can cause variation in amplicon length. Sci Rep. 2018;8: 10999 10.1038/s41598-018-29364-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rognes T, Flouri T, Nichols B, Quince C, Mahé F. VSEARCH: a versatile open source tool for metagenomics. PeerJ. 2016;4: e2584 10.7717/peerj.2584 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Alberdi A, Aizpurua O, Gilbert MTP, Bohmann K. Scrutinizing key steps for reliable metabarcoding of environmental samples. Mahon A, editor. Methods Ecol Evol. 2018;9: 134–147. 10.1111/2041-210X.12849 [DOI] [Google Scholar]
  • 38.Beentjes KK, Speksnijder AGCL, van der Hoorn BB, van Tol J. DNA barcoding program at Naturalis Biodiversity Center, the Netherlands. Genome. 2015;58: 193 10.1139/gen-2015-0087 [DOI] [Google Scholar]
  • 39.Ratnasingham S, Hebert PDN. BOLD: the Barcode of Life Data System (www.barcodinglife.org). Mol Ecol Notes. 2007;7: 355–364. 10.1111/j.1471-8286.2007.01678.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Chamberlain S. bold: Interface to Bold Systems ‘API’. R package version 0.5.0. [Internet]. 2017. Available: https://cran.r-project.org/package=bold
  • 41.RStudio. RStudio: Integrated development environment for R (Version 0.99.902) [Internet]. Boston, MA; 2015. Available: http://www.rstudio.com/
  • 42.Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL. GenBank. Nucleic Acids Res. 2005;33: D34–8. 10.1093/nar/gki063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Huson DH, Auch AF, Qi J, Schuster SC. MEGAN analysis of metagenomic data. Genome Res. 2007;17: 377–86. 10.1101/gr.5969107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Pot R. QBWat, programma voor beoordeling van de biologische waterkwaliteit volgens de Nederlandse maatlatten voor de Kaderrichtlijn Water, versie 5.33. [Internet]. 2015. Available: http://www.roelfpot.nl/qbwat [Google Scholar]
  • 45.Haase P, Pauls SU, Schindehütte K, Sundermann A. First audit of macroinvertebrate samples from an EU Water Framework Directive monitoring program: human error greatly lowers precision of assessment results. J North Am Benthol Soc. 2010;29: 1279–1291. [Google Scholar]
  • 46.Jones FC. Taxonomic sufficiency: The influence of taxonomic resolution on freshwater bioassessments using benthic macroinvertebrates. Environ Rev. 2008;16: 45–69. 10.1139/A07-010 [DOI] [Google Scholar]
  • 47.Huemer P, Mutanen M, Sefc KM, Hebert PDN. Testing DNA barcode performance in 1000 species of European Lepidoptera: Large geographic distances have small genetic impacts. PLoS One. 2014;9: e115774 10.1371/journal.pone.0115774 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Weigand H, Beermann AJ, Čiampor F, Costa FO, Csabai Z, Duarte S, et al. DNA barcode reference libraries for the monitoring of aquatic biota in Europe: Gap-analysis and recommendations for future work. bioRxiv. 2019; 576553 10.1101/576553 [DOI] [PubMed] [Google Scholar]
  • 49.Wangensteen OS, Palacín C, Guardiola M, Turon X. DNA metabarcoding of littoral hard-bottom communities: high diversity and database gaps revealed by two molecular markers. PeerJ. 2018;6: e4705 10.7717/peerj.4705 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Kvist S. Barcoding in the dark?: A critical view of the sufficiency of zoological DNA barcoding databases and a plea for broader integration of taxonomic knowledge. Mol Phylogenet Evol. 2013;69: 39–45. 10.1016/j.ympev.2013.05.012 [DOI] [PubMed] [Google Scholar]
  • 51.Elbrecht V, Vamos EE, Steinke D, Leese F. Estimating intraspecific genetic diversity from community DNA metabarcoding data. PeerJ. 2018;6: e4644 10.7717/peerj.4644 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Brown EA, Chain FJJ, Crease TJ, MacIsaac HJ, Cristescu ME. Divergence thresholds and divergent biodiversity estimates: can metabarcoding reliably describe zooplankton communities? Ecol Evol. 2015;5: 2234–2251. 10.1002/ece3.1485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Song H, Buhay JE, Whiting MF, Crandall KA. Many species in one: DNA barcoding overestimates the number of species when nuclear mitochondrial pseudogenes are coamplified. Proc Natl Acad Sci. 2008;105: 13486–13491. 10.1073/pnas.0803076105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hebert PDN, Ratnasingham S, Zakharov E V., Telfer AC, Levesque-Beaudin V, Milton MA, et al. Counting animal species with DNA barcodes: Canadian insects. Philos Trans R Soc B Biol Sci. 2016;371: 20150333 10.1098/rstb.2015.0333 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Macher JN, Salis RK, Blakemore KS, Tollrian R, Matthaei CD, Leese F. Multiple-stressor effects on stream invertebrates: DNA barcoding reveals contrasting responses of cryptic mayfly species. Ecol Indic. 2016;61: 159–169. 10.1016/j.ecolind.2015.08.024 [DOI] [Google Scholar]
  • 56.Beermann AJ, Zizka VMA, Elbrecht V, Baranov V, Leese F. DNA metabarcoding reveals the complex and hidden responses of chironomids to multiple stressors. Environ Sci Eur. 2018;30: 26 10.1186/s12302-018-0157-x [DOI] [Google Scholar]
  • 57.Curry CJ, Gibson JF, Shokralla S, Hajibabaei M, Baird DJ. Identifying North American freshwater invertebrates using DNA barcodes: are existing COI sequence libraries fit for purpose? Freshw Sci. 2018;37: 178–189. 10.1086/696613 [DOI] [Google Scholar]
  • 58.Porter TM, Hajibabaei M. Automated high throughput animal CO1 metabarcode classification. Sci Rep. 2018;8: 219675 10.1038/s41598-018-22505-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Van der Molen D, Pot R, Evers C, Van Herpen F, Van Nieuwerburgh L. Referenties en maatlatten voor natuurlijke watertypen voor de KRW 2015–2021. STOWA rapportnummer 2012–31. 2016. [Google Scholar]

Decision Letter 0

Sebastian D Fugmann

29 Jul 2019

PONE-D-19-18500

Increased performance of DNA metabarcoding of macroinvertebrates by taxonomic sorting

PLOS ONE

Dear Drs. Beentjes,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

1) Both reviewers have concerns and comments on the approach to handle the differences in the sequence read depths between different samples.   

2) Reviewer #2 raises an important point in that it seems a single primer pair was used for the amplification reactions that may work very inefficiently for some of the phyla being investigated.  The authors should elobarate on the potential shortcomings of this approach, and (if feasible) evaluate the differences of using a different primer pair on a test sample.  

3) All minor concerns of both reviewers should be addressed by making appropriate changes to the text of the manuscript.  

We would appreciate receiving your revised manuscript by Sep 12 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Sebastian D. Fugmann, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you are reporting an analysis of a microarray, next-generation sequencing, or deep sequencing data set. PLOS requires that authors comply with field-specific standards for preparation, recording, and deposition of data in repositories appropriate to their field. Please upload these data to a stable, public repository (such as ArrayExpress, Gene Expression Omnibus (GEO), DNA Data Bank of Japan (DDBJ), NCBI GenBank, NCBI Sequence Read Archive, or EMBL Nucleotide Sequence Database (ENA)). In your revised cover letter, please provide the relevant accession numbers that may be used to access these data. For a full list of recommended repositories, see http://journals.plos.org/plosone/s/data-availability#loc-omics or http://journals.plos.org/plosone/s/data-availability#loc-sequencing

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors present a methodological analysis of DNA metabarcoding, using a set of 25 freshwater arthropod community samples, with a particular focus on taxonomic composition of samples. They present a comprehensive pipeline by which they identify their OTUs to species level, using this data to compile molecular species lists and community indices. They have particular power in this analysis as these samples have been previously sorted and morphologically identified, against which data the authors validate their species lists and diversity/composition indices. Their main methodology is the separate DNA metabarcoding of separate taxonomic groups within each sample, compared against DNA metabarcoding of all of these groups combined within each sample. Their metabarcoding results suggest that in general, morphological and molecular species lists have considerable overlap, but metabarcoding misses a substantial proportion of species. Their key conclusion is that this observation is exacerbated in pooled samples containing all taxa, even when rarefying to ensure standardisation of effective sequencing coverage, and thus that metabarcoding taxa separately is important for full community recovery.

This research is largely well presented, although I feel the depth of analysis is somewhat shallow in places. In particular, the authors do not analyse or discuss the possible effect of sequencing depth on their findings, both in terms of the extent to which they can compare between samples and the extent to which their findings are able to be generalised. While they present the total numbers of reads recovered from sequencing, they only passingly report some of the post-filtering and bioinformatics per-sample reads and ranges. Rarefaction is only passingly discussed (in the results section), and the reader's understanding of sample treatment is confounded by unclear description of the way that PCR replicates were combined. While it is clear, and largely justified, that the authors rarefied the read counts of the pooled samples to be equal to 6 times those of the separate-taxon samples, it is not stated what these rarefaction targets were, nor unequivocally stated whether all separate-taxon samples were rarefied to the same target. I am sure the authors agree that this is a crucial step if comparing lists of recovered species or summary indices. I suggest that the authors clarify how exactly technical replicates are treated as well, as it is unclear whether data from these were combined pre- or post- rarefaction. The authors' description of the 'additive' combination of PCR replicates implies that only species lists were combined; if so, this is somewhat concerning given they report several sequencing failures - I don't feel that species lists from samples with two successful replicates would necessarily be comparable to samples with only one. A more valid approach would see the data from multiple replicates being combined before each independent sample is rarefied to the same rarefaction target (or discarded for having lower than the target number of reads). Finally on this, the authors report that some of the source 25 samples did not have all six taxa: should the pooled samples from these not be rarefied to the appropriate lower multiple to make this more comparable?

More widely, beyond simply reporting per-sample read numbers and rarefaction targets, the authors could improve the extent to which their results can be generalised by examining the effect of different read numbers on their findings - assuming they have sufficient coverage in the first place. This could be achieved by utilising a range of rarefaction targets to simulate reduced sequencing coverage, and examining how the values presented in figure 2 respond. This could all be simulated based on existing data. The authors may be interested in a recent paper (doi: 10.1111/1755-0998.13008) that suggests that protocol variation can have substantial effects on community recovery if coverage does not reach an asymptote (although not specifically regarding taxon composition)

Otherwise, I have no other major comments on this nice paper, but there are various presentational and detail aspects which hinder the reader's easy understanding of the subject matter and the authors' comprehensive discussion of their findings:

Abstract:

The authors define WFD (line 20) but not EQR (line 30). "Such as" on line 22 seems misplaced. Otherwise clear and succinct.

Introduction:

The introduction is short, but this is fine, it provides a clear background and well stated aims without waffle. I would suggest that line 46: "Until this day" be changed to "To date" as a more standard English phrase.

Methods:

>Sample selection and processing:

Line 77: Citation needed for "identified according to national WFD monitoring standards" - what are these methods/standards?

Line 79: Some explanation for the EQR score range and quality classes would improve reader understanding here - at the very least a citation for the reader to find more, but an in-text explanation preferred.

It is unclear to the non-Dutch reader what exactly Aquon is - in line 76 I assumed it was a specific location, and was confused later that a location was performing identification. It took context clues from later (and a google search) to understand that it was a freshwater ecological surveying company. I suggest reorganising this paragraph to make this clearer: for example, "Freshwater arthropod samples were collected using [method] in [location] by [ecological survey company] Aquon on [dates] according to [standardised WFD monitoring guidelines]. Aquon taxonomists sorted and identified these samples into [groups]. For this study, we selected 25 samples based on ratio ... etc"

It's not crucial, but some useful context could be supplied if the authors report the total number of samples from which the 25 were selected, and more importantly the overall EQR and quality class ranges from which their sample EQR and quality class ranges were decided. It's currently unclear to what extent their samples provide a wide sample.

On line 90, the authors report 158 tubes. 25 samples of 6 sample groups used gives a total of 150 - where do the extra 8 come from?

>DNA extraction and amplification:

The authors do not report the number of species, or more crucially the numbers of individuals or the biomass ranges of species within tubes, nor the extent to which this varies between samples of the same taxon or different taxa. Considerable variation in these values may introduce biases to molecular recovery rates and diminish the validity of their findings. The authors mention the effect of biomass on OTU recovery in passing in their introduction, and again in their discussion, but do not report any attempts to mitigate these effects.

>Bioinformatics

This section could be improved by more detail to improve reproducability of the methods. For each step of the bioinformatics process, the authors should fully report the parameters used, or that default parameters were used. Furthermore, there should be some justification for software and parameter choices made. No denoising was performed, and reads shorter than the target length were permitted but not reads longer. Length variants were permitted that allowed frameshift mutant sequences (length variants not a multiple of 3 bases different from the target length) - these are likely to be sequencing errors rather than true sequences. It is not clear at what point sequences were converted from fasta to fastq? It is also unclear whether OTU clustering was performed on a by-sample basis or over the entire dataset.

Line 161: "Anticipating on" should just be "Anticipating"

>Comparison morphology versus DNA

The title of this section is not good English. It should be something along the lines of "Comparison of morphology vs molecular identification"

Line 181: what is meant by "accumulated" in the context of shared taxonomic identity. Does this just mean merged? It would be illuminating if the authors provided some analysis or commentary on what the recovery of multiple OTUs per species says about the OTU clustering methods, particularly whether this varied between taxa

Line 182: Again, what is meant by "accumulated" in this context?

There could be a little more discussion of ecological quality ratios: what are these actually measuring? Was morphological data also converted to presence-absence? Studies into methodological considerations in metabarcoding are relevant to ecologists from a wide range of backgrounds, not just those working in freshwater or marine monitoring. Some of this is discussed in more detail in the discussion, but until then a reader unfamiliar with EQRs is left in the dark.

Results

As supplied, having figure captions in the text but the figures at the end of the document is a bit of a pain to read. It would be easier to have the figures in the text, or at least the captions with the figures at the end.

The results are by large comprehensively and clearly reported. There is a substantial amount of detail here that is perhaps unnecessary, and the reader can easily lose track of the main finding, the comparison between the combined taxon-specific samples and the pooled samples in terms of recovered community composition. In particular the reportage of taxonomic composition is a bit dry and repetitive, and figure one is not particularly surprising or informative. Perhaps the authors could find a more engaging way to present taxonomic composition and move figure one to the supplement - entirely their choice though.

Figure two as supplied is portrait and difficult to read. Furthermore, it appears to be missing the lower standard error bars? The response axis should read richness rather than diversity, to correspond with the language in the text and be unequivocal.

Line 267 is missing a closing parathesis

The rate of uncertainty in taxonomic identification at both morphological and molecular level is unsurprising, and the authors should be applauded for undertaking considerable work in designing a pipeline to achieve the best possible molecular identifications. However, this does beg the question: to what extent might the findings be different if the authors focused only on species/OTUs that can be definitively identified to species level by both morphological and molecular methods. I.e. rather than

What happens if drop out comparisons of presence/absence of any uncertain taxon, and focus solely on clearly identified species at both DNA and morphological level? I.e. the dropping of any entry in category 2-4 from the lines 191-200. At very least, it would be useful if the authors could report the number of entries falling into each category to reassure the reader that uncertainty levels were low

I would suggest bringing Supplemental Figure 4 into the main text, this quite strikingly shows the contribution of the annelids to the variation in EQR scores between Fig 3B and 3C-H. The authors may wish to consider simplifying and perhaps even merging Figures 3 and 4 by removal of the coloured blocks and simplification of axes etc.

Discussion

The discussion is detailed, thoughtful and well referenced, with the conclusions well supported by the results. It could perhaps be reduced in length somewhat.

In line 364, the authors report that size-based sorting affects recovery rates, yet they do not justify why they did not do this.

The authors do not compare their findings to recent work also looking at the effects of taxonomic composition on OTU recovery in metabarcoding, which did not use morphological data as a comparison but instead used pools of increasing taxonomic complexity, finding somewhat similar results (doi: 10.1002/ece3.4839).

I congratulate the authors on an interesting paper, and am sure it will be an important contribution to metabarcoding and molecular ecology with the above comments addressed.

Reviewer #2: The authors present an interesting paper evaluating the performance of metabarcoding versus morphological assessment of aquatic invertebrate samples. Although the general premise of the paper is interesting, I have a few concerns on the methodological and technical aspects of the paper. My first concern is the use of only a single primer set (BF1/BR2) to assess a community not only containing insects, but crustaceans, annelids, and molluscs. This primer is not well suited for organisms from different phylums and I would argue that is a poor primer choice for non-insects. The poor performance of this primer pair on the pooled sample is unsurprising given the diversity of organism (or templates) present in the samples. My other major concern is that the authors try to account for uneven read depth by taking one six of the read depths to create pseudoreplicates of pooled samples from sorted samples (at least that is my understanding from the text). The simplest solution and most comparable solution would be to subsamples reads to have an equivalent read depth for all samples. This would allow a more meaningful comparison of recovery amongst all the samples. My third major concern is that text can be a bit confusing on the discrepancies between morphological and molecular identifications. I think it would be best to clearly state the errors in either data set (preferably morphology first) and then elaborate on the where mistakes are made in the other approach. I.e. I want to see some sort of assess of the quality of morphological ids and the reference library. This will make it easier to then sort out differences in recovery between the two approaches. Otherwise, I did appreciate the authors use of the top 100 hits to assign taxonomy to MOTUs. This is a great approach that will hopefully become more common in the field and is a strength of the paper. I have a few other comments below:

Line 56: could probably add a few more references for mock communities and maybe argue that there is a greater need to use actual samples because they tend to be a bit naïve. I.e. the diversity present in your samples is a testament to the limitations in diversity often present in mock samples.

Line 100 – 102: This sentence is quite awkward!

Lines 92 – 102: why was no estimate of DNA concentration done prior to pooling? Depending on extraction efficiency this could have introduced another massive bias. It would have also allowed you to assess the quality of samples following extraction because what if one community was more degraded then another?

Lines 116 -130: This cannot be changed after the fact but why so many PCR cycles? Also the use of AIC was great in addition to extraction blanks!

Line 142-144: Although I am good with using an additive approach, it could be argued that some of these are stochastic in nature and this approach can over count OTUs. I think this approach can be justified and maybe a sentence or two would be useful.

Line 146-159: Why not query and remove contaminants as the first step? And why not screen and remove them from your reference library first?

Figure 1: What exactly is the difference between DNA taxon richness versus MOTU richness. I think there is a mistake in the caption that should be reviewed. I am assuming one is supposed to be morphological richness? I would just review terminology through the material and methods as well as the results.

Figure 2: have “observed diversity observed” maybe change the second observed to present?

Line 298: So the results are mutually exclusive, which on was correct or are they both wrong?

Line 351-353: what were the quality of hits?

Lines 435-439: I would argue that this is due to poor primer choice for this group.

Conclusions: I agree that things are not always clear cut but can you make a more concrete statement. Does metabarcoding provide reliable data for EQR scores or do both approaches have fundamental flaws …

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2019 Dec 16;14(12):e0226527. doi: 10.1371/journal.pone.0226527.r002

Author response to Decision Letter 0


12 Sep 2019

We would like to thank the editor and reviewers for considering our submission, and for their constructive feedback on the manuscript. We have incorporated some changed based on their suggestions in the revised manuscript.

One of the main issues was the lack of clarity regarding the rarefaction of data we performed to address the uneven sequencing depths between pooled and sorted samples. We have added a paragraph to the methods section detailing our approach.

While we can understand the concerns about the primer choice by reviewer #2, we wanted to perform amplifications with a single set of primers, mostly since we are trying to work towards a standard way of processing monitoring samples that should not be more complicated and time consuming than the current morphological assessments. By our knowledge, the primers selected perform well on a wide variety of macroinvertebrates, both in theory and in practice. We have assessed several primer sets prior to this project, and these primers showed most potential across the different taxonomic groups. We have included a statement about the primers in the discussion.

Based on suggestion by reviewer #1, we have combined the data from figure S4 into figure 3. This was certainly a useful suggestion, as both figures displayed similar information. Figure S4 has been changed to only detail the comparison of EQR scores with and without water mites, as that group was not included in the DNA analysis. We have also changed figure 2, to also provide the lower error bars, and we have removed the values of the fractions listed in the figure.

A few minor other edits were made to the manuscript and the figures and their legends to address the other concerns of the reviewers, and have provided a response to all comments in the "Response to Reviewers" document.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Sebastian D Fugmann

28 Oct 2019

PONE-D-19-18500R1

Increased performance of DNA metabarcoding of macroinvertebrates by taxonomic sorting

PLOS ONE

Dear Drs. Beentjes,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

1)  The Reviewer raises only minor points that should be addressed by making appropriate changes to the text of this manuscript.  

We would appreciate receiving your revised manuscript by Dec 12 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Sebastian D. Fugmann, Ph.D.

Academic Editor

PLOS ONE

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thank you to the authors for providing a detailed response to my comments and supplying a revision of their manuscript. I am satisfied that the vast majority of my concerns have been addressed.

I have only relatively minor comments on this revision that is otherwise suitable for publication. Note that my line numbers refer to the tracked changes version of the manuscript.

Firstly, there is inconsistency in the reporting of the fragment length. On line 127 you state the length of the fragment amplified is 316bp, in agreement with Elbrecht and Leese and supplemental figure S1. However, in the response to reviewers you state the length is 313 and that you allowed a +/- 3-base variation in length around this, giving the 310-316bp stated on line 160. Which is it? It seems as if you may have allowed sequences up to 6bp shorter than the target, but no longer. In the slippage paper cited, the forward primer was observed to reduce the length by 1-2 bases, and they did not study the reverse primer. While allowing for primer slippage by using a loose length threshold may retain true biological sequences, it may simultaneously allow PCR or sequencing errors through to create erroneous MOTUs that may inflate your MOTU richness. In my opinion this concern should at least be noted and discussed in the paper, even if the authors choose to retain this less conservative length variation threshold.

Otherwise, I suggest the authors review the paper for language once again, I noticed relatively frequent typos or slight English errors. For example on line 30 "ration" should be "ratio" and line 60 "organisms groups" is not correct English. Similar minor errors are found throughout the paper. This doesn't detract from the understanding of the paper, simply an aesthetic concern.

On line 116, the authors should report the exact speed or frequency of the homogeniser used, rather than stating the maximum was used, as different devices may have different maxima. According to this https://www.ika.com/en/Products-Lab-Eq/Dispersers-Homogenizer-csp-177/ULTRA-TURRAX-Tube-Drive-Technical-Data-cptd-3646000/ it's 6000rpm

I should note that I broadly agree with reviewer 2's comments on primer choice, something I clearly didn't think about enough on my first review. I believe the current methodology provides sufficiently supported and interesting results for publication, however there is certainly some scope for discussion of primer choice. After all, the authors advocate for separate amplification of taxon pools, negating the need for a single general multi-taxon-appropriate primer-pair (or even locus). In the response to reviewers, the authors provide good reasoning for why the primers are not to blame for poor amplification of molluscs, however there seems no reason why such a discussion cannot be also included in the text of the paper itself, to acknowledge that primer choice may have been an source of error in the study, discuss whether this is the case and propose future methodological directions to ameliorate this issue.

On the other hand, I agree with the authors that their pooling on volume instead of concentration was the appropriate route to take for accurate reflection of a multi-taxon extraction.

Otherwise, the authors have improved the clarity of the text, especially on the issue of rarefaction, and overall the paper reads well, the work is well-justified and results, discussion and conclusion well-supported.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2019 Dec 16;14(12):e0226527. doi: 10.1371/journal.pone.0226527.r004

Author response to Decision Letter 1


27 Nov 2019

Response to reviewers

We would like to thank the editor and reviewers for considering our re-submission. We have adjusted the manuscript based on the remaining points raised by the reviewer.

We ran some of the analysis again based on an inconsistency noticed by the reviewer in the length used to trim the sequence data. To be certain we went back to the data and noticed it was not a typo in the manuscript, but we did indeed use incorrect values for trimming. After comparing the new results to the earlier ones, we found only small differences in the MOTU clustering, leading to a few additional MOTUs and changes in the number of reads for a selection of MOTUs. However, we found no additional taxa, other than some MOTUs identified at genera level that were already found at species level, and rare cases vice versa. Since we used a taxonomic aggregation and removed redundant taxa (as described in the manuscript) prior to comparison with morphology, we observed no changes in the analyses beyond the creation of the MOTU table. Please note that we have adjusted the numbers in the manuscript to reflect the new MOTU table with the corrected sequence filtering, and also updated Figure 1 to show the new MOTU richness, as well as supplementary File S1.

We have provided a response to the other comments by the reviewer below.

Review Comments to the Author

Reviewer #1: Thank you to the authors for providing a detailed response to my comments and supplying a revision of their manuscript. I am satisfied that the vast majority of my concerns have been addressed. I have only relatively minor comments on this revision that is otherwise suitable for publication. Note that my line numbers refer to the tracked changes version of the manuscript.

Firstly, there is inconsistency in the reporting of the fragment length. On line 127 you state the length of the fragment amplified is 316bp, in agreement with Elbrecht and Leese and supplemental figure S1. However, in the response to reviewers you state the length is 313 and that you allowed a +/- 3-base variation in length around this, giving the 310-316bp stated on line 160. Which is it? It seems as if you may have allowed sequences up to 6bp shorter than the target, but no longer. In the slippage paper cited, the forward primer was observed to reduce the length by 1-2 bases, and they did not study the reverse primer. While allowing for primer slippage by using a loose length threshold may retain true biological sequences, it may simultaneously allow PCR or sequencing errors through to create erroneous MOTUs that may inflate your MOTU richness. In my opinion this concern should at least be noted and discussed in the paper, even if the authors choose to retain this less conservative length variation threshold.

Our response: We checked the raw data and we indeed made a mistake in the analysis rather than in the manuscript. We have ran the analysis again as discussed above.

Otherwise, I suggest the authors review the paper for language once again, I noticed relatively frequent typos or slight English errors. For example on line 30 "ration" should be "ratio" and line 60 "organisms groups" is not correct English. Similar minor errors are found throughout the paper. This doesn't detract from the understanding of the paper, simply an aesthetic concern.

Our response: Several mistakes throughout the paper were corrected and sentences were rewritten for clarity.

On line 116, the authors should report the exact speed or frequency of the homogeniser used, rather than stating the maximum was used, as different devices may have different maxima. According to this https://www.ika.com/en/Products-Lab-Eq/Dispersers-Homogenizer-csp-177/ULTRA-TURRAX-Tube-Drive-Technical-Data-cptd-3646000/ it's 6000rpm

Our response: We have added the speed used (indeed 6000 rpm) to the methods section.

I should note that I broadly agree with reviewer 2's comments on primer choice, something I clearly didn't think about enough on my first review. I believe the current methodology provides sufficiently supported and interesting results for publication, however there is certainly some scope for discussion of primer choice. After all, the authors advocate for separate amplification of taxon pools, negating the need for a single general multi-taxon-appropriate primer-pair (or even locus). In the response to reviewers, the authors provide good reasoning for why the primers are not to blame for poor amplification of molluscs, however there seems no reason why such a discussion cannot be also included in the text of the paper itself, to acknowledge that primer choice may have been an source of error in the study, discuss whether this is the case and propose future methodological directions to ameliorate this issue.

Our response: We have added several parts of the discussion in the rebuttal letter on this topic to the main manuscript.

On the other hand, I agree with the authors that their pooling on volume instead of concentration was the appropriate route to take for accurate reflection of a multi-taxon extraction.

Otherwise, the authors have improved the clarity of the text, especially on the issue of rarefaction, and overall the paper reads well, the work is well-justified and results, discussion and conclusion well-supported.

Attachment

Submitted filename: Response to Reviewers V2.docx

Decision Letter 2

Sebastian D Fugmann

3 Dec 2019

Increased performance of DNA metabarcoding of macroinvertebrates by taxonomic sorting

PONE-D-19-18500R2

Dear Dr. Beentjes,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Sebastian D. Fugmann, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Sebastian D Fugmann

9 Dec 2019

PONE-D-19-18500R2

Increased performance of DNA metabarcoding of macroinvertebrates by taxonomic sorting

Dear Dr. Beentjes:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Sebastian D. Fugmann

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Artificial internal control.

    The artificial control (AIC) used to measure cross-contamination. The sequence is based on the COI barcode region of a Reeve’s muntjac (Muntiacus reevesi) with several primer sets built into the sequence (forward strand shown). Binding sites for COI primers are shown, BF1 and BR2 used in this study are highlighted in red.

    (EPS)

    S2 Fig. Overlap between morphology and DNA per sample.

    The overlap between morphology and DNA (in yellow), as well as the fractions of taxa only detected with DNA (blue) and morphology (red), for each of the 25 samples separately, as well as averages (last column), separated for each of the six taxa groups.

    (EPS)

    S3 Fig. Specimen and read abundances.

    Relative abundances of (A) specimens in the traditional morphological assessment and reads in the metabarcoding data of (B) separately sequenced taxa groups combined and (C) samples pooled prior to amplification. In addition to the six groups assessed in this study, the fractions of water mites (in morphology), as well as vertebrates and unidentified MOTUs (in DNA data) have been included.

    (EPS)

    S4 Fig. EQR scores without water mites.

    Comparison of EQR scores for the morphological data with and without water mites (ACA). No DNA was obtained from water mites due to the buffer they were stored in. Pearson correlation value provided in the panel, p < 0.001.

    (EPS)

    S1 File. Taxon lists.

    Taxon lists for the three datasets: Morphologically identified taxa (with specimen counts), DNA-based identifications from the sorted samples, and DNA-based identifications form the pooled samples (both with read counts).

    (XLSX)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Attachment

    Submitted filename: Response to Reviewers V2.docx

    Data Availability Statement

    All relevant data are within the paper and its Supporting Information files. Raw sequence data is available from the NCBI Sequence Read Archive (Bioproject accession PRJNA550542).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES