. 2021 Jan 11;11(3):1392–1398. doi: 10.1002/ece3.7146

TABLE 2.

Percent similarity to the respective Sanger sequence for the datasets B1 to BC7 from Maestri et al. (2019)

SampleID	BC1 ^a	BC2	BC3	BC4 ^a	BC5	BC6 ^a	BC7 ^a
NGSpeciesID
ONT Medaka	100% (0/651)	100% (0/658)	99.9% (1/649)	100% (0/606)	100% (0/658)	99.8% (1/576)	100% (0/536)
ONT Racon	99.5% (3/651)	99.5% (3/658)	98.9% (7/649)	99.2% (5/606)	99.8% (1/658)	99.7% (2/576)	99.4% (3/536)
ONTrack
ONT	99.9% (1/651)	100% (1 ^b /658)	100% (2 ^b /649)	100% (0/606)	100% (2 ^b /658)	99.8% (1/576)	100% (0/536)
Mixed
NGSpeciesID
ONT Medaka	100% (0/651)	100% (0/658)	99.7% (2/649)	99.3% (4/606)	100% (0/658)	99.8% (1/576)	99.6% (2/536)

For the mixed samples, 300 reads of each of the seven DNA barcodes were combined into a single file, from which NGSpeciesID generated multiple consensus sequences. NGSpeciesID was run using Medaka polishing.

^{^a}

Here, the Sanger sequence from Maestri et al. (2019) was shorter than the expected fragment length and all the consensus sequences. In these cases, we only calculated the percentage similarity for the region covered by the respective Sanger sequence.

^{^b}

The consensus sequences from Maestri et al. (2019) are missing one or two bases at the start, which could be due to a consensus calling error, or deletion of one additional base during the primer removal. For the percentage accuracy, we assumed them to be incorrectly trimmed. The highest similarity scores are highlighted in bold. The numbers in the brackets provide the amount of mismatches to the Sanger reference and the length of the reference sequence.