Skip to main content
. 2021 Jan 11;11(3):1392–1398. doi: 10.1002/ece3.7146

TABLE 2.

Percent similarity to the respective Sanger sequence for the datasets B1 to BC7 from Maestri et al. (2019)

SampleID BC1 a BC2 BC3 BC4 a BC5 BC6 a BC7 a
NGSpeciesID
ONT Medaka 100% (0/651) 100% (0/658) 99.9% (1/649) 100% (0/606) 100% (0/658) 99.8% (1/576) 100% (0/536)
ONT Racon 99.5% (3/651) 99.5% (3/658) 98.9% (7/649) 99.2% (5/606) 99.8% (1/658) 99.7% (2/576) 99.4% (3/536)
ONTrack
ONT 99.9% (1/651) 100% (1 b /658) 100% (2 b /649) 100% (0/606) 100% (2 b /658) 99.8% (1/576) 100% (0/536)
Mixed
NGSpeciesID
ONT Medaka 100% (0/651) 100% (0/658) 99.7% (2/649) 99.3% (4/606) 100% (0/658) 99.8% (1/576) 99.6% (2/536)

For the mixed samples, 300 reads of each of the seven DNA barcodes were combined into a single file, from which NGSpeciesID generated multiple consensus sequences. NGSpeciesID was run using Medaka polishing.

a

Here, the Sanger sequence from Maestri et al. (2019) was shorter than the expected fragment length and all the consensus sequences. In these cases, we only calculated the percentage similarity for the region covered by the respective Sanger sequence.

b

The consensus sequences from Maestri et al. (2019) are missing one or two bases at the start, which could be due to a consensus calling error, or deletion of one additional base during the primer removal. For the percentage accuracy, we assumed them to be incorrectly trimmed. The highest similarity scores are highlighted in bold. The numbers in the brackets provide the amount of mismatches to the Sanger reference and the length of the reference sequence.