Skip to main content
. 2020 Sep 5;48(18):e106. doi: 10.1093/nar/gkaa727

Table 1.

Contamination test summary

Variant type %Contamination %Correct %Fragmented %Mismatches
None 0 100% 0% 12.29%
1 SNP 20% 100% 0% 13.38%
35% 96% 0% 14.41%
50% 44% 0% 14.68%
2 SNPs 20% 100% 0% 14.35%
35% 96% 0% 14.82%
50% 32% 0% 15.96%
3 SNPs 20% 100% 0% 14.04%
35% 96% 0% 15.21%
50% 35% 0% 16.58%
4 SNPs 20% 100% 0% 14.85%
35% 90% 0% 16.83%
50% 18% 0% 17.99%
608 bp indel 10% 99% 1% 13.49%
20% 89% 11% 14.78%
35% 16% 84% 17.28%
50% 2% 95% 21.00%
Unrelated 10% 86% 4% 18.24%
20% 28% 72% 23.96%
35% 0% 100% NA
50% 0% 100% NA

Each row of this table represents the average result from a single simulation experiment (repeated 500 times). Each experiment differed by the amount and type of contamination simulated. Variant Type denotes by what metric the ‘contaminating’ sequence (reads from a different sample that were artificially spiked) differs from the ‘primary’ sequence (the reference sample). %Contamination refers to the percentage of the read library derived from contaminating reads. %Correct denotes the percentage of correct assemblies yielded. %Fragmented denotes the percentage of fragmented assemblies yielded. %Mismatches denotes the percentage of assemblies that did not exactly match the reference.