Table 10.
Performance evaluation to remove duplicated sequences from the real life dataset SRR921889.
Tool | Prefix | Mismatches | Percentage removed | Time | Memory |
---|---|---|---|---|---|
G-CNV | 10 | 1 | 11.2 | 2 h | 17.3 GB |
10 | 3 | 11.5 | 1 h 50 min | 17.3 GB | |
25 | 1 | 11.9 | 16 min | 17.3 GB | |
25 | 3 | 12.1 | 8 min | 17.3 GB | |
Fulcrum | 10 | 1 | 11.3 | 4 h 01 min | 1.6 GB |
10 | 3 | 11.4 | 3 h 23 min | 1.6 GB | |
25 | 1 | 11.6 | 1 h 24 min | 1.6 GB | |
25 | 3 | 11.9 | 1 h 33 min | 1.6 GB |
The first column reports the tool. The second column reports the length of the prefixes used for clustering. Column third reports the allowed mismatches. The fourth column reports percentage of removed sequences. Columns fifth and sixth report the computing time and the memory consumption, respectively.