Skip to main content
. Author manuscript; available in PMC: 2024 Jun 4.
Published in final edited form as: Res Comput Mol Biol. 2020 Apr 21;12074:37–53. doi: 10.1007/978-3-030-45257-5_3

Table 1.

Summary of theoretical results for the second phase of different algorithms for generating a set of k-mers hitting all L-long sequences. PDOCKS is DOCKS with the improved hitting number calculation, i.e. greedy removal of one vertex at each iteration. pD, pDA denote the total number of picked vertices for DOCKS/PDOCKS and DOCKSany, respectively. m denotes the number of parallel threads used, Tmax the maximum vertex hitting number, and ϵ and δ PASHA’s randomization parameters.

Algorithm DOCKS PDOCKS DOCKSany PASHA
Theoretical runtime O((1+pD)|Σ|k+1L) O((1+pD)|Σ|k+1L/m) O((1+pDA)|Σ|k+1) O((L2|Σ|k+1log2(|Σ|k))/(εδ3m))
Approximation ratio 1+logTmax 1+logTmax N/A (1+logTmax)/(14δ2ε)