Skip to main content
. 2020 Jul 13;36(Suppl 1):i12–i20. doi: 10.1093/bioinformatics/btaa458

Table 2.

Reference sequences after over-representation filtering

Base pairs # species # leaf taxids # assemblies # sequences
RefSeq-CG- 2.9e10 11 464 14 071 15 171 24 290
top-3 (62%) (100%) (100%) (77%) (74%)
RefSeq-ALL- 2.1e11 29 061 51 292 56 805 4 400 402
top-3 (36%) (100%) (100%) (38%) (29%)

Note: Percentages in brackets show the amount of data left compared to the original set (Table 1). Protein data information can be found in the Supplementary Table S5.