Table 1.
Five species included in this benchmarking study with the size of the reference genome and the used reference genome version which has been taken for the simulation of the read datasets.
| Species | Arabidopsis thaliana | Brassica napus | Glycine max | Solanum tuberosum | Zea mays |
|---|---|---|---|---|---|
| Genome Size (bp) | 135,670,229 | 738,357,821 | 955,377,461 | 727,424,546 | 2,104,350,183 |
| Genome version | TAIR10 | AST_PRJEB5043_v1 | Glycine_max_v2.1 | SolTub_3.0 | B73 RefGen_v4 |
| Repeats in % | <23 (Flutre et al., 2011) |
~48 (Liu et al., 2018) |
~57 (Schmutz et al., 2010) |
~49 (Mehra et al., 2015) |
~75 (Wolf et al., 2015) |
| Citation | Lamesch et al., 2012 | Chalhoub et al., 2014 | Schmutz et al., 2010 | Xu et al., 2011 | Schnable et al., 2009 |
The proportion of repetitive sequences is given as the estimated value over the genome.