Table 2.
Number of non-redundant terminators (No. Term.) per genome accession used for training
| Data source | Species | Phylum | Genome | No. | GC | Avg. |
|---|---|---|---|---|---|---|
| Accession | Term. | (%) | Pre. | |||
| [6, 54] | Bacillus subtilis 168 | Bacillota | AL009126.3 | 1800 | 42.9 | 0.88 |
| [55–57], | ||||||
| [23, 58, 59] | Bacillus subtilis 168 | Bacillota | NC_000964.3 | 4872 | 42.9 | 0.72 |
| [57] | Caulobacter vibrioides NA1000 | Pseudomonadota | CP001340.1 | 341 | 66.2 | 0.93 |
| [60] | Clostridioides difficile 630 | Bacillota | CP010905.2 | 1646 | 28.6 | 0.84 |
| [61] | Dickeya dadantii 3937 | Pseudomonadota | NC_014500.1 | 1786 | 55.5 | 0.49 |
| [62] | Escherichia coli BW25113 | Pseudomonadota | CP009273.1 | 1095 | 50.1 | 0.81 |
| [7, 57, 58, 63], | ||||||
| [8, 37, 64] | E. coli str. K-12 substr. MG1655 | Pseudomonadota | NC_000913.3 | 4139 | 50.1 | 0.62 |
| [65] | Pseudomonas aeruginosa PAO1 | Pseudomonadota | NC_002516.2 | 805 | 65.6 | 0.89 |
| [66] | Staphylococcus aureus JKD6009 | Bacillota | LR027876.1 | 978 | 32.4 | 0.81 |
| [67] | Staphylococcus aureus NCTC 8325 | Bacillota | NC_007795.1 | 566 | 32.4 | 0.90 |
| [68] | Streptococcus pneumoniae D39V | Bacillota | CP027540.1 | 747 | 39.1 | 0.94 |
| [69] | Streptococcus pneumoniae TIGR4 | Bacillota | NC_003028.3 | 1810 | 39.1 | 0.67 |
| [70, 71] | Streptomyces avermitilis MA-4680 | Actinomycetota | BA000030.4 | 2006 | 69.7 | 0.63 |
| [71, 38] | Streptomyces clavuligerus ATCC27064 | Actinomycetota | CP027858.1 | 1583 | 71.6 | 0.62 |
| [71] | Streptomyces coelicolor M145 | Actinomycetota | NC_003888.3 | 1308 | 71.1 | 0.73 |
| [71, 72] | Streptomyces griseus NBRC13350 | Actinomycetota | NC_010572.1 | 2724 | 71.2 | 0.76 |
| [71, 73] | Streptomyces lividans TK24 | Actinomycetota | CP009124.1 | 1999 | 71.2 | 0.59 |
| [71] | Streptomyces tsukubaensis NBRC108819 | Actinomycetota | CP020700.1 | 1283 | 70.8 | 0.55 |
| [57] | Vibrio natriegens ATCC 14048 | Pseudomonadota | CP009977.1 | 905 | 44.7 | 0.97 |
| [57] | Vibrio natriegens ATCC 14048 | Pseudomonadota | CP009978.1 | 257 | 44.1 | 0.90 |
| [74] | Vibrio parahaemolyticus RIMD 2210633 | Pseudomonadota | NC_004603.1 | 1852 | 44.7 | 0.67 |
| [75] | Zymomonas mobilis ZM4 = ATCC 31821 | Pseudomonadota | CP023715.1 | 2040 | 45.7 | 0.54 |
There is a total of 36 542 terminator sequences. Avg. Pre. shows the average precision obtained during cross-validation as in Fig. 5.