Skip to main content
. 2022 Nov 18;12:19899. doi: 10.1038/s41598-022-24356-6

Table 2.

Summary of mean accuracy/R2 (± standard deviation) for the baseline models when the seed and training set order is changed during training.

Dataset (Task) Seed Shuffle Overall SotA performance
KAIMRC (Classification) 82.576 ± 0.4668 83.381 ± 0.1811 83.12 ± 0.4779 83.22
KAIMRC (Regression) 0.5927 ± 0.0113 0.579 ± 0.0122 0.5858 ± 0.01326 n/a
BCW 92.185 ± 1.7315 91.5 ± 2.8319 91.843 ± 2.3172 99.04
Codon usage (Kingdom) 85.280 ± 1.8029 85.38 ± 1.1778 85.33 ± 1.4367 84.25
Codon usage (DNA) 99.268 ± 0.0950 99.166 ± 0.0921 99.217 ± 0.1033 99.15
MIMIC-IV 76.362 ± 2.5808 79.736 ± 1.8906 78.049 ± 2.7769 84.72

The state of the art (SotA) model performance is also reported to confirm the models are properly trained.