Skip to main content
. 2021 Nov 1;19:6009–6019. doi: 10.1016/j.csbj.2021.10.034

Table 1.

Dataset Statistics.

Species Available Train Dataset Train Dataset Size Test Dataset Size Updated TrainDataset Updated Test Dataset
Caenorhabditis elegans (C. elegans) iDNA4mC (chen et al. [21]) 4mC  = 1554 4mC  = 0 4mC  = 7939 Non-4mC  = 82033 4mC  = 2352 Non-4mc  = 2660
Non-4mC  = 1554 Non-4mC  = 0
DeepTorrent (Liu et al. [24]) 4mC  = 55729 4mC  = 2667
Non-4mC  = 55729 Non-4mC  = 2667
Zeng et al. [27] 4mC  = 11173 4mC  = 0
Non-4mC  = 6635 Non-4mC  = 0
Rao et al. [26] 4mC  = 20000 4mC  = 0
Non-4mC  = 20000 Non-4mC  = 0



Drosophila melanogaster (D. melanogaster) iDNA4mC (chen et al. [21]) 4mC  = 1769 4mC  = 0 4mC  = 72127 Non-4mC  = 75460 4mC  = 3332 Non-4mC  = 3521
Non-4mC  = 1769 Non-4mC  = 0
DeepTorrent (Liu et al. [24]) 4mC = 53970 4mC = 3684
Non-4mC  = 53970 Non-4mC  = 3684
Rao et al. [26] 4mC  = 20000 4mC  = 0
Non-4mC  = 20000 Non-4mC  = 0



Arabidopsis thaliana(A. thaliana) iDNA4mC (chen et al. [21]) 4mC = 1978 4mC = 0 4mC = 81143 Non-4mC = 85456 4mC = 10388 Non-4mC = 11172
Non-4mC  = 1978 Non-4mC  = 0
DeepTorrent (Liu et al. [24]) 4mC  = 63720 4mC  = 11 307
Non-4mC  = 63720 Non-4mC  = 11 307
Rao et al. [26] 4mC  = 20000 4mC  = 0
Non-4mC  = 20000 Non-4mC  = 0



Escherichia coli (E. coli) iDNA4mC (chen et al. [21]) 4mC  = 388 4mC  = 0 4mC  = 1959 Non-4mC  = 2156 4mC = 126 Non-4mC = 126
Non-4mC  = 388 Non-4mC  = 0
DeepTorrent (Liu et al. [24]) 4mC  = 1941 4mC  = 126
Non-4mC  = 1941 Non-4mC  = 126



Geoalkalibacter subterraneus (G. subterraneus) iDNA4mC(chen et al. [21]) 4mC  = 905 4mC  = 0 4mC  = 10583 Non-4mC  = 10780 4mC  = 5263 Non-4mC  = 5263
Non-4mC  = 905 Non-4mC  = 0
DeepTorrent (Liu et al. [24]) 4mC  = 9934 4mC  = 5263
Non-4mC  = 9934 Non-4mC  = 5263



Geobacter pickeringii (G. pickeringii) iDNA4mC (chen et al. [21]) 4mC  = 569 4mC  = 0 4mC  = 4703 Non-4mC  = 4900 4mC  = 1210 Non-4mC  = 1210
Non-4mC  = 569 Non-4mC  = 0
DeepTorrent (Liu et al. [24]) 4mC  = 4514 4mC  = 1210
Non-4mC  = 4514 Non-4mC  = 1210



Mus musculus 4mCpred-EL [28] 4mC  = 800 4mC  = 180 4mC  = 800 Non-4mC  = 800 4mC = 180 Non-4mC  = 180
Non-4mC  = 800 Non-4mC  = 180



Casuarina equisetifolia (C. equisetifolia) iDNA-MS [30] 4mC  = 183 4mC  = 183 4mC = 183 Non-4mC = 183 4mC = 183 Non-4mC = 183
Non-4mC  = 183 Non-4mC  = 183



Saccharomyces cerevisiae (S. cerevisiae) iDNA-MS [30] 4mC  = 990 4mC  = 989 4mC = 990 Non-4mC = 990 4mC  = 989 Non-4mC  = 989
Non-4mC  = 990 Non-4mC  = 989



Tolypocladium sp SUP5-1 (Tolypocladium) iDNA-MS [30] 4mC  = 7664 4mC  = 7663 4mC  = 7664 Non-4mC  = 7664 4mC  = 7663 Non-4mC  = 7663
Non-4mC  = 7664 Non-4mC  = 7663



Fragaria vesca (F. vesca) i4mC-ROSE [29] 4mC  = 4854 4mC  = 1617 4mC  = 12298 Non-4mC  = 12152 4mC = 8819 Non-4mC = 9015
Non-4mC  = 4854 Non-4mC  = 1617
iDNA-MS [30] 4mC  = 7899 4mC  = 7898
Non-4mC  = 7899 Non-4mC  = 7898



Rosa chinensis (R. chinensis) i4mC-ROSE [29] 4mC  = 2337 4mC  = 779 4mC  = 2337 Non-4mC  = 2337 4mC  = 779 Non-4mC  = 779
Non-4mC  = 2337 Non-4mC  = 779