Skip to main content
. 2023 Feb 26;15:29. doi: 10.1186/s13321-023-00698-9

Table 4.

The performance comparison for classification and regression tasks

Classification (the higher the better)a
Johnson et al Tox21 Clintox ToxCast HIV
RF 0.252 ± 0.014 0.818 ± 0.005 0.721 ± 0.088 _b 0.798 ± 0.040
FFN 0.258 ± 0.015 0.837 ± 0.010 0.837 ± 0.062 0.738 ± 0.009 0.803 ± 0.045
MPNN 0.258 ± 0.013 0.859 ± 0.011 0.873 ± 0.051 0.752 ± 0.010 0.788 ± 0.050
D-MPNN 0.281 ± 0.028 0.855 ± 0.015 0.895 ± 0.037 0.749 ± 0.013 0.788 ± 0.039
Deeper GCN 0.272 ± 0.022 0.853 ± 0.013 0.870 ± 0.042 0.751 ± 0.010 0.789 ± 0.031
GEM 0.280 ± 0.018 0.864 ± 0.010 0.825 ± 0.091 0.757 ± 0.013 0.769 ± 0.038
ABT-MPNN 0.295 ± 0.021 0.857 ± 0.010 0.904 ± 0.034 0.760 ± 0.013 0.809 ± 0.036
Regression (the lower the better)a
Johnson et al ESOL Lipophilicity Freesolv QM8
RF 1.315 ± 0.021 1.230 ± 0.066 0.846 ± 0.039 2.467 ± 0.570 0.014 ± 0.000
FFN 1.321 ± 0.016 0.614 ± 0.109 0.674 ± 0.043 1.275 ± 0.352 0.016 ± 0.000
MPNN 1.309 ± 0.017 0.575 ± 0.086 0.585 ± 0.044 1.042 ± 0.220 0.010 ± 0.000
D-MPNN 1.307 ± 0.024 0.594 ± 0.066 0.558 ± 0.044 0.915 ± 0.142 0.010 ± 0.000
Deeper GCN 1.325 ± 0.015 0.601 ± 0.056 0.580 ± 0.035 0.970 ± 0.368 0.012 ± 0.000
GEM 1.315 ± 0.021 0.632 ± 0.062 0.599 ± 0.035 0.962 ± 0.257 0.010 ± 0.000
ABT-MPNN 1.305 ± 0.017 0.566 ± 0.075 0.554 ± 0.041 0.902 ± 0.157 0.009 ± 0.000

aThe evaluation metrics are represented as averaged values ± standard deviation from fivefold CV. The best performance values are highlighted in bold

bThe results of RF on ToxCast are not presented because of the substantial computational cost