Skip to main content
. 2020 Jun 24;11(28):7335–7348. doi: 10.1039/d0sc01637c

Average statistical performance for models with test sets generated using chemical clustering and generated randomly. Clustered statistics are taken from Table 3 and random statistics generated from Table 5. The difference shown is the change in performance when moving from random to clusteringa.

ACC MCC ROC-AUC
Clustered test set 92.2 0.814 0.96
Random test set 92.8 0.832 0.96
Difference −0.6 −0.018 0
a

ACC = accuracy, MCC = Matthews correlation coefficient, ROC-AUC = area under receiver operating characteristic curve.