Skip to main content
. 2021 Feb 8;13:7. doi: 10.1186/s13321-021-00488-1

Table 2.

The R2 between the actual solubility scores and those predicted by GraphSol based on individual feature groups, removing each feature group from the final GraphSol model, and recursively adding feature groups according to their importance, respectively

Feature groupsa CVd Ind. test Feature groupsb CVd Ind. test Features groupsc CVd Ind. test
GraphSol 0.476 ± 0.014 0.483
BLOSUM 0.329 ± 0.016 0.317 − BLOSUM 0.460 ± 0.011 0.465 BLOSUM 0.329 ± 0.016 0.317
AAPHY7 0.293 ± 0.014 0.289 − AAPHY7 0.465 ± 0.012 0.479 + SPIDER3 0.413 ± 0.012 0.409
PSSM 0.333 ± 0.012 0.332 − PSSM 0.457 ± 0.017 0.467 + PSSM 0.456 ± 0.011 0.453
HMM 0.337 ± 0.015 0.341 − HMM 0.455 ± 0.016 0.458 + HMM 0.465 ± 0.012 0.479
SPIDER3 0.231 ± 0.019 0.227 − SPIDER3 0.428 ± 0.018 0.449 + AAPHY7 0.476 ± 0.014 0.483

Italic values indicate the performance of using all feature groups in our model

a Performances based on individual feature groups

b by removing each feature group from all feature

c by adding feature groups recursively

d Performances by the fivefold cross-validation