Skip to main content
. 2025 Feb 21;65(5):2456–2475. doi: 10.1021/acs.jcim.4c01902

Table 9. 5-Fold Cross-Validation Performance of Different Descriptors Using Gradient Boosted Trees and a RF Regressor on the BASF Data Seta.

XGBoost    
descriptor type R2 MAE
MFP 0.716 ± 0.046 0.398 ± 0.018
GRADEOPD 0.448 ± 0.060 0.590 ± 0.025
MFP + GRADEOPD 0.704 ± 0.062 0.421 ± 0.045
GRADE 0.519 ± 0.061 0.551 ± 0.031
MFP + GRADE 0.716 ± 0.047 0.416 ± 0.023
RDKitPhysChem 0.659 ± 0.038 0.472 ± 0.027
MFP + RDKitPhysChem 0.716 ± 0.044 0.423 ± 0.028
PLEC 0.722 ± 0.018 0.402 ± 0.017
MFP + PLEC 0.732 ± 0.039 0.404 ± 0.035
RF Regressor    
descriptor type R2 MAE
MFP 0.665 ± 0.020 0.457 ± 0.020
GRADEOPD 0.483 ± 0.014 0.579 ± 0.016
MFP + GRADEOPD 0.668 ± 0.028 0.464 ± 0.026
GRADE 0.509 ± 0.022 0.565 ± 0.020
MFP + GRADE 0.661 ± 0.029 0.467 ± 0.027
RDKitPhysChem 0.587 ± 0.028 0.510 ± 0.027
MFP + RDKitPhysChem 0.643 ± 0.019 0.477 ± 0.025
PLEC 0.687 ± 0.020 0.448 ± 0.016
MFP + PLEC 0.689 ± 0.021 0.448 ± 0.016
a

OPD = Only Pose-Dependent. The best performing ones are marked by the use of bold letters.