. 2021 Jul 17;35(8):901–909. doi: 10.1007/s10822-021-00405-6

Table 2.

Overview of the performance of the final multitask ensemble model (12_Full), used for the challenge, the singletask ensemble model (11_Full), and several commercial logP prediction tools on the SAMPL7, SAMPL6 and Martel et al. data sets [4]

Method	Dataset	R²	RMSE	Spearman ρ
AlogP	SAMPL7	− 0.30 [− 1.78,0.34]	0.82 [0.59,1.01]	0.42 [− 0.09,0.73]
XlogP3	SAMPL7	0.01 [− 1.12,0.46]	0.72 [0.55,0.87]	0.52 [0.07,0.78]
S+ logP	SAMPL7	0.06 [− 1.23,0.64]	0.70 [0.41,0.93]	0.62 [0.19,0.87]
Model 11_Full	SAMPL7	− 0.17 [− 1.49,0.38]	0.78 [0.52,1.01]	0.60 [0.13,0.86]
Model 12_Full	SAMPL7	0.17 [− 0.95,0.65]	0.66 [0.40,0.89]	0.63 [0.20,0.91]
AlogP	SAMPL6	0.56 [− 0.73,0.84]	0.44 [0.25,0.62]	0.83 [0.32,0.97]
XlogP3	SAMPL6	0.54 [− 0.69,0.82]	0.45 [0.29,0.58]	0.71 [0.05,0.94]
S+ logP	SAMPL6	0.42 [− 1.17,0.80]	0.51 [0.32,0.65]	0.71 [0.03,0.94]
Model 11_Full	SAMPL6	0.71 [− 0.25,0.90]	0.36 [0.24,0.46]	0.85 [0.40,0.99]
Model 12_Full	SAMPL6	0.75 [− 0.08,0.93]	0.34 [0.17,0.46]	0.82 [0.30,0.99]
AlogP	Martel et al.	− 0.15 [− 0.34,− 0.00]	1.27 [1.19,1.34]	0.73 [0.69,0.76]
XlogP3	Martel et al.	0.04 [− 0.11,0.16]	1.16 [1.10,1.21]	0.78 [0.75,0.81]
S+ logP	Martel et al.	− 0.26 [− 0.45,− 0.10]	1.33 [1.26,1.39]	0.71 [0.67,0.75]
Model 11_Full	Martel et al.	− 0.33 [− 0.51,− 0.18]	1.36 [1.31,1.41]	0.74 [0.70,0.77]
Model 12_Full	Martel et al.	− 0.00 [− 0.14,0.12]	1.18 [1.13,1.23]	0.76 [0.73,0.80]

The 95% confidence interval for the different performance metrics is shown between square brackets.