. 2020 Jul 24;18:1968–1979. doi: 10.1016/j.csbj.2020.07.011

Table 3.

Performance evaluation reported for each method, highlighting possible biases.

Method	Validation	Training data	Test Data	Correlation (ρ)	Anti-symmetry (ρ_dir-inv)	Sequence Identity/homologs
FoldX [18]	none	S339	S625	0.82	Biased* (−0.38)	not declared
MUpro [51]	20‐fold CV and LOO	S1615	S1615, S388	0.13–0.76	Biased* (−0.02)	Homologs removed from S1615 (SR1496, SR1135, SR1023, SR1539)
CUPSAT [36]	3/4/5-fold CV	S1538, S1603	S1538, S1603	0.55–0.78	Biased* (−0.54)	not declared
I-Mutant(3.0/2.0) [26], [40]	10/20/30-fold CV	S1948	S1948	0.62–0.71	Biased* (0.02)	not declared
iPTREE-STAB [25]	4/10/20-fold CV	S1859	S1859	0.7	not evaluated	not declared
AUTO-MUTE (2.0) [69]	20‐fold CV	Subsets of S1948, S1615, S388,S1791, S1396, S2204	Subsets of S1948, S1615, S388,S1791, S1396	0.74–0.79	Biased* (−0.06)	not declared
Prethermut [29]	10-fold CV	S3366, S2156	S3366, S2156	0.67–0.72	not evaluated	not declared
POPMUSIC(3.1/2.1) [28]	5-fold CV	S2648	S2648	0.63–0.79	Biased* (−0.29); unbiased version POPMUSICsym (−0.77)	not declared
Pro-Maya [52]	5/10-fold CV and LOO	S2648, S2156	S2648, S2156	0.59–0.8	not evaluated	<30% identity and keeping information on the mutation site
PROTS-RF [51]	5/10-fold CV	S2156	S2156 + D180, D140 (27 and 19 proteins)	0.62–0.86	Unbiased	<30% identity in CV
iStable(2.0) [30], [53]	5-fold CV v2.0: 10-fold CV	S2648, S1948 v2.0: S3528	M1311, M1820 (from S2648, S1948, no redundancies) v2.0: S630	0.85 v2.0: 0.67–0.71	Biased* (−0.05) v2.0: not evaluated	Meta predictor combining several predictors and using the same protein variant to train the combined model
NeEMO [54]	10-fold CV	S2399 (113 proteins)	IM_631 (from S2399)	0.5–0.79	Biased* (0.09)	Evaluated but not used in CV
DUET [41]	S350, p53	S2648	p53,S350	0.71–0.82	Biased* (−0.21)	not declared
mCSM [20]	5/10/20-fold CV	S2648, S1925	S350	0.51–0.82	Biased* (−0.26)	5-fold cross-validation separating by protein (Pearson = 0.51)
EASE-MM [24]	10-fold CV and blind	1676 mutations from S1948	S543, S236	0.51–0.59	not evaluated	max 25% sequence identity between folds and train/test sets
INPS(3D) [43], [55]	5-fold CV	S2648, p53	S2648, p53	0.53–0.71	Unbiased for sequence-only* (−0.99 1D, −0.86 3D)	5-fold cross-validation separating by protein as in mCSM
STRUM [22]	5-fold CV	Q3421	S2648, S350, Q306 (subset of S2648)	0.4–0.8	Biased* (0.34)	Q306 as test, with sequence identity < 60%
ELASPIC [56]	20‐fold CV and LOO	S3463 (159 proteins)	S2636 (134 proteins), S2104 (79 proteins)	0.77	not evaluated	90% sequence identity redundancy reduction
SAAFEC [39]	5-fold CV	983 mutations from Protherm (42 proteins)	983 mutations from Protherm (42 proteins)	0.61	not evaluated	not declared
MAESTRO(web) [57]	5/10/20-fold CV	S2648, S1925 (from S1948), S1765, S2244	S2648, S350	0.63–0.76	Biased* (−0.34)	Lowest correlation (0.63) when separating by protein in 5-fold CV as in mCSM
SDM(2) [46]	None	None	S2648, S350, p53, S140	0.52–0.63	Biased* (−0.75)	not declared
TML-MP [58]	5-fold cross validation	S2648 S350, M233	2648 S350, M233	0.54–0.82	not evaluated	not declared
ThreeFoil [16]	2-fold CV with 1000 reshuffling	Broom2017	Broom2017	0.73	not evaluated	not declared
DynaMut [59]	10-fold CV and blind TS	S2648	S351	0.58–0,70	Biased	Homology included, use in input Duet, trained on the same S2640
DDGun [21]	S2648, Ssym, p53, Myoglobin	Untrained	S2648, Ssym, p53, Myoglobin	0.45–0.71 (0.54–0.68)	Unbiased* (−0.99)	not applicable
DeepDDG [17]		Cao Train	Cao Test	0.56	not evaluated	Homologs (>25%) removed from test set
ProTstab [60]	10-fold CV	Varibench	Varibench	0.79	not evaluated	not declared

Possible biases can be due to anti-symmetry and lack of checking for sequence identity on training/test data. CV = cross-validation, LOO = leave-one-out, TS = test-set, MCC = Matthews Correlation Coefficient, ρ = correlation, ρ_dir-inv = correlation between direct and inverse variation. *Evaluated on Ssym database [11].