. 2016 Mar 31;7:50. doi: 10.3389/fpsyt.2016.00050

Table 2.

Factors that influence the ML effect size and classification accuracy.

Source	Relevant factor	Acting on	Effect in study:
			Training + LOOCV	Train → test
Reference: homogeneous sample (“0”)	Separation strength, Δy₀ Spread, σ_y₀	–	d_ML0 = Δy₀/σ_y₀ acc₀ = Φ(d_ML0/2)	–
Heterogeneity of the disease^a	f	Separation strength, Δy → d_ML	d_ML = √[(1 + f)/2] d_ML0	d_ML = fd_ML0
Heterogeneity: biological variation	σ_BIOL	broadening, σ_y → d_ML	d_ML = (σ_y₀/σ_y)d_ML0	d_MLT = (σ_y/σ_y_T)d_ML
Measurement noise	σ_EXP	broadening, σ_y → d_ML	d_ML = (σ_y₀/σ_y) d_ML0	d_MLT = (σ_y/σ_y_T)d_ML
Sampling effects (finite N)	N, σ_y	uncertainty in accuracy	≤ SD(acc) in train/test case	SD(acc) = √(acc_true × (100%−acc_true)/N)
Gold → silver standard	Intra-class kappa, κ	ceiling of accuracy		acc = κ × acc₀ +(1−κ)/2

^ai.e., related to the prediction model.

f = cos(θ), the relative amount of heterogeneity;σ²_y = σ²_BIOL + σ²_EXP; acc = accuracy; subscripts “T” refer to values in the Test sample.