Skip to main content
. Author manuscript; available in PMC: 2022 Oct 1.
Published in final edited form as: Br J Psychiatry. 2022 Jan 24;220(Spec Iss 4 Themed Iss Precision Medicine and Personalised Healthcare in Psychiatry):179–191. doi: 10.1192/bjp.2021.219

Table 3. Performance metrics for best model per outcome in each study.

Study ID Outcome Discrimination C- Statistic Calibration Other Global Performance Metrics Classification Metrics
Studies Reporting Apparent Validity
BHATTACHARYYA 2021 Psychiatric rehospitalisation 0.749 Calibration plot only; No α or β Brier score - 0.192 N.R.
CHUA 2019 EET Status at 2 years 0.759 (95%CI: 0.728, 0.790) N.R. N.R. Classification Accuracy - 0.759; PPV - 0.64; NPV - 0.78; PSI - 0.42
DERKS 2010 Andreasen symptom remission (6 months duration) with 1 year f/u N.R. N.R. N.R. Classification Accuracy - 0.63; Balanced Accuracy - 0.665; Sensitivity - 0.73; Specificity - 0.60; PPV - 0.73; NPV - 0.61; PSI - 0.34
FLYCKT 2006 Global functioning (Independent living, EET status, GAF ≥60) at mean 5.4 years N.R. N.R. N.R. Classification Accuracy - 0.81; Balanced Accuracy - 0.805; Sensitivity - 0.84; Specificity - 0.77
GONZALEZ-BLANCH 2010 Global functioning (EET status, DAS ≤1) at 1 year N.R. Hosmer–Lemeshow test - p = >0.05 N.R. Classification Accuracy - 0.750; Balanced Accuracy - 0.587; Sensitivity - 0.261; Specificity - 0.913; PPV - 0.500; NPV - 0.788; PSI - 0.288
Studies Reporting Internal Validity
AJNAKINA 2020 Early treatment resistance from illness onset with 5 years f/u 0.77 α - 0.028; β - 1.264; No calibration plot N.R. Balanced Accuracy - 0.5; Sensitivity - 0; Specificity - 1.00; PPV - 0.48, NPV - 0.84; PSI - 0.32
Later treatment resistance with 5 years f/u 0.77 α - 0.504; β - 1.838; No calibration plot N.R. Balanced Accuracy - 0.81; Sensitivity - 0.62; Specificity - 1.00; PPV - 0.42; NPV - 1.00; PSI - 0.42
BHATTACHARYYA 2021 Psychiatric rehospitalisation 0.66 Calibration plot only; No α or β Brier score - 0.232 N.R.
DEMJAHA 2017 Early Treatment Resistance from Illness Onset with 10 years f/u N.R. N.R. Brier score - 0.146; McFadden pseudo R2 - 0.1 N.R.
DENIJS 2019 Andreasen Symptom Remission (6 months duration) at 3 years N.R. N.R. N.R. Balanced Accuracy - 0.644; Sensitivity - 0.76; Specificity - 0.50; PPV - 0.722; NPV - 0.548; PSI - 0.27
GAF ≥65 at 3 years N.R. N.R. N.R. Balanced Accuracy - 0.676; Sensitivity - 0.749; Specificity - 0.584; PPV - 0.701; NPV - 0.642; PSI - 0.343
Andreasen symptom remission (6 months duration) at 6 years N.R. N.R. N.R. Balanced Accuracy - 0.647; Sensitivity - 0.787; Specificity - 0.465; PPV - 0.690; NPV - 0.590; PSI - 0.28
GAF ≥65 at 6 years N.R. N.R. N.R. Balanced Accuracy - 0.676; Sensitivity - 0.818; Specificity - 0.477; PPV - 0.718; NPV - 0.616; PSI - 0.334
KOUTSOULERIS 2016 GAF ≥65 at 1 year N.R. N.R. N.R. Balanced Accuracy - 0.738; Sensitivity - 0.667; Specificity - 0.809; PPV - 0.515; NPV - 0.888; PSI - 0.403
LEIGHTON 2021 Andreasen symptom remission (6 months duration) at 1 year 0.74 (0.73, 0.75) β - 0.84 (95%CI: 0.81, 0.86); No calibration plot N.R. N.R.
PUNTIS 2021 Psychiatric hospitalisation after discharge from early intervention 0.76 (0.75, 0.77) α - 0.01 (95%CI: -0.25, 0.24); β - 0.89 (95%CI: 0.88, 0.89); Calibration plot Brier score - 0.078 N.R.
Studies Reporting Internal-External Validity
DENIJS 2019 Andreasen symptom remission (6 months duration) at 3 years N.R. N.R. N.R. Balanced Accuracy - 0.638; Sensitivity - 0.629; Specificity - 0.647; PPV - 0.758; NPV - 0.485; PSI - 0.243
GAF ≥65 at 3 years N.R. N.R. N.R. Balanced Accuracy - 0.648; Sensitivity - 0.658; Specificity - 0.638; PPV - 0.727; NPV - 0.565; PSI - 0.292
Andreasen symptom remission (6 months duration) at 6 years N.R. N.R. N.R. Balanced Accuracy - 0.625; Sensitivity - 0.685; Specificity - 0.565; PPV - 0.743; NPV - 0.493; PSI - 0.236
GAF ≥65 at 6 years N.R. N.R. N.R. Balanced Accuracy - 0.640; Sensitivity - 0.718; Specificity - 0.561; PPV - 0.732; NPV - 0.553; PSI - 0.285
KOUTSOULERIS 2016 GAF ≥65 at 1 year N.R. N.R. N.R. Balanced Accuracy - 0.711; Sensitivity - 0.641; Specificity - 0.781; PPV - 0.472; NPV - 0.877; PSI - 0.349
LEIGHTON 2019 (2) EET Status at 1 year 0.736 (95%CI: 0.702 - 0.771) N.R. N.R. Classification Accuracy - 0.693 (95%CI: 0.660, 0.725); Balanced Accuracy - 0.694 (95%CI: 0.562, 0.812); Sensitivity - 0.722 (95%CI: 0.573, 0.821); Specificity - 0.666 (95%CI: 0.550, 0.803); PPV - 0.719 (95%CI: 0.673, 0.785); NPV - 0.668 (95%CI: 0.606, 0.736); PSI - 0.387 (95%CI: 0.279, 0.521)
GAF ≥65 at 1 year 0.731 (95%CI: 0.697, 0.765) N.R. N.R. Classification Accuracy - 0.687 (95%CI: 0.657, 0.718); Balanced Accuracy - 0.691 (95%CI: 0.541, 0.825); Sensitivity - 0.722 (95%CI: 0.487, 0.778); Specificity - 0.660 (95%CI: 0.594, 0.871); PPV - 0.650 (95%CI: 0.616, 0.769); NPV - 0.726 (95%CI: 0.655, 0.766); PSI - 0.376 (95%CI: 0.271 - 0.535)
Andreasen symptom remission (6 months duration) at 1 year 0.703 (95%CI: 0.664, 0.742) N.R. N.R. Classification Accuracy - 0.670 (95%CI: 0.636, 0.703); Balanced Accuracy - 0.668 (95%CI: 0.518, 0.827); Sensitivity - 0.584 (95%CI: 0.491, 0.827); Specificity - 0.751 (95%CI: 0.544, 0.827); PPV - 0.679 (95%CI: 0.601, 0.739); NPV - 0.667 (95%CI: 0.631, 0.734); PSI - 0.346 (95%CI: 0.232, 0.473)
Quality of life at 1 year 0.704 (95%CI: 0.667, 0.742) N.R. N.R. Classification Accuracy - 0.668 (95%CI: 0.632, 0.704); Balanced Accuracy - 0.667 (95%CI: 0.532, 0.789); Sensitivity - 0.623 (95%CI: 0.512, 0.774); Specificity - 0.711 (95%CI: 0.551, 0.803); PPV - 0.633 (95%CI: 0.575, 0.701); NPV 0.700 (95%CI: 0.659, 0.759); PSI - 0.333 (95%CI: 0.234, 0.460)
Studies Reporting External Validity
LEIGHTON 2019 (1) EET status at 1 year 0.876 (95%CI: 0.864, 0.887) N.R. N.R. Classification Accuracy - 0.851; Balanced Accuracy - 0.845; Sensitivity - 0.815; Specificity - 0.875; PPV - 0.815; NPV - 0.875; PSI - 0.690
Andreasen symptom remission (no duration criteria) at 1 year 0.652 (95%CI: 0.635, 0.670) N.R. N.R. Classification Accuracy - 0.612; Balanced Accuracy - 0.623; Sensitivity - 0.578; Specificity - 0.667; PPV - 0.794; NPV - 0.424; PSI - 0.218
Andreasen symptom remission (6 months duration) at 1 year 0.630 (95%CI: 0.612, 0.647) N.R. N.R. Classification Accuracy - 0.625; Balanced Accuracy - 0.626; Sensitivity - 0.606; Specificity - 0.645; PPV - 0.645; NPV - 0.606; PSI - 0.251
LEIGHTON 2019 (2) - Validated in UK EET Status at 1 year 0.867 (95%CI: 0.805, 0.930) N.R. N.R. Classification Accuracy - 0.838 (95%CI: 0.775, 0.894); Balanced Accuracy - 0.853 (95%CI: 0.740, 0.935); Sensitivity - 0.898 (95%CI: 0.780, 0.966); Specificity - 0.807 (95%CI: 0.699, 0.904); PPV - 0.766 (95%CI: 0.679, 0.867); NPV - 0.911 (95%CI: 0.840, 0.971); PSI - 0.677 (95%CI: 0.519, 0.838)
Andreasen symptom remission (6 months duration) at 1 year 0.680 (95%CI: 0.587, 0.773) N.R. N.R. Classification Accuracy - 0.695 (95%CI: 0.618, 0.771); Balanced Accuracy - 0.695 (95%CI: 0.535, 0.841); Sensitivity - 0.621 (95%CI: 0.455, 0.773); Specificity - 0.769 (95%CI: 0.615, 0.908); PPV - 0.729 (95%CI: 0.636, 0.854); NPV - 0.667 (95%CI: 0.593, 0.759); PSI - 0.396 (95%CI: 0.229, 0.613)
Quality of life at 1 year 0.679 (95%CI: 0.522, 0.836) N.R. N.R. Classification Accuracy - 0.702 (95%CI: 0.596, 0.809); Balanced Accuracy - 0.729 (95%CI: 0.407, 0.917); Sensitivity - 0.957 (95%CI: 0.564, 1.000); Specificity - 0.500 (95%CI: 0.250, 0.833); PPV - 0.640 (95%CI: 0.561, 0.800); NPV - 0.900 (95%CI: 0.643, 1.000); PSI - 0.540 (95%CI: 0.204, 0.800)
LEIGHTON 2019 (2) - Validated in Denmark EET Status at 1 year 0.660 (95%CI: 0.610, 0.710) N.R. N.R. Classification Accuracy - 0.680 (95%CI: 0.609, 0.725); Balanced Accuracy - 0.655 (95%CI: 0.516, 0.774); Sensitivity - 0.584 (95%CI: 0.457, 0.723); Specificity - 0.726 (95%CI: 0.574, 0.824); PPV - 0.490 (95%CI: 0.421, 0.563); NPV - 0.793 (95%CI: 0.760, 0.831); PSI - 0.283 (95%CI: 0.181, 0.394)
GAF ≥65 at 1 year 0.573 (95%CI: 0.504, 0.643) N.R. N.R. Classification Accuracy - 0.456 (95%CI: 0.328, 0.817); Balanced Accuracy - 0.589 (95%CI: 0.234, 0.926); Sensitivity - 0.781 (95%CI: 0.233, 0.945); Specificity - 0.396 (95%CI: 0.234, 0.906); PPV - 0.179 (95%CI: 0.158, 0.333); NPV - 0.914 (95%CI: 0.876, 0.967); PSI - 0.093 (95%CI: 0.034, 0.300)
Andreasen symptom remission (6 months duration) at 1 year 0.616 (95%CI: 0.553, 0.679) N.R. N.R. Classification Accuracy - 0.618 (95%CI: 0.524, 0.704); Balanced Accuracy - 0.621 (95%CI: 0.342, 0.864); Sensitivity - 0.612 (95%CI: 0.306, 0.843); Specificity - 0.629 (95%CI: 0.378, 0.885); PPV - 0.476 (95%CI: 0.412, 0.636); NPV - 0.742 (95%CI: 0.687, 0.829); PSI - 0.217 (95%CI: 0.099, 0.465)
Quality of life at 1 year 0.556 (95%CI: 0.481, 0.631) N.R. N.R. Classification Accuracy - 0.589 (95%CI: 0.540, 0.637); Balanced Accuracy - 0.589 (95%CI: 0.312, 0.845); Sensitivity - 0.876 (95%CI: 0.419, 0.947); Specificity - 0.301 (95%CI: 0.204, 0.743); PPV - 0.559 (95%CI: 0.527, 0.642); NPV - 0.706 (95%CI: 0.555, 0.841); PSI - 0.265 (95%CI: 0.081, 0.483)
LEIGHTON 2021 Andreasen symptom remission (6 months duration) 0.73 (95%CI: 0.71, 0.75) α - 0.12 (95%CI: 0.02, 0.22); β - 0.98 (95%CI: 0.85, 1.11); Calibration plot N.R. N.R.
PUNTIS 2021 Psychiatric hospitalisation after discharge from early intervention 0.70 (95%CI: 0.66, 0.75) α - -0.01 (95%CI: -0.17, 0.167); β - 1.00 (95%CI: 0.78, 1.22); Calibration plot Brier score - 0.094 N.R.

N.R. – not reported; EET – employment, education or training; GAF – Global Assessment of Functioning; DAS – Disability Assessment Schedule; PPV – positive predictive value; NPV – negative predictive value; PSI – prognostic summary index; f/u – follow-up