Skip to main content
. 2022 Jan 11;37(7):1763–1770. doi: 10.1007/s11606-021-07333-z

Table 2.

Summary of Model Characteristics

Development only with or without internal validation
(n = 40 studies)
Development and external validation
(n = 15 studies)
External validation
(n = 5 studies)
Modelling method*
  Regression analysis, no. of studies (%) 31 (78%) 13 (87%) 3 (60%)
  Artificial intelligence, no. of studies (%) 9 (23%) 5 (33%) 0 (0%)
  Other, no. of studies (%) 4 (10%) 0 (0%) 2 (40%)
Internal validation
  Split sample, no. of studies (%) 17 (43%) 1 (7%)
  Bootstrapping or cross-validation, no. of studies (%) 9 (23%) 1 (7 %)
  None, no. of studies (%) 14 (35%) 13 (87%)
Performance measures
  Explained variance
    R2, no. of studies (%) 21 (53%) 5 (33%) 2 (40%)
  Discrimination 24 (60%) 10 (67%) 4 (80%)
    C-statistic/AUC, no. of studies (%) 22 (55%) 10 (67%) 4 (80%)
    Discrimination slope, no. of studies (%) 2 (5%) 0 (0%) 0 (0%)
    Other§, no. of studies (%) 2 (5%) 0 (0%) 1 (20%)
Calibration 9 (23%) 6 (40%) 4 (80%)
  Goodness of fit, no. of studies (%) 4 (10%) 3 (20%) 1 (20%)
  Calibration plot, no. of studies (%) 0 (0%) 3 (20%) 4 (80%)
  Other, no. of studies (%) 5 (13%) 4 (27%) 1 (20%)
Classification
  Sensitivity/specificity, no. of studies (%) 14 (35%) 7 (47%) 1 (20%)
  Clinical usefulness 1 (3%) 1 (7%) 0 (0%)
  Net reclassification index, no. of studies (%) 1 (3%) 0 (0%) 0 (0%)
  Decision curve, no. of studies (%) 0 (0%) 1 (7%) 0 (0%)

AUC area under the receiver operating curve

*As some studies use multiple modelling strategies when presenting multiple models, totals may add up to > 100%

†For example, risk stratification with predefined risk tiers

‡As some studies use multiple measures of discrimination, totals may add up to > 100%

§For example, D-statistic, Brier score, Integrated Discrimination Improvement (IDI)

As some studies use multiple measures of calibration, totals may add up to > 100%

For example, calibration slope, root mean square of approximation (RMSEA), cost capture