Skip to main content
. 2021 Jul 11;120:108168. doi: 10.1016/j.patcog.2021.108168

Table 4.

The 12 metrics for evaluating the performance of various segmentation models.

Metric Formula Description
Receiver operating characteristic (ROC) curve TPR=TPTP+FN,FPR=FPFP+TN TPR and FPR measure the proportion of correctly identified actual positives and actual negatives, respectively
Precision-recall (PR) curve P=TPTP+FP,R=TPTP+FN PR curve mainly evaluates the comprehensiveness of the detected lung infection pixels
F-measure curve Fm=(1+β2)P·Rβ2·P+R
β2 is set to 0.3 to emphasize the effect of P
F-measure is computed by the weighted harmonic mean of precision and recall, which can reflect the quality of detection
DICE score DICE=2×|SMGT||SM|+|GT| DICE score measures the similarity between the predicted map and the ground truth
Sensitivity score Sen.=TPTP+FN Sensitivity score measures the rate of missed detection
Specificity score Spec.=TNFP+TN Specificity score measures the rate of false detection
Mean absolute error (MAE) score MAE=1W×H1W1H|SMGT|
W and H denote the width and height of the image, respectively
MAE score indicates the similarity between the segmentation map and the ground truth
Area under curve (AUC) score AUC=t0E0t1E1I[f(t0)f(t0)]|E0|·|E1|
E0 and E1 denote the set of negative and positive examples, respectively
AUC score gives an intuitive indication of how well the segmentation map predicts the true lung infection regions
Weighted F-measure (WF) score [45] WF=(1+β2)WP·WRβ2·WP+WR
WP and WR denote the weighted precision and weighted recall, respectively
WP and WP measure the exactness and completeness, respectively
Overlapping ratio (OR) score OR=|SBMGT||SBMGT|
SBM denotes the binary segmentation map
OR score measures the completeness of lung infection pixels and the correctness of non-lung infection pixels
Structure-measure (S-M) score [46] (1α)×SO(SM,GT)+α×SR(SM,GT)
SO and SRdenote the object-aware similarity and region-aware similarity, respectively
S-M score measures the structural similarity between the segmentation map and the ground truth
Execution time Average execution time per image (in second) All experiments were performed with the same equipment and settings