Table 4.
Metric | Formula | Description |
---|---|---|
Receiver operating characteristic (ROC) curve | TPR and FPR measure the proportion of correctly identified actual positives and actual negatives, respectively | |
Precision-recall (PR) curve | PR curve mainly evaluates the comprehensiveness of the detected lung infection pixels | |
F-measure curve |
is set to 0.3 to emphasize the effect of |
F-measure is computed by the weighted harmonic mean of precision and recall, which can reflect the quality of detection |
DICE score | DICE score measures the similarity between the predicted map and the ground truth | |
Sensitivity score | Sensitivity score measures the rate of missed detection | |
Specificity score | Specificity score measures the rate of false detection | |
Mean absolute error (MAE) score |
and denote the width and height of the image, respectively |
MAE score indicates the similarity between the segmentation map and the ground truth |
Area under curve (AUC) score |
and denote the set of negative and positive examples, respectively |
AUC score gives an intuitive indication of how well the segmentation map predicts the true lung infection regions |
Weighted F-measure (WF) score [45] |
and denote the weighted precision and weighted recall, respectively |
WP and WP measure the exactness and completeness, respectively |
Overlapping ratio (OR) score |
denotes the binary segmentation map |
OR score measures the completeness of lung infection pixels and the correctness of non-lung infection pixels |
Structure-measure (S-M) score [46] |
and denote the object-aware similarity and region-aware similarity, respectively |
S-M score measures the structural similarity between the segmentation map and the ground truth |
Execution time | Average execution time per image (in second) | All experiments were performed with the same equipment and settings |