TABLE 1.
The four metrics of model evaluation considered in this paper, their formulations, ranges, and descriptions.
Metric | Formula | [Range] (random expectation) | Description | ||
---|---|---|---|---|---|
Area under the operator curve (AUC) |
where the indicator function obtains the value of one if , whereas otherwise it obtains the value of zero |
[0…1] (0.5) |
The proportion of cases for which the occurrence probability for a randomly chosen occupied sampling unit is higher than the occurrence probability for a randomly chosen empty sampling unit. Can be equivalently defined as the integral of the receiver operating characteristic (ROC) curve, plotting sensitivity against 1—specificity over all thresholds (Hanley & McNeil, 1982) | ||
Tjur's R 2 |
|
[−1…1] (0) |
The difference in the average occupancy probabilities between occupied and empty sampling units (Tjur, 2009). Can be also interpreted as variance explained by the binary model, hence being to some extent equivalent of the usual R 2 of linear models (Tjur, 2009) | ||
True skill statistic (TSS) | Sensitivity + specificity – 1 |
[−1…1] (0) |
TSS compares the number of correct predictions, minus those attributable to random guessing, to that of a hypothetical set of perfect predictions (Allouche et al., 2006) | ||
Cohen's Kappa |
where and are, respectively, the observed and expected total accuracy |
[−1…1] (0) |
Overall accuracy of model predictions, corrected by the accuracy expected to occur by chance (Allouche et al., 2006; Shao & Halpin, 1995) |