Table 5. P-Values of a Pairwise Wilcoxon Signed-Rank Test for AUC and BEDROC Results, Comparing All Three Similarity Methods (GED-Based, FP-Based, and SED-Based)a.
| Wilcoxon test (AUC) |
Wilcoxon test (BEDROC) |
|||||
|---|---|---|---|---|---|---|
| SED | FP | GED | SED | FP | GED | |
| SED | 1.20841 × 10–13* | 7.4432 × 10–21* | 2.08456 × 10–16* | 1.18024 × 10–21* | ||
| FP | 1.20841 × 10–13* | 0.000748788* | 2.08456 × 10–16* | 0.000585085* | ||
| GED | 7.4432 × 10–21* | 0.000748788* | 1.18024 × 10–21* | 0.000585085* | ||
The test is applied to all targets in the datasets combined. Here, a confidence level of α = 0.05 is used, so p-values lower than α indicate statistically significant differences, which are marked with an asterisk (*) in the table.