Figure 4.
Statistical analysis of models on all datasets based on the F1-score and outcomes of the 10 top and 10 worst performing models. (A) Friedman ranks, (B) Holm post-hoc test, (C) prediction results of the top performing models and (D) prediction results of the worst performing models. MCC* indicates the mean of two-class datasets only.
