Table 7.
Robustness evaluation of our classifiers
Data set | Accuracy | AUC | F1-score |
---|---|---|---|
0.52 | 0.49 | 0.36 | |
0.92 | 0.91 | 0.89 | |
0.92 | 0.91 | 0.89 | |
0.93 | 0.92 | 0.91 | |
0.93 | 0.92 | 0.90 | |
0.92 | 0.91 | 0.90 | |
0.63 | 0.51 | 0.26 | |
0.93 | 0.89 | 0.86 | |
0.94 | 0.90 | 0.88 | |
0.93 | 0.90 | 0.86 | |
0.92 | 0.89 | 0.85 | |
0.93 | 0.89 | 0.86 |
corresponds to an experiment where class labels (i.e., or ) are randomly affected to drugs. correspond to experiments where negative examples (i.e., ) are replaced by drugs randomly picked in the knowledge graph. Indices i from 1 to 5 refer to 5 different draws