Table 1.
BiolarkGSC+ | COPD-HPO | |||||
---|---|---|---|---|---|---|
Method/Metric | Precision | Recall | F1-score | Precision | Recall | F1-score |
OBO Anotator [9] | 0.810 | 0.568 | 0.668 | 0.318 | 0.282 | 0.299 |
NCBO [10] | 0.777 | 0.521 | 0.624 | 0.756 | 0.763 | 0.760 |
MonarchInitiative [16] | 0.751 | 0.608 | 0.672 | 0.741 | 0.747 | 0.744 |
Doc2hpo-Ensemble [15] | 0.754 | 0.608 | 0.673 | 0.779 | 0.755 | 0.767 |
MetaMap [12] | 0.707 | 0.599 | 0.649 | 0.640 | 0.781 | 0.704 |
Clinphen [11] | 0.590 | 0.418 | 0.489 | 0.377 | 0.328 | 0.351 |
NeuralCR [14] | 0.736 | 0.610 | 0.667 | 0.543 | 0.719 | 0.619 |
TrackHealth | 0.757 | 0.595 | 0.666 | 0.719 | 0.669 | 0.693 |
PhenoTagger [17] | 0.720 | 0.760 | 0.740 | 0.623 | 0.820 | 0.708 |
MMRerank | 0.754 | 0.599 | 0.668 | 0.822 | 0.779 | 0.800 |
MNIRerank | 0.789 | 0.603 | 0.683 | 0.802 | 0.736 | 0.768 |
PTRerank | 0.843 | 0.708 | 0.770 | 0.836 | 0.771 | 0.802 |
Note. MMRerank, MNIRerank, and PTRerank represent the re-ranking models based on MetaMap, MonarchInitiative methods, and PhenoTagger. The digits in bold indicate the best scores in terms of the corresponding metrics.