Table 1. Pre-trained concept tagging tools used in ezTag.
Pre-trained tagger | Bio-entity | Nomenclature | F1 score (normalization) |
---|---|---|---|
TaggerOne | Chemical | MeSH | 0.895 |
Disease | MEDIC | 0.807 | |
GNormPlus | Gene | NCBI Gene | 0.867 |
Species | NCBI Taxonomy | 0.854 | |
tmVar | Sequence variation | NCBI dbSNP | 0.903 |
MEDIC is a disease vocabulary created by Comparative Toxicogenomics Database. All other vocabularies are products of National Library Medicine. F1 scores are taken from their corresponding publications.