Table 2.
Evaluation scores obtained for the DDI classification task on the DDI corpus and on each type of document, comparing different configurations of the model
DDI test | DrugBank | Medline | |||||||
---|---|---|---|---|---|---|---|---|---|
Configuration | P | R | F | P | R | F | P | R | F |
Word embeddings | 0.5819 | 0.5291 | 0.5542 | 0.5868 | 0.5512 | 0.5685 | 0.5000 | 0.2951 | 0.3711 |
+ WordNet | 0.5754 | 0.5574 | 0.5663 | 0.5845 | 0.5745 | 0.5795 | 0.4600 | 0.3770 | 0.4144 |
+ Common Anc. | 0.5968 | 0.5248 | 0.5585 | 0.6045 | 0.5481 | 0.5749 | 0.5152 | 0.2787 | 0.3617 |
+ Concat. Anc. | 0.5282 | 0.5589 | 0.5431 | 0.5286 | 0.5590 | 0.5434 | 0.4921 | 0.5082 | 0.5000 |
+ WordNet + Anc. | 0.5182 | 0.6454 | 0.5749 | 0.5171 | 0.6568 | 0.5787 | 0.4590 | 0.4590 | 0.4590 |
Evaluation metrics used: Precision (P), Recall (R) and F1-score (F). Each row represents the addition of an information source to the initial configuration
Boldface indicates the configuration with highest score for each measure