Skip to main content
. 2015 Feb 21;16:57. doi: 10.1186/s12859-015-0487-2

Table 3.

Dataset used for training SVM classifiers

Headwords Positive Negative SemCat catogories
Gene 3532163 1631676 GENE_OR_PROTEIN
DNA_MOLECULE
Protein 3533621 1630690 GENE_OR_PROTEIN
PROTEIN_MOLECULE
Disease 88653 5096888 DISEASE_OR_SYNDROME
INJURY_OR_POISONING
SIGN_OR_SYMPTOM
Cell(s) 14581 5178142 CELL

For each keyword, terms from relevant SemCat categories were merged and used for the classifiers.