Skip to main content

View full-text article in PMC

. 2012 Apr 1;3:3. doi: 10.1186/2041-1480-3-3

Table 9.

Lemmatization performance of the BioLemmatizer resources on CRAFT set

Silver Standard
	Recall	Precision	F-score

Base (MorphAdorner lexicon)	94.37% (5532/5862)	94.16% (5532/5875)	94.26%

Base + GENIA	94.20% (5522/5862)	93.90% (5522/5881)	94.05%

Base + BioLexicon	98.41% (5769/5862)	98.23% (5769/5873)	98.32%

Entire Lexicon	98.60% (5780/5862)	98.42% (5780/5873)	98.51%

Rule Only	97.83% (5735/5862)	97.83% (5735/5862)	97.83%

Rule + Lexicon Validation	98.67% (5784/5862)	98.67% (5784/5862)	98.67%

Gold Standard

	Recall	Precision	F-score

Base (MorphAdorner lexicon)	53.71% (311/579)	53.34% (311/583)	53.52%

Base + GENIA	62.69% (363/579)	61.95% (363/586)	62.32%

Base + BioLexicon	64.77% (375/579)	64.10% (375/585)	64.43%

Entire Lexicon	76.68% (444/579)	75.90% (444/585)	76.29%

Rule Only	85.84% (497/579)	85.84% (497/579)	85.84%

Rule + Lexicon Validation	90.85% (526/579)	90.85% (526/579)	90.85%