. 2015 Aug 31;17(8):e212. doi: 10.2196/jmir.4612

Table 4.

Performance (in %) of automatic failure detection and its individual component.

Failure type	Causes of failure	Precision	Recall	Accuracy	F1 score
1. Boundary failures	1.1 Splitting a phrase	82.00	78.85	96.78	80.39
2. Missed term failures	2.1 Community specific nomenclatures	88.00	100.00	99.02	93.62
2. Missed term failures	2.2 Misspellings	80.00	93.02	97.88	86.02
3. Word sense ambiguity failures	3.1 Abbreviations and contractions	82.00	95.35	98.20	88.17
	3.2 Colloquial language	100.00	100.00	100.00	100.00
	3.3 Numbers	100.00	100.00	100.00	100.00
	3.4 Email addresses and URLs	100.00	100.00	100.00	100.00
	3.5 Internet slang and SMS language	100.00	100.00	100.00	100.00
	3.6 Names	66.00	100.00	97.21	79.52
	3.7 Narrative style of pronoun “I”	100.00	100.00	100.00	100.00
	3.8 Mismapped verbs	32.00	100.00	94.43	48.48
	3.9 Inconsistent mappings	66.00	53.23	92.80	58.93
Total		83.00	92.57	88.17	87.52