Skip to main content
. 2015 Aug 31;17(8):e212. doi: 10.2196/jmir.4612

Table 4.

Performance (in %) of automatic failure detection and its individual component.

Failure type Causes of failure Precision Recall Accuracy F1 score
1. Boundary failures 1.1 Splitting a phrase 82.00 78.85 96.78 80.39
2. Missed term failures 2.1 Community specific nomenclatures 88.00 100.00 99.02 93.62
2.2 Misspellings 80.00 93.02 97.88 86.02
3. Word sense ambiguity failures 3.1 Abbreviations and contractions 82.00 95.35 98.20 88.17
3.2 Colloquial language 100.00 100.00 100.00 100.00
3.3 Numbers 100.00 100.00 100.00 100.00
3.4 Email addresses and URLs 100.00 100.00 100.00 100.00
3.5 Internet slang and SMS language 100.00 100.00 100.00 100.00
3.6 Names 66.00 100.00 97.21 79.52
3.7 Narrative style of pronoun “I” 100.00 100.00 100.00 100.00
3.8 Mismapped verbs 32.00 100.00 94.43 48.48
3.9 Inconsistent mappings 66.00 53.23 92.80 58.93
Total 83.00 92.57 88.17 87.52