Skip to main content
. 2015 Jun 15;15(Suppl 2):S4. doi: 10.1186/1472-6947-15-S2-S4

Table 11.

Sentence detection.

Top 10 [1-4] w 2 [1-5] w 2
1 Capitalization 11.34 LT 674.21
2 LT 10.52 Mean-LT 674.21
3 Mean-LT 10.52 Capitalization 637.85
4 No "\n" 4.82 RippenanteileRC 627.54
5 ∈ CCDict 4.08 LymphknotenRC 356.25
6 Double "\n" 3.77 Double "\n" 336.64
7 All upper case 0.97 LungengerüstzeichnungRC 332.50
8 Contains digit 0.71 IntegumentRC 321.86
9 > b3 0.31 No "\n" 300.18
10 Contains period 0.19 NormaleRC 277.68

Top 10 [1-6] w 2 [1-7] w 2

1 Capitalization 971.25 Abbreviation 1326.41
2 Mean-LT 840.45 Capitalization 867.06
3 LT 840.45 o.B. 382.83
4 Double "\n" 341.46 Double "\n" 374.57
5 No "\n" 324.25 No "\n" 364.32
6 o.B. 259.13 bds. 282.13
7 RippenanteileRC 254.91 CTRC 266.54
8 mitresez. 254.91 LeberlappenRC 225.08
9 CTRC 251.41 A. 206.77
10 LeberlappenRC 236.26 NarbigeRC 191.01

Top 10 feature rankings per feature set (1 Language features; 2 Rule-based features; 3 Text format features; 4 Word length features; 5 Right context word type features (RC); 6 Word type features; 7 Abbreviation feature). Length (LT); w2: Weight based feature relevance criterion.