Skip to main content
. 2012 Feb 8;28(7):991–1000. doi: 10.1093/bioinformatics/bts071

Table 1.

Statistics on the training data (ART/CoreSC corpus)

Measure Bac Con Exp Goa Met Mot Obs Res Mod Obj Hyp Total
Number of sentences 7606 3636 3858 582 4281 541 5410 8404 3656 1161 780 39 915
Number of words 193 930 102 173 93 882 16 564 107 309 13 737 123 394 224 353 99 313 29 215 21 315 1 025 185
Percentage of sentences 19 9 10 1 11 1 14 21 9 3 2
Number of words p/s (mean) 25.5 28.1 24.33 28.46 25.07 25.39 22.81 26.7 27.16 25.16 27.33
Number of words p/s (SD) 12.32 12.49 20.6 12.69 11.4 10.34 11.44 12.65 14.76 11.16 12.01
κ-IAA 0.87 0.89 0.65 0.60 0.74 0.46 0.79 0.78 0.43 0.81 0.46