Table 1.
Statistics on the training data (ART/CoreSC corpus)
| Measure | Bac | Con | Exp | Goa | Met | Mot | Obs | Res | Mod | Obj | Hyp | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number of sentences | 7606 | 3636 | 3858 | 582 | 4281 | 541 | 5410 | 8404 | 3656 | 1161 | 780 | 39 915 |
| Number of words | 193 930 | 102 173 | 93 882 | 16 564 | 107 309 | 13 737 | 123 394 | 224 353 | 99 313 | 29 215 | 21 315 | 1 025 185 |
| Percentage of sentences | 19 | 9 | 10 | 1 | 11 | 1 | 14 | 21 | 9 | 3 | 2 | |
| Number of words p/s (mean) | 25.5 | 28.1 | 24.33 | 28.46 | 25.07 | 25.39 | 22.81 | 26.7 | 27.16 | 25.16 | 27.33 | |
| Number of words p/s (SD) | 12.32 | 12.49 | 20.6 | 12.69 | 11.4 | 10.34 | 11.44 | 12.65 | 14.76 | 11.16 | 12.01 | |
| κ-IAA | 0.87 | 0.89 | 0.65 | 0.60 | 0.74 | 0.46 | 0.79 | 0.78 | 0.43 | 0.81 | 0.46 |