Table 3.
Distribution of the raw text, lexical, and morpho-syntactic features in the complex and simple set of sentences for the three corpora.
| Feature | Terence | Teacher | PaCCSS–IT | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Compl | Simp | Diff | Compl | Simp | Diff | Compl | Simp | Diff | |
| Raw text features | |||||||||
| Sentence length | 19.92 | 18.61 | 1.31 | 21.25 | 18.56 | 2.70 | 8.97 | 8.0 | 0.97 |
| Word length | 4.89 | 4.80 | 0.09 | 4.74 | 4.70 | 0.04 | 4.70 | 4.54 | 0.16 |
| Lexical features | |||||||||
| % BIV | 75.59 | 77.31 | −1.72 | 78.53 | 77.77 | 0.75 | 72.19 | 77.08 | −4.88 |
| % FO | 78.14 | 79.82 | −1.67 | 80.21 | 82.73 | −2.51 | 75.03 | 75.76 | −0.73 |
| % HU | 13.08 | 12.15 | 0.93 | 11.98 | 9.68 | 2.30 | 20.19 | 19.82 | 0.37 |
| % HA | 8.77 | 8.03 | 0.74 | 7.81 | 7.60 | 0.21 | 4.78 | 4.42 | 0.36 |
| Type/Token ratio | 0.942 | 0.941 | -0.001 | 0.921 | 0.913 | 0.008 | 0.97 | 0.99 | −0.02 |
| Morpho–syntactic features | |||||||||
| Morpho–syntactic information | |||||||||
| Adjectives | 5.87 | 5.97 | −0.01 | 5.34 | 5.11 | 0.23 | 5.74 | 7.90 | −2.15 |
| Adverbs | 6.82 | 6.97 | −0.15 | 7.62 | 6.73 | 0.89 | 12.26 | 9.95 | 2.31 |
| Articles | 8.79 | 8.73 | 0.07 | 8.24 | 8.69 | −0.45 | 11.04 | 12.71 | −1.67 |
| Conjunctions—coordinating | 3.57 | 3.76 | −0.19 | 3.98 | 4.72 | −0.74 | 2.66 | 3.45 | −0.79 |
| Conjunctions—subordinating | 1.75 | 2.16 | −0.41 | 1.73 | 1.09 | 0.64 | 0.32 | 0.30 | 0.02 |
| Prepositions | 13.31 | 12.50 | 0.81 | 10.77 | 10.51 | 0.25 | 5.98 | 6.21 | −0.23 |
| Pronouns | 5.33 | 5.04 | 0.28 | 17.69 | 17.15 | 0.54 | 7.23 | 4.14 | 3.09 |
| Pronouns—relative | 0.87 | 0.81 | 0.06 | 0.85 | 0.28 | 0.57 | 0.27 | 0.1 | 0.17 |
| Pronouns—clitic | 2.78 | 2.61 | 0.17 | 5.25 | 2.74 | 2.51 | 2.47 | 1.60 | 0.87 |
| Punctuation | 11.57 | 11.54 | 0.03 | 15.53 | 15.52 | 0.01 | 20.5 | 15.13 | 5.36 |
| Numbers | 1.07 | 0.91 | 0.15 | 2.25 | 2.47 | −0.22 | 2.25 | 2.47 | −0.22 |
| Lexical density | 0.59 | 0.60 | −0.00 | 0.58 | 0.62 | −0.04 | 0.61 | 0.60 | 0.00 |
| Inflectional morphology | |||||||||
| Indicative mood | 61.23 | 64.4 | −3.17 | 57.14 | 70.87 | −13.73 | 68.14 | 68.31 | −0.17 |
| Participial mood | 6.95 | 4.63 | 2.32 | 3.95 | 2.84 | 1.11 | 3.65 | 2.42 | 1.23 |
| Gerundive mood | 3.44 | 2.62 | 0.83 | 1.56 | – | 1.56 | 0.46 | 0.04 | 0.42 |
| Infinitive mood | 15.98 | 17.64 | −1.66 | 22.1 | 19.67 | 2.43 | 12.04 | 11.65 | 0.39 |
| Subjunctive mood | 1.00 | 0.57 | 0.42 | 0.58 | – | 0.58 | 0.78 | 0.05 | 0.73 |
| Conditional mood | 0.19 | 0.12 | 0.07 | 0.84 | 0.18 | 0.66 | 3.34 | 0.001 | 3.33 |
| Present tense | 6.21 | 4.74 | 1.47 | 43.31 | 90.19 | −46.87 | 79.18 | 80.91 | −1.73 |
| Imperfect tense | 50.66 | 52.97 | −2.31 | 16.39 | 0.82 | 15.57 | 2.89 | 4.29 | −1.40 |
| Past tense | 40.98 | 39.97 | 1.01 | 27.45 | – | 27.45 | 1.33 | 1.57 | −0.24 |
| 2 person, singular | 0.44 | 0.51 | −0.07 | 2.77 | 0.37 | 2.4 | 0.60 | 0.44 | 0.15 |
| 3 person, singular | 64.9 | 66.09 | −1.19 | 48.59 | 53.31 | −4.72 | 62.31 | 58.13 | 4.18 |
| 1 person, plural | – | 0.09 | –0.09 | 2.95 | 4.13 | −1.18 | 1.51 | 1.84 | −0.33 |
| 2 person, plural | – | – | – | 0.42 | 0.32 | 0.10 | 0.30 | 0.19 | 0.11 |
| 3 person, plural | 18.69 | 19.14 | −0.45 | 13.86 | 16.55 | −2.69 | 8.12 | 7.83 | 0.28 |
Statistically significant variations with respect to the Wilcoxon signed-rank test at p < 0.05 are bold.