Table 4.
European Parliament (25–30M) | Cosmas II (2000M) | deWac (1411M) | Google set to 1 | DLex (100M) | |
---|---|---|---|---|---|
Past tense verb forms | 0.091 | 3.787 | 0.032 | 37175.20 | 8.034 |
Present tense verb forms | 1.123 | 5.439 | 0.034 | 105627.80 | 3.498 |
Verb difference scores | −1.032 | −1.652 | −0.002 | −68452.6 | 4.536 |
lower/upper 95 CI of the difference scores | −2.779/0.715 | −5.843/2.541 | −0.0171/0.0130 | −247001.4/110096.2 | −1.750/10.822 |
Adverbs indicating the past | 58.879 | 27.774 | 0.183 | 417298.0 | 18.680 |
Adverbs indicating the future | 11.537 | 17.012 | 0.184 | 421338.4 | 23.805 |
Adverb difference scores | 47.343 | 10.762 | −0.001 | −4040.404 | −5.125 |
lower/upper 95 CI of the difference scores | 1.126/93.559 | −20.667/42.191 | −0.239/0.237 | −528146.7/520065.9 | −45.065/34.815 |
“Past tense verb forms” and “present tense verb forms” indicate the averaged and normalized frequencies for the recent and future conditions respectively. “Adverbs indicating the past” and “adverbs indicating the future” present the averaged and normalized frequency averages across the adverbs used in the recent and future conditions. “Verb difference scores” and “Adverb difference scores” present the results for subtracting the scores for verbs/adverbs in the future from those for verbs/adverbs in the recent condition. Negative difference scores indicate lower frequencies for the past than present tense verbs and the adverbs. For each corpus we show the number of tokens in millions (M) in brackets. For the verb and adverb difference scores we list first the lower and then the upper 95 percent confidence interval.