Table 2.
General statistics for the Spanish and English corpora and sub-corpora
| Whole Corpus | 1stW | 2ndW | 3rdW | |||||
|---|---|---|---|---|---|---|---|---|
| ES | EN | ES | EN | ES | EN | ES | EN | |
| Tokens | 963,212 | 1,807,845 | 280,236 | 364,657 | 364,881 | 477,991 | 269,271 | 224,459 |
| Words | 837,772 | 1,574,108 | 245,001 | 317,268 | 293,885 | 415,257 | 216,010 | 195,907 |
| Sentences | 30,871 | 64,967 | 9,662 | 13,535 | 10,082 | 17,258 | 7,164 | 7,665 |
| Types | 37,671 | 44,834 | 18,445 | 20,463 | 20,54 | 23,234 | 18,132 | 15,511 |
| TTR | 3.86 | 2.12 | 6.58 | 5.61 | 5.62 | 4.86 | 6.73 | 6.91 |
| STTR | 43.26 | 45.68 | 44.01 | 45.37 | 43.11 | 45.9 | 42.62 | 45.90 |