Skip to main content
. 2021 Feb 9;7(1):20190063. doi: 10.1515/lingvan-2019-0063

Table 1:

Basic information on corpora and languages, with Glottolog language identification codes (Hammarström et al. 2018). Note that the language we call “Texistepec” here is often also referred to as “Texistepec Popoluca”.

Language Typology Corpus
Language Glottocode Family Word order Stress versus tone Vowel length Av. Segments/word Av. Morphemes/word Texts Speakers Words (total) Words (duration study) Reference
Baure baur1253 Arawakan VSO Stress No 5.73 1.86 34 9 17 563 2 992 Danielsen et al. (2009)
Bora bora1263 Boran SOV Tone Yes 7.13 2.21 37 32 29 795 7 080 Seifart (2009)
Chintang chhi1245 Sino-Tibetan SOV Stress No 5.14 1.81 40 51 37 731 5 096 Bickel et al. (2011)
Dutch dutc1256 Indo-European SOV Stress Yes 3.85 n/a 17 42 39 448 8 128 CGN-consortium (2003)
English stan1293 Indo-European SVO Stress (Tenseness) 3.70 1.09 47 80 56 136 8 544 Godfrey et al. (1992)
Even lamu1253 Tungusic SOV Stress Yes 5.79 1.91 67 31 37 394 12 116 Pakendorf et al. (2010)
Hoocąk hoch1243 Siouan SOV Stress Yes 6.64 1.71 62 26 23 176 7 440 Hartmann (2013)
N||ng nuuu1241 ǃUi-Taa SVO tone No 3.45 1.14 33 7 25 850 5 112 Güldemann et al. (2011)
Sakha yaku1245 Turkic SOV Stress Yes 5.77 1.68 16 22 31 139 8 560 Pakendorf (2007)
Texistepec texi1237 Mixe-Zoquean SOV Stress Yes 5.14 1.81 6 1 21 315 4 044 Wichmann (1996)