Table 2.
Description of lexical databases
Language | Database | Number of examined wordforms | Number of minimal pairs | Percentage of words with phonological neighbors |
---|---|---|---|---|
German | WebCelex | 50,655 | 13,289 | 22.41 |
English | WebCelex | 38,890 | 24,063 | 29.01 |
Dutch | WebCelex | 117,237 | 30,503 | 17.16 |
Swedish | NST lexical database for Swedish | 97,325 | 18,842 | 13.38 |
Norwegian | NST lexical database for Norwegian | 65,142 | 20,239 | 17.63 |
French | Lexique 3.81 | 40,138 | 22,893 | 31.53 |
Italian | PhonItalia 1.10 | 42,232 | 11,617 | 22.39 |
Spanish | BuscaPalabras | 26,349 | 10,494 | 26.41 |
Czech | Phonological Corpora of Czech | 44,869 | 11,123 | 25.09 |
Greek | GreekLex 2.1 | 35,047 | 5,964 | 17.58 |
Turkish | Turkish Electronic Living Lexicon (TELL) | 15,259 | 9,079 | 41.88 |
Korean | K-SPAN | 55,599 | 45,019 | 44.01 |
All lexical databases can be accessed freely online: German, English and Dutch: WebCelex (http://celex.mpi.nl/); Swedish: NST lexical database for Swedish (https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-22/); Norwegian: NST lexical database for Norwegian (https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-23/); French: Lexique 3.81 (http://www.lexique.org) (66); Italian: PhonItalia1.10 (67); Spanish: BuscaPalabras (68); Czech: Phonological Corpus of Czech (https://ujc.avcr.cz/phword/); Greek: GreekLex 2.1 (69); Turkish: TELL (http://linguistics.berkeley.edu/TELL/); Korean: K-SPAN (70).