Skip to main content
. 2023 Aug 28;120(36):e2215710120. doi: 10.1073/pnas.2215710120

Table 2.

Description of lexical databases

Language Database Number of examined wordforms Number of minimal pairs Percentage of words with phonological neighbors
German WebCelex 50,655 13,289 22.41
English WebCelex 38,890 24,063 29.01
Dutch WebCelex 117,237 30,503 17.16
Swedish NST lexical database for Swedish 97,325 18,842 13.38
Norwegian NST lexical database for Norwegian 65,142 20,239 17.63
French Lexique 3.81 40,138 22,893 31.53
Italian PhonItalia 1.10 42,232 11,617 22.39
Spanish BuscaPalabras 26,349 10,494 26.41
Czech Phonological Corpora of Czech 44,869 11,123 25.09
Greek GreekLex 2.1 35,047 5,964 17.58
Turkish Turkish Electronic Living Lexicon (TELL) 15,259 9,079 41.88
Korean K-SPAN 55,599 45,019 44.01

All lexical databases can be accessed freely online: German, English and Dutch: WebCelex (http://celex.mpi.nl/); Swedish: NST lexical database for Swedish (https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-22/); Norwegian: NST lexical database for Norwegian (https://www.nb.no/sprakbanken/en/resource-catalogue/oai-nb-no-sbr-23/); French: Lexique 3.81 (http://www.lexique.org) (66); Italian: PhonItalia1.10 (67); Spanish: BuscaPalabras (68); Czech: Phonological Corpus of Czech (https://ujc.avcr.cz/phword/); Greek: GreekLex 2.1 (69); Turkish: TELL (http://linguistics.berkeley.edu/TELL/); Korean: K-SPAN (70).