Skip to main content
. 2024 Apr 9;56(7):6826–6861. doi: 10.3758/s13428-024-02395-3

Fig. 6.

Fig. 6

The increase in the number of sublexical units occurring at least twice (y-axis) as the corpus size increases (x-axis). Corpora were randomly resampled 100 times each with sizes of n = 10, 20, 50, 100, 500, 1000, 2000, 4000, 6000, and 10000 words. Top row: phonological units; middle row: graphemic units; bottom row: phonographemic units. Left column: phonographeme level; middle and right columns: onset and rime level, respectively. The solid blue line reflects the mean number of units across the 100 iterations per sample size; dashed lines reflect the 95% confidence interval. The horizontal red line reflects the number of units actually observed in the full corpus of 13338 words