Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Oct 1.
Published in final edited form as: J Mem Lang. 2020 Jun 7;114:104145. doi: 10.1016/j.jml.2020.104145

Individual differences in learning the regularities between orthography, phonology and semantics predict early reading skills

Noam Siegelman 1, Jay G Rueckl 1,2, Laura M Steacy 3, Stephen J Frost 1, Mark van den Bunt 1, Jason D Zevin 1,4, Mark S Seidenberg 5, Kenneth R Pugh 1,2,6, Donald L Compton 3, Robin D Morris 7
PMCID: PMC7373223  NIHMSID: NIHMS1601855  PMID: 32694882

Abstract

Statistical views of literacy development maintain that proficient reading requires the assimilation of myriad statistical regularities present in the writing system. Indeed, previous studies have tied statistical learning (SL) abilities to reading skills, establishing the existence of a link between the two. However, some issues are currently left unanswered, including questions regarding the underlying bases for these associations as well as the types of statistical regularities actually assimilated by developing readers. Here we present an alternative approach to study the role of SL in literacy development, focusing on individual differences among beginning readers. Instead of using an artificial task to estimate SL abilities, our approach identifies individual differences in children’s reliance on statistical regularities as reflected by actual reading behavior. We specifically focus on individuals’ reliance on regularities in the mapping between print and speech versus associations between print and meaning in a word naming task. We present data from 399 children, showing that those whose oral naming performance is impacted more by print-speech regularities and less by associations between print and meaning have better reading skills. These findings suggest that a key route by which SL mechanisms impact developing reading abilities is via their role in the assimilation of sub-lexical regularities between printed and spoken language -and more generally, in detecting regularities that are more reliable than others. We discuss the implications of our findings to both SL and reading theories.

Keywords: Statistical learning, Reading acquisition, Individual differences, Print-speech regularities


One of the basic motivations to study Statistical Learning (SL) mechanisms is their presumed role across cognition. Thus, researchers study SL processes not just to understand the “SL mechanism(s)” in isolation, but rather as a way to gain new insights as to the processes involved in other cognitive domains. This premise is rooted in two research traditions. The first comprises works of pioneering computational modeling that demonstrate how incorporating statistical learning principles can account for complex linguistic behavior (Rumelhart & McClelland, 1986; Seidenberg & McClelland, 1989). The second are experimental demonstrations showing that SL computations can potentially account for multiple cognitive functions, including segmentation of continuous input (e.g. Aslin, Saffran, & Newport, 1998), categorization (e.g. Maye, Werker, & Gerken, 2002), and prediction of upcoming signal (e.g. Dale, Duran, & Morehead, 2012). Together, these two lines of research led to the view that SL plays a role in multiple cognitive domains, from low level perception (e.g. Barakat, Seitz, & Shams, 2013), to complex cognitive functions such as music appreciation (e.g. Salimpoor, Zald, Zatorre, Dagher, & McIntosh, 2015), face recognition (e.g. Dotsch, Hassin, & Todorov, 2017), and the acquisition and processing of spoken and written language (see Arciuli, 2018; Romberg & Saffran, 2010 for reviews).

In this vein, over the course of the past decades reading research has become increasingly grounded in the idea that proficient reading requires the assimilation of statistical regularities present in the writing system. This statistical view of writing systems had an effect on virtually all sub-domains of reading research. As such, multiple studies document statistical regularities that are available to readers as they acquire and use their writing systems. These include associations between letters and speech sounds (Harm & Seidenberg, 2004; Seidenberg & McClelland, 1989) and between different orthographic units (including the correlations between different letters, e.g. Gingras & Sénéchal, 2019, and between different words, e.g. Fine & Jaeger, 2013), as well as more complex relationships between orthographic units and morphological and semantic representations (Seidenberg & Gonnerman, 2000; Ulicheva, Harvey, Aronoff, & Rastle, 2018), and between orthographic units and stress patterns (Arciuli, 2018), among others (see Sawi & Rueckl, 2019 for review).

Indeed, studies from the reading literature propose that patterns of reading behavior can be best explained when considering this rich array of statistical regularities that are embedded in the written input (see, e.g., Frost, 2012; Sawi & Rueckl, 2019; Seidenberg, 2011 for reviews). This already suggests that SL -the mechanism(s) responsible for extracting regularities from the sensory input -plays an important role in the process of literacy acquisition and later on in guiding efficient reading. Attempts to further solidify this claim come from studies that use separate tasks to estimate individual’s SL abilities (using for example the embedded triplet task or artificial grammar learning paradigms) and examine the relation between performance on these tasks and reading behavior. Thus, studies of individual differences have tied SL performance to variation in different aspects of reading skills both in first language (Arciuli & Simpson, 2012; Spencer, Kaschak, Jones, & Lonigan, 2014; Torkildsen, Arciuli, & Wie, 2019), and second language (Frost, Siegelman, Narkiss, & Afek, 2013). Still, the magnitude of the correlations between SL and reading varies across studies (see Arnon, 2019b; Siegelman, Bogaerts, Christiansen, & Frost, 2017), and recent studies suggest that correlations are observed only when using specific SL tasks and/or reading measures (Elleman, Steacy, & Compton, 2019; Qi, Sanchez Araujo, Georgan, Gabrieli, & Arciuli, 2019; Schmalz, Moll, Mulatti, & Schulte-Körne, 2019). Relatedly, studies of clinical populations show that individuals with reading disabilities are generally characterized by lower SL abilities (Gabay, Thiessen, & Holt, 2015; Kahta & Schiff, 2019; Sigurdardottir et al., 2017), although here again, there is some debate regarding the actual magnitude of these effects (see Schmalz, Altoè, & Mulatti, 2017).

Together, these studies provide an important proof of concept regarding the existence of some link between SL computations and reading skills. Importantly, however, questions remain not only in regards to why some studies fail to observe a relationship between SL performance and reading, but also as to the exact SL computations involved in reading and how variation in SL abilities gives rise to variability in literacy acquisition. These questions arise at least in part due to limited external validity of SL studies, which typically deal with learning of a single type of regularity over a short period of time (see Erickson & Thiessen, 2015; Frost, Armstrong, & Christiansen, 2019, for detailed discussion). In the context of reading, it is unclear whether such simplified form of learning can scale-up to account for the complex nature of the statistical knowledge that is required to account for reading behavior, which consists of multiple types of regularities, concurrently presented in various grain sizes and across modalities, and learned over extended periods of time.

In the present paper we report two studies that use an alternative approach to study the link between SL and reading, focusing on individual differences in beginning readers. Namely, instead of documenting individual differences in SL by using a simplified artificial task and then relating the individual differences observed on this task to some linguistic function (e.g. some component skill of reading), our approach focuses on identifying individual differences in children’s reliance on statistical regularities as reflected directly in their word naming behavior. We believe that this approach has the promise of informing SL research as well as the study of reading. On the SL side, it can help inform researchers about the subtle regularities that humans are able to assimilate “in the wild”. In parallel, it has implications for understanding reading and reading acquisition by providing insights into the processes involved in typical (or impaired) reading acquisition. In other words, the current study demonstrates the theoretical power of a statistical approach to reading and serves to illustrate the directions that the SL literature should be extended to in order to address the issues of interest to reading researchers.

More concretely, in the experiments reported below we assessed how lexical and sub-lexical statistical regularities impact the word naming behavior of individual children. We then asked whether children whose word naming performance exhibits more sensitivity to the reliable statistical regularities are better readers overall (as reflected in standardized reading measures) compared to those who display lesser sensitivity to this statistical information. In parallel, we examined whether there is a relation between reading skill and individuals’ reliance on more arbitrary associations that provide less efficient source of information for word naming. Given variability in SL abilities (e.g. Siegelman & Frost, 2015), and their presumed relation to reading abilities (e.g. Arciuli & Simpson, 2012; Frost et al., 2013), we hypothesized that there would be systematic variability in the degree to which individuals rely on different statistical regularities in word naming, and that those who show greater sensitivity to the reliable regularities, and lesser sensitivity to arbitrary associations, would display better reading skills.

Literacy acquisition and print-speech correspondences

As a starting point, we focus on statistical regularities that are particularly important for the development of skilled reading: the correspondences between letters and sounds. These regularities are considered to play a key role in learning the mappings between an orthographic string to the spoken form it represents (phonological decoding), which is widely considered to be one of the fundamental skills underlying proficient reading (e.g. Ehri, 2005; Share, 1999). This claim is supported by an extensive body of evidence, including correlations between phonological decoding and overall reading skills (e.g., Kearns, Rogers, Koriakin, & Al Ghanem, 2016; Perfetti, Beck, Bell, & Hughes, 1987; Shankweiler et al., 1999), impaired phonological decoding skills in populations with reading disabilities (e.g., Scarborough, 1998; Vellutino, Fletcher, Snowling, & Scanlon, 2004), and the impact of orthographic-phonological (O-P) regularities on word reading in skilled adult readers (e.g. Glushko, 1979; Jared, McRae, & Seidenberg, 1990).

Operationally, the strength of the association between an orthographic form and its phonological realization can be quantified by various measures (see Borleffs, Maassen, Lyytinen, & Zwarts, 2017; Siegelman, Kearns, & Rueckl, 2020, for reviews). A common measure is O-P consistency, which is a function of the number of words with a similar pronunciation of a given orthographic unit (‘friends’) and the number of words where the same orthographic unit is realized differently (‘enemies’). Typically, this measure focuses on the body-rime level (body-rime consistency), and is defined as the ratio friends / (friends + enemies) of a word body (e.g. ead in the word head; e.g. Jared, McRae, & Seidenberg, 1990). Alternatively, a consistency measure at a smaller-grain size focuses on regularities at the grapheme-phoneme level (e.g. Chateau & Jared, 2003; Treiman, Mullennix, Bijeljac-Babic, & Richmond-Welty, 1995). Thus, the consistency of the grapheme i in the word mint (where i → /ɪ/) is a function of the number of words where i is similarly pronounced as /ɪ/ (e.g., bin, sing, etc.) and the number of words where i is pronounced otherwise (e.g., pint). Indeed, a variety of findings make clear that skilled readers read faster and more accurately words with O-P mappings that are more “regular” or consistent at multiple grain-sizes, and that adults are particularly impacted by body-rime regularities (e.g. Cortese & Simpson, 2000; Jared, 2002). Developmentally, sensitivity to O-P correspondences is gradually acquired over the course of typical reading acquisition (e.g., Sénéchal, Gingras, & L’Heureux, 2016; Weekes, Castles, & Davies, 2006), again with increased reliance on larger grain sizes later in development (Treiman & Kessler, 2006).

Additional sources of information for word reading

Importantly, sub-lexical O-P regularities are not the only source of information available to readers. One additional factor is word frequency -the rate of occurrence of an orthographic form. Although measures of word frequency typically ignore phonological and semantic ambiguity, to a first approximation they can be taken as an index of how often the representation of an orthographic word form is associated with representations of how that word is pronounced and what it means. Thus, frequency captures some of the statistical structure in the mappings from orthography to phonology (O-P) and from orthography to semantics (O-S) (as well as the mapping between semantics and phonology, S-P) at the whole-word level. Other factors have to do specifically with the mapping between orthography to semantics. Generally, although O-S mapping is far less systematic than the mapping from orthography to phonology, it does embody statistical regularities and skilled readers assimilate and make use of these regularities. To a large extent, given the relatively arbitrary nature of the mapping, the O-S regularities that underlie word reading largely involve lexical-level correspondences between word forms and word meanings, as manifest in the effects of lexical ambiguity (e.g. Borowsky & Masson, 1996). Note, however, that because some words that are similar in form are also similar in meaning, the O-S mapping also embodies sub-lexical regularities (Marelli & Amenta, 2018; Monaghan, Chater, & Christiansen, 2005). This is perhaps most apparent in the ubiquitous effects of morphological structure on word reading (e.g. Grainger, Colé, & Segui, 1991; Rastle, Davis, Marslen-Wilson, & Tyler, 2000). From a statistical perspective, these effects arise because morphologically related words are typically related in both form and meaning and these form-meaning regularities are assimilated over the course of reading acquisition (Rueckl, 2010; Seidenberg & Gonnerman, 2000). Regardless of these important morphological regularities, however, the structure of the O-S mapping is generally more arbitrary than the mapping between orthography and phonology.

The impact of different sources of information related to O-P and O-S, as well as their interactions, are well demonstrated by the results of the word naming study by Strain, Patterson, and Seidenberg (1995). Strain et al. investigated the effect of imageability (the ease of eliciting a mental image, Paivio, Yuille, & Madigan, 1968), frequency, and O-P consistency on word naming in adult readers of English. Because imageability is a characteristic of word meaning, Strain et al. treated the imageability effect as a marker of the contribution of O-S processes. Strain et al. found that high-imageability words were named faster and more accurately than low-imageability words. Importantly, the magnitude of this effect varied with both word frequency and O-P consistency and was largest for low frequency, inconsistent words. These results illustrate how multiple statistical regularities interact to jointly determine behavior in a given task. In this particular case, the impact of imageability depends on the degree to which a word can be efficiently named on the basis of lexical and sub-lexical O-P regularities.

It should be noted that in contrast to word frequency and O-P consistency, which are regularities related to correspondences in the mappings between orthography, phonology, and semantics per se, imageability is related to statistical structure within the semantic domain. Specifically, imageability is thought to be related to degree of intercorrelation among semantic features (Harm & Seidenberg, 2004; Woollams, Lambon Ralph, Madrid, & Patterson, 2016). The degree of intercorrelation (or ‘coherent covariation’, McClelland & Rogers, 2003) among semantic features determines how strongly and/or quickly a semantic representation can be activated (Woollams et al., 2016). Therefore, higher imageability of a word leads to more efficient computation of its meaning, and hence to stronger involvement of O-S(-P) processes.

From a developmental perspective, just as readers must learn to balance the influence of O-P regularities at different grain sizes, so too must they learn to make use of both O-S and O-P regularities. The question is then how do children eventually achieve the efficient division of labor between O-P and O-S processes exhibited by skilled readers. Computational models suggest that this developmental change is a consequence of how differences in the structure of the O-P and O-S mappings impact the statistical learning mechanism that attunes the reader to this structure (Harm & Seidenberg, 2004). Behaviorally, then, early readers are expected to exhibit a different weighing of the regularities in O-P and O-S mappings compared to adult readers in the same task.

Individual differences in the sensitivity to O-P and O-S processes

Even though there is an expected developmental trajectory at the group-level, not all individuals are predicted to follow it and similarly rely on the different regularities available to them in the input. As noted above, recent research using SL paradigms has revealed that there is substantial variability in SL ability. This suggests that individuals may differ in the degree to which they rely on different associations in a given task. That is, because not all individuals are similarly adept at learning the statistical regularities embedded in their environment, some may be better than others in finding the optimal balance in the use of different types of regularities (specifically O-P and O-S associations) for solving a particular reading task. In the context of a word naming task, which involves the mapping between orthographic and phonological word forms, we predict that some individuals will be better able to take advantage of the more systematic structure of the O-P mapping, whereas others will be more reliant on relatively arbitrary O-S associations. Furthermore, because individual differences in SL ability are correlated with differences in reading achievement (see above), we might further hypothesize that differences in the relative influence of O-P and O-S associations (i.e. the division of labor, Harm & Seidenberg, 2004) will be associated with differences in reading skill in general. Specifically, we hypothesize that readers who display greater sensitivity to O-P regularities in a naming task will have better reading skills than those who display less sensitivity to these regularities. In contrast, readers who rely more heavily on OS processes are expected to have lower reading skills. This hypothesis is compatible with computational models that simulate reading behavior at the individual level, suggesting that it may explain observed variance in behavioral studies (Plaut, McClelland, Seidenberg, & Patterson, 1996; Rueckl, Zevin, & Wolf VII, 2019; Zevin & Seidenberg, 2006).

To re-iterate, our approach focuses on estimating individuals’ reliance on different types of regularities in the context of a word naming task, where O-P regularities provide a more direct (and hence more efficient) source of information than O-S. Thus, sensitivity to O-P and O-S in word naming should not be taken to reflect an individual’s sensitivity to these processes in general (see more on this issue in the General Discussion). Our claim is that the reliance on O-P and O-S(-P) in a word naming task reflects an individual’s general tendency to rely on reliable (vs. arbitrary) regularities in the writing system, and will thus be related to overall reading skills. This hypothesis is consistent with a number of previous studies documenting variation in individuals’ reliance on different cues in word naming as a function of reading skill. In fact, preliminary suggestions regarding individual differences in code utilization among skilled readers can be traced back to the 1970’s: In their work on the effects of O-P regularities in the reading of English, Baron and Strawson (1976) distinguished between “Phoenician” readers who are relatively reliant on sub-lexical spelling to sound rules, and “Chinese” readers who are more dependent on lexical-level processing. Although the Phoenician vs. Chinese distinction did not hold up to further scrutiny (e.g. Brown, Lupker, & Colombo, 1994; Yap, Balota, Sibley, & Ratcliff, 2012), the core intuition underlying this distinction—that readers differ in the degree to which they rely on different types of information—is central to more recent investigations concerned with individual differences in the division of labor. For example, Strain and Herdman (1999) demonstrated that when college students were grouped into tertiles based on phonological decoding skills (as measured by a standardized test), those in the lowest tertile of decoding skill showed the strongest imageability effect. Larger imageability effects were also found for groups of children with lower skill levels compared to groups of better early readers (Coltheart, Laxon, & Keating, 1988; Schwanenflugel & Noyes, 1996), and in parallel, early empirical and theoretical work on the effects of O-P consistency pointed towards differences between groups of adult readers (e.g. slower and faster readers, Seidenberg, 1985; readers with and without reading difficulties, Plaut et al., 1996). Relatedly, children with poorer comprehension skills are worse in reading irregular O-P words compared to children with similar decoding skills but better comprehension ability (Nation & Snowling, 1998). More recently, studies further explored individual differences in O-S processing among adult readers, as reflected in the strength of imageability effects while reading irregular words. Specifically, it was found that larger imageability effects are negatively correlated with overall word reading performance (i.e. readers who rely more on semantic imageability generally read words more slowly; Woollams et al., 2016). In the same vein, a recent paper investigating visual word recognition through the lifespan showed that imageability effects decrease with age (in both naming and lexical decision tasks) and with overall reading skill (in naming; Davies, Birchenough, Arnell, Grimmond, & Houlson, 2017). Lastly, individual differences in the magnitude of the imageability effects were also found to be correlated with patterns of neural activity and structural connectivity (Graves et al., 2014; Hoffman, Lambon Ralph, & Woollams, 2015).

The current study

It is notable that the vast majority of prior studies of individual differences in the division of labor have investigated individual differences among adult readers. To our knowledge, there are no prior studies investigating the joint influences of O-P and O-S regularities on word reading in beginning readers even at the group level. The present study addresses this gap in the literature. By investigating individual differences in children’s sensitivity to O-P and O-S regularities in a word naming task and how differences in this division of labor are related to their reading skills, we sought to illuminate the processes by which developing readers become attuned to the structure of a writing system and why some individuals do this more successfully than others. Importantly, from the perspective of SL theory, understanding how sensitivity to different types of information develops can inform us regarding the actual statistics assimilated by readers as they learn to read.

Concretely, below we present data collected using a word naming task from two samples. The first sample includes a group of 3rd and 4th graders, with an over-sampling of children with reading disabilities (RD). The second sample comprises a larger group of 2nd to 5th graders, sampled from the full spectrum of reading abilities. Children in both samples participated in a word naming task in which O-P regularities, imageability, and frequency were manipulated. We used this task to estimate each individual’s reliance on O-P and O-S processes using logistic models predicting accuracy from measures of O-P regularities and imageability. In addition, standardized measures of component reading skills were collected for each participant. To preview our findings, we show that in line with our predictions, individuals substantially vary in their sensitivity to O-P and O-S regularities, and that those two processes account for very large portions of individual variation in reading skills among beginning readers of English.

Study 1

Methods

Participants

Children were recruited to a larger study examining response to intervention for reading disabilities (RD), which includes behavioral and neural measures at two time points, before and after an intensive phonics-based intervention. Here we only report baseline behavioral data collected prior to intervention. The sample includes 123 children (69 boys) in the third and fourth grade (mean age in years: 9.2, SD = 0.7, range: 7.8 – 11.3). Because the larger study focuses on response to intervention, this sample includes over-sampling of children diagnosed with RD: 101 out of the 123 children had a RD diagnosis, whereas the rest of the sample (22) were typically developing (TD) readers. Participants come from public and charter schools from a large urban community (68% African-American, 25% Caucasian, 7% Mixed Race/other). Diagnosis of RD was made using a study-based criterion, defined as scoring at least one standard deviation below age-norm expectations on at least one standard reading assessment (see Arrington et al., 2019 for details). Due to the over-sampling of children with RD, participants had generally low average standard reading scores on Woodcock-Johnson III sub-tests (Woodcock, McGrew, & Mather, 2001): Letter-Word Identification: M = 92.08, SD = 12.24; Word Attack: M = 91.21, SD = 11.71; Passage Comprehension: M = 84.55, SD = 13.19; and Reading Fluency: M = 90.76, SD = 14.41).

Materials, Design and Procedure

Within a larger battery of behavioral and neural assessments, each child participated in a word naming task, as well as four sub-tests of the Woodcock-Johnson III, which are the focus of the analyses reported here.

The word naming task included 160 trials presented to children in a fixed order. In each trial, a fixation cross appeared for 500ms, and was then replaced by a monosyllabic word which was presented in the center of the screen until response. Participants were asked to read out loud each word as accurately and quickly as possible, and their responses were coded by an experimenter who sat in the experiment room. In addition, children’s responses were recorded and in cases when the experimenter could not decide during the experimental session whether a correct response was made they went back and listen to the recordings. Words were selected so they would be generally familiar to children in the 2nd grade and up, and so they would vary along the three independent variables: frequency, imageability, and O-P regularity. Frequency (log-transformed) was estimated for each word based on the Zeno corpus, grades 1–8 (Zeno, Ivens, Millard, & Duvvuri, 1995), and words’ imageability was based on standard ratings (Paivio et al., 1968). O-P regularity was operationalized as vowel surprisal (i.e. −log(p(i))) of the vowel pronunciation, which is a function of the relative likelihood of the pronunciation of a vowel grapheme (see Siegelman et al., 2020). Hence, for example, the word pint has a higher surprisal value than mint, since p(i → /aɪ/) is smaller than p(i → /ɪ/)1. Note that the decision to focus on regularities at vowel level differs from previous investigations of the imageability by consistency interaction, which have generally operationalized O-P consistency in terms of body-rime correspondences. Our choice of focusing on the vowel-level was based on results indicating that young readers rely more on grapheme-phoneme than body-rime regularities (Steacy et al., 2018; Treiman & Kessler, 2006)2. Items were selected to minimize the correlations between the three independent variables. Thus, the correlations between the three independent variables across the 160 items were small (O-P surprisal and imageability: r=−0.08; O-P surprisal and frequency: 0.04; imageability and frequency: 0.16). Accuracy in each trial was used as the dependent variable, and was coded as 1 (correct) or 0 (incorrect) in each trial. Trials with microphone malfunctions or unclear responses were disregarded from further analysis (14.5% of all trials)3. Please refer to the Supplementary Material for the full list of items and their frequency, imageability, and O-P surprisal values, and mean by-item accuracy levels in Studies 1 and 2.

Each participant also completed four sub-tests of the Woodcock-Johnson III Tests of Achievement battery: Letter-word Identification (measuring letter and word decoding), Word-Attack (pseudo-word reading), Reading Fluency, and Passage Comprehension. We use the raw scores in each of these four sub-tests as outcome measures.

Results and Discussion

Group-level analysis.

Before turning to the main focus of the paper – analyses of individual differences – we first report the group-level effects in the word naming task. To examine these, we ran a logistic mixed-effect model (Jaeger, 2008) using the lme4 package in R (Bates, Maechler, Bolker, & Walker, 2015), with correctness in each trial as the dependent variable. The model included frequency, imageability, and O-P surprisal as well as all interactions as fixed effects. The three predictors were scaled and centered to avoid collinearity. The model also included random intercepts for subjects and words, and random slopes for O-P surprisal and frequency (the maximal random effect structure that converged; Barr, Levy, Scheepers, & Tily, 2013). Collinearity in the model was small (all |r|<0.25). Table 1 presents the estimated fixed effects. As can be seen, and in line with previous studies, we found significant effects of all three main-effects on accuracy: words were named more accurately if they were higher on imageability, higher on frequency, and lower on O-P surprisal (i.e. more regular in terms of O-P). In addition to the main effects, we found two significant interactions: O-P regularity by frequency, and imageability by frequency. The O-P regularity by frequency interaction resembled that in adult studies, with a stronger O-P effect in infrequent compared to frequent words. In contrast, the imageability by frequency interaction went in an opposite direction compared to adult studies: The effect of imageability was stronger in frequent compared to infrequent words (i.e. super-additive interaction of imageability by frequency). The significant interactions are visually depicted in Figure 1. We return to the pattern of interactions in the General Discussion.

Table 1:

Group-level effects on accuracy in the word naming task in Study 1.

Predictor β (coefficient) SE z value p value
IMG .234 .081 2.90 .003
O-P REG −.500 .085 −5.86 <.001
FREQ 1.03 .085 3.22 <.001
IMG*O-P REG −.135 .083 −1.64 .10
IMG*FREQ .324 .093 3.48 <.001
O-P REG*FREQ .269 .085 3.15 .002
IMG*O-P REG*FREQ .154 .089 1.72 .09

Notes: IMG: imageability; O-P REG: O-P regularity (surprisal); FREQ: Frequency.

Figure 1.

Figure 1.

Significant interactions in the group-level analysis of Study 1. Left panel: greater impact of frequency in higher levels of surprisal (i.e. when O-P regularities are less consistent). Right panel: greater impact of frequency in higher levels of imageability. Plots created using the interact_plot() function in interactions package in R (Long, 2019).

Individual-differences analysis.

Our main analyses focused on the effects of the three independent variables – O-P regularity, imageability, and frequency – on the word naming behavior of each subject. To estimate these individual-level effects, we ran a set of three logistic regression models on the data of each participant, including the subject’s trial-by-trial data from the word naming task. All three models had accuracy in each trial as the dependent variable, while each model included one of the three predictors (centered and scaled) as the independent variable. The output of these models were used to estimate the impact of each of the three predictors on each individual’s word naming accuracy. Thus, each child had one slope score that estimates sensitivity to O-P regularity, one slope score that quantifies the effect of semantic imageability, and one slope score that indexes reliance on word frequency. Also note that because our individual-level models were based on predicting accuracy, at least some naming errors are required (i.e., no slopes can be estimated for a subject whose performance is perfect). We therefore excluded from the individual-level analysis data from two subjects who had a mean word naming accuracy of more than 98% (unsurprisingly, both subjects were from the TD group). Our individual-differences analyses below are based on the data of remaining 121 participants, whose mean accuracy varied between 10% to 97.8% (mean = 69.4%, SD = 18.5%). Note that due to inter-individual variance in the number of correct responses, estimates of slope scores are expected to vary in their reliability across subjects: Less reliable estimates are expected for subjects with very high or low accuracy rates, compared to subjects with a large enough number of correct and incorrect trials. Nonetheless, we do not expect this variability in reliability to systematically bias the correlation between the slope scores and reading skill – which are the center of the current analyses. Rather, if anything, the increased measurement error for some subjects should result in an under-estimation of the true correlation due to attenuation4. Also note that although in the surprisal measure of O-P regularities higher values represent more surprising (i.e. unpredictable) readings, for simplicity, we ‘flipped’ the O-P surprisal slope scores in all further analyses. Thus, the slope score of sensitivity to O-P regularity was coded such that higher, positive, values represent more sensitivity to O-P regularities, in the same direction as the slope scores reflecting sensitivity to imageability and frequency.

Given our theoretical focus, we first focused on the individual differences in sensitivity to O-P regularity and imageability, and their relation to individual differences in reading skills. These results are summarized in Figure 2, which depicts the spread of individuals according to each child’s sensitivity to O-P regularity (x-axis) and imageability (y-axis). As can be seen, and as expected given the group-level analysis above, readers exhibited sensitivity to both O-P regularity and imageability on average (vertical and horizontal dashed lines, both significantly higher than zero; O-P regularities: one-sample t(120) = 15.35, p < 0.001; imageability: t(120) = 14.01, p < 0.001). Importantly, however, there was substantial variability along the two axes, suggesting that not all individuals are similarly impacted by these two properties. Also note that sensitivity to OP regularity and imageability were significantly negatively correlated (r=−0.32, p < 0.001), although this correlation was moderate in magnitude, suggesting that the effects of O-P regularity and imageability reflect related, yet separate, processes.

Figure 2.

Figure 2.

Variability among participants in Study 1 in sensitivity to imageability (y-axis) and O-P regularities (x-axis). Vertical and horizontal dashed lines show mean sensitivity to O-P regularity and imageability, respectively. Dashed trend line show correlation between the two slope measures. Color scale presents raw scores on Letter-Word Identification task.

Next, we examined the relations between each individual’s reliance on O-P and O-S associations as assessed by the slope scores and their overall reading skills, estimated by the standardized reading measures. We thus ran four multiple regression models, all including the two slope scores as independent variables, and raw scores in one of the Woodcock Johnson sub-tests as a dependent variable. The results of these models are summarized in Table 2. Across all four sub-tests, we found that greater sensitivity to O-P regularity and lower sensitivity to imageability were associated with better reading performance. Note that the effects were not only significant, but also of large magnitude (adj-R2 values: Word-Attack: 53%; Letter-Word Identification: 46%; Reading Fluency: 44%; Passage Comprehension: 32%). As an illustration of the strength of this association, Figure 2 presents letter-word identification scores in color scale: It can be clearly seen that as individuals’ sensitivity to O-P regularity increases (i.e. higher scores along the a-axis), and sensitivity to imageability decreases (i.e. lower scores along the y-axis), Letter-Word Identification scores increases. Next, we also examined whether sensitivity to frequency adds to the predictive value of O-P regularity and imageability, by adding to the multiple regression models the slope scores reflecting reliance on frequency. Notably, in three out of four models (letter-word identification, word attack, passage comprehension) adding frequency to the models did not result in significant improvement in model fit (p > 0.05; Δadj-R2<1%). Sensitivity to frequency was significantly associated with Passage Comprehension scores beyond sensitivity to O-P regularity and imageability (β=2.32, SE=1.01, p = 0.024), albeit with a small improvement in model fit (Δadj-R 2=2.35%). Together, then, measures of sensitivity to O-P regularity and imageability based on the word naming task were better predictors of participants’ reading skills than sensitivity to frequency.

Table 2:

Regression models predicting reading skill from sensitivity to imageability and O-P regularity in Study 1.

Predictor β (coefficient) SE z value p value
Dependent variable: Letter-Word Identification (adj-R2=46%)
O-P REG 4.57 0.55 8.26 <.001
IMG −1.63 0.55 −2.95 .004
Dependent variable: Word Attack (adj-R2=53%)
O-P REG 3.65 0.43 8.55 <.001
IMG −2.15 0.43 −5.03 <.001
Dependent variable: Reading Fluency (adj-R2=32%)
O-P REG 5.92 1.02 5.80 <.001
IMG −3.06 1.02 −2.99 .003
Dependent variable: Passage Comprehension (adj-R2=44%)
O-P REG 2.87 0.39 7.35 <.001
IMG −1.47 0.39 −3.77 <.001

Notes: IMG: imageability; O-P REG: O-P regularity (surprisal). Predictors are centered and scaled.

The associations of reading skills with the individual-level effects of O-P regularity and imageability (and the numerically smaller magnitude of the correlations with that of frequency) is also apparent in Table 3 which shows all zero-order correlations between the three slope measures and the reading sub-tests. Whereas reading skills were strongly positively correlated with sensitivity to O-P regularity (all 0.67 ≥ r’s ≥ 0.53), and negatively correlated with imageability (all −0.31 ≥ r’s ≥ −0.58), their correlations with sensitivity to frequency were generally lower (−0.04 ≥ r’s ≥ −0.33). Nonetheless, and despite the fact that reliance on frequency was generally not significantly related to reading performance above and beyond reliance on O-P regularity and imageability, it is worth mentioning that sensitivity to frequency was generally negatively associated with reading skills. This aligns with findings with adult readers showing that higher reading skills are correlated with lesser impact by word frequency in different tasks (e.g. Kuperman & Van Dyke, 2013; Schilling, Rayner, & Chumbley, 1998). Nonetheless, our data suggest that O-P regularity and imageability are more strongly correlated with early reading skills compared to sensitivity to frequency.

Table 3:

Bi-variate Pearson correlations of sensitivity to O-P regularity, imageability, and frequency with standardized reading measures in Study 1. Significant correlations (p < 0.05) are shown in bold.

Measure 2 3 4 5 6 7
1. O-P regularity −0.32 −0.30 0.65 0.67 0.62 0.53
2. Imageability 0.26 −0.39 −0.51 −0.44 −0.38
3. Frequency −0.16 −0.33 −0.10 −0.04
4. Letter-Word Identification 0.78 0.83 0.85
5. Word Attack 0.74 0.65
6. Passage Comprehension 0.80
7. Reading Fluency

Lastly, we examined whether sensitivity to O-S and O-P processes is associated not only with continuous measures of reading, but also with categorical TD/RD diagnosis – that is, whether the two slope scores can distinguish between children with and without RD diagnosis. A logistic model with TD/RD diagnosis of each child and slope scores of O-P regularities and imageability revealed that indeed, higher sensitivity to O-P regularities and lower reliance on imageability were associated with a TD (rather than a RD) diagnosis (sensitivity to O-P regularities: B=2.21, SE=0.52, Z=4.21, p < 0.001; imageability B=−0.78, SE=0.35, Z=−2.24, p = 0.02; McFadden’s multiple-R2 = 45%). Figure 3 visually depicts these results, showing that participants without reading disabilities (TD readers) are those who show greater sensitivity to O-P regularities and lower sensitivity to imageability compared to their peers (i.e. TD readers are concentrated in a limited portion of the regularity by imageability space, generally exhibiting larger than average sensitivity to regularity, and smaller than average sensitivity to imageability). Among RD readers, however, there is substantial variability in the extent of sensitivity to the two sources of information.

Figure 3.

Figure 3.

RD diagnosis (in color) as a function of sensitivity to imageability (y-axis) and O-P regularity (x-axis).

Study 2

The primary goal of Study 2 is to replicate the findings of Study 1. We utilize a similar task to estimate individuals’ reliance on O-S and O-P associations during word naming, and examine the relation between these processes and individual differences in reading skills. Study 2 includes an even larger sample of children compared to Study 1. Moreover, this sample consists of a cohort of children which are more representative of typical variability in reading skills, as they were sampled from the general population of early readers without over-sampling of children with RD. This enables us to examine whether reliance on the two processes is associated with variability in reading skill also among children without a formal diagnosis of RD.

Methods

Participants

A total of 282 children (143 boys) participated in Study 2. All children were in the 2nd to 5th grade, and their mean age was 9.43 (SD=1.17, range=7.3–13.08). Participants came from both public and private schools, and the sample was made up of 51% African American, 36% Caucasian, 11.5% Hispanic, and 1.5% Mixed Race students. As expected given the differences in the samples, participants had higher average standardized reading assessment scores compared to those of Study 1 (Letter-Word Identification: M = 97.59, SD = 11.47; Word Attack: M = 98.12, SD = 9.56; Passage Comprehension: 90.40, SD = 11.41; all p’s < 0.001).

Materials, Design and Procedure.

The word naming task contained the same items as Study 1. Students were asked to read the words in a list format as quickly and accurately as they could and a research assistant coded the child’s responses. Audio of the experimental session was recorded and experimenters could go back and listen to the recordings in cases where they could not decide whether a correct response was made during the experimental session. Children also participated in three sub-tests of reading skills from the Woodcock-Johnson III: Letter-Word Identification, Word Attack, and Passage Comprehension.

Results and Discussion

Group-level analysis

As in Study 1, we first ran a logistic mixed-model to estimate the group-level effects in the word naming task. The model again included frequency, imageability, and O-P surprisal (all standardized) as well as all interactions as fixed effects, with accuracy as the dependent variable. The model included by-subject and by-item random intercepts, and by-subject random slopes for O-P surprisal and frequency (the maximal random effect structure that converged; Barr, Levy, Scheepers, & Tily, 2013). Collinearity in the model was again small (all |r|<0.25). As can be seen in Table 4, higher frequency and lower O-P surprisal (i.e. higher O-P regularity) were both associated with higher naming accuracy. In contrast, the effect of imageability on group level accuracy did not reach significance. Thus, on average, naming performance for this group of readers was not significantly impacted by imageability, as opposed to performance in the sample of Study 1. One possible reason for this arises from the fact that on average Study 2’s participants are better readers than the mostly RD sample of Study 1 (and see aggregated analysis section, below). The effects of the interactions mirrored those in Study 1: There was a sub-additive frequency by O-P surprisal interaction (i.e. a stronger O-P regularity effect in infrequent words), and a super-additive interaction between frequency and imageability (a stronger frequency effect in highly imageable words).

Table 4:

Group-level effects on accuracy in the word naming task in Study 2.

Predictor β (coefficient) SE z value p value
IMG .128 .087 1.47 .14
O-P REG −.584 .091 −6.40 <.001
FREQ 1.028 .089 11.46 <.001
IMG*O-P REG −.067 .088 −0.76 .44
IMG*FREQ .269 .099 2.70 .007
O-P REG*FREQ .218 .090 2.41 .016
IMG*O-P REG*FREQ .172 .095 1.82 .07

Notes: IMG: imageability; O-P REG: O-P regularity (surprisal); FREQ: Frequency.

Individual-differences analysis

Four children had mean word naming accuracy of more than 98% and their data were therefore removed from further analysis. For each of the remaining 278 children (accuracy range: 9%−97%, mean = 78.9%, SD = 19.8%), we again estimated sensitivity to O-P regularity, imageability, and frequency, using logistic models predicting accuracy of each subject at each trial from each predictor. Note that we again ‘flipped’ the slope scores of O-P surprisal such that higher scores reflect more sensitivity to O-P regularities.

Figure 4 shows the spread of individuals along the two critical dimensions: sensitivity to O-P regularity and imageability. Despite the fact that there was no group-level effect of imageability in the mixed-model above, mean slope scores of sensitivity to O-P regularity and imageability both significantly differed from 0 (vertical and horizontal dashed lines: O-P regularities: one-sample t(277) = 26.41, p < 0.001; imageability: t(277) = 13.72, p < 0.001). This suggests that the lack of observed sensitivity to imageability at the group level is related to the inclusion in the model of the higher order interactions, substantial shrinkage, and/or substantial variance across items (which mixed-models take into account; see Jaeger, 2008 for discussion). Note also that although sensitivity to O-P regularity and imageability were again numerically negatively correlated, this correlation was very weak and failed to reach significance (r=−0.06, p=0.29; and see aggregated analysis section, below).

Figure 4.

Figure 4

Variability among participants in Study 2 in sensitivity to imageability (y-axis) and O-P regularities (x-axis). Vertical and horizontal dashed lines show mean sensitivity to O-P regularity and imageability, respectively. Dashed trend line show correlation between the two slope measures. Color scale presents raw score on Letter-Word Identification task.

Most importantly, however, and in line with the results of Study 1, sensitivity to imageability and O-P regularity were again associated with reading skills. This is reflected in Figure 4 which depicts letter-word identification scores (in color scale) as a function of the two slope scores. The full results are shown in Table 5, which shows the outputs of three multiple regression models with the three Woodcock-Johnson sub-tests as dependent variables. As in Study 1, in all three models we found that higher sensitivity to O-P regularity and less sensitivity to imageability were associated with better reading skills. Also note that again effect sizes were large (adj-R2 values: Word Attack: 39%; Letter-Word Identification: 38%; Passage Comprehension: 24%), albeit somewhat lower than in Study 1 (and see aggregated analysis section for further discussion). As for sensitivity to frequency, we again found that it had only limited relations to reading skills beyond sensitivity to imageability and O-P regularity: Adding slopes of sensitivity to frequency did not improve model fit for Letter-Word Identification and Passage Comprehension (p>0.05), although it did have a significant yet small added effect on Word Attack scores (β=−0.84, SE=0.28, p = 0.003, Δadj-R2=1.7%). Table 6 includes the full correlation matrix with the three slope scores and the Woodcock-Johnson sub-tests. In line with the results of Study 1, the reading skills scores were significantly positively correlated with sensitivity to O-P regularity, significantly negatively correlated with imageability, and negatively (yet generally insignificantly) correlated with sensitivity to frequency.

Table 5:

Regression models predicting reading skill from sensitivity to imageability and O-P regularities in Study 2.

Predictor β (coefficient) SE z value p value
Dependent variable: Letter-Word Identification (adj-R2=38%)
O-P REG 4.59 0.36 12.73 <.001
IMG −1.03 0.36 −2.85 .005
Dependent variable: Word Attack (adj-R2=39%)
O-P REG 3.51 0.28 12.60 <.001
IMG −1.08 0.28 −3.89 <.001
Dependent variable: Passage Comprehension (adj-R2=24%)
O-P REG 2.35 0.27 8.81 <.001
IMG −0.74 0.27 −2.77 .006

Notes: IMG: imageability; O-P REG: O-P regularity (surprisal); FREQ: Frequency.

Table 6:

Bi-variate Pearson correlations of sensitivity to O-P regularities, imageability, and frequency with standardized reading measures in Study 2.

Measure 2 3 4 5 6
1. O-P regularity −0.06 −0.17 0.61 0.60 0.47
2. Imageability 0.13 −0.17 −0.22 −0.18
3. Frequency −0.10 −0.26 −0.04
4. Letter-Word Identification 0.76 0.74
5. Word Attack 0.62
6. Passage Comprehension

Significant correlations (p < 0.05) are shown in bold.

Aggregated Analysis

As noted above, participants in the two studies were sampled from two different populations: Although the sample of Study 1 consisted mostly of RD readers, participants in Study 2 were sampled from the general population of early readers. Indeed, this different sampling was reflected in significant differences in the three sub-components of the Woodcock-Johnson sub-tests that were administered in both samples (see above). Importantly, the difference between the samples was also reflected in the slope scores extracted from the word naming task. Thus, in line with the results above, the better readers of Study 2 showed greater mean slopes of O-P regularity and smaller slopes of imageability than the mostly-RD participants of Study 1 (O-P regularity: t(397) = 3.02; p = 0.003; imageability: t(397) = 3.01; p = 0.002).

Despite the different samples, the results of Studies 1 and 2 point to a similar general conclusion: In both studies greater sensitivity to O-P regularity and lower sensitivity to imageability were associated with higher reading skills. Unsurprisingly, aggregating the data from the two samples (n = 399) showed that sensitivity to O-P regularity and imageability were both strongly associated with reading skills, as expected given the separate results of Study 1 and 2 (see Table 7 for output of regression models predicting performance on the Woodcock-Johnson sub-tests, and Figure 5 for an aggregated scatter plot).

Table 7:

Regression models predicting reading skill from sensitivity to imageability and O-P regularities in the aggregated analyses (n=399).

Predictor β (coefficient) SE z value p value
Dependent variable: Letter-Word Identification (adj-R2=43%)
O-P REG 4.83 0.31 15.76 <.001
IMG −1.39 0.31 −4.53 <.001
Dependent variable: Word Attack (adj-R2=45%)
O-P REG 3.84 0.25 15.67 <.001
IMG −1.61 0.25 −6.57 <.001
Dependent variable: Passage Comprehension (adj-R2=32%)
O-P REG 2.69 0.23 11.91 <.001
IMG −1.12 0.23 −4.95 <.001

Notes: IMG: imageability; O-P REG: O-P regularity (surprisal); FREQ: Frequency.

Figure 5.

Figure 5

Variability among participants across the two studies in sensitivity to imageability (y-axis) and O-P regularities (x-axis). Vertical and horizontal dashed lines show mean sensitivity to O-P regularity and imageability, respectively. Dashed trend line show correlation between the two slope measures. Color scale presents raw score on Letter-Word Identification task.

Despite the similarities in the effects of the slope scores on reading across the two studies, there were two differences between the samples that warrant additional investigation. First, whereas in Study 1 we observed a significant negative correlation (of r = −0.32) between O-P regularity and imageability, no significant correlation between the slope scores was observed in Study 2 (r = −0.06), and these two correlation coefficients significantly differed from one another (Z = 2.47, p = 0.01). Second, effects sizes (i.e., percent of variance explained) were generally higher among the lower-skilled readers of Study 1 compared to those of Study 2 (see R2 values in Table 2 vs. Table 5, above).

These two discrepant findings raise the possibility that the development of the two processes and their relation to reading behavior is non-linear. For example, it is possible that knowledge of the correspondences between graphemes and phonemes follows a non-linear trajectory, developing rapidly (and showing substantial individual differences) among early/poor readers, while its growth saturates (or even decreases) among readers that are more further along the literacy acquisition trajectory. This prediction conforms with developmental studies documenting a non-linear trajectory of O-P consistency effects (e.g. Weekes et al., 2006), and with non-linear effects in language and skill acquisition more broadly (e.g. Farnia & Geva, 2011; Newell, 1991). Importantly, aggregating the data from the two studies allowed us to achieve sufficient power to examine the possibility of such non-linear effects using more sensitive statistical analysis.

To do so, we ran Generalized Additive Models (GAM) on the aggregated data from the two studies (n=399 children). In contrast to linear models, in GAMs the relation between a dependent variable and a predictor or a set of predictors does not have to be linear. Instead, GAMs use smoothed terms of the predictor(s) to find the function that best fits the outcome. Then, these models can be compared to models without smoothed terms for one or more of the predictors, to examine whether the non-linear smoothing indeed accounts for the data better than a simple linear function (i.e. whether the non-linear transformation is warranted by the data). Here we report two GAMs using the mgcv package in R (Wood, 2011), both with O-P regularity and imageability as predictors. The dependent variable in the first model was Letter-Word Identification scores, and Word Attack scores in the second.

The results of the GAMs are presented in Figure 6. As can be seen in the top panel, the estimated effect of imageability on the two dependent variables was negative as expected, and importantly -almost entirely linear. Indeed, a Chi-square test comparing the GAMs with smoothed terms for both O-P regularity and imageability to models without a smoothed term for imageability revealed no significant difference in models’ fit (p > 0.05 for both dependent variables). In contrast, the positive effect of O-P regularities on reading skills was non-linear: Model comparison revealed a significant increase in model fit for models with a non-linear term for O-P regularity (for both models: p < 0.001). More specifically, the impact of sensitivity to O-P regularity on reading skills had a seemingly logarithmic trajectory, with a strong positive impact on reading at lower skills, which saturates at higher skill level (see Figure 5, top panel). The joint impact of sensitivity to imageability and O-P regularities on the two dependent variables are depicted in the bottom panel of Figure 5, which presents predicted reading scores given a reader’s sensitivity to O-P regularities and imageability. This again shows the clear relation between the two dimensions and reading skills, while also taking into account the non-linear impact of sensitivity to O-P regularity on reading.

Figure 6.

Figure 6

GAM results based on the aggregated data (n=399). Panel A: Estimated effects of the smoothed terms of sensitivity to imageability and O-P regularity on (residualized) Letter-Word Identification (left) and Word Attack (right) scores. Panel B: Predicted scores in Letter-Word Identification (left) and Word Attack (right) as a function of sensitivity to O-P regularity (x-axis) and imageability (y-axis).

Lastly, we wished to examine the role of vocabulary knowledge in the development of the reliance on O-P and O-S processes and their associations with reading skills. This analysis is motivated by behavioral studies showing that reading development is related not only to decoding skills but to oral learning skills more broadly (e.g. Nation & Snowling, 2004), and by recent computational work suggesting that vocabulary skill is associated with a more efficient division of labor (as better oral language skills reflect better knowledge of S-P and P-S mappings, which is then beneficial for the development of an efficient division of labor between O-P and O-S; Chang & Monaghan, 2019; Chang, Monaghan, & Welbourne, 2019). Indeed, we found significant correlations between a measure of vocabulary knowledge (raw scores in the vocabulary sub-test of the WASI-II, Wechsler, 2011) and sensitivity to O-P and O-S: a positive correlation with reliance on O-P (r = 0.2), and a negative association with O-S (r = −0.27 both p’s < 0.001). Importantly, however, even after controlling for vocabulary skills we found significant associations (with large effect sizes) between sensitivity to O-P and O-S and reading outcomes (Table 8). Thus, although better knowledge of S-P/P-S does support the development of a more efficient division of labor as reflected in word naming (i.e. more reliance on O-P and less reliance on O-S), the associations between sensitivity to O-P and O-S and reading skill cannot be explained solely by variance in vocabulary knowledge.

Table 8:

Regression models predicting reading skill from sensitivity to imageability and O-P regularities in the aggregated analyses (n=399), while controlling for vocabulary knowledge.

Predictor β (coefficient) SE z value p value ΔR2
Dependent variable: Letter-Word Identification (adj-R2=47%)
O-P REG 4.55 0.30 15.24 <.001 34%
IMG −0.97 0.31 −3.17 .002
Vocabulary 1.79 0.31 5.81 <.001 5%
Dependent variable: Word Attack (adj-R2=48%)
O-P REG 3.63 0.24 15.05 <.001 36%
IMG −1.29 0.25 −5.24 <.001
Vocabulary 1.30 0.25 5.22 <.001 4%
Dependent variable: Passage Comprehension (adj-R2=49%)
O-P REG 2.32 0.20 11.65 <.001 20%
IMG −0.53 0.20 −2.60 .009
Vocabulary 2.33 0.21 11.31 <.001 17%

Notes: IMG: imageability; O-P REG: O-P regularity (surprisal); FREQ: Frequency. ΔR2 values are the difference in R2 between the full model and a partial model that includes only the relevant predictor(s).

General Discussion

The current work revisits a basic question: What are the processes that lead to successful early reading acquisition? From an SL perspective, achieving proficient reading requires the assimilation of statistical regularities present across varied linguistic dimensions in the writing system. Thus, early readers who are better in assimilating these regularities are expected to be more proficient readers than those with poorer SL abilities. The current results provide strong evidence for this theoretical claim. Namely, data from almost 400 early readers of English show that children who show greater sensitivity to the correspondences between print and speech in word naming have better reading skills. In parallel, individuals who display greater reliance on associations between print and meaning, and particularly those who are also not sensitive to O-P correspondences, are characterized by lower reading abilities.

A small number of previous studies have investigated differences in the imageability effect between sub-groups of children who vary in their reading skills. For example, Coltheart, Laxon, and Keating (1988) found that imageability facilitated word naming for the poorer readers among a sample of nine-year-old children but had no effect on the performance of the better readers. Relatedly, although Schwanenflugel and Noyes (1996) found limited evidence that imageability affects word naming by 3rd-and 5th-grade children, their results also suggested that imageability effects are larger for poorer readers. Joining these studies is the work by Davies et al. (2017) showing that the magnitude of the imageability effect is modulated by both age and reading skill in a sample of 8 to 83 year-old readers, with larger imageability effects for younger, and poorer, readers. Our results extend these findings in an important way: Because we manipulated both imageability and O-P regularity, we documented the joint contribution of the reliance on regularities in O-P as well as O-S over a large sample of children with diverse reading skills. Similarly, a handful of previous studies have investigated individual differences in the division of labor between O-P and O-S contributions to word naming in adults (e.g. Strain & Herdman, 1999; Woollams, Lambon Ralph, Madrid, & Patterson, 2016). Our study extends this work by investigating these processes during a period of the developmental trajectory for which we have theoretical and computational insights (e.g. Harm & Seidenberg, 2004) but a dearth of empirical data. In line with this computational work, our results suggest that developing an efficient division of labor between O-P and O-S is crucial for early reading success.

Although the main focus of the present investigation is individual-level variation, the current results also present intriguing group-level effects that shed additional light on the development of the division of labor between O-P and O-S processes. Specifically, in both studies we observed a similar pattern of interactions between frequency, imageability, and O-P regularity on group-level performance: We found that whereas the O-P regularity effect was stronger with infrequent compared to frequent words, the imageability effect was stronger with frequent compared to infrequent words. Further, neither the interaction between imageability and O-P regularity nor the three-way interaction of frequency, regularity, and frequency were statistically reliable. Thus, whereas the O-P regularity by frequency interaction resembles the effect typically observed in adults, the lack of interactions between imageability and regularity, and the opposite interaction between imageability and frequency, differed from previous studies with adult samples (e.g. Strain et al., 1995). This pattern of interactions may be due to differences in the rate of learning of O-P and O-S associations: O-S associations are learned more slowly and thus at the age range of the children in our sample the O-S associations for lower-frequency and/or O-P irregular words may be too weak to contribute significantly to word naming (see Duff & Hulme, 2012, and Laing & Hulme, 1999, for similar results in a word learning paradigm). From this perspective, the group-level results compliment the individual-level findings: O-P associations are learned earlier and are strongly related to reading skills among early readers; full knowledge of the more arbitrary O-S associations is expected to emerge only later and at this developmental stage general reliance on O-S (when other sources of information are present) is associated with lower reading skills (see also Harm & Seidenberg, 2004).

Although the results of our study shed light on an important aspect of literacy acquisition, they also raise several issues that should be addressed by further research. One issue concerns the nature of the mechanism that gives rise to individual differences in sensitivity to O-P regularities. As mentioned above, some portion of the variance in reliance on O-P (as well as on O-S) was related to vocabulary skills: A measure of vocabulary skill was positively associated with the magnitude of the O-P effect, and negatively associated with the imageability effect. This suggests that one factor that contributes to the development of O-P knowledge (and a more efficient division of labor more broadly) is better knowledge of the associations between phonology and semantics, in line with recent computational work (Chang & Monaghan, 2019; Chang et al., 2019). At the same time, oral language skills did not account for variability in O-P fully, and an open question therefore is what additional mechanism(s) give rise to these individual differences. One possibility is that these differences are related to the effectiveness of the mechanism that tracks the statistical relationships between orthographic and phonological units. That is, perhaps readers who are less impacted by O-P regularities are less adept at forming associations between well-established orthographic and phonological representations based on the reliability of these relationships. Alternatively, it is possible that these readers’ failure in forming these associations is the result of a deficiency in the orthographic or phonological representations themselves – for example, unstable phonological representations, resulting in poor associations with orthographic representations. These two possibilities reflect the principles of computational models of reading (Harm, McCandliss, & Seidenberg, 2003; Harm & Seidenberg, 1999; Rueckl et al., 2019), which suggest that differences in the quality of O-P relations are related to both learning-related parameters (e.g. learning rate, number of hidden units) and parameters related to the quality of the phonological representations (e.g. phonological noise). These two possibilities also relate to the model of SL by Frost and colleagues, which posits that individual differences in a given SL task are the result of the interplay between individuals’ ability to compute regularities between established representations, and their encoding abilities that enable the efficient formation of these representations to begin with (Frost, Armstrong, Siegelman, & Christiansen, 2015). Also of note is that given the presumed involvement of SL in word segmentation (Saffran, Aslin, & Newport, 1996), impaired phonological representations may also be related to SL deficits. To dissociate between these two possible explanations (and their relative contribution), future studies can consider measures that directly assess the quality of orthographic and phonological representations (e.g. phonological awareness), and further explore the relations between sensitivity to O-P regularities, quality of representations, oral language skills, and reading.

A second issue concerns the relationship between the attunement of the reading system to statistical regularities and the manifestation of those regularities in behavior. From our perspective, reading behavior is jointly determined by both O-P and O-S regularities; the influence of either depends on how well both have been assimilated as well as how reliable they are in a particular task. In the context of word naming, the behavioral manifestation of O-P regularities (as assessed by the O-P regularity effect) can be taken as a reasonably direct index of the degree to which those regularities have been assimilated into the organization of the reading system. Thus, with regard to our experiments, the magnitude of the O-P regularity effect can be taken as an indicator of how well an individual had assimilated the statistical regularities of the O-P mapping. In contrast, as illustrated by the modulation of the imageability effect on word naming by factors such as word frequency and O-P consistency, the manifestation of semantic regularities in this task is dependent on whether other regularities (in particular O-P regularities) serve as an efficient source of information to guide word reading. Thus, although differences in the magnitude of the imageability effect clearly reveal differences in the overall organization of the reading system, they should not be taken as a (context-free) measure of the strength of an individual’s O-S (and S-P) knowledge. Rather, an individual’s imageability effect is a proxy of the use of unreliable information despite the availability of more reliable information. Indeed, although we observed that poorer readers exhibited larger imageability effects, there is reason to believe that poorer readers are less adept at assimilating O-S regularities in general (see Keenan & Betjemann, 2008). Going forward, it will be vital to develop methods that allow us to assess how well O-S regularities have been assimilated across different reading tasks, to identify the factors related to individual differences in the ability to assimilate them, and to determine whether these are the same factors underlying variability in the assimilation of O-P regularities as well (see Sawi & Rueckl, 2019, for discussion).

Beyond these specific issues, we believe that our study has broader implications for the study of reading, as it provides a glimpse of an important but understudied aspect of literacy acquisition. More research is needed not only to gain a more complete picture of the division of labor between O-P regularities and imageability across the course of literacy acquisition, but also to reveal the role of other regularities (e.g. morphological regularities, O-P regularities at different grain sizes) over this same time course. In addition, whereas the experiments reported here focused on the results of long-term SL—that is, the behavioral consequence of learning that took place prior to the experiment—it is critical that future research target the learning mechanisms themselves. In this regard, it is interesting to note that recent findings suggest that O-P regularities and imageability play a facilitatory role in word learning, a role that further varies as a function of individual-level reading skill (e.g., Duff & Hulme, 2012; Steacy & Compton, 2019).

Finally, our study has implications for the study of SL more broadly. It is useful to consider the differences between the results of the current study and those of previous studies examining the correlations between SL tasks and reading skills. Of particular note is the strong predictive power of the measures examining reliance on O-P and O-S compared to the much weaker correlations observed in correlational studies of ‘typical’ SL tasks and reading outcomes. Some of the reasons for this difference are methodological. Specifically, SL tasks are oftentimes characterized by compromised reliability, especially with children (Arnon, 2019a; West, Vadillo, Shanks, & Hulme, 2018), which may limits the tasks’ predictive power. Importantly, however, we believe that this discrepancy also originates from theoretical reasons. First, there is growing evidence that SL is not (or at least not exclusively) a domain-general construct, but rather shows patterns of modality and information-specificity (e.g. Arciuli, 2017; Siegelman, Bogaerts, Christiansen, & Frost, 2017). Per this domain-specific view of SL, learning one type of regularity (e.g. learning the transitional probability between two shapes) may be different than learning another (e.g. an association between a letter and a sound). Second, and relatedly, our results highlight the different nature of learning of regularities in ‘typical’ SL tasks and those that are actually assimilated by learners in real-world tasks (e.g. reading). Namely, typical SL studies focus on constrained, highly (if not fully) regular, and uni-modal statistical contingencies that are learned over a short period of time. In contrast, the current approach examines reliance on more complex types of regularities, which vary in their reliability, that are concurrently presented to learners, and that are assimilated over long time-scales. Our results suggest that these more complex regularities are those that play a role in reading acquisition, more so than the simplified regularities typically studied in SL paradigms. We therefore believe that in order for SL theory to scale-up to the actual challenges presented by real-world learning tasks, it must go beyond simplified learning scenarios, and focus on these types of more complex and subtle regularities.

Highlights.

  • Reading requires assimilating regularities between print, speech, and meaning.

  • We quantify early readers’ reliance on these regularities during word naming.

  • Children who rely more on print-speech regularities have better reading skills.

  • Children who rely more on print-meaning associations are poorer readers.

Acknowledgements

Research reported in this publication was supported by the Eunice Kennedy National Institute of Child Health & Human Development of the National Institutes of under Awards P01HD070837, P20HD091013, and R37HD090153. The content is solely responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Supplementary Material

The full list of items (with their order of presentation, frequency, imageability, O-P surprisal values, and mean accuracy levels in Studies 1 and 2) is available at: https://osf.io/hkpy2/.

1

We used the values provided in the recent work by Siegelman et al. (2020), which quantifies grapheme-phoneme regularities based on a corpus of 10093 monosyllabic English words. To ensure that our estimates of individuals’ reliance on O-P regularities (see below) were not skewed by utilizing a corpus that is based on adult reading materials, we examined the item-level estimates of O-P surprisal for the 160 items in the word naming task based only on words that appear in the Zeno corpus (Zeno et al., 1995) in grades 1 and 2 (3632 words). This calculation revealed a minimal impact of the type of corpus used with a near-perfect correlation between the values based on the child-directed input and the original values provided by Siegelman et al. (2020), of r = 0.985.

2

Also note that generally there are high correlations between vowel-level and body-level regularities across items (i.e. words that are more regular at one grain size are also more regular at the other; see Siegelman et al. 2020). This was also the case in the current items, which were not selected to isolate sensitivity to vowel-level vs. body-level information. Differentiating between sensitivity to regularities in the two grain-sizes requires selection of items in which the two properties are uncorrelated.

3

Because participants in this age range produced a large number of unclear or idiosyncratic responses (e.g., ‘hmmm’, ‘when is this done’) leading to unclear or inaccurate speech onsets, we a-priori decided not to further analyze latency data from the reading task and focus on accuracy, which tends to be more reliable in this age range.

4

To account for this issue, future modifications to the task can adopt an adaptive design, where items are chosen online based on each child’s performance in the naming task, so a similar mean accuracy rate is achieved for all subjects.

References

  1. Arciuli J (2017). The multi-component nature of statistical learning. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences, 372(1711), 20160058 10.1098/rstb.2016.0058 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Arciuli J (2018). Reading as Statistical Learning. Language, Speech, and Hearing Services in Schools, 49(3), 634–643. 10.1044/2018_lshss-stlt1-17-0135 [DOI] [PubMed] [Google Scholar]
  3. Arciuli J, & Simpson IC (2012). Statistical Learning Is Related to Reading Ability in Children and Adults. Cognitive Science, 36(2), 286–304. 10.1111/j.1551-6709.2011.01200.x [DOI] [PubMed] [Google Scholar]
  4. Arnon I (2019a). Do current statistical learning tasks capture stable individual differences in children? An investigation of task reliability across modality. Behavior Research Methods. 10.3758/s13428-019-01205-5 [DOI] [PubMed] [Google Scholar]
  5. Arnon I (2019b). Statistical Learning, Implicit Learning, and First Language Acquisition: A Critical Evaluation of Two Developmental Predictions. Topics in Cognitive Science, 11(3), 12428 10.1111/tops.12428 [DOI] [PubMed] [Google Scholar]
  6. Arrington CN, Malins JG, Winter R, Mencl WE, Pugh KR, & Morris R (2019). Examining individual differences in reading and attentional control networks utilizing an oddball fMRI task. Developmental Cognitive Neuroscience, 38, 100674 10.1016/j.dcn.2019.100674 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Aslin RN, Saffran JR, & Newport EL (1998). Computation of Conditional Probability Statistics by 8-Month-Old Infants. Psychological Science, 9(4), 321–324. 10.1111/1467-9280.00063 [DOI] [Google Scholar]
  8. Barakat BK, Seitz AR, & Shams L (2013). The effect of statistical learning on internal stimulus representations: Predictable items are enhanced even when not predicted. Cognition, 129(2), 205–211. 10.1016/j.cognition.2013.07.003 [DOI] [PubMed] [Google Scholar]
  9. Baron J, & Strawson C (1976). Use of orthographic and word-specific knowledge in reading words aloud. Journal of Experimental Psychology: Human Perception and Performance, 2(3), 386–393. 10.1037/0096-1523.2.3.386 [DOI] [Google Scholar]
  10. Barr DJ, Levy R, Scheepers C, & Tily HJ (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68, 255–278. 10.1016/j.jml.2012.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bates D, Maechler M, Bolker B, & Walker S (2015). lme4: Linear mixed-effects models using Eigen and S4. R package version 1. 1–8. Retrieved from http://cran.r-project.org/package=lme4 [Google Scholar]
  12. Borleffs E, Maassen BAM, Lyytinen H, & Zwarts F (2017). Measuring orthographic transparency and morphological-syllabic complexity in alphabetic orthographies: a narrative review. Reading and Writing, 30(8), 1617–1638. 10.1007/s11145-017-9741-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Borowsky R, & Masson MEJ (1996). Semantic ambiguity effects in word identification. Journal of Experimental Psychology: Learning Memory and Cognition, 22(1), 63–85. 10.1037/0278-7393.22.1.63 [DOI] [Google Scholar]
  14. Brown P, Lupker SJ, & Colombo L (1994). Interacting Sources of Information in Word Naming: A Study of Individual Differences. Journal of Experimental Psychology: Human Perception and Performance, 20(3), 537–554. 10.1037/0096-1523.20.3.537 [DOI] [Google Scholar]
  15. Chang YN, & Monaghan P (2019). Quantity and Diversity of Preliteracy Language Exposure Both Affect Literacy Development: Evidence from a Computational Model of Reading. Scientific Studies of Reading, 23(3), 235–253. 10.1080/10888438.2018.1529177 [DOI] [Google Scholar]
  16. Chang YN, Monaghan P, & Welbourne S (2019). A computational model of reading across development: Effects of literacy onset on language processing. Journal of Memory and Language, 108, 104025 10.1016/j.jml.2019.05.003 [DOI] [Google Scholar]
  17. Chateau D, & Jared D (2003). Spelling-sound consistency effects in disyllabic word naming. Journal of Memory and Language, 48(2), 255–280. 10.1016/S0749-596X(02)00521-1 [DOI] [Google Scholar]
  18. Coltheart V, Laxon VJ, & Keating C (1988). Effects of word imageability and age of acquisition on children’s reading. British Journal of Psychology, 79(1), 1–12. 10.1111/j.2044-8295.1988.tb02270.x [DOI] [Google Scholar]
  19. Cortese MJ, & Simpson G (2000). Regularity effects in word naming: What are they? Memory & Cognition, 28(8), 1269–1276. 10.3758/bf03211827 [DOI] [PubMed] [Google Scholar]
  20. Dale R, Duran ND, & Morehead JR (2012). Prediction during statistical learning, and implications for the implicit/explicit divide. Advances in Cognitive Psychology, 8(2), 196–209. 10.2478/v10053-008-0115-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Davies RAI, Birchenough JMH, Arnell R, Grimmond D, & Houlson S (2017). Reading through the life span: Individual differences in psycholinguistic effects. Journal of Experimental Psychology: Learning Memory and Cognition, 43(8), 1298–1338. 10.1037/xlm0000366 [DOI] [PubMed] [Google Scholar]
  22. Dotsch R, Hassin RR, & Todorov A (2017). Statistical learning shapes face evaluation. Nature Human Behaviour, 1, 1 10.1038/s41562-016-0001 [DOI] [Google Scholar]
  23. Duff FJ, & Hulme C (2012). The Role of Children’s Phonological and Semantic Knowledge in Learning to Read Words. Scientific Studies of Reading, 16(6), 504–525. 10.1080/10888438.2011.598199 [DOI] [Google Scholar]
  24. Ehri LC (2005). Learning to read words: Theory, findings, and issues. Scientific Studies of Reading, 9(2), 167–188. 10.1207/s1532799xssr0902_4 [DOI] [Google Scholar]
  25. Elleman AM, Steacy LM, & Compton DL (2019). The Role of Statistical Learning in Word Reading and Spelling Development: More Questions Than Answers. Scientific Studies of Reading, 23, 1–7. 10.1080/10888438.2018.1549045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Erickson LC, & Thiessen ED (2015). Statistical learning of language: Theory, validity, and predictions of a statistical learning account of language acquisition. Developmental Review, 37, 66–108. [Google Scholar]
  27. Farnia F, & Geva E (2011). Cognitive correlates of vocabulary growth in English language learners. Applied Psycholinguistics, 32, 711–738. 10.1017/S0142716411000038 [DOI] [Google Scholar]
  28. Fine AB, & Florian Jaeger T (2013). Evidence for Implicit Learning in Syntactic Comprehension. Cognitive Science, 37(3), 578–591. 10.1111/cogs.12022 [DOI] [PubMed] [Google Scholar]
  29. Frost R (2012). Towards a universal model of reading. Behavioral and Brain Sciences, 35(5), 263–279. 10.1017/S0140525X11001841 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Frost R, Armstrong BC, & Christiansen MH (2019). Statistical Learning Research: A Critical Review and Possible New Directions. Psychological Bulletin, 145(12), 1128–1153. 10.1037/bul0000210 [DOI] [PubMed] [Google Scholar]
  31. Frost R, Armstrong BC, Siegelman N, & Christiansen MH (2015). Domain generality versus modality specificity: the paradox of statistical learning. Trends in Cognitive Sciences, 19(3), 117–125. 10.1016/j.tics.2014.12.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Frost R, Siegelman N, Narkiss A, & Afek L (2013). What predicts successful literacy acquisition in a second language? Psychological Science, 24(7), 1243–1252. 10.1177/0956797612472207 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Gabay Y, Thiessen ED, & Holt LL (2015). Impaired statistical learning in developmental dyslexia. Journal of Speech, Language, and Hearing Research, 58, 934–945. 10.1044/2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Gingras M, & Sénéchal M (2019). Evidence of Statistical Learning of Orthographic Representations in Grades 1–5: The Case of Silent Letters and Double Consonants in French. Scientific Studies of Reading, 23(1), 37–48. 10.1080/10888438.2018.1482303 [DOI] [Google Scholar]
  35. Glushko RJ (1979). The organization and activation of orthographic knowledge in reading aloud. Journal of Experimental Psychology: Human Perception and Performance, 5(4), 674–691. 10.1037/0096-1523.5.4.674 [DOI] [Google Scholar]
  36. Grainger J, Colé P, & Segui J (1991). Masked morphological priming in visual word recognition. Journal of Memory and Language, 30, 370–384. 10.1016/0749-596X(91)90042-I [DOI] [Google Scholar]
  37. Graves WW, Binder JR, Desai RH, Humphries C, Stengel BC, & Seidenberg MS (2014). Anatomy is strategy: Skilled reading differences associated with structural connectivity differences in the reading network. Brain and Language, 133, 1–13. 10.1016/j.bandl.2014.03.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Harm MW, McCandliss BD, & Seidenberg MS (2003). Modeling the Successes and Failures of Interventions for Disabled Readers. Scientific Studies of Reading, 7(2), 155–182. 10.1207/s1532799xssr0702_3 [DOI] [Google Scholar]
  39. Harm MW, & Seidenberg MS (1999). Phonology, reading acquisition, and dyslexia: insights from connectionist models. Psychological Review, 106, 491–528. 10.1037/0033-295X.106.3.491 [DOI] [PubMed] [Google Scholar]
  40. Harm MW, & Seidenberg MS (2004). Computing the meanings of words in reading: cooperative division of labor between visual and phonological processes. Psychological Review, 111, 662–720. 10.1037/0033-295X.111.3.662 [DOI] [PubMed] [Google Scholar]
  41. Hoffman P, Lambon Ralph MA, & Woollams AM (2015). Triangulation of the neurocomputational architecture underpinning reading aloud. Proceedings of the National Academy of Sciences, 112(28), E3719–E3728. 10.1073/pnas.1502032112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Jaeger TF (2008). Categorical data analysis: Away from ANOVAs (transformation or not) and towards logit mixed models. Journal of Memory and Language, 59(4), 434–446. 10.1016/j.jml.2007.11.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Jared D (2002). Spelling-sound consistency and regularity effects in word naming. Journal of Memory and Language, 46(4), 723–750. 10.1006/jmla.2001.2827 [DOI] [Google Scholar]
  44. Jared D, McRae K, & Seidenberg MS (1990). The basis of consistency effects in word naming. Journal of Memory and Language, 29(6), 687–715. 10.1016/0749-596X(90)90044-Z [DOI] [Google Scholar]
  45. Kahta S, & Schiff R (2019). Deficits in statistical leaning of auditory sequences among adults with dyslexia. Dyslexia, 25(2), 142–157. 10.1002/dys.1618 [DOI] [PubMed] [Google Scholar]
  46. Kearns DM, Rogers HJ, Koriakin T, & Al Ghanem R (2016). Semantic and Phonological Ability to Adjust Recoding: A Unique Correlate of Word Reading Skill? Scientific Studies of Reading, 20(6), 455–470. 10.1080/10888438.2016.1217865 [DOI] [Google Scholar]
  47. Keenan JM, & Betjemann RS (2008). Comprehension of single words: The role of semantics in word identification and reading disability In Grigorenko EL & Naples AJ (Eds.), Single-word reading: Behavioral and biological perspectives. (pp. 191–209). Denver, CO: Lawrence Erlbaum Associates Publishers. [Google Scholar]
  48. Kuperman V, & Van Dyke JA (2013). Reassessing word frequency as a determinant of word recognition for skilled and unskilled readers. Journal of Experimental Psychology: Human Perception and Performance, 39(3), 802–823. 10.1037/a0030859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Laing E, & Hulme C (1999). Phonological and Semantic Processes Influence Beginning Readers’ Ability to Learn to Read Words. Journal of Experimental Child Psychology, 73, 183–207. 10.1006/jecp.1999.2500 [DOI] [PubMed] [Google Scholar]
  50. Long JA (2019). Interactions: Comprehensive, user-friendly toolkit for probing interactions (R package version 1.1.0). Retrieved from https://cran.r-project.org/package=interactions
  51. Marelli M, & Amenta S (2018). A database of orthography-semantics consistency (OSC) estimates for 15,017 English words. Behavior Research Methods, 50(4), 1482–1495. 10.3758/s13428-018-1017-8 [DOI] [PubMed] [Google Scholar]
  52. Maye J, Werker JF, & Gerken L (2002). Infant sensitivity to distributional information can affect phonetic discrimination. Cognition, 82(3), 101–111. 10.1016/S0010-0277(01)00157-3 [DOI] [PubMed] [Google Scholar]
  53. McClelland JL, & Rogers TT (2003). The parallel distributed processing approach to semantic cognition. Nature Reviews. Neuroscience, 4(4), 310–322. 10.1038/nrn1076 [DOI] [PubMed] [Google Scholar]
  54. Monaghan P, Chater N, & Christiansen MH (2005). The differential role of phonological and distributional cues in grammatical categorisation. Cognition, 96(2), 143–182. 10.1016/j.cognition.2004.09.001 [DOI] [PubMed] [Google Scholar]
  55. Nation K, & Snowling MJ (1998). Semantic processing and the development of word-recognition skills: Evidence from children with reading comprehension difficulties. Journal of Memory and Language, 39, 85–101. 10.1006/jmla.1998.2564 [DOI] [Google Scholar]
  56. Nation K, & Snowling MJ (2004). Beyond phonological skills: Broader language skills contribute to the development of reading. Journal of Research in Reading, 27(4), 342–356. 10.1111/j.1467-9817.2004.00238.x [DOI] [Google Scholar]
  57. Newell KM (1991). Motor Skill Acquisition. Annual Review of Psychology, 42, 213–237. 10.1146/annurev.ps.42.020191.001241 [DOI] [PubMed] [Google Scholar]
  58. Paivio A, Yuille JC, & Madigan SA (1968). Concreteness, imagery, and meaningfulness values for 925 nouns. Journal of Experimental Psychology, 76(1), 1–25. 10.1037/h0025327 [DOI] [PubMed] [Google Scholar]
  59. Perfetti CA, Beck I, Bell LC, & Hughes C (1987). Phonemic knowledge and learning to read are reciprocal: A longitudinal study of first grade children. Merrill-Palmer Quarterly, 33, 283–319. [Google Scholar]
  60. Plaut DC, McClelland JL, Seidenberg MS, & Patterson K (1996). Understanding normal and impaired word reading: computational principles in quasi-regular domains. Psychological Review, 103(1), 56–115. 10.1037/0033-295X.103.1.56 [DOI] [PubMed] [Google Scholar]
  61. Qi Z, Sanchez Araujo Y, Georgan WC, Gabrieli JDE, & Arciuli J (2019). Hearing Matters More Than Seeing: A Cross-Modality Study of Statistical Learning and Reading Ability. Scientific Studies of Reading, 23(1), 101–115. 10.1080/10888438.2018.1485680 [DOI] [Google Scholar]
  62. Rastle K, Davis MH, Marslen-Wilson WD, & Tyler LK (2000). Morphological and semantic effects in visual word recognition: A time-course study. Language and Cognitive Processes, 15(4), 507–537. 10.1080/01690960050119689 [DOI] [Google Scholar]
  63. Romberg AR, & Saffran JR (2010). Statistical learning and language acquisition. Wiley Interdisciplinary Reviews. Cognitive Science, 1, 906–914. 10.1002/wcs.78 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Rueckl JG (2010). Connectionism and the role of morphology in visual word recognition. The Mental Lexicon, 5(3), 371–400. 10.1075/ml.5.3.07rue [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Rueckl JG, Zevin JD, & Wolf VII H (2019). Using computational techniques to model and better understand developmental word-reading disorders (i.e., dyslexia) In Washington J, Compton D, & McCardle P (Eds.), Dyslexia: Revisiting Etiology, Diagnosis, Treatment, and Policy. Baltimore, MD: Brookes Publishing Co. [Google Scholar]
  66. Rumelhart DE, & McClelland JL (1986). On learning the past tenses of English verbs In McClelland JL & Rumelhart DE (Eds.), Parallel Distributed Processing: Explorations in the Microstructure of Cognition (Vol. 2) (pp. 216–271). Cambridge, MA: MIT Press. [Google Scholar]
  67. Saffran JR, Aslin RN, & Newport EL (1996). Statistical Learning by 8-Month-Old Infants. Science, 274(5294), 1926–1928. 10.1126/science.274.5294.1926 [DOI] [PubMed] [Google Scholar]
  68. Salimpoor VN, Zald DH, Zatorre RJ, Dagher A, & McIntosh AR (2015). Predictions and the brain: How musical sounds become rewarding. Trends in Cognitive Sciences, 19(2), 86–91. 10.1016/j.tics.2014.12.001 [DOI] [PubMed] [Google Scholar]
  69. Sawi OM, & Rueckl J (2019). Reading and the Neurocognitive Bases of Statistical Learning. Scientific Studies of Reading, 23(1), 8–23. 10.1080/10888438.2018.1457681 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Scarborough HS (1998). Predicting the future achievement of second graders with reading disabilities: Contributions of phonemic awareness, verbal memory, rapid naming, and IQ. Annals of Dyslexia, 48(1), 115–136. 10.1007/s11881-998-0006-5 [DOI] [Google Scholar]
  71. Schilling HEH, Rayner K, & Chumbley JI (1998). Comparing naming, lexical decision, and eye fixation times: Word frequency effects and individual differences. Memory and Cognition, 26(6), 1270–1281. 10.3758/BF03201199 [DOI] [PubMed] [Google Scholar]
  72. Schmalz X, Altoè G, & Mulatti C (2017). Statistical learning and dyslexia: a systematic review. Annals of Dyslexia, 67(2), 147–162. 10.1007/s11881-016-0136-0 [DOI] [PubMed] [Google Scholar]
  73. Schmalz X, Moll K, Mulatti C, & Schulte-Körne G (2019). Is Statistical Learning Ability Related to Reading Ability, and If So, Why? Scientific Studies of Reading, 23(1), 64–76. 10.1080/10888438.2018.1482304 [DOI] [Google Scholar]
  74. Schwanenflugel PJ, & Noyes CR (1996). Context Availability and the Development of Word Reading Skill. Journal of Literacy Research, 28(1), 35–54. 10.1080/10862969609547909 [DOI] [Google Scholar]
  75. Seidenberg MS (1985). The time course of phonological code activation in two writing systems. Cognition, 19(1), 1–30. 10.1016/0010-0277(85)90029-0 [DOI] [PubMed] [Google Scholar]
  76. Seidenberg MS (2011). Reading in different writing systems: One architecture, multiple solutions In McCardle P, Miller B, Lee JR, & Tzeng OJL (Eds.), yslexia across languages: Orthography and the brain–gene–behavior link (pp. 146–168). Baltimore, MD: Paul H Brookes Publishing. [Google Scholar]
  77. Seidenberg MS, & Gonnerman LM (2000). Explaining derivational morphology as the convergence of codes. Trends in Cognitive Sciences, 4(9), 353–361. 10.1016/S1364-6613(00)01515-1 [DOI] [PubMed] [Google Scholar]
  78. Seidenberg MS, & McClelland JL (1989). A Distributed, Developmental Model of Word Recognition and Naming. Psychological Review, 96(4), 523–568. 10.1037/0033-295X.96.4.523 [DOI] [PubMed] [Google Scholar]
  79. Sénéchal M, Gingras M, & L’Heureux L (2016). Modeling Spelling Acquisition: The Effect of Orthographic Regularities on Silent-Letter Representations. Scientific Studies of Reading, 20(2), 155–162. 10.1080/10888438.2015.1098650 [DOI] [Google Scholar]
  80. Shankweiler D, Lundquist E, Katz L, Stuebing KK, Fletcher JM, Brady S, … Shaywitz BA (1999). Comprehension and Decoding: Patterns of Association in Children With Reading Difficulties. Scientific Studies of Reading, 3(1), 69–94. 10.1207/s1532799xssr0301_4 [DOI] [Google Scholar]
  81. Share DL (1999). Phonological Recoding and Orthographic Learning: A Direct Test of the Self-Teaching Hypothesis. Journal of Experimental Child Psychology, 72(2), 95–129. 10.1006/jecp.1998.2481 [DOI] [PubMed] [Google Scholar]
  82. Siegelman N, Bogaerts L, Christiansen MH, & Frost R (2017). Towards a theory of individual differences in statistical learning. Philosophical Transactions of the Royal Society B: Biological Sciences, 372(1711). 10.1098/rstb.2016.0059 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Siegelman N, & Frost R (2015). Statistical learning as an individual ability: Theoretical perspectives and empirical evidence. Journal of Memory and Language, 81, 105–120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Siegelman N, Kearns DM, & Rueckl JG (2020). Using information-theoretic measures to characterize the structure of the writing system: the case of orthographic-phonological regularities in English. Behavior Research Methods, 1–21. 10.3758/s13428-019-01317-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Sigurdardottir HM, Danielsdottir HB, Gudmundsdottir M, Hjartarson KH, Thorarinsdottir EA, & Kristjánsson Á (2017). Problems with visual statistical learning in developmental dyslexia. Scientific Reports, 7(1), 606 10.1038/s41598-017-00554-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Spencer M, Kaschak MP, Jones JL, & Lonigan CJ (2014). Statistical learning is related to early literacy-related skills. Reading and Writing, 28(4), 467–490. 10.1007/s11145-014-9533-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Steacy LM, & Compton DL (2019). Examining the role of imageability and regularity in word reading accuracy and learning efficiency among first and second graders at risk for reading disabilities. Journal of Experimental Child Psychology, 178, 226–250. 10.1016/j.jecp.2018.09.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Steacy LM, Compton DL, Petscher Y, Elliott JD, Smith K, Rueckl JG, … Pugh KR (2018). Development and prediction of context-dependent vowel pronunciation in elementary readers. Scientific Studies of Reading, 23(1). 10.1080/10888438.2018.1466303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Strain E, & Herdman CM (1999). Imageability effects in word naming: An individual differences analysis. Canadian Journal of Experimental Psychology, 53(4), 347–359. 10.1037/h0087322 [DOI] [PubMed] [Google Scholar]
  90. Strain E, Patterson K, & Seidenberg MS (1995). Semantic Effects in Single-Word Naming. Journal of Experimental Psychology: Learning, Memory, and Cognition, 21(5), 1140–1154. 10.1037/0278-7393.21.5.1140 [DOI] [PubMed] [Google Scholar]
  91. Torkildsen J. von K., Arciuli J, & Wie OB (2019). Individual differences in statistical learning predict children’s reading ability in a semi-transparent orthography. Learning and Individual Differences, 69, 60–68. 10.1016/J.LINDIF.2018.11.003 [DOI] [Google Scholar]
  92. Treiman R, & Kessler B (2006). Spelling as Statistical Learning: Using Consonantal Context to Spell Vowels. Journal of Educational Psychology, 98(3), 642–652. 10.1037/0022-0663.98.3.642 [DOI] [Google Scholar]
  93. Treiman R, Mullennix J, Bijeljac-Babic R, & Richmond-Welty ED (1995). The Special Role of Rimes in the Description, Use, and Acquisition of English Orthography. Journal of Experimental Psychology: General, 124(2), 107–136. 10.1037/0096-3445.124.2.107 [DOI] [PubMed] [Google Scholar]
  94. Ulicheva A, Harvey H, Aronoff M, & Rastle K (2018). Skilled readers’ sensitivity to meaningful regularities in English writing. Cognition. 10.1016/j.cognition.2018.09.013 [DOI] [PubMed] [Google Scholar]
  95. Vellutino FR, Fletcher JM, Snowling MJ, & Scanlon DM (2004). Specific reading disability (dyslexia): What have we learned in the past four decades? Journal of Child Psychology and Psychiatry and Allied Disciplines, 45(1), 2–40. 10.1046/j.0021-9630.2003.00305.x [DOI] [PubMed] [Google Scholar]
  96. Wechsler D (2011). Wechsler Abbreviated Scale of Intelligence, Second Edition (WASI-II). San Antonio, TX: NCS Pearson. [Google Scholar]
  97. Weekes BS, Castles AE, & Davies RA (2006). Effects of consistency and age of acquisition on reading and spelling among developing readers. Reading and Writing, 19(2), 133–169. 10.1007/s11145-005-2032-6 [DOI] [Google Scholar]
  98. West G, Vadillo MA, Shanks DR, & Hulme C (2018). The procedural learning deficit hypothesis of language learning disorders: we see some problems. Developmental Science, 21(2). 10.1111/desc.12552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Wood SN (2011). Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 73(1), 3–36. 10.1111/j.1467-9868.2010.00749.x [DOI] [Google Scholar]
  100. Woodcock RW, McGrew KS, & Mather N (2001). Woodcock-Johnson III Tests of Achievement. Itasca, IL: Riverside Publishing. [Google Scholar]
  101. Woollams AM, Lambon Ralph MA, Madrid G, & Patterson KE (2016). Do you read how I read? Systematic individual differences in semantic reliance amongst normal readers. Frontiers in Psychology, 7, 1757 10.3389/fpsyg.2016.01757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Yap MJ, Balota DA, Sibley DE, & Ratcliff R (2012). Individual differences in visual word recognition: Insights from the English Lexicon Project. Journal of Experimental Psychology: Human Perception and Performance, 38(1), 53–79. 10.1037/a0024177 [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Zeno S, Ivens SH, Millard RT, & Duvvuri R (1995). The educator’s word frequency guide. Brewster, NY: Touchstone Applied Science Associates. [Google Scholar]
  104. Zevin JD, & Seidenberg MS (2006). Simulating consistency effects and individual differences in nonword naming: A comparison of current models. Journal of Memory and Language, 54(2), 145–160. 10.1016/j.jml.2005.08.002 [DOI] [Google Scholar]

RESOURCES