Abstract
Evidence shows that skilled readers extract information about upcoming words in the parafovea. Using the boundary paradigm, we investigated native Arabic readers’ processing of orthographic, morphological, and semantic information available parafoveally. Target words were embedded in frame sentences, and prior to readers fixating them, one of the following previews were made available: (a) Identity preview; (b) Preview that shared the pattern morpheme with the target; (c) Preview that shared the root morpheme with the target; (d) Preview that was a synonym with the target word; (e) Preview with two of the root letters were transposed thus creating a new root, while preserving all letter identities of the target; (f) Preview with two of the root letters were transposed thus creating a pronounceable pseudo root, while also preserving all letter identities of the target; and (g) Previews that was unrelated to the target word and shared no information with it. The results showed that identity, root-preserving, and synonymous preview conditions yielded preview benefit. On the other hand, no benefit was obtained from the pattern-preserving previews, and significant disruption to processing was obtained from the previews that contained transposed root letters, particularly when this letter transposition created a new real root. The results thus reflect Arabic readers’ dependance on morphological and semantic information, and suggest that these levels of representation are accessed as early as orthographic information. Implications for theory- and model-building, and the need to accommodate early morphological and semantic processing activities in more comprehensive models are further discussed.
Introduction
Arabic is a Semitic language that is read from right to left. It features Semitic morphology where words are built from non-concatenated combination of root and pattern morphemes. That is, the root letters are typically diffuse within the word and can be interrupted by inserting letters from the pattern morpheme (so-called infixes, e.g., مكتوب /mktub/ is written, where the root كتب /ktb/ is interrupted by the letter و /u/ of the pattern morpheme /مـ ـ ـ و - / /m_ _ u _/, see e.g., [1–3]). The root morpheme indicates the main semantic family to which the word belongs (e.g., in the example above, the root كتب /ktb/ refers predominantly to writing-related meanings), whereas the pattern morpheme provides the detailed phonological and syntactic information that are necessary for complete and accurate word identification (e.g., [2]). Therefore, the tightly-knitted orthographic, morphological and semantic information within words in Arabic make it a very interesting medium to investigate how this information is extracted from the parafovea during sentence reading.
A brief overview of ortho-morphological and semantic processing in Arabic
In alphabetic orthographies, like English and other European languages, evidence shows that letter identity and position information are important variables in word identification. Evidence from transposed letter (TL) studies suggested that these two variables are encoded independently during early lexical processing. Typically, primes with transposed letter order (thus preserving letter identity but not letter position information) result in facilitation in identifying the target word comparable with identity primes (e.g., JUGDE as a prime for judge), suggesting that there is a good deal of flexibility in letter position encoding. In single word paradigms, TL prime benefits are widely reported in languages such as English, French, Dutch, and Spanish [4–14]. In these languages morphology is sequential and concatenated in nature, and lexical organization is believed to be orthography-based, with entries that share letters, or are orthographic neighbors being clustered together in the lexicon (see [15] for review). Findings from these European languages informed the construction of multiple word identification models, where flexible letter position encoding was thought to be a fundamental feature of the cognitive system (e.g., SOLAR model [16]; Spatial Coding model [17–19]; the SERIOL model [20–23]).
Findings from Semitic languages such as Arabic and Hebrew did not, however, conform with these findings. The root morphemes in Semitic words were found to play an important role in Arabic word identification and in lexical organization, with words that share the same root being clustered together [15,24–35]. For example, primes that shared root information with the target resulted in faster identification of the target, regardless of whether the semantic relationship between the prime and target was transparent (e.g., كتابة /kitaabat/ writing and كاتب /katib/ writer), or not (e.g., كتيبة /katibat/ squadron and كاتب /katib/ writer, see [1,36,37]). Importantly, robust findings showed clearly that transposing root letter order in primes (thus preserving root letter identities, but not their order), did not yield a processing benefit as was found in European languages [33,38–40]. Rather, significant processing costs were reported, especially when transposing the root letters created another existing real root, the activation of which through priming is thought to interfere with processing the actual root present in the target [41]. In Arabic transposing root letters results in creating new roots that exist in the lexicon around 54% of the time (e.g., the root بدل /bdl/ switched becomes بلد /bld/ dulled/numbed with transposing the second and third letters, see [15,29,38] for reviews). By contrast, transposing letters in English was estimated to result in creating new words on only 7% of the time [38]. The rigidity in letter position encoding in Semitic languages, compared to European languages, is thus understood to be a product of the orthographic density of Semitic languages ([15] for review).
With regards to word patterns, as mentioned earlier, Arabic words feature pattern morphemes that are interwoven, or infixed into the letters of the root morpheme. Compared to word roots, the exact role of word patterns in word identification, and the time course of processing word patterns remains to be fully delineated. In Hebrew, for instance, whereas verbal word patterns resulted in priming facilitation, nominal word patterns did not [32]. This was attributed to the fact that whereas Hebrew has over one hundred nominal word patterns [28], there are only seven verbal patterns [42], with the smaller number of the latter helping narrow down potential word candidates. On the other hand, investigations of pattern processing in Arabic reported benefit from both verbal and nominal pattern-preserving primes [1,2,36,43,44]. Furthermore, the benefit from pattern-preserving primes in Arabic were found to be influenced by variables such as stimulus onset asynchronies (SOA, or the delay between the onset of the masked prime and the onset of the target, [2]). Interestingly, the benefit from pattern-preserving primes in Arabic were also found to be influenced by root-related variables such as the presence of a transparent semantic relation between the root of the prime and the target; and the productivity (family size) of the root of the target word [44]. Boudelaa and Marslen-Wilson [44] suggested that the difference in how root and pattern information are processed may be influenced by the fact that the Semitic consonantal roots in Arabic are always fully specified, that is, present in print. By contrast, pattern information is mostly only partially specified, given that pattern morphemes often incorporate vowels that are typically represented by diacritics, and that these diacritics are typically omitted from print [45].
Arabic readers were also found to extract semantic information from primes such that facilitation for processing target words was obtained when these targets were preceded by semantically-related primes (e.g., سرير bed as prime for وسادة pillow) relative to when the prime and target were not semantically related (e.g., عسكري military as prime for وسادة pillow, see [46,47]). Interestingly, in the task presented by Mountaj et al. [46] the prime remained on the screen for 500 ms. After another 300 ms blank screen, the participants were required to decide whether the prime and target were semantically related. Thus, more remains to be learnt about whether Arabic readers extract meaning from upcoming words during sentence reading (i.e., in time intervals that are typically shorter than what was allowed by Mountaj et al.).
Parafoveal processing in sentence reading
I. Preview benefit and processing of orthographic information in the parafovea
A great deal of evidence from eye movement investigations has established that during a fixation, readers process the fixated word as well as pre-process the upcoming word in the parafovea. Parafoveal pre-processing is thought to resemble the partial processing that takes place in priming [48–51]. The boundary paradigm (Rayner, 1975) [53] is typically used to investigate the extent to which readers extract information from the upcoming word. In this paradigm, an invisible boundary is inserted in the text immediately before a target word. Prior to crossing this boundary, the reader is presented with a preview of the upcoming word that may, or may not, be identical to the target word, or that may share certain linguistic characteristics with it (e.g., orthographic, bench as preview of beach). As the reader’s eyes saccade across that boundary, forward into the text, the display changes and the target word is displayed correctly when the reader fixates it. Importantly, the reader is typically unaware of the display change because of the suppression of visual processing during saccades [52]. Available evidence shows that when readers are given a valid (i.e., identical) parafoveal preview of the upcoming word, fixation durations on this word are reduced—the so-called preview benefit, compared to when the previews are not valid (e.g., the string dmaeb as a preview of beach, e.g., [53,54]. Schotter [50] suggested that preview benefit effects are due to costs to processing being less severe when the preview and target are similar, that is, share relevant sublexical information, compared to when they are dissimilar. Indeed, investigations in many languages reported that giving readers parafoveal previews that share orthographic and/or phonological information with the targets results in preview benefit, relative to when previews lack such information (e.g., [55–60]).
II. Morphological processing in the parafovea: The case of Semitic roots and patterns
In studies that aimed to investigate readers’ parafoveal processing of morphology, readers are typically presented with previews that shared a morpheme with the upcoming target word. In English and other European languages, the results showed no greater benefit from morphologically-related previews relative to orthographic control previews that shared as many letters with the target (see [61–63], and [64] for review). By contrast, findings from parafoveal processing of Hebrew morphology reported that previews that shared the Semitic root with the target resulted in processing benefit (i.e., reduced fixation durations) for the target, compared to orthographic control previews [28,65].
Furthermore, additional findings further supported the idea of rigid letter position encoding in Semitic languages. Specifically, in non-Semitic languages, parafoveal previews that featured transposed letters of the target word (e.g., cpatain as preview of captain) yielded the typical preview benefit once that target word was fixated (see e.g., [6,66]; and also in children reading [67]), akin to what was reported in single word paradigms (see above). By contrast, and in line with the above discussion, disrupting the order of the Semitic root letters in the parafoveal preview did not lead to processing benefit, rather, it results in inflated fixation durations on the target words [68].
Parafoveal processing of Semitic word patterns was also investigated. Deutsch, Frost, Pollatsek, and Rayner [69] reported facilitation in Hebrew word identification when the parafoveal preview shared the pattern morpheme with the target. However, this was only the case for verbal, not nominal, patterns, thus replicating the findings discussed above from single word processing tasks in Hebrew. However, a more recent investigation using both the boundary paradigm and fast priming techniques, Deutsch, Velan, and Michaly [70] reported that readers do benefit from nominal word patterns in Hebrew. The recorded facilitation represented a subtle effect that took place early during word identification (on average 8 ms in first fixation duration and 17 ms in gaze duration).
III. Processing of Semantic information in the Parafovea
Semantic processing was typically presumed to take place later, subsequent to processing other word representations (e.g., orthography and phonology). Given this, and because parafoveal processing, by definition, takes place very early prior to fixating the target word, the absence of evidence for semantic preview benefit in studies of parafoveal processing was not considered surprising [71–74]. Later studies, however, found evidence for semantic preview benefit in German and in Chinese, where semantically related previews resulted in more facilitation in target word processing relative to previews that were not semantically related to the target [75–78]. Differences in orthographic transparency between the languages (German being more transparent in its letter-sound correspondences than English), and the nature of ortho-semantic relationship (being closer for Chinese compared to English, with the semantically transparent Chinese radicals) were suggested as potential explanations for German and Chinese readers’ ability to access semantic information parafoveally. By contrast, English readers were suggested not to reach this depth of processing until later stages of word identification [51,79].
In later investigations, researchers reported that readers of English also perform parafoveal semantic processing. In one investigation, preview benefit was obtained from synonymous parafoveal previews (e.g., video as preview of movie, [80]), while no significant benefit was obtained from previews that were only semantically related to the target (e.g., audio as preview of movie). The benefit obtained from the closeness of the semantic relationship between the preview and target in English was found to be modulated by visual and orthographic word-level variables such as initial letter capitalization in the previews [81]. This benefit was also found to be modulated by sentence level variables such as the degree of sematic constraining of pre-target context (e.g., the word scrub after the neutral context “My roommate will continuously …” vs. the constraining context “For hygienic purposes, doctors must …,” see [82]); and the plausibility of the preview in the sentence context (e.g., plausible “… scared that a freak twister/ tsunami …” vs. implausible “… scared that a freak booster …”, see [83–85], also [78,86] for similar findings in Chinese). As yet, processing semantic information parafoveally during reading Arabic has not been investigated.
The current research
In the study reported here we aimed to examine the parafoveal processing of orthography, morphology and semantics in Arabic sentence reading. As illustrated above, the tight knitting of these levels of representation in Arabic words makes it a fertile medium for studying readers’ access and processing of the information conveyed in printed text in a comprehensive way. There are numerous outstanding questions regarding the extent, time course, and effects of accessing various ortho-morphological, and semantic information parafoveally. Specifically, we aimed to investigate the following: (a) Parafoveal processing of Arabic orthography: Replicating the basic findings that identity previews result in preview benefit, relative to orthographically unrelated previews; (b) Parafoveal ortho-morphological processing: The effects of transposing root letters in the parafoveal preview, when the letter transposition creates a new (real) root, and when it creates a pseudo root (pronounceable but meaningless), while preserving letter identities in both cases; (c) Morphological parafoveal processing: The effects of providing readers with parafoveal previews that preserve the root morpheme, and previews that preserve the pattern morpheme; and finally, (d) Semantic processing in the parafovea: Exploring whether readers would obtain preview benefit if the preview was a synonymous word with the target. The experimental conditions and statistical contrasts we utilized to investigate these questions are detailed in the following sections.
Based on the previous findings discussed above, we expected to obtain the classic preview benefit for identity previews, and benefits from root-preserving previews. Similarly, we expected that transposing root letters in the previews would result in processing costs, particularly when the letter transpositions create a new real root, given the rigid letter position encoding in Semitic languages (e.g., [15,29,38]). By including both TL preview conditions, however, we aimed to learn more about the costs of creating a TL real root—a lexical competitor to the root embedded in the actual target, compared to a TL pseudo root that arguably does not generate lexical competition to the root embedded in the actual target. Additionally, we had no clear expectations for the effects of pattern-preserving previews, given the inconsistent findings obtained so far in silent reading and single word tasks. Indeed, pattern-based processing facilitation reported in studies of Hebrew [69–70] have yet to be replicated in Arabic, where, so far the reported effects of preserving word patterns seem to be modulated by experimental variables (e.g., SOA, see [2] and discussion above), and root-related variables (e.g., root productivity, see conclusions in [44]). Similarly, we had no clear expectations of whether previews that are semantically identical to the targets (i.e., synonyms) would yield preview benefit. However, in the light of previous findings, it is plausible that Arabic readers may extract semantic information parafoveally, perhaps by virtue of the transparency of the Arabic consonants’ letter-sound correspondences that would facilitate earlier access to meaning (e.g., [80]).
Method
Participants
A total of seventy-seven adult native Arabic readers, all undergraduates at Zayed University, UAE, took part in the eye tracking procedure (M = 21.4, SD = 2.1, Range = 18–24). All participants had normal or correct to normal vision as determined by the Bailey-Lovie chart [87]. Participants received AED20 (~ $5.30) coffeeshop vouchers as incentive for participation.
For stimuli norming (more details below), a total of 15 participants were recruited from the same population (undergraduate, native Arabic readers). They also received AED20 coffeeshop vouchers for participation. These participants, however, did not take part in the eye tracking procedure.
Stimuli
Sixty-three target words were embedded in frame sentences (see Fig 1 for an example). The target words were 4–8 letters long (M = 5.4, SD = 1.1) and each contained a 3-letter root. For the target words, and all words used as previews, Table 1 provides orthographic frequency (counts per million, CPM) in the Aralex database [88], as well as ratings of words’ and roots’ commonness of occurrence in the language. The commonness of occurrence ratings for each word and each root were collected from 5 participants who did not take part in the eye tracking procedure on a 5-point scale (1 indicating very rare occurrence, 5 indicating very common occurrence).
Fig 1. Sample stimuli set.
The dashed line indicated the location of the invisible boundary, followed by the different preview strings in the different conditions.
Table 1. Preview conditions details and an example.
| Preview Condition | Example | Translation | Lexicality | Average Number Shared Letters with Target (SD) | Average Word Frequency CPM in Aralex (SD) | Average Root Token Frequency CPM in Aralex (SD) | Average Pattern Token Frequency CPM in Aralex (SD) | Average Ratings of word Commonness of Occurrence (SD) | Average Ratings of Root Commonness of Occurrence (SD) |
|---|---|---|---|---|---|---|---|---|---|
| Identity | تبديل/tbdil/ | Switching/Change | Word | All | 12.6 (40.4) | 741.4 (1070.0) | 15307.1 (11559.6) | 4.1 (0.6) | 4.4 (0.7) |
| Pattern | تعمير/tʕmir/ | Constructing | Word | 2.6 (1.1) | 29.7 (77.1) | 1059.1 (1380.7) | 15307.1 (11559.6) | 3.9 (0.7) | 4.3 (0.6) |
| Root | بدائل/bdaal/ | Alternatives | Word | 3.6 (0.7) | 6.2 (13.4) | 741.4 (1070.0) | 16594.7 (11966.0) | 4.2 (0.5) | 4.0 (0.6) |
| TL New Root | تبليد/tblid/ | Dulling/Numbing | Word | All | 8.7 (23.9) | 517.7 (837.1) | 15307.1 (11559.6) | 4.2 (0.7) | 4.5 (0.7) |
| TL Non-root | تلديب/tldib/ | NA | Pseudo Word | All | NA | NA | 15307.1 (11559.6) | NA | NA |
| Synonym | تغيير/tgyir/ | Change | Word | 2.9 (1.2) | 13.2 (38.1) | 953.8 (1559.0) | 15449.9 (12669.6) | 3.9 (0.8) | 4.4 (0.5) |
| Unrelated | مخطوف/mxtuf/ | Kidnapped [noun] | Word | 0 | 9.4 (22.6) | 667.9 (1112.1) | 14633.1 (14646.7) | 4.0 (0.7) | 4.3 (0.7) |
The root letters in the Example column are underlined for illustration. SD = standard deviation, CPM = raw counts per million in the Aralex corpus (Boudelaa & Marslen-Wilson, 2010 [88]).
To allow us to answer the research questions detailed above, seven parafoveal preview conditions were created for each of the target words. These were: (a) Identity preview, where the target word itself without any alterations was the preview; (b) Pattern-preserving preview, where the pattern of the target word was preserved and presented in the preview with a new root (29 patters, around 40%, were verbal, and the rest were nominal); (c) Root-preserving preview, where the root of the target word was preserved and combined with a new pattern; (d) Synonym preview, where the preview was a synonym of the target word, without sharing any root information with the target (i.e., the synonym was derived from a different root), the synonyms were chosen from the major nine Arabic language dictionaries (these are:لسان العرب، مقاييس اللغة، الصحاح في اللغة، القاموس المحيط، العباب الزاخر، مختار الصحاح، المعجم الوسيط، تاج العروس، ومعجم اللغة العربية المعاصرة. We used the electronic searchable versions of these dictionaries available at http://www.maajim.com and http://www.baheth.info); (e) TL New Root preview, where the position of two root letters was transposed, creating a new real root in a new real word; (f) TL Pseudo Root preview, where two root letters were transposed, creating a pseudo root, that is, a pronounceable but meaningless root in a pronounceable pseudo word, the created root and word did not correspond to any root entries in any of the major nine Arabic language dictionaries; and finally, (g) an Unrelated preview that shared no information with the target word.
Note that in the two TL conditions, the preview shared all target word letter identities, including the same unaltered word pattern (see e.g., [40]). Preserving the word pattern in the TL conditions meant that the manipulation is not contaminated by crossing morphemic boundaries between root and pattern [89]. Previous investigations reported that transposing non-adjacent letters may result in weakening or delaying the onset of the recorded transposed letter effect [90,91]. However, given the relative shortness of Arabic words compared to other languages (see discussions in [92,93]), we were compelled to use letter transpositions where the transposed letters were adjacent or separated by either one or two letters. Specifically, in the TL New-root condition, 44.4% of the letter transpositions involved adjacent letters, also about 44.4% involved letters separated by 1 letter, and the remaining (11.1%) contained transposed letters that were separated by 2 letters. In the TL Pseudo-root condition, 60.3% of the letter transpositions involved adjacent letters, 23.8% involved letters separated by 1 letter, and the remaining (15.9%) contained transposed letters that were separated by 2 letters.
In all stimuli sentences, the target words (and previews) always occurred near the middle of the sentence and were never the first or last three words of the sentence. The sentences contained, on average 68.7 characters (SD = 11.0). The sentences were displayed on a single line at the center of the monitor.
Stimuli matching and norming
The target word and its previews were matched on number of letters and spatial extent [94]. With the exception of the Unrelated and TL Pseudo Root previews, all preview conditions were of the same syntactic case as the target. We aimed to match the Pattern, the Root and Synonym previews, as much as possible, on how many letters they shared with the target (see Table 1). It was inevitable however that Root previews shared more letters with the target than the Pattern or Synonym previews (F(2,186) = 14.4, p < .001; post-hoc Tukey test for Root vs. Pattern p < .001; Root vs. Synonym p = .001; and Pattern vs. Synonym p = .23). This is because Root previews and the target may by necessity share the root letters plus one more letter that belongs to the pattern morpheme, given the relatively small number of letters that can be part of pattern morphemes [95].
We also aimed to match the target and previews on word, root and word pattern frequencies (based on Aralex database), as much as possible. In addition, we obtained subjective ratings of word and root commonness as additional indicators of frequency for the targets and previews (see Table 1). For all six preview conditions that were real words, there was no significant difference between log-transformed word orthographic frequencies (all Fs(5,325) < 1.7, all ps >.09). For root frequencies, only the difference between the Aralex log-transformed root token frequency of the Identity and TL New Root conditions reached significance (t = 3.0, p < .01). None of the remaining root token frequency contrasts were significant (all ts < 1.4, ps >.50). For pattern frequencies, only the difference between the Aralex log-transformed pattern token frequency of the Identity and the Unrelated previews reached significance (t = 2.7, p < .01). None of the remaining pattern token frequency contrasts were significant (all ts < 1.1, ps >.29). Importantly, participants’ ratings of commonness of word and root occurrence in the language did not differ significantly between any of the real word preview conditions (all Fs(5,325) < 1, all ps >.80).
As part of the stimuli norming, target and preview words’ predictability was determined using a cloze procedure. Five participants were given the stem sentences up to, and not including, the target word and were asked to produce the word they thought would occur next in the sentence. If the participants produced either the target word or any of its previews, the stem sentence was changed. In the sentences used as stimuli none of the target words or previews were produced by the participants, indicating that none of these words were predictable from the sentence stem (i.e., the target and preview words were produced on zero occasions by the participants). Finally, we obtained ratings of sentence structure naturalness for all target sentences containing the target word on a 5-point scale (1 = structure is highly unusual, 5 = structure is highly natural). 5 ratings per sentence were obtained from 5 participants, and these indicated that sentence structure for all stimuli in all conditions was highly natural with average ratings of 4.1 (SD = 0.5).
Apparatus
EyeLink 1000+ eye tracker was used to sample readers’ eye movements during reading. Viewing was binocular, but eye movements were recorded from the right eye only. The eye tracker sampling rate was set to 1000Hz. The eye tracker was tower-mounted, and interfaced with a Silverstone computer, and with a 24-inch BenQ monitor. Monitor resolution was set at 1920 × 1080 pixels, with the maximum vertical refresh rate (144Hz) to minimize display refresh time. The participants leaned on a headrest to reduce head movements. The sentences were displayed in black on a light grey background. The participants viewed the screen from 80 cm, and at this distance each character subtended approximately 0.14° of visual angle, and average number of pixels = 9.4 (SD = 1.1).
Design
The parafoveal preview condition was the within-participant independent variable. The order of sentence presentation was randomized, and the preview condition presentation was counterbalanced such that each participant saw each sentence only once, and only in one of the 7 preview conditions.
Procedure
This experiment was approved by Zayed University Research Ethics Committee. At the beginning of the testing session, the participants were given the consent form package (including information sheet). Consenting participants subsequently took part in a vision acuity test before taking part in the eye tracking procedure.
The eye tracker was calibrated using a horizontal 3-point calibration at the beginning of the experiment, and the calibration was validated. Calibration accuracy was always ≤ 0.25°, otherwise calibration and validation were repeated. Prior to the onset of the target sentence, a circular fixation target (diameter = 1°) appeared on the screen in the location of the first character of the sentence. When the tracker registered a stable fixation on the circle, the sentence was displayed.
The participants were told to read silently and press a button on the button box when finished reading each sentence. Additionally, they would periodically be required to use the button box to provide a yes/no answer to the questions that followed approximately one-third of the sentences. For the sentence given in Fig 1, for example, the question would be “The manager wanted to reward all the farmers: yes/no.” Before being exposed to the experimental sentences, the participants read 7 practice sentences (3 followed by yes/no questions) to become fully acquainted with the procedure. In total, thus, the participants read 70 sentences (63 experimental sentences + 7 practice sentences).
The participants were allowed to take breaks followed by re-calibration of the tracker. The testing session lasted around 30 minutes, depending on how many breaks a participant took.
Results
Seven participants were excluded from the analyses given that their sentence comprehension scores fell below 80%. Thus, the reported results are based on data collected from 70 participants. These participants had an average sentence comprehension score of 96% (SD = 3.2, range = 84–100%).
For all reported analyses, fixations with durations shorter than 80 ms, or longer than 800 ms were removed. However, fixations shorter than 80 ms that were located within 10 pixels or less from another longer fixation were merged with the longer fixation. Trials where blinks occurred were removed. In addition, we also removed trials where the display change occurred too early during a fixation on the pre-boundary (pre-target) word, or too late after the onset of a fixation on the target word. These procedures resulted in removing about 4.5% of all data points (a total of 4,211 trials remaining).
From the remaining dataset, to ensure that participants had access to high quality linguistic information in the parafovea, two further steps were implemented. First, only trials where the pre-boundary (pre-target) word was fixated immediately before fixating the target (i.e., not skipped) were included in the analyses. This resulted in removing about 6.2% of all data points (a total of 3,949 trials remaining). Subsequently, we removed all trials where the pre-target word was fixated but the fixation was too far from the target word. Specifically, fixations on the target word that had a launch site of ≥ 65 pixels (around 6.9 characters) from the space before the target word were removed. This resulted in removing around 5.4% of the remaining data points (a total of 3,735 trials remaining). Launch distance is an important variable when investigating parafoveal processing (e.g., [96]). This applies particularly to Arabic script [93,97] given its visual confusability and complexity (e.g., depending on very small, letter-part, identifying features of such as number and location of small dots, or the number of very short vertical strokes, etc.).
We report a number of eye movement measures on the target word region. The first measure is word skipping probability (the probability that the target word was not fixated during first pass reading). We also report measures of first pass reading, namely: First fixation duration (the duration of the first fixation on the target word, regardless of the number of fixations the word received overall); single fixation duration (the duration of the fixation on the target in instances where the target received exactly one fixation during sentence reading); and gaze duration (the sum of fixation durations the target word received during first pass reading and before exiting the target word to go forward or backwards in the text). We also report the later measure of total fixation time (the sum of all fixation durations the target word received in all passes). In addition, we also report analyses of spillover, or the duration of the first fixation on the post-target word, subsequent to fixating the target word. This allowed for determining whether there were any spillover effects of processing the parafoveal previews of the target. Table 2 contains the descriptive statistics for all reported measures for both the target word and the post-target (spillover) regions.
Table 2. Descriptive statistics of eye movement measures for the target and post-target regions.
| Preview Condition | ||||||||
|---|---|---|---|---|---|---|---|---|
| Identity | Pattern | Root | Synonym | TL New Root | TL Pseudo Root | Unrelated | ||
| Interest Region | Eye Movement Measure | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) |
| Target Word | Skipping Probability | 0.011 (0.106) | 0.004 (0.061) | 0.010 (0.097) | 0.008 (0.086) | 0.004 (0.061) | 0.007 (0.086) | 0.009 (0.096) |
| First Fixation Duration (ms) | 278 (118) | 318 (142) | 279 (118) | 283 (105) | 360 (149) | 328 (149) | 320 (147) | |
| Single Fixation Duration (ms) | 281 (108) | 337 (143) | 285 (117) | 294 (107) | 404 (142) | 345 (141) | 337 (142) | |
| Gaze Duration (ms) | 386 (254) | 419 (244) | 383 (211) | 389 (207) | 475 (251) | 416 (233) | 423 (223) | |
| Total Fixation Time (ms) | 505 (365) | 570 (378) | 503 (314) | 514 (363) | 608 (343) | 559 (397) | 573 (395) | |
| Post-Target Word | First Fixation Duration (Spillover, ms) | 278 (119) | 284 (122) | 281 (114) | 284 (110) | 283 (129) | 283 (124) | 283 (123) |
In order to answer the research questions outlined above, we constructed three contrast matrices. The first two matrices were used as a first-stage analysis to answer the primary research questions. Specifically, in the first contrast matrix the Identity preview condition was prespecified as the baseline condition against which all other preview conditions were contrasted. In the second contrast matrix, the Unrelated preview condition was prespecified as the baseline condition to be contrasted with the other preview conditions (excluding the Identity preview given that the two conditions have already been contrasted in the first matrix). To explicitly map the contrasts from the first and second matrices on to the specific research questions outlined above: (a) Contrasting the Identity vs. Unrelated previews allowed us to answer the question regarding parafoveal orthographic processing in reading Arabic; (b) Contrasting the two TL conditions vs. both the Identity and Unrelated preview conditions allowed for investigating parafoveal ortho-morphological processing, and if these root letter transpositions in the preview would result in costs to processing the target, thus replicating previous findings; (c) Contrasting the Root and Pattern preview conditions to both the Identity and Unrelated previews allowed for investigating parafoveal morphological processing, namely the effect of the availability of these two morphological units in the parafovea; and finally (d) Contrasting the Synonym preview to both the Identity and Unrelated previews allowed for investigating semantic processing in the parafovea, by exploring whether readers would obtain preview benefit if the preview word was synonymous with the target.
The third contrast matrix was built to answer secondary questions through contrasting specific pairs of preview conditions. This matrix contained the following pre-specified contrasts: (i) Root vs. Pattern previews, to compare the effects of preserving these two morphological constituents in the parafoveal preview on target word processing; (ii) Root vs. Synonym previews, to compare the effects of preserving morphological (root, which also preserves partial semantic information) relative to preserving the full semantic information (synonym) parafoveally; and finally (iii) TL New Root vs. TL Pseudo Root, to investigate the potential costs of the lexical competition resulting from the letter transposition that creates a new existing root, relative to the letter transposition that merely disrupts root letter order, but does not create a lexical competitor.
We used the lme4 package (version 1.1–23, [98]) within the R environment for statistical computing [99] to analyze the raw fixation duration measures by fitting generalized linear mixed-effects models (GLMMs), with Gamma-distribution assumed for the fixation durations that were the dependent variables. Using GLMMs to analyze raw positively-skewed response times, including fixation durations, maintains the transparency of the reported analyses while satisfying the necessary normality assumptions, without the need to transform data [100]. In these models the preview condition was the fixed factor and subjects and items were the random variables. For word skipping probability we used GLMMs with a binomial-distribution assumed for the dependent variable. We always started by running models with maximal random structure [101]. We trimmed the models when failure to converge occurred, or when singular boundaries (suggesting overparameterization) were identified. All findings reported here are thus from successfully converging models. For each eye movement measure we report beta values (b), standard error (SE), t statistic for fixation duration measures, and z statistic for skipping probability, and the p value associated with the t or z statistic.
Bonferroni correction was applied to reduce family-wise error rate resulting from running multiple contrasts on the eye movement measures for the analyses conducted at both the target word, and the post-target word regions [102]. For the target word region, the Bonferroni-corrected α = .05 ÷ (5 eye movement measures × 3 contrast matrices) ≤.003. For the post-target word (spillover), the Bonferroni-corrected α ≤.05 ÷ (1 eye movement measures × 3 contrast matrices) = .016.
I. Parafoveal orthographic processing
Target word region
Identity preview vs. Unrelated preview (matrix 1). The small difference in skipping probability between these two conditions was not statistically significant. For the fixation durations measures, Identity previews yielded significantly shorter fixation in all reported measures relative to Unrelated previews (Tables 2 & 3). The results thus replicate the classic preview benefit findings for orthographically identical previews, and also indicate that our experimental manipulation is functioning as expected.
Table 3. Linear mixed models outputs for the contrast matrix of measures with Identity preview as baseline.
| Eye Movement Measure | Contrast Preview vs. Identity Preview | b | SE | t/z | p |
|---|---|---|---|---|---|
| Skipping | (Intercept) | -4.911 | 0.565 | -8.687 | < .001 |
| Pattern | -1.103 | 0.821 | -1.344 | .179 | |
| Root | -0.146 | 0.613 | -0.238 | .812 | |
| Synonym | -0.389 | 0.652 | -0.597 | .550 | |
| TL New Root | -1.130 | 0.821 | -1.377 | .169 | |
| TL Pseudo Root | -0.383 | 0.652 | -0.587 | .557 | |
| Unrelated | -0.182 | 0.613 | -0.297 | .767 | |
| First Fixation Duration | (Intercept) | 280.382 | 6.285 | 44.614 | < .001 |
| Pattern | 38.914 | 4.531 | 8.589 | < .001 | |
| Root | -0.263 | 4.858 | -0.054 | .957 | |
| Synonym | 5.619 | 4.488 | 1.252 | .211 | |
| TL New Root | 84.914 | 5.451 | 15.579 | < .001 | |
| TL Pseudo Root | 45.800 | 4.704 | 9.737 | < .001 | |
| Unrelated | 36.599 | 5.020 | 7.291 | < .001 | |
| Single Fixation Duration | (Intercept) | 284.590 | 7.215 | 39.445 | < .001 |
| Pattern | 55.781 | 6.205 | 8.990 | < .001 | |
| Root | 3.161 | 5.810 | 0.544 | .587 | |
| Synonym | 12.326 | 5.831 | 2.114 | .035 | |
| TL New Root | 125.817 | 7.030 | 17.898 | < .001 | |
| TL Pseudo Root | 60.609 | 5.997 | 10.106 | < .001 | |
| Unrelated | 50.049 | 6.314 | 7.926 | < .001 | |
| Gaze Duration | (Intercept) | 386.081 | 7.032 | 54.903 | < .001 |
| Pattern | 37.975 | 8.779 | 4.326 | < .001 | |
| Root | 11.059 | 7.788 | 1.420 | .156 | |
| Synonym | 11.005 | 11.413 | 0.964 | .335 | |
| TL New Root | 97.918 | 13.151 | 7.446 | < .001 | |
| TL Pseudo Root | 33.881 | 10.416 | 3.253 | .001 | |
| Unrelated | 43.001 | 6.063 | 7.093 | < .001 | |
| Total Fixation Time | (Intercept) | 498.449 | 27.411 | 18.184 | < .001 |
| Pattern | 62.353 | 18.919 | 3.296 | .001 | |
| Root | 4.714 | 18.982 | 0.248 | .804 | |
| Synonym | 9.179 | 18.911 | 0.485 | .627 | |
| TL New Root | 101.542 | 18.796 | 5.402 | < .001 | |
| TL Pseudo Root | 56.149 | 18.880 | 2.974 | .003 | |
| Unrelated | 73.924 | 18.857 | 3.920 | < .001 | |
| Spillover | (Intercept) | 267.020 | 6.352 | 42.034 | < .001 |
| Pattern | 7.991 | 5.481 | 1.458 | .145 | |
| Root | 8.135 | 5.426 | 1.499 | .134 | |
| Synonym | 6.380 | 5.584 | 1.143 | .253 | |
| TL New Root | 4.907 | 5.003 | 0.981 | .327 | |
| TL Pseudo Root | 9.115 | 5.345 | 1.705 | .088 | |
| Unrelated | 8.723 | 5.294 | 1.648 | .099 |
TL = transposed letters of the root of the target word. p values are marked in boldface as significant only when Bonferroni-corrected α ≤.003 for the target word, and α ≤.016 for the Spillover measure. Spillover corresponds to the first fixation duration on the post-target word. Final reported models in S1 File.
Post-target region
Identity preview vs. Unrelated preview (matrix 1). The spillover measure (first fixation on post-target word) did not show a significant difference between these two previews conditions (see Tables 2 & 3). This indicates that the processing cost of the Unrelated previews was incurred only locally on the target word itself.
II. Parafoveal ortho-morphological processing
Target word region
Identity preview vs. TL New Root preview (matrix 1). The difference between the two conditions in skipping probability was not statistically significant (Tables 2 & 3). On the other hand, the TL New Root preview condition yielded significantly longer fixation durations in all reported measures relative to the Identity preview. Recall that this TL condition shared all the orthographic (letter identity) information with the target, and differed only in the letter order of root morpheme.
Unrelated preview vs. TL New Root preview (matrix 2). The difference in skipping probability between the two conditions was not statistically significant (Tables 2 & 4). On the other hand, the TL New Root preview yielded significantly longer fixation durations in all reported measures relative to the Unrelated preview, indicating a substantial processing cost for the former.
Table 4. Linear mixed models outputs for the contrast matrix of measures with unrelated preview as baseline.
| Eye Movement Measure | Contrast Preview vs. Unrelated Preview | b | SE | t/z | p |
|---|---|---|---|---|---|
| Skipping | (Intercept) | -5.366 | 0.430 | -12.472 | < .001 |
| Pattern | -0.491 | 0.576 | -0.852 | .394 | |
| Root | 0.373 | 0.417 | 0.895 | .371 | |
| Synonym | 0.146 | 0.450 | 0.323 | .746 | |
| TL New Root | -0.518 | 0.576 | -0.899 | .368 | |
| TL Pseudo Root | 0.153 | 0.451 | 0.339 | .735 | |
| First Fixation Duration | (Intercept) | 310.613 | 8.213 | 37.819 | < .001 |
| Pattern | 3.503 | 3.981 | 0.880 | .379 | |
| Root | -34.420 | 4.069 | -8.458 | < .001 | |
| Synonym | -28.656 | 4.196 | -6.829 | < .001 | |
| TL New Root | 48.135 | 4.352 | 11.061 | < .001 | |
| TL Pseudo Root | 10.114 | 5.016 | 2.016 | .044 | |
| Single Fixation Duration | (Intercept) | 328.622 | 5.721 | 57.445 | < .001 |
| Pattern | 4.399 | 5.313 | 0.828 | .408 | |
| Root | -45.947 | 4.611 | -9.965 | < .001 | |
| Synonym | -37.005 | 4.451 | -8.314 | < .001 | |
| TL New Root | 70.992 | 5.620 | 12.632 | < .001 | |
| TL Pseudo Root | 8.950 | 5.121 | 1.748 | .081 | |
| Gaze Duration | (Intercept) | 417.434 | 5.415 | 77.090 | < .001 |
| Pattern | -2.028 | 4.996 | -0.406 | .685 | |
| Root | -30.792 | 4.287 | -7.183 | < .001 | |
| Synonym | -26.521 | 4.567 | -5.807 | < .001 | |
| TL New Root | 62.074 | 5.432 | 11.428 | < .001 | |
| TL Pseudo Root | -2.460 | 4.478 | -0.549 | .583 | |
| Total Fixation Time | (Intercept) | 561.041 | 7.281 | 77.051 | < .001 |
| Pattern | 5.078 | 5.966 | 0.851 | .395 | |
| Root | -40.140 | 5.642 | -7.114 | < .001 | |
| Synonym | -37.250 | 4.716 | -7.900 | < .001 | |
| TL New Root | 71.781 | 5.710 | 12.572 | < .001 | |
| TL Pseudo Root | -0.538 | 5.956 | -0.090 | .928 | |
| Spillover | (Intercept) | 273.485 | 7.167 | 38.160 | < .001 |
| Pattern | 0.457 | 4.231 | 0.108 | .914 | |
| Root | 0.557 | 4.261 | 0.131 | .896 | |
| Synonym | -1.169 | 4.235 | -0.276 | .783 | |
| TL New Root | -2.581 | 4.070 | -0.634 | .526 | |
| TL Pseudo Root | 1.552 | 4.220 | 0.368 | .713 |
TL = transposed letters of the root of the target word. p values are marked in boldface as significant only when Bonferroni-corrected α ≤.003 for the target word, and α ≤.016 for the Spillover measure. Spillover corresponds to the first fixation duration on the post-target word. Final reported models in S1 File.
Identity preview vs. TL Pseudo Root preview (matrix 1). The difference in skipping probability between the two conditions was not statistically significant (Tables 2 & 3). The TL Pseudo Root preview yielded significantly longer fixation durations in all reported measures relative to the Identity preview. Recall that this TL condition also shared all the orthographic (letter identity) information with the target, and differed only in the letter order of root morpheme.
Unrelated preview vs. TL Pseudo Root preview (matrix 2). The differences between the two conditions in skipping probability and all the reported fixation duration measures were not statistically significant (Tables 2 & 4).
TL Pseudo Root vs. TL New Root (matrix 3). The difference in skipping probability between the two conditions was not statistically significant (Tables 2 & 5). TL New Root previews, on the other hand, yielded significant increases in all reported fixation durations relative to TL Pseudo Root previews. Overall, thus, these findings suggest a substantial disruption to processing when root letters were transposed and created a new root—a lexical competitor to the root embedded in the actual target, relative to when the root letter transposition does not create such a competitor.
Table 5. Linear mixed models outputs for the additional pre-specified contrasts.
| Eye Movement Measure | Contrast Preview Conditions | b | SE | t/z | p |
|---|---|---|---|---|---|
| Skipping | (Intercept) | -5.355 | 0.429 | -12.491 | < .001 |
| Root vs. Pattern | -0.497 | 0.471 | -1.055 | .291 | |
| Root vs. Synonym | 0.135 | 0.412 | 0.328 | .743 | |
| TL Pseudo Root vs. TL New Root | -0.282 | 0.359 | -0.785 | .432 | |
| First Fixation Duration | (Intercept) | 311.475 | 4.854 | 64.166 | < .001 |
| Root vs. Pattern | 27.425 | 5.082 | 5.397 | < .001 | |
| Root vs. Synonym | -10.601 | 4.115 | -2.576 | .010 | |
| TL Pseudo Root vs. TL New Root | 15.224 | 4.179 | 3.642 | < .001 | |
| Single Fixation Duration | (Intercept) | 327.851 | 5.937 | 55.225 | < .001 |
| Root vs. Pattern | 36.168 | 4.714 | 7.672 | < .001 | |
| Root vs. Synonym | -13.107 | 4.429 | -2.959 | .003 | |
| TL Pseudo Root vs. TL New Root | 24.564 | 3.526 | 6.966 | < .001 | |
| Gaze Duration | (Intercept) | 417.230 | 6.526 | 63.933 | < .001 |
| Root vs. Pattern | 19.264 | 4.525 | 4.257 | < .001 | |
| Root vs. Synonym | -7.219 | 4.529 | -1.594 | .111 | |
| TL Pseudo Root vs. TL New Root | 28.374 | 3.934 | 7.212 | < .001 | |
| Total Fixation Time | (Intercept) | 560.591 | 6.696 | 83.718 | < .001 |
| Root vs. Pattern | 32.486 | 4.895 | 6.637 | < .001 | |
| Root vs. Synonym | -14.449 | 5.944 | -2.431 | .015 | |
| TL Pseudo Root vs. TL New Root | 31.813 | 5.384 | 5.909 | < .001 | |
| Spillover | (Intercept) | 273.774 | 6.842 | 40.016 | < .001 |
| Root vs. Pattern | 0.443 | 4.491 | 0.099 | .922 | |
| Root vs. Synonym | -1.939 | 4.748 | -0.408 | .683 | |
| TL Pseudo Root vs. TL New Root | -2.380 | 4.195 | -0.567 | .570 |
TL = transposed letters of the root of the target word. p values are marked in boldface as significant only when Bonferroni-corrected α ≤.003 for the target word, and α ≤.016 for the Spillover measure. Spillover corresponds to the first fixation duration on the post-target word. Final reported models in S1 File.
Post-target region
None of the contrasts listed above yielded statistically significant difference in the spillover measure (see Tables 2–5). This indicates that the processing costs associated with TL previews were incurred only locally on the target word itself.
III. Parafoveal morphological processing
Target word region
Identity preview vs. Pattern preview (matrix 1). The difference between the two conditions in skipping probability was not statistically significant. Pattern preview however resulted in significant increase in fixation durations, in all reported measures, relative to Identity preview (Tables 2 & 3).
Unrelated preview vs. Pattern preview (matrix 2). There was no statistically significant difference between the two preview conditions in the skipping probability or in any of the fixation duration measures (Tables 2 & 4).
Identity preview vs. Root preview (matrix 1). There was no statistically significant difference between the two preview conditions in the skipping probability or in any of the fixation duration measures (Tables 2 & 3).
Unrelated preview vs. Root preview (matrix 2). The difference between the two conditions in skipping probability was not statistically significant. Fixation durations on the target following Root previews, however, were significantly shorter, in all reported measures, relative to Unrelated previews (Tables 2 & 4).
Root preview vs. Pattern preview (matrix 3). The difference between the two conditions in skipping probability was not statistically significant. Fixation durations on the target following Root previews, however, were significantly shorter, in all reported measures, relative to Pattern previews (Tables 2 & 5). Thus, overall, the results point at preview benefit from Root, but not from Pattern previews.
Post-target region
None of the contrasts listed above yielded statistically significant difference in the spillover measure (see Tables 2–5).
IV. Parafoveal semantic processing
Target word region
Identity preview vs. Synonym preview (matrix 1). There was no statistically significant difference between the two preview conditions in the skipping probability or in any of the fixation duration measures (Tables 2 & 3).
Unrelated preview vs. Synonym preview (matrix 2). The difference between the two conditions in skipping probability was not statistically significant. Fixation durations on the target following Synonym previews, however, were significantly shorter, in all reported measures, relative to Unrelated previews (Tables 2 & 4). This indicates the presence of a preview benefit from Synonym previews.
Root preview vs. Synonym preview (matrix 3). The difference between the two conditions in skipping probability was not statistically significant. Fixations durations were on the whole numerically shorter following Root previews compared to Synonym previews. The difference reached statistical significance (at the Bonferroni-corrected α level) only at single fixation duration (Tables 2 & 5).
Post-target region
None of the contrasts listed above yielded statistically significant difference in the spillover measure (Tables 2–5).
Discussion
In the current study we aimed to investigate orthographic, morphological, and semantic parafoveal processing during Arabic sentence reading. The results obtained replicated some classic preview benefit effects, and expanded existing knowledge about parafoveal processing during reading Arabic.
Parafoveal orthographic processing
The reported results replicated the preview benefit findings (see above) for orthographically identical previews (e.g., [55–59]). Identity previews indeed yielded significantly shorter fixation durations relative to Unrelated previews. In addition to indicating that the experimental manipulation was working as expected, these results clearly indicate that Arabic readers performed parafoveal orthographic processing such that the information extracted from identical previews facilitated the processing of the target word, whereas extracting information from previews that do not share orthographic information with the target resulted in processing costs. The results show that the significant processing costs for Unrelated previews were observable as early as first fixation duration, and persisted into later processing measures.
Parafoveal ortho-morphological processing
The reported results also replicate the findings from various experimental paradigms where transposing the letters of a Semitic root morpheme resulted in significant disruption to target word processing (e.g., [33,39–41,68,103]). Indeed, both TL preview conditions, despite preserving all root (and word) letter identities, yielded significantly inflated fixation durations relative to the Identity previews.
Importantly, the results suggested that the disruption to target word processing was greater for TL previews that created a new real roots. The TL New Root previews yielded significantly longer fixation durations relative to TL Pseudo Root previews. Furthermore, and whereas the processing costs associated with the TL Pseudo Root condition did not differ significantly from those of the Unrelated preview condition, the costs associated with the TL New Root preview significantly exceeded the costs of the Unrelated preview condition. The costs associated with both TL preview conditions appeared as early as first fixation duration and persisted into later processing measures. The same applies to the significant costs obtained from the TL New Root condition compared to the TL Pseudo Root condition.
These results lend strong support to the idea that lexical organization in Semitic languages is root- and not orthography-based, with root-sharing words clustering close together in the lexicon (e.g., [15,24,25,28,29,39]). Thus, the activation of the new root, created by transposing the letters of the original root in the target, led to a costly competition that inflated processing time on the target word and delayed its identification. Additionally, and given that in both TL conditions the preview strings shared all letter identities with the target, the results reported here further support the idea that the degree to which readers benefit from orthographic similarity is contingent upon the morphological characteristics of the language being read (see e.g., [15,29]), and the degree to which these properties of the linguistic environment permit flexibility, or demand rigidity, in letter position coding.
Parafoveal morphological processing
The reported results also replicated the preview benefit obtained from previews that preserve the root morpheme information (e.g., [28,65]). The results also replicate findings from investigations using single word tasks (e.g., [2,24,25,27,43]) where primes that shared the root of the target word resulted in processing benefit for the target word relative to other conditions where the same number of letters was shared between the target word and the prime. In the current experiment root-preserving previews yielded comparable fixation durations to Identity previews, and a significant processing facilitation compared to Unrelated previews. These results thus further support the suggestion that lexical organization in Semitic languages is root-based. Arabic readers do indeed extract root morpheme information parafoveally, and this early access to the root information facilitates the identification of the up-coming target word. The results show that processing benefits from the Root preview condition relative to Unrelated previews appeared as early as first fixation duration and persisted into later processing measures.
By contrast, Pattern-preserving previews did not yield processing facilitation. Rather, these previews yielded significant processing cost relative to the Identity preview condition, and a comparable cost to processing as that observed for the Unrelated previews. In fact, additional contrasts revealed that Root previews resulted in significant processing facilitation relative to Pattern previews. In selecting the target words and previews, we included both nominal and verbal patterns given that both were found to yield processing benefit as primes in single word tasks (e.g., [1,2,36,43,44]). Although somewhat surprising, these results are broadly in line with the literature. It will be recalled that the benefits from pattern morpheme primes in Arabic were of more precarious nature, and were strongly affected by other variables such as stimulus onset asynchrony (SOA), as well as root productivity.
The greater benefit obtained from Root relative to Pattern previews can perhaps be explained by one (or a combination) of these two mechanisms. The first mechanism can be summarized as follows: Any small benefit obtained from pattern-preserving parafoveal preview may be obliterated given the sizable disruption to processing that results from embedding a completely new root in that preview [70]. The second possible mechanism was put forward by Boudelaa and Marslen-Wilson [44] (see also [24]). In this account, the way that root and pattern information is approached or relied upon for word identification by Arabic readers is essentially different given the ubiquitous presence of consonantal roots in Arabic print, as compared to the absence of vowel representations which make up a significant part of Arabic word patterns. Clearly, further investigation into the role pattern morphemes play in word identification is necessary in order to clarify the extent to which readers make use of them, and, importantly, to clearly delineate the time course of these processes. The ultimate goal is to use such findings to inform a more comprehensive theory of morphological processing in Semitic languages.
Parafoveal semantic processing
The results reported expanded our knowledge of parafoveal processing of Arabic and suggests that Arabic readers extract semantic information from the parafovea. Synonym previews yielded comparable fixation durations to Identity previews, and a significant processing facilitation compared to Unrelated previews. As far as we are aware, this is the first time semantic preview benefit is reported in reading Arabic. This benefit appeared as early as first fixation duration and persisted into later processing measures (similar pattern of results in English reported by Schotter [80]). These results also complement findings from previous research that documented that Arabic readers show semantic priming benefit (e.g. [46,47]). We will forward two accounts that potentially accommodate these results.
Starting with a parafoveal integration account (see, e.g., [83]), it is possible that these results point at Arabic readers accessing the semantic representation of the preview, and the integration between the semantically-identical preview and target facilitated the identification of this target once it was fixated. This facilitation from the Synonym previews is not likely to be attributable to the shared orthographic representation with the target (the number of shared letters), or the shared pattern morpheme letters with the target word either. This is because the Synonym previews do not share more letters with the target relative to other preview conditions (see Table 1), and also, as the results show, sharing pattern letters with the target did not result in any preview benefit. Similarly, given that the Synonym previews did not share root information with the target, root-based facilitation of target word identification can be ruled out. It is also worth noting that the obtained benefit from the synonymous previews was not facilitated by sentential contextual constraints, that is, predictability, given that the target word and its previews were all unpredictable from previous sentence context, as we established during the sentences’ norming.
A second plausible account is potential faciliatory effect of contextual fit of the Synonym preview, that is, its plausibility (we wish to thank an anonymous reviewer for their helpful comments and guidance with regards to this issue.) Specifically, the facilitation reported was because the Synonym preview is plausible in the sentence context, regardless of its semantic closeness to the target word. Previous research showed a considerable effect of preview plausibility such that plausible previews resulted in processing benefit, regardless of their semantic links with the target (e.g., [83,84]), and independently of the orthographic relatedness between the preview and target [104], especially when the previews were not predictable from the immediate preceding context ([105] also [106] for additional empirical evidence, and [107] for a review). In essence, when the upcoming word (in the parafovea) is plausible, the information extracted from it facilitates the incremental sentence processing, regardless of the relatedness between the target and the preview, hence plausible previews generate more skipping of the target (e.g., [106]), and shorter fixation duration on the target in early processing measures when it is fixated and not skipped (e.g., [83]). As we have not controlled for target and preview plausibility, the possible effects of plausibility on the reported results were not part of our a priori research question. Attempting to explore, post hoc, the possible effects of plausibility, we have collected plausibility ratings from 19 additional participants, from the same population, for all the preview conditions, except the TL Pseudo Word condition. The results of this activity are summarized in S2 File. The preview conditions varied significantly in their plausibility ratings (F(5,1128) = 287.7, p < .001). Post hoc Tukey test for selected pairwise contrasts revealed that with the exception of the Synonym preview condition, all other preview conditions were rated significantly less plausible than the Identity preview (all ps < .001, see Table A in S2 File for descriptive statistics of the plausibility ratings). Similarly, all preview conditions, including the Synonym condition, were rated more plausible than the Unrelated condition (all ps < .001). As such, it is not possible to rule out that the facilitation observed for the Synonym preview condition may reflect, at least to some degree, facilitation from this preview being a plausible continuation of the sentence.
Clearly, further investigation is needed to clarify if the effects reported here are due to integration of the preview and target semantic identity, or if the high plausibility of Synonym previews was sufficient to result in the reported preview benefit effects. As a speculation, however, we will suggest that the plausibility of the preview on its own was not sufficient to generate the observed facilitation for the Synonym previews. To begin with, similar to the Identity previews, the Synonym previews were rated as significantly more plausible than all other remaining conditions (see S2 File, all ps < .001). Importantly, this included the Root previews. Yet, as reported above, Synonym and Root previews yielded highly comparable facilitation patterns, the discrepancy in preview plausibility ratings between these two conditions notwithstanding. Thus, a plausible speculation is that the significant preview benefit obtained from the Synonym previews perhaps reflects both benefits, combined: Integrating the preview information with the semantically-identical target, plus the facilitation afforded by this preview being a good fit with the context. Other contrasts also indicated that preview plausibility cannot solely explain the patterns of results reported. Rather, morphological parafoveal processing (e.g., preserving or violating the root information) was the main driver of the observed effects. For instance, the Root and TL New Root conditions were rated as similarly plausible (see S2 File, p = .55), yet, there was a sizable cost associated with the TL New Root previews, whereas the Root previews yielded the expected preview benefit. Similarly, the Pattern previews’ significantly higher plausibility rating relative to the Unrelated and the Root previews (ps < .001), did not translate into a preview benefit. This perhaps can be seen as further support for the integration accounts of parafoveal processing, at least as far as processing of Semitic root morphology is concerned.
Put together, the patterns of findings reported thus far suggest that in attempting to translate text into meaning, readers of Arabic prioritize these two levels of representation, root morphology and semantic/contextual information. We return to and expand upon this suggestion below.
Theoretical implications and future directions
To begin with, our results replicate previous findings where orthographically identical previews yield processing benefit relative to orthographically unrelated previews. These findings further support the suggestion that skilled readers extract information from upcoming, parafoveal, words [48,49,51], and readers of Arabic are no exception. In addition to extracting orthographic information from the parafovea, the results indicated that they also extract root morpheme information. The importance of Semitic roots, as the base of lexical organization and their role in word identification has long been accepted and advocated in the literature (see [93] for a review). Extracting such morphological information prior to fixating the word can be seen as additional support to models of word identification in Semitic languages that postulate early and compulsory morphemic decomposition and root identification processes (e.g., [24,32,44] also [92] for further discussion). When the root information available in the parafovea was valid, processing benefit was obtained. By contrast, inaccurate (i.e., transposed) previews of root letters resulted in delaying target word identification, especially when the root letter transposition instantiated a real root—a lexical competitor.
The results reported here further challenge models of word identification that postulate flexible letter position encoding as a universal property of the cognitive system (see above). Extensive discussion of this topic and implications for modelling word identification is available elsewhere (e.g., [15,38–41]) and need not be repeated here. An issue of equal importance, however, is that current models of reading do not provide full accounts of the role of morphology in word identification as they, predominantly, focus on single-morpheme words or assume that polymorphemic words are represented as static, whole units, in the lexicon (e.g., [108,109]). Strong and consistent empirical evidence support the idea that compulsory morphological decomposition and root identification processes take place early on during word identification (see above), and this is followed by root and pattern re-combination that allows for complete word identification [92,110]. As such, models that adopt a compositional outlook (e.g., [111]) are potentially more capable of accommodating the empirical findings. In such models, mapping of patterns of orthographic features onto their corresponding phonological and semantic representations allows the activation of the compositional meaning of the word being read. This mechanism can be the core on which models that feature fully-specified morphological processing are constructed [93]. Full construction and specification of such models will need to be informed by clearer findings concerning pattern morpheme processing, as mentioned above, and a careful accommodation of the existing robust empirical findings concerning root morpheme processing.
The findings reported here indicate that in addition to benefiting from early (parafoveal) access to root morpheme information, readers of Arabic benefit from early access to semantic information. If we adopt the parafoveal integration account of these results, this deep level of linguistic processing may have been facilitated by the relative orthographic transparency of Arabic consonants (e.g., [79,80]). It is plausible that this transparency permits earlier start of semantic processing of Arabic words. This hypothesis, however, requires further investigation in carefully designed experiments. Essentially, the transparency of consonant-phoneme correspondences in Arabic is not perfect given the absence of vowel information from Arabic print—the letter ب, for instance, will always produce the /b/ sound, however, it may not be immediately obvious to the reader if the correct pronunciation of this letter in a string should be /ba/ or /bu/. In fact, lexical ambiguity is very commonplace in Arabic text such that readers as a matter of course rely on sentence context for lexical disambiguation and word identification during text reading [45,97,112]. In essence, this forces the readers of Arabic to prioritize access to morpho-semantic information in word identification, and to be guided throughout reading by sentence context (i.e., meaning comprehension) and word-context integrative processes [93,113]. It is thus unsurprising that readers of Arabic begin extracting and processing morphological and semantic information early on in the time course of translating text to meaning. Arguably, similar conclusions can be reached for previews that are plausible and fit with sentence context (at least if they do not include a violation of the Semitic root information, as the discussion above indicated). Consequently, parafoveal previews that facilitate this processing by providing early and accurate root morpheme information, highly plausible, or identical (synonymous) semantic information with the target word, result in processing benefit in the identification of this target. Future investigations will need to further explore the variables that influence semantic parafoveal processing in Semitic languages, not least of all the role of pre-target sentence context and target plausibility.
Early access to semantic information in the course of word identification, is not a novel idea. Multiple existing models of reading include semantic feedback routes to phonology and orthography (e.g., when this route provides feedback for disambiguating the pronunciations of homophonic competitors as in the Triangle Model [111,114]. In a similar manner, other models assume the presence of a semantic-to-orthography feedback route, akin to a spelling-checking mechanism [115]. The inclusion of semantic feedback allows these models to accommodate findings which suggest that accessing the semantic representation of a letter string facilitates its identification, particularly for low-frequency letter strings [116], and for morphologically complex and/or compound words [117]. However, most of the currently available models (e.g., [114]) acknowledge that the contribution of word-based semantic processing is underspecified, much less the role of context-based semantic information utilized by readers of Arabic (and other similar Semitic languages like Hebrew, see [93]). The empirical findings reported here further exemplify the need to accommodate the involvement of semantics in early word identification, and emphasize the necessity of developing comprehensive models that fully specify the roles of semantic processing at the level of single-word, and at the level of word-context semantic integration.
Finally, and with regards to eye movements control during reading, one point concerning the pattern of word skipping data that was reported here requires a brief discussion. The word skipping probabilities reported were generally in line with the average skipping rates previously reported in Arabic reading studies [93]. Hermena and Reichle [93] suggested that the informational density of Arabic words (with too many pieces of morpho-syntactic information represented in relatively short single words, see [118]; also [92]), as well as the visual complexity of the Arabic script, combined, force Arabic readers to skip fewer words (on average 8%) compared to readers of European languages (around 30%, e.g., [49,119]). With such infrequent skipping events (almost a floor effect), it is not surprising that no statistically reliable differences between the different preview conditions were found in the measure of word skipping. Thus far, word skipping rate in Arabic has reliably indexed the effect of the physical width (spatial extent) of printed words, with wider words typically skipped significantly less than narrower ones, regardless of the number of letters encompassed in that space [94,120].
A limitation of the ambitious design adopted in this study is the relatively small number of items per condition that resulted from the number of conditions we wanted to include, and the strict matching criteria adopted. Importantly, the reported results replicated stable and well-documented findings in the literature (e.g., benefits from identity and root-preserving previews). We are currently working to replicate, further clarify, and expand upon the more novel findings reported here (e.g., semantic preview benefit).
To summarize, the reported study investigated orthographic, morphological and semantic parafoveal processing in reading Arabic. The obtained results replicated the well-established findings that readers benefit from orthographically-identical previews, and that Arabic readers benefit from preserving the Semitic root of the Arabic words in the parafoveal preview. The results did not show benefit from preserving the pattern morpheme in the preview. Furthermore, the results replicated the findings that transposing the letter order of the Semitic root in the parafoveal preview results in sizable disruption to processing, especially when the letter transposition created a new real root. Thus, the reported findings further highlight how Semitic morphology, notably the consonantal root system, plays a key role in word identification, lexical organization, and modulates the degree to which the orthographic code is relied upon during early word identification processes. Finally, the results suggested that Arabic readers extract semantic information from parafoveal previews such that preview benefit was obtained from previews that were synonymous with the target word. It is likely that the high plausibility of the synonymous previous further amplified this semantic preview benefit. Further investigation into these processes is necessary. We concluded that these results can contribute towards building more comprehensive models of word identification and reading where full accounts of morphological processing and early semantic access should be further delineated and incorporated.
Supporting information
The final reported LMM models for the contrast matrices in Tables 3–5.
(DOCX)
The procedure and outcome of obtaining plausibility ratings for previews.
(DOCX)
Acknowledgments
We would like to thank Ascensión Pagán for the valuable suggestions she provided for structuring some sections of the introduction. We would also like to thank Ascensión Pagán and Chelsea Parlett-Pelleriti for the valuable guidance they provided with some aspects of the data analysis.
Data Availability
Data, stimuli and code are available at OSF: 10.17605/OSF.IO/GRP8K.
Funding Statement
EWH was awarded grant RIF R17035 from Zayed University, UAE, https://www.zu.ac.ae/main/en/index.aspx The university had no role in the study design, data collection, decision to publish or preparing the manuscript.
References
- 1.Boudelaa S., & Marslen-Wilson W. D. (2001). Morphological units in the Arabic mental lexicon. Cognition, 81, 65–92. doi: 10.1016/s0010-0277(01)00119-6 [DOI] [PubMed] [Google Scholar]
- 2.Boudelaa S., & Marslen-Wilson W. D. (2005). Discontinuous morphology in time: Incremental masked priming in Arabic. Language and Cognitive Processes, 20, 207–260. doi: 10.1080/01690960444000106 [DOI] [Google Scholar]
- 3.Schulz E. (2004). A student grammar of Modern Standard Arabic. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 4.Brysbaert M. (2001). Prelexical phonological coding of visual words in Dutch: Automatic after all. Memory and Cognition, 29, 765–73. doi: 10.3758/bf03200479 [DOI] [PubMed] [Google Scholar]
- 5.Duñabeitia J. A., Perea M. & Carreiras M. (2007). Do transposed-letter similarity effects occur at a morpheme level? Evidence for morpho-orthographic decomposition. Cognition, 105, 691–703. doi: 10.1016/j.cognition.2006.12.001 [DOI] [PubMed] [Google Scholar]
- 6.Johnson R. L., Perea M. & Rayner K. (2007). Transposed-letter effects in reading. Evidence from eye movements and parafoveal preview. Journal of Experimental Psychology: Human Perception and Performance, 33, 209–29. doi: 10.1037/0096-1523.33.1.209 [DOI] [PubMed] [Google Scholar]
- 7.Kinoshita S. & Norris D. (2009). Transposed-letter priming of prelexical orthographic representations. Journal of Experimental Psychology: Learning, Memory, and Cognition, 35, 1–18. doi: 10.1037/a0014277 [DOI] [PubMed] [Google Scholar]
- 8.Perea M. & Carreiras M. (2006a). Do transposed-letter similarity effects occur at a prelexical phonological level? Quarterly Journal of Experimental Psychology, 59, 1600–13. doi: 10.1080/17470210500298880 [DOI] [PubMed] [Google Scholar]
- 9.Perea M. & Carreiras M. (2006b). Do transposed-letter similarity effects occur at a syllable level? Experimental Psychology, 53, 308–15. doi: 10.1027/1618-3169.53.4.308 [DOI] [PubMed] [Google Scholar]
- 10.Perea M. & Carreiras M. (2008). Do orthotactics and phonology constrain the transposed-letter effect? Language and Cognitive Processes, 23, 69–92. doi: 10.1080/01690960701578146 [DOI] [Google Scholar]
- 11.Perea M. & Lupker S. J. (2003). Transposed-letter confusability effects in masked form priming. In Kinoshita S. & Lupker S. J. (Eds.), Masked priming: The state of the art (pp. 97–120). NY: Psychology Press. [Google Scholar]
- 12.Perea M. & Lupker S. J. (2004). Can CANISO activate CASINO? Transposed-letter similarity effects with nonadjacent letter positions. Journal of Memory and Language, 51, 231–46. doi: 10.1016/j.jml.2004.05.005 [DOI] [Google Scholar]
- 13.Rayner K., White S. J., Johnson R. L. & Liversedge S. P. (2006). Raeding wrods with jubmled lettres: There is a cost. Psychological Science, 17, 192–193. doi: 10.1111/j.1467-9280.2006.01684.x [DOI] [PubMed] [Google Scholar]
- 14.Schoonbaert S. & Grainger J. (2004). Letter position coding in printed word perception: Effects of repeated and transposed letters. Language and Cognitive Processes, 19, 333–367. doi: 10.1080/01690960344000198 [DOI] [Google Scholar]
- 15.Frost R. (2012). Towards a universal model of reading. Behavioral and Brain Sciences, 35, 263–279. doi: 10.1017/S0140525X11001841 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Davis, C. J. (1999). The self-organising lexical acquisition and recognition (SOLAR) model of visual word recognition. Unpublished doctoral dissertation, University of New South Wales.
- 17.Davis C. J. (2010). The spatial coding model of visual word identification. Psychological Review, 117, 713–58. doi: 10.1037/a0019738 [DOI] [PubMed] [Google Scholar]
- 18.Davis C. J. & Bowers J. (2004). What do letter migration errors reveal about letter position coding in visual word recognition? Journal of Experimental Psychology: Human Perception and Performance, 30, 923–41. doi: 10.1037/0096-1523.30.5.923 [DOI] [PubMed] [Google Scholar]
- 19.Davis C. J. & Bowers J. (2006). Contrasting five different theories of letter position coding: Evidence from orthographic similarity effects. Journal of Experimental Psychology: Human Perception and Performance, 32, 535–57. doi: 10.1037/0096-1523.32.3.535 [DOI] [PubMed] [Google Scholar]
- 20.Grainger J. & Whitney C. (2004). Does the huamn mnid raed wrods as a wlohe? Trends in Cognitive Sciences, 8, 58–59. doi: 10.1016/j.tics.2003.11.006 [DOI] [PubMed] [Google Scholar]
- 21.Whitney C. (2001). How the brain encodes the order of letters in a printed word: The SERIOL model and selective literature review. Psychonomic Bulletin and Review, 8, 221–43. doi: 10.3758/bf03196158 [DOI] [PubMed] [Google Scholar]
- 22.Whitney C. (2008). Supporting the serial in the SERIOL model. Language and Cognitive Processes, 23, 824–65. doi: 10.1080/01690960701828202 [DOI] [Google Scholar]
- 23.Whitney C. & Cornelissen P. (2008). SERIOL reading. Language and Cognitive Processes, 23, 143–64. doi: 10.1080/01690960701579771 [DOI] [Google Scholar]
- 24.Boudelaa S. (2014). Is the Arabic mental lexicon morpheme-based or stem-based? Implications for spoken and written word recognition. In Saiegh-Haddad E. & Joshi R. M., (Eds.), Handbook of Arabic literacy (pp. 31–54). London: Springer. [Google Scholar]
- 25.Boudelaa S. & Marslen-Wilson W. D. (2015). Structure, form, and meaning in the mental lexicon: Evidence from Arabic. Language, Cognition and Neuroscience, 30, 955–992. doi: 10.1080/23273798.2015.1048258 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Boudelaa S., Pulvermüller F., Hauk O., Shtyrov Y., & Marslen-Wilson W. D. (2010). Arabic morphology in the neural language system: A mismatch negativity study. Journal of Cognitive Neuroscience, 22, 998–1010. doi: 10.1162/jocn.2009.21273 [DOI] [PubMed] [Google Scholar]
- 27.Deutsch A., Frost R., & Forster K. I. (1998). Verbs and nouns are organized and accessed differently in the mental lexicon: Evidence from Hebrew. Journal of Experimental Psychology: Learning, Memory, and Cognition, 24, 1238–1255. doi: 10.1037//0278-7393.24.5.1238 [DOI] [PubMed] [Google Scholar]
- 28.Deutsch A., Frost R., Pollatsek A. & Rayner K. (2000). Early morphological effects in word recognition in Hebrew: Evidence from parafoveal preview benefit. Language and Cognitive Processes, 15, 487–506, doi: 10.1080/01690960050119670 [DOI] [Google Scholar]
- 29.Frost R. (2009). Reading in Hebrew vs. reading in English: Is there a qualitative difference? In Pugh K. & McCradle P. (eds.), How children learn to read: Current issues and new directions in the integration of cognition, neurobiology and genetics of reading and dyslexia research and practice (pp. 235–54). New York: Psychology Press. [Google Scholar]
- 30.Frost R., Deutsch A. & Forster K. I. (2000). Decomposing morphologically complex words in a nonlinear morphology. Journal of Experimental Psychology Learning Memory, and Cognition, 26, 751–65. doi: 10.1037/0278-7393.26.3.751 [DOI] [PubMed] [Google Scholar]
- 31.Frost R., Deutsch A., Gilboa O., Tannenbaum M. & Marslen-Wilson W. (2000). Morphological priming: Dissociation of phonological, semantic, and morphological factors. Memory and Cognition, 28, 1277–88. doi: 10.3758/bf03211828 [DOI] [PubMed] [Google Scholar]
- 32.Frost R., Forster K. I., & Deutsch A. (1997). What can we learn from the morphology of Hebrew: A masked priming investigation of morphological representation. Journal of Experimental Psychology: Learning, Memory, and Cognition, 23, 829–56. doi: 10.1037//0278-7393.23.4.829 [DOI] [PubMed] [Google Scholar]
- 33.Frost R., Kugler T., Deutsch A., & Forster K. I. (2005). Orthographic structure versus morphological structure: Principles of lexical organization in a given language. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31, 1293–1326. doi: 10.1037/0278-7393.31.6.1293 [DOI] [PubMed] [Google Scholar]
- 34.Gwilliams L. & Marantz A. (2015). Tracking non-linear prediction in a linear speech stream: Influence of morphological structure on spoken word recognition. Brain and Language, 147, 1–13. [DOI] [PubMed] [Google Scholar]
- 35.Prunet J. F., Béland R., & Idrissi A. (2000). The mental representation of Semitic words. Linguistic Inquiry, 31, 609–648. doi: 10.1162/002438900554497 [DOI] [Google Scholar]
- 36.Boudelaa S. & Marslen-Wilson W. D. (2000). Non-concatenative morphemes in language processing: Evidence from Modern Standard Arabic. In McQueen J., & Cutler A. (Eds.), Proceedings of the workshop on spoken word access processes (Vol. 1, pp. 23–26). Nijmegen: Nijmegen Publishing. [Google Scholar]
- 37.Frost R. (1998). Toward a strong phonological theory of visual word recognition: True issues and false trails. Psychological Bulletin, 123, 71–99. doi: 10.1037/0033-2909.123.1.71 [DOI] [PubMed] [Google Scholar]
- 38.Boudelaa S., Norris D., Mahfoudhi A., & Kinoshita S. (2019). Transposed letter priming effects and allographic variation in Arabic: Insights from lexical decision and the same–different task. Journal of Experimental Psychology: Human Perception and Performance, 45, 729–757. doi: 10.1037/xhp0000621 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Perea M., Abu Mallouh R., & Carreiras M. (2010). The search of an input coding scheme: Transposed-letter priming in Arabic. Psychonomic Bulletin & Review, 17, 375–380. doi: 10.3758/PBR.17.3.375 [DOI] [PubMed] [Google Scholar]
- 40.Velan H. & Frost R. (2007). Cambridge University vs. Hebrew University: The impact of letter transposition on reading English and Hebrew. Psychonomic Bulletin & Review, 14, 913–18. doi: 10.3758/BF03194121 [DOI] [PubMed] [Google Scholar]
- 41.Velan H. & Frost R. (2009). Letter-transposition effects are not universal: The impact of transposing letters in Hebrew. Journal of Memory and Language, 61, 285–320. doi: 10.1016/j.jml.2009.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Shimron J. (2006). Reading Hebrew: The language and the psychology of reading it. London: Erlbaum. [Google Scholar]
- 43.Boudelaa S. & Marslen-Wilson W. D. (2004). Allomorphic variation in Arabic: Implications for lexical processing and representation. Brain and Language, 90, 106–116. doi: 10.1016/S0093-934X(03)00424-3 [DOI] [PubMed] [Google Scholar]
- 44.Boudelaa S. & Marslen-Wilson W. D. (2011). Productivity and priming: Morphemic decomposition in Arabic. Language and Cognitive Processes, 26, 624–652. doi: 10.1080/01690965.2010.521022 [DOI] [Google Scholar]
- 45.Hermena E. W., Drieghe D., Hellmuth S., & Liversedge S. P. (2015). Processing of Arabic diacritical marks: Phonological-syntactic disambiguation of homographic verbs and visual crowding effects. Journal of Experimental Psychology: Human Perception and Performance, 41, 494–507. doi: 10.1037/xhp0000032 [DOI] [PubMed] [Google Scholar]
- 46.Mountaj N., El Yagoubi R., Himmi M., Ghazal F. L., Besson M., & Boudelaa S. (2015). Vowelling and semantic priming effects in Arabic. International Journal of Psychophysiology, 95, 46–55. doi: 10.1016/j.ijpsycho.2014.12.009 [DOI] [PubMed] [Google Scholar]
- 47.Ibrahim R., & Aharon-Peretz J. (2005). Is Literary Arabic a Second Language for Native Arab Speakers? Evidence from Semantic Priming Study. Journal of Psycholinguistic Research, 34, 51–70. doi: 10.1007/s10936-005-3631-8 [DOI] [PubMed] [Google Scholar]
- 48.Rayner K. (1998). Eye movements in reading and information processing: 20 years of research. Psychological Bulletin, 124, 372–422. doi: 10.1037/0033-2909.124.3.372 [DOI] [PubMed] [Google Scholar]
- 49.Rayner K. (2009). Eye movements and attention in reading, scene perception, and visual search. Quarterly Journal of Experimental Psychology, 62, 1457–1506. doi: 10.1080/17470210902816461 [DOI] [PubMed] [Google Scholar]
- 50.Schotter E.R. (2018). Reading ahead by hedging our bets on seeing the future: Eye tracking and electrophysiology evidence for parafoveal lexical processing and saccadic control by partial word recognition. Psychology of Learning and Motivation, 68, 263–298. doi: 10.1016/bs.plm.2018.08.011 [DOI] [Google Scholar]
- 51.Schotter E. R., Angele B., & Rayner K. (2012). Parafoveal processing in reading. Attention, Perception, & Psychophysics, 74, 5–35. doi: 10.3758/s13414-011-0219-2 [DOI] [PubMed] [Google Scholar]
- 52.Matin E. (1974). Saccadic suppression: A review and an analysis. Psychological Bulletin, 81, 899–917. doi: 10.1037/h0037368 [DOI] [PubMed] [Google Scholar]
- 53.Rayner K. (1975). The perceptual span and peripheral cues in reading. Cognitive Psychology, 7, 65–81. doi: 10.1016/0010-0285(75)90005-5 [DOI] [Google Scholar]
- 54.Rayner K., Well A. D., Pollatsek A., & Bertera J. H. (1982). The availability of useful information to the right of fixation in reading. Perception & Psychophysics, 31, 537–550. doi: 10.3758/bf03204186 [DOI] [PubMed] [Google Scholar]
- 55.Ashby J., Treiman R., Kessler B., & Rayner K. (2006). Vowel processing during silent reading: Evidence from eye movements. Journal of Experimental Psychology: Learning, Memory, and Cognition, 32, 416–424. doi: 10.1037/0278-7393.32.2.416 [DOI] [PubMed] [Google Scholar]
- 56.Henderson J. M., Dixon P., Petersen A., Twilley L. C., & Ferreira F. (1995). Evidence for the use of phonological representations during transsaccadic word recognition. Journal of Experimental Psychology: Human Perception and Performance, 21, 82–97. doi: 10.1037/0096-1523.21.1.82 [DOI] [Google Scholar]
- 57.Miellet S., & Sparrow L. (2004). Phonological codes are assembled before word fixation: Evidence from boundary paradigm in sentence reading. Brain and Language, 90, 299–310. doi: 10.1016/S0093-934X(03)00442-5 [DOI] [PubMed] [Google Scholar]
- 58.Pollatsek A., Lesch M., Morris R. K., Rayner K. (1992). Phonological codes are used in integrating information across saccades in word identification and reading. Journal of Experimental Psychology: Human Perception and Performance, 18, 148–162. doi: 10.1037//0096-1523.18.1.148 [DOI] [PubMed] [Google Scholar]
- 59.Rayner K., McConkie G. W., & Ehrlich S. (1978). Eye movements and integrating information across fixations. Journal of Experimental Psychology: Human Perception and Performance, 4, 529–544. doi: 10.1037//0096-1523.4.4.529 [DOI] [PubMed] [Google Scholar]
- 60.Reingold E. M., Reichle E. D., Glaholt M. G., & Sheridan H. (2012). Direct lexical control of eye movements in reading: Evidence from survival analysis of fixation durations. Cognitive Psychology, 65,177–206. doi: 10.1016/j.cogpsych.2012.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bertram R. and Hyönä J. (2007). The interplay between parafoveal preview and morphological processing in reading. In van Gompel R.G., Fischer M.H., Murray W.S. and Hill R.L. (eds.) Eye movements: A window on mind and brain (pp. 391–407). Oxford: Elsevier Science. [Google Scholar]
- 62.Juhasz B. J., White S. J., Liversedge S. P., & Rayner K. (2008). Eye movements and the use of parafoveal word length information in reading. Journal of Experimental Psychology: Human Perception and Performance, 34, 1560–1579. doi: 10.1037/a0012319 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Kambe G. (2004). Parafoveal processing of prefixed words during eye fixations in reading: Evidence against morphological influences on parafoveal preprocessing. Perception and Psychophysics, 66, 279–292. doi: 10.3758/bf03194879 [DOI] [PubMed] [Google Scholar]
- 64.Hyönä J. (2011). Foveal and parafoveal processing during reading. In Liversedge S. P., Gilchrist I. D., & Everling S. (Eds.), The Oxford handbook of eye movements (pp. 819–838). Oxford, England: Oxford University Press. [Google Scholar]
- 65.Deutsch A., Frost R., Pelleg S., Pollatsek A., Rayner K. (2003). Early morphological effects in reading: Evidence from parafoveal preview benefit in Hebrew. Psychonomic Bulletin & Review, 10, 415–22. doi: 10.3758/bf03196500 [DOI] [PubMed] [Google Scholar]
- 66.Johnson R. L., Dunne M.D. (2012). Parafoveal processing of transposed-letter words and nonwords: Evidence against parafoveal lexical activation. Journal of Experimental Psychology: Human Perception and Performance, 38, 191–212. doi: 10.1037/a0025983 [DOI] [PubMed] [Google Scholar]
- 67.Pagán A., Blythe H. I., & Liversedge S. P. (2016). Parafoveal preprocessing of word initial trigrams during reading in adults and children. Journal of experimental psychology: Learning, memory, and cognition, 42, 411432. doi: 10.1037/xlm0000175 [DOI] [PubMed] [Google Scholar]
- 68.Velan H., Deutsch A., & Frost R. (2013). The flexibility of letter-position flexibility: Evidence from eye movements in reading Hebrew. Journal of Experimental Psychology: Human Perception and Performance, 39, 1143–1152. doi: 10.1037/a0031075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Deutsch A., Frost R., Pollatsek A. & Rayner K. (2005). Morphological parafoveal preview benefit effects in reading: Evidence from Hebrew. Language and Cognitive Processes, 20, 341–371. doi: 10.1080/01690960444000115 [DOI] [Google Scholar]
- 70.Deutsch A., Velan H., & Michaly T. (2018). Decomposition in a non-concatenated morphological structure involves more than just the roots: Evidence from fast priming. Quarterly Journal of Experimental Psychology, 71, 85–92. doi: 10.1080/17470218.2016.1250788 [DOI] [PubMed] [Google Scholar]
- 71.Rayner K., Balota D. A., & Pollatsek A. (1986). Against parafoveal semantic preprocessing during eye fixations in reading. Canadian Journal of Psychology, 40, 473–483. doi: 10.1037/h0080111 [DOI] [PubMed] [Google Scholar]
- 72.Altarriba J., Kambe G., Pollatsek A., & Rayner K. (2001). Semantic codes are not used in integrating information across eye fixations in reading: evidence from fluent Spanish-English bilinguals. Perception & psychophysics, 63, 875–890. doi: 10.3758/bf03194444 [DOI] [PubMed] [Google Scholar]
- 73.Hyönä J., & Häikiö T. (2005). Is emotional content obtained from parafoveal words during reading? An eye movement analysis. Scandinavian Journal of Psychology, 46, 475–483. doi: 10.1111/j.1467-9450.2005.00479.x [DOI] [PubMed] [Google Scholar]
- 74.Rayner K., Schotter E. R., & Drieghe D. (2014). Lack of semantic parafoveal preview benefit in reading revisited. Psychonomic Bulletin & Review, 21, 1067–1072. doi: 10.3758/s13423-014-0582-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Hohenstein S., & Kliegl R. (2014). Semantic preview benefit during reading. Journal of Experimental Psychology. Learning, Memory, and Cognition, 40, 166–190. doi: 10.1037/a0033670 [DOI] [PubMed] [Google Scholar]
- 76.Hohenstein S., Laubrock J., & Kliegl R. (2010). Semantic preview benefit in eye movements during reading: A parafoveal fast-priming study. Journal of Experimental Psychology. Learning, Memory, and Cognition, 36, 1150–1170. doi: 10.1037/a0020233 [DOI] [PubMed] [Google Scholar]
- 77.Yan M., Richter E. M., Shu H., & Kliegl R. (2009). Readers of Chinese extract semantic information from parafoveal words. Psychonomic Bulletin & Review, 16, 561–566. doi: 10.3758/PBR.16.3.561 [DOI] [PubMed] [Google Scholar]
- 78.Yang J., Wang S., Tong X., & Rayner K. (2012). Semantic and plausibility effects on preview benefit during eye fixations in Chinese reading. Reading and Writing, 25, 1031–1052. doi: 10.1007/s11145-010-9281-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Laubrock J., & Hohenstein S. (2012). Orthographic consistency and parafoveal preview benefit: a resource-sharing account of language differences in processing of phonological and semantic codes. The Behavioral and Brain Sciences, 35, 292–293. doi: 10.1017/S0140525X12000209 [DOI] [PubMed] [Google Scholar]
- 80.Schotter E. R. (2013). Synonyms provide semantic preview benefit in English. Journal of Memory and Language, 69, 619–633. doi: 10.1016/j.jml.2013.09.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Rayner K., & Schotter E. R. (2014). Semantic preview benefit in reading English: The effect of initial letter capitalization. Journal of Experimental Psychology: Human Perception and Performance, 40, 1617–1628. doi: 10.1037/a0036763 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Schotter E. R., Lee M., Reiderman M., & Rayner K. (2015). The effect of contextual constraint on parafoveal processing in reading. Journal of Memory and Language, 83, 118–139. doi: 10.1016/j.jml.2015.04.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Schotter E. R., & Jia A. (2016). Semantic and plausibility preview benefit effects in English: Evidence from eye movements. Journal of Experimental Psychology: Learning, Memory, and Cognition, 42, 1839–1866. doi: 10.1037/xlm0000281 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Veldre A., & Andrews S. (2016a). Is semantic preview benefit due to relatedness or plausibility? Journal of Experimental Psychology: Human Perception and Performance, 42, 939–952. doi: 10.1037/xhp0000200 [DOI] [PubMed] [Google Scholar]
- 85.Veldre A., & Andrews S. (2016b). Semantic preview benefit in English: Individual differences in the extraction and use of parafoveal semantic information. Journal of Experimental Psychology: Learning, Memory, and Cognition, 42, 837–854. doi: 10.1037/xlm0000212 [DOI] [PubMed] [Google Scholar]
- 86.Yang J., Li N., Wang S., Slattery T. J., & Rayner K. (2014). Encoding the target or the plausible preview word? The nature of the plausibility preview benefit in reading Chinese. Visual Cognition, 22, 193–213. doi: 10.1080/13506285.2014.890689 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Bailey I. L., & Lovie J. E. (1980). The design and use of a new near- vision chart. American Journal of Optometry and Physiological Optics, 57, 378–387. doi: 10.1097/00006324-198006000-00011 [DOI] [PubMed] [Google Scholar]
- 88.Boudelaa S., & Marslen-Wilson W. D. (2010). Aralex: A lexical database for Modern Standard Arabic. Behavior Research Methods, 42, 481–487. doi: 10.3758/BRM.42.2.481 [DOI] [PubMed] [Google Scholar]
- 89.Christianson K., Johnson R. L., & Rayner K. (2005). Letter transpositions within and across morphemes. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31, 1327–1339. doi: 10.1037/0278-7393.31.6.1327 [DOI] [PubMed] [Google Scholar]
- 90.Ktori M., Kingma B., Hannagan T., Holcomb P. J., & Grainger J. (2014). On the time-course of adjacent and non-adjacent transposed-letter priming. Journal of Cognitive Psychology, 26, 491–505. doi: 10.1080/20445911.2014.922092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Perea M., Duñabeitia J. A., & Carreiras M. (2008). Transposed-letter priming effects for close versus distant transpositions. Experimental Psychology, 55, 384–393. doi: 10.1027/1618-3169.55.6.384 [DOI] [PubMed] [Google Scholar]
- 92.Hermena E. W., Liversedge S. P., Bouamama S., & Drieghe D. (2019). Orthographic and root frequency effects in Arabic: Evidence from eye movements and lexical decision. Journal of Experimental Psychology: Learning, Memory, and Cognition, 45, 934–954. doi: 10.1037/xlm0000626 [DOI] [PubMed] [Google Scholar]
- 93.Hermena E.W. & Reichle E.D. (2020), Insights from the study of Arabic reading. Language and Linguistics Compass, 14, 1–26.e12400. doi: 10.1111/lnc3.12400 [DOI] [Google Scholar]
- 94.Hermena E. W., Liversedge S. P., & Drieghe D. (2017). The influence of a word’s number of Letters, spatial Extent, and initial bigram characteristics on eye movement control during reading: Evidence from Arabic. Journal of Experimental Psychology: Learning, Memory, and Cognition, 43, 451–471. doi: 10.1037/xlm0000319 [DOI] [PubMed] [Google Scholar]
- 95.Holes C. (2004). Modern Arabic: Structures, functions and varieties. Washington, DC: Georgetown University Press. [Google Scholar]
- 96.Fitzsimmons G., & Drieghe D. (2011). The influence of number of syllables on word skipping during reading. Psychonomic Bulletin & Review, 18, 736–741. doi: 10.3758/s13423-011-0105-x [DOI] [PubMed] [Google Scholar]
- 97.Hermena E. W., Liversedge S. P., & Drieghe D. (2016). Parafoveal processing of Arabic diacritical marks. Journal of Experimental Psychology: Human Perception and Performance, 42, 2021–2038. doi: 10.1037/xhp0000294 [DOI] [PubMed] [Google Scholar]
- 98.Bates D., Mächler M., Bolker B., & Walker S. (2015). Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software, 67, 1–48. doi: 10.18637/jss.v067.i01 [DOI] [Google Scholar]
- 99.R Development Core Team. 2016. R: a language and environment for statistical computing. Vienna: R foundation for statistical computing. http://www.Rproject.org.
- 100.Lo S., & Andrews S. (2015). To transform or not to transform: using generalized linear mixed models to analyse reaction time data. Frontiers in Psychology, 6, 1171. doi: 10.3389/fpsyg.2015.01171 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Barr D. J., Levy R., Scheepers C., & Tily H. J. (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68, 255–278. doi: 10.1016/j.jml.2012.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.von der Malsburg T., & Angele B. (2017). False Positives and Other Statistical Errors in Standard Analyses of Eye Movements in Reading. Journal of Memory and Language, 94, 119–133. doi: 10.1016/j.jml.2016.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Perea M., Abu Mallouh R., García-Orza J., & Carreiras M. (2011). Masked priming effects are modulated by expertise in the script. Quarterly Journal of Experimental Psychology, 64, 902–919. doi: 10.1080/17470218.2010.512088 [DOI] [PubMed] [Google Scholar]
- 104.Veldre A., Andrews S. (2017). Parafoveal preview benefit in sentence reading: Independent effects of plausibility and orthographic relatedness. Psychonomic Bulletin and Review, 24, 519–528. doi: 10.3758/s13423-016-1120-8 [DOI] [PubMed] [Google Scholar]
- 105.Veldre A., & Andrews S. (2018). Parafoveal preview effects depend on both preview plausibility and target predictability. Quarterly Journal of Experimental Psychology, 71, 64–74. doi: 10.1080/17470218.2016.1247894 [DOI] [PubMed] [Google Scholar]
- 106.Veldre A., Reichle E. D., Wong R., & Andrews S. (2020). The effect of contextual plausibility on word skipping during reading. Cognition, 197, 104184. doi: 10.1016/j.cognition.2020.104184 [DOI] [PubMed] [Google Scholar]
- 107.Andrews S., & Veldre A. (2019). What is the most plausible account of the role of parafoveal processing in reading? Language and Linguistics Compass, 13. doi: 10.1111/lnc3.12344 [DOI] [Google Scholar]
- 108.Coltheart M., Rastle K., Perry C., Langdon R., & Ziegler J. (2001). DRC: A dual route cascaded model of visual word recognition and reading aloud. Psychological Review, 108, 204–256. doi: 10.1037/0033-295x.108.1.204 [DOI] [PubMed] [Google Scholar]
- 109.Reichle E. D. (2020). Computational models of reading: A handbook. Oxford, UK: Oxford University Press. [Google Scholar]
- 110.Taft M. (2004). Morphological decomposition and the reverse base frequency effect. Quarterly Journal of Experimental Psychology, 57, 745–765. doi: 10.1080/02724980343000477 [DOI] [PubMed] [Google Scholar]
- 111.Seidenberg M. S., & McClelland J. L. (1989). A distributed, developmental model of word recognition and naming. Psychological Review, 96, 523–568. doi: 10.1037/0033-295x.96.4.523 [DOI] [PubMed] [Google Scholar]
- 112.Abu-Rabia S. (2001). The role of vowels in reading Semitic scripts: Data from Arabic and Hebrew. Reading and Writing, 14, 39–59. doi: 10.1023/A:1008147606320 [DOI] [Google Scholar]
- 113.Saiegh-Haddad E. (2017). MAWRID: A model of Arabic word reading in development. Journal of Learning Disabilities, 51, 454–462. doi: 10.1177/0022219417720460 [DOI] [PubMed] [Google Scholar]
- 114.Harm M. W., & Seidenberg M. S. (2004). Computing the meanings of words in reading: Cooperative division of labor between visual and phonological processes. Psychological Review, 111, 662–720. doi: 10.1037/0033-295X.111.3.662 [DOI] [PubMed] [Google Scholar]
- 115.Van Orden G. C., Pennington B. F., & Stone G. O. (1990). Word identification in reading and the promise of subsymbolic psycholinguistics. Psychological Review, 97, 488–522. doi: 10.1037/0033-295x.97.4.488 [DOI] [PubMed] [Google Scholar]
- 116.Strain E., Patterson K., & Seidenberg M. S. (1995). Semantic effects in single-word naming. Journal of Experimental Psychology: Learning, Memory, and Cognition, 21, 1140–1154. doi: 10.1037//0278-7393.21.5.1140 [DOI] [PubMed] [Google Scholar]
- 117.Schmidtke D., Van Dyke J. A., & Kuperman V. (2018). Individual variability in the semantic processing of English compound words. Journal of Experimental Psychology: Learning, Memory, and Cognition, 44, 421–439. doi: 10.1037/xlm0000442 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Brysbaert M. (2019). How many words do we read per minute? A review and meta-analysis of reading rate. Journal of Memory and Language, 109, 1–30. doi: 10.1016/j.jml.2019.104047 [DOI] [Google Scholar]
- 119.Rayner K., & Pollatsek A. (1981). Eye movement control during reading: Evidence for direct control. Quarterly Journal of Experimental Psychology, 33A, 351–373. doi: 10.1080/14640748108400798 [DOI] [PubMed] [Google Scholar]
- 120.Paterson K. B., Almabruk A. A., McGowan V. A., White S. J., & Jordan T. (2015). Effects of word length on eye movement control: The evidence from Arabic. Psychonomic Bulletin & Review, 22, 1443–1450. doi: 10.3758/s13423-015-0809-4 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
The final reported LMM models for the contrast matrices in Tables 3–5.
(DOCX)
The procedure and outcome of obtaining plausibility ratings for previews.
(DOCX)
Data Availability Statement
Data, stimuli and code are available at OSF: 10.17605/OSF.IO/GRP8K.

