Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Oct 1.
Published in final edited form as: Trends Cogn Sci. 2017 Jul 1;21(10):721–724. doi: 10.1016/j.tics.2017.06.004

Chinese vs. English: Insights on cognition during reading

Lili Yu 1,*, Erik D Reichle 1
PMCID: PMC5610629  NIHMSID: NIHMS890037  PMID: 28676283

Abstract

Chinese reading experiments have introduced important caveats to theories of reading that have been largely informed by studies of English reading—especially in relation to our understanding of lexical processing and eye-movement control. This article provides a brief primer on Chinese reading and examples of questions that arise from its study.

Keywords: Chinese, eye movements, reading, word identification


Recent interest in Chinese reading reflects a growing appreciation that the language and writing system can inform our understanding of the perceptual, cognitive, and motor processes involved in reading—an understanding that has largely been informed by studies of English and other Western languages and writing systems [1]. This article will review what is known about the reading of Chinese versus English, focusing on the Chinese logographic writing system and how its properties affect two important aspects of skilled reading—word identification and eye-movement control (for reviews, see [23]).

As Figure 1 shows, Chinese is visually denser than English. Unlike English words, which consist of letter strings, Chinese words are composed of characters—the smallest pronounceable and meaningful units in Chinese, corresponding to morphemic syllables having one of four possible tones (in Mandarin). Each character consists of 1–36 overlapping strokes occupying a uniformly-sized, two-dimensional box-shaped spatial layout in text. Strokes can be further arranged into radicals, some of which can also be characters, but most being within-character subunits. Additionally, the words (most of which consist of 1–4 characters) are not demarcated by clear word boundaries.

Figure 1.

Figure 1

Panel A shows a single sentence (meaning: “She collected the sophora flowers to make a crown.”) containing words comprised of a variable number of characters, which in turn are composed of radicals and strokes (the numbers next to the pronunciations of characters denote the four possible tones). Individual radicals can convey meaning and phonological information, but in ways that are often complex. For example, in Panel A, character “花” is a regular phonogram because it has an identical pronunciation as its phonetic radical “化” (ignoring tonal differences), while character “槐” is an irregular phonogram. Panel B shows two groups of characters, one that is pronounced consistently with its component phonological radical and another where this relationship is inconsistent. Similarly, Panel C shows that there is a large degree of homophony in Chinese that varies in terms of its density. Finally, note that some simple characters can also be radicals embedded within other characters (e.g., “木”, “鬼” and “化” in Panel A are characters but also radicals in the characters “槐” and “花”), and that some radicals (e.g., the rightmost radical in “化” in Panel A) actually have neither a meaning nor pronunciation.

As Table 1 shows, with both English and Chinese, factors that increase the length or complexity of words also slow their identification. However, whereas word length and complexity are defined by the number of letters or morphemes in English, length and complexity are defined by the number of strokes, radicals, or characters in Chinese. The latter has enabled demonstrations that complexity modulates lexical processing when controlling for visual acuity in a manner not possible in English; for example, readers tend to fixate longer on complex than simple characters that are equally proximal to the center of vision [4]. Visual processing also likely plays a less important role in English word identification because their visual features (e.g., line segments) can be rapidly converted into abstract orthographic codes that represent individual letters devoid of case or font (e.g., “cat” and “CAT” are represented identically). In contrast, the visually dense, two-dimensional and hierarchical arrangement of features in Chinese words probably make their conversion to abstract orthographic codes more difficult (see [5]).

Table 1.

Factors affecting wordN processing during reading of English vs. Chinese.

Factors1 Observed Effects2 on WordN English Chinese
Sub-Lexical & Lexical Characteristics of WordN Orthography Length Inhibitory letters & morpheme s characters
Complexity Inhibitory morphemes strokes & radicals
Frequency/Familiarity Facilitative morphemes & words radicals, characters, & words
Neighborhood Density Inconsistent defined using letters defined using strokes, radicals, or characters
Phonology Grapheme-Phoneme Regularity Facilitative defined relative to other words defined by nature of phonogram
Grapheme-Phoneme Consistency Facilitative
Meaning Ambiguity Inhibitory morpheme s & words words, but predicted for radicals & characters
Concreteness Facilitative
Acquisition Age Inhibitory
Inter- & Supra-Lexical Factors Related to: WordN−1 Processing Difficulty (i.e., Spillover) Inhibitory orthographic & phonologic al, limited semantic orthographic, phonologic al, & semantic
WordN Pre-Processing Difficulty (i.e., Preview)
Word Boundaries Preferred-Viewing Location present absent
Predictability/Plausibility Facilitative meaning & syntactic category of words meaning & syntactic category of characters & words
WordN+1 Processing Difficulty Inhibitory (Parafove-a-on-Fovea Effect) absent present

Notes:

1

N is the index of the word being processed.

2

The effects are for increasing factor values (e.g., increasing word length inhibits wordN processing).

Despite the marked visual differences between English and Chinese, factors that influence how well the orthographic forms are represented in memory similarly affect their identification in both writing systems. For example, with Chinese, as the frequency of radicals, characters, and words increases, so too does word-identification efficiency. And as with English, the orthographic-neighborhood (i.e., number and/or frequency of orthographically similar words) influences Chinese word identification, although there might be more inconsistencies because there are more ways to define orthographic similarity (e.g., shared strokes vs. radicals vs. characters) [6].

There are also unexpected similarities between the two writing systems in terms of how the pronunciations and meanings of words are represented in and accessed from memory. For example, English word pronunciations can be generated by direct retrieval from memory and/or spelling-to-sound conversion rules. Additionally, whereas these spelling-to-sound rules are completely consistent in languages like Finnish, they are only “quasi-regular” in English; words that are pronounced according to the rules are regular (e.g., “cat” |kæt|) but others are irregular (e.g., “yacht” |jat|), and words can be pronounced in a manner that is consistent (e.g., “cat” rhymes with “bat”, “rat”, etc.) or inconsistent (e.g., “pint” does not rhyme with “hint”, “mint”, etc.) with similarly spelled words. Although Chinese characters are not pronounced using spelling-to-sound conversion rules, there is a remarkable degree of consistency in how characters are pronounced. For example, approximately 80% of Chinese characters are phonograms that contain one radical expressing pronunciation and one radical expressing meaning, allowing a character’s pronunciation to be generated by retrieving the pronunciation associated with its radical.

As Figure 1A shows, phonograms can be regular if the character is pronounced the same (ignoring possible tonal differences) as the phonological radical embedded within it; otherwise the phonogram is irregular. And as Figure 1B–C show, a phonogram’s pronunciation can be consistent or inconsistent with those of other phonograms. As with English, regular/consistent Chinese words are identified more rapidly than irregular/inconsistent words, showing that readers of Chinese are able to abstract grapheme-phoneme correspondences despite their complex nature (see [7]). Similarly, word identification is affected by homophony, or the number of words pronounced the same, in a way that is still poorly understood [8]. This may reflect the fact that Chinese word pronunciations become available in a “threshold-style” manner, directly from memory (see [9]), whereas most English word pronunciations can also be generated using spelling-to-sound conversion rules.

Perhaps not surprisingly, semantic variables known to affect English word identification also affect Chinese word identification. For example, with all else being equal, words with unambiguous meanings, concrete meanings, or learned early in life tend to be more rapidly identified. As Table 1 shows, such effects have been demonstrated in both writing systems, but with Chinese are also predicted for radicals and characters—at least for those that are also words. And as Table 1 shows, supra-lexical effects of word predictability are also evident in both languages—with all else being equal, wordN is easier to identify to the extent that it is constrained by its preceding sentence context.

Finally, as Table 1 shows, several inter- and supra-lexical variables similarly influence the reading of English and Chinese. For example, the difficulty associated with processing wordN−1 “spills over” to slow the processing of wordN, possibly by reducing preview of wordN [10]. However, several important differences also result from the fact that English is written with spaces between words whereas Chinese is not. These spaces allow English words to be rapidly identified as discrete “objects” for the purposes of lexical processing and eye-movement control during reading. One implication of this is that readers tend to direct their eyes slightly left of a word’s center because that is the preferred-viewing location that affords efficient lexical processing. In contrast, the lack of clear word boundaries in Chinese text increases the difficulty associated with both the segmentation of words for the purpose of their identification, and the selection of saccade targets for the purpose of moving the eyes from one word to the next. Both of these differences may contribute to reports of larger preview effects, where the parafoveal preview of wordN modulates the time spent looking at wordN, as well as the presence of parafovea-on-fovea effects in Chinese (which are absent in English), where the processing difficulty associated with wordN+1 modulates the fixation time on wordN [11]. Both effects likely reflect how the absence of word boundaries makes it more difficult to accurately coordinate the eyes and attention, so that, for example, there are more instances of attention being on a region corresponding to wordN+1 with the eyes inadvertently being on a region corresponding to wordN. The lack of clear word boundaries also likely contributes to the absence of preferred-viewing locations, with fixation landing-site distributions being fairly uniform [12].

These differences between the reading of English versus Chinese are intriguing and allow us to compare how attention, word identification, and saccadic programming operate under conditions that respectively impose less versus more extreme demands on the perceptual, cognitive, and motor systems supporting reading. (The separation of words using spaces is a recent cultural convention adopted to make reading easier.) To date, efforts to explain how these systems operate and are coordinated have focused largely on English, with current models explaining only limited aspects of Chinese reading, such as how characters are identified, how words are segmented, or how saccade targets are selected (for a review, see [13]). More comprehensive models are necessary to determine how these processes are coordinated—to understand how characteristics of the Chinese writing system affect word identification in continuous text, and how this in turn influences (and is influenced by) the progression of the eyes during reading [14]. Another question concerns the role of phonology in Chinese reading: Given the evidence that phonology becomes available in an all-or-none manner, what role does it play in reading, where words are typically identified by integrating parafoveal information with information that becomes available after the words are fixated? Finally, understanding how two-dimensional arrays of strokes are configured to produce radicals and characters might provide new insights into the alignment problem, or the question of how the relative order of letters in alphabetic languages is encoded and represented. We suspect that future research will reveal a plethora of new questions not anticipated by this review.

Acknowledgments

This research was support by funding from the U.S. National Institute of Health (RO1HD075800).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Share DL. On the Anglocentricities of current reading research and practice: The perils of overreliance on an “outlier” orthography. Psychol Bull. 2008;134:584–615. doi: 10.1037/0033-2909.134.4.584. [DOI] [PubMed] [Google Scholar]
  • 2.Li X, Bicknell K, Liu P, Wei W, Rayner K. Reading is fundamentally similar across disparate writing systems: A systematic characterization of how words and characters influence eye movements in Chinese reading. J Exp Psychol Gen. 2014;143:895–913. doi: 10.1037/a0033580. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li X, Zang C, Liversedge S, Pollatsek A. The role of words in Chinese reading. In: Pollatsek A, Treisman R, editors. The Oxford handbook of reading. Oxford, UK: Oxford University Press; 2015. pp. 232–244. [Google Scholar]
  • 4.Liversedge SP, Zang C, Zhang M, Bai X, Yan G, Drieghe D. The effect of visual complexity and word frequency on eye movements during Chinese reading. Vis Cogn. 2014;22:441–457. [Google Scholar]
  • 5.Yu L, Zhang Q, Priest C, Reichle ED, Sheridan H. Character-complexity effects in Chinese reading and visual search: A comparison and theoretical implications. Q J Exp Psychol. 2017 doi: 10.1080/17470218.2016.1272616. http://dx.doi.org/10.1080/17470218.2016.1272616. [DOI] [PMC free article] [PubMed]
  • 6.Chang YN, Hsu CH, Tsai JL, Chen CL, Lee CY. A psycholinguistic database for traditional Chinese character naming. Behav Res Methods. 2016;48:112–122. doi: 10.3758/s13428-014-0559-7. [DOI] [PubMed] [Google Scholar]
  • 7.Chang YN, Welbourne S, Lee CY. Exploring orthographic neighborhood size effects in a computational model of Chinese character naming. Cognitive Psychol. 2016;91:1–23. doi: 10.1016/j.cogpsych.2016.09.001. [DOI] [PubMed] [Google Scholar]
  • 8.Sze WP, Yap MJ, Rickard Liow SJ. The role of lexical variables in the visual recognition of Chinese characters: A megastudy analysis. Q J Exp Psychol. 2015;68:1541–1570. doi: 10.1080/17470218.2014.985234. [DOI] [PubMed] [Google Scholar]
  • 9.Perfetti CA, Liu Y, Tan LH. The Lexical Constituency model: Some implications of research on Chinese for general theories of reading. Psychol Rev. 2006;112:43–59. doi: 10.1037/0033-295X.112.1.43. [DOI] [PubMed] [Google Scholar]
  • 10.Yan M, Kliegl R, Shu H, Pan J, Zhou X. Parafoveal load of word N+1 modulates preprocessing effectiveness of word N+2 in Chinese reading. J Exp Psychol Human. 2010;36:1669. doi: 10.1037/a0019329. [DOI] [PubMed] [Google Scholar]
  • 11.Schotter ER, Angele B, Rayner K. Parafoveal processing in reading. Atten Percept Psycho. 2012;74:5–35. doi: 10.3758/s13414-011-0219-2. [DOI] [PubMed] [Google Scholar]
  • 12.Liu Y, Reichle ED, Li X. The effect of word frequency and parafoveal preview on saccade length during the reading of Chinese. J Exp Psychol Human. 2016;42:1008–1025. doi: 10.1037/xhp0000190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Reichle ED, Yu L. Models of Chinese reading: Review and Analysis. 2017 doi: 10.1111/cogs.12564. Manuscript submitted for review. [DOI] [PubMed] [Google Scholar]
  • 14.Liu Y, Huang R, Gao DG, Reichle ED. Further tests of a dynamic-adjustment account of sacccade targeting during the reading of Chinese. Cognitive Sci. 2017 doi: 10.1111/cogs.12487. Manuscript in press. [DOI] [PubMed] [Google Scholar]

RESOURCES