Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Apr 23.
Published in final edited form as: Nat Commun. 2012;3:1284. doi: 10.1038/ncomms2220

Sequential then Interactive Processing of Letters and Words in the Left Fusiform Gyrus

Thomas Thesen 1,2, Carrie R McDonald 2, Chad Carlson 1, Werner Doyle 1, Syd Cash 3, Jason Sherfey 2, Olga Felsovalyi 2, Holly Girard 2, William Barr 1, Orrin Devinsky 1, Ruben Kuzniecky 1, Eric Halgren 2,4
PMCID: PMC4407686  NIHMSID: NIHMS425656  PMID: 23250414

Abstract

Despite decades of cognitive, neuropsychological, and neuroimaging studies, it is unclear if letters are identified prior to word-form encoding during reading, or if letters and their combinations are encoded simultaneously and interactively. Here, using functional magnetic resonance imaging, we show that a ‘letter-form’ area (responding more to consonant strings than false fonts) can be distinguished from an immediately anterior ‘visual word-form area’ in ventral occipitotemporal cortex (responding more to words than consonant strings). Letter-selective magnetoencephalographic responses begin in the letter-form area ~60ms earlier than word-selective responses in the word-form area. Local field potentials confirm the latency and location of letter-selective responses. This area shows increased high gamma power for ~400ms, and strong phase-locking with more anterior areas supporting lexico-semantic processing. These findings suggest that during reading, visual stimuli are first encoded as letters before their combinations are encoded as words. Activity then rapidly spreads anteriorly, and the entire network is engaged in sustained integrative processing.

Keywords: fMRI, magnetoencephalography, intracranial EEG, memory, language, high gamma, reading

Introduction

Fluent readers distinguish between thousands of subtly different visual stimuli, associating each with a different meaning within a few hundred milliseconds. Some models of reading suppose that visual stimuli are identified as letters before their ordered combinations are identified as words, noting that brain lesions can specifically impair the ability to recognize letters[1], or to identify single letters but not whole words [2]. Such cases are countered by studies in healthy subjects showing that letters are more quickly and accurately identified within the context of words (the ‘word superiority effect’), suggesting that letter- and word-recognition may not be sequential and separable, but rather simultaneous and integrated [3].

More recently, neuroimaging studies have identified a ‘visual word-form area’ (VWFA), showing increased hemodynamic activation to words compared to sensory controls, and centered in the left posterior fusiform gyrus (lpFg; for review see[4], for limitations to this concept see[5]). Critically, activation in this area to letter-strings increases with their similarity to actual words[6, 7], especially in more anterior VWFA[8], suggested that it actually comprises a succession of detectors responding to progressively more abstract lexico-semantic aspects of the letter-strings. A word-selective response can also be recorded with EEG, peaking over the left occipital scalp at ~140–220ms[9]. This response has been localized to lpFg with magnetoencephalography (MEG)[10, 11] and intracranial local field potentials (LFP)[1214].

In contrast to the strong multimodal evidence for word-form processing in VWFA, the evidence for separable letter-form processing is equivocal. Although several studies have reported larger EEG responses to letter-strings as compared to false fonts over left lateral occipital scalp, it is not clear if these differ in either latency or location from word-form responses[9, 15]. Functional magnetic resonance imaging (fMRI) provides more certain localization, but has not identified areas where letter-strings reliably evoke more activity than false fonts within lpFg, nor has it been able to provide information regarding the timing of these processes[8, 16].

Here we identify a putative letterform area immediately posterior to the VWFA with fMRI in healthy subjects, and show with MEG that letter-selective activation estimated to the putative letterform area precedes the word-selective activation in the VWFA. Next, we use local field potentials recorded directly from the letterform area using pial electrodes in epileptic patients to confirm and extend the non-invasive measures, providing converging evidence for a separate letterform area preceding in time and anatomy the VWFA. Finally, we show using intracranial recordings that activation of the putative letterform area is prolonged, overlapping and phase-locked with anterior language areas during later, but not earlier, stages of reading.

Results

Letter- and word-selectivity

We recorded brain activity in English readers evoked by false fonts (FF) arranged in a string like a word, by consonant strings (CS), and by real words (RW). We reasoned that if separate letter-form and word-form processing stages exist, they would be indexed by CS>FF, and RW>CS contrasts, respectively. Stimuli were presented every 600ms with no gap, and the subject responded to rare (<5%) animal names. This task required the subject to attempt to read each stimulus, the cognitive process under examination. Although non-word stimuli would thus be subjected to less processing once they were identified as such, our main focus was on the first pass of neural activity occurring before definitive word identification.

Hemodynamic responses

First, we used fMRI in 12 healthy subjects to isolate candidate areas in lpFg. Letter-selective (CS>FF) hemodynamic activation was restricted to lpFg, and word- selective (RW>CS) processing was immediately anterior, with very little overlap (Fig 1A, Fig. 2). Word-selective areas extended beyond the lpFg to traditional language areas (Wernicke’s and Broca’s), as well as cingulate gyrus and contralateral sites. In order to maximize single subject SNR we used a block design for the fMRI modality only. Thus the subjects may have used shallower processing for the non-word stimuli, accentuating their difference from words. Furthermore, the contrast RW>CS would be expected to reveal areas processing more abstract lexical and semantic properties as well as those processing word-forms. Nonetheless, the fMRI study accomplished its goal, to localize for further study candidate structures in lpFg that might underlie letter-form and word-form processing.

Figure 1. Putative letter-form area identified with fMRI and MEG.

Figure 1

A. fMRI. Hemodynamic activation to letter-selective (red) and word-selective (orange) contrasts, or both (yellow). B. MEG. Estimated time-courses of activation (F-values) in 4 Regions of Interest (ROI) in the left ventral occipitotemporal and orbital cortices. ROIs, centered at the ends of the arrows, were chosen based on fMRI activation. Colors (A) and asterisks (B) mark cluster-corrected differences, t-test, p<.05; n=12 healthy subjects. MNI coordinates of the maximum activation clusters: letter-form area (−40 −78 −18), word-form area (−46 −52 −20).

Figure 2. Interaction of BOLD response to factors of task contrast and ROI.

Figure 2

A. Location of putative letterform and wordform areas used for this analysis. B. BOLD response in these areas to the letterform contrast (CS, consonant strings, as compared to false fonts, FF) and wordform contrast (N, novel words, as compared to CS). BOLD signal in the letterform area (left) is very sensitive to the CS versus FF contrast but not to the N versus CS contrast, i.e., it is sensitive to whether the stimulus is composed of letters but not to whether the letters compose a word. In contrast, BOLD signal in the wordform area is somewhat sensitive to whether the stimulus is composed of letters (CSvFF), but is more sensitive to whether the letters compose a word. ANOVA for area (letterform, wordform) × contrast (CSvFF, NvCS) showed a significant area × contrast interaction (p<.05, F(11)=5.05). The BOLD response is in arbitrary units.

Magnetoencephalographic responses

Due to the nature of neurovascular coupling, hemodynamic measures cannot distinguish the onsets of neural processing stages that differ by less than about a second. Consequently, we turned to the millisecond accuracy of MEG to examine the time-course of processing evoked by FF, CS and RW within the regions identified by fMRI in the lpFg. By using a random stimulus order, and concentrating on first-pass processing, we were able to determine when CS>FF and RW>CS effects initially occur, prior to potentially confounding effects of differential processing, which could occur only after stimulus identification.

MEG is mainly generated by currents within apical dendrites of cortical pyramidal cells. Currents were estimated with noise-normalized minimum norm constrained by each subject’s MRI[17]. At 160ms, the first letter-, but not word-selective differences peak in lpFg (Figure 1B, area 1). Word-selective activation emerges later, peaking at 225ms in an immediately anterior location (Figure 1B, area 2). At this latency, letter-selective responses are also estimated to this area. Thus, like hemodynamic activation, the earliest neural currents which were letter-selective but not word-selective were estimated to occur only in the most posterior part of lpFg. Furthermore, these letter-selective currents peaked earlier than more anterior word-selective responses. Unlike its hemodynamic response, currents in anterior fusiform g. showed letter-selective as well as word-selective responses (Figure 1B, area 3). Dissociations between MEG and fMRI may occur because they are sensitive to different aspects of neural activity, and fMRI integrates activity over a longer time period [18]. Nonetheless, MEG confirms a succession in time and space of neural currents distinguishing first letters and then words from their respective controls, confirming the spatial succession shown by hemodynamic measures (Fig. 3).

Figure 3. Task contrasts across different latencies and areas.

Figure 3

A. Equivalent current dipole (ECD) strength in the letterform area responds at an early latency (160ms) to consonant strings (CS, as compared to false fonts, FF), but shows little differential response at either latency to novel words (N) versus CS. putative letterform and wordform areas were defined by fMRI responses in the same subjects. ECD strength is estimated from MEG as the absolute difference between noise-normalized dipole strengths. B. ECD strength in the wordform area shows little differential response to either contrast at the early latency, but responds more to words than CS, at the longer latency (225ms). A supplementary MANOVA for area (letterform, wordform) × latency (160, 225ms) × contrast (CSvFF, NvCS) showed a significant area × latency interaction (p<.05, F(1,11)=5.97). MEG responses were estimated for areas 1 and 3 as shown in Figure 1. Motivated by studies suggesting that very early word-selective responses may be present shortly after ~100ms [50, 51], we also examined MEG responses at this latency in a supplementary t-test but failed to find any differences between conditions.

Intracranial EEG responses

Although providing excellent timing, localizations of MEG generators are always subject to some uncertainty. Unambiguous localization was obtained with LFP recordings from the lpFg surface using electrodes implanted in epileptic patients for the clinical purpose of localizing seizure onset relative to eloquent cortex. Nine patients had electrodes located in the ventral occipito-temporal region of the language dominant hemisphere, and had normal verbal IQ and reading ability (see Supplementary Table S1). Electrode contacts considered for analysis were within 1cm of the group hemodynamic response, were distant from the ultimately determined seizure focus and from brain abnormalities identified with structural imaging, and had normal-appearing background activity with few or no epileptiform spikes or slow waves. Of 34 such contacts, 25 recorded LFP (intracranial ERP) that responded during the task compared to prestimulus baseline. Of these 25 responsive contacts, 14 responded differentially to CS vs FF before 300ms (fig. 4). Since the LFP records essentially the same signal locally that the MEG records at a distance, the LFP responses directly confirm the inferred localization of MEG generators (see Supplementary Fig. S1).

Figure 4. Direct intracranial recordings confirm inferences from non-invasive fMRI and MEG.

Figure 4

A. Intracranial LFP (A) and HGP (B) differentiate between consonant strings (CS) versus false fonts (FF), in an electrode contact (bold white circle, open arrow) centered on fMRI activation to the same contrast in the same patient (C), at the posterior limit of the left fusiform gyrus (D). No HGP response to either CS or FF were recorded by adjacent contacts (E; responses are plotted at the same scale as in B; these adjacent contacts, which are lateral (L) or medial (M) to that in B, are marked in C and D). The HGP response was highly correlated with the number of letters (F), and extended to >140Hz (G). A, B, F, and G display different recordings from the same contact. B. Differential LFP (H) response to CS vs FF in another patient, again recorded over the left posterior fusiform cortex in a location which showed BOLD activation (J) in the same contrast in the same patient. Electrical stimulation between this contact and the medially-adjacent contact (J) disrupted naming performance. This patient also performed MEG with activation (F-values) estimated to the same area at the latency of the LFP response (I, K). C. Differential LFP (L) and HGP (M) responses to CS vs FF over left posterior fusiform cortex (P). Although the same location responds to words vs consonants (N, O), the differential response begins >80ms later. Again, the HGP response extends across all recorded gamma frequencies (Q), and no significant response is observed in adjacent contacts (R; same scale as M). The polarity and morphology of the LFP responses (A,H,L) are highly variable as is typically seen in the vicinity of the LFP generator, presumably reflecting the exact spatial relationship of the electrode to the generator as well as individual differences. Brown rectangles behind waveforms indicate significant condition differences using resampling statistics across individual trials. HGP is in arbitrary units.

High-gamma band power

The polarity of MEG or LFP does not reliably indicate if the underlying population is producing increased or decreased neuronal activity. Such information can be derived from broadband high gamma power (HGP), which arises from summated fast post-synaptic membrane currents and action potentials. The 9 patients were implanted with a total of 1351 electrodes of which 107 (7.9%) contacts exhibited significant task-related HGP. Of these 107, 7 (6.5%) contacts recorded greater activation to CS than FF before 250ms, of which 6 (85%) were in lpFg, thus providing additional evidence that letter-selective activation is mainly localized to this area.

Common response patterns across brain imaging modalities

The locations and timing of the LFP and HGP responses to words, CS, and FF directly recorded from lpFg in patients thus showed a good correspondence to the fMRI and MEG contrasts recorded from healthy controls. In addition, excellent correspondence was observed in one patient studied with fMRI prior to electrode implantation (fig. 4A), and in another patient studied with both fMRI and MEG recordings (fig. 4B). The recording electrode on the cortical location showing CS>FF hemodynamic activation also recorded focal CS>FF LFP and HGP. The HGP response significantly differentiates between CS and FF beginning at ~140–170ms, very close to that observed with MEG in the same subject. The LFP and HGP responses were in most cases highly focal, being absent in the adjacent contacts separated by 6mm (fig. 4 E,R).

Number of letters

Previous studies have found that the number of letters does not affect hemodynamic activation of the VWFA, but does affect the immediately posterior region[19, 20]. We also found that the letter-selective HGP responses increased linearly with the number of letters (fig. 4F). Specifically, in the two subjects with the highest SNR recordings, the average HGP from 200–300ms correlated with number of letters in CS (Pearson’s r=.96, .95; both p<.01) and words (Pearson’s r.94, .88; both p<.05) but not FF (r=−.32, .64; both p>.2; please see Supplemental Materials for details). Thus, this correlation with number of letters does not reflect greater sensory stimulation (since it was not seen with increasing numbers of FF stimuli), and is independent of word frequency or meaning (since CS have neither). When considering the words only, there is no significant correlation with word frequency if the effects of word length are removed (Supplemental materials), unlike what has been reported for the VWFA [21]. These findings show that the processing devoted by the letter-selective area to a stimulus is proportional to the number of letters it contains but is not sensitive to basic lexical properties such as frequency. These characteristics are consistent with its putative role in processing individual letters instead of whole words, and distinguish it from the VWFA.

Temporal dynamics of high-gamma power

Since HGP is highly correlated with hemodynamic activation[22], the HGP responses recorded at the location of hemodynamic responses should indicate the time-course of the neural activity underlying the hemodynamic activations. In the highest SNR HGP recordings, letter-selective activity began at ~150ms after CS onset, peaked at ~200ms, and continued for over 400ms (figs. 4B, 4M). Thus, although activation of the putative letter-form area begins prior to more anterior language areas, it is prolonged and overlaps with word-form, lexical and lexico-semantic processing.

Temporal dynamics of communication between brain regions

In order to obtain additional evidence regarding whether these co-activated areas are communicating, the phase-locking value (PLV) was calculated between active sites [23]. PLV measures the consistency of the relative phase of LFPs in two locations. High PLV indicates consistent synchronization of the synaptic currents in pyramidal apical dendrites between the cortical locations underlying the intracranial sensors. Such inferences are weakened in EEG or MEG by the fact that any two sensors will often record activity from the same cortical location, resulting in spurious correlations [24]. Intracranial LFP are focally sensitive to the underlying cortex and thus are not prone to this confound.

PLV was strongly elevated during word processing from ~170–400ms between the lpFg sites showing letter-selectivity and other locations responding to words (fig. 5). In order to test the generality of this finding, a single-trial estimate of the PLV (PLVi) was calculated for 24 electrode-pairs, each between an lpFg electrode with early CS>FF HGP activation, and another location with temporally overlapping statistically significant differential HGP responses in the same task. Fourteen (58%) showed significantly increased PLVi (8–35Hz; 140–300ms) to words as compared to false fonts (p<.01; please see Supplemental Materials for details). Although the PLV indicated very high levels of phase synchrony during the critical period while reading words, it was at chance levels prior to word onset, or in response to FF (fig. 5). Resting-state fMRI correlations have been reported between the VWFA and other language-related regions [25], but other studies have given apparently contradictory results[26]. In any case, the phase-locking reported here is transient and restricted to reading, and occurs at an about one thousand times higher frequency (8 to 35 Hz for PLV as compared to 01 to .1 Hz for resting-state fMRI correlations), rendering direct comparisons problematic. The high PLV between the putative letterform area and anterior language-related areas suggests that although early processing of the visual word during reading is sequential and modular, later processing is simultaneous and interactive across a widespread network of structures with complementary specializations. Participation by letter-selective regions in the broader language network is also implied by the picture naming deficits induced by electrical stimulation of the contacts recording letter-selective responses in one subject (fig. 4J).

Figure 5. Phase-locking values between co-activated structures suggests sustained interactions.

Figure 5

PLV between the posterior fusiform letter-selective area (large white circle) and other simultaneously active sites (colored circles) is increased from ~160–400ms to words (A), and consonants (B), but not false fonts (C). Columns 1 and 3: Time-frequency plots of PLV between the posterior fusiform and the prefrontal (PF), anterior temporal (AT), lingual (LG), lateral occipitotemporal (LOT), lateral occipital (LO), and medial occipitotemporal (MOT) contacts. Color bar in B applies also to A and C; bar in B to A and C. Thus, the color bar is constant for a given contact-pair across all conditions. Column 2: PLV between the posterior fusiform letter-selective site and all other sites mapped at 10Hz, 225ms. Columns 1 and 3 display the absolute size of the PLV; Column 2 shows the z-score relative to prestimulus baseline, thresholded z>5.

Discussion

This study replicated previous studies showing word-selective hemodynamic activation in lpFg[4], and then demonstrated letter selective activation in the posteriorly adjacent area. Previous studies recording the hemodynamic response to CS and FF have either not directly compared them[27], not reported their comparison[28], found no differences in the lpFg[16], or found only locations with FF>CS[8]. In most cases, these studies used low level tasks in order to prevent the possible confound of differential stimulus processing, but this may have unintentionally biased them against specific letter- or word-form processing. We used a high level task which required reading for meaning and were able to avoid the possible confound by concentrating on first pass processing probed with high temporal resolution electromagnetic techniques. Due to the random stimulus order each stimulus could be a word, and thus had to be processed initially as if it were a word. Eventually, FF were identified as such, attenuating further lexico-semantic processing. However, identification of the stimulus as FF must have occurred after the stage of interest because the stage of interest is exactly that which performs such identification. Due to the high temporal resolution of MEG and ECOG, we observed the activity of each stage without contamination by other stages, and distinguished which anatomical location selectively responded to CS versus FF at the shortest latency, even though many structures eventually showed such effects due to both feedforward and feedback influences at longer latencies.

It is possible that FF could have been determined very rapidly to not be letters and this resulted in fewer resources being devoted to their further processing. Similarly, CS may have been rapidly determined to have no vowels and thus evoked shallower processing than RW. If so, it is possible that our measure of CS processing (CS minus FF) was incomplete, for example in that not all letters were identified during this shallow processing. However, we note that our task, which requires reading for meaning, is more likely to encourage letter identification than the perceptual tasks which strive for identical processing of FF, CS and RW. Indeed, activation by CS of the putative letterform area was proportional to the number of letters in the string, suggesting that all letters were processed. Finally, even if the letters in CS were not completely processed in our task (i.e., as much as letters in RW), the result would be to decrease the effect size that we observed, not change their interpretation.

Several studies have compared responses to letters versus symbols, sometimes finding greater fMRI activation in lpFg with consistent EEG responses [29]. Using a low-level task, Vartiainen et al.[30] did not detect greater fMRI activation to words or letters than to symbols and other controls in lpFg, but were able to fit dipoles with greater activity to letters in lateral temporo-occipital cortex. Other studies have found that this area may show fMRI activation with attention[31] or working memory[32] for single letters as compared to symbols. Differential MEG activity to symbols has also been localized at early latencies to more postero-medial occipital areas[10]. This may correspond to the most posterior lpFg differential fMRI activation noted in the current study (fig. 1B).

A previous intracranial study failed to find any difference in HGP or LFP evoked by FF compared to CS[14]. However, this study also used a perceptual task, and sampled the sulci surrounding lpFg with depth electrodes. We recorded from the ventral surface of the lpFg, where the responses were highly focal. Additional studies are needed to determine if the letterform area requires a reading task for full activation, and if it extends anatomically from the crown of the lpFg into the surrounding sulci. Additional studies are also needed to determine if this area responds to stimuli besides letters and words.

Using the excellent temporal resolution of MEG we found that the letter-selective activation in lpFg precedes the more anterior word-selective activity. We confirmed the timing and anatomical location of the letter-form responses identified with the noninvasive measures with direct intracranial recordings of LFP and HG, and further demonstrated that these responses comprise increased synaptic processing. Our finding that letter-form and word-form processing are arranged sequentially in the lpFg is consistent with previous studies of reading showing relatively greater activation to higher order lexical and ultimately semantic stimulus properties in more anterior locations in humans with fMRI[8, 33], and MEG[34, 35]. Intracranial recordings confirm that the first sweep of activation along the ventral stream extends to Broca’s region[14, 36], and comprises a current sink in layer IV with sharply increased firing[37]. In the anteroventral temporal lobe, first pass activity to words may even be selective for the semantic category of the word [38]. These findings are also consistent with the general posterior-to-anterior gradient in the complexity of visual stimulus processing in the ventral stream demonstrated with single unit recordings in monkeys[39].

Neural activity in the putative letter-form area remained strongly elevated during reading for hundreds of millseconds following the initial letter-selective activation. This later processing could be sensitive to multiple constraints, and preceded the behavioral response. Furthermore, during these later stages, widely distributed areas were activated to words, and their activity became strongly but transiently phase-locked with the lpFg electrodes showing early letter-form responses, especially when reading words. These results resemble the transient phase-locking which occurs between the fusiform face area and more anterior sites in the right hemisphere[40], adding to the many parallels which have been found between face and word recognition[4, 41, 42].

Thus, following the initial feedforward sweep, the current HGP and PLV results strongly support a sustained and interactive co-activation of a network of sites contributing to reading. This could provide the substrate for distributed calculation of word identity and meaning[5], an interpretation that is supported by the disruption of naming by stimulation of the putative letter-form area in one patient. The top-down influences may also underlie the word-superiority effect[3]. Alternatively, it is possible that lpFg stimulation disrupted naming by interfering with remote processing, and that top-down information to the putative letter-form area serves only as a training signal to help refine processing that is essentially sensory pattern recognition. In either case, our results suggest that words are processed first sequentially in stages with increasing complexity[4], and then in parallel in multiple areas encoding complementary properties[43].

Methods

Participants

Twelve healthy right-handed subjects underwent fMRI testing, and a separate group of 12 healthy subjects underwent MEG testing. In addition, we analyzed local field potentials from 9 patients implanted with intracranial electrodes while performing the task (see Supplementary Table 1 for patient characteristics). Electrodes were implanted to localize seizure onset prior to contemplated surgical treatment. One of these patients was also studied with fMRI during the same task prior to surgery, and another with both fMRI and MEG. Subjects gave written informed consent to participate in this study and the study was approved by the NYUMC and UCSD IRBs in accordance with the Declaration of Helsinki.

Semantic Judgment Task

Stimuli were white letters on a black background in Arial font at 4° visual angle, comprising real words (RW), previously presented ‘old’ words (OW), non-pronounceable consonant letter strings (CS), false font stimuli (FF), and 40 target words. FF were alphabet-like characters that matched a real letter in the English alphabet in size, number of strokes, total line length, and curvature (see Table 1). FF strings were each matched to a RW in the number of characters. Subjects pressed a button in response to low-frequency target words representing animals. RW were 4–8 letter nouns, with a written lexical frequency of 3–80 per 10 million[44]. Tasks were programmed using Presentation software (Neurobehavioral Systems, Inc).

Table 1.

Stimuli used for semantic judgment tasks

Categories Examples
targets COBRA
real words BURN
consonants LPBV
false fonts graphic file with name nihms425656t1.jpg

The same design was used for both MEG and iEEG. We presented 400 each RW, OW, CS, and FF, plus 80 targets pseudo-randomly with the constraint that each condition was preceded by every other condition with equal likelihood. Stimulus exposure and stimulus onset asynchrony were both 600ms. Throughout the experiment, each CS and FF stimulus was only presented once. Here we report results on the RW, CS and FF comparisons; later responses to stimulus repetition are reported elsewhere[45]. Subjects detected 83% (stdev: 12.2) of the targets in the MEG task with a mean reaction time of 694 ms (stdev = 92 ms). They detected 78% (stdev: 13.8) during iEEG recordings (chance = 4.8%) with a mean reaction time of 744 ms (stdev = 121 ms). Since the RT often exceeded the SOA, the trials following targets were excluded from averages.

A blocked version of the semantic judgment task was designed for fMRI in order to maximize SNR, with 30 blocks including 5 blocks each of RW, OW, and CS, and 15 blocks of FF. Each block contained 40 words of one stimulus type, plus two targets. Blocks of RW and CS were presented in random order. Subjects detected 84% (stdev: 9.2) of the targets. Mean reaction time was 688 ms (stdev = 76 ms).

MRI analysis

Twelve healthy subjects (6 males, mean age: 23, range 19–36) underwent fMRI testing. Each subject was right-handed and free of neurological impairments. Handedness was assessed with the Edinburgh Handedness Inventory[46]. 3T MRI data were acquired and analyzed using FreeSurfer, FSL, and custom software as previously described[45]. Letter-specific activation was defined as increased BOLD to CS versus FF, since they were closely matched on basic visual features. Similarly, word-specific activity was defined as increased BOLD to RW versus CS. Larger responses to FF are common, with EEG as we: ll as BOLD, especially in the right hemisphere[9, 15]. Since such responses are thought to reflect the novelty of FF rather than template-matching[16], we omitted them from our study. Functional MRI data were preprocessed using FSL (www.fmrib.ox.ac.uk/fsl). For each subject, motion correction was performed using FLIRT [47], and data were spatially smoothed using a 5 mm full width half-maximum (FWHM) Gaussian kernel, grand-mean intensity normalized, high-pass filtered at sigma = 50 sm and pre-whitened using FILM [48]. Functional scans were co-registered to T1-weighted images [47, 49], and analyzed using FMRI Expert Analysis Tool (FEAT) Version 5.90, part of FSLs FMRIB’s software library. BOLD parameter estimates (beta-weights) were averaged across the two runs for each contrast of interest (Real Words > Consonant Strings and Consonant Strings > False Fonts). Percent signal change was calculated in MATLAB (The Mathworks, Natrick, MA) by multiplying the beta-weights by 100 x the regressor height and dividing by the mean functional volume. Individually averaged functional data were then resampled from each volume to each individual’s native surface, then from native surface to spherical atlas space for surface-based group analysis.

MEG analysis

MEG signals were recorded from 204 planar gradiometers as previously described[11]. Distributed source estimates of cortical activity were calculated from gradiometer data using dynamic statistical parametric mapping and cortical dipole constraints derived from each individual’s reconstructed MRI[17]. Peak amplitudes from each subject in fMRI-based ROIs were entered into ANOVA.

Intracranial EEG analysis

Local field potentials were recorded from intracranially implanted subdural electrodes (AdTech medical Instrument Corp., WI, USA) in patients undergoing elective monitoring of medically intractable seizures (see below; see Table S1 for patient demographics), with implant sites over the left ventral occipito-temporal cortex in nine patients. A large number of additional brain areas were sampled, including regions that were subsequently determined to be non-epileptogenic. Patients were native English speaking and left language dominant, with average performance on cognitive, language and reading tests and normal language organization as indicated by cortical stimulation mapping, when available. Only electrode contacts outside the seizure onset zone and with normal interictal activity were included in the analysis. In each case, the source of the patient’s epilepsy was thought to be focal and in an operable brain region. Electrode placement was based entirely on clinical grounds for identification of seizure foci and eloquent cortex during stimulation mapping, and included grid (8×8 contacts), depth (1×8 contacts) and strip (1×4 to 1×12 contacts) electrode arrays with 10 mm inter-electrode spacing center-to-center. Subdural grid and strip contacts were 4mm in diameter; consequently the distance between contacts was 6mm. A large number of brain areas was sampled, with coverage extending widely into regions that were subsequently determined to be non-epileptogenic. All 9 patients met the following strict selection criteria: (1) Left language lateralization as indicated by Wada testing; (2) Cognitive and language abilities in the average range, including language and reading ability, as indicated by formal neuropsychological testing (see Table S1); (3) Native English speaking; (4) Normal language organization as indicated by cortical stimulation mapping, when available; (5) Above 75% performance on the semantic judgment task; and (6) Electrode strips sampling from the left ventral occipito-temporal cortex. In addition, only electrode contacts outside the seizure onset zone and with normal interictal activity were included in the analysis. EEG activity was recorded at 400 Hz with a Nicolet 128 channel clinical amplifier (0.1 Hz to 200 Hz) or at 1000 Hz with a custom-design 256 channel recording system (0.1 Hz to 500 Hz). The precise localization of each electrode was computed by co-registering two T1-weighted MRIs, one obtained preoperatively and one on the day after implant surgery with the electrodes in place. A spatial optimization algorithm was used to integrate known information from the array geometry and intra-operative photos to achieve high spatial accuracy of the electrode locations in relation to the cortical MRI surface. Electrodes were visualized on the reconstructed pial surface from T1-weighted MRI scans using Freesurfer v4.1. For anatomical orientation, the Freesurfer generated cortical parcellations were overlaid onto the reconstructed surface (Figure 4D).

Data were analyzed in Matlab using Fieldtrip and custom routines. Statistical comparison across stimulus types used a nonparametric randomization test with temporal clustering. Phase locking value[23], as well as a single trial analogue (see Supplementary Methods) were calculated between responsive subdural electrode contacts.

Supplementary Material

1

Acknowledgments

We thank Anders Dale and Donald Hagler for analysis tools, and Mark Blumberg and Amy Trongnetrpunya for help with data collection. This research was supported by grants from NIH (NS18741) and FACES.

Footnotes

Competing financial interests: None

References

  • 1.Rosazza C, et al. Qualitatively different forms of pure alexia. Cogn Neuropsychol. 2007;24(4):393–418. doi: 10.1080/02643290701377877. [DOI] [PubMed] [Google Scholar]
  • 2.Patterson K, Kay J. Letter-by-letter reading: psychological descriptions of a neurological syndrome. Q J Exp Psychol A. 1982;34(Pt 3):411–41. doi: 10.1080/14640748208400852. [DOI] [PubMed] [Google Scholar]
  • 3.Grainger J, Jacobs AM. A dual read-out model of word context effects in letter perception: Further investigations of the word superiority effect. Journal of Experimental Psychology: Human Perception and Performance. 1994;20(6):1158–1176. [Google Scholar]
  • 4.Dehaene S, Cohen L. The unique role of the visual word form area in reading. Trends Cogn Sci. 2011;15(6):254–62. doi: 10.1016/j.tics.2011.04.003. [DOI] [PubMed] [Google Scholar]
  • 5.Price CJ, Devlin JT. The interactive account of ventral occipitotemporal contributions to reading. Trends Cogn Sci. 2011;15(6):246–53. doi: 10.1016/j.tics.2011.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Binder JR, et al. Tuning of the human left fusiform gyrus to sublexical orthographic structure. Neuroimage. 2006;33(2):739–48. doi: 10.1016/j.neuroimage.2006.06.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Glezer LS, Jiang X, Riesenhuber M. Evidence for highly selective neuronal tuning to whole words in the “visual word form area”. Neuron. 2009;62(2):199–204. doi: 10.1016/j.neuron.2009.03.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Vinckier F, et al. Hierarchical coding of letter strings in the ventral stream: dissecting the inner organization of the visual word-form system. Neuron. 2007;55(1):143–56. doi: 10.1016/j.neuron.2007.05.031. [DOI] [PubMed] [Google Scholar]
  • 9.Bentin S, et al. ERP manifestations of processing printed words at different psycholinguistic levels: time course and scalp distribution. J Cogn Neurosci. 1999;11(3):235–60. doi: 10.1162/089892999563373. [DOI] [PubMed] [Google Scholar]
  • 10.Tarkiainen A, et al. Dynamics of letter string perception in the human occipitotemporal cortex. Brain. 1999;122(Pt 11):2119–32. doi: 10.1093/brain/122.11.2119. [DOI] [PubMed] [Google Scholar]
  • 11.Leonard MK, et al. Spatiotemporal dynamics of bilingual word processing. Neuroimage. 2010;49(4):3286–94. doi: 10.1016/j.neuroimage.2009.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Allison T, et al. Human extrastriate visual cortex and the perception of faces, words, numbers, and colors. Cerebral Cortex. 1994;4:544–554. doi: 10.1093/cercor/4.5.544. [DOI] [PubMed] [Google Scholar]
  • 13.Gaillard R, et al. Direct intracranial, FMRI, and lesion evidence for the causal role of left inferotemporal cortex in reading. Neuron. 2006;50(2):191–204. doi: 10.1016/j.neuron.2006.03.031. [DOI] [PubMed] [Google Scholar]
  • 14.Mainy N, et al. Cortical dynamics of word recognition. Hum Brain Mapp. 2008;29(11):1215–30. doi: 10.1002/hbm.20457. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Appelbaum LG, et al. The temporal dynamics of implicit processing of non-letter, letter, and word-forms in the human visual cortex. Front Hum Neurosci. 2009;3:56. doi: 10.3389/neuro.09.056.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Tagamets MA, et al. A parametric approach to orthographic processing in the brain: an fMRI study. J Cogn Neurosci. 2000;12(2):281–97. doi: 10.1162/089892900562101. [DOI] [PubMed] [Google Scholar]
  • 17.Dale AM, et al. Dynamic statistical parametric mapping: combining fMRI and MEG for high-resolution imaging of cortical activity. Neuron. 2000;26(1):55–67. doi: 10.1016/s0896-6273(00)81138-1. [DOI] [PubMed] [Google Scholar]
  • 18.Dale AM, Halgren E. Spatiotemporal mapping of brain activity by integration of multiple imaging modalities. Curr Opin Neurobiol. 2001;11(2):202–8. doi: 10.1016/s0959-4388(00)00197-5. [DOI] [PubMed] [Google Scholar]
  • 19.Mechelli A, et al. Differential effects of word length and visual contrast in the fusiform and lingual gyri during reading. Proc Biol Sci. 2000;267(1455):1909–13. doi: 10.1098/rspb.2000.1229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Schurz M, et al. A dual-route perspective on brain activation in response to visual words: evidence for a length by lexicality interaction in the visual word form area (VWFA) Neuroimage. 2010;49(3):2649–61. doi: 10.1016/j.neuroimage.2009.10.082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Bruno JL, et al. Sensitivity to orthographic familiarity in the occipito-temporal region. Neuroimage. 2008;39(4):1988–2001. doi: 10.1016/j.neuroimage.2007.10.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Mukamel R, et al. Coupling between neuronal firing, field potentials, and FMRI in human auditory cortex. Science. 2005;309(5736):951–4. doi: 10.1126/science.1110913. [DOI] [PubMed] [Google Scholar]
  • 23.Lachaux JP, et al. Measuring phase synchrony in brain signals. Hum Brain Mapp. 1999;8(4):194–208. doi: 10.1002/(SICI)1097-0193(1999)8:4&#x0003c;194::AID-HBM4&#x0003e;3.0.CO;2-C. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Srinivasan R, Nunez P, Silberstein R. Spatial filtering and neocortical dynamics: estimates of EEG coherence. IEEE Trans Biomed Eng. 1998;45(7):814–826. doi: 10.1109/10.686789. [DOI] [PubMed] [Google Scholar]
  • 25.Koyama MS, et al. Reading networks at rest. Cereb Cortex. 2010;20(11):2549–59. doi: 10.1093/cercor/bhq005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Vogel AC, et al. The putative visual word form area is functionally connected to the dorsal attention network. Cereb Cortex. 2011 doi: 10.1093/cercor/bhr100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Price CJ, et al. Brain. 1996;119:919–931. doi: 10.1093/brain/119.3.919. [DOI] [PubMed] [Google Scholar]
  • 28.Ben-Shachar M, et al. Differential sensitivity to words and shapes in ventral occipito-temporal cortex. Cereb Cortex. 2007;17(7):1604–11. doi: 10.1093/cercor/bhl071. [DOI] [PubMed] [Google Scholar]
  • 29.Brem S, et al. Evidence for developmental changes in the visual word processing network beyond adolescence. Neuroimage. 2006;29(3):822–37. doi: 10.1016/j.neuroimage.2005.09.023. [DOI] [PubMed] [Google Scholar]
  • 30.Vartiainen J, et al. Functional magnetic resonance imaging blood oxygenation level-dependent signal and magnetoencephalography evoked responses yield different neural functionality in reading. J Neurosci. 2011;31(3):1048–58. doi: 10.1523/JNEUROSCI.3113-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Flowers DL, et al. Attention to single letters activates left extrastriate cortex. Neuroimage. 2004;21(3):829–39. doi: 10.1016/j.neuroimage.2003.10.002. [DOI] [PubMed] [Google Scholar]
  • 32.Libertus ME, Brannon EM, Pelphrey KA. Developmental changes in category-specific brain responses to numbers and letters in a working memory task. Neuroimage. 2009;44(4):1404–14. doi: 10.1016/j.neuroimage.2008.10.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.van der Mark S, et al. Children with dyslexia lack multiple specializations along the visual word-form (VWF) system. Neuroimage. 2009;47(4):1940–9. doi: 10.1016/j.neuroimage.2009.05.021. [DOI] [PubMed] [Google Scholar]
  • 34.Marinkovic K, et al. Spatiotemporal dynamics of modality-specific and supramodal word processing. Neuron. 2003;38(3):487–97. doi: 10.1016/s0896-6273(03)00197-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Solomyak O, Marantz A. Evidence for early morphological decomposition in visual word recognition. J Cogn Neurosci. 2010;22(9):2042–57. doi: 10.1162/jocn.2009.21296. [DOI] [PubMed] [Google Scholar]
  • 36.Sahin NT, et al. Sequential processing of lexical, grammatical, and phonological information within Broca’s area. Science. 2009;326(5951):445–9. doi: 10.1126/science.1174481. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Halgren E, et al. Processing stages underlying word recognition in the anteroventral temporal lobe. Neuroimage. 2006;30(4):1401–13. doi: 10.1016/j.neuroimage.2005.10.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Chan AM, et al. First-pass selectivity for semantic categories in human anteroventral temporal lobe. J Neurosci. 2011;31(49):18119–29. doi: 10.1523/JNEUROSCI.3122-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Grill-Spector K, et al. Differential processing of objects under various viewing conditions in the human lateral occipital complex. Neuron. 1999;24(1):187–203. doi: 10.1016/s0896-6273(00)80832-6. [DOI] [PubMed] [Google Scholar]
  • 40.Klopp J, et al. Early widespread cortical distribution of coherent fusiform face activity. Hum Brain Mapp. 2000;11(4):286–293. doi: 10.1002/1097-0193(200012)11:4&#x0003c;286::AID-HBM80&#x0003e;3.0.CO;2-R. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Halgren E, et al. Spatio-temporal stages in face and word processing. 1. Depth-recorded potentials in the human occipital, temporal and parietal lobes. Journal of Physiology (Paris) 1994;88:1–50. doi: 10.1016/0928-4257(94)90092-2. [DOI] [PubMed] [Google Scholar]
  • 42.Halgren E, et al. Spatio-temporal stages in face and word processing. 2. Depth-recorded potentials in the human frontal and Rolandic cortices. Journal of Physiology (Paris) 1994;88:51–80. doi: 10.1016/0928-4257(94)90093-0. [DOI] [PubMed] [Google Scholar]
  • 43.Twomey T, et al. Top-down modulation of ventral occipito-temporal responses during visual word recognition. Neuroimage. 2011;55(3):1242–51. doi: 10.1016/j.neuroimage.2011.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Francis WN, Kucera H. Frequency Analysis of English Usage: Lexicon and Grammar. Boston: Houghton Mifflin; 1982. [Google Scholar]
  • 45.McDonald CR, et al. Multimodal imaging of repetition priming: Using fMRI, MEG, and intracranial EEG to reveal spatiotemporal profiles of word processing. Neuroimage. 2010;53(2):707–17. doi: 10.1016/j.neuroimage.2010.06.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Oldfield RC. Ambidexterity in surgeons. Lancet. 1971;1(7700):655. doi: 10.1016/s0140-6736(71)91594-7. [DOI] [PubMed] [Google Scholar]
  • 47.Jenkinson M, et al. Improved optimization for the robust and accurate linear registration and motion correction of brain images. NeuroImage. 2002;17(2):825–41. doi: 10.1016/s1053-8119(02)91132-8. [DOI] [PubMed] [Google Scholar]
  • 48.Woolrich MW, et al. Temporal autocorrelation in univariate linear modeling of FMRI data. NeuroImage. 2001;14(6):1370–86. doi: 10.1006/nimg.2001.0931. [DOI] [PubMed] [Google Scholar]
  • 49.Jenkinson M, Smith S. A global optimisation method for robust affine registration of brain images. Med Image Anal. 2001;5(2):143–56. doi: 10.1016/s1361-8415(01)00036-6. [DOI] [PubMed] [Google Scholar]
  • 50.Hauk O, et al. The time course of visual word recognition as revealed by linear regression analysis of ERP data. Neuroimage. 2006;30(4):1383–400. doi: 10.1016/j.neuroimage.2005.11.048. [DOI] [PubMed] [Google Scholar]
  • 51.Sereno SC, Rayner K, Posner MI. Establishing a time-line of word recognition: evidence from eye movements and event-related potentials. Neuroreport. 1998;9(10):2195–200. doi: 10.1097/00001756-199807130-00009. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES