Repetition suppression in occipitotemporal cortex despite negligible visual similarity: Evidence for postperceptual processing?

Aidan J Horner; Richard N Henson

doi:10.1002/hbm.21124

. 2010 Sep 2;32(10):1519–1534. doi: 10.1002/hbm.21124

Repetition suppression in occipitotemporal cortex despite negligible visual similarity: Evidence for postperceptual processing?

Aidan J Horner ¹, Richard N Henson ^1,^✉

PMCID: PMC6870074 PMID: 20814963

Abstract

The reduced neural response in certain brain regions when a task‐relevant stimulus is repeated (“repetition suppression”, RS) is often attributed to facilitation of the cognitive processes performed in those regions. Repetition of visual objects is associated with RS in the ventral and lateral occipital/temporal regions, and is typically attributed to facilitation of visual processes, ranging from the extraction of shape to the perceptual identification of objects. In two fMRI experiments using a semantic classification task, we found RS in a left lateral occipital/inferior temporal region to a picture of an object when the name of that object had previously been presented in a separate session. In other words, we found RS despite negligible visual similarity between the initial and repeated occurrences of an object identity. There was no evidence that this RS was driven by the learning of task‐specific responses to an object identity (“S‐R learning”). We consider several explanations of this occipitotemporal RS, such as phonological retrieval, semantic retrieval, and visual imagery. Although no explanation if fully satisfactory, it is proposed that such effects most plausibly relate to the extraction of task‐relevant information relating to object size, either through the extraction of sensory‐specific semantic information or through visual imagery processes. Our findings serve to emphasize the potential complexity of processing within traditionally visual regions, at least as measured by fMRI. Hum Brain Mapp, 2010. © 2010 Wiley‐Liss, Inc.

Keywords: priming, object recognition, implicit memory, conceptual

INTRODUCTION

Repetition of a stimulus in a given task often results in a decrease in neural activity within certain cortical regions, a phenomenon known as repetition suppression (RS) [Grill‐Spector et al., 2006]. When using fMRI while people categorise familiar visual objects, for example, RS is normally found in higher‐order visual regions within the ventral processing stream [e.g., Koutstaal et al., 2001]. RS may represent a fundamental form of stimulus‐specific neural plasticity, reflecting more efficient neural processing. This neural facilitation may also contribute to analogous behavioral phenomena, such as repetition priming [e.g., faster reaction times to make a categorization; see Henson, 2003; Schacter and Buckner, 1998].

Previous research demonstrating RS within ventral stream regions following repetition of familiar visual objects has normally attributed such effects to facilitation of perceptual processes. These perceptual processes are seen as distinct from the RS often seen in more anterior regions, such as inferior prefrontal regions, which is normally attributed to facilitation of phonological, lexical, and/or semantic processing [Poldrack et al., 1999; Wagner et al., 1997; Wagner et al., 2000]. More specifically, lateral regions of the occipital cortex including the occipitotemporal sulcus, as well as posterior regions of the fusiform gyrus—corresponding to the posterior and anterior portions respectively of what has been called the “Lateral Occipital Complex” (LOC) [Malach et al., 1995]—have been associated with relatively low‐level perceptual processes involved in the extraction of object shape. This is based on findings that RS (or “adaptation” to multiple stimulus repetitions) generalizes across manipulations that maintain object shape, such as changes in retinotopic location and stimulus size [Grill‐Spector et al., 1999; Grill‐Spector and Malach, 2001] and stimulus format [e.g., from line‐drawings to grayscale photographs; Kourtzi and Kanwisher, 2000, 2001] as well as across mirror reflections [Eger et al., 2004]. Conversely, manipulations that disrupt object shape, such as changes in object viewpoint [Andresen et al., 2009; Ewbank et al., 2005; Grill‐Spector et al., 1999; Grill‐Spector and Malach, 2001; though see James et al., 2002] and occlusion of object parts [Hayworth and Biederman, 2006], have been shown to disrupt RS in these regions.

There has also been a suggestion that RS in posterior fusiform cortex—corresponding to the anterior portion of the LOC—demonstrates greater resilience to such changes, consistent with a posterior‐anterior gradient with respect to representational abstraction [Grill‐Spector et al., 1999; see Grill‐Spector and Malach, 2004 for a review of RS in the functionally defined LOC].1 Extending such findings, Koutstaal et al. [ 2001] showed that RS in a left mid‐fusiform region generalised over different exemplars of an object with the same name (e.g., pictures of different umbrellas). However, while Vuilleumier et al. [ 2002] found that RS in this region generalized over different viewpoints of an object, they found no evidence that it generalised over different exemplars, unlike Koutstaal et al. [ 2001]. This lack of generalization of fusifom RS was also unlike the RS that Vuilleumier et al. [ 2002] found in left inferior prefrontal cortex, which did generalize across different exemplars with the same name. On the basis of these findings, Vuilleumier et al. [ 2002] argued that the generalisation of RS found by Koutstaal et al. [ 2001] reflected visual similarity between the exemplars, rather than an abstract representation of an object identity. This claim was later supported by Chouinard et al. [ 2008], who failed to find occipitotemporal RS when controlling for visual similarity between exemplars.

Other studies however have continued to implicate mid‐fusiform regions—particularly in the left hemisphere—with more abstract processing. For example, Simons et al. [ 2003] reported a left fusiform region that showed RS to object pictures that were immediately preceded, and accompanied, by auditory presentation of the name of that object. This implicates this region in lexical or semantic processing. Furthermore, Wheatley et al. [ 2005] and Gold et al. [ 2006] even found RS in left fusiform cortex for semantically related versus unrelated word pairs, implicating this region in semantic processing.

Interpretation of these fMRI RS effects in the ventral visual stream is further complicated by recent evidence that RS may also reflect the “by‐passing” of processing in such regions, owing to the direct retrieval of task‐relevant responses previously associated with a stimulus [Dobbins et al., 2004; Horner and Henson, 2008; Race, Shanker and Wagner, 2009]. The idea behind such stimulus‐response (S‐R) learning is that the response made on the initial presentation of a stimulus becomes bound to that stimulus, such that when the stimulus is repeated, the response can be retrieved quickly, without needing to repeat any detailed perceptual or semantic analysis of the stimulus. Thus the RS observed in the object categorization tasks used by many of the above fMRI studies may not reflect more efficient perceptual processing per se, but rather substantial attenuation (or even abolishment) of such processing. S‐R learning has long been known to exert strong effects on the behavioral priming that is found in speeded categorization tasks [Horner and Henson, 2009; Logan, 1990; Schnyer et al., 2006]. More recently, behavioral effects of S‐R learning have been shown to generalize across different object exemplars [Denkinger and Koutstaal, 2009; though see Schnyer et al., 2007]. Thus it is possible that at least some of the RS effects reviewed above that were used to argue for different levels of object representation in the ventral visual stream actually reflect relatively abstract S‐R learning, rather than facilitation of processes normally involved in visual object recognition.

Dobbins et al. [ 2004] suggested one way of testing for such S‐R learning: by reversing the categorizations between initial and repeated stimulus presentations. On initial presentation, participants were asked “is the object bigger than a shoebox?”. When the same decision was used for repeated objects, RS was found in regions including the fusiform cortex. When the decision was reversed however (i.e., “is the object smaller than a shoebox?”), RS was no longer significant in fusiform cortex. Although other studies have since found RS in fusiform regions despite such task reversal [Horner and Henson, 2008; Race et al., 2009], the comparison between “Same” and “Reverse” task conditions still offers a way to test for S‐R learning, given the large effect it produces on behavioral priming (see Denkinger and Koutstaal, 2009; and Horner and Henson, 2009 for further discussion). We therefore used this Same/Reverse task manipulation in the present experiment.

Given the controversy regarding the degree of abstraction of object processing in ventral visual stream regions, based on evidence from RS paradigms, we asked the following question: can we see RS to visual objects within occipital/temporal regions when repeating an object identity despite negligible visual similarity between its initial and repeated presentation? We tested this by examining the generalization of RS from words (object names) to pictures (of the same objects). The basic study‐test design is shown in Figure 1. Color pictures of everyday nameable objects were always used at Test, one half of which (Repeated condition) depicted objects that were previously encountered at Study, and the other half of which depicted objects not encountered at study (Novel condition). RS was defined as the reduction in mean event‐related fMRI response to pictures in the Repeated relative to Novel condition in the Test phase. In Experiment 1 (shaded rows of Fig. 1), we only used a word‐picture condition; in Experiment 2 we added a picture‐picture condition as well, in order to compare RS from words to that from pictures (i.e., negligible versus full visual similarity).

To test whether RS was affected by S‐R learning, we added an orthogonal manipulation of using either the same task at Study and Test, or reversing the task between Study and Test. We chose to use the “bigger‐than‐shoebox” size‐judgment task because it has been used in numerous behavioral [Horner and Henson, 2009; Schnyer et al., 1996, 2007] and fMRI [Dobbins et al., 2004; Horner and Henson, 2008; Koutstaal et al., 2001; Race et al., 2009; Simons et al., 2003] studies. Although S‐R learning might not be expected when the visual form of the stimulus (S) changes so dramatically (i.e., from a word to a picture), it is possible that a response (R) can become bound to a relatively abstract (amodal) representation of an object identity. This would at least mirror our prior behavioral evidence that the response representations in S‐R learning can be quite abstract (to the level of a particular semantic label, e.g., “bigger” or “smaller”, independently of the yes/no decision or specific motor action, Horner and Henson, 2009). In any case, if the amount of RS in occipitotemporal regions in the word‐picture condition was unaffected by a task reversal, then it is unlikely to reflect S‐R learning, and more likely to reflect facilitation of some post‐perceptual processing of visual objects.

Experiment 1

Experiment 1 was designed to assess whether significant RS could be seen within occipital/temporal regions once we controlled for visual similarity between Study and Test. At Study, word stimuli were presented (e.g., the word “lion” was presented) with participants performing the “bigger‐than‐shoebox” task. At Test, the same identities (object referents) seen at Study (along with novel items) were presented as pictures rather than as words (e.g., a picture of a “lion”), with participants either performing the “bigger‐than‐shoebox” or “smaller‐than‐shoebox” task (see Fig. 1). Thus, item identity was repeated between Study and Test; however, there was no visual similarity between repetitions. These manipulations resulted in a 2 × 2 factorial design crossing the factors Repetition (Novel, Repeated) and Task (Same, Reverse).

MATERIALS AND METHODS

Participants

Participants in both experiments were recruited from the MRC‐CBU subject panel, or from the student population of Cambridge University. All participants had normal or corrected to normal vision and were right‐handed by self‐report. Both experiments were of the type approved by a local research ethics committee (LREC reference 05/Q0108/401).

Eighteen participants (eight male) gave informed consent to participate in Experiment 1. The mean age across participants was 23.1 years (SD = 2.1). Participants were the same as those reported in Experiment 2 of Horner and Henson [ 2008].

Materials

Stimuli were 160 colored images of everyday objects (and their name equivalents), taken from a set used by Dobbins et al. [ 2004]. They were selected so that 50% were bigger than a shoebox and 50% were smaller than a shoebox, according to norms from independent raters [Horner and Henson, 2009]. Each stimulus was randomly assigned to one of 4 groups relating to the four experimental conditions, with each group containing equal numbers of each stimulus classification, resulting in 40 stimuli per group. The assignment of groups to experimental condition was rotated across participants. The scrambled stimuli used during Study blocks (see Procedure) were created from the same set of objects by randomly redistributing the pixels so that a coherent object was no longer visible. None of the stimuli used in the present experiment were presented in the remainder of the scanning visit (i.e., they were not seen in Horner and Henson, 2008—Experiment 2).

Procedure

Experiment 1 was conducted at the end of the same scanning visit as Experiment 2 of Horner and Henson [ 2008]. The experiment consisted of two study‐test cycles, with each cycle lasting ∼ 10 min. During each Study phase 80 stimuli were shown. Forty stimulus identities were presented once as visual words (e.g., the word “lion” was presented rather than a picture of a lion). A further 40 scrambled images (see Materials) were presented once. Words and scrambled images were grouped into mini‐blocks of five stimuli, with each mini‐block lasting 15 s. During word presentation mini‐blocks, participants were required to respond to whether the stimulus was “bigger than a shoebox,” where the comparison referred to the object's size in real life. During scrambled image mini‐blocks, participants were instructed to alternate between right and left key‐presses at stimulus onset. During each Test phase, the 40 stimuli from the Study phase were randomly intermixed with 40 novel stimuli. Crucially, object pictures were presented at Test (e.g., a picture of a lion rather than the word “lion”) such that there was no visual overlap between stimuli seen at Study and Test. Participants were either asked the same question to that at Study (e.g., “is the object bigger than a shoebox?” ‐ the Same condition) or the opposite question (e.g., “is the object smaller than a shoebox?”—the Reverse condition). The order of the two test conditions (tasks) was counterbalanced across participants.

Each trial sequence began with a centrally placed fixation cross presented for 500 ms, followed by a stimulus for 2,000 ms, in turn followed by a blank screen for 500 ms. Images subtended ∼6^° of visual angle. Words were presented in black on a white background with the same pixel dimensions as the object picture stimuli. Participants were able to respond at any point up to the start of a new trial (i.e., the presentation of another fixation cross). Participants responded using a “yes” or “no” key with their right or left index finger, respectively. Prior to entering the scanner, participants were asked to perform a practice session using the “bigger‐than‐shoebox” task.

Behavioral Analyses

Trials in which RTs were less than 400 ms, or two or more standard deviations above or below a participant's mean for a given block (i.e., a separate Study or Test phase), were excluded. Subsequent to this exclusion, accuracy was based on prior norms [Horner and Henson, 2009]. For the RT analyses, trials at Test were further excluded if objects were given an incorrect response at Study. Repetition priming was then calculated as the difference in mean RTs between Novel and Repeated stimuli. All statistical tests had alpha set at 0.05, and a Greenhouse‐Geisser correction was applied to all F‐values with more than one degree of freedom in the numerator. T‐tests were two‐tailed, except where stated otherwise.

fMRI Acquisition

Thirty‐two T2*‐weighted transverse slices (64 × 64 3 mm × 3 mm pixels, TE = 30 ms, flip‐angle = 78^°) per volume were taken using Echo‐Planar Imaging (EPI) on a 3T TIM Trio system (Siemens, Erlangen, Germany). Slices were 3‐mm thick with a 0.75‐mm‐gap, tilted ∼30^° upward at the front to minimize eye‐ghosting, and acquired in descending order. Four sessions of 130 volumes were acquired, with a repetition time (TR) of 2,000 ms. The first five volumes of each session were discarded to allow for equilibrium effects. A T1‐weighted structural volume was also acquired for each participant with 1mmx1mmx1 mm voxels using MPRAGE and GRAPPA parallel imaging (flip‐angle = 9^°; TE = 2.00 s; acceleration factor = 2).

fMRI Analysis

Data were analyzed using Statistical Parametric Mapping (SPM5, http://www.fil.ion.ucl.ac.uk/spm5.html). Preprocessing of image volumes included spatial realignment to correct for movement, followed by spatial normalization to Talairach space, using the linear and nonlinear normalization parameters estimated from warping each participant's structural image to a T1‐weighted average template image from the Montreal Neurological Institute (MNI). These re‐sampled images (voxel size 3 × 3 × 3 mm³) were smoothed spatially by an 8 mm FWHM Gaussian kernel (final smoothness ∼11 × 11 × 11 mm³).

Statistical analysis was performed in a two‐stage approximation to a Mixed Effects model. In the first stage, neural activity was modeled by a delta function at stimulus onset. The BOLD response was modeled by a convolution of these delta functions by a canonical Haemodynamic Response Function (HRF). The resulting time‐courses were down‐sampled at the midpoint of each scan to form regressors in a General Linear Model.

For each Test session (Task), five separate regressors were modeled—the two experimental conditions (Novel, Repeated) were split according to the particular key‐press given (left/right), plus an additional regressor for discarded trials (using the behavioral exclusion criteria outlined earlier). To account for (linear) residual artifacts after realignment, the model also included six further regressors representing the movement parameters estimated during realignment. Voxel‐wise parameter estimates for these regressors were obtained by Restricted Maximum‐Likelihood (ReML) estimation, using a temporal high‐pass filter (cut‐off 128 s) to remove low‐frequency drifts, and modeling temporal autocorrelation across scans with an AR(1) process.

Images of contrasts of the resulting parameter estimates (collapsed across left/right key‐press) comprised the data for a second‐stage model, which treated participants as a random effect. In addition to the 18 subject effects, this model had four condition effects, corresponding to a 2 × 2 (Task × Repetition) repeated‐measures ANOVA. Within this model, Statistical Parametric Maps (SPMs) were created of the T or F‐statistic for the various ANOVA effects of interest, using a single pooled error estimate for all contrasts, whose nonsphericity was estimated using ReML as described in Friston et al. [ 2002]. Unless otherwise stated, all SPMs were height‐thresholded at the voxel‐level at P < 0.05, corrected for multiple comparisons using Random Field Theory, either across the whole‐brain or within regions of interest (ROIs) defined by contrasts from independent data. Stereotactic coordinates of the maxima within the thresholded SPMs correspond to the MNI template.

RESULTS

Behavioral Results

After excluding 5.6% of trials with outlying RTs, the percentages of errors are shown in Table I. A 2 × 2 (Task × Repetition) repeated‐measures ANOVA on errors revealed no significant main effects or interactions (F's < 1.9, Ps > 0.19). A further 3.8% of Repeated trials were excluded from RT analysis due to incorrect responses given at Study (see Methods). Table I displays mean RTs, while Figure 2A shows priming (Novel‐Repeated) of RTs across all conditions. Priming was reliable in the Same condition, t(17) = 1.86, P < 0.05, but did not reach significance in the Reverse condition, t(17) = 0.48, P = 0.32 (one‐tailed). A 2 × 2 (Task × Repetition) ANOVA however showed no evidence of an interaction between the Same/Reverse conditions and priming (F(1, 17) = 0.41, P = 0.53; cf. Experiment 2).II

Table I.

Mean percentage errors and RTs (plus standard deviations) across Task and Repetition for Experiment 1 and Stimulus‐modality, Task, and Repetition for Experiment 2

Stimulus‐type		Picture‐Picture		Word‐Picture
Task		Same	Reverse	Same	Reverse
Errors
Experiment 1	Novel	Picture‐Picture condition not included in Experiment 1		13.5 (5.4)	16.7 (5.4)
	Repeated	Picture‐Picture condition not included in Experiment 1		13.6 (6.4)	14.4 (4.7)
Experiment 2	Novel	12.1 (5.6)	15.3 (5.7)	11.8 (6.1)	15.7 (6.1)
	Repeated	12.2 (4.7)	12.8 (5.4)	12.8 (5.8)	13.1 (5.4)
RTs
Experiment 1	Novel	Picture‐Picture condition not included in Experiment 1		859 (210)	791 (130)
	Repeated	Picture‐Picture condition not included in Experiment 1		827 (200)	780 (134)
Experiment 2	Novel	894 (127)	1005 (146)	905 (147)	990 (129)
	Repeated	772 (103)	921 (133)	847 (132)	976 (144)

Open in a new tab

Results from Experiment 2 are collapsed across Prime‐level for clarity. Note that this factor did not interact significantly with Stimulus‐modality or Task so is of little theoretical interest.

Behavioral priming across Task (Same vs. Reverse) in Experiment 1 (A) and across Task (Same vs. Reverse) and Stimulus‐type (Picture‐Picture vs. Word‐Picture) in Experiment 2 (B). Error bars represent one‐tailed 95% confidence intervals. ***P < 0.001.

Table II.

Mean percentage signal change (and standard deviations) within left LO‐IT (‐51, ‐66, 0) across task and repetition for experiment 1 and stimulus‐type, task and repetition for experiment 2

Stimulus‐type		Picture‐Picture		Word‐Picture
Task		Same	Reverse	Same	Reverse
Experiment 1	Novel	Picture‐Picture condition not included in Experiment 1		−0.18 (0.66)	−0.09 (0.34)
	Repeated	Picture‐Picture condition not included in Experiment 1		−0.42 (0.75)	−0.18 (0.33)
Experiment 2	Novel	0.13 (0.43)	0.11 (0.45)	0.10 (‐0.06)	0.13 (0.39)
	Repeated	−0.05 (0.42)	−0.06 (0.42)	0.05 (0.39)	0.08 (0.46)

Open in a new tab

Results from Experiment 2 are collapsed across Prime‐level for clarity. Percent signal change refers to the peak of the fitted BOLD impulse response, and is relative to the grand mean over all voxels and scans. Note that the baseline level of 0 was not estimated reliably in this design, so only relative patterns across conditions are meaningful.

Finally, to check whether priming differed as a function of Test block or Task order, we conducted a 2 × 2 (Block × Order) mixed ANOVA, where the within‐subject factor Block refers to the Test block 1 or 2 (regardless of Task) and the between‐subject factor Order refers to the task order (i.e., Same‐Reverse or Reverse‐Same). This 2 × 2 ANOVA failed to reveal any main effects of Block or Order, Fs < 0.39, Ps > 0.54, suggesting priming did not vary as a function of block or task order.

fMRI Results

We first sought evidence for significant RS (i.e., a Novel—Repeated one‐tailed T‐contrast). Our initial whole‐brain analysis revealed no significant effects. Given we were interested in regions previously shown to demonstrate significant RS in visual object repetition paradigms, when the same object is shown at Study and Test, we next constrained our search using the “main effect” of RS (i.e., the corrected‐thresholded map for the RS T‐constrast) from Experiment 2 of Horner and Henson [ 2008], which contained 1,980 voxels. This T‐contrast was derived from independent data taken from the same participants in the same scanning visit as the present experiment; our voxel selection for this small‐volume correction (SVC) is therefore not biased in favor of finding a significant RS effect. This T‐contrast map covered bilateral occipital/temporal cortex, including lateral occipital and fusiform cortex, as well as distinct clusters in the left inferior prefrontal gyrus. Two regions survived SVC: (1) a region in the left inferior frontal gyrus—pars opercularis—henceforth referred to as the posterior prefrontal cortex (pPFC) (−48, +3, +24) and (2) a region in the left hemisphere on the lateral surface starting in the middle occipital gyrus and descending into the posterior inferior temporal gyrus—henceforth referred to as lateral occipital/inferior temporal cortex (LO‐IT) (−51, −66, 0) (see Fig. 3A). RS in the LO‐IT region was numerically greater in the Same than Reverse condition (Fig. 3B; also see Table II for mean percentage signal change across all conditions), reflected by a trend for an interaction F(1, 17) = 4.04, P = 0.06. Nonetheless, residual RS still appeared reliable in the Reverse condition, suggesting that this RS was not dependent on S‐R retieval (error bars in Fig. 3B reflect 95% confidence intervals, though note that these simple effects of repetition are biased by the prior selection of the region to show a main effect of repetition). Assessing Test block and Task order (see behavioral analysis), a 2 × 2 (Block × Order) mixed ANOVA on RS showed no main effects of Block or Order, Fs < 3.0, Ps > 0.10, suggesting that RS, like behavioral priming, was unaffected by test block or task order.

A: Voxels demonstrating significant repetition suppression (RS) in the Word‐Picture condition in Experiment 1 across sagittal, coronal and axial slices; P < 0.05 small‐volume corrected. RS in left LO‐IT (−51, −66, 0) across Task (Same vs. Reverse) in Experiment 1 (B) and across Task (Same vs. Reverse) and Stimulus‐type (Picture‐Picture vs. Word‐Picture) in Experiment 2 (C). Error bars represent one‐tailed 95% confidence intervals. ***P < 0.001, *P < 0.05.

A whole‐brain search for the interaction between Repetition and Task, to further investigate possible effects of S‐R learning, did not reveal any regions that survived either whole‐brain correction, or small‐volume correction for the main effect of RS. Lastly, searching for regions showing significantly greater activation for repeated than novel items (i.e., repetition enhancement) revealed several clusters that survived whole‐brain correction (see Supporting Information Table I). Given our present focus on RS within posterior occipitotemporal regions however we do not discuss these results further (see Horner and Henson, 2008 for a discussion of this issue).

DISCUSSION

Experiment 1 demonstrated that significant RS can be seen in prefrontal and occipital/temporal regions despite negligible visual similarity between Study and Test stimuli, and without apparent contributions from S‐R learning. One possibility is that RS within these regions can reflect facilitation of “higher‐level” processes, such as phonological or lexical processes (associated with naming the objects) or possibly semantic processing associated with the conceptual task (see General Discussion).

Our pPFC results support previous research suggesting that RS within left inferior PFC regions reflects improved phonological and/or semantic processing [Poldrack et al., 1999; Wagner et al., 1997]. Indeed, given RS in the present experiment was confined to more dorsal regions of the inferior frontal gyrus—pars opercularis—it is likely our results reflect repetition of phonological processes [Poldrack et al., 1999]. Furthermore, the lack of reliable difference between our Same and Reverse tasks (see also Experiment 2 and Fig. 4A) suggests that PFC RS is not necessarily always related to S‐R learning [Dobbins et al., 2004]. One possibility is that, while S‐R learning effects might generalize across visually‐similar pictures of different exemplars of an object [Denkinger and Koutstaal, 2009], they do not generalize across visually dissimilar stimuli [Schnyer et al., 2007], such as between words and pictures, as in the present study. We return to this issue in Experiment 2.

Repetition suppression (RS) across Task (Same vs. Reverse) and Stimulus‐type (Picture‐Picture vs. Word‐Picture) in Experiment 2 in left posterior prefrontal (pPFC) cortex (A), right lateral occipital/inferior temporal (LO‐IT) cortex (B) and left and right fusiform cortex (C and D, respectively). Error bars represent one‐tailed 95% confidence intervals. ***P < 0.001, **P < 0.01, *P < 0.05.

The significant RS within left LO‐IT however was a surprise. Some generalization of RS across stimuli has been found previously in the ventral visual pathway, such as across view‐point and/or exemplars of objects [e.g., Koutstaal et al., 2001; Vuilleumier et al., 2002], from names to objects like here [though when the name immediately preceded and was concurrent with the object; Simons et al., 2003] and even for semantically related vs. unrelated words [Wheatley et al., 2005]. However this generalization has been found in more anterior (left) mid‐fusiform regions. RS in more posterior and lateral regions of the occipital cortex, like the LO‐IT region here, has tended to be highly sensitive to changes in object view‐point [Andresen et al., 2009; Ewbank et al., 2005], suggesting that these regions support relatively low‐level shape processing. We return to this point in the General Discussion.

Finally, we found the RS effect only within left and not right LO‐IT. This tendency for left‐lateralization has been reported previously [Koutstaal et al., 2001; Simons et al., 2003], but more often in fusiform cortex. This finding is consistent with the hypothesis that the left hemisphere processes more abstract visual object representations than does the right hemisphere [e.g., Burgund and Marsolek, 2000; Marsolek, 1999], though it is also consistent with possible linguistic causes of our RS (such as naming), which are known to be left‐hemisphere dominant. Note, however that finding a simple effect in the left but not right hemisphere is not sufficient to conclude a difference in laterality. To test for such an effect, RS within homologous regions needs to be contrasted statistically, where those regions are selected in an unbiased manner. We do this in Experiment 2.

Experiment 2

In Experiment 1, we presented only words at study. In Experiment 2, we compared RS from either pictures or words at Study (maintaining only pictures at Test) (see Fig. 1). We could therefore attempt to replicate our surprising RS from words to pictures (the Word‐Picture condition) in LO‐IT, and furthermore compare the size of this RS with that obtained when repeating pictures (i.e., with perceptual as well as conceptual overlap between Study and Test; the Picture‐Picture condition). The presentation of both word and picture stimuli at Study also allowed us to evaluate overall activation levels for each type of stimulus within the region that demonstrated significant word‐to‐picture RS in Experiment 1. For example, does the left LO‐IT region seen in Experiment 1 show greater activation for word than picture stimuli?

Given we were primarily interested in whether the significant RS seen in Experiment 1 was replicable, we used the peak RS co‐ordinates from Experiment 1 (i.e., from an independent data set) in an ROI analysis to assess RS in the Word‐Picture and Picture‐Picture condition. This unbiased ROI selection also allowed us to test for a laterality effect given the RS effect in Experiment 1 was only seen in the left hemisphere. To test for concurrent signs of S‐R learning, we again included a Task manipulation (Same vs. Reverse judgment), as well as adding a manipulation of “Prime‐level”, whereby stimuli at Study were either seen once (Low‐primed) or three times (High‐primed), which has previously been demonstrated to modulate the effects of S‐R learning on behavioral priming [Horner and Henson, 2009]. This resulted in a 2 × 2 × 2 × 2 pseudofactorial design with factors Stimulus‐type (Picture‐Picture, Word‐Picture), Task (Same, Reverse), Repetition (Repeated, Novel), and Prime‐level (Low‐primed, High‐primed); where Novel items were randomly assigned to each Stimulus‐type and Prime‐level.