Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 1998 Jun 1;18(11):4244–4254. doi: 10.1523/JNEUROSCI.18-11-04244.1998

Induced γ-Band Activity during the Delay of a Visual Short-Term Memory Task in Humans

Catherine Tallon-Baudry 1, Olivier Bertrand 1, Franck Peronnet 1, Jacques Pernier 1
PMCID: PMC6792803  PMID: 9592102

Abstract

It has been hypothesized that visual objects could be represented in the brain by a distributed cell assembly synchronized on an oscillatory mode in the γ-band (20–80 Hz). If this hypothesis is correct, then oscillatory γ-band activity should appear in any task requiring the activation of an object representation, and in particular when an object representation is held active in short-term memory: sustained γ-band activity is thus expected during the delay of a delayed-matching-to-sample task. EEG was recorded while subjects performed such a task. Induced (e.g., appearing with a jitter in latency from one trial to the next) γ-band activity was observed during the delay. In a control task, in which no memorization was required, this activity disappeared. Furthermore, this γ-band activity during the rehearsal of the first stimulus representation in short-term memory peaked at both occipitotemporal and frontal electrodes. This topography fits with the idea of a synchronized cortical network centered on prefrontal and ventral visual areas. Activities in the α band, in the 15–20 Hz band, and in the averaged evoked potential were also analyzed. The γ-band activity during the delay can be distinguished from all of these other components of the response, on the basis of either its variations or its topography. It thus seems to be a specific functional component of the response that could correspond to the rehearsal of an object representation in short-term memory.

Keywords: γ-band, 40 Hz, oscillations, cell assembly, visual short-term memory, delayed-matching-to-sample task, vision, human, EEG


How is a particular visual object represented in the brain? It has been proposed that the specific combination of attributes corresponding to one object could be coded by an assembly of neurons distributed in different functional areas, the distinctive sign of neurons belonging to the same assembly being the synchronization of their discharge on an oscillatory mode (Milner, 1974; von der Malsburg and Schneider, 1986; Singer and Gray, 1995).

Epochs of synchronization on an oscillatory mode in the γ-band (20–80 Hz) have been observed in various species and neural structures (for review, see Singer and Gray, 1995; Engel et al., 1997). Moreover, in support of the above hypothesis, evidence for enhanced γ-band synchronization in response to a visually coherent object has been obtained in anesthetized (Eckhorn et al., 1988; Gray et al., 1989;Engel et al., 1991; Freiwald et al., 1995; Brosch et al., 1997) and awake (Gray and Di Prisco, 1997) cats and awake monkeys (Kreiter and Singer, 1992, 1996), as well as in EEGs in humans (Lutzenberger et al., 1995; Müller et al., 1996, 1997; Tallon-Baudry et al., 1996).

Oscillatory synchronization in the γ-band thus could underlie the feature binding process. Another experiment in humans suggests that an oscillatory assembly is also activated when the internal representation of an object is needed (Tallon-Baudry et al., 1997): when subjects were required to find an hidden object in a picture (e.g., they had to activate the representation of the object searched for), enhanced γ-band activity was observed. Altogether, these results suggest that the neural correlate of the activation of a visual object representation is the oscillatory synchronization of an assembly, no matter whether the activation of this representation is triggered by an external input (or bottom-up process), as in feature binding, or induced by an internal event (or top-down process), as in the visual search task described above.

If the hypothesis of a correspondence between the activation of an object representation and the oscillatory synchronization of a neural assembly is correct, then induced γ-band activity should be elicited in any task requiring the activation of an object representation. The experiment presented here was designed to test whether such an oscillatory activity could be detected when an object representation is held active in short-term memory. Thirteen subjects performed a delayed-matching-to-sample task (see Fig. 1A). During the delay, subjects had to hold the representation of the first stimulus S1 in memory, and sustained γ-band activity is expected. A control task was designed (dimming condition; see Fig.1B) in which S1 does not have to be memorized.

Fig. 1.

Fig. 1.

Experimental design. In the memory condition, two stimuli, S1 and S2, were presented for 400 msec separated by a 800 msec delay. Subjects were to detect matching S2 (20% of the trials). In the dimming condition, no second stimulus appeared. Instead, the fixation cross could either dim (80% of the trials) or remain the same (20%), in which case subjects had to respond. In this control condition, S1 does not have to be memorized and acts only as a warning stimulus. These two conditions were presented in two successive recording sessions. We expected to find induced γ-band activity reflecting S1 representation rehearsal during the delay in the memory condition but not in the dimming condition.

Finally, it should be emphasized that in all the studies quoted above, the functionally relevant epochs of oscillatory synchronization are not strictly triggered by stimulus onset; rather, they appear with a jitter in latency from one trial to the next. Our analysis will thus focus on so called “induced” or “nonphase-locked to stimulus onset” activities.

MATERIALS AND METHODS

Subjects. Thirteen right-handed subjects (five females, mean age 24 years) gave their informed and written consent to participate in the study. All of the subjects had normal or corrected to normal vision.

Procedure. The experiment was divided into two conditions: a memory condition and a dimming condition (Fig.1). Before the recordings, subjects were instructed on which task they were to perform. The order in which the conditions were presented was counterbalanced between subjects. The shapes used were smooth, without any sharp angle, to avoid any explicit spatial description of the stimulus; rather, we hoped the subjects would hold active a “photographic” representation of S1 during the delay. Stimuli were presented in black on a light gray background on a video monitor (refresh rate 106 Hz). They were positioned within a disk with an inner radius of 0.88° and an outer radius of 1.23°, at a viewing distance of 1.5 m. In the memory condition, one trial consisted of (Fig. 1A) (1) red fixation cross for 800 msec, (2) S1 presentation for 400 msec, (3) a blank screen where only the red fixation cross remained for 800 msec, and (4) S2 presentation for 400 msec. S2 could be either different (no-go trial) or identical (go trial) to S1. In the dimming condition, one trial consisted of (1) red fixation cross for 800 msec, (2) S1 presentation for 400 msec, (3) red fixation cross for 800 msec, and (4) fixation cross (400 msec) of a dimmed red (no-go trials) or the same red (go trials). In both conditions, the intertrial interval was randomized between 2 and 3 sec. The proportion of go trials was 20%, with no more than two consecutive go trials. The difficulty of the task was adjusted to the subject’s performance (see below). Subjects were instructed to press a mousekey with the right finger on go trials and to answer accurately rather than fast. They were also instructed to blink only during intertrial intervals.

Stimuli construction. Stimuli were strictly parametrized by 12 anchor points of polar coordinates (rii), with θi = 2π/12(i − 1) andri being randomly chosen at each trial betweenrmin and rmax, corresponding to visual angles of 0.88° and 1.23°, respectively. The interpolation function between four successive points (r11)..(r44) is given by Lagrange’s polynomial:

r(θ)=i riji (θθj)ji (θiθj),

where i and j vary between 1 and 4. Only the inner segment, between (r22) and (r33), is drawn. The same procedure is applied until the 11 segments of the curve are drawn.

Adjustment of the difficulty of the task to the subject’s performance. When S2 does not match S1, it is derived from S1 by modulating the radii of the 12 anchor points:ri(S2) = ri(S1) × (1 ± coeff). The sign of the modulation (increase or decrease ri) is chosen randomly and independently for each anchor point. The value ofcoeff is set at 12% at the beginning of a block of recording and constantly modified according to the subject’s performance computed on the last 20 trials. If this moving average performance falls below 90%, coeff is increased by 1%, and the task becomes easier. On the contrary if the performance is above 90%, coeff is decreased by 1%, and the task becomes more difficult. The upper and lower limit values of coeff were set at 20% and 4%, respectively.

In the dimming task, no second shape appears, but the intensity of the red fixation cross may decrease. The difficulty of this task is also modulated according to the subject’s performance. When the moving average performance falls below 90%, the intensity decrement of the screen red channel is increased by one, and the task becomes easier; if the value of the intensity decrement is decreased by one, the task becomes more difficult.

Recordings. EEG was recorded continuously at a sampling rate of 1000 Hz (0.1–320 Hz analog bandwidth) from 17 Ag-AgCl electrodes referenced to the nose. Electrode impedances were kept below 5 kΩ. Their locations, according to the international 10–20 system, are Iz, P7, O1, O2, P8, POz, P3, Pz, P4, C3, Cz, C4, F3, Fz, and F4. Two electrodes (OM1, OM2) were placed halfway between Iz and the mastoids. Electrode placement on the head was computer-assisted (Echallier et al., 1992). Horizontal eye movements were monitored (electrode Yh), and a rejection threshold was set for each subject at one-fourth of the potential value corresponding to a saccade of 4°. Four blocks of ∼70 trials (56 no-go and 14 go trials) each were recorded in each condition. Epochs containing artifacts (EEG > 100 μV or EOG > threshold) were rejected off-line.

Time–frequency (TF) transformation of the data. We were interested in the identification and characterization of oscillatory activities induced by a stimulation. Because neither latency nor frequency of these oscillatory bursts was known a priori, a method that preserves both types of information was chosen: the time–frequency (TF) representation based on a wavelet transform of the signals (Tallon-Baudry et al., 1996, 1997). The main advantage of this approach, compared with the short-term Fourier transform approach (Makeig, 1993), is that the duration of the window of analysis depends on the frequency band: the higher the central frequency, the shorter the window duration and the wider the frequency band. This method thus provides a better compromise between time and frequency resolutions (Sinkkonen et al., 1995).

Using this method, induced activities can be analyzed. So-calledinduced activities appear with a jitter in latency from one trial to the next; thus, they tend to disappear on the classic averaged evoked potential. When the TF energy is computed on each single trial and averaged (TF energy averaged across single trials), induced activities can be analyzed, provided their signal-to-noise ratio is high enough. The TF transformation can also be applied to the averaged evoked potential (TF energy of the evoked potential). Induced activities tend to disappear on these plots, whereas information on oscillatory bursts that are phase-locked to stimulus onset (e.g., appearing exactly at the same latency in each trial) can be obtained.

The signal was convolved with complex Morlet’s waveletsw(t, f0) (Kronland-Martinet et al., 1987) having a Gaussian shape both in the time domain (SD ςt) and in the frequency domain (SD ςf) around its central frequencyf0: w(t, f0) = A.exp(−t2/2ςt2) × exp(2iπf0t), with ςf = 1/2πςt. Wavelets were normalized so that their total energy was 1, the normalization factor A being equal to (ςt✓π)−1/2. A wavelet family is characterized by a constant ratio (f0f), which should be chosen in practice greater than ∼5 (Grossmann et al., 1989). The wavelet family we used was defined byf0f = 7, withf0 ranging from 8 to 100 Hz in 1 Hz steps. At 8 Hz, this leads to a wavelet duration (2ςt) of 278 msec and to a spectral bandwidth (2ςf) of 2.3 Hz; at 20 Hz, to a wavelet duration of 111.4 msec and to a spectral bandwidth of 5.8 Hz; and at 100 Hz, to a duration of 22.2 msec and a bandwidth of 28.6 Hz. The time resolution of this method thus increases with frequency, whereas the frequency resolution decreases. The time-varying energy E(t,f0) of the signal in a frequency band around f0 is the squared norm of the result of the convolution of a complex wavelet w(t,f0) with the signals(t): E(t,f0) = ‖w(t,f0) ×s(t)‖2. A family of wavelets will provide a TF representation of the energy of the signal. The mean TF energy of the prestimulus (between −300 and −50 msec) is considered as a baseline level and subtracted from the prestimulus and poststimulus TF energy. This correction is done separately in each frequency band.

Data analysis. On average, approximately 140 correct no-go trials and 50 correct go trials were included in the analysis in each condition after artifact rejection. Because the subjects’ performance was maintained at about 90%, there were not enough incorrect trials to analyze them. Go and no-go trials were pooled together to improve the signal-to-noise ratio for the analysis of the responses to S1 and of the activities during the delay. Go and no-go trials were processed separately to analyze the responses to S2 in the memory condition.

Because the TF energy values were far from having a Gaussian distribution, the nonparametric test of Wilcoxon for matched pairs was used for all statistical analysis.

RESULTS

Behavioral results

Performance was on average 88% in both conditions. Reaction times measured on correct go trials from S1 onset were shorter in the memory (1992 ± 23 msec) than in the dimming condition (2049 ± 54 msec). Although small, this difference is significant (p = 0.039; Wilcoxon test for matched pairs).

Induced γ-band responses

The TF energy averaged across single trials, where oscillatory activities induced by the stimulus (e.g., appearing with a jitter in latency) can be analyzed, is presented in Figure2A for both the memory and dimming condition. Two peaks of enhanced activity around 30 Hz can be observed in both conditions: a first one at ∼280 msec after S1 onset, called the ON response, and a second one at ∼680 msec (e.g., 280 msec after S1 offset) that we called an OFF-induced response. A third area of enhanced γ-band activity can also be observed during the delay in the memory condition (700–1000 msec). These three peaks of γ-band activity disappear on the TF representation of the averaged evoked potential (Fig. 2B, white boxes). Hence, they do correspond to induced activities. The frequency band between 24 and 60 Hz was chosen for further analysis of these phenomena. The choice of the lower limit of this frequency band (24 Hz) was made to avoid mixing these enhancements of γ-band energy with the decreases in energy that can be observed at lower frequencies (Fig. 2A).

Fig. 2.

Fig. 2.

A, Time–frequency (TF) representation of the energy averaged across single trials at electrode C3, grand average across subjects, in both conditions. Time is presented on the x-axis. Frequency is presented on the y-axis on a logarithmic scale. The energy level is coded on a color scale: yellow areas show an enhancement of energy compared with prestimulus level, andred areas show decrease. Three areas of enhanced high-frequency activity can be observed (white boxes): (1) an ON response, peaking at ∼280 msec and 30 Hz, higher in the memory than in the dimming condition; (2) an OFF response, peaking at ∼680 msec (e.g., 280 msec after S1 offset), similar in both conditions; and (3) a γ-band activity during the delay, in the memory condition only. B, Time–frequency (TF) representation of the energy of the averaged evoked potential at electrode C3, grand average across subjects, in both conditions. Only activities phase-locked to stimulus onset (e.g., appearing at a fixed latency from one trial to the next) can be observed on these representations. The three areas of enhanced high-frequency activity in A thus correspond to induced activities (e.g., appearing with a latency jitter from one trial to the next).

S1 ON and OFF γ-band responses

The first peak of γ-band activity appears at ∼280 msec. It was analyzed by averaging the 24–60 Hz TF energy (Fig.3A) between 230 and 330 msec: the ON response is higher in the memory than in the dimming condition. This effect is significant at electrodes OM1, O1, and P3 (p < 0.04). Although there is a difference in energy, the ON response is most prominent at posterior electrodes in both conditions (Fig. 3B). Still, it is lateralized on the left in the memory condition only. When the 24–60 Hz, 230–330 msec energy across OM1, O1, P7, and P3 was averaged and compared with the energy averaged across the corresponding electrodes over the right hemisphere (OM2, O2, P8, and P4), a significant difference was revealed in the memory condition (p = 0.039) but not in the dimming condition (p = 0.39).

Fig. 3.

Fig. 3.

A, Energy of the mean activity between 24 and 60 Hz, grand average across subjects, in the memory (thick line) and in the dimming (thin line) conditions. A first peak of enhanced γ-band activity appears at ∼280 msec (ON). It is significantly stronger in the memory than in the dimming condition. An OFF response can be observed at ∼680 msec (e.g., 280 msec after S1 offset); it does not show any significant difference between conditions. Later on, during the delay, a sustained γ-band activity appears in the memory condition only, mainly at left posterior electrodes and bilaterally at frontal electrodes. It tends to decrease before the end of the delay.B, Topographical maps of the 24–60 Hz energy (left, back, and right views of the head) averaged between 230 and 330 msec (ON response), and between 630 and 730 msec (OFF response). In both conditions, the ON response is maximum over occipital electrodes and decreases smoothly until the frontal sites. It is not only enhanced but also lateralized over the left hemisphere in the memory condition. The topography of the OFF response is less clear-cut; it tends to be maximum over occipital sites also. No tendency for lateralization of the OFF response could be found in any condition.

The OFF-induced γ-band response appears at ∼680 msec, e.g., 280 msec after S1 offset (Fig. 2A). Comparing the 24–60 Hz energy averaged between 630 and 730 msec at each electrode does not reveal any significant differences between conditions (p > 0.46). The topography of this OFF response (Fig. 3B) seems more widespread than the topography of the ON response, and no significant effect of lateralization can be found.

S1 ON and OFF evoked 0–25 Hz responses

After S1 onset, the P1 (105 msec) and N1 (155 msec) components of the averaged evoked potential filtered between 0 and 25 Hz can be observed in both conditions (Fig. 4). They are not affected by condition type, neither in peak latency nor in peak amplitude. The first difference in the evoked potential can be observed in the 200–240 msec time window (P2 component), where the mean amplitude is significantly enhanced in the dimming condition compared with the memory condition at electrodes OM1 (p = 0.013), Iz (p = 0.046), OM2 (p = 0.033), and P7 (p = 0.046). Later on, between 300 and 550 msec, a sustained activity is higher in the memory condition at electrodes POz and Pz (p < 0.04).

Fig. 4.

Fig. 4.

Filtered (0–25 Hz) averaged evoked potentials, grand average across subjects, in the memory (thick line) and in the dimming (thin line) conditions. The first significant difference (*1) between the two conditions occurs at 200–240 msec: the posterior P2 component is more pronounced in the dimming condition. Later on (300–550 msec), a sustained positivity appears at electrodes POz (not shown) and Pz (*2) in the memory condition. During the delay, a negativity appears at left occipitotemporal sites (O1, *3) in the memory condition and tends to decrease in the end of the delay, whereas a parietocentral negativity rises (Cz, *4). See also the maps of the 0–25 Hz evoked potential during the delay in Figure5B.

An N1 OFF component appears in the evoked potential 155 msec after S1 offset, at 555 msec (Fig. 4). It does not vary with condition type at any electrode (amplitude averaged between 500 and 600 msec;p > 0.25).

Induced γ-band responses during the delay

As depicted in Figure 3A, sustained 24–60 Hz activity is observed in the memory condition, whereas in the dimming condition, the level of 24–60 Hz energy is often negative (e.g., below prestimulus level). It should be noted that this sustained activity in the memory condition tends to decrease before the end of the delay. Table 1 shows the electrodes at which the difference between the two conditions during the delay is significant, on 200-msec time windows shifted by steps of 100 msec. The enhancement in the memory condition is highly significant from 750 to 1050 msec at several electrodes; at the end of the delay (950–1150 msec), it remains significant only at electrode C3.

Table 1.

Electrodes at which a significant effect between the two conditions can be observed, in the 24–60 Hz energy (left), in the evoked potential (middle), and in the 15–20 Hz band (right) in 200 msec time windows shifted by steps of 100 msec.

24–60 Hz band Evoked potential 15–20 Hz band
750–950 850–1050 950–1150 750–950 850–1050 950–1150 750–950 850–1050 950–1150
Left
 OM1 ** ** *
 P7 ** * * ** **
 O1 ** ** ** *
 P3 ** ** ** ** ** * *
 C3 ** ** ** *
 F3 ** *
Central
 Iz * * ** *
 POz ** * * * **
 Pz * **
 Cz * ** **
 Fz * ** *
Right
 OM2 ** **
 P8 ** ** **
 O2 * ** *
 P4 * * *
 C4 ** * *
 F4 *
EOG
 Yh

−, Nonsignificant.

*0.02 < p < 0.05.

**p < 0.02; nonparametric Wilcoxon test for matched pairs.

The topography of the 24–60 Hz activity during the delay is depicted in Figure 5A in both conditions. In the memory condition, two regions of enhanced γ-band activity can be observed, one at left occipitotemporal electrodes and another one bilaterally at frontocentral electrodes. The occipitotemporal maximum is lateralized on the left side in the memory condition (comparison of the 24–60 Hz energy averaged across electrodes OM1, O1, P7, and P3 vs OM2, O2, P8, and P4; 750–950 msec time window: p = 0.003 in the memory condition, 0.91 in the dimming condition; 850–1050 msec: p = 0.008 in the memory condition, 0.25 in the dimming condition; 950–1150 msec: not significant in both conditions). The frontocentral activity shows no significant preference for an hemisphere (comparison of the 24–60 Hz energy averaged across electrodes C3 and F3 vs C4 and F4 in the time windows 750–950 msec, 850–1050 msec, and 950–1150 msec:p > 0.25 in both conditions).

Fig. 5.

Fig. 5.

Summary of the activities observed during the delay. A, γ-band (24–60 Hz) energy in the memory (top row) and in the dimming condition (bottom row), averaged on three overlapping 200-msec time windows (750–950, 850–1050, and 950–1150 msec). Topographical maps of left, back, and right views of the head are displayed for these three time windows. Enhanced γ-band activity appears in the memory condition at left posterior electrodes, and bilaterally at more frontal sites (arrows). Differences between conditions are more significant at the beginning of the delay (750–950 and 850–1050 msec time windows) than in the end (950–1150 msec). B, Topographical maps of the evoked potential in both conditions, at three latencies (700, 900, and 1050 msec). Two negativities with different time courses and topographies can be observed in the memory condition (arrows): a left posterior negativity is prominent at the beginning of the delay and decreases in the end of the delay, whereas a parietocentral negativity rises. C, Energy (15–20 Hz) averaged in three 200 msec time windows in both conditions. An occipital enhancement, with a tendency for being lateralized on the right, is observed in the memory condition. Another peak of 15–20 Hz activity is observed in the memory condition at the midline frontal electrode. D, Energy in the α-band (8–12 Hz), averaged between 750 and 1150 msec. No difference between the two conditions could be detected in this frequency band during the delay.

Evoked (0–25 Hz) responses during the delay

The evoked potential during the delay was studied on 200 msec time windows shifted by steps of 100 msec. Electrodes at which a significant difference between the two conditions could be found appear in Table 1. Two components are enhanced in the memory condition (Fig. 4): a sustained left-sided negativity, restricted to the posterior electrodes and more prominent at the beginning of the delay, and a parietocentral negativity that rises later on. On topographical maps (Fig.5B), the balance between these two components results in a shift of the maximum of voltage from left posterior electrodes (750 msec) to parietocentral sites (1050 msec). It should be noted that no significant difference between conditions could be found at frontal electrodes in the 0–25 Hz evoked potential during the delay.

Induced activity in other frequency bands during the delay

As can be seen in Figure6A, in addition to the increase in the γ-band, an enhancement of the energy at lower frequencies (15–20 Hz) is observed in the memory condition at occipital electrodes during the delay. This peak of enhanced energy disappears on the TF representation of the evoked potential (Fig.6B); it thus corresponds to induced activity. The 15–20 Hz band was chosen for further analysis.

Fig. 6.

Fig. 6.

A, Time–frequency energy averaged across single trials at electrode O2, in the memory (top) and in the dimming (bottom) conditions. Sustained 15–20 Hz activity is observed during the delay in the memory condition only (arrows). B, Time–frequency energy of the averaged evoked potential at electrode O2, in the memory condition. The 15–20 Hz energy observed during the delay disappears: it is an induced activity.

The 15–20 Hz energy during the delay was averaged on 200 msec time windows shifted by steps of 100 msec. Electrodes at which a significant difference between the two conditions could be found are shown in Table1. Significant effects are found at both occipital and frontal sites. The occipital enhancement of the 15–20 Hz activity in the memory condition remains significant through all the delay, although the energy level in the memory condition tends to decrease in the end of the delay. The frontal enhancement in the memory condition is significant at Fz and F4 only within the 850–1050 msec time interval. Topographical maps of this frequency band are shown in Figure5C.

As opposed to the 24–60 Hz energy in the same latency range, the occipital component of the 15–20 Hz activity is not lateralized on the left side. A tendency for being lateralized on the right side in the memory condition can even be observed (Fig. 5C), although it does not reach the significance level [comparison of the averaged 15–20 Hz energy at electrodes O1 (left) and O2 (right); time window 850–1050 msec: p = 0.13 in the memory condition and 0.86 in the dimming condition; 950–1150 msec: p = 0.1 in the memory condition and 0.55 in the dimming condition]. The frontal 15–20 Hz activity is not lateralized (comparison of F3 vs F4; time interval 850–1050 msec: p = 0.22 in the memory condition and 0.80 in the dimming condition; 950–1150 msec:p > 0.35 in both conditions).

Finally, we studied the variations in the α band (8–12 Hz) during the delay. TF energy values averaged between 8 and 12 Hz and 750 and 1150 msec did not reveal any significant difference between conditions during the delay at any electrode (p > 0.34). The topography of the α band in the 750–1150 msec time interval is shown in Figure 5D.

Baseline level

Up to now, the TF energy values used for statistical analysis were baseline-corrected by subtracting the averaged value of the prestimulus level between −300 and −50 msec. For each of the frequency bands studied (24–60, 15–20, and 8–12 Hz), we thus examined whether there was any significant difference between conditions of the prestimulus level. Comparison of the averaged energy between −300 and −50 at each electrode did not reveal any significant difference between conditions in any frequency band (24–60 Hz band: p > 0.25; 15–20 Hz: p > 0.27; 8–12 Hz: p > 0.31).

Responses to S2

Go and no-go trials were pooled together in the analysis of the responses to S1 and the responses during the delay to improve the signal-to-noise ratio. To study the response to S2 in the memory condition, we separated the trials with matching S2 (20%) from the trials with nonmatching S2 (80%). The averaged evoked potentials for matching and nonmatching S2 at electrode O1 are depicted in Figure7A: matching and nonmatching S2 differ at ∼1410 msec, e.g., 210 msec after S2 onset. The mean amplitude, measured in a 40 msec time window centered at 1410 msec, differs significantly between matching and nonmatching S2 at electrode O1 and O2 (p = 0.013) and P8 and POz (p = 0.039). Before this latency, neither the P1 nor the N1 components vary, estimated on a 40 msec time window centered at their peaking latency (P1: p > 0.2 at all electrodes; except O1 and OM1: p > 0.1; N1:p > 0.2).

Fig. 7.

Fig. 7.

Responses to the matching and nonmatching S2 in the memory condition. A, Evoked (0–25 Hz) potentials at electrode O1, for matching (thin line) and nonmatching (thick line) S2. The first significant difference occurs around 1410 msec (*), e.g., 210 msec after S2 onset. B, Time–frequency representation of the energy averaged across single trials at electrode O1, in the memory condition, trials with nonmatching S2. A small γ-band response to S2 can be observed at ∼1500 msec (300 msec after S2 onset); it is much smaller than the ON response to S1. C, Topographical maps at 28 Hz (back views) of the ON response to S1 (top), of the ON response to nonmatching S2 (bottom left), and of the ON response to matching S2 (bottom right). In both cases (matching or nonmatching S2), the ON response to S2 is much smaller than the response to S1. Nevertheless, it shows the same left occipital maximum.

In the γ-band, a small ON induced response to S2 can be observed at ∼1500 msec (Fig. 7B), e.g., 300 msec after S2 onset and later than the difference between matching and nonmatching S2 in the evoked potentials. It is much smaller than the ON response to S1, but with a similar topography (Fig. 7C). Whether or not S2 matches, S1 does not seem to influence this small γ-band response.

DISCUSSION

γ-band activity during the delay

This experiment was designed to test the existence of induced γ-band activity when one has to hold active an object representation in short-term memory. Such an activity was indeed observed in the memory condition. It appears at left occipitotemporal electrodes and bilaterally at left and right frontal sites. In the dimming condition, where there is no need to memorize S1, no γ-band activity is found. This finding meets with the hypothesis of a representation of visual objects through the oscillatory synchronization of a distributed neural assembly.

This enhancement of γ-band activity during the delay in the memory condition appears at both occipitotemporal and frontal electrodes. The occipitotemporal component seems to be already present in the ON γ response to the first stimulus. This ON occipital γ response in previous studies (Tallon-Baudry et al., 1996, 1997) was associated with the building of an object representation. A possible interpretation of our data is that during the delay the occipital S1 representation is being rehearsed. The additional frontal γ-band activity could be necessary to maintain the occipitotemporal activity. Indeed, there may be some cortical interplay between the ventral visual areas and the prefrontal areas that have both been involved in stimulus retention in short-term memory in animals (for review, see Goldman-Rakic, 1995;Desimone, 1996; Fuster, 1997) and humans (for review, see Ungerleider, 1995) (also see Swartz et al., 1995; Courtney et al., 1996). Although the idea of a synchronized cortical network involving prefrontal and ventral visual areas fits with current hypotheses both on short-term visual memory and on synchronized cell assembly, this interpretation goes beyond our results. Indeed, because scalp recordings do not allow one to localize directly the active neural sources, we cannot be sure that the γ-band activity appearing at left occipitotemporal and frontal electrodes is generated within the underlying cortical areas. Furthermore, we do not know whether the γ-band activities appearing at frontal and occipitotemporal electrodes are synchronized or whether they behave as two independently oscillating ensembles.

We thus interpret our data in terms of neural activity related to memory processes. We cannot rule out a possible contribution of muscle activity to our results. Nevertheless, it seems unlikely that muscle activity alone can explain our data. Indeed, it would mean that muscle activity is task-dependent (memory/dimming condition), has a different topography according to the processing stage of the stimulus in the memory condition (occipitotemporal when the stimulus is being encoded, occipitotemporal plus frontal when it is being rehearsed), and is lateralized on the left. Another argument is the frequency distribution of the induced γ-band activity compared with the frequency distribution of muscle activity. Indeed, a control experiment in one subject (Fig. 8) shows that the ON induced γ response observed in the memory condition is confined to the 24–46 Hz band, whereas muscle activity extends up to 100 Hz.

Fig. 8.

Fig. 8.

Power spectrum of the ON induced γ response compared with muscle activity (one subject, electrode O1 referenced to the nose). The subject was first recorded while performing the memory task. Comparison of the spectrum of the prestimulus (−350 to −50 msec) to the spectrum of the ON induced γ-band response (270–320 msec) reveals a difference around 35 Hz (gray area), whereas above 55 Hz, both spectra are similar. The ON induced γ response is thus confined to a narrow frequency band. The subject was then recorded while simply fixating on the screen, both at rest and when contracting different muscles (neck, jaw, eyebrow). The power spectrum at rest was lower than during the prestimulus of the memory condition, but with a similar profile. To get rid of scaling effects, the power spectrum at rest was normalized with respect to the 24–46 Hz band of the power spectrum of the prestimulus. Similarly, the power spectrum of muscle activity was normalized with respect to the 24–46 Hz band of the ON induced response power spectrum. It clearly appears that the effect of muscle activity is not restricted to this 24–46 Hz band, but extends up to 100 Hz. The same type of spectrum is observed when the different muscles (neck, jaw, eyebrow) are activated separately.

Because our subjects had to produce a motor response, one can also wonder whether our results are not related to γ activity elicited by motor preparation (Pfurtscheller and Neuper, 1992; Kristeva-Feige et al., 1993; Nashmi et al., 1994; Pfurtscheller et al., 1994). This does not seem to be the case because (1) motor responses were given both in the memory and the dimming conditions, whereas we observe γ-band activity during the delay in the memory condition only; and (2) if the γ activity we observe was related to motor preparation, one would expect it to increase until movement onset, whereas it tends to decrease even before the end of the delay.

The temporal course of the γ-band activity during the delay is indeed surprising: the energy level decreases before S2 onset. Why does not it remain high until the end of the delay? The duration of the delay was constant for all the trials, so it may be that S2 onset was anticipated by the subject. Such an anticipation phenomena was already described byKlimesch (1996) in the α band.

A number of other activities were analyzed during the delay, namely the averaged evoked potential and induced activities in the 15–20 Hz band and in the α band. Do all these activities belong to the same functional ensemble?

Evoked potentials (0–25 Hz)

The 0–25 Hz filtered evoked potential shows two enhanced components in the memory condition: a left posterior negativity and a parietocentral negativity. It should be emphasized that no differences between conditions were detected at frontal electrodes in the evoked potential. The left occipitotemporal negativity may be related to the sustained firing of inferotemporal neurons observed in similar tasks in animal (Fuster and Jervey, 1981; Miyashita and Chang, 1988; Fuster, 1990; Miller and Desimone, 1993). It may be tightly linked to the γ-band activity appearing at the same left occipitotemporal electrodes. A possible functional role for this component is to prepare a speedy matching process between S1 and S2, because later on the evoked potentials reflect the result of this process by indicating a difference between matching and nonmatching S2. This difference between matching and nonmatching S2 in the evoked potential was already observed in other studies in the same latency range (Begleiter et al., 1993; Sugita, 1994). Furthermore, if this difference means that the comparison process between S1 and S2 is completed 210 msec after S2 onset, it may account for the reduced γ-band response 300 msec after S2 onset: there is no need for building a representation of S2 because the matching task is over.

The other component in the evoked potential during the delay is a rising parietocentral negativity. It is greater in the memory than in the dimming condition. Both the topography (parietocentral) and the time course (increases until S2 onset) of this negativity suggest that it belongs to the family of contingent negative variation waves, which have been observed repeatedly in humans in paired S1–S2 paradigms, especially when the contingency between the two stimuli is strong (for review, see Tecce and Cattanach, 1987).

Activity in other frequency bands during the delay

During the delay, an enhancement of induced activity in the 15–20 Hz band in the memory condition can also be observed at some electrodes. It thus shows the same functional variations with conditions as the activity in the γ-band. Furthermore, its temporal course is quite similar to the temporal course of the γ-band activity during the delay, with a tendency to decrease before S2 onset. Still, the topographies of these two activities are different: at least partially different areas would be involved in the generation of 24–60 and 15–20 Hz activities. The difference between the 24–60 and 15–20 Hz topographies may also partly justify the choice of these two frequency bands: they do not reflect the same phenomena.

Activity in the α-band (8–12 Hz) during the delay did not show any significant difference between conditions during the delay, which seems to rule out the possibility (Jürgens et al., 1995) that either the γ-band or the 15–20 Hz activities are simply harmonics of α activity, at least in this experiment. In other studies (Gevins et al., 1997; Klimesch et al., 1997), energy in the α-band was found to correlate with memory load. Still, both the protocols (continuous matching task or words learning and recognition) and the long time windows that were studied (>1 sec) in these experiments are very different from what we used.

Responses to S1

Responses to S1 differ in the memory and in the dimming tasks. Evoked potentials show a decrease of the P2 component in the memory condition, which could be associated with feature selection processes (for review, see Anllo-Vento and Hillyard, 1996). It is followed by a positive wave that may be related to the memory encoding process. In the γ-band, the ON induced response appearing at ∼280 msec after S1 onset is reduced in the dimming condition, where S1 probably acts only as a warning stimulus. Because our experimental protocol was designed to study the delay, during S1 presentation many behavioral parameters differ between conditions (spatial attention more focal in the dimming task, feature selective attention to shape and memory encoding in the memory task, etc.). It is thus difficult to give a precise interpretation of the responses to S1.

Finally, OFF responses to S1 have been observed not only in the evoked potential but also in the γ-band. To our knowledge, such a response had not been described so far in the γ-band. Its latency (280 msec after S1 offset) seems to indicate that it is indeed an inducedOFF response. Because it does not vary between conditions in this experiment, its functional significance remains unclear.

Conclusion

Both the existence and the topography of the γ-band activity during the delay in the memory condition fit the hypothesis of an oscillatory network centered on prefrontal and visual areas that would ensure the rehearsal of the first stimulus representation in memory. This experiment does not prove definitively that this interpretation is the right one. In particular, we have not ruled out definitively a possible contribution of muscle activity to our data. In any case, oscillatory γ-band activity seems to be a specific component of the response: it can be distinguished from the averaged evoked potential and from activities in other frequency bands (8–12, 15–20 Hz) either by its topography or by its variations with conditions. Finally, the topography of γ-band activity is modified depending on whether S1 is being encoded (left occipitotemporal) or rehearsed (left occipitotemporal plus frontal). This suggests that depending on the task to be performed, different functional areas can be recruited to participate in an oscillatory γ-band ensemble.

Footnotes

This work was supported by grants from the Human Frontier Science Program and the Rhône-Alpes region. We thank J. F. Echallier and P. E. Aguerra for helpful assistance.

Correspondence should be addressed to Catherine Tallon-Baudry, INSERM U280, 151 cours Albert Thomas, F-69003 Lyon, France.

REFERENCES

  • 1.Anllo-Vento L, Hillyard SA. Selective attention to the color and direction of moving stimuli: electrophysiological correlates of hierarchical feature selection. Percept Psychophys. 1996;58:191–206. doi: 10.3758/bf03211875. [DOI] [PubMed] [Google Scholar]
  • 2.Begleiter H, Porjesz B, Wang W. A neurophysiologic correlate of visual short-term memory in humans. Electroenceph Clin Neurophysiol. 1993;87:46–53. doi: 10.1016/0013-4694(93)90173-s. [DOI] [PubMed] [Google Scholar]
  • 3.Brosch M, Bauer R, Eckhorn R. Stimulus-dependent modulations of correlated high-frequency oscillations in cat visual cortex. Cereb Cortex. 1997;7:70–76. doi: 10.1093/cercor/7.1.70. [DOI] [PubMed] [Google Scholar]
  • 4.Courtney SM, Ungerleider LG, Keil K, Haxby JV. Object and spatial visual working memory activate separate neural systems in human cortex. Cereb Cortex. 1996;6:39–49. doi: 10.1093/cercor/6.1.39. [DOI] [PubMed] [Google Scholar]
  • 5.Desimone R. Neural mechanisms for visual memory and their role in attention. Proc Natl Acad Sci USA. 1996;93:13494–13499. doi: 10.1073/pnas.93.24.13494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Echallier JF, Perrin F, Pernier J. Computer assisted placement of electrodes on the human head. Electroenceph Clin Neurophysiol. 1992;82:160–163. doi: 10.1016/0013-4694(92)90161-a. [DOI] [PubMed] [Google Scholar]
  • 7.Eckhorn R, Bauer R, Jordan W, Brosch M, Kruse W, Munk M, Reitboeck HJ. Coherent oscillations: a mechanism of feature linking in the visual cortex? Biol Cybern. 1988;60:121–130. doi: 10.1007/BF00202899. [DOI] [PubMed] [Google Scholar]
  • 8.Engel AK, König P, Singer W. Direct physiological evidence for scene segmentation by temporal coding. Proc Natl Acad Sci USA. 1991;88:9136–9140. doi: 10.1073/pnas.88.20.9136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Engel AK, Roelfsema PR, Fries P, Brecht M, Singer W. Role of the temporal domain for response selection and perceptual binding. Cereb Cortex. 1997;7:571–582. doi: 10.1093/cercor/7.6.571. [DOI] [PubMed] [Google Scholar]
  • 10.Freiwald WA, Kreiter AK, Singer W. Stimulus dependent intercolumnar synchronization of single unit responses in cat area 17. NeuroReport. 1995;6:2348–2352. doi: 10.1097/00001756-199511270-00018. [DOI] [PubMed] [Google Scholar]
  • 11.Fuster JM. Inferotemporal units in selective visual attention and short-term memory. J Neurophysiol. 1990;64:681–697. doi: 10.1152/jn.1990.64.3.681. [DOI] [PubMed] [Google Scholar]
  • 12.Fuster JM. Network memory. Trends Neurosci. 1997;20:451–459. doi: 10.1016/s0166-2236(97)01128-4. [DOI] [PubMed] [Google Scholar]
  • 13.Fuster JM, Jervey JP. Inferotemporal neurones distinguish and retain behaviorally relevant features of visual stimuli. Science. 1981;212:952–955. doi: 10.1126/science.7233192. [DOI] [PubMed] [Google Scholar]
  • 14.Gevins A, Smith ME, Mcevoy L, Yu D. High-resolution EEG mapping of cortical activation related to working memory: effects of task difficulty, type of processing, and practice. Cereb Cortex. 1997;7:374–385. doi: 10.1093/cercor/7.4.374. [DOI] [PubMed] [Google Scholar]
  • 15.Goldman-Rakic PS. Cellular basis of working memory. Neuron. 1995;14:477–485. doi: 10.1016/0896-6273(95)90304-6. [DOI] [PubMed] [Google Scholar]
  • 16.Gray CM, DiPrisco GV. Stimulus-dependent neuronal oscillations and local synchronization in striate cortex of the alert cat. J Neurosci. 1997;17:3239–3253. doi: 10.1523/JNEUROSCI.17-09-03239.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Gray CM, König P, Engel AK, Singer W. Oscillatory responses in cat visual cortex exhibit inter-columnar synchronization which reflects global stimulus properties. Nature. 1989;338:334–337. doi: 10.1038/338334a0. [DOI] [PubMed] [Google Scholar]
  • 18.Grossmann A, Kronland-Martinet R, Morlet J. Reading and understanding continuous wavelets transforms. In: Combes JM, Gossmann A, Tchamitchian P, editors. Wavelets, time-frequency methods and phase space. Springer; Berlin: 1989. pp. 2–20. [Google Scholar]
  • 19.Jürgens E, Rösler F, Hennighausen E, Heil M. Stimulus-induced gamma oscillations: harmonics of alpha activity? NeuroReport. 1995;6:813–816. doi: 10.1097/00001756-199503270-00027. [DOI] [PubMed] [Google Scholar]
  • 20.Klimesch W. Memory processes, brain oscillations and EEG synchronization. Int J Psychophysiol. 1996;24:61–100. doi: 10.1016/s0167-8760(96)00057-8. [DOI] [PubMed] [Google Scholar]
  • 21.Klimesch W, Doppelmayr M, Schimke H, Ripper B. Theta synchronization and alpha desynchronization in a memory task. Psychophysiology. 1997;34:169–176. doi: 10.1111/j.1469-8986.1997.tb02128.x. [DOI] [PubMed] [Google Scholar]
  • 22.Kreiter AK, Singer W. Oscillatory neuronal responses in the visual cortex of the awake macaque monkey. Eur J Neurosci. 1992;4:369–375. doi: 10.1111/j.1460-9568.1992.tb00884.x. [DOI] [PubMed] [Google Scholar]
  • 23.Kreiter AK, Singer W. Stimulus-dependent synchronization of neuronal responses in the visual cortex of the awake macaque monkey. J Neurosci. 1996;16:2381–2396. doi: 10.1523/JNEUROSCI.16-07-02381.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kristeva-Feige R, Feige B, Makeig S, Ross B, Elbert T. Oscillatory brain activity during a motor task. NeuroReport. 1993;4:1291–1294. doi: 10.1097/00001756-199309150-00001. [DOI] [PubMed] [Google Scholar]
  • 25.Kronland-Martinet R, Morlet J, Grossmann A. Analysis of sound patterns through wavelet transforms. Int J Patt Recogn Art Intell. 1987;1:273–302. [Google Scholar]
  • 26.Lutzenberger W, Pulvermüller F, Elbert T, Birbaumer N. Visual stimulation alters local 40 Hz responses in humans: an EEG study. Neurosci Lett. 1995;183:39–42. doi: 10.1016/0304-3940(94)11109-v. [DOI] [PubMed] [Google Scholar]
  • 27.Makeig S. Auditory event-related dynamics of the EEG spectrum and effects of exposure to tones. Electroenceph Clin Neurophysiol. 1993;86:283–293. doi: 10.1016/0013-4694(93)90110-h. [DOI] [PubMed] [Google Scholar]
  • 28.Miller EK, Li L, Desimone R. Activity of neurons in anterior inferior temporal cortex during a short-term memory task. J Neurosci. 1993;13:1460–1478. doi: 10.1523/JNEUROSCI.13-04-01460.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Milner PM. A model for visual shape recognition. Psychol Rev. 1974;81:521–535. doi: 10.1037/h0037149. [DOI] [PubMed] [Google Scholar]
  • 30.Miyashita Y, Chang HS. Neuronal correlate of pictorial short-term memory in the primate temporal cortex. Nature. 1988;331:68–70. doi: 10.1038/331068a0. [DOI] [PubMed] [Google Scholar]
  • 31.Müller MM, Bosch J, Elbert T, Kreiter AK, Sosa MV, Sosa PV, Rockstroh B. Visually induced gamma-based responses in human electroencephalographic activity: a link to animal studies. Exp Brain Res. 1996;112:96–102. doi: 10.1007/BF00227182. [DOI] [PubMed] [Google Scholar]
  • 32.Müller MM, Junghofer M, Elbert T, Rochstroh B. Visually induced gamma-band responses to coherent and incoherent motion: a replication study. NeuroReport. 1997;8:2575–2579. doi: 10.1097/00001756-199707280-00031. [DOI] [PubMed] [Google Scholar]
  • 33.Nashmi R, Mendonça AJ, Mackay WA. EEG rhythms of the sensorimotor region during hand movements. Electroenceph Clin Neurophysiol. 1994;91:456–467. doi: 10.1016/0013-4694(94)90166-x. [DOI] [PubMed] [Google Scholar]
  • 34.Pfurtscheller G, Neuper C. Simultaneous EEG 10 Hz desynchronization and 40 Hz synchronization during finger movements. NeuroReport. 1992;3:1057–1060. doi: 10.1097/00001756-199212000-00006. [DOI] [PubMed] [Google Scholar]
  • 35.Pfurtscheller G, Flotzinger D, Neuper C. Differentiation between finger, toe and tongue movement in man based on 40 Hz EEG. Electroenceph Clin Neurophysiol. 1994;90:456–460. doi: 10.1016/0013-4694(94)90137-6. [DOI] [PubMed] [Google Scholar]
  • 36.Singer W, Gray CM. Visual feature integration and the temporal correlation hypothesis. Annu Rev Neurosci. 1995;18:555–586. doi: 10.1146/annurev.ne.18.030195.003011. [DOI] [PubMed] [Google Scholar]
  • 37.Sinkkonen J, Tiitinen H, Näätänen R. Gabor filters: an informative way for analyzing event-related brain activity. J Neurosci Methods. 1995;56:99–104. doi: 10.1016/0165-0270(94)00111-s. [DOI] [PubMed] [Google Scholar]
  • 38.Sugita Y. Electrophysiological correlates of human visual recognition memory. Neurosci Lett. 1994;178:151–154. doi: 10.1016/0304-3940(94)90312-3. [DOI] [PubMed] [Google Scholar]
  • 39.Swartz BE, Halgren E, Fuster JM, Simpkins F, Gee M, Mandelkern M. Cortical metabolic activation in humans during a visual memory task. Cereb Cortex. 1995;5:205–214. doi: 10.1093/cercor/5.3.205. [DOI] [PubMed] [Google Scholar]
  • 40.Tallon-Baudry C, Bertrand O, Delpuech C, Pernier J. Stimulus specificity of phase-locked and non-phase-locked 40 Hz visual responses in human. J Neurosci. 1996;16:4240–4249. doi: 10.1523/JNEUROSCI.16-13-04240.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Tallon-Baudry C, Bertrand O, Delpuech C, Pernier J. Oscillatory γ-band (30–70 Hz) activity induced by a visual search task in humans. J Neurosci. 1997;17:722–734. doi: 10.1523/JNEUROSCI.17-02-00722.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Tecce JJ, Cattanach L. Contingent negative variation (CNV). In: Niedermeyer E, Lopes da Silva F, editors. Electroencephalography, Ed 2. Urban & Schwarzenberg; Baltimore: 1987. pp. 657–679. [Google Scholar]
  • 43.Ungerleider LG. Functional brain imaging studies of cortical mechanisms for memory. Science. 1995;270:769–775. doi: 10.1126/science.270.5237.769. [DOI] [PubMed] [Google Scholar]
  • 44.von der Malsburg C, Schneider W. A neural cocktail-party processor. Biol Cybern. 1986;54:29–40. doi: 10.1007/BF00337113. [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES